Science.gov

Sample records for conifer genomics resource

  1. Generation, functional annotation and comparative analysis of black spruce (Picea mariana) ESTs: an important conifer genomic resource

    PubMed Central

    2013-01-01

    Background EST (expressed sequence tag) sequences and their annotation provide a highly valuable resource for gene discovery, genome sequence annotation, and other genomics studies that can be applied in genetics, breeding and conservation programs for non-model organisms. Conifers are long-lived plants that are ecologically and economically important globally, and have a large genome size. Black spruce (Picea mariana), is a transcontinental species of the North American boreal and temperate forests. However, there are limited transcriptomic and genomic resources for this species. The primary objective of our study was to develop a black spruce transcriptomic resource to facilitate on-going functional genomics projects related to growth and adaptation to climate change. Results We conducted bidirectional sequencing of cDNA clones from a standard cDNA library constructed from black spruce needle tissues. We obtained 4,594 high quality (2,455 5' end and 2,139 3' end) sequence reads, with an average read-length of 532 bp. Clustering and assembly of ESTs resulted in 2,731 unique sequences, consisting of 2,234 singletons and 497 contigs. Approximately two-thirds (63%) of unique sequences were functionally annotated. Genes involved in 36 molecular functions and 90 biological processes were discovered, including 24 putative transcription factors and 232 genes involved in photosynthesis. Most abundantly expressed transcripts were associated with photosynthesis, growth factors, stress and disease response, and transcription factors. A total of 216 full-length genes were identified. About 18% (493) of the transcripts were novel, representing an important addition to the Genbank EST database (dbEST). Fifty-seven di-, tri-, tetra- and penta-nucleotide simple sequence repeats were identified. Conclusions We have developed the first high quality EST resource for black spruce and identified 493 novel transcripts, which may be species-specific related to life history and

  2. Generation, functional annotation and comparative analysis of black spruce (Picea mariana) ESTs: an important conifer genomic resource.

    PubMed

    Mann, Ishminder K; Wegrzyn, Jill L; Rajora, Om P

    2013-10-11

    EST (expressed sequence tag) sequences and their annotation provide a highly valuable resource for gene discovery, genome sequence annotation, and other genomics studies that can be applied in genetics, breeding and conservation programs for non-model organisms. Conifers are long-lived plants that are ecologically and economically important globally, and have a large genome size. Black spruce (Picea mariana), is a transcontinental species of the North American boreal and temperate forests. However, there are limited transcriptomic and genomic resources for this species. The primary objective of our study was to develop a black spruce transcriptomic resource to facilitate on-going functional genomics projects related to growth and adaptation to climate change. We conducted bidirectional sequencing of cDNA clones from a standard cDNA library constructed from black spruce needle tissues. We obtained 4,594 high quality (2,455 5' end and 2,139 3' end) sequence reads, with an average read-length of 532 bp. Clustering and assembly of ESTs resulted in 2,731 unique sequences, consisting of 2,234 singletons and 497 contigs. Approximately two-thirds (63%) of unique sequences were functionally annotated. Genes involved in 36 molecular functions and 90 biological processes were discovered, including 24 putative transcription factors and 232 genes involved in photosynthesis. Most abundantly expressed transcripts were associated with photosynthesis, growth factors, stress and disease response, and transcription factors. A total of 216 full-length genes were identified. About 18% (493) of the transcripts were novel, representing an important addition to the Genbank EST database (dbEST). Fifty-seven di-, tri-, tetra- and penta-nucleotide simple sequence repeats were identified. We have developed the first high quality EST resource for black spruce and identified 493 novel transcripts, which may be species-specific related to life history and ecological traits. We have also

  3. Towards decoding the conifer giga-genome.

    PubMed

    Mackay, John; Dean, Jeffrey F D; Plomion, Christophe; Peterson, Daniel G; Cánovas, Francisco M; Pavy, Nathalie; Ingvarsson, Pär K; Savolainen, Outi; Guevara, M Ángeles; Fluch, Silvia; Vinceti, Barbara; Abarca, Dolores; Díaz-Sala, Carmen; Cervera, María-Teresa

    2012-12-01

    Several new initiatives have been launched recently to sequence conifer genomes including pines, spruces and Douglas-fir. Owing to the very large genome sizes ranging from 18 to 35 gigabases, sequencing even a single conifer genome had been considered unattainable until the recent throughput increases and cost reductions afforded by next generation sequencers. The purpose of this review is to describe the context for these new initiatives. A knowledge foundation has been acquired in several conifers of commercial and ecological interest through large-scale cDNA analyses, construction of genetic maps and gene mapping studies aiming to link phenotype and genotype. Exploratory sequencing in pines and spruces have pointed out some of the unique properties of these giga-genomes and suggested strategies that may be needed to extract value from their sequencing. The hope is that recent and pending developments in sequencing technology will contribute to rapidly filling the knowledge vacuum surrounding their structure, contents and evolution. Researchers are also making plans to use comparative analyses that will help to turn the data into a valuable resource for enhancing and protecting the world's conifer forests.

  4. A white spruce gene catalog for conifer genome analyses.

    PubMed

    Rigault, Philippe; Boyle, Brian; Lepage, Pierre; Cooke, Janice E K; Bousquet, Jean; MacKay, John J

    2011-09-01

    Several angiosperm plant genomes, including Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), poplar (Populus trichocarpa), and grapevine (Vitis vinifera), have been sequenced, but the lack of reference genomes in gymnosperm phyla reduces our understanding of plant evolution and restricts the potential impacts of genomics research. A gene catalog was developed for the conifer tree Picea glauca (white spruce) through large-scale expressed sequence tag sequencing and full-length cDNA sequencing to facilitate genome characterizations, comparative genomics, and gene mapping. The resource incorporates new and publicly available sequences into 27,720 cDNA clusters, 23,589 of which are represented by full-length insert cDNAs. Expressed sequence tags, mate-pair cDNA clone analysis, and custom sequencing were integrated through an iterative process to improve the accuracy of clustering outcomes. The entire catalog spans 30 Mb of unique transcribed sequence. We estimated that the P. glauca nuclear genome contains up to 32,520 transcribed genes owing to incomplete, partially sequenced, and unsampled transcripts and that its transcriptome could span up to 47 Mb. These estimates are in the same range as the Arabidopsis and rice transcriptomes. Next-generation methods confirmed and enhanced the catalog by providing deeper coverage for rare transcripts, by extending many incomplete clusters, and by augmenting the overall transcriptome coverage to 38 Mb of unique sequence. Genomic sample sequencing at 8.5% of the 19.8-Gb P. glauca genome identified 1,495 clusters representing highly repeated sequences among the cDNA clusters. With a conifer transcriptome in full view, functional and protein domain annotations clearly highlighted the divergences between conifers and angiosperms, likely reflecting their respective evolutionary paths.

  5. A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis).

    PubMed

    Ralph, Steven G; Chun, Hye Jung E; Kolosova, Natalia; Cooper, Dawn; Oddy, Claire; Ritland, Carol E; Kirkpatrick, Robert; Moore, Richard; Barber, Sarah; Holt, Robert A; Jones, Steven J M; Marra, Marco A; Douglas, Carl J; Ritland, Kermit; Bohlmann, Jörg

    2008-10-14

    Members of the pine family (Pinaceae), especially species of spruce (Picea spp.) and pine (Pinus spp.), dominate many of the world's temperate and boreal forests. These conifer forests are of critical importance for global ecosystem stability and biodiversity. They also provide the majority of the world's wood and fiber supply and serve as a renewable resource for other industrial biomaterials. In contrast to angiosperms, functional and comparative genomics research on conifers, or other gymnosperms, is limited by the lack of a relevant reference genome sequence. Sequence-finished full-length (FL)cDNAs and large collections of expressed sequence tags (ESTs) are essential for gene discovery, functional genomics, and for future efforts of conifer genome annotation. As part of a conifer genomics program to characterize defense against insects and adaptation to local environments, and to discover genes for the production of biomaterials, we developed 20 standard, normalized or full-length enriched cDNA libraries from Sitka spruce (P. sitchensis), white spruce (P. glauca), and interior spruce (P. glauca-engelmannii complex). We sequenced and analyzed 206,875 3'- or 5'-end ESTs from these libraries, and developed a resource of 6,464 high-quality sequence-finished FLcDNAs from Sitka spruce. Clustering and assembly of 147,146 3'-end ESTs resulted in 19,941 contigs and 26,804 singletons, representing 46,745 putative unique transcripts (PUTs). The 6,464 FLcDNAs were all obtained from a single Sitka spruce genotype and represent 5,718 PUTs. This paper provides detailed annotation and quality assessment of a large EST and FLcDNA resource for spruce. The 6,464 Sitka spruce FLcDNAs represent the third largest sequence-verified FLcDNA resource for any plant species, behind only rice (Oryza sativa) and Arabidopsis (Arabidopsis thaliana), and the only substantial FLcDNA resource for a gymnosperm. Our emphasis on capturing FLcDNAs and ESTs from cDNA libraries representing herbivore

  6. A conifer genomics resource of 200,000 spruce (Picea spp.) ESTs and 6,464 high-quality, sequence-finished full-length cDNAs for Sitka spruce (Picea sitchensis)

    PubMed Central

    Ralph, Steven G; Chun, Hye Jung E; Kolosova, Natalia; Cooper, Dawn; Oddy, Claire; Ritland, Carol E; Kirkpatrick, Robert; Moore, Richard; Barber, Sarah; Holt, Robert A; Jones, Steven JM; Marra, Marco A; Douglas, Carl J; Ritland, Kermit; Bohlmann, Jörg

    2008-01-01

    Background Members of the pine family (Pinaceae), especially species of spruce (Picea spp.) and pine (Pinus spp.), dominate many of the world's temperate and boreal forests. These conifer forests are of critical importance for global ecosystem stability and biodiversity. They also provide the majority of the world's wood and fiber supply and serve as a renewable resource for other industrial biomaterials. In contrast to angiosperms, functional and comparative genomics research on conifers, or other gymnosperms, is limited by the lack of a relevant reference genome sequence. Sequence-finished full-length (FL)cDNAs and large collections of expressed sequence tags (ESTs) are essential for gene discovery, functional genomics, and for future efforts of conifer genome annotation. Results As part of a conifer genomics program to characterize defense against insects and adaptation to local environments, and to discover genes for the production of biomaterials, we developed 20 standard, normalized or full-length enriched cDNA libraries from Sitka spruce (P. sitchensis), white spruce (P. glauca), and interior spruce (P. glauca-engelmannii complex). We sequenced and analyzed 206,875 3'- or 5'-end ESTs from these libraries, and developed a resource of 6,464 high-quality sequence-finished FLcDNAs from Sitka spruce. Clustering and assembly of 147,146 3'-end ESTs resulted in 19,941 contigs and 26,804 singletons, representing 46,745 putative unique transcripts (PUTs). The 6,464 FLcDNAs were all obtained from a single Sitka spruce genotype and represent 5,718 PUTs. Conclusion This paper provides detailed annotation and quality assessment of a large EST and FLcDNA resource for spruce. The 6,464 Sitka spruce FLcDNAs represent the third largest sequence-verified FLcDNA resource for any plant species, behind only rice (Oryza sativa) and Arabidopsis (Arabidopsis thaliana), and the only substantial FLcDNA resource for a gymnosperm. Our emphasis on capturing FLcDNAs and ESTs from c

  7. Early genome duplications in conifers and other seed plants.

    PubMed

    Li, Zheng; Baniaga, Anthony E; Sessa, Emily B; Scascitelli, Moira; Graham, Sean W; Rieseberg, Loren H; Barker, Michael S

    2015-11-01

    Polyploidy is a common mode of speciation and evolution in angiosperms (flowering plants). In contrast, there is little evidence to date that whole genome duplication (WGD) has played a significant role in the evolution of their putative extant sister lineage, the gymnosperms. Recent analyses of the spruce genome, the first published conifer genome, failed to detect evidence of WGDs in gene age distributions and attributed many aspects of conifer biology to a lack of WGDs. We present evidence for three ancient genome duplications during the evolution of gymnosperms, based on phylogenomic analyses of transcriptomes from 24 gymnosperms and 3 outgroups. We use a new algorithm to place these WGD events in phylogenetic context: two in the ancestry of major conifer clades (Pinaceae and cupressophyte conifers) and one in Welwitschia (Gnetales). We also confirm that a WGD hypothesized to be restricted to seed plants is indeed not shared with ferns and relatives (monilophytes), a result that was unclear in earlier studies. Contrary to previous genomic research that reported an absence of polyploidy in the ancestry of contemporary gymnosperms, our analyses indicate that polyploidy has contributed to the evolution of conifers and other gymnosperms. As in the flowering plants, the evolution of the large genome sizes of gymnosperms involved both polyploidy and repetitive element activity.

  8. Early genome duplications in conifers and other seed plants

    PubMed Central

    Li, Zheng; Baniaga, Anthony E.; Sessa, Emily B.; Scascitelli, Moira; Graham, Sean W.; Rieseberg, Loren H.; Barker, Michael S.

    2015-01-01

    Polyploidy is a common mode of speciation and evolution in angiosperms (flowering plants). In contrast, there is little evidence to date that whole genome duplication (WGD) has played a significant role in the evolution of their putative extant sister lineage, the gymnosperms. Recent analyses of the spruce genome, the first published conifer genome, failed to detect evidence of WGDs in gene age distributions and attributed many aspects of conifer biology to a lack of WGDs. We present evidence for three ancient genome duplications during the evolution of gymnosperms, based on phylogenomic analyses of transcriptomes from 24 gymnosperms and 3 outgroups. We use a new algorithm to place these WGD events in phylogenetic context: two in the ancestry of major conifer clades (Pinaceae and cupressophyte conifers) and one in Welwitschia (Gnetales). We also confirm that a WGD hypothesized to be restricted to seed plants is indeed not shared with ferns and relatives (monilophytes), a result that was unclear in earlier studies. Contrary to previous genomic research that reported an absence of polyploidy in the ancestry of contemporary gymnosperms, our analyses indicate that polyploidy has contributed to the evolution of conifers and other gymnosperms. As in the flowering plants, the evolution of the large genome sizes of gymnosperms involved both polyploidy and repetitive element activity. PMID:26702445

  9. Resource physiology of conifers: Acquisition, allocation, and utilization

    SciTech Connect

    Smith, W.K.; Hinckley, T.M.

    1995-03-01

    This book focuses on a synthetic view of the resource physiology of conifer trees with an emphasis on developing a perspective that can integrate across the biological hierarchy. This objective is in concert with more scientific goals of maintaining biological diversity and the sustainability of forest systems. The preservation of coniferous forest ecosystems is a major concern today. This volume deals with the topics of resource acquisition, allocation, and utilization in conifers. Selected papers are indexed separately for inclusion in the Energy Science and Technology Database.

  10. Improved white spruce (Picea glauca) genome assemblies and annotation of large gene families of conifer terpenoid and phenolic defense metabolism.

    PubMed

    Warren, René L; Keeling, Christopher I; Yuen, Macaire Man Saint; Raymond, Anthony; Taylor, Greg A; Vandervalk, Benjamin P; Mohamadi, Hamid; Paulino, Daniel; Chiu, Readman; Jackman, Shaun D; Robertson, Gordon; Yang, Chen; Boyle, Brian; Hoffmann, Margarete; Weigel, Detlef; Nelson, David R; Ritland, Carol; Isabel, Nathalie; Jaquish, Barry; Yanchuk, Alvin; Bousquet, Jean; Jones, Steven J M; MacKay, John; Birol, Inanc; Bohlmann, Joerg

    2015-07-01

    White spruce (Picea glauca), a gymnosperm tree, has been established as one of the models for conifer genomics. We describe the draft genome assemblies of two white spruce genotypes, PG29 and WS77111, innovative tools for the assembly of very large genomes, and the conifer genomics resources developed in this process. The two white spruce genotypes originate from distant geographic regions of western (PG29) and eastern (WS77111) North America, and represent elite trees in two Canadian tree-breeding programs. We present an update (V3 and V4) for a previously reported PG29 V2 draft genome assembly and introduce a second white spruce genome assembly for genotype WS77111. Assemblies of the PG29 and WS77111 genomes confirm the reconstructed white spruce genome size in the 20 Gbp range, and show broad synteny. Using the PG29 V3 assembly and additional white spruce genomics and transcriptomics resources, we performed MAKER-P annotation and meticulous expert annotation of very large gene families of conifer defense metabolism, the terpene synthases and cytochrome P450s. We also comprehensively annotated the white spruce mevalonate, methylerythritol phosphate and phenylpropanoid pathways. These analyses highlighted the large extent of gene and pseudogene duplications in a conifer genome, in particular for genes of secondary (i.e. specialized) metabolism, and the potential for gain and loss of function for defense and adaptation.

  11. Branching habit and the allocation of reproductive resources in conifers

    PubMed Central

    Leslie, Andrew B.

    2012-01-01

    Background and Aims Correlated relationships between branch thickness, branch density, and twig and leaf size have been used extensively to study the evolution of plant canopy architecture, but fewer studies have explored the impact of these relationships on the allocation of reproductive resources. This study quantifies pollen cone production in conifers, which have similar basic reproductive biology but vary dramatically in branching habit, in order to test how differences in branch diameter influence pollen cone size and the density with which they are deployed in the canopy. Methods Measurements of canopy branch density, the number of cones per branch and cone size were used to estimate the amount of pollen cone tissues produced by 16 species in three major conifer clades. The number of pollen grains produced was also estimated using direct counts from individual pollen cones. Key Results The total amount of pollen cone tissues in the conifer canopy varied little among species and clades, although vegetative traits such as branch thickness, branch density and pollen cone size varied over several orders of magnitude. However, branching habit controls the way these tissues are deployed: taxa with small branches produce small pollen cones at a high density, while taxa with large branches produce large cones relatively sparsely. Conclusions Conifers appear to invest similar amounts of energy in pollen production independent of branching habit. However, similar associations between branch thickness, branch density and pollen cone size are seen across conifers, including members of living and extinct groups not directly studied here. This suggests that reproductive features relating to pollen cone size are in large part a function of the evolution of vegetative morphology and branching habit. PMID:22782240

  12. Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome

    PubMed Central

    2009-01-01

    Background Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. Results We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. Conclusion We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated

  13. Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.

    PubMed

    Hamberger, Björn; Hall, Dawn; Yuen, Mack; Oddy, Claire; Hamberger, Britta; Keeling, Christopher I; Ritland, Carol; Ritland, Kermit; Bohlmann, Jörg

    2009-08-06

    Conifers are a large group of gymnosperm trees which are separated from the angiosperms by more than 300 million years of independent evolution. Conifer genomes are extremely large and contain considerable amounts of repetitive DNA. Currently, conifer sequence resources exist predominantly as expressed sequence tags (ESTs) and full-length (FL)cDNAs. There is no genome sequence available for a conifer or any other gymnosperm. Conifer defence-related genes often group into large families with closely related members. The goals of this study are to assess the feasibility of targeted isolation and sequence assembly of conifer BAC clones containing specific genes from two large gene families, and to characterize large segments of genomic DNA sequence for the first time from a conifer. We used a PCR-based approach to identify BAC clones for two target genes, a terpene synthase (3-carene synthase; 3CAR) and a cytochrome P450 (CYP720B4) from a non-arrayed genomic BAC library of white spruce (Picea glauca). Shotgun genomic fragments isolated from the BAC clones were sequenced to a depth of 15.6- and 16.0-fold coverage, respectively. Assembly and manual curation yielded sequence scaffolds of 172 kbp (3CAR) and 94 kbp (CYP720B4) long. Inspection of the genomic sequences revealed the intron-exon structures, the putative promoter regions and putative cis-regulatory elements of these genes. Sequences related to transposable elements (TEs), high complexity repeats and simple repeats were prevalent and comprised approximately 40% of the sequenced genomic DNA. An in silico simulation of the effect of sequencing depth on the quality of the sequence assembly provides direction for future efforts of conifer genome sequencing. We report the first targeted cloning, sequencing, assembly, and annotation of large segments of genomic DNA from a conifer. We demonstrate that genomic BAC clones for individual members of multi-member gene families can be isolated in a gene-specific fashion. The

  14. Conifer genomics and adaptation: at the crossroads of genetic diversity and genome function.

    PubMed

    Prunier, Julien; Verta, Jukka-Pekka; MacKay, John J

    2016-01-01

    Conifers have been understudied at the genomic level despite their worldwide ecological and economic importance but the situation is rapidly changing with the development of next generation sequencing (NGS) technologies. With NGS, genomics research has simultaneously gained in speed, magnitude and scope. In just a few years, genomes of 20-24 gigabases have been sequenced for several conifers, with several others expected in the near future. Biological insights have resulted from recent sequencing initiatives as well as genetic mapping, gene expression profiling and gene discovery research over nearly two decades. We review the knowledge arising from conifer genomics research emphasizing genome evolution and the genomic basis of adaptation, and outline emerging questions and knowledge gaps. We discuss future directions in three areas with potential inputs from NGS technologies: the evolutionary impacts of adaptation in conifers based on the adaptation-by-speciation model; the contributions of genetic variability of gene expression in adaptation; and the development of a broader understanding of genetic diversity and its impacts on genome function. These research directions promise to sustain research aimed at addressing the emerging challenges of adaptation that face conifer trees. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  15. A White Spruce Gene Catalog for Conifer Genome Analyses1[W][OA

    PubMed Central

    Rigault, Philippe; Boyle, Brian; Lepage, Pierre; Cooke, Janice E.K.; Bousquet, Jean; MacKay, John J.

    2011-01-01

    Several angiosperm plant genomes, including Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), poplar (Populus trichocarpa), and grapevine (Vitis vinifera), have been sequenced, but the lack of reference genomes in gymnosperm phyla reduces our understanding of plant evolution and restricts the potential impacts of genomics research. A gene catalog was developed for the conifer tree Picea glauca (white spruce) through large-scale expressed sequence tag sequencing and full-length cDNA sequencing to facilitate genome characterizations, comparative genomics, and gene mapping. The resource incorporates new and publicly available sequences into 27,720 cDNA clusters, 23,589 of which are represented by full-length insert cDNAs. Expressed sequence tags, mate-pair cDNA clone analysis, and custom sequencing were integrated through an iterative process to improve the accuracy of clustering outcomes. The entire catalog spans 30 Mb of unique transcribed sequence. We estimated that the P. glauca nuclear genome contains up to 32,520 transcribed genes owing to incomplete, partially sequenced, and unsampled transcripts and that its transcriptome could span up to 47 Mb. These estimates are in the same range as the Arabidopsis and rice transcriptomes. Next-generation methods confirmed and enhanced the catalog by providing deeper coverage for rare transcripts, by extending many incomplete clusters, and by augmenting the overall transcriptome coverage to 38 Mb of unique sequence. Genomic sample sequencing at 8.5% of the 19.8-Gb P. glauca genome identified 1,495 clusters representing highly repeated sequences among the cDNA clusters. With a conifer transcriptome in full view, functional and protein domain annotations clearly highlighted the divergences between conifers and angiosperms, likely reflecting their respective evolutionary paths. PMID:21730200

  16. Sequenced genomes and rapidly emerging technologies pave the way for conifer evolutionary developmental biology

    PubMed Central

    Uddenberg, Daniel; Akhter, Shirin; Ramachandran, Prashanth; Sundström, Jens F.; Carlsbecker, Annelie

    2015-01-01

    Conifers, Ginkgo, cycads and gnetophytes comprise the four groups of extant gymnosperms holding a unique position of sharing common ancestry with the angiosperms. Comparative studies of gymnosperms and angiosperms are the key to a better understanding of ancient seed plant morphologies, how they have shifted over evolution to shape modern day species, and how the genes governing these morphologies have evolved. However, conifers and other gymnosperms have been notoriously difficult to study due to their long generation times, inaccessibility to genetic experimentation and unavailable genome sequences. Now, with three draft genomes from spruces and pines, rapid advances in next generation sequencing methods for genome wide expression analyses, and enhanced methods for genetic transformation, we are much better equipped to address a number of key evolutionary questions relating to seed plant evolution. In this mini-review we highlight recent progress in conifer developmental biology relevant to evo-devo questions. We discuss how genome sequence data and novel techniques might allow us to explore genetic variation and naturally occurring conifer mutants, approaches to reduce long generation times to allow for genetic studies in conifers, and other potential upcoming research avenues utilizing current and emergent techniques. Results from developmental studies of conifers and other gymnosperms in comparison to those in angiosperms will provide information to trace core molecular developmental control tool kits of ancestral seed plants, but foremost they will greatly improve our understanding of the biology of conifers and other gymnosperms in their own right. PMID:26579190

  17. Sequenced genomes and rapidly emerging technologies pave the way for conifer evolutionary developmental biology.

    PubMed

    Uddenberg, Daniel; Akhter, Shirin; Ramachandran, Prashanth; Sundström, Jens F; Carlsbecker, Annelie

    2015-01-01

    Conifers, Ginkgo, cycads and gnetophytes comprise the four groups of extant gymnosperms holding a unique position of sharing common ancestry with the angiosperms. Comparative studies of gymnosperms and angiosperms are the key to a better understanding of ancient seed plant morphologies, how they have shifted over evolution to shape modern day species, and how the genes governing these morphologies have evolved. However, conifers and other gymnosperms have been notoriously difficult to study due to their long generation times, inaccessibility to genetic experimentation and unavailable genome sequences. Now, with three draft genomes from spruces and pines, rapid advances in next generation sequencing methods for genome wide expression analyses, and enhanced methods for genetic transformation, we are much better equipped to address a number of key evolutionary questions relating to seed plant evolution. In this mini-review we highlight recent progress in conifer developmental biology relevant to evo-devo questions. We discuss how genome sequence data and novel techniques might allow us to explore genetic variation and naturally occurring conifer mutants, approaches to reduce long generation times to allow for genetic studies in conifers, and other potential upcoming research avenues utilizing current and emergent techniques. Results from developmental studies of conifers and other gymnosperms in comparison to those in angiosperms will provide information to trace core molecular developmental control tool kits of ancestral seed plants, but foremost they will greatly improve our understanding of the biology of conifers and other gymnosperms in their own right.

  18. Molecular Aspects of Conifer Zygotic and Somatic Embryo Development: A Review of Genome-Wide Approaches and Recent Insights.

    PubMed

    Trontin, Jean-François; Klimaszewska, Krystyna; Morel, Alexandre; Hargreaves, Catherine; Lelu-Walter, Marie-Anne

    2016-01-01

    Genome-wide profiling (transcriptomics, proteomics, metabolomics) is providing unprecedented opportunities to unravel the complexity of coordinated gene expression during embryo development in trees, especially conifer species harboring "giga-genome." This knowledge should be critical for the efficient delivery of improved varieties through seeds and/or somatic embryos in fluctuating markets and to cope with climate change. We reviewed "omics" as well as targeted gene expression studies during both somatic and zygotic embryo development in conifers and tentatively puzzled over the critical processes and genes involved at the specific developmental and transition stages. Current limitations to the interpretation of these large datasets are going to be lifted through the ongoing development of comprehensive genome resources in conifers. Nevertheless omics already confirmed that master regulators (e.g., transcription and epigenetic factors) play central roles. As in model angiosperms, the molecular regulation from early to late embryogenesis may mainly arise from spatiotemporal modulation of auxin-, gibberellin-, and abscisic acid-mediated responses. Omics also showed the potential for the development of tools to assess the progress of embryo development or to build genotype-independent, predictive models of embryogenesis-specific characteristics.

  19. Chloroplast genomes of two conifers lack a large inverted repeat and are extensively rearranged.

    PubMed Central

    Strauss, S H; Palmer, J D; Howe, G T; Doerksen, A H

    1988-01-01

    Chloroplast genomes of Douglas-fir [Pseudotsuga menziesii (Mirb.) Franco] and radiata (Monterey) pine [Pinus radiata D. Don], two conifers from the widespread Pinaceae, were mapped and their genomes were compared to other land plants. Douglas-fir and radiata pine lack the large (20-25 kilobases) inverted repeat that characterizes most land plants. To our knowledge, this is only the second recorded loss of this ancient and highly conserved inverted repeat among all lineages of land plants thus far examined. Loss of the repeat largely accounts for the small size of the conifer genome, 120 kilobase, versus 140-160 kilobases in most land plants. Douglas-fir possesses a major inversion of 40-50 kilobases relative to radiata pine and nonconiferous plants. Nucleotide sequence differentiation between Douglas-fir and radiata pine was estimated to be 3.8%. Both conifer genomes possess a number of rearrangements relative to Osmunda, a fern, Ginkgo, a gymnosperm, and Petunia, an angiosperm. Among land plants, structural changes of this degree have occurred primarily within tribes of the legume family (Fabaceae) that have also lost the inverted repeat. These results support the hypothesis that the presence of the large inverted repeat stabilizes the chloroplast genome against major structural rearrangements. PMID:2836862

  20. Conifer stored resources and resistance to a fungus associated with the spruce bark beetle Ips typographus.

    PubMed

    Lahr, Eleanor C; Krokene, Paal

    2013-01-01

    Bark beetles and associated fungi are among the greatest natural threats to conifers worldwide. Conifers have potent defenses, but resistance to beetles and fungal pathogens may be reduced if tree stored resources are consumed by fungi rather than used for tree defense. Here, we assessed the relationship between tree stored resources and resistance to Ceratocystis polonica, a phytopathogenic fungus vectored by the spruce bark beetle Ips typographus. We measured phloem and sapwood nitrogen, non-structural carbohydrates (NSC), and lipids before and after trees were attacked by I. typographus (vectoring C. polonica) or artificially inoculated with C. polonica alone. Tree resistance was assessed by measuring phloem lesions and the proportion of necrotic phloem around the tree's circumference following attack or inoculation. While initial resource concentrations were unrelated to tree resistance to C. polonica, over time, phloem NSC and sapwood lipids declined in the trees inoculated with C. polonica. Greater resource declines correlated with less resistant trees (trees with larger lesions or more necrotic phloem), suggesting that resource depletion may be caused by fungal consumption rather than tree resistance. Ips typographus may then benefit indirectly from reduced tree defenses caused by fungal resource uptake. Our research on tree stored resources represents a novel way of understanding bark beetle-fungal-conifer interactions.

  1. A high-resolution reference genetic map positioning 8.8 K genes for the conifer white spruce: structural genomics implications and correspondence with physical distance.

    PubMed

    Pavy, Nathalie; Lamothe, Manuel; Pelgas, Betty; Gagnon, France; Birol, Inanç; Bohlmann, Joerg; Mackay, John; Isabel, Nathalie; Bousquet, Jean

    2017-04-01

    Over the last decade, extensive genetic and genomic resources have been developed for the conifer white spruce (Picea glauca, Pinaceae), which has one of the largest plant genomes (20 Gbp). Draft genome sequences of white spruce and other conifers have recently been produced, but dense genetic maps are needed to comprehend genome macrostructure, delineate regions involved in quantitative traits, complement functional genomic investigations, and assist the assembly of fragmented genomic sequences. A greatly expanded P. glauca composite linkage map was generated from a set of 1976 full-sib progeny, with the positioning of 8793 expressed genes. Regions with significant low or high gene density were identified. Gene family members tended to be mapped on the same chromosomes, with tandemly arrayed genes significantly biased towards specific functional classes. The map was integrated with transcriptome data surveyed across eight tissues. In total, 69 clusters of co-expressed and co-localising genes were identified. A high level of synteny was found with pine genetic maps, which should facilitate the transfer of structural information in the Pinaceae. Although the current white spruce genome sequence remains highly fragmented, dozens of scaffolds encompassing more than one mapped gene were identified. From these, the relationship between genetic and physical distances was examined and the genome-wide recombination rate was found to be much smaller than most estimates reported for angiosperm genomes. This gene linkage map shall assist the large-scale assembly of the next-generation white spruce genome sequence and provide a reference resource for the conifer genomics community.

  2. NCBI viral genomes resource.

    PubMed

    Brister, J Rodney; Ako-Adjei, Danso; Bao, Yiming; Blinkova, Olga

    2015-01-01

    Recent technological innovations have ignited an explosion in virus genome sequencing that promises to fundamentally alter our understanding of viral biology and profoundly impact public health policy. Yet, any potential benefits from the billowing cloud of next generation sequence data hinge upon well implemented reference resources that facilitate the identification of sequences, aid in the assembly of sequence reads and provide reference annotation sources. The NCBI Viral Genomes Resource is a reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data. The resource can be accessed at http://www.ncbi.nlm.nih.gov/genome/viruses/ and catalogs all publicly available virus genome sequences and curates reference genome sequences. As the number of genome sequences has grown, so too have the difficulties in annotating and maintaining reference sequences. The rapid expansion of the viral sequence universe has forced a recalibration of the data model to better provide extant sequence representation and enhanced reference sequence products to serve the needs of the various viral communities. This, in turn, has placed increased emphasis on leveraging the knowledge of individual scientific communities to identify important viral sequences and develop well annotated reference virus genome sets. Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by US Government employees and is in the public domain in the US.

  3. NCBI Viral Genomes Resource

    PubMed Central

    Brister, J. Rodney; Ako-adjei, Danso; Bao, Yiming; Blinkova, Olga

    2015-01-01

    Recent technological innovations have ignited an explosion in virus genome sequencing that promises to fundamentally alter our understanding of viral biology and profoundly impact public health policy. Yet, any potential benefits from the billowing cloud of next generation sequence data hinge upon well implemented reference resources that facilitate the identification of sequences, aid in the assembly of sequence reads and provide reference annotation sources. The NCBI Viral Genomes Resource is a reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data. The resource can be accessed at http://www.ncbi.nlm.nih.gov/genome/viruses/ and catalogs all publicly available virus genome sequences and curates reference genome sequences. As the number of genome sequences has grown, so too have the difficulties in annotating and maintaining reference sequences. The rapid expansion of the viral sequence universe has forced a recalibration of the data model to better provide extant sequence representation and enhanced reference sequence products to serve the needs of the various viral communities. This, in turn, has placed increased emphasis on leveraging the knowledge of individual scientific communities to identify important viral sequences and develop well annotated reference virus genome sets. PMID:25428358

  4. Ensembl comparative genomics resources

    PubMed Central

    Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J.; Searle, Stephen M. J.; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. PMID:26896847

  5. Ensembl comparative genomics resources.

    PubMed

    Herrero, Javier; Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J; Searle, Stephen M J; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. © The Author(s) 2016. Published by Oxford University Press.

  6. Highly rearranged and size-variable chloroplast genomes in conifers II clade (cupressophytes): evolution towards shorter intergenic spacers.

    PubMed

    Wu, Chung-Shien; Chaw, Shu-Miaw

    2014-04-01

    Although conifers are of immense ecological and economic value, bioengineering of their chloroplasts remains undeveloped. Understanding the chloroplast genomic organization of conifers can facilitate their bioengineering. Members of the conifer II clade (or cupressophytes) are highly diverse in both morphologic features and chloroplast genomic organization. We compared six cupressophyte chloroplast genomes (cpDNAs) that represent four of the five cupressophyte families, including three genomes that are first reported here (Agathis dammara, Calocedrus formosana and Nageia nagi). The six cupressophyte cpDNAs have lost a pair of large inverted repeats (IRs) and vary greatly in size, organization and tRNA copies. We demonstrate that cupressophyte cpDNAs have evolved towards reduced size, largely due to shrunken intergenic spacers. In cupressophytes, cpDNA rearrangements are capable of extending intergenic spacers, and synonymous mutations are negatively associated with the size and frequency of rearrangements. The variable cpDNA sizes of cupressophytes may have been shaped by mutational burden and genomic rearrangements. On the basis of cpDNA organization, our analyses revealed that in gymnosperms, cpDNA rearrangements are phylogenetically informative, which supports the 'gnepines' clade. In addition, removal of a specific IR influences the minimal rearrangements required for the gnepines and cupressophyte clades, whereby Pinaceae favours the removal of IRB but cupressophytes exclusion of IRA. This result strongly suggests that different IR copies have been lost from conifers I and II. Our data help understand the complexity and evolution of cupressophyte cpDNAs.

  7. MIPS plant genome information resources.

    PubMed

    Spannagl, Manuel; Haberer, Georg; Ernst, Rebecca; Schoof, Heiko; Mayer, Klaus F X

    2007-01-01

    The Munich Institute for Protein Sequences (MIPS) has been involved in maintaining plant genome databases since the Arabidopsis thaliana genome project. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable data sets for model plant genomes as a backbone against which experimental data, for example from high-throughput functional genomics, can be organized and evaluated. In addition, model genomes also form a scaffold for comparative genomics, and much can be learned from genome-wide evolutionary studies.

  8. WheatGenome.info: A Resource for Wheat Genomics Resource.

    PubMed

    Lai, Kaitao

    2016-01-01

    An integrated database with a variety of Web-based systems named WheatGenome.info hosting wheat genome and genomic data has been developed to support wheat research and crop improvement. The resource includes multiple Web-based applications, which are implemented as a variety of Web-based systems. These include a GBrowse2-based wheat genome viewer with BLAST search portal, TAGdb for searching wheat second generation genome sequence data, wheat autoSNPdb, links to wheat genetic maps using CMap and CMap3D, and a wheat genome Wiki to allow interaction between diverse wheat genome sequencing activities. This portal provides links to a variety of wheat genome resources hosted at other research organizations. This integrated database aims to accelerate wheat genome research and is freely accessible via the web interface at http://www.wheatgenome.info/ .

  9. Optimization of the genotyping-by-sequencing strategy for population genomic analysis in conifers.

    PubMed

    Pan, Jin; Wang, Baosheng; Pei, Zhi-Yong; Zhao, Wei; Gao, Jie; Mao, Jian-Feng; Wang, Xiao-Ru

    2015-07-01

    Flexibility and low cost make genotyping-by-sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI-MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference-free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000-11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking.

  10. The Complete Chloroplast Genome Sequence of a Relict Conifer Glyptostrobus pensilis: Comparative Analysis and Insights into Dynamics of Chloroplast Genome Rearrangement in Cupressophytes and Pinaceae

    PubMed Central

    Zheng, Renhua; Xu, Haibin; Zhou, Yanwei; Li, Meiping; Lu, Fengjuan; Dong, Yini; Liu, Xin; Chen, Jinhui; Shi, Jisen

    2016-01-01

    Glyptostrobus pensilis, belonging to the monotypic genus Glyptostrobus (Family: Cupressaceae), is an ancient conifer that is naturally distributed in low-lying wet areas. Here, we report the complete chloroplast (cp) genome sequence (132,239 bp) of G. pensilis. The G. pensilis cp genome is similar in gene content, organization and genome structure to the sequenced cp genomes from other cupressophytes, especially with respect to the loss of the inverted repeat region A (IRA). Through phylogenetic analysis, we demonstrated that the genus Glyptostrobus is closely related to the genus Cryptomeria, supporting previous findings based on physiological characteristics. Since IRs play an important role in stabilize cp genome and conifer cp genomes lost different IR regions after splitting in two clades (cupressophytes and Pinaceae), we performed cp genome rearrangement analysis and found more extensive cp genome rearrangements among the species of cupressophytes relative to Pinaceae. Additional repeat analysis indicated that cupressophytes cp genomes contained less potential functional repeats, especially in Cupressaceae, compared with Pinaceae. These results suggested that dynamics of cp genome rearrangement in conifers differed since the two clades, Pinaceae and cupressophytes, lost IR copies independently and developed different repeats to complement the residual IRs. In addition, we identified 170 perfect simple sequence repeats that will be useful in future research focusing on the evolution of genetic diversity and conservation of genetic variation for this endangered species in the wild. PMID:27560965

  11. The Complete Chloroplast Genome Sequence of a Relict Conifer Glyptostrobus pensilis: Comparative Analysis and Insights into Dynamics of Chloroplast Genome Rearrangement in Cupressophytes and Pinaceae.

    PubMed

    Hao, Zhaodong; Cheng, Tielong; Zheng, Renhua; Xu, Haibin; Zhou, Yanwei; Li, Meiping; Lu, Fengjuan; Dong, Yini; Liu, Xin; Chen, Jinhui; Shi, Jisen

    2016-01-01

    Glyptostrobus pensilis, belonging to the monotypic genus Glyptostrobus (Family: Cupressaceae), is an ancient conifer that is naturally distributed in low-lying wet areas. Here, we report the complete chloroplast (cp) genome sequence (132,239 bp) of G. pensilis. The G. pensilis cp genome is similar in gene content, organization and genome structure to the sequenced cp genomes from other cupressophytes, especially with respect to the loss of the inverted repeat region A (IRA). Through phylogenetic analysis, we demonstrated that the genus Glyptostrobus is closely related to the genus Cryptomeria, supporting previous findings based on physiological characteristics. Since IRs play an important role in stabilize cp genome and conifer cp genomes lost different IR regions after splitting in two clades (cupressophytes and Pinaceae), we performed cp genome rearrangement analysis and found more extensive cp genome rearrangements among the species of cupressophytes relative to Pinaceae. Additional repeat analysis indicated that cupressophytes cp genomes contained less potential functional repeats, especially in Cupressaceae, compared with Pinaceae. These results suggested that dynamics of cp genome rearrangement in conifers differed since the two clades, Pinaceae and cupressophytes, lost IR copies independently and developed different repeats to complement the residual IRs. In addition, we identified 170 perfect simple sequence repeats that will be useful in future research focusing on the evolution of genetic diversity and conservation of genetic variation for this endangered species in the wild.

  12. Testing of a heterologous, wound- and insect-inducible promoter for functional genomics studies in conifer defense.

    PubMed

    Godard, Kimberley-Ann; Byun-McKay, Ashley; Levasseur, Caroline; Plant, Aine; Séguin, Armand; Bohlmann, Jörg

    2007-12-01

    Large-scale sequencing of conifer cDNAs and targeted molecular cloning have identified many putative conifer defense genes. Expression of many of these genes is induced in response to biotic stress and some may be expressed only in a few specialized tissues or cells. Proven functional genomics approaches to test these genes involve expression of proteins in Escherichia coli or yeast for biochemical characterization or constitutive over-expression in transformed plants. Plant transformation to test functions of insect-, wound- or pathogen-induced conifer defense genes would benefit from the use of an inducible expression system. We describe here the development of a heterologous, wound- and insect-inducible gene expression system for conifers using the potato proteinase inhibitor II (pinII)-promoter. Using pinII::GUS and pinII::(E)-alpha-bisabolene synthase expression constructs we demonstrate localized induced gene expression in white spruce seedlings (Picea glauca). Testing of these constructs in Arabidopsis thaliana and tobacco illustrates the additional potential of the pinII-promoter to be used in tests of gene function that involve cell-specific and systemic induction.

  13. Genomic Resources for Cancer Epidemiology

    Cancer.gov

    This page provides links to research resources, complied by the Epidemiology and Genomics Research Program, that may be of interest to genetic epidemiologists conducting cancer research, but is not exhaustive.

  14. Resource-dependent growth models for Sierran mixed-conifer saplings

    Treesearch

    S.W. Bigelow; M.P. North; W.R. Horwath

    2009-01-01

    Thinning to reduce wildfire hazard is a common management practice in frequent-fire forests of the American west, but it is uncertain whether projects will help regenerate fire-resistant, shade-intolerant pines. We studied naturally established saplings of six conifer species in mixed-conifer forest in northern California, USA to...

  15. NorWood: a gene expression resource for evo-devo studies of conifer wood development.

    PubMed

    Jokipii-Lukkari, Soile; Sundell, David; Nilsson, Ove; Hvidsten, Torgeir R; Street, Nathaniel R; Tuominen, Hannele

    2017-02-10

    The secondary xylem of conifers is composed mainly of tracheids that differ anatomically and chemically from angiosperm xylem cells. There is currently no high-spatial-resolution data available profiling gene expression during wood formation for any coniferous species, which limits insight into tracheid development. RNA-sequencing data from replicated, high-spatial-resolution section series throughout the cambial and woody tissues of Picea abies were used to generate the NorWood.conGenIE.org web resource, which facilitates exploration of the associated gene expression profiles and co-expression networks. Integration within PlantGenIE.org enabled a comparative regulomics analysis, revealing divergent co-expression networks between P. abies and the two angiosperm species Arabidopsis thaliana and Populus tremula for the secondary cell wall (SCW) master regulator NAC Class IIB transcription factors. The SCW cellulose synthase genes (CesAs) were located in the neighbourhoods of the NAC factors in A. thaliana and P. tremula, but not in P. abies. The NorWood co-expression network enabled identification of potential SCW CesA regulators in P. abies. The NorWood web resource represents a powerful community tool for generating evo-devo insights into the divergence of wood formation between angiosperms and gymnosperms and for advancing understanding of the regulation of wood development in P. abies.

  16. Phylogeny of seed plants based on all three genomic compartments: Extant gymnosperms are monophyletic and Gnetales' closest relatives are conifers

    PubMed Central

    Bowe, L. Michelle; Coat, Gwénaële; dePamphilis, Claude W.

    2000-01-01

    Efforts to resolve Darwin's “abominable mystery”—the origin of angiosperms—have led to the conclusion that Gnetales and various fossil groups are sister to angiosperms, forming the “anthophytes.” Morphological homologies, however, are difficult to interpret, and molecular data have not provided clear resolution of relationships among major groups of seed plants. We introduce two sequence data sets from slowly evolving mitochondrial genes, cox1 and atpA, which unambiguously reject the anthophyte hypothesis, favoring instead a close relationship between Gnetales and conifers. Parsimony- and likelihood-based analyses of plastid rbcL and nuclear 18S rDNA alone and with cox1 and atpA also strongly support a gnetophyte–conifer grouping. Surprisingly, three of four genes (all but nuclear rDNA) and combined three-genome analyses also suggest or strongly support Gnetales as derived conifers, sister to Pinaceae. Analyses with outgroups screened to avoid long branches consistently identify all gymnosperms as a monophyletic sister group to angiosperms. Combined three- and four-gene rooted analyses resolve the branching order for the remaining major groups—cycads separate from other gymnosperms first, followed by Ginkgo and then (Gnetales + Pinaceae) sister to a monophyletic group with all other conifer families. The molecular phylogeny strongly conflicts with current interpretations of seed plant morphology, and implies that many similarities between gnetophytes and angiosperms, such as “flower-like” reproductive structures and double fertilization, were independently derived, whereas other characters could emerge as synapomorphies for an expanded conifer group including Gnetales. An initial angiosperm–gymnosperm split implies a long stem lineage preceding the explosive Mesozoic radiation of flowering plants and suggests that angiosperm origins and homologies should be sought among extinct seed plant groups. PMID:10760278

  17. Rat Genome and Model Resources.

    PubMed

    Shimoyama, Mary; Smith, Jennifer R; Bryda, Elizabeth; Kuramoto, Takashi; Saba, Laura; Dwinell, Melinda

    2017-07-01

    Rats remain a major model for studying disease mechanisms and discovery, validation, and testing of new compounds to improve human health. The rat's value continues to grow as indicated by the more than 1.4 million publications (second to human) at PubMed documenting important discoveries using this model. Advanced sequencing technologies, genome modification techniques, and the development of embryonic stem cell protocols ensure the rat remains an important mammalian model for disease studies. The 2004 release of the reference genome has been followed by the production of complete genomes for more than two dozen individual strains utilizing NextGen sequencing technologies; their analyses have identified over 80 million variants. This explosion in genomic data has been accompanied by the ability to selectively edit the rat genome, leading to hundreds of new strains through multiple technologies. A number of resources have been developed to provide investigators with access to precision rat models, comprehensive datasets, and sophisticated software tools necessary for their research. Those profiled here include the Rat Genome Database, PhenoGen, Gene Editing Rat Resource Center, Rat Resource and Research Center, and the National BioResource Project for the Rat in Japan. © The Author 2017. Published by Oxford University Press.

  18. Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms

    PubMed Central

    2012-01-01

    Background Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Results Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10-9 synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Conclusions Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations. PMID:22264329

  19. Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms.

    PubMed

    Buschiazzo, Emmanuel; Ritland, Carol; Bohlmann, Jörg; Ritland, Kermit

    2012-01-20

    Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10(-9) synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations.

  20. Fungal genome resources at NCBI

    PubMed Central

    Robbertse, B.; Tatusova, T.

    2011-01-01

    The National Center for Biotechnology Information (NCBI) is well known for the nucleotide sequence archive, GenBank and sequence analysis tool BLAST. However, NCBI integrates many types of biomolecular data from variety of sources and makes it available to the scientific community as interactive web resources as well as organized releases of bulk data. These tools are available to explore and compare fungal genomes. Searching all databases with Fungi [organism] at http://www.ncbi.nlm.nih.gov/ is the quickest way to find resources of interest with fungal entries. Some tools though are resources specific and can be indirectly accessed from a particular database in the Entrez system. These include graphical viewers and comparative analysis tools such as TaxPlot, TaxMap and UniGene DDD (found via UniGene Homepage). Gene and BioProject pages also serve as portals to external data such as community annotation websites, BioGrid and UniProt. There are many different ways of accessing genomic data at NCBI. Depending on the focus and goal of research projects or the level of interest, a user would select a particular route for accessing genomic databases and resources. This review article describes methods of accessing fungal genome data and provides examples that illustrate the use of analysis tools. PMID:22737589

  1. Fungal genome resources at NCBI.

    PubMed

    Robbertse, B; Tatusova, T

    2011-09-01

    The National Center for Biotechnology Information (NCBI) is well known for the nucleotide sequence archive, GenBank and sequence analysis tool BLAST. However, NCBI integrates many types of biomolecular data from variety of sources and makes it available to the scientific community as interactive web resources as well as organized releases of bulk data. These tools are available to explore and compare fungal genomes. Searching all databases with Fungi [organism] at http://www.ncbi.nlm.nih.gov/ is the quickest way to find resources of interest with fungal entries. Some tools though are resources specific and can be indirectly accessed from a particular database in the Entrez system. These include graphical viewers and comparative analysis tools such as TaxPlot, TaxMap and UniGene DDD (found via UniGene Homepage). Gene and BioProject pages also serve as portals to external data such as community annotation websites, BioGrid and UniProt. There are many different ways of accessing genomic data at NCBI. Depending on the focus and goal of research projects or the level of interest, a user would select a particular route for accessing genomic databases and resources. This review article describes methods of accessing fungal genome data and provides examples that illustrate the use of analysis tools.

  2. A spruce gene map infers ancient plant genome reshuffling and subsequent slow evolution in the gymnosperm lineage leading to extant conifers

    PubMed Central

    2012-01-01

    Background Seed plants are composed of angiosperms and gymnosperms, which diverged from each other around 300 million years ago. While much light has been shed on the mechanisms and rate of genome evolution in flowering plants, such knowledge remains conspicuously meagre for the gymnosperms. Conifers are key representatives of gymnosperms and the sheer size of their genomes represents a significant challenge for characterization, sequencing and assembling. Results To gain insight into the macro-organisation and long-term evolution of the conifer genome, we developed a genetic map involving 1,801 spruce genes. We designed a statistical approach based on kernel density estimation to analyse gene density and identified seven gene-rich isochors. Groups of co-localizing genes were also found that were transcriptionally co-regulated, indicative of functional clusters. Phylogenetic analyses of 157 gene families for which at least two duplicates were mapped on the spruce genome indicated that ancient gene duplicates shared by angiosperms and gymnosperms outnumbered conifer-specific duplicates by a ratio of eight to one. Ancient duplicates were much more translocated within and among spruce chromosomes than conifer-specific duplicates, which were mostly organised in tandem arrays. Both high synteny and collinearity were also observed between the genomes of spruce and pine, two conifers that diverged more than 100 million years ago. Conclusions Taken together, these results indicate that much genomic evolution has occurred in the seed plant lineage before the split between gymnosperms and angiosperms, and that the pace of evolution of the genome macro-structure has been much slower in the gymnosperm lineage leading to extent conifers than that seen for the same period of time in flowering plants. This trend is largely congruent with the contrasted rates of diversification and morphological evolution observed between these two groups of seed plants. PMID:23102090

  3. Gramene database: navigating plant comparative genomics resources

    USDA-ARS?s Scientific Manuscript database

    Gramene (http://www.gramene.org) is an online, open source, curated resource for plant comparative genomics and pathway analysis designed to support researchers working in plant genomics, breeding, evolutionary biology, system biology, and metabolic engineering. It exploits phylogenetic relationship...

  4. Whole genome duplication in coast redwood (Sequoia sempervirens) and its implications for explaining the rarity of polyploidy in conifers.

    PubMed

    Scott, Alison Dawn; Stenz, Noah W M; Ingvarsson, Pär K; Baum, David A

    2016-07-01

    Polyploidy is common and an important evolutionary factor in most land plant lineages, but it is rare in gymnosperms. Coast redwood (Sequoia sempervirens) is one of just two polyploid conifer species and the only hexaploid. Evidence from fossil guard cell size suggests that polyploidy in Sequoia dates to the Eocene. Numerous hypotheses about the mechanism of polyploidy and parental genome donors have been proposed, based primarily on morphological and cytological data, but it remains unclear how Sequoia became polyploid and why this lineage overcame an apparent gymnosperm barrier to whole-genome duplication (WGD). We sequenced transcriptomes and used phylogenetic inference, Bayesian concordance analysis and paralog age distributions to resolve relationships among gene copies in hexaploid coast redwood and close relatives. Our data show that hexaploidy in coast redwood is best explained by autopolyploidy or, if there was allopolyploidy, it happened within the Californian redwood clade. We found that duplicate genes have more similar sequences than expected, given the age of the inferred polyploidization. Conflict between molecular and fossil estimates of WGD can be explained if diploidization occurred very slowly following polyploidization. We extrapolate from this to suggest that the rarity of polyploidy in gymnosperms may be due to slow diploidization in this clade. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  5. The Contribution of Short Repeats of Low Sequence Complexity to Large Conifer Genomes

    Treesearch

    A. Schmidt; R.L. Doudrick; J.S. Heslop-Harrison; T. Schmidt

    2000-01-01

    Abstract: The abundance and genomic organization of six simple sequence repeats, consisting of di-, tri-, and tetranucleotide sequence motifs, and a minisatellite repeat have been analyzed in different gymnosperms by Southern hybridization. Within the gymnosperm genomes investigated, the abundance and genomic organization of micro- and...

  6. MADS goes genomic in conifers: towards determining the ancestral set of MADS-box genes in seed plants

    PubMed Central

    Gramzow, Lydia; Weilandt, Lisa; Theißen, Günter

    2014-01-01

    Background and Aims MADS-box genes comprise a gene family coding for transcription factors. This gene family expanded greatly during land plant evolution such that the number of MADS-box genes ranges from one or two in green algae to around 100 in angiosperms. Given the crucial functions of MADS-box genes for nearly all aspects of plant development, the expansion of this gene family probably contributed to the increasing complexity of plants. However, the expansion of MADS-box genes during one important step of land plant evolution, namely the origin of seed plants, remains poorly understood due to the previous lack of whole-genome data for gymnosperms. Methods The newly available genome sequences of Picea abies, Picea glauca and Pinus taeda were used to identify the complete set of MADS-box genes in these conifers. In addition, MADS-box genes were identified in the growing number of transcriptomes available for gymnosperms. With these datasets, phylogenies were constructed to determine the ancestral set of MADS-box genes of seed plants and to infer the ancestral functions of these genes. Key Results Type I MADS-box genes are under-represented in gymnosperms and only a minimum of two Type I MADS-box genes have been present in the most recent common ancestor (MRCA) of seed plants. In contrast, a large number of Type II MADS-box genes were found in gymnosperms. The MRCA of extant seed plants probably possessed at least 11–14 Type II MADS-box genes. In gymnosperms two duplications of Type II MADS-box genes were found, such that the MRCA of extant gymnosperms had at least 14–16 Type II MADS-box genes. Conclusions The implied ancestral set of MADS-box genes for seed plants shows simplicity for Type I MADS-box genes and remarkable complexity for Type II MADS-box genes in terms of phylogeny and putative functions. The analysis of transcriptome data reveals that gymnosperm MADS-box genes are expressed in a great variety of tissues, indicating diverse roles of MADS

  7. MADS goes genomic in conifers: towards determining the ancestral set of MADS-box genes in seed plants.

    PubMed

    Gramzow, Lydia; Weilandt, Lisa; Theißen, Günter

    2014-11-01

    MADS-box genes comprise a gene family coding for transcription factors. This gene family expanded greatly during land plant evolution such that the number of MADS-box genes ranges from one or two in green algae to around 100 in angiosperms. Given the crucial functions of MADS-box genes for nearly all aspects of plant development, the expansion of this gene family probably contributed to the increasing complexity of plants. However, the expansion of MADS-box genes during one important step of land plant evolution, namely the origin of seed plants, remains poorly understood due to the previous lack of whole-genome data for gymnosperms. The newly available genome sequences of Picea abies, Picea glauca and Pinus taeda were used to identify the complete set of MADS-box genes in these conifers. In addition, MADS-box genes were identified in the growing number of transcriptomes available for gymnosperms. With these datasets, phylogenies were constructed to determine the ancestral set of MADS-box genes of seed plants and to infer the ancestral functions of these genes. Type I MADS-box genes are under-represented in gymnosperms and only a minimum of two Type I MADS-box genes have been present in the most recent common ancestor (MRCA) of seed plants. In contrast, a large number of Type II MADS-box genes were found in gymnosperms. The MRCA of extant seed plants probably possessed at least 11-14 Type II MADS-box genes. In gymnosperms two duplications of Type II MADS-box genes were found, such that the MRCA of extant gymnosperms had at least 14-16 Type II MADS-box genes. The implied ancestral set of MADS-box genes for seed plants shows simplicity for Type I MADS-box genes and remarkable complexity for Type II MADS-box genes in terms of phylogeny and putative functions. The analysis of transcriptome data reveals that gymnosperm MADS-box genes are expressed in a great variety of tissues, indicating diverse roles of MADS-box genes for the development of gymnosperms. This study is

  8. Mitochondrial Genome Recombination in the Zone of Contact Between Two Hybridizing Conifers

    PubMed Central

    Jaramillo-Correa, Juan P.; Bousquet, Jean

    2005-01-01

    Variation in mitochondrial DNA was surveyed at four gene loci in and around the zone of contact between two naturally hybridizing conifers, black spruce (Picea mariana) and red spruce (P. rubens) in northeastern North America. Most of the mtDNA diversity of these species was found in populations next to or into the zone of contact, where some individuals bore rare mitotypes intermediate between the common mitotypes observed in the allopatric areas of each species. Sequence analysis and tests for mtDNA recombination point to this phenomenon, rather than to recurrent mutation, as the most tenable hypothesis for the origin of these rare mitotypes. From the 10 mitotypes observed, at least 4 would be the product of recombination between 4 of the 5 putative ancestral mitotypes. Tests for cytonuclear disequilibrium and geographical structure of the putative recombinant mitotypes suggest that mtDNA recombination is not frequent and relatively recent on the geological time scale. mtDNA recombination would have been promoted by transient heteroplasmy due to leakage of paternal mtDNA since the Holocene secondary contact between the two species. PMID:16118197

  9. Genomic education resources for nursing faculty.

    PubMed

    Tonkin, Emma; Calzone, Kathleen; Jenkins, Jean; Lea, Dale; Prows, Cynthia

    2011-12-01

    The increasing recognition regarding the relevance of genomics across the scope of nursing healthcare practice has resulted in the drive to integrate appropriate genomic knowledge and skills into nurse education and training. In this final article of the series Genetics-Genomics and Nursing Education, we will look at genetic and genomic education resources and the factors that influence both their creation and use. In considering nurse education from faculty and student perspectives, four identified areas of need have been used as the organizing constructs: guidance (what should be taught and at what level of complexity); support and training; access to genetics professionals and service users; and quality resources. This paper sets out to address the following points: (a) why there is a need for quality genomics education resources to support nurse education; (b) what is required from a resource to make it "useful" for the user; and (c) how the quality and impact of a resource can be measured. While not exhaustive, information is provided to a number of globally accessible resources, along with detailed descriptions of selected teaching or learning tools. Strategies for evaluating the suitability of a resource and suggestions on how genomic resources can be used within nurse education are provided. The use of clinically relevant resources that link theory to professional practice and which meet predefined learning outcomes and practice indicators for nurse education and training will facilitate the integration of genomics into curricula by nurse faculty. Providing clinically meaningful education and training in genomics is central to enabling every nurse to develop the appropriate knowledge and skills in genomics in order to provide optimum care to individuals and families now, and to facilitate the integration of new information and technology as it becomes available across mainstream healthcare services. No claim to original US government works.

  10. Resources | Office of Cancer Genomics

    Cancer.gov

    OCG provides a variety of scientific and educational resources for both cancer researchers and members of the general public. These resources are divided into the following types: OCG-Supported Resources: Tools, databases, and reagents generated by initiated and completed OCG programs for researchers, educators, and students. (Note: Databases for current OCG programs are available through program-specific data matrices)

  11. Gramene 2013: Comparative plant genomics resources

    USDA-ARS?s Scientific Manuscript database

    Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework fo...

  12. Avianbase: a community resource for bird genomics.

    PubMed

    Eöry, Lél; Gilbert, M Thomas P; Li, Cai; Li, Bo; Archibald, Alan; Aken, Bronwen L; Zhang, Guojie; Jarvis, Erich; Flicek, Paul; Burt, David W

    2015-01-29

    Giving access to sequence and annotation data for genome assemblies is important because, while facilitating research, it places both assembly and annotation quality under scrutiny, resulting in improvements to both. Therefore we announce Avianbase, a resource for bird genomics, which provides access to data released by the Avian Phylogenomics Consortium.

  13. Gramene: a growing plant comparative genomics resource

    USDA-ARS?s Scientific Manuscript database

    Gramene (www.gramene.org) is a curated genetic, genomic and comparative genome analysis resource for the major crop species, such as rice, maize, wheat and many other plant (mainly grass) species. Gramene is an open-source project, with all data and software freely downloadable through the ftp site ...

  14. The coffee genome hub: a resource for coffee genomes

    PubMed Central

    Dereeper, Alexis; Bocs, Stéphanie; Rouard, Mathieu; Guignon, Valentin; Ravel, Sébastien; Tranchant-Dubreuil, Christine; Poncet, Valérie; Garsmeur, Olivier; Lashermes, Philippe; Droc, Gaëtan

    2015-01-01

    The whole genome sequence of Coffea canephora, the perennial diploid species known as Robusta, has been recently released. In the context of the C. canephora genome sequencing project and to support post-genomics efforts, we developed the Coffee Genome Hub (http://coffee-genome.org/), an integrative genome information system that allows centralized access to genomics and genetics data and analysis tools to facilitate translational and applied research in coffee. We provide the complete genome sequence of C. canephora along with gene structure, gene product information, metabolism, gene families, transcriptomics, syntenic blocks, genetic markers and genetic maps. The hub relies on generic software (e.g. GMOD tools) for easy querying, visualizing and downloading research data. It includes a Genome Browser enhanced by a Community Annotation System, enabling the improvement of automatic gene annotation through an annotation editor. In addition, the hub aims at developing interoperability among other existing South Green tools managing coffee data (phylogenomics resources, SNPs) and/or supporting data analyses with the Galaxy workflow manager. PMID:25392413

  15. The coffee genome hub: a resource for coffee genomes.

    PubMed

    Dereeper, Alexis; Bocs, Stéphanie; Rouard, Mathieu; Guignon, Valentin; Ravel, Sébastien; Tranchant-Dubreuil, Christine; Poncet, Valérie; Garsmeur, Olivier; Lashermes, Philippe; Droc, Gaëtan

    2015-01-01

    The whole genome sequence of Coffea canephora, the perennial diploid species known as Robusta, has been recently released. In the context of the C. canephora genome sequencing project and to support post-genomics efforts, we developed the Coffee Genome Hub (http://coffee-genome.org/), an integrative genome information system that allows centralized access to genomics and genetics data and analysis tools to facilitate translational and applied research in coffee. We provide the complete genome sequence of C. canephora along with gene structure, gene product information, metabolism, gene families, transcriptomics, syntenic blocks, genetic markers and genetic maps. The hub relies on generic software (e.g. GMOD tools) for easy querying, visualizing and downloading research data. It includes a Genome Browser enhanced by a Community Annotation System, enabling the improvement of automatic gene annotation through an annotation editor. In addition, the hub aims at developing interoperability among other existing South Green tools managing coffee data (phylogenomics resources, SNPs) and/or supporting data analyses with the Galaxy workflow manager. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Conifer DBMagic: A database housing multiple de novo transcriptome assemblies for twelve diverse conifer species

    Treesearch

    W. Walter Lorenz; Savavanaraj Ayyampalayam; John M. Bordeaux; Glenn T. Howe; Kathleen D. Jermstad; David B. Neale; Deborah L. Rogers; Jeffrey F.D. Dean

    2012-01-01

    Conifers comprise an ancient and widespread plant lineage of enormous commercial and ecological value. However, compared to model woody angiosperms, such as Populus and Eucalyptus, our understanding of conifers remains quite limited at a genomic level. Large genome sizes (10,000-40,000 Mbp) and large amounts of repetitive DNA...

  17. Genomics and Bioinformatics Resources for Crop Improvement

    PubMed Central

    Mochida, Keiichi; Shinozaki, Kazuo

    2010-01-01

    Recent remarkable innovations in platforms for omics-based research and application development provide crucial resources to promote research in model and applied plant species. A combinatorial approach using multiple omics platforms and integration of their outcomes is now an effective strategy for clarifying molecular systems integral to improving plant productivity. Furthermore, promotion of comparative genomics among model and applied plants allows us to grasp the biological properties of each species and to accelerate gene discovery and functional analyses of genes. Bioinformatics platforms and their associated databases are also essential for the effective design of approaches making the best use of genomic resources, including resource integration. We review recent advances in research platforms and resources in plant omics together with related databases and advances in technology. PMID:20208064

  18. Developing genomic resources for the apiaceae

    USDA-ARS?s Scientific Manuscript database

    The Apiaceae family includes carrot, celery, cilantro, dill, fennel and numerous other spice and medicinal crops. Carrot is the most economically important member of the Apiaceae with an annual value of $600 M in the United States alone. There are few genomic resources for carrot or other Apiaceae, ...

  19. Genome-scale resources for Thermoanaerobacterium saccharolyticum

    DOE PAGES

    Currie, Devin H.; Raman, Babu; Gowen, Christopher M.; ...

    2015-06-26

    Thermoanaerobacterium saccharolyticum is a hemicellulose-degrading thermophilic anaerobe that was previously engineered to produce ethanol at high yield. For this research, a major project was undertaken to develop this organism into an industrial biocatalyst, but the lack of genome information and resources were recognized early on as a key limitation.

  20. Seed plant phylogeny inferred from all three plant genomes: Monophyly of extant gymnosperms and origin of Gnetales from conifers

    PubMed Central

    Chaw, Shu-Miaw; Parkinson, Christopher L.; Cheng, Yuchang; Vincent, Thomas M.; Palmer, Jeffrey D.

    2000-01-01

    Phylogenetic relationships among the five groups of extant seed plants are presently quite unclear. For example, morphological studies consistently identify the Gnetales as the extant sister group to angiosperms (the so-called “anthophyte” hypothesis), whereas a number of molecular studies recover gymnosperm monophyly, and few agree with the morphology-based placement of Gnetales. To better resolve these and other unsettled issues, we have generated a new molecular data set of mitochondrial small subunit rRNA sequences, and have analyzed these data together with comparable data sets for the nuclear small subunit rRNA gene and the chloroplast rbcL gene. All nuclear analyses strongly ally Gnetales with a monophyletic conifers, whereas all mitochondrial analyses and those chloroplast analyses that take into account saturation of third-codon position transitions actually place Gnetales within conifers, as the sister group to the Pinaceae. Combined analyses of all three genes strongly support this latter relationship, which to our knowledge has never been suggested before. The combined analyses also strongly support monophyly of extant gymnosperms, with cycads identified as the basal-most group of gymnosperms, Ginkgo as the next basal, and all conifers except for Pinaceae as sister to the Gnetales + Pinaceae clade. According to these findings, the Gnetales may be viewed as extremely divergent conifers, and the many morphological similarities between angiosperms and Gnetales (e.g., double fertilization and flower-like reproductive structures) arose independently. PMID:10760277

  1. Saccharomyces Genome Database: the genomics resource of budding yeast

    PubMed Central

    Cherry, J. Michael; Hong, Eurie L.; Amundsen, Craig; Balakrishnan, Rama; Binkley, Gail; Chan, Esther T.; Christie, Karen R.; Costanzo, Maria C.; Dwight, Selina S.; Engel, Stacia R.; Fisk, Dianna G.; Hirschman, Jodi E.; Hitz, Benjamin C.; Karra, Kalpana; Krieger, Cynthia J.; Miyasato, Stuart R.; Nash, Rob S.; Park, Julie; Skrzypek, Marek S.; Simison, Matt; Weng, Shuai; Wong, Edith D.

    2012-01-01

    The Saccharomyces Genome Database (SGD, http://www.yeastgenome.org) is the community resource for the budding yeast Saccharomyces cerevisiae. The SGD project provides the highest-quality manually curated information from peer-reviewed literature. The experimental results reported in the literature are extracted and integrated within a well-developed database. These data are combined with quality high-throughput results and provided through Locus Summary pages, a powerful query engine and rich genome browser. The acquisition, integration and retrieval of these data allow SGD to facilitate experimental design and analysis by providing an encyclopedia of the yeast genome, its chromosomal features, their functions and interactions. Public access to these data is provided to researchers and educators via web pages designed for optimal ease of use. PMID:22110037

  2. Gramene 2013: comparative plant genomics resources

    PubMed Central

    Monaco, Marcela K.; Stein, Joshua; Naithani, Sushma; Wei, Sharon; Dharmawardhana, Palitha; Kumari, Sunita; Amarasinghe, Vindhya; Youens-Clark, Ken; Thomason, James; Preece, Justin; Pasternak, Shiran; Olson, Andrew; Jiao, Yinping; Lu, Zhenyuan; Bolser, Dan; Kerhornou, Arnaud; Staines, Dan; Walts, Brandon; Wu, Guanming; D’Eustachio, Peter; Haw, Robin; Croft, David; Kersey, Paul J.; Stein, Lincoln; Jaiswal, Pankaj; Ware, Doreen

    2014-01-01

    Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. Whole-genome alignments complemented by phylogenetic gene family trees help infer syntenic and orthologous relationships. Genetic variation data, sequences and genome mappings available for 10 species, including Arabidopsis, rice and maize, help infer putative variant effects on genes and transcripts. The pathways section also hosts 10 species-specific metabolic pathways databases developed in-house or by our collaborators using Pathway Tools software, which facilitates searches for pathway, reaction and metabolite annotations, and allows analyses of user-defined expression datasets. Recently, we released a Plant Reactome portal featuring 133 curated rice pathways. This portal will be expanded for Arabidopsis, maize and other plant species. We continue to provide genetic and QTL maps and marker datasets developed by crop researchers. The project provides a unique community platform to support scientific research in plant genomics including studies in evolution, genetics, plant breeding, molecular biology, biochemistry and systems biology. PMID:24217918

  3. The perennial ryegrass GenomeZipper: targeted use of genome resources for comparative grass genomics.

    PubMed

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F X; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-02-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species.

  4. MycoCosm, an Integrated Fungal Genomics Resource

    SciTech Connect

    Shabalov, Igor; Grigoriev, Igor

    2012-03-16

    MycoCosm is a web-based interactive fungal genomics resource, which was first released in March 2010, in response to an urgent call from the fungal community for integration of all fungal genomes and analytical tools in one place (Pan-fungal data resources meeting, Feb 21-22, 2010, Alexandria, VA). MycoCosm integrates genomics data and analysis tools to navigate through over 100 fungal genomes sequenced at JGI and elsewhere. This resource allows users to explore fungal genomes in the context of both genome-centric analysis and comparative genomics, and promotes user community participation in data submission, annotation and analysis. MycoCosm has over 4500 unique visitors/month or 35000+ visitors/year as well as hundreds of registered users contributing their data and expertise to this resource. Its scalable architecture allows significant expansion of the data expected from JGI Fungal Genomics Program, its users, and integration with external resources used by fungal community.

  5. A simple and rapid method for isolation of high quality genomic DNA from fruit trees and conifers using PVP.

    PubMed Central

    Kim, C S; Lee, C H; Shin, J S; Chung, Y S; Hyung, N I

    1997-01-01

    Because DNA degradation is mediated by secondary plant products such as phenolic terpenoids, the isolation of high quality DNA from plants containing a high content of polyphenolics has been a difficult problem. We demonstrate an easy extraction process by modifying several existing ones. Using this process we have found it possible to isolate DNAs from four fruit trees, grape (Vitis spp.), apple (Malus spp.), pear (Pyrus spp.) and persimmon (Diospyros spp.) and four species of conifer, Pinus densiflora, Pinus koraiensis,Taxus cuspidata and Juniperus chinensis within a few hours. Compared with the existing method, we have isolated high quality intact DNAs (260/280 = 1.8-2.0) routinely yielding 250-500 ng/microl (total 7.5-15 microg DNA from four to five tissue discs). PMID:9023124

  6. Success stories in genomic medicine from resource-limited countries.

    PubMed

    Mitropoulos, Konstantinos; Al Jaibeji, Hayat; Forero, Diego A; Laissue, Paul; Wonkam, Ambroise; Lopez-Correa, Catalina; Mohamed, Zahurin; Chantratita, Wasun; Lee, Ming Ta Michael; Llerena, Adrian; Brand, Angela; Ali, Bassam R; Patrinos, George P

    2015-06-18

    In recent years, the translation of genomic discoveries into mainstream medical practice and public health has gained momentum, facilitated by the advent of new technologies. However, there are often major discrepancies in the pace of implementation of genomic medicine between developed and developing/resource-limited countries. The main reason does not only lie in the limitation of resources but also in the slow pace of adoption of the new findings and the poor understanding of the potential that this new discipline offers to rationalize medical diagnosis and treatment. Here, we present and critically discuss examples from the successful implementation of genomic medicine in resource-limited countries, focusing on pharmacogenomics, genome informatics, and public health genomics, emphasizing in the latter case genomic education, stakeholder analysis, and economics in pharmacogenomics. These examples can be considered as model cases and be readily replicated for the wide implementation of pharmacogenomics and genomic medicine in other resource-limited environments.

  7. Update on RefSeq microbial genomes resources

    PubMed Central

    Tatusova, Tatiana; Ciufo, Stacy; Federhen, Scott; Fedorov, Boris; McVeigh, Richard; O'Neill, Kathleen; Tolstoy, Igor; Zaslavsky, Leonid

    2015-01-01

    NCBI RefSeq genome collection http://www.ncbi.nlm.nih.gov/genome represents all three major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic genome sequences are the most rapidly growing part of the collection. During the year of 2014 more than 10 000 microbial genome assemblies have been publicly released bringing the total number of prokaryotic genomes close to 30 000. We continue to improve the quality and usability of the microbial genome resources by providing easy access to the data and the results of the pre-computed analysis, and improving analysis and visualization tools. A number of improvements have been incorporated into the Prokaryotic Genome Annotation Pipeline. Several new features have been added to RefSeq prokaryotic genomes data processing pipeline including the calculation of genome groups (clades) and the optimization of protein clusters generation using pan-genome approach. PMID:25510495

  8. The Brachypodium genome sequence: a resource for oat genomics research

    USDA-ARS?s Scientific Manuscript database

    Oat (Avena sativa) is an important cereal crop used as both an animal feed and for human consumption. Genetic and genomic research on oat is hindered because it is hexaploid and possesses a large (13 Gb) genome. Diploid Avena relatives have been employed for genetic and genomic studies, but only mod...

  9. Sources for Comparative Studies of Placentation. II. Genomic Resources

    PubMed Central

    Wildman, Derek E.

    2008-01-01

    The genomes of dozens of placental mammal species are now publicly available. These genome sequences have the potential to provide insight into the development and evolution of the placenta. In particular, the variable anatomy of the placenta has likely been affected by natural selection on the genomes of living and extinct mammals. In this note the current availability of mammal genome sequences is reviewed, and strengths and limitations of these data are discussed. Additionally, museums, zoos, and commercial entities are available to provide genomic resources to the placental research community. Recommendations for tissue storage conditions of placentas in genomic research are given. PMID:18155141

  10. Intraspecific growth and functional leaf trait responses to natural soil resource gradients for conifer species with contrasting leaf habit.

    PubMed

    Walters, Michael B; Gerlach, John P

    2013-03-01

    Interspecific relationships among species mean leaf traits, performance and species resource/climate distributions help provide the foundation for a predictive, functionally based plant ecology. Intraspecific responses of leaf traits and performance to resource gradients and how these vary among species may be equally important but have received less attention. Here, we examine relationships between proxies of soil resource availability, leaf traits and growth (height at 25 years, SI25) for winter deciduous Larix decidua Mill. and evergreen Pinus resinosa Ait. trees distributed over soil resource gradients in the Great Lakes region of North America. We predicted that (i) leaf trait responses to soil resources within species will be similar to reported distributions of mean leaf traits over soil resource gradients among species; (ii) soil resource-related variation in leaf traits can help explain SI25; and (iii) SI25 will be greater for Larix than Pinus at higher soil resources and greater for Pinus than Larix at lower soil resources and this pattern will be associated with species differences in leaf trait responses to soil resources. Among the measured leaf traits (live N, Mg, Ca, K, P, and Mn, litter N, N resorption, carbon isotope discrimination, specific leaf area, lifespan), soil resources only impacted live and litter N for both species and K for Pinus. In turn, only the leaf traits responsive to soil resources affected SI25 in the expected manner. Larix had greater SI25 than Pinus across soil resource gradients and both species had similar growth and leaf trait sensitivities to resources. In summary: (i) several leaf traits reported to be associated with performance and edaphic distributions across species were, within species, unresponsive to nitrogen and water availability and unrelated to growth; (ii) leaf N showed high plasticity to soil resources and this plasticity was functionally relevant to growth over its entire range of response; (iii) large

  11. BrucellaBase: Genome information resource.

    PubMed

    Sankarasubramanian, Jagadesan; Vishnu, Udayakumar S; Khader, L K M Abdul; Sridhar, Jayavel; Gunasekaran, Paramasamy; Rajendhran, Jeyaprakash

    2016-09-01

    Brucella sp. causes a major zoonotic disease, brucellosis. Brucella belongs to the family Brucellaceae under the order Rhizobiales of Alphaproteobacteria. We present BrucellaBase, a web-based platform, providing features of a genome database together with unique analysis tools. We have developed a web version of the multilocus sequence typing (MLST) (Whatmore et al., 2007) and phylogenetic analysis of Brucella spp. BrucellaBase currently contains genome data of 510 Brucella strains along with the user interfaces for BLAST, VFDB, CARD, pairwise genome alignment and MLST typing. Availability of these tools will enable the researchers interested in Brucella to get meaningful information from Brucella genome sequences. BrucellaBase will regularly be updated with new genome sequences, new features along with improvements in genome annotations. BrucellaBase is available online at http://www.dbtbrucellosis.in/brucellabase.html or http://59.99.226.203/brucellabase/homepage.html. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. Plant database resources at The Institute for Genomic Research.

    PubMed

    Chan, Agnes P; Rabinowicz, Pablo D; Quackenbush, John; Buell, C Robin; Town, Chris D

    2007-01-01

    With the completion of the genome sequences of the model plants Arabidopsis and rice, and the continuing sequencing efforts of other economically important crop plants, an unprecedented amount of genome sequence data is now available for large-scale genomics studies and analyses, such as the identification and discovery of novel genes, comparative genomics, and functional genomics. Efficient utilization of these large data sets is critically dependent on the ease of access and organization of the data. The plant databases at The Institute for Genomic Research (TIGR) have been set up to maintain various data types including genomic sequence, annotation and analyses, expressed transcript assemblies and analyses, and gene expression profiles from microarray studies. We present here an overview of the TIGR database resources for plant genomics and describe methods to access the data.

  13. HopBase: a unified resource for Humulus genomics.

    PubMed

    Hill, Steven T; Sudarsanam, Ramcharan; Henning, John; Hendrix, David

    2017-01-01

    Hop (Humulus lupulus L. var lupulus) is a dioecious plant of worldwide significance, used primarily for bittering and flavoring in brewing beer. Studies on the medicinal properties of several unique compounds produced by hop have led to additional interest from pharmacy and healthcare industries as well as livestock production as a natural antibiotic. Genomic research in hop has resulted a published draft genome and transcriptome assemblies. As research into the genomics of hop has gained interest, there is a critical need for centralized online genomic resources. To support the growing research community, we report the development of an online resource "HopBase.org." In addition to providing a gene annotation to the existing Shinsuwase draft genome, HopBase makes available genome assemblies and annotations for both the cultivar "Teamaker" and male hop accession number USDA 21422M. These genome assemblies, gene annotations, along with other common data, coupled with a genome browser and BLAST database enable the hop community to enter the genomic age. The HopBase genomic resource is accessible at http://hopbase.org and http://hopbase.cgrb.oregonstate.edu. © The Author 2017. Published by Oxford University Press.

  14. CottonDB: A resource for cotton genome research

    USDA-ARS?s Scientific Manuscript database

    CottonDB (http://cottondb.org/) is a database and web resource for cotton genomic and genetic research. Created in 1995, CottonDB was among the first plant genome databases established by the USDA-ARS. Accessed through a website interface, the database aims to be a convenient, inclusive medium of ...

  15. Gramene 2016: comparative plant genomics and pathway resources

    USDA-ARS?s Scientific Manuscript database

    Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the data...

  16. The KEGG resource for deciphering the genome.

    PubMed

    Kanehisa, Minoru; Goto, Susumu; Kawashima, Shuichi; Okuno, Yasushi; Hattori, Masahiro

    2004-01-01

    A grand challenge in the post-genomic era is a complete computer representation of the cell and the organism, which will enable computational prediction of higher-level complexity of cellular processes and organism behavior from genomic information. Toward this end we have been developing a knowledge-based approach for network prediction, which is to predict, given a complete set of genes in the genome, the protein interaction networks that are responsible for various cellular processes. KEGG at http://www.genome.ad.jp/kegg/ is the reference knowledge base that integrates current knowledge on molecular interaction networks such as pathways and complexes (PATHWAY database), information about genes and proteins generated by genome projects (GENES/SSDB/KO databases) and information about biochemical compounds and reactions (COMPOUND/GLYCAN/REACTION databases). These three types of database actually represent three graph objects, called the protein network, the gene universe and the chemical universe. New efforts are being made to abstract knowledge, both computationally and manually, about ortholog clusters in the KO (KEGG Orthology) database, and to collect and analyze carbohydrate structures in the GLYCAN database.

  17. The KEGG resource for deciphering the genome

    PubMed Central

    Kanehisa, Minoru; Goto, Susumu; Kawashima, Shuichi; Okuno, Yasushi; Hattori, Masahiro

    2004-01-01

    A grand challenge in the post-genomic era is a complete computer representation of the cell and the organism, which will enable computational prediction of higher-level complexity of cellular processes and organism behavior from genomic information. Toward this end we have been developing a knowledge-based approach for network prediction, which is to predict, given a complete set of genes in the genome, the protein interaction networks that are responsible for various cellular processes. KEGG at http://www.genome.ad.jp/kegg/ is the reference knowledge base that integrates current knowledge on molecular interaction networks such as pathways and complexes (PATHWAY database), information about genes and proteins generated by genome projects (GENES/SSDB/KO databases) and information about biochemical compounds and reactions (COMPOUND/GLYCAN/REACTION databases). These three types of database actually represent three graph objects, called the protein network, the gene universe and the chemical universe. New efforts are being made to abstract knowledge, both computationally and manually, about ortholog clusters in the KO (KEGG Orthology) database, and to collect and analyze carbohydrate structures in the GLYCAN database. PMID:14681412

  18. The Comprehensive Phytopathogen Genomics Resource: a web-based resource for data-mining plant pathogen genomes.

    PubMed

    Hamilton, John P; Neeno-Eckwall, Eric C; Adhikari, Bishwo N; Perna, Nicole T; Tisserat, Ned; Leach, Jan E; Lévesque, C André; Buell, C Robin

    2011-01-01

    The Comprehensive Phytopathogen Genomics Resource (CPGR) provides a web-based portal for plant pathologists and diagnosticians to view the genome and trancriptome sequence status of 806 bacterial, fungal, oomycete, nematode, viral and viroid plant pathogens. Tools are available to search and analyze annotated genome sequences of 74 bacterial, fungal and oomycete pathogens. Oomycete and fungal genomes are obtained directly from GenBank, whereas bacterial genome sequences are downloaded from the A Systematic Annotation Package (ASAP) database that provides curation of genomes using comparative approaches. Curated lists of bacterial genes relevant to pathogenicity and avirulence are also provided. The Plant Pathogen Transcript Assemblies Database provides annotated assemblies of the transcribed regions of 82 eukaryotic genomes from publicly available single pass Expressed Sequence Tags. Data-mining tools are provided along with tools to create candidate diagnostic markers, an emerging use for genomic sequence data in plant pathology. The Plant Pathogen Ribosomal DNA (rDNA) database is a resource for pathogens that lack genome or transcriptome data sets and contains 131 755 rDNA sequences from GenBank for 17 613 species identified as plant pathogens and related genera. Database URL: http://cpgr.plantbiology.msu.edu.

  19. Update on RefSeq microbial genomes resources.

    PubMed

    Tatusova, Tatiana; Ciufo, Stacy; Federhen, Scott; Fedorov, Boris; McVeigh, Richard; O'Neill, Kathleen; Tolstoy, Igor; Zaslavsky, Leonid

    2015-01-01

    NCBI RefSeq genome collection http://www.ncbi.nlm.nih.gov/genome represents all three major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic genome sequences are the most rapidly growing part of the collection. During the year of 2014 more than 10,000 microbial genome assemblies have been publicly released bringing the total number of prokaryotic genomes close to 30,000. We continue to improve the quality and usability of the microbial genome resources by providing easy access to the data and the results of the pre-computed analysis, and improving analysis and visualization tools. A number of improvements have been incorporated into the Prokaryotic Genome Annotation Pipeline. Several new features have been added to RefSeq prokaryotic genomes data processing pipeline including the calculation of genome groups (clades) and the optimization of protein clusters generation using pan-genome approach. Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by US Government employees and is in the public domain in the US.

  20. Fungal genetic resource centres and the genomic challenge.

    PubMed

    Ryan, Matthew J; Smith, David

    2004-12-01

    Fungal research and education has for many years been supported by public service genetic resource centres, whose roles have been to maintain, preserve and supply living cultures to the research community. In the genomic era, genetic resource centres are perhaps more important than ever before. The cultures held, many of which are described and validated by expert biosystematists, are valuable resources for the future. There is a need to supply genomic and proteomic research programmes with fully characterised organisms, as usage of organisms from unreliable sources can prove disastrous, not least in economical terms. However, mycologists often require more than just the organisms, for example, their associated information is vital for bioinformatic applications and some researchers may only require genomic DNA from the organism rather than the organism per se. Genetic resource centres are continually adapting to meet the needs of their users and the wider mycological research community, this associated with OECD international initiatives should ensure they exist to support research for many years to come. This review considers the impact of such initiatives, the current roles of fungal genetic resource centres, the mechanisms used to preserve organisms in a stable manner and the range of resources that are offered for genomic research.

  1. Genomic resources for the study of neuropsychiatric disorders.

    PubMed

    Senthil, G; Dutka, T; Bingaman, L; Lehner, T

    2017-03-21

    The National Institute of Mental Health (NIMH) has made sustained investments in the development of genomic resources over the last two decades. These investments have led to the development of the largest biorepository for psychiatric genetics as a centralized national resource. In the realm of genomic resources, NIMH has been supporting large team science (TS) consortia focused on gene discovery, fine mapping of loci, and functional genomics using state-of-the-art technologies. The scientific output from these efforts has not only begun to transform our understanding of the genetic architecture of neuropsychiatric disorders, but it has also led to a broader cultural change among the investigator community towards deeper collaborations and broad pre-publication sharing of data and resources. The NIMH supported efforts have led to a vast increase in the amount of genetic and genomic resources available to the mental health research community. Here we provide an account of the existing resources and estimates of the scale and scope of what will be available in the near future. All biosamples and data described are intended for broad sharing with researchers worldwide, as allowed by the subject consent and applicable laws.Molecular Psychiatry advance online publication, 21 March 2017; doi:10.1038/mp.2017.29.

  2. The Perennial Ryegrass GenomeZipper: Targeted Use of Genome Resources for Comparative Grass Genomics1[C][W

    PubMed Central

    Pfeifer, Matthias; Martis, Mihaela; Asp, Torben; Mayer, Klaus F.X.; Lübberstedt, Thomas; Byrne, Stephen; Frei, Ursula; Studer, Bruno

    2013-01-01

    Whole-genome sequences established for model and major crop species constitute a key resource for advanced genomic research. For outbreeding forage and turf grass species like ryegrasses (Lolium spp.), such resources have yet to be developed. Here, we present a model of the perennial ryegrass (Lolium perenne) genome on the basis of conserved synteny to barley (Hordeum vulgare) and the model grass genome Brachypodium (Brachypodium distachyon) as well as rice (Oryza sativa) and sorghum (Sorghum bicolor). A transcriptome-based genetic linkage map of perennial ryegrass served as a scaffold to establish the chromosomal arrangement of syntenic genes from model grass species. This scaffold revealed a high degree of synteny and macrocollinearity and was then utilized to anchor a collection of perennial ryegrass genes in silico to their predicted genome positions. This resulted in the unambiguous assignment of 3,315 out of 8,876 previously unmapped genes to the respective chromosomes. In total, the GenomeZipper incorporates 4,035 conserved grass gene loci, which were used for the first genome-wide sequence divergence analysis between perennial ryegrass, barley, Brachypodium, rice, and sorghum. The perennial ryegrass GenomeZipper is an ordered, information-rich genome scaffold, facilitating map-based cloning and genome assembly in perennial ryegrass and closely related Poaceae species. It also represents a milestone in describing synteny between perennial ryegrass and fully sequenced model grass genomes, thereby increasing our understanding of genome organization and evolution in the most important temperate forage and turf grass species. PMID:23184232

  3. proGenomes: a resource for consistent functional and taxonomic annotations of prokaryotic genomes

    PubMed Central

    Mende, Daniel R.; Letunic, Ivica; Huerta-Cepas, Jaime; Li, Simone S.; Forslund, Kristoffer; Sunagawa, Shinichi; Bork, Peer

    2017-01-01

    The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de. PMID:28053165

  4. proGenomes: a resource for consistent functional and taxonomic annotations of prokaryotic genomes.

    PubMed

    Mende, Daniel R; Letunic, Ivica; Huerta-Cepas, Jaime; Li, Simone S; Forslund, Kristoffer; Sunagawa, Shinichi; Bork, Peer

    2017-01-04

    The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de.

  5. PGSB/MIPS Plant Genome Information Resources and Concepts for the Analysis of Complex Grass Genomes.

    PubMed

    Spannagl, Manuel; Bader, Kai; Pfeifer, Matthias; Nussbaumer, Thomas; Mayer, Klaus F X

    2016-01-01

    PGSB (Plant Genome and Systems Biology; formerly MIPS-Munich Institute for Protein Sequences) has been involved in developing, implementing and maintaining plant genome databases for more than a decade. Genome databases and analysis resources have focused on individual genomes and aim to provide flexible and maintainable datasets for model plant genomes as a backbone against which experimental data, e.g., from high-throughput functional genomics, can be organized and analyzed. In addition, genomes from both model and crop plants form a scaffold for comparative genomics, assisted by specialized tools such as the CrowsNest viewer to explore conserved gene order (synteny) between related species on macro- and micro-levels.The genomes of many economically important Triticeae plants such as wheat, barley, and rye present a great challenge for sequence assembly and bioinformatic analysis due to their enormous complexity and large genome size. Novel concepts and strategies have been developed to deal with these difficulties and have been applied to the genomes of wheat, barley, rye, and other cereals. This includes the GenomeZipper concept, reference-guided exome assembly, and "chromosome genomics" based on flow cytometry sorted chromosomes.

  6. Resources for Biological Annotation of the Drosophila Genome

    SciTech Connect

    Gerald M. Rubin

    2005-08-08

    This project supported seed money for the development of cDNA and genetic resources to support studies of the Drosophila melanogaster genome. Key publications supported by this work that provide additional detail: (1) ''The Drosophila gene collection: identification of putative full-length cDNAs for 70% of D. melanogaster genes''; and (2) ''The Berkeley Drosophila Genome Project gene disruption project: Single P-element insertions mutating 25% of vital Drosophila genes''.

  7. Gramene 2016: comparative plant genomics and pathway resources.

    PubMed

    Tello-Ruiz, Marcela K; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A; Huerta, Laura; Keays, Maria; Tang, Y Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J; Jaiswal, Pankaj; Ware, Doreen

    2016-01-04

    Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  8. Gramene 2016: comparative plant genomics and pathway resources

    PubMed Central

    Tello-Ruiz, Marcela K.; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M.; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A.; Huerta, Laura; Keays, Maria; Tang, Y. Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J.; Jaiswal, Pankaj; Ware, Doreen

    2016-01-01

    Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. PMID:26553803

  9. Genetic resources offer efficient tools for rice functional genomics research.

    PubMed

    Lo, Shuen-Fang; Fan, Ming-Jen; Hsing, Yue-Ie; Chen, Liang-Jwu; Chen, Shu; Wen, Ien-Chie; Liu, Yi-Lun; Chen, Ku-Ting; Jiang, Mirng-Jier; Lin, Ming-Kuang; Rao, Meng-Yen; Yu, Lin-Chih; Ho, Tuan-Hua David; Yu, Su-May

    2016-05-01

    Rice is an important crop and major model plant for monocot functional genomics studies. With the establishment of various genetic resources for rice genomics, the next challenge is to systematically assign functions to predicted genes in the rice genome. Compared with the robustness of genome sequencing and bioinformatics techniques, progress in understanding the function of rice genes has lagged, hampering the utilization of rice genes for cereal crop improvement. The use of transfer DNA (T-DNA) insertional mutagenesis offers the advantage of uniform distribution throughout the rice genome, but preferentially in gene-rich regions, resulting in direct gene knockout or activation of genes within 20-30 kb up- and downstream of the T-DNA insertion site and high gene tagging efficiency. Here, we summarize the recent progress in functional genomics using the T-DNA-tagged rice mutant population. We also discuss important features of T-DNA activation- and knockout-tagging and promoter-trapping of the rice genome in relation to mutant and candidate gene characterizations and how to more efficiently utilize rice mutant populations and datasets for high-throughput functional genomics and phenomics studies by forward and reverse genetics approaches. These studies may facilitate the translation of rice functional genomics research to improvements of rice and other cereal crops. © 2015 John Wiley & Sons Ltd.

  10. Enhancing faba bean (Vicia faba L.) genome resources

    PubMed Central

    Cooper, James W.; Wilson, Michael H.; Derks, Martijn F. L.; Smit, Sandra; Kunert, Karl J.; Cullis, Christopher

    2017-01-01

    Abstract Grain legume improvement is currently impeded by a lack of genomic resources. The paucity of genome information for faba bean can be attributed to the intrinsic difficulties of assembling/annotating its giant (~13 Gb) genome. In order to address this challenge, RNA-sequencing analysis was performed on faba bean (cv. Wizard) leaves. Read alignment to the faba bean reference transcriptome identified 16 300 high quality unigenes. In addition, Illumina paired-end sequencing was used to establish a baseline for genomic information assembly. Genomic reads were assembled de novo into contigs with a size range of 50–5000 bp. Over 85% of sequences did not align to known genes, of which ~10% could be aligned to known repetitive genetic elements. Over 26 000 of the reference transcriptome unigenes could be aligned to DNA-sequencing (DNA-seq) reads with high confidence. Moreover, this comparison identified 56 668 potential splice points in all identified unigenes. Sequence length data were extended at 461 putative loci through alignment of DNA-seq contigs to full-length, publicly available linkage marker sequences. Reads also yielded coverages of 3466× and 650× for the chloroplast and mitochondrial genomes, respectively. Inter- and intraspecies organelle genome comparisons established core legume organelle gene sets, and revealed polymorphic regions of faba bean organelle genomes. PMID:28419381

  11. PhytoPath: an integrative resource for plant pathogen genomics

    PubMed Central

    Pedro, Helder; Maheswari, Uma; Urban, Martin; Irvine, Alistair George; Cuzick, Alayne; McDowall, Mark D.; Staines, Daniel M.; Kulesha, Eugene; Hammond-Kosack, Kim Elizabeth; Kersey, Paul Julian

    2016-01-01

    PhytoPath (www.phytopathdb.org) is a resource for genomic and phenotypic data from plant pathogen species, that integrates phenotypic data for genes from PHI-base, an expertly curated catalog of genes with experimentally verified pathogenicity, with the Ensembl tools for data visualization and analysis. The resource is focused on fungi, protists (oomycetes) and bacterial plant pathogens that have genomes that have been sequenced and annotated. Genes with associated PHI-base data can be easily identified across all plant pathogen species using a BioMart-based query tool and visualized in their genomic context on the Ensembl genome browser. The PhytoPath resource contains data for 135 genomic sequences from 87 plant pathogen species, and 1364 genes curated for their role in pathogenicity and as targets for chemical intervention. Support for community annotation of gene models is provided using the WebApollo online gene editor, and we are working with interested communities to improve reference annotation for selected species. PMID:26476449

  12. Assembly: a resource for assembled genomes at NCBI

    PubMed Central

    Kitts, Paul A.; Church, Deanna M.; Thibaud-Nissen, Françoise; Choi, Jinna; Hem, Vichet; Sapojnikov, Victor; Smith, Robert G.; Tatusova, Tatiana; Xiang, Charlie; Zherikov, Andrey; DiCuccio, Michael; Murphy, Terence D.; Pruitt, Kim D.; Kimchi, Avi

    2016-01-01

    The NCBI Assembly database (www.ncbi.nlm.nih.gov/assembly/) provides stable accessioning and data tracking for genome assembly data. The model underlying the database can accommodate a range of assembly structures, including sets of unordered contig or scaffold sequences, bacterial genomes consisting of a single complete chromosome, or complex structures such as a human genome with modeled allelic variation. The database provides an assembly accession and version to unambiguously identify the set of sequences that make up a particular version of an assembly, and tracks changes to updated genome assemblies. The Assembly database reports metadata such as assembly names, simple statistical reports of the assembly (number of contigs and scaffolds, contiguity metrics such as contig N50, total sequence length and total gap length) as well as the assembly update history. The Assembly database also tracks the relationship between an assembly submitted to the International Nucleotide Sequence Database Consortium (INSDC) and the assembly represented in the NCBI RefSeq project. Users can find assemblies of interest by querying the Assembly Resource directly or by browsing available assemblies for a particular organism. Links in the Assembly Resource allow users to easily download sequence and annotations for current versions of genome assemblies from the NCBI genomes FTP site. PMID:26578580

  13. Assembly: a resource for assembled genomes at NCBI.

    PubMed

    Kitts, Paul A; Church, Deanna M; Thibaud-Nissen, Françoise; Choi, Jinna; Hem, Vichet; Sapojnikov, Victor; Smith, Robert G; Tatusova, Tatiana; Xiang, Charlie; Zherikov, Andrey; DiCuccio, Michael; Murphy, Terence D; Pruitt, Kim D; Kimchi, Avi

    2016-01-04

    The NCBI Assembly database (www.ncbi.nlm.nih.gov/assembly/) provides stable accessioning and data tracking for genome assembly data. The model underlying the database can accommodate a range of assembly structures, including sets of unordered contig or scaffold sequences, bacterial genomes consisting of a single complete chromosome, or complex structures such as a human genome with modeled allelic variation. The database provides an assembly accession and version to unambiguously identify the set of sequences that make up a particular version of an assembly, and tracks changes to updated genome assemblies. The Assembly database reports metadata such as assembly names, simple statistical reports of the assembly (number of contigs and scaffolds, contiguity metrics such as contig N50, total sequence length and total gap length) as well as the assembly update history. The Assembly database also tracks the relationship between an assembly submitted to the International Nucleotide Sequence Database Consortium (INSDC) and the assembly represented in the NCBI RefSeq project. Users can find assemblies of interest by querying the Assembly Resource directly or by browsing available assemblies for a particular organism. Links in the Assembly Resource allow users to easily download sequence and annotations for current versions of genome assemblies from the NCBI genomes FTP site. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  14. Genome resource for the Indonesian coelacanth, Latimeria menadoensis.

    PubMed

    Danke, Joshua; Miyake, Tsutomu; Powers, Thomas; Schein, Jacqueline; Shin, Heesun; Bosdet, Ian; Erdmann, Mark; Caldwell, Roy; Amemiya, Chris T

    2004-03-01

    We have generated a BAC library from the Indonesian coelacanth, Latimeria menadoensis. This library was generated using genomic DNA of nuclei isolated from heart tissue, and has an average insert size of 171 kb. There are a total of 288 384-well microtiter dishes in the library (110,592 clones) and its genomic representation is estimated to encompass > or = 7X coverage based on the amount of DNA presumably cloned in the library as well as via hybridization with probes to a small set of single copy genes. This genomic resource has been made available to the public and should prove useful to the scientific community for many applications, including comparative genomics, molecular evolution and conservation genetics.

  15. Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing

    PubMed Central

    2011-01-01

    Background Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives

  16. Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing.

    PubMed

    Straub, Shannon C K; Fishbein, Mark; Livshultz, Tatyana; Foster, Zachary; Parks, Matthew; Weitemier, Kevin; Cronn, Richard C; Liston, Aaron

    2011-05-04

    Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first

  17. Floral gene resources from basal angiosperms for comparative genomics research.

    PubMed

    Albert, Victor A; Soltis, Douglas E; Carlson, John E; Farmerie, William G; Wall, P Kerr; Ilut, Daniel C; Solow, Teri M; Mueller, Lukas A; Landherr, Lena L; Hu, Yi; Buzgo, Matyas; Kim, Sangtae; Yoo, Mi-Jeong; Frohlich, Michael W; Perl-Treves, Rafael; Schlarbaum, Scott E; Bliss, Barbara J; Zhang, Xiaohong; Tanksley, Steven D; Oppenheimer, David G; Soltis, Pamela S; Ma, Hong; DePamphilis, Claude W; Leebens-Mack, James H

    2005-03-30

    The Floral Genome Project was initiated to bridge the genomic gap between the most broadly studied plant model systems. Arabidopsis and rice, although now completely sequenced and under intensive comparative genomic investigation, are separated by at least 125 million years of evolutionary time, and cannot in isolation provide a comprehensive perspective on structural and functional aspects of flowering plant genome dynamics. Here we discuss new genomic resources available to the scientific community, comprising cDNA libraries and Expressed Sequence Tag (EST) sequences for a suite of phylogenetically basal angiosperms specifically selected to bridge the evolutionary gaps between model plants and provide insights into gene content and genome structure in the earliest flowering plants. Random sequencing of cDNAs from representatives of phylogenetically important eudicot, non-grass monocot, and gymnosperm lineages has so far (as of 12/1/04) generated 70,514 ESTs and 48,170 assembled unigenes. Efficient sorting of EST sequences into putative gene families based on whole Arabidopsis/rice proteome comparison has permitted ready identification of cDNA clones for finished sequencing. Preliminarily, (i) proportions of functional categories among sequenced floral genes seem representative of the entire Arabidopsis transcriptome, (ii) many known floral gene homologues have been captured, and (iii) phylogenetic analyses of ESTs are providing new insights into the process of gene family evolution in relation to the origin and diversification of the angiosperms. Initial comparisons illustrate the utility of the EST data sets toward discovery of the basic floral transcriptome. These first findings also afford the opportunity to address a number of conspicuous evolutionary genomic questions, including reproductive organ transcriptome overlap between angiosperms and gymnosperms, genome-wide duplication history, lineage-specific gene duplication and functional divergence, and

  18. Plant Genome Resources at the National Center for Biotechnology Information

    PubMed Central

    Wheeler, David L.; Smith-White, Brian; Chetvernin, Vyacheslav; Resenchuk, Sergei; Dombrowski, Susan M.; Pechous, Steven W.; Tatusova, Tatiana; Ostell, James

    2005-01-01

    The National Center for Biotechnology Information (NCBI) integrates data from more than 20 biological databases through a flexible search and retrieval system called Entrez. A core Entrez database, Entrez Nucleotide, includes GenBank and is tightly linked to the NCBI Taxonomy database, the Entrez Protein database, and the scientific literature in PubMed. A suite of more specialized databases for genomes, genes, gene families, gene expression, gene variation, and protein domains dovetails with the core databases to make Entrez a powerful system for genomic research. Linked to the full range of Entrez databases is the NCBI Map Viewer, which displays aligned genetic, physical, and sequence maps for eukaryotic genomes including those of many plants. A specialized plant query page allow maps from all plant genomes covered by the Map Viewer to be searched in tandem to produce a display of aligned maps from several species. PlantBLAST searches against the sequences shown in the Map Viewer allow BLAST alignments to be viewed within a genomic context. In addition, precomputed sequence similarities, such as those for proteins offered by BLAST Link, enable fluid navigation from unannotated to annotated sequences, quickening the pace of discovery. NCBI Web pages for plants, such as Plant Genome Central, complete the system by providing centralized access to NCBI's genomic resources as well as links to organism-specific Web pages beyond NCBI. PMID:16010002

  19. Integrating genomic resources of flatfish (Pleuronectiformes) to boost aquaculture production.

    PubMed

    Robledo, Diego; Hermida, Miguel; Rubiolo, Juan A; Fernández, Carlos; Blanco, Andrés; Bouza, Carmen; Martínez, Paulino

    2017-03-01

    Flatfish have a high market acceptance thus representing a profitable aquaculture production. The main farmed species is the turbot (Scophthalmus maximus) followed by Japanese flounder (Paralichthys olivaceous) and tongue sole (Cynoglossus semilaevis), but other species like Atlantic halibut (Hippoglossus hippoglossus), Senegalese sole (Solea senegalensis) and common sole (Solea solea) also register an important production and are very promising for farming. Important genomic resources are available for most of these species including whole genome sequencing projects, genetic maps and transcriptomes. In this work, we integrate all available genomic information of these species within a common framework, taking as reference the whole assembled genomes of turbot and tongue sole (>210× coverage). New insights related to the genetic basis of productive traits and new data useful to understand the evolutionary origin and diversification of this group were obtained. Despite a general 1:1 chromosome syntenic relationship between species, the comparison of turbot and tongue sole genomes showed huge intrachromosomic reorganizations. The integration of available mapping information supported specific chromosome fusions along flatfish evolution and facilitated the comparison between species of previously reported genetic associations for productive traits. When comparing transcriptomic resources of the six species, a common set of ~2500 othologues and ~150 common miRNAs were identified, and specific sets of putative missing genes were detected in flatfish transcriptomes, likely reflecting their evolutionary diversification. Copyright © 2016 Elsevier Inc. All rights reserved.

  20. Genomic resources for flatfish research and their applications.

    PubMed

    Cerdà, J; Douglas, S; Reith, M

    2010-10-01

    Flatfishes are a group of teleosts of high commercial and environmental interest, whose biology is still poorly understood. The recent rapid development of different 'omic' technologies is, however, enhancing the knowledge of the complex genetic control underlying different physiological processes of flatfishes. This review describes the different functional genomic approaches and resources currently available for flatfish research and summarizes different areas where microarray-based gene expression analysis has been applied. The increase in genome sequencing data has also allowed the construction of genetic linkage maps in different flatfish species; these maps are invaluable for investigating genome organization and identifying genetic traits of commercial interest. Despite the significant progress in this field, the genomic resources currently available for flatfish are still scarce. Further intensive research should be carried out to develop larger genomic sequence databases, high-density microarrays and, more detailed, complete linkage maps, using second-generation sequencing platforms. These tools will be crucial for further expanding the knowledge of flatfish physiology, and it is predicted that they will have important implications for wild fish population management, improved fish welfare and increased productivity in aquaculture.

  1. A public resource facilitating clinical use of genomes.

    PubMed

    Ball, Madeleine P; Thakuria, Joseph V; Zaranek, Alexander Wait; Clegg, Tom; Rosenbaum, Abraham M; Wu, Xiaodi; Angrist, Misha; Bhak, Jong; Bobe, Jason; Callow, Matthew J; Cano, Carlos; Chou, Michael F; Chung, Wendy K; Douglas, Shawn M; Estep, Preston W; Gore, Athurva; Hulick, Peter; Labarga, Alberto; Lee, Je-Hyuk; Lunshof, Jeantine E; Kim, Byung Chul; Kim, Jong-Il; Li, Zhe; Murray, Michael F; Nilsen, Geoffrey B; Peters, Brock A; Raman, Anugraha M; Rienhoff, Hugh Y; Robasky, Kimberly; Wheeler, Matthew T; Vandewege, Ward; Vorhaus, Daniel B; Yang, Joyce L; Yang, Luhan; Aach, John; Ashley, Euan A; Drmanac, Radoje; Kim, Seong-Jin; Li, Jin Billy; Peshkin, Leonid; Seidman, Christine E; Seo, Jeong-Sun; Zhang, Kun; Rehm, Heidi L; Church, George M

    2012-07-24

    Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved "open consent" process. The contribution of public data and samples facilitates both scientific discovery and standardization of methods. We present our findings after enrollment of more than 1,800 participants, including whole-genome sequencing of 10 pilot participant genomes (the PGP-10). We introduce the Genome-Environment-Trait Evidence (GET-Evidence) system. This tool automatically processes genomes and prioritizes both published and novel variants for interpretation. In the process of reviewing the presumed healthy PGP-10 genomes, we find numerous literature references implying serious disease. Although it is sometimes impossible to rule out a late-onset effect, stringent evidence requirements can address the high rate of incidental findings. To that end we develop a peer production system for recording and organizing variant evaluations according to standard evidence guidelines, creating a public forum for reaching consensus on interpretation of clinically relevant variants. Genome analysis becomes a two-step process: using a prioritized list to record variant evaluations, then automatically sorting reviewed variants using these annotations. Genome data, health and trait information, participant samples, and variant interpretations are all shared in the public domain-we invite others to review our results using our participant samples and contribute to our interpretations. We offer our public resource and methods to further personalized medical research.

  2. A public resource facilitating clinical use of genomes

    PubMed Central

    Ball, Madeleine P.; Thakuria, Joseph V.; Zaranek, Alexander Wait; Clegg, Tom; Rosenbaum, Abraham M.; Wu, Xiaodi; Angrist, Misha; Bhak, Jong; Bobe, Jason; Callow, Matthew J.; Cano, Carlos; Chou, Michael F.; Chung, Wendy K.; Douglas, Shawn M.; Estep, Preston W.; Gore, Athurva; Hulick, Peter; Labarga, Alberto; Lee, Je-Hyuk; Lunshof, Jeantine E.; Kim, Byung Chul; Kim, Jong-Il; Li, Zhe; Murray, Michael F.; Nilsen, Geoffrey B.; Peters, Brock A.; Raman, Anugraha M.; Rienhoff, Hugh Y.; Robasky, Kimberly; Wheeler, Matthew T.; Vandewege, Ward; Vorhaus, Daniel B.; Yang, Joyce L.; Yang, Luhan; Aach, John; Ashley, Euan A.; Drmanac, Radoje; Kim, Seong-Jin; Li, Jin Billy; Peshkin, Leonid; Seidman, Christine E.; Seo, Jeong-Sun; Zhang, Kun; Rehm, Heidi L.; Church, George M.

    2012-01-01

    Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved “open consent” process. The contribution of public data and samples facilitates both scientific discovery and standardization of methods. We present our findings after enrollment of more than 1,800 participants, including whole-genome sequencing of 10 pilot participant genomes (the PGP-10). We introduce the Genome-Environment-Trait Evidence (GET-Evidence) system. This tool automatically processes genomes and prioritizes both published and novel variants for interpretation. In the process of reviewing the presumed healthy PGP-10 genomes, we find numerous literature references implying serious disease. Although it is sometimes impossible to rule out a late-onset effect, stringent evidence requirements can address the high rate of incidental findings. To that end we develop a peer production system for recording and organizing variant evaluations according to standard evidence guidelines, creating a public forum for reaching consensus on interpretation of clinically relevant variants. Genome analysis becomes a two-step process: using a prioritized list to record variant evaluations, then automatically sorting reviewed variants using these annotations. Genome data, health and trait information, participant samples, and variant interpretations are all shared in the public domain—we invite others to review our results using our participant samples and contribute to our interpretations. We offer our public resource and methods to further personalized medical research. PMID:22797899

  3. Kiwifruit Information Resource (KIR): a comparative platform for kiwifruit genomics

    PubMed Central

    Yue, Junyang; Liu, Jian; Ban, Rongjun; Tang, Wei; Deng, Lin; Fei, Zhangjun; Liu, Yongsheng

    2015-01-01

    The Kiwifruit Information Resource (KIR) is dedicated to maintain and integrate comprehensive datasets on genomics, functional genomics and transcriptomics of kiwifruit (Actinidiaceae). KIR serves as a central access point for existing/new genomic and genetic data. KIR also provides researchers with a variety of visualization and analysis tools. Current developments include the updated genome structure of Actinidia chinensis cv. Hongyang and its newest genome annotation, putative transcripts, gene expression, physical markers of genetic traits as well as relevant publications based on the latest genome assembly. Nine thousand five hundred and forty-seven new transcripts are detected and 21 132 old transcripts are changed. At the present release, the next-generation transcriptome sequencing data has been incorporated into gene models and splice variants. Protein–protein interactions are also identified based on experimentally determined orthologous interactions. Furthermore, the experimental results reported in peer-reviewed literature are manually extracted and integrated within a well-developed query page. In total, 122 identifications are currently associated, including commonly used gene names and symbols. All KIR datasets are helpful to facilitate a broad range of kiwifruit research topics and freely available to the research community. Database URL: http://bdg.hfut.edu.cn/kir/index.html. PMID:26656885

  4. Kiwifruit Information Resource (KIR): a comparative platform for kiwifruit genomics.

    PubMed

    Yue, Junyang; Liu, Jian; Ban, Rongjun; Tang, Wei; Deng, Lin; Fei, Zhangjun; Liu, Yongsheng

    2015-01-01

    The Kiwifruit Information Resource (KIR) is dedicated to maintain and integrate comprehensive datasets on genomics, functional genomics and transcriptomics of kiwifruit (Actinidiaceae). KIR serves as a central access point for existing/new genomic and genetic data. KIR also provides researchers with a variety of visualization and analysis tools. Current developments include the updated genome structure of Actinidia chinensis cv. Hongyang and its newest genome annotation, putative transcripts, gene expression, physical markers of genetic traits as well as relevant publications based on the latest genome assembly. Nine thousand five hundred and forty-seven new transcripts are detected and 21 132 old transcripts are changed. At the present release, the next-generation transcriptome sequencing data has been incorporated into gene models and splice variants. Protein-protein interactions are also identified based on experimentally determined orthologous interactions. Furthermore, the experimental results reported in peer-reviewed literature are manually extracted and integrated within a well-developed query page. In total, 122 identifications are currently associated, including commonly used gene names and symbols. All KIR datasets are helpful to facilitate a broad range of kiwifruit research topics and freely available to the research community. Database URL: http://bdg.hfut.edu.cn/kir/index.html.

  5. The international Genome sample resource (IGSR): A worldwide collection of genome variation incorporating the 1000 Genomes Project data

    PubMed Central

    Clarke, Laura; Fairley, Susan; Zheng-Bradley, Xiangqun; Streeter, Ian; Perry, Emily; Lowy, Ernesto; Tassé, Anne-Marie; Flicek, Paul

    2017-01-01

    The International Genome Sample Resource (IGSR; http://www.internationalgenome.org) expands in data type and population diversity the resources from the 1000 Genomes Project. IGSR represents the largest open collection of human variation data and provides easy access to these resources. IGSR was established in 2015 to maintain and extend the 1000 Genomes Project data, which has been widely used as a reference set of human variation and by researchers developing analysis methods. IGSR has mapped all of the 1000 Genomes sequence to the newest human reference (GRCh38), and will release updated variant calls to ensure maximal usefulness of the existing data. IGSR is collecting new structural variation data on the 1000 Genomes samples from long read sequencing and other technologies, and will collect relevant functional data into a single comprehensive resource. IGSR is extending coverage with new populations sequenced by collaborating groups. Here, we present the new data and analysis that IGSR has made available. We have also introduced a new data portal that increases discoverability of our data—previously only browseable through our FTP site—by focusing on particular samples, populations or data sets of interest. PMID:27638885

  6. The international Genome sample resource (IGSR): A worldwide collection of genome variation incorporating the 1000 Genomes Project data.

    PubMed

    Clarke, Laura; Fairley, Susan; Zheng-Bradley, Xiangqun; Streeter, Ian; Perry, Emily; Lowy, Ernesto; Tassé, Anne-Marie; Flicek, Paul

    2017-01-04

    The International Genome Sample Resource (IGSR; http://www.internationalgenome.org) expands in data type and population diversity the resources from the 1000 Genomes Project. IGSR represents the largest open collection of human variation data and provides easy access to these resources. IGSR was established in 2015 to maintain and extend the 1000 Genomes Project data, which has been widely used as a reference set of human variation and by researchers developing analysis methods. IGSR has mapped all of the 1000 Genomes sequence to the newest human reference (GRCh38), and will release updated variant calls to ensure maximal usefulness of the existing data. IGSR is collecting new structural variation data on the 1000 Genomes samples from long read sequencing and other technologies, and will collect relevant functional data into a single comprehensive resource. IGSR is extending coverage with new populations sequenced by collaborating groups. Here, we present the new data and analysis that IGSR has made available. We have also introduced a new data portal that increases discoverability of our data-previously only browseable through our FTP site-by focusing on particular samples, populations or data sets of interest. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Rooting of conifer propagules

    Treesearch

    R.L. Mott

    1977-01-01

    An outline of the general problems involved with the propagation of elite conifer clones by rooted cuttings is drawn from published reports. New approaches for resolving these problems can come from studies of clone production through tissue culture methods. Probable extension of tissue culture techniques will permit the establishment of clones from adult, proven trees...

  8. Resources for Functional Genomics Studies in Drosophila melanogaster

    PubMed Central

    Mohr, Stephanie E.; Hu, Yanhui; Kim, Kevin; Housden, Benjamin E.; Perrimon, Norbert

    2014-01-01

    Drosophila melanogaster has become a system of choice for functional genomic studies. Many resources, including online databases and software tools, are now available to support design or identification of relevant fly stocks and reagents or analysis and mining of existing functional genomic, transcriptomic, proteomic, etc. datasets. These include large community collections of fly stocks and plasmid clones, “meta” information sites like FlyBase and FlyMine, and an increasing number of more specialized reagents, databases, and online tools. Here, we introduce key resources useful to plan large-scale functional genomics studies in Drosophila and to analyze, integrate, and mine the results of those studies in ways that facilitate identification of highest-confidence results and generation of new hypotheses. We also discuss ways in which existing resources can be used and might be improved and suggest a few areas of future development that would further support large- and small-scale studies in Drosophila and facilitate use of Drosophila information by the research community more generally. PMID:24653003

  9. Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing

    Treesearch

    Shannon C.K. Straub; Mark Fishbein; Tatyana Livshult; Zachary Foster; Matthew Parks; Kevin Weitemier; Richard C. Cronn; Aaron. Liston

    2011-01-01

    Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in...

  10. Genomic resources in mungbean for future breeding programs

    PubMed Central

    Kim, Sue K.; Nair, Ramakrishnan M.; Lee, Jayern; Lee, Suk-Ha

    2015-01-01

    Among the legume family, mungbean (Vigna radiata) has become one of the important crops in Asia, showing a steady increase in global production. It provides a good source of protein and contains most notably folate and iron. Beyond the nutritional value of mungbean, certain features make it a well-suited model organism among legume plants because of its small genome size, short life-cycle, self-pollinating, and close genetic relationship to other legumes. In the past, there have been several efforts to develop molecular markers and linkage maps associated with agronomic traits for the genetic improvement of mungbean and, ultimately, breeding for cultivar development to increase the average yields of mungbean. The recent release of a reference genome of the cultivated mungbean (V. radiata var. radiata VC1973A) and an additional de novo sequencing of a wild relative mungbean (V. radiata var. sublobata) has provided a framework for mungbean genetic and genome research, that can further be used for genome-wide association and functional studies to identify genes related to specific agronomic traits. Moreover, the diverse gene pool of wild mungbean comprises valuable genetic resources of beneficial genes that may be helpful in widening the genetic diversity of cultivated mungbean. This review paper covers the research progress on molecular and genomics approaches and the current status of breeding programs that have developed to move toward the ultimate goal of mungbean improvement. PMID:26322067

  11. Floral gene resources from basal angiosperms for comparative genomics research

    PubMed Central

    Albert, Victor A; Soltis, Douglas E; Carlson, John E; Farmerie, William G; Wall, P Kerr; Ilut, Daniel C; Solow, Teri M; Mueller, Lukas A; Landherr, Lena L; Hu, Yi; Buzgo, Matyas; Kim, Sangtae; Yoo, Mi-Jeong; Frohlich, Michael W; Perl-Treves, Rafael; Schlarbaum, Scott E; Bliss, Barbara J; Zhang, Xiaohong; Tanksley, Steven D; Oppenheimer, David G; Soltis, Pamela S; Ma, Hong; dePamphilis, Claude W; Leebens-Mack, James H

    2005-01-01

    Background The Floral Genome Project was initiated to bridge the genomic gap between the most broadly studied plant model systems. Arabidopsis and rice, although now completely sequenced and under intensive comparative genomic investigation, are separated by at least 125 million years of evolutionary time, and cannot in isolation provide a comprehensive perspective on structural and functional aspects of flowering plant genome dynamics. Here we discuss new genomic resources available to the scientific community, comprising cDNA libraries and Expressed Sequence Tag (EST) sequences for a suite of phylogenetically basal angiosperms specifically selected to bridge the evolutionary gaps between model plants and provide insights into gene content and genome structure in the earliest flowering plants. Results Random sequencing of cDNAs from representatives of phylogenetically important eudicot, non-grass monocot, and gymnosperm lineages has so far (as of 12/1/04) generated 70,514 ESTs and 48,170 assembled unigenes. Efficient sorting of EST sequences into putative gene families based on whole Arabidopsis/rice proteome comparison has permitted ready identification of cDNA clones for finished sequencing. Preliminarily, (i) proportions of functional categories among sequenced floral genes seem representative of the entire Arabidopsis transcriptome, (ii) many known floral gene homologues have been captured, and (iii) phylogenetic analyses of ESTs are providing new insights into the process of gene family evolution in relation to the origin and diversification of the angiosperms. Conclusion Initial comparisons illustrate the utility of the EST data sets toward discovery of the basic floral transcriptome. These first findings also afford the opportunity to address a number of conspicuous evolutionary genomic questions, including reproductive organ transcriptome overlap between angiosperms and gymnosperms, genome-wide duplication history, lineage-specific gene duplication and

  12. MorusDB: a resource for mulberry genomics and genome biology

    PubMed Central

    Li, Tian; Qi, Xiwu; Zeng, Qiwei; Xiang, Zhonghuai; He, Ningjia

    2014-01-01

    Mulberry is an important cultivated plant that has received the attention of biologists interested in sericulture and plant–insect interaction. Morus notabilis, a wild mulberry species with a minimal chromosome number is an ideal material for whole-genome sequencing and assembly. The genome and transcriptome of M. notabilis were sequenced and analyzed. In this article, a web-based and open-access database, the Morus Genome Database (MorusDB), was developed to enable easy-to-access and data mining. The MorusDB provides an integrated data source and an easy accession of mulberry large-scale genomic sequencing and assembly, predicted genes and functional annotations, expressed sequence tags (ESTs), transposable elements (TEs), Gene Ontology (GO) terms, horizontal gene transfers between mulberry and silkworm and ortholog and paralog groups. Transcriptome sequencing data for M. notabilis root, leaf, bark, winter bud and male flower can also be searched and downloaded. Furthermore, MorusDB provides an analytical workbench with some built-in tools and pipelines, such as BLAST, Search GO, Mulberry GO and Mulberry GBrowse, to facilitate genomic studies and comparative genomics. The MorusDB provides important genomic resources for scientists working with mulberry and other Moraceae species, which include many important fruit crops. Designed as a basic platform and accompanied by the SilkDB, MorusDB strives to be a comprehensive platform for the silkworm–mulberry interaction studies. Database URL: http://morus.swu.edu.cn/morusdb. PMID:24923822

  13. Tetrahymena functional genomics database (TetraFGD): an integrated resource for Tetrahymena functional genomics.

    PubMed

    Xiong, Jie; Lu, Yuming; Feng, Jinmei; Yuan, Dongxia; Tian, Miao; Chang, Yue; Fu, Chengjie; Wang, Guangying; Zeng, Honghui; Miao, Wei

    2013-01-01

    The ciliated protozoan Tetrahymena thermophila is a useful unicellular model organism for studies of eukaryotic cellular and molecular biology. Researches on T. thermophila have contributed to a series of remarkable basic biological principles. After the macronuclear genome was sequenced, substantial progress has been made in functional genomics research on T. thermophila, including genome-wide microarray analysis of the T. thermophila life cycle, a T. thermophila gene network analysis based on the microarray data and transcriptome analysis by deep RNA sequencing. To meet the growing demands for the Tetrahymena research community, we integrated these data to provide a public access database: Tetrahymena functional genomics database (TetraFGD). TetraFGD contains three major resources, including the RNA-Seq transcriptome, microarray and gene networks. The RNA-Seq data define gene structures and transcriptome, with special emphasis on exon-intron boundaries; the microarray data describe gene expression of 20 time points during three major stages of the T. thermophila life cycle; the gene network data identify potential gene-gene interactions of 15 049 genes. The TetraFGD provides user-friendly search functions that assist researchers in accessing gene models, transcripts, gene expression data and gene-gene relationships. In conclusion, the TetraFGD is an important functional genomic resource for researchers who focus on the Tetrahymena or other ciliates. Database URL: http://tfgd.ihb.ac.cn/

  14. Genetic Transformation and Genomic Resources for Next-Generation Precise Genome Engineering in Vegetable Crops

    PubMed Central

    Cardi, Teodoro; D’Agostino, Nunzio; Tripodi, Pasquale

    2017-01-01

    In the frame of modern agriculture facing the predicted increase of population and general environmental changes, the securement of high quality food remains a major challenge to deal with. Vegetable crops include a large number of species, characterized by multiple geographical origins, large genetic variability and diverse reproductive features. Due to their nutritional value, they have an important place in human diet. In recent years, many crop genomes have been sequenced permitting the identification of genes and superior alleles associated with desirable traits. Furthermore, innovative biotechnological approaches allow to take a step forward towards the development of new improved cultivars harboring precise genome modifications. Sequence-based knowledge coupled with advanced biotechnologies is supporting the widespread application of new plant breeding techniques to enhance the success in modification and transfer of useful alleles into target varieties. Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 system, zinc-finger nucleases, and transcription activator-like effector nucleases represent the main methods available for plant genome engineering through targeted modifications. Such technologies, however, require efficient transformation protocols as well as extensive genomic resources and accurate knowledge before they can be efficiently exploited in practical breeding programs. In this review, we revise the state of the art in relation to availability of such scientific and technological resources in various groups of vegetables, describe genome editing results obtained so far and discuss the implications for future applications. PMID:28275380

  15. Genetic Transformation and Genomic Resources for Next-Generation Precise Genome Engineering in Vegetable Crops.

    PubMed

    Cardi, Teodoro; D'Agostino, Nunzio; Tripodi, Pasquale

    2017-01-01

    In the frame of modern agriculture facing the predicted increase of population and general environmental changes, the securement of high quality food remains a major challenge to deal with. Vegetable crops include a large number of species, characterized by multiple geographical origins, large genetic variability and diverse reproductive features. Due to their nutritional value, they have an important place in human diet. In recent years, many crop genomes have been sequenced permitting the identification of genes and superior alleles associated with desirable traits. Furthermore, innovative biotechnological approaches allow to take a step forward towards the development of new improved cultivars harboring precise genome modifications. Sequence-based knowledge coupled with advanced biotechnologies is supporting the widespread application of new plant breeding techniques to enhance the success in modification and transfer of useful alleles into target varieties. Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 system, zinc-finger nucleases, and transcription activator-like effector nucleases represent the main methods available for plant genome engineering through targeted modifications. Such technologies, however, require efficient transformation protocols as well as extensive genomic resources and accurate knowledge before they can be efficiently exploited in practical breeding programs. In this review, we revise the state of the art in relation to availability of such scientific and technological resources in various groups of vegetables, describe genome editing results obtained so far and discuss the implications for future applications.

  16. The Transcriptomics of Secondary Growth and Wood Formation in Conifers

    PubMed Central

    Carvalho, Ana; Paiva, Jorge; Louzada, José; Lima-Brito, José

    2013-01-01

    In the last years, forestry scientists have adapted genomics and next-generation sequencing (NGS) technologies to the search for candidate genes related to the transcriptomics of secondary growth and wood formation in several tree species. Gymnosperms, in particular, the conifers, are ecologically and economically important, namely, for the production of wood and other forestry end products. Until very recently, no whole genome sequencing of a conifer genome was available. Due to the gradual improvement of the NGS technologies and inherent bioinformatics tools, two draft assemblies of the whole genomes sequence of Picea abies and Picea glauca arose in the current year. These draft genome assemblies will bring new insights about the structure, content, and evolution of the conifer genomes. Furthermore, new directions in the forestry, breeding and research of conifers will be discussed in the following. The identification of genes associated with the xylem transcriptome and the knowledge of their regulatory mechanisms will provide less time-consuming breeding cycles and a high accuracy for the selection of traits related to wood production and quality. PMID:24288610

  17. The transcriptomics of secondary growth and wood formation in conifers.

    PubMed

    Carvalho, Ana; Paiva, Jorge; Louzada, José; Lima-Brito, José

    2013-01-01

    In the last years, forestry scientists have adapted genomics and next-generation sequencing (NGS) technologies to the search for candidate genes related to the transcriptomics of secondary growth and wood formation in several tree species. Gymnosperms, in particular, the conifers, are ecologically and economically important, namely, for the production of wood and other forestry end products. Until very recently, no whole genome sequencing of a conifer genome was available. Due to the gradual improvement of the NGS technologies and inherent bioinformatics tools, two draft assemblies of the whole genomes sequence of Picea abies and Picea glauca arose in the current year. These draft genome assemblies will bring new insights about the structure, content, and evolution of the conifer genomes. Furthermore, new directions in the forestry, breeding and research of conifers will be discussed in the following. The identification of genes associated with the xylem transcriptome and the knowledge of their regulatory mechanisms will provide less time-consuming breeding cycles and a high accuracy for the selection of traits related to wood production and quality.

  18. EuPathDB: the eukaryotic pathogen genomics database resource

    PubMed Central

    Aurrecoechea, Cristina; Barreto, Ana; Basenko, Evelina Y.; Brestelli, John; Brunk, Brian P.; Cade, Shon; Crouch, Kathryn; Doherty, Ryan; Falke, Dave; Fischer, Steve; Gajria, Bindu; Harb, Omar S.; Heiges, Mark; Hertz-Fowler, Christiane; Hu, Sufen; Iodice, John; Kissinger, Jessica C.; Lawrence, Cris; Li, Wei; Pinney, Deborah F.; Pulman, Jane A.; Roos, David S.; Shanmugasundram, Achchuthan; Silva-Franco, Fatima; Steinbiss, Sascha; Stoeckert, Christian J.; Spruill, Drew; Wang, Haiming; Warrenfeltz, Susanne; Zheng, Jie

    2017-01-01

    The Eukaryotic Pathogen Genomics Database Resource (EuPathDB, http://eupathdb.org) is a collection of databases covering 170+ eukaryotic pathogens (protists & fungi), along with relevant free-living and non-pathogenic species, and select pathogen hosts. To facilitate the discovery of meaningful biological relationships, the databases couple preconfigured searches with visualization and analysis tools for comprehensive data mining via intuitive graphical interfaces and APIs. All data are analyzed with the same workflows, including creation of gene orthology profiles, so data are easily compared across data sets, data types and organisms. EuPathDB is updated with numerous new analysis tools, features, data sets and data types. New tools include GO, metabolic pathway and word enrichment analyses plus an online workspace for analysis of personal, non-public, large-scale data. Expanded data content is mostly genomic and functional genomic data while new data types include protein microarray, metabolic pathways, compounds, quantitative proteomics, copy number variation, and polysomal transcriptomics. New features include consistent categorization of searches, data sets and genome browser tracks; redesigned gene pages; effective integration of alternative transcripts; and a EuPathDB Galaxy instance for private analyses of a user's data. Forthcoming upgrades include user workspaces for private integration of data with existing EuPathDB data and improved integration and presentation of host–pathogen interactions. PMID:27903906

  19. EuPathDB: the eukaryotic pathogen genomics database resource.

    PubMed

    Aurrecoechea, Cristina; Barreto, Ana; Basenko, Evelina Y; Brestelli, John; Brunk, Brian P; Cade, Shon; Crouch, Kathryn; Doherty, Ryan; Falke, Dave; Fischer, Steve; Gajria, Bindu; Harb, Omar S; Heiges, Mark; Hertz-Fowler, Christiane; Hu, Sufen; Iodice, John; Kissinger, Jessica C; Lawrence, Cris; Li, Wei; Pinney, Deborah F; Pulman, Jane A; Roos, David S; Shanmugasundram, Achchuthan; Silva-Franco, Fatima; Steinbiss, Sascha; Stoeckert, Christian J; Spruill, Drew; Wang, Haiming; Warrenfeltz, Susanne; Zheng, Jie

    2017-01-04

    The Eukaryotic Pathogen Genomics Database Resource (EuPathDB, http://eupathdb.org) is a collection of databases covering 170+ eukaryotic pathogens (protists & fungi), along with relevant free-living and non-pathogenic species, and select pathogen hosts. To facilitate the discovery of meaningful biological relationships, the databases couple preconfigured searches with visualization and analysis tools for comprehensive data mining via intuitive graphical interfaces and APIs. All data are analyzed with the same workflows, including creation of gene orthology profiles, so data are easily compared across data sets, data types and organisms. EuPathDB is updated with numerous new analysis tools, features, data sets and data types. New tools include GO, metabolic pathway and word enrichment analyses plus an online workspace for analysis of personal, non-public, large-scale data. Expanded data content is mostly genomic and functional genomic data while new data types include protein microarray, metabolic pathways, compounds, quantitative proteomics, copy number variation, and polysomal transcriptomics. New features include consistent categorization of searches, data sets and genome browser tracks; redesigned gene pages; effective integration of alternative transcripts; and a EuPathDB Galaxy instance for private analyses of a user's data. Forthcoming upgrades include user workspaces for private integration of data with existing EuPathDB data and improved integration and presentation of host-pathogen interactions.

  20. Fluorescence-based resource for semi-automated genomic analyses

    SciTech Connect

    Kiser, M.B.; Dragwa, C.; Jedlicka, A.E.

    1994-09-01

    To facilitate the practical application of highly efficient semi-automated methods for general application in genomic analyses, we have developed a fluorescence-based marker resource. Ninety highly polymorphic simple tandem repeat markers were combined to provide a rapid, accurate, and highly efficient initial genome-wide screening system. These markers are spaced on average every 33 recombination units, with a mean heterozygosity of 81% (range 65-94%), covering 22 autosomes and the X and Y chromosomes. Less than 3% of the genome lies beyond 30 cM of the nearest marker. Markers were placed in a vertical ladder that we have termed a SET according to the size of the PCR fragments they produce during electrophoresis. Each SET was designed to avoid overlap between loci during gel separations to assure accuracy when scoring genotypes. We have constructed 15 SETS of markers. Three SETS, each labelled with one of three fluors, were combined into what we have termed a GROUP, which is co-electrophoresed with internal size standards that are labelled with a fourth flour. Five GROUPS of markers were assembled that contain a total of 15 SETS of markers. Each GROUP cover 18 regions of the genome that can be detected simultaneously, since this genomic analysis system is fully compatible with automated fragment analyzers using simultaneous four-color fluorescence-based detection systems. This allows for multiplex detection and a throughput of 1,944 genotypes daily per instrument. This system will be highly beneficial in a number of clinical and research applications including: linkage, cancer genetics, forensics, and cytogenetics.

  1. Learning about the Human Genome. Part 2: Resources for Science Educators. ERIC Digest.

    ERIC Educational Resources Information Center

    Haury, David L.

    This ERIC Digest identifies how the human genome project fits into the "National Science Education Standards" and lists Human Genome Project Web sites found on the World Wide Web. It is a resource companion to "Learning about the Human Genome. Part 1: Challenge to Science Educators" (Haury 2001). The Web resources and…

  2. Genomic MRI - a Public Resource for Studying Sequence Patterns within Genomic DNA

    PubMed Central

    Prakash, Ashwin; Bechtel, Jason; Fedorov, Alexei

    2011-01-01

    Non-coding genomic regions in complex eukaryotes, including intergenic areas, introns, and untranslated segments of exons, are profoundly non-random in their nucleotide composition and consist of a complex mosaic of sequence patterns. These patterns include so-called Mid-Range Inhomogeneity (MRI) regions -- sequences 30-10000 nucleotides in length that are enriched by a particular base or combination of bases (e.g. (G+T)-rich, purine-rich, etc.). MRI regions are associated with unusual (non-B-form) DNA structures that are often involved in regulation of gene expression, recombination, and other genetic processes (Fedorova & Fedorov 2010). The existence of a strong fixation bias within MRI regions against mutations that tend to reduce their sequence inhomogeneity additionally supports the functionality and importance of these genomic sequences (Prakash et al. 2009). Here we demonstrate a freely available Internet resource -- the Genomic MRI program package -- designed for computational analysis of genomic sequences in order to find and characterize various MRI patterns within them (Bechtel et al. 2008). This package also allows generation of randomized sequences with various properties and level of correspondence to the natural input DNA sequences. The main goal of this resource is to facilitate examination of vast regions of non-coding DNA that are still scarcely investigated and await thorough exploration and recognition. PMID:21610667

  3. The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse

    PubMed Central

    Eppig, Janan T.; Blake, Judith A.; Bult, Carol J.; Kadin, James A.; Richardson, Joel E.

    2012-01-01

    The Mouse Genome Database (MGD, http://www.informatics.jax.org) is the international community resource for integrated genetic, genomic and biological data about the laboratory mouse. Data in MGD are obtained through loads from major data providers and experimental consortia, electronic submissions from laboratories and from the biomedical literature. MGD maintains a comprehensive, unified, non-redundant catalog of mouse genome features generated by distilling gene predictions from NCBI, Ensembl and VEGA. MGD serves as the authoritative source for the nomenclature of mouse genes, mutations, alleles and strains. MGD is the primary source for evidence-supported functional annotations for mouse genes and gene products using the Gene Ontology (GO). MGD provides full annotation of phenotypes and human disease associations for mouse models (genotypes) using terms from the Mammalian Phenotype Ontology and disease names from the Online Mendelian Inheritance in Man (OMIM) resource. MGD is freely accessible online through our website, where users can browse and search interactively, access data in bulk using Batch Query or BioMart, download data files or use our web services Application Programming Interface (API). Improvements to MGD include expanded genome feature classifications, inclusion of new mutant allele sets and phenotype associations and extensions of GO to include new relationships and a new stream of annotations via phylogenetic-based approaches. PMID:22075990

  4. The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse.

    PubMed

    Eppig, Janan T; Blake, Judith A; Bult, Carol J; Kadin, James A; Richardson, Joel E

    2012-01-01

    The Mouse Genome Database (MGD, http://www.informatics.jax.org) is the international community resource for integrated genetic, genomic and biological data about the laboratory mouse. Data in MGD are obtained through loads from major data providers and experimental consortia, electronic submissions from laboratories and from the biomedical literature. MGD maintains a comprehensive, unified, non-redundant catalog of mouse genome features generated by distilling gene predictions from NCBI, Ensembl and VEGA. MGD serves as the authoritative source for the nomenclature of mouse genes, mutations, alleles and strains. MGD is the primary source for evidence-supported functional annotations for mouse genes and gene products using the Gene Ontology (GO). MGD provides full annotation of phenotypes and human disease associations for mouse models (genotypes) using terms from the Mammalian Phenotype Ontology and disease names from the Online Mendelian Inheritance in Man (OMIM) resource. MGD is freely accessible online through our website, where users can browse and search interactively, access data in bulk using Batch Query or BioMart, download data files or use our web services Application Programming Interface (API). Improvements to MGD include expanded genome feature classifications, inclusion of new mutant allele sets and phenotype associations and extensions of GO to include new relationships and a new stream of annotations via phylogenetic-based approaches.

  5. IGD: a resource for intronless genes in the human genome.

    PubMed

    Louhichi, Amel; Fourati, Ahmed; Rebaï, Ahmed

    2011-11-15

    Intronless genes (IGs) fraction varies between 2.7 and 97.7% in eukaryotic genomes. Although many databases on exons and introns exist, there was no curated database for such genes which allowed their study in a concerted manner. Such a database would be useful to identify the functional features and the distribution of these genes across the genome. Here, a new database of IGs in eukaryotes based on GenBank data was described. This database, called IGD (Intronless Gene Database), is a collection of gene sequences that were annotated and curated. The current version of IGD contains 687 human intronless genes with their protein and CDS sequences. Some features of the entries are given in this paper. Data was extracted from GenBank release 183 using a Perl script. Data extraction was followed by a manual curation step. Intronless genes were then analyzed based on their RefSeq annotation and Gene Ontology functional class. IGD represents a useful resource for retrieval and in silico study of intronless genes. IGD is available at http://www.bioinfo-cbs.org/igd with comprehensive help and FAQ pages that illustrate the main uses of this resource.

  6. Development of peanut expessed sequence tag-based genomic resources and tools

    USDA-ARS?s Scientific Manuscript database

    U.S. Peanut Genome Initiative (PGI) has widely recognized the need for peanut genome tools and resources development for mitigating peanut allergens and food safety. Genomics such as Expressed Sequence Tag (EST), microarray technologies, and whole genome sequencing provides robotic tools for profili...

  7. MorusDB: a resource for mulberry genomics and genome biology.

    PubMed

    Li, Tian; Qi, Xiwu; Zeng, Qiwei; Xiang, Zhonghuai; He, Ningjia

    2014-01-01

    Mulberry is an important cultivated plant that has received the attention of biologists interested in sericulture and plant-insect interaction. Morus notabilis, a wild mulberry species with a minimal chromosome number is an ideal material for whole-genome sequencing and assembly. The genome and transcriptome of M. notabilis were sequenced and analyzed. In this article, a web-based and open-access database, the Morus Genome Database (MorusDB), was developed to enable easy-to-access and data mining. The MorusDB provides an integrated data source and an easy accession of mulberry large-scale genomic sequencing and assembly, predicted genes and functional annotations, expressed sequence tags (ESTs), transposable elements (TEs), Gene Ontology (GO) terms, horizontal gene transfers between mulberry and silkworm and ortholog and paralog groups. Transcriptome sequencing data for M. notabilis root, leaf, bark, winter bud and male flower can also be searched and downloaded. Furthermore, MorusDB provides an analytical workbench with some built-in tools and pipelines, such as BLAST, Search GO, Mulberry GO and Mulberry GBrowse, to facilitate genomic studies and comparative genomics. The MorusDB provides important genomic resources for scientists working with mulberry and other Moraceae species, which include many important fruit crops. Designed as a basic platform and accompanied by the SilkDB, MorusDB strives to be a comprehensive platform for the silkworm-mulberry interaction studies. Database URL: http://morus.swu.edu.cn/morusdb. © The Author(s) 2014. Published by Oxford University Press.

  8. Phenotypic and genomic analysis of a fast neutron mutant population resource in soybean

    USDA-ARS?s Scientific Manuscript database

    Mutagenized populations have become indispensable resources for introducing variation and studying gene function in plant genomics research. We utilized fast neutron radiation to induce deletion mutations in the soybean genome and phenotypically screened the resulting population. We exposed approxim...

  9. Genomic resources for sea lice: analysis of ESTs and mitochondrial genomes.

    PubMed

    Yasuike, Motoshige; Leong, Jong; Jantzen, Stuart G; von Schalburg, Kristian R; Nilsen, Frank; Jones, Simon R M; Koop, Ben F

    2012-04-01

    Sea lice are common parasites of both farmed and wild salmon. Salmon farming constitutes an important economic market in North America, South America, and Northern Europe. Infections with sea lice can result in significant production losses. A compilation of genomic information on different genera of sea lice is an important resource for understanding their biology as well as for the study of population genetics and control strategies. We report on over 150,000 expressed sequence tags (ESTs) from five different species (Pacific Lepeophtheirus salmonis (49,672 new ESTs in addition to 14,994 previously reported ESTs), Atlantic L. salmonis (57,349 ESTs), Caligus clemensi (14,821 ESTs), Caligus rogercresseyi (32,135 ESTs), and Lernaeocera branchialis (16,441 ESTs)). For each species, ESTs were assembled into complete or partial genes and annotated by comparisons to known proteins in public databases. In addition, whole mitochondrial (mt) genome sequences of C. clemensi (13,440 bp) and C. rogercresseyi (13,468 bp) were determined and compared to L. salmonis. Both nuclear and mtDNA genes show very high levels of sequence divergence between these ectoparastic copepods suggesting that the different species of sea lice have been in existence for 37-113 million years and that parasitic association with salmonids is also quite ancient. Our ESTs and mtDNA data provide a novel resource for the study of sea louse biology, population genetics, and control strategies. This genomic information provides the material basis for the development of a 38K sea louse microarray that can be used in conjunction with our existing 44K salmon microarray to study host-parasite interactions at the molecular level. This report represents the largest genomic resource for any copepod species to date.

  10. CNVs into the wild: screening the genomes of conifer trees (Picea spp.) reveals fewer gene copy number variations in hybrids and links to adaptation.

    PubMed

    Prunier, Julien; Caron, Sébastien; MacKay, John

    2017-01-18

    Copy number variations (CNVs) have been linked to different phenotypes in human, including many diseases. A genome-scale understanding of CNVs is available in a few plants but none are wild species, leaving a knowledge gap regarding their genome biology and evolutionary role. We developed a reliable CNV detection method for species lacking contiguous reference genome. We selected multiple probes within 14,078 gene sequences and developed comparative genome hybridization on arrays. Gene CNVs were assessed in three full-sib families from species with 20 Gb genomes, i.e., white and black spruce, and interior spruce - a natural hybrid. We discovered hundreds of gene CNVs in each species, 3612 in total, which were enriched in functions related to stress and defense responses and narrow expression profiles, indicating a potential role in adaptation. The number of shared CNVs was in accordance with the degree of relatedness between individuals and species. The genetically mapped subset of these genes showed a wide distribution across the genome, implying numerous structural variations. The hybrid family presented significantly fewer CNVs, suggesting that the admixture of two species within one genome reduces the occurrence of CNVs. The approach we developed is of particular interest in non-model species lacking a reference genome. Our findings point to a role for CNVs in adaptation. Their reduced abundance in the hybrid may limit genetic variability and evolvability of hybrids. We propose that CNVs make a qualitatively distinct contribution to adaptation which could be important for short term change.

  11. Wheat EST resources for functional genomics of abiotic stress

    PubMed Central

    Houde, Mario; Belcaid, Mahdi; Ouellet, François; Danyluk, Jean; Monroy, Antonio F; Dryanova, Ani; Gulick, Patrick; Bergeron, Anne; Laroche, André; Links, Matthew G; MacCarthy, Luke; Crosby, William L; Sarhan, Fathey

    2006-01-01

    Background Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS) project. Results We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets). Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology. Conclusion We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in wheat and other cereals. PMID

  12. Wheat EST resources for functional genomics of abiotic stress.

    PubMed

    Houde, Mario; Belcaid, Mahdi; Ouellet, François; Danyluk, Jean; Monroy, Antonio F; Dryanova, Ani; Gulick, Patrick; Bergeron, Anne; Laroche, André; Links, Matthew G; MacCarthy, Luke; Crosby, William L; Sarhan, Fathey

    2006-06-13

    Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS) project. We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets). Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology. We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in wheat and other cereals.

  13. openSNP–A Crowdsourced Web Resource for Personal Genomics

    PubMed Central

    Greshake, Bastian; Bayer, Philipp E.; Rausch, Helge; Reda, Julia

    2014-01-01

    Genome-Wide Association Studies are widely used to correlate phenotypic traits with genetic variants. These studies usually compare the genetic variation between two groups to single out certain Single Nucleotide Polymorphisms (SNPs) that are linked to a phenotypic variation in one of the groups. However, it is necessary to have a large enough sample size to find statistically significant correlations. Direct-To-Consumer (DTC) genetic testing can supply additional data: DTC-companies offer the analysis of a large amount of SNPs for an individual at low cost without the need to consult a physician or geneticist. Over 100,000 people have already been genotyped through Direct-To-Consumer genetic testing companies. However, this data is not public for a variety of reasons and thus cannot be used in research. It seems reasonable to create a central open data repository for such data. Here we present the web platform openSNP, an open database which allows participants of Direct-To-Consumer genetic testing to publish their genetic data at no cost along with phenotypic information. Through this crowdsourced effort of collecting genetic and phenotypic information, openSNP has become a resource for a wide area of studies, including Genome-Wide Association Studies. openSNP is hosted at http://www.opensnp.org, and the code is released under MIT-license at http://github.com/gedankenstuecke/snpr. PMID:24647222

  14. Microbial Genome Analysis and Comparisons: Web-based Protocols and Resources

    USDA-ARS?s Scientific Manuscript database

    Fully annotated genome sequences of many microorganisms are publicly available as a resource. However, in-depth analysis of these genomes using specialized tools is required to derive meaningful information. We describe here the utility of three powerful publicly available genome databases and ana...

  15. Management of Westside Washington Conifer Stands Infected with Heterobasidion annosum

    Treesearch

    Elvira Young

    1989-01-01

    Timber stands in western Washington are managed for many purposes. Such stands yield commodity and noncommodity outputs, such as recreation, wildlife, visual resources, or watershed benefits, and often from the same stands. Annosus root disease is widespread in many mixed conifer stands within the western hemlock, mountain hemlock, and Pacific silver fir zones west of...

  16. Mouse Genome Informatics (MGI): Resources for Mining Mouse Genetic, Genomic, and Biological Data in Support of Primary and Translational Research.

    PubMed

    Eppig, Janan T; Smith, Cynthia L; Blake, Judith A; Ringwald, Martin; Kadin, James A; Richardson, Joel E; Bult, Carol J

    2017-01-01

    The Mouse Genome Informatics (MGI), resource ( www.informatics.jax.org ) has existed for over 25 years, and over this time its data content, informatics infrastructure, and user interfaces and tools have undergone dramatic changes (Eppig et al., Mamm Genome 26:272-284, 2015). Change has been driven by scientific methodological advances, rapid improvements in computational software, growth in computer hardware capacity, and the ongoing collaborative nature of the mouse genomics community in building resources and sharing data. Here we present an overview of the current data content of MGI, describe its general organization, and provide examples using simple and complex searches, and tools for mining and retrieving sets of data.

  17. The Aspergillus Genome Database (AspGD): recent developments in comprehensive multispecies curation, comparative genomics and community resources.

    PubMed

    Arnaud, Martha B; Cerqueira, Gustavo C; Inglis, Diane O; Skrzypek, Marek S; Binkley, Jonathan; Chibucos, Marcus C; Crabtree, Jonathan; Howarth, Clinton; Orvis, Joshua; Shah, Prachi; Wymore, Farrell; Binkley, Gail; Miyasato, Stuart R; Simison, Matt; Sherlock, Gavin; Wortman, Jennifer R

    2012-01-01

    The Aspergillus Genome Database (AspGD; http://www.aspgd.org) is a freely available, web-based resource for researchers studying fungi of the genus Aspergillus, which includes organisms of clinical, agricultural and industrial importance. AspGD curators have now completed comprehensive review of the entire published literature about Aspergillus nidulans and Aspergillus fumigatus, and this annotation is provided with streamlined, ortholog-based navigation of the multispecies information. AspGD facilitates comparative genomics by providing a full-featured genomics viewer, as well as matched and standardized sets of genomic information for the sequenced aspergilli. AspGD also provides resources to foster interaction and dissemination of community information and resources. We welcome and encourage feedback at aspergillus-curator@lists.stanford.edu.

  18. The Aspergillus Genome Database, a curated comparative genomics resource for gene, protein and sequence information for the Aspergillus research community.

    PubMed

    Arnaud, Martha B; Chibucos, Marcus C; Costanzo, Maria C; Crabtree, Jonathan; Inglis, Diane O; Lotia, Adil; Orvis, Joshua; Shah, Prachi; Skrzypek, Marek S; Binkley, Gail; Miyasato, Stuart R; Wortman, Jennifer R; Sherlock, Gavin

    2010-01-01

    The Aspergillus Genome Database (AspGD) is an online genomics resource for researchers studying the genetics and molecular biology of the Aspergilli. AspGD combines high-quality manual curation of the experimental scientific literature examining the genetics and molecular biology of Aspergilli, cutting-edge comparative genomics approaches to iteratively refine and improve structural gene annotations across multiple Aspergillus species, and web-based research tools for accessing and exploring the data. All of these data are freely available at http://www.aspgd.org. We welcome feedback from users and the research community at aspergillus-curator@genome.stanford.edu.

  19. CucCAP - Developing genomic resources for the cucurbit community

    USDA-ARS?s Scientific Manuscript database

    The U.S. cucurbit community has initiated a USDA-SCRI funded cucurbit genomics project, CucCAP: Leveraging applied genomics to increase disease resistance in cucurbit crops. Our primary objectives are: develop genomic and bioinformatic breeding tool kits for accelerated crop improvement across the...

  20. Genomic resources and their influence on the detection of the signal of positive selection in genome scans.

    PubMed

    Manel, S; Perrier, C; Pratlong, M; Abi-Rached, L; Paganini, J; Pontarotti, P; Aurelle, D

    2016-01-01

    Genome scans represent powerful approaches to investigate the action of natural selection on the genetic variation of natural populations and to better understand local adaptation. This is very useful, for example, in the field of conservation biology and evolutionary biology. Thanks to Next Generation Sequencing, genomic resources are growing exponentially, improving genome scan analyses in non-model species. Thousands of SNPs called using Reduced Representation Sequencing are increasingly used in genome scans. Besides, genome sequences are also becoming increasingly available, allowing better processing of short-read data, offering physical localization of variants, and improving haplotype reconstruction and data imputation. Ultimately, genome sequences are also becoming the raw material for selection inferences. Here, we discuss how the increasing availability of such genomic resources, notably genome sequences, influences the detection of signals of selection. Mainly, increasing data density and having the information of physical linkage data expand genome scans by (i) improving the overall quality of the data, (ii) helping the reconstruction of demographic history for the population studied to decrease false-positive rates and (iii) improving the statistical power of methods to detect the signal of selection. Of particular importance, the availability of a high-quality reference genome can improve the detection of the signal of selection by (i) allowing matching the potential candidate loci to linked coding regions under selection, (ii) rapidly moving the investigation to the gene and function and (iii) ensuring that the highly variable regions of the genomes that include functional genes are also investigated. For all those reasons, using reference genomes in genome scan analyses is highly recommended. © 2015 John Wiley & Sons Ltd.

  1. The Human OligoGenome Resource: a database of oligonucleotide capture probes for resequencing target regions across the human genome.

    PubMed

    Newburger, Daniel E; Natsoulis, Georges; Grimes, Sue; Bell, John M; Davis, Ronald W; Batzoglou, Serafim; Ji, Hanlee P

    2012-01-01

    Recent exponential growth in the throughput of next-generation DNA sequencing platforms has dramatically spurred the use of accessible and scalable targeted resequencing approaches. This includes candidate region diagnostic resequencing and novel variant validation from whole genome or exome sequencing analysis. We have previously demonstrated that selective genomic circularization is a robust in-solution approach for capturing and resequencing thousands of target human genome loci such as exons and regulatory sequences. To facilitate the design and production of customized capture assays for any given region in the human genome, we developed the Human OligoGenome Resource (http://oligogenome.stanford.edu/). This online database contains over 21 million capture oligonucleotide sequences. It enables one to create customized and highly multiplexed resequencing assays of target regions across the human genome and is not restricted to coding regions. In total, this resource provides 92.1% in silico coverage of the human genome. The online server allows researchers to download a complete repository of oligonucleotide probes and design customized capture assays to target multiple regions throughout the human genome. The website has query tools for selecting and evaluating capture oligonucleotides from specified genomic regions.

  2. An EST dataset for Metasequoia glyptostroboides buds: the first EST resource for molecular genomics studies in Metasequoia.

    PubMed

    Zhao, Ying; Thammannagowda, Shivegowda; Staton, Margaret; Tang, Sha; Xia, Xinli; Yin, Weilun; Liang, Haiying

    2013-03-01

    The "living fossil" Metasequoia glyptostroboides Hu et Cheng, commonly known as dawn redwood or Chinese redwood, is the only living species in the genus and is valued for its essential oil and crude extracts that have great potential for anti-fungal activity. Despite its paleontological significance and economical value as a rare relict species, genomic resources of Metasequoia are very limited. In order to gain insight into the molecular mechanisms behind the formation of reproductive buds and the transition from vegetative phase to reproductive phase in Metasequoia, we performed sequencing of expressed sequence tags from Metasequoia vegetative buds and female buds. By using the 454 pyrosequencing technology, a total of 1,571,764 high-quality reads were generated, among which 733,128 were from vegetative buds and 775,636 were from female buds. These EST reads were clustered and assembled into 114,124 putative unique transcripts (PUTs) with an average length of 536 bp. The 97,565 PUTs that were at least 100 bp in length were functionally annotated by a similarity search against public databases and assigned with Gene Ontology (GO) terms. A total of 59 known floral gene families and 190 isotigs involved in hormone regulation were captured in the dataset. Furthermore, a set of PUTs differentially expressed in vegetative and reproductive buds, as well as SSR motifs and high confidence SNPs, were identified. This is the first large-scale expressed sequence tags ever generated in Metasequoia and the first evidence for floral genes in this critically endangered deciduous conifer species.

  3. GDR (Genome Database for Rosaceae): integrated web resources for Rosaceae genomics and genetics research

    PubMed Central

    Jung, Sook; Jesudurai, Christopher; Staton, Margaret; Du, Zhidian; Ficklin, Stephen; Cho, Ilhyung; Abbott, Albert; Tomkins, Jeffrey; Main, Dorrie

    2004-01-01

    Background Peach is being developed as a model organism for Rosaceae, an economically important family that includes fruits and ornamental plants such as apple, pear, strawberry, cherry, almond and rose. The genomics and genetics data of peach can play a significant role in the gene discovery and the genetic understanding of related species. The effective utilization of these peach resources, however, requires the development of an integrated and centralized database with associated analysis tools. Description The Genome Database for Rosaceae (GDR) is a curated and integrated web-based relational database. GDR contains comprehensive data of the genetically anchored peach physical map, an annotated peach EST database, Rosaceae maps and markers and all publicly available Rosaceae sequences. Annotations of ESTs include contig assembly, putative function, simple sequence repeats, and anchored position to the peach physical map where applicable. Our integrated map viewer provides graphical interface to the genetic, transcriptome and physical mapping information. ESTs, BACs and markers can be queried by various categories and the search result sites are linked to the integrated map viewer or to the WebFPC physical map sites. In addition to browsing and querying the database, users can compare their sequences with the annotated GDR sequences via a dedicated sequence similarity server running either the BLAST or FASTA algorithm. To demonstrate the utility of the integrated and fully annotated database and analysis tools, we describe a case study where we anchored Rosaceae sequences to the peach physical and genetic map by sequence similarity. Conclusions The GDR has been initiated to meet the major deficiency in Rosaceae genomics and genetics research, namely a centralized web database and bioinformatics tools for data storage, analysis and exchange. GDR can be accessed at . PMID:15357877

  4. Response mechanisms of conifers to air pollutants

    SciTech Connect

    Matyssek, R.; Reich, P.; Oren, R.; Winner, W.E.

    1995-07-01

    Conifers are known to respond to SO{sub 2}, O{sub 3}, NO{sub x} and acid deposition. Of these pollutants, O{sub 3} is likely the most widespread and phytotoxic compound, and therefore of great interest to individuals concerned with forest resources Direct biological responses have a toxicological effects on metabolism which can then scale to effects on tree growth and forest ecology, including processes of competition and succession. Air pollution can cause reductions in photosynthesis and stomatal conductance, which are the physiological parameters most rigorously studied for conifers. Some effects air pollutants can have on plants are influenced by the presence of co-occurring environmental stresses. For example, drought usually reduces vulnerability of plants to air pollution. In addition, air pollution sensitivity may differ among species and with plant/leaf age. Plants may make short-term physiological adjustments to compensate for air pollution or may evolve resistance to air pollution through the processes of selection. Models are necessary to understand how physiological processes, growth processes, and ecological processes are affected by air pollutants. The process of defining the ecological risk that air pollutants pose for coniferous forests requires approaches that exploit existing databases, environmental monitoring of air pollutants and forest resources, experiments with well-defined air pollution treatments and environmental control/monitoring, modeling, predicting air pollution-caused changes in productivity and ecological processes over time and space, and integration of social values.

  5. The 19 Genomes of Drosophila: A BAC Library Resource for Genus-Wide and Genome-Scale Comparative Evolutionary Research

    PubMed Central

    Song, Xiang; Goicoechea, Jose Luis; Ammiraju, Jetty S. S.; Luo, Meizhong; He, Ruifeng; Lin, Jinke; Lee, So-Jeong; Sisneros, Nicholas; Watts, Tom; Kudrna, David A.; Golser, Wolfgang; Ashley, Elizabeth; Collura, Kristi; Braidotti, Michele; Yu, Yeisoo; Matzkin, Luciano M.; McAllister, Bryant F.; Markow, Therese Ann; Wing, Rod A.

    2011-01-01

    The genus Drosophila has been the subject of intense comparative phylogenomics characterization to provide insights into genome evolution under diverse biological and ecological contexts and to functionally annotate the Drosophila melanogaster genome, a model system for animal and insect genetics. Recent sequencing of 11 additional Drosophila species from various divergence points of the genus is a first step in this direction. However, to fully reap the benefits of this resource, the Drosophila community is faced with two critical needs: i.e., the expansion of genomic resources from a much broader range of phylogenetic diversity and the development of additional resources to aid in finishing the existing draft genomes. To address these needs, we report the first synthesis of a comprehensive set of bacterial artificial chromosome (BAC) resources for 19 Drosophila species from all three subgenera. Ten libraries were derived from the exact source used to generate 10 of the 12 draft genomes, while the rest were generated from a strategically selected set of species on the basis of salient ecological and life history features and their phylogenetic positions. The majority of the new species have at least one sequenced reference genome for immediate comparative benefit. This 19-BAC library set was rigorously characterized and shown to have large insert sizes (125–168 kb), low nonrecombinant clone content (0.3–5.3%), and deep coverage (9.1–42.9×). Further, we demonstrated the utility of this BAC resource for generating physical maps of targeted loci, refining draft sequence assemblies and identifying potential genomic rearrangements across the phylogeny. PMID:21321134

  6. The 19 genomes of Drosophila: a BAC library resource for genus-wide and genome-scale comparative evolutionary research.

    PubMed

    Song, Xiang; Goicoechea, Jose Luis; Ammiraju, Jetty S S; Luo, Meizhong; He, Ruifeng; Lin, Jinke; Lee, So-Jeong; Sisneros, Nicholas; Watts, Tom; Kudrna, David A; Golser, Wolfgang; Ashley, Elizabeth; Collura, Kristi; Braidotti, Michele; Yu, Yeisoo; Matzkin, Luciano M; McAllister, Bryant F; Markow, Therese Ann; Wing, Rod A

    2011-04-01

    The genus Drosophila has been the subject of intense comparative phylogenomics characterization to provide insights into genome evolution under diverse biological and ecological contexts and to functionally annotate the Drosophila melanogaster genome, a model system for animal and insect genetics. Recent sequencing of 11 additional Drosophila species from various divergence points of the genus is a first step in this direction. However, to fully reap the benefits of this resource, the Drosophila community is faced with two critical needs: i.e., the expansion of genomic resources from a much broader range of phylogenetic diversity and the development of additional resources to aid in finishing the existing draft genomes. To address these needs, we report the first synthesis of a comprehensive set of bacterial artificial chromosome (BAC) resources for 19 Drosophila species from all three subgenera. Ten libraries were derived from the exact source used to generate 10 of the 12 draft genomes, while the rest were generated from a strategically selected set of species on the basis of salient ecological and life history features and their phylogenetic positions. The majority of the new species have at least one sequenced reference genome for immediate comparative benefit. This 19-BAC library set was rigorously characterized and shown to have large insert sizes (125-168 kb), low nonrecombinant clone content (0.3-5.3%), and deep coverage (9.1-42.9×). Further, we demonstrated the utility of this BAC resource for generating physical maps of targeted loci, refining draft sequence assemblies and identifying potential genomic rearrangements across the phylogeny.

  7. A collection of plant-specific genomic data and resources at NCBI.

    PubMed

    Tatusova, Tatiana; Smith-White, Brian; Ostell, James

    2007-01-01

    The National Center for Biotechnology Information (NCBI) provides a data-rich environment in support of genomic research by collecting the biological data for genomes, genes, gene expressions, gene variation, gene families, proteins, and protein domains and integrating the data with analytical, search, and retrieval resources through the NCBI Web site. Entrez, an integrated search and retrieval system, enables text searches across various diverse biological databases maintained at NCBI. Map Viewer, the genome browser developed at NCBI, displays aligned genetic, physical, and sequence maps for eukaryotic genomes including those of many plants. A specialized plant query page allows maps from all plant genomes available in the Map Viewer to be searched to produce a display of aligned maps from several species. Customized Plant Basic Local Alignment Search Tool (PlantBLAST) allows the user to perform sequence similarity searches in a special collection of mapped plant sequence data and to view the resulting alignments within a genomic context using Map Viewer. In addition, pre-computed sequence similarities, such as those for proteins offered by BLAST Link (BLink), enable fluid navigation from un-annotated to annotated sequences, quickening the pace of discovery. Plant Genome Central (PGC) is a Web portal that provides centralized access to all NCBI plant genome resources. Also, there are links to plant-specific Web resources external to NCBI such as organism-specific databases, genome-sequencing project Web pages, and homepages of genomic bioinformatics organizations.

  8. Seasonal and age-related variation in the needle quality of five conifer species.

    PubMed

    Hatcher, Paul E

    1990-12-01

    Age changes of foliage resource quality (water, nitrogen, fibre, phenolics and toughness) were studied in five species of conifer (Pinus sylvestris L.), Picea abies (L.) Karsten, Tsuga heterophylla (Rafinesque) Sargent (all Pinaceae), Chamaecyparis lawsonian (Murray) Parlatore and Thuja plicata D. Don (both Cupressaceae) over a 2-year period in an English forest.Mature foliage of Pinus sylvestris was characterized by higher levels of nitrogen, fibre and toughness, and lower phenolics, and that of Tsuga heterophylla by higher levels of phenolics, and lower fibre and toughness levels, than the mature needles of the other species studied. Immature needles had higher levels of water and nitrogen, and lower levels of fibre and toughness, than older needles. Immature needles of Picea abies and Tsuga heterophylla had a high concentration of phenolics, which decreased with needle maturity. By mid-August, the levels of most of the foliar constituents in current-year needles had stabilized at levels maintained for the next year. Sampling revealed a fall in the concentration of phenolics, fibre and water in mature needles between March and June. Possible reasons for this seasonal trend are discussed. The levels of conifer foliar constituents were compared with levels recorded in broadleaf trees. Conifers had greater concentrations of all measured foliar constituents, but, with the exception of the six fold greater toughness of conifer needles, the differences between broadleaves and conifers were less than those between the immature and mature conifer needles. Previous studies have related the abundance of Lepidoptera on conifers to hostplant taxonomic relationships. However, the foliar constituents measured in this study were poor predictors of taxonomic relationships between the conifers. It is suggested that the abundance of Lepidoptera on conifers is not determined by levels of general foliar constituents and the role of other hostplant factors in shaping lepidopteran

  9. Community standards for genomic resources, genetic conservation, and data integration

    Treesearch

    Jill Wegrzyn; Meg Staton; Emily Grau; Richard Cronn; C. Dana. Nelson

    2017-01-01

    Genetics and genomics are increasingly important in forestry management and conservation. Next generation sequencing can increase analytical power, but still relies on building on the structure of previously acquired data. Data standards and data sharing allow the community to maximize the analytical power of high throughput genomics data. The landscape of incomplete...

  10. Whole genome sequences in pulse crops: a global community resource to expedite translational genomics and knowledge-based crop improvement.

    PubMed

    Bohra, Abhishek; Singh, Narendra P

    2015-08-01

    Unprecedented developments in legume genomics over the last decade have resulted in the acquisition of a wide range of modern genomic resources to underpin genetic improvement of grain legumes. The genome enabled insights direct investigators in various ways that primarily include unearthing novel structural variations, retrieving the lost genetic diversity, introducing novel/exotic alleles from wider gene pools, finely resolving the complex quantitative traits and so forth. To this end, ready availability of cost-efficient and high-density genotyping assays allows genome wide prediction to be increasingly recognized as the key selection criterion in crop breeding. Further, the high-dimensional measurements of agronomically significant phenotypes obtained by using new-generation screening techniques will empower reference based resequencing as well as allele mining and trait mapping methods to comprehensively associate genome diversity with the phenome scale variation. Besides stimulating the forward genetic systems, accessibility to precisely delineated genomic segments reveals novel candidates for reverse genetic techniques like targeted genome editing. The shifting paradigm in plant genomics in turn necessitates optimization of crop breeding strategies to enable the most efficient integration of advanced omics knowledge and tools. We anticipate that the crop improvement schemes will be bolstered remarkably with rational deployment of these genome-guided approaches, ultimately resulting in expanded plant breeding capacities and improved crop performance.

  11. Integrating Genomic Resources with Electronic Health Records using the HL7 Infobutton Standard

    PubMed Central

    Overby, Casey Lynnette; Del Fiol, Guilherme; Rubinstein, Wendy S.; Maglott, Donna R.; Nelson, Tristan H.; Milosavljevic, Aleksandar; Martin, Christa L.; Goehringer, Scott R.; Freimuth, Robert R.; Williams, Marc S.

    2016-01-01

    Summary Background The Clinical Genome Resource (ClinGen) Electronic Health Record (EHR) Workgroup aims to integrate ClinGen resources with EHRs. A promising option to enable this integration is through the Health Level Seven (HL7) Infobutton Standard. EHR systems that are certified according to the US Meaningful Use program provide HL7-compliant infobutton capabilities, which can be leveraged to support clinical decision-making in genomics. Objectives To integrate genomic knowledge resources using the HL7 infobutton standard. Two tactics to achieve this objective were: (1) creating an HL7-compliant search interface for ClinGen, and (2) proposing guidance for genomic resources on achieving HL7 Infobutton standard accessibility and compliance. Methods We built a search interface utilizing OpenInfobutton, an open source reference implementation of the HL7 Infobutton standard. ClinGen resources were assessed for readiness towards HL7 compliance. Finally, based upon our experiences we provide recommendations for publishers seeking to achieve HL7 compliance. Results Eight genomic resources and two sub-resources were integrated with the ClinGen search engine via OpenInfobutton and the HL7 infobutton standard. Resources we assessed have varying levels of readiness towards HL7-compliance. Furthermore, we found that adoption of standard terminologies used by EHR systems is the main gap to achieve compliance. Conclusion Genomic resources can be integrated with EHR systems via the HL7 Infobutton standard using OpenInfobutton. Full compliance of genomic resources with the Infobutton standard would further enhance interoperability with EHR systems. PMID:27579472

  12. Genomic resources for valuable woody ornamental landscape plants such as hydrangea macrophylla

    USDA-ARS?s Scientific Manuscript database

    New and improved ornamental landscape plants are typically produced by conventional breeding. Unfortunately, long generation times for woody plants can significantly slow progress. Incorporating genomic and biotechnology resources provides more information when designing breeding strategies and ac...

  13. Conifers have a unique small RNA silencing signature

    PubMed Central

    Dolgosheina, Elena V.; Morin, Ryan D.; Aksay, Gozde; Sahinalp, S. Cenk; Magrini, Vincent; Mardis, Elaine R.; Mattsson, Jim; Unrau, Peter J.

    2008-01-01

    Plants produce small RNAs to negatively regulate genes, viral nucleic acids, and repetitive elements at either the transcriptional or post-transcriptional level in a process that is referred to as RNA silencing. While RNA silencing has been extensively studied across the different phyla of the animal kingdom (e.g., mouse, fly, worm), similar studies in the plant kingdom have focused primarily on angiosperms, thus limiting evolutionary studies of RNA silencing in plants. Here we report on an unexpected phylogenetic difference in the size distribution of small RNAs among the vascular plants. By extracting total RNA from freshly growing shoot tissue, we conducted a survey of small RNAs in 24 vascular plant species. We find that conifers, which radiated from the other seed-bearing plants ∼260 million years ago, fail to produce significant amounts of 24-nucleotide (nt) RNAs that are known to guide DNA methylation and heterochromatin formation in angiosperms. Instead, they synthesize a diverse population of small RNAs that are exactly 21-nt long. This finding was confirmed by high-throughput sequencing of the small RNA sequences from a conifer, Pinus contorta. A conifer EST search revealed the presence of a novel Dicer-like (DCL) family, which may be responsible for the observed change in small RNA expression. No evidence for DCL3, an enzyme that matures 24-nt RNAs in angiosperms, was found. We hypothesize that the diverse class of 21-nt RNAs found in conifers may help to maintain organization of their unusually large genomes. PMID:18566193

  14. The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population.

    PubMed

    Lack, Justin B; Cardeno, Charis M; Crepeau, Marc W; Taylor, William; Corbett-Detig, Russell B; Stevens, Kristian A; Langley, Charles H; Pool, John E

    2015-04-01

    Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets. Copyright © 2015 by the Genetics Society of America.

  15. Comprehensive Resources for Tomato Functional Genomics Based on the Miniature Model Tomato Micro-Tom

    PubMed Central

    Matsukura, C; Aoki, K; Fukuda, N; Mizoguchi, T; Asamizu, E; Saito, T; Shibata, D; Ezura, H

    2008-01-01

    Tomato (Solanum lycopersicum L., Solanaceae) is an excellent model plant for genomic research of solanaceous plants, as well as for studying the development, ripening, and metabolism of fruit. In 2003, the International Solanaceae Project (SOL, www.sgn.cornell.edu ) was initiated by members from more than 30 countries, and the tomato genome-sequencing project is currently underway. Genome sequence of tomato obtained by this project will provide a firm foundation for forthcoming genomic studies such as the comparative analysis of genes conserved among the Solanaceae species and the elucidation of the functions of unknown tomato genes. To exploit the wealth of the genome sequence information, there is an urgent need for novel resources and analytical tools for tomato functional genomics. Here, we present an overview of the development of genetic and genomic resources of tomato in the last decade, with a special focus on the activities of Japan SOL and the National Bio-Resource Project in the development of functional genomic resources of a model cultivar, Micro-Tom. PMID:19506732

  16. Phenotypic and genomic analyses of a fast neutron mutant population resource in soybean

    USDA-ARS?s Scientific Manuscript database

    Mutagenized populations have become indispensable resources for introducing variation and studying gene function in plant genomics research. In this study, fast neutron (FN) radiation was used to induce deletion mutations in the soybean (Glycine max (L.) Merrill) genome. Approximately 120,000 soybea...

  17. Resources to increase genetics and genomics capacity of oncology nurses.

    PubMed

    Aiello, Lisa B

    2015-03-01

    Since the completion of the Human Genome Project (HGP) in 2003, the understanding of genetics and its influence on disease, particularly cancer, has increased dramatically. The initial focus after the completion of HGP was on identifying single-gene disorders, such as many hereditary cancer syndromes (e.g., BRCA1, BRCA2, HNPCC). As research continues, the major impact that genetics and genomics have across the healthcare continuum is only beginning to become clear.

  18. Megx.net: integrated database resource for marine ecological genomics

    PubMed Central

    Kottmann, Renzo; Kostadinov, Ivalyo; Duhaime, Melissa Beth; Buttigieg, Pier Luigi; Yilmaz, Pelin; Hankeln, Wolfgang; Waldmann, Jost; Glöckner, Frank Oliver

    2010-01-01

    Megx.net is a database and portal that provides integrated access to georeferenced marker genes, environment data and marine genome and metagenome projects for microbial ecological genomics. All data are stored in the Microbial Ecological Genomics DataBase (MegDB), which is subdivided to hold both sequence and habitat data and global environmental data layers. The extended system provides access to several hundreds of genomes and metagenomes from prokaryotes and phages, as well as over a million small and large subunit ribosomal RNA sequences. With the refined Genes Mapserver, all data can be interactively visualized on a world map and statistics describing environmental parameters can be calculated. Sequence entries have been curated to comply with the proposed minimal standards for genomes and metagenomes (MIGS/MIMS) of the Genomic Standards Consortium. Access to data is facilitated by Web Services. The updated megx.net portal offers microbial ecologists greatly enhanced database content, and new features and tools for data analysis, all of which are freely accessible from our webpage http://www.megx.net. PMID:19858098

  19. Snow catch by conifer crowns

    Treesearch

    Donald R. Satterlund; Harold F. Haupt

    1967-01-01

    Study of interception storage of snow by two species of sapling conifers in northern Idaho revealed that cumulative snow catch follows the classical law of autocatakinetic growth, or [equation - see PDF] where I, is interception storage, e is the interception storage capacity of the tree, e is the base of the natural logarithm, k is a constant expressing the rate of...

  20. Scleroderris Canker of Northern Conifers

    Treesearch

    Darroll D. Skilling; James T. O' Brien; James A. Bell

    1979-01-01

    Scleroderris canker, caused by the fungus Gremmeniella abietina-Scleroderris lagerbergii (Lagerb.) Morelet, has caused extensive mortality in conifer plantations and forest nurseries in the northeast and north central United States and eastern Canada. Two strains of the fungus are known in North America. The Lake States strain, present throughout northeastern North...

  1. Mistletoes of North American conifers

    Treesearch

    Brian W. Geils; Jose Cibrian-Tovar; Benjamin Moody

    2002-01-01

    Mistletoes of the families Loranthaceae and Viscaceae are the most important vascular plant parasites of conifers in Canada, the United States, and Mexico. Species of the genera Psittacanthus, Phoradendron, and Arceuthobium cause the greatest economic and ecological impacts. These shrubby, aerial parasites...

  2. Transcriptome profiling in conifers and the PiceaGenExpress database show patterns of diversification within gene families and interspecific conservation in vascular gene expression

    PubMed Central

    2012-01-01

    Background Conifers have very large genomes (13 to 30 Gigabases) that are mostly uncharacterized although extensive cDNA resources have recently become available. This report presents a global overview of transcriptome variation in a conifer tree and documents conservation and diversity of gene expression patterns among major vegetative tissues. Results An oligonucleotide microarray was developed from Picea glauca and P. sitchensis cDNA datasets. It represents 23,853 unique genes and was shown to be suitable for transcriptome profiling in several species. A comparison of secondary xylem and phelloderm tissues showed that preferential expression in these vascular tissues was highly conserved among Picea spp. RNA-Sequencing strongly confirmed tissue preferential expression and provided a robust validation of the microarray design. A small database of transcription profiles called PiceaGenExpress was developed from over 150 hybridizations spanning eight major tissue types. In total, transcripts were detected for 92% of the genes on the microarray, in at least one tissue. Non-annotated genes were predominantly expressed at low levels in fewer tissues than genes of known or predicted function. Diversity of expression within gene families may be rapidly assessed from PiceaGenExpress. In conifer trees, dehydrins and late embryogenesis abundant (LEA) osmotic regulation proteins occur in large gene families compared to angiosperms. Strong contrasts and low diversity was observed in the dehydrin family, while diverse patterns suggested a greater degree of diversification among LEAs. Conclusion Together, the oligonucleotide microarray and the PiceaGenExpress database represent the first resource of this kind for gymnosperm plants. The spruce transcriptome analysis reported here is expected to accelerate genetic studies in the large and important group comprised of conifer trees. PMID:22931377

  3. Cone size is related to branching architecture in conifers.

    PubMed

    Leslie, Andrew B; Beaulieu, Jeremy M; Crane, Peter R; Donoghue, Michael J

    2014-09-01

    The relationship between branch diameter and leaf size has been widely used to understand how vegetative resources are allocated in plants. Branching architecture influences reproductive allocation as well, but fewer studies have explored this relationship at broad phylogenetic or ecological scales. In this study, we tested whether pollen-producing and seed-producing cone size scales with branch diameter in conifers, a diverse and globally distributed lineage of nonflowering seed plants. Branch diameter and cone size were analyzed using multiple regression models and evolutionary models of trait evolution for a data set of 293 extant conifer species within an explicit phylogenetic framework. Branch diameter is a strong predictor of cone size across conifer species, particularly for pollen cones and dry seed cones. However, these relationships are complex in detail because leaf morphology and seed dispersal biology influence the specific ways in which they are expressed. The ubiquity and strength of these scaling relationships across conifers suggest that reproductive and vegetative morphologies are coupled in the group, and it is therefore difficult to disentangle the evolution of cone size from the evolution of branching architecture.

  4. Using an online genome resource to identify myostatin variation in U.S. sheep

    USDA-ARS?s Scientific Manuscript database

    We created a public, searchable DNA sequence resource for sheep that contained approximately 14x whole genome sequence of 96 rams. The animals represent 10 popular U.S. breeds and share minimal pedigree relationships, making the resource suitable for viewing gene variants in the user-friendly Integ...

  5. InsectBase: a resource for insect genomes and transcriptomes.

    PubMed

    Yin, Chuanlin; Shen, Gengyu; Guo, Dianhao; Wang, Shuping; Ma, Xingzhou; Xiao, Huamei; Liu, Jinding; Zhang, Zan; Liu, Ying; Zhang, Yiqun; Yu, Kaixiang; Huang, Shuiqing; Li, Fei

    2016-01-04

    The genomes and transcriptomes of hundreds of insects have been sequenced. However, insect community lacks an integrated, up-to-date collection of insect gene data. Here, we introduce the first release of InsectBase, available online at http://www.insect-genome.com. The database encompasses 138 insect genomes, 116 insect transcriptomes, 61 insect gene sets, 36 gene families of 60 insects, 7544 miRNAs of 69 insects, 96,925 piRNAs of Drosophila melanogaster and Chilo suppressalis, 2439 lncRNA of Nilaparvata lugens, 22,536 pathways of 78 insects, 678,881 untranslated regions (UTR) of 84 insects and 160,905 coding sequences (CDS) of 70 insects. This release contains over 12 million sequences and provides search functionality, a BLAST server, GBrowse, insect pathway construction, a Facebook-like network for the insect community (iFacebook), and phylogenetic analysis of selected genes.

  6. InsectBase: a resource for insect genomes and transcriptomes

    PubMed Central

    Yin, Chuanlin; Shen, Gengyu; Guo, Dianhao; Wang, Shuping; Ma, Xingzhou; Xiao, Huamei; Liu, Jinding; Zhang, Zan; Liu, Ying; Zhang, Yiqun; Yu, Kaixiang; Huang, Shuiqing; Li, Fei

    2016-01-01

    The genomes and transcriptomes of hundreds of insects have been sequenced. However, insect community lacks an integrated, up-to-date collection of insect gene data. Here, we introduce the first release of InsectBase, available online at http://www.insect-genome.com. The database encompasses 138 insect genomes, 116 insect transcriptomes, 61 insect gene sets, 36 gene families of 60 insects, 7544 miRNAs of 69 insects, 96 925 piRNAs of Drosophila melanogaster and Chilo suppressalis, 2439 lncRNA of Nilaparvata lugens, 22 536 pathways of 78 insects, 678 881 untranslated regions (UTR) of 84 insects and 160 905 coding sequences (CDS) of 70 insects. This release contains over 12 million sequences and provides search functionality, a BLAST server, GBrowse, insect pathway construction, a Facebook-like network for the insect community (iFacebook), and phylogenetic analysis of selected genes. PMID:26578584

  7. An Italian functional genomic resource for Medicago truncatula

    PubMed Central

    Porceddu, Andrea; Panara, Francesco; Calderini, Ornella; Molinari, Lorna; Taviani, Paola; Lanfaloni, Luisa; Scotti, Carla; Carelli, Maria; Scaramelli, Laura; Bruschi, Gianluca; Cosson, Viviane; Ratet, Pascal; de Larembergue, Henri; Duc, Gerard; Piano, Efisio; Arcioni, Sergio

    2008-01-01

    Background Medicago truncatula is a model species for legumes. Its functional genomics have been considerably boosted in recent years due to initiatives based both in Europe and US. Collections of mutants are becoming increasingly available and this will help unravel the genetic control of important traits for many species of legumes. Findings Our report is on the production of three complementary mutant collections of the model species Medicago truncatula produced in Italy in the frame of a national genomic initiative. Well established strategies were used: Tnt1 mutagenesis, TILLING and activation tagging. Both forward and reverse genetics screenings proved the efficiency of the mutagenesis approaches adopted, enabling the isolation of interesting mutants which are in course of characterization. We anticipate that the reported collections will be complementary to the recently established functional genomics tools developed for Medicago truncatula both in Europe and in the United States. PMID:19077311

  8. CyanoClust: comparative genome resources of cyanobacteria and plastids.

    PubMed

    Sasaki, Naobumi V; Sato, Naoki

    2010-01-01

    Cyanobacteria, which perform oxygen-evolving photosynthesis as do chloroplasts of plants and algae, are one of the best-studied prokaryotic phyla and one from which many representative genomes have been sequenced. Lack of a suitable comparative genomic database has been a problem in cyanobacterial genomics because many proteins involved in physiological functions such as photosynthesis and nitrogen fixation are not catalogued in commonly used databases, such as Clusters of Orthologous Proteins (COG). CyanoClust is a database of homolog groups in cyanobacteria and plastids that are produced by the program Gclust. We have developed a web-server system for the protein homology database featuring cyanobacteria and plastids. Database URL: http://cyanoclust.c.u-tokyo.ac.jp/.

  9. New resources inform study of genome size, content, and organization in nonavian reptiles

    PubMed Central

    Janes, Daniel E.; Organ, Christopher; Valenzuela, Nicole

    2008-01-01

    Genomic resources for studies of nonavian reptiles have recently improved and will reach a new level of access once the genomes of the painted turtle (Chrysemys picta) and the green anole (Anolis carolinensis) have been published. Eleven speakers gathered for a symposium on reptilian genomics and evolutionary genetics at the 2008 meeting of the Society for Integrative and Comparative Biology in San Antonio, Texas. Presentations described results of reptilian genetic studies concerning molecular evolution, chromosomal evolution, genomic architecture, population dynamics, endocrinology and endocrine disruption, and the evolution of developmental mechanisms. The presented studies took advantage of the recent generation of genetic and genomic tools and resources. Novel findings demonstrated the positive impact made by the improved availability of resources like genome annotations and bacterial artificial chromosomes (BACs). The symposium was timely and important because it provided a vehicle for the dissemination of novel findings that advance the field. Moreover, this meeting fostered the synergistic interaction of the participants as a group, which is anticipated to encourage the funding and creation of further resources such as additional BAC libraries and genomic projects. Novel data have already been collected and studies like those presented in this symposium promise to shape and improve our understanding of overall amniote evolution. Additional reptilian taxa such as the American alligator (Alligator mississippiensis), tuatara (Sphenodon punctatus), and garter snake (Thamnophis sirtalis) should be the foci of future genomic projects. We hope that the following articles in this volume will help promote these efforts by describing the conclusions and the potential that the improvement of genomic resources for nonavian reptiles can continue having in this important area of integrative and comparative biology. PMID:21669805

  10. Brassica database (BRAD) version 2.0: integrating and mining Brassicaceae species genomic resources.

    PubMed

    Wang, Xiaobo; Wu, Jian; Liang, Jianli; Cheng, Feng; Wang, Xiaowu

    2015-01-01

    The Brassica database (BRAD) was built initially to assist users apply Brassica rapa and Arabidopsis thaliana genomic data efficiently to their research. However, many Brassicaceae genomes have been sequenced and released after its construction. These genomes are rich resources for comparative genomics, gene annotation and functional evolutionary studies of Brassica crops. Therefore, we have updated BRAD to version 2.0 (V2.0). In BRAD V2.0, 11 more Brassicaceae genomes have been integrated into the database, namely those of Arabidopsis lyrata, Aethionema arabicum, Brassica oleracea, Brassica napus, Camelina sativa, Capsella rubella, Leavenworthia alabamica, Sisymbrium irio and three extremophiles Schrenkiella parvula, Thellungiella halophila and Thellungiella salsuginea. BRAD V2.0 provides plots of syntenic genomic fragments between pairs of Brassicaceae species, from the level of chromosomes to genomic blocks. The Generic Synteny Browser (GBrowse_syn), a module of the Genome Browser (GBrowse), is used to show syntenic relationships between multiple genomes. Search functions for retrieving syntenic and non-syntenic orthologs, as well as their annotation and sequences are also provided. Furthermore, genome and annotation information have been imported into GBrowse so that all functional elements can be visualized in one frame. We plan to continually update BRAD by integrating more Brassicaceae genomes into the database. Database URL: http://brassicadb.org/brad/. © The Author(s) 2015. Published by Oxford University Press.

  11. Brassica database (BRAD) version 2.0: integrating and mining Brassicaceae species genomic resources

    PubMed Central

    Wang, Xiaobo; Wu, Jian; Liang, Jianli; Cheng, Feng; Wang, Xiaowu

    2015-01-01

    The Brassica database (BRAD) was built initially to assist users apply Brassica rapa and Arabidopsis thaliana genomic data efficiently to their research. However, many Brassicaceae genomes have been sequenced and released after its construction. These genomes are rich resources for comparative genomics, gene annotation and functional evolutionary studies of Brassica crops. Therefore, we have updated BRAD to version 2.0 (V2.0). In BRAD V2.0, 11 more Brassicaceae genomes have been integrated into the database, namely those of Arabidopsis lyrata, Aethionema arabicum, Brassica oleracea, Brassica napus, Camelina sativa, Capsella rubella, Leavenworthia alabamica, Sisymbrium irio and three extremophiles Schrenkiella parvula, Thellungiella halophila and Thellungiella salsuginea. BRAD V2.0 provides plots of syntenic genomic fragments between pairs of Brassicaceae species, from the level of chromosomes to genomic blocks. The Generic Synteny Browser (GBrowse_syn), a module of the Genome Browser (GBrowse), is used to show syntenic relationships between multiple genomes. Search functions for retrieving syntenic and non-syntenic orthologs, as well as their annotation and sequences are also provided. Furthermore, genome and annotation information have been imported into GBrowse so that all functional elements can be visualized in one frame. We plan to continually update BRAD by integrating more Brassicaceae genomes into the database. Database URL: http://brassicadb.org/brad/ PMID:26589635

  12. A Populus EST resource for plant functional genomics

    PubMed Central

    Sterky, Fredrik; Bhalerao, Rupali R.; Unneberg, Per; Segerman, Bo; Nilsson, Peter; Brunner, Amy M.; Charbonnel-Campaa, Laurence; Lindvall, Jenny Jonsson; Tandre, Karolina; Strauss, Steven H.; Sundberg, Björn; Gustafsson, Petter; Uhlén, Mathias; Bhalerao, Rishikesh P.; Nilsson, Ove; Sandberg, Göran; Karlsson, Jan; Lundeberg, Joakim; Jansson, Stefan

    2004-01-01

    Trees present a life form of paramount importance for terrestrial ecosystems and human societies because of their ecological structure and physiological function and provision of energy and industrial materials. The genus Populus is the internationally accepted model for molecular tree biology. We have analyzed 102,019 Populus ESTs that clustered into 11,885 clusters and 12,759 singletons. We also provide >4,000 assembled full clone sequences to serve as a basis for the upcoming annotation of the Populus genome sequence. A public web-based EST database (populusdb) provides digital expression profiles for 18 tissues that comprise the majority of differentiated organs. The coding content of Populus and Arabidopsis genomes shows very high similarity, indicating that differences between these annual and perennial angiosperm life forms result primarily from differences in gene regulation. The high similarity between Populus and Arabidopsis will allow studies of Populus to directly benefit from the detailed functional genomic information generated for Arabidopsis, enabling detailed insights into tree development and adaptation. These data will also valuable for functional genomic efforts in Arabidopsis. PMID:15353603

  13. Population genomics of resource exploitation: insights from gene expression profiles of two Daphnia ecotypes fed alternate resources.

    PubMed

    Dudycha, Jeffry L; Brandon, Christopher S; Deitz, Kevin C

    2012-02-01

    Consumer-resource interactions are a central issue in evolutionary and community ecology because they play important roles in selection and population regulation. Most consumers encounter resource variation at multiple scales, and respond through phenotypic plasticity in the short term or evolutionary divergence in the long term. The key traits for these responses may influence resource acquisition, assimilation, and/or allocation. To identify relevant candidate genes, we experimentally assayed genome-wide gene expression in pond and lake Daphnia ecotypes exposed to alternate resource environments. One was a simple, high-quality laboratory diet, Ankistrodesmus falcatus. The other was the complex natural seston from a large lake. In temporary ponds, Daphnia generally experience high-quality, abundant resources, whereas lakes provide low-quality, seasonally shifting resources that are chronically limiting. For both ecotypes, we used replicate clones drawn from a number of separate populations. Fourteen genes were differentially regulated with respect to resources, including genes involved in gut processes, resource allocation, and activities with no obvious connection to resource exploitation. Three genes were differentially regulated in both ecotypes; the others may play a role in ecological divergence. Genes clearly linked to gut processes include two peritrophic matrix proteins, a Niemann-Pick type C2 gene, and a chymotrypsin. A pancreatic lipase, an epoxide hydrolase, a neuroparsin, and an UDP-dependent glucuronyltransferase are potentially involved in resource allocation through effects on energy processing and storage or hormone pathways. We performed quantitative rt-PCR for eight genes in independent samples of three clones of each of the two ecotypes. Though these largely confirmed observed differential regulation, some genes' expression was highly variable among clones. Our results demonstrate the value of matching the level of biological replication in

  14. Population genomics of resource exploitation: insights from gene expression profiles of two Daphnia ecotypes fed alternate resources

    PubMed Central

    Dudycha, Jeffry L; Brandon, Christopher S; Deitz, Kevin C

    2012-01-01

    Consumer–resource interactions are a central issue in evolutionary and community ecology because they play important roles in selection and population regulation. Most consumers encounter resource variation at multiple scales, and respond through phenotypic plasticity in the short term or evolutionary divergence in the long term. The key traits for these responses may influence resource acquisition, assimilation, and/or allocation. To identify relevant candidate genes, we experimentally assayed genome-wide gene expression in pond and lake Daphnia ecotypes exposed to alternate resource environments. One was a simple, high-quality laboratory diet, Ankistrodesmus falcatus. The other was the complex natural seston from a large lake. In temporary ponds, Daphnia generally experience high-quality, abundant resources, whereas lakes provide low-quality, seasonally shifting resources that are chronically limiting. For both ecotypes, we used replicate clones drawn from a number of separate populations. Fourteen genes were differentially regulated with respect to resources, including genes involved in gut processes, resource allocation, and activities with no obvious connection to resource exploitation. Three genes were differentially regulated in both ecotypes; the others may play a role in ecological divergence. Genes clearly linked to gut processes include two peritrophic matrix proteins, a Niemann–Pick type C2 gene, and a chymotrypsin. A pancreatic lipase, an epoxide hydrolase, a neuroparsin, and an UDP-dependent glucuronyltransferase are potentially involved in resource allocation through effects on energy processing and storage or hormone pathways. We performed quantitative rt-PCR for eight genes in independent samples of three clones of each of the two ecotypes. Though these largely confirmed observed differential regulation, some genes’ expression was highly variable among clones. Our results demonstrate the value of matching the level of biological replication

  15. The Integrated Microbial Genomes (IMG) System: An Expanding Comparative Analysis Resource

    SciTech Connect

    Markowitz, Victor M.; Chen, I-Min A.; Palaniappan, Krishna; Chu, Ken; Szeto, Ernest; Grechkin, Yuri; Ratner, Anna; Anderson, Iain; Lykidis, Athanasios; Mavromatis, Konstantinos; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2009-09-13

    The integrated microbial genomes (IMG) system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context. IMG contains both draft and complete microbial genomes integrated with other publicly available genomes from all three domains of life, together with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and reviewing the annotations of genes and genomes in a comparative context. Since its first release in 2005, IMG's data content and analytical capabilities have been constantly expanded through regular releases. Several companion IMG systems have been set up in order to serve domain specific needs, such as expert review of genome annotations. IMG is available at .

  16. Adventures in the enormous: a 1.8 million clone BAC library for the 21.7 Gb genome of loblolly pine

    Treesearch

    Zenaida V. Magbanua; Seval Ozkan; Benjamin D. Bartlett; Philippe Chouvarine; Christopher A. Saski; Aaron Liston; Richard C. Cronn; C. Dana Nelson; Daniel G. Peterson

    2011-01-01

    Loblolly pine (LP; Pinus taeda L.) is the most economically important tree in the U.S. and a cornerstone species in southeastern forests. However, genomics research on LP and other conifers has lagged behind studies on flowering plants due, in part, to the large size of conifer genomes. As a means to accelerate conifer genome research, we...

  17. The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences

    PubMed Central

    2010-01-01

    Background In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24). The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda. Results We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS) sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (≥ 75% nucleotide identity) elsewhere in the genome, but only 23% have identical copies (99% identity). The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome. Conclusions This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is a feasible goal. PMID

  18. The hydraulic architecture of conifers

    Treesearch

    Uwe G Hacke; Barbara Lachenbruch; Jarmila Pittermann; Stefan Mayr; Jean-Christophe Domec; Paul J. Schulte

    2015-01-01

    Conifers survive in diverse and sometimes extreme environments (Fig. 2.1a–f). Piñon-juniper communities are found in semi-arid environments, receiving ca. 400 mm of yearly precipitation (Linton et al. 1998), which is less than half the average precipitations received by other coniferous tree species worldwide. Picea mariana and Larix laricina grow in boreal peatlands...

  19. Genetic Resources, Genome Mapping and Evolutionary Genomics of the Pig (Sus scrofa)

    PubMed Central

    Chen, Kefei; Baxter, Tara; Muir, William M.; Groenen, Martien A.; Schook, Lawrence B.

    2007-01-01

    The pig, a representative of the artiodactyla clade, is one of the first animals domesticated, and has become an important agriculture animal as one of the major human nutritional sources of animal based protein. The pig is also a valuable biomedical model organism for human health. The pig's importance to human health and nutrition is reflected in the decision to sequence its genome (3X). As an animal species with its wild ancestors present in the world, the pig provides a unique opportunity for tracing mammalian evolutionary history and defining signatures of selection resulting from both domestication and natural selection. Completion of the pig genome sequencing project will have significant impacts on both agriculture and human health. Following the pig whole genome sequence drafts, along with large-scale polymorphism data, it will be possible to conduct genome sweeps using association mapping, and identify signatures of selection. Here, we provide a description of the pig genome sequencing project and perspectives on utilizing genomic technologies to exploit pig genome evolution and the molecular basis for phenotypic traits for improving pig production and health. PMID:17384734

  20. e-Fungi: a data resource for comparative analysis of fungal genomes

    PubMed Central

    Hedeler, Cornelia; Wong, Han Min; Cornell, Michael J; Alam, Intikhab; Soanes, Darren M; Rattray, Magnus; Hubbard, Simon J; Talbot, Nicholas J; Oliver, Stephen G; Paton, Norman W

    2007-01-01

    Background The number of sequenced fungal genomes is ever increasing, with about 200 genomes already fully sequenced or in progress. Only a small percentage of those genomes have been comprehensively studied, for example using techniques from functional genomics. Comparative analysis has proven to be a useful strategy for enhancing our understanding of evolutionary biology and of the less well understood genomes. However, the data required for these analyses tends to be distributed in various heterogeneous data sources, making systematic comparative studies a cumbersome task. Furthermore, comparative analyses benefit from close integration of derived data sets that cluster genes or organisms in a way that eases the expression of requests that clarify points of similarity or difference between species. Description To support systematic comparative analyses of fungal genomes we have developed the e-Fungi database, which integrates a variety of data for more than 30 fungal genomes. Publicly available genome data, functional annotations, and pathway information has been integrated into a single data repository and complemented with results of comparative analyses, such as MCL and OrthoMCL cluster analysis, and predictions of signaling proteins and the sub-cellular localisation of proteins. To access the data, a library of analysis tasks is available through a web interface. The analysis tasks are motivated by recent comparative genomics studies, and aim to support the study of evolutionary biology as well as community efforts for improving the annotation of genomes. Web services for each query are also available, enabling the tasks to be incorporated into workflows. Conclusion The e-Fungi database provides fungal biologists with a resource for comparative studies of a large range of fungal genomes. Its analysis library supports the comparative study of genome data, functional annotation, and results of large scale analyses over all the genomes stored in the database

  1. Phenotype-Genotype Integrator (PheGenI): synthesizing genome-wide association study (GWAS) data with existing genomic resources.

    PubMed

    Ramos, Erin M; Hoffman, Douglas; Junkins, Heather A; Maglott, Donna; Phan, Lon; Sherry, Stephen T; Feolo, Mike; Hindorff, Lucia A

    2014-01-01

    Rapidly accumulating data from genome-wide association studies (GWASs) and other large-scale studies are most useful when synthesized with existing databases. To address this opportunity, we developed the Phenotype-Genotype Integrator (PheGenI), a user-friendly web interface that integrates various National Center for Biotechnology Information (NCBI) genomic databases with association data from the National Human Genome Research Institute GWAS Catalog and supports downloads of search results. Here, we describe the rationale for and development of this resource. Integrating over 66,000 association records with extensive single nucleotide polymorphism (SNP), gene, and expression quantitative trait loci data already available from the NCBI, PheGenI enables deeper investigation and interrogation of SNPs associated with a wide range of traits, facilitating the examination of the relationships between genetic variation and human diseases.

  2. De Novo Transcriptome Assembly and Characterization for the Widespread and Stress-Tolerant Conifer Platycladus orientalis.

    PubMed

    Hu, Xian-Ge; Liu, Hui; Jin, YuQing; Sun, Yan-Qiang; Li, Yue; Zhao, Wei; El-Kassaby, Yousry A; Wang, Xiao-Ru; Mao, Jian-Feng

    2016-01-01

    Platycladus orientalis, of the family Cupressaceae, is a widespread conifer throughout China and is extensively used for ecological reforestation, horticulture, and in medicine. Transcriptome assemblies are required for this ecologically important conifer for understanding genes underpinning adaptation and complex traits for breeding programs. To enrich the species' genomic resources, a de novo transcriptome sequencing was performed using Illumina paired-end sequencing. In total, 104,073,506 high quality sequence reads (approximately 10.3 Gbp) were obtained, which were assembled into 228,948 transcripts and 148,867 unigenes that were longer than 200 nt. Quality assessment using CEGMA showed that the transcriptomes obtained were mostly complete for highly conserved core eukaryotic genes. Based on similarity searches with known proteins, 62,938 (42.28% of all unigenes), 42,158 (28.32%), and 23,179 (15.57%) had homologs in the Nr, GO, and KOG databases, 25,625 (17.21%) unigenes were mapped to 322 pathways by BLASTX comparison against the KEGG database and 1,941 unigenes involved in environmental signaling and stress response were identified. We also identified 43 putative terpene synthase (TPS) functional genes loci and compared them with TPSs from other species. Additionally, 5,296 simple sequence repeats (SSRs) were identified in 4,715 unigenes, which were assigned to 142 motif types. This is the first report of a complete transcriptome analysis of P. orientalis. These resources provide a foundation for further studies of adaptation mechanisms and molecular-based breeding programs.

  3. Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation

    PubMed Central

    Kähäri, Andreas; Komorowska, Monika; Keenan, Stephen; Laird, Matthew; Longden, Ian; Proctor, Glenn; Searle, Steve; Staines, Daniel; Taylor, Kieron; Vullo, Alessandro; Yates, Andrew; Zerbino, Daniel

    2017-01-01

    Abstract The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl ‘Core’ database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all of our genome resources. Since its initial design more than fifteen years ago, the number of publicly available genomic, transcriptomic and proteomic datasets has grown enormously, accelerated by continuous advances in DNA-sequencing technology. Initially intended to provide annotation for the reference human genome, we have extended our framework to support the genomes of all species as well as richer assembly models. Cross-referenced links to other informatics resources facilitate searching our database with a variety of popular identifiers such as UniProt and RefSeq. Our comprehensive and robust framework storing a large diversity of genome annotations in one location serves as a platform for other groups to generate and maintain their own tailored annotation. We welcome reuse and contributions: our databases and APIs are publicly available, all of our source code is released with a permissive Apache v2.0 licence at http://github.com/Ensembl and we have an active developer mailing list (http://www.ensembl.org/info/about/contact/index.html). Database URL: http://www.ensembl.org PMID:28365736

  4. MBGD update 2010: toward a comprehensive resource for exploring microbial genome diversity.

    PubMed

    Uchiyama, Ikuo; Higuchi, Toshio; Kawai, Mikihiko

    2010-01-01

    The microbial genome database (MBGD) for comparative analysis is a platform for microbial comparative genomics based on automated ortholog group identification. A prominent feature of MBGD is that it allows users to create ortholog groups using a specified subgroup of organisms. The database is constantly updated and now contains almost 1000 genomes. To utilize the MBGD database as a comprehensive resource for investigating microbial genome diversity, we have developed the following advanced functionalities: (i) enhanced assignment of functional annotation, including external database links to each orthologous group, (ii) interface for choosing a set of genomes to compare based on phenotypic properties, (iii) the addition of more eukaryotic microbial genomes (fungi and protists) and some higher eukaryotes as references and (iv) enhancement of the MyMBGD mode, which allows users to add their own genomes to MBGD and now accepts raw genomic sequences without any annotation (in such a case, it runs a gene-finding procedure before identifying the orthologs). Some analysis functions, such as the function to find orthologs with similar phylogenetic patterns, have also been improved. MBGD is accessible at http://mbgd.genome.ad.jp/.

  5. CoryneBase: Corynebacterium Genomic Resources and Analysis Tools at Your Fingertips

    PubMed Central

    Tan, Mui Fern; Jakubovics, Nick S.; Wee, Wei Yee; Mutha, Naresh V. R.; Wong, Guat Jah; Ang, Mia Yang; Yazdi, Amir Hessam; Choo, Siew Woh

    2014-01-01

    Corynebacteria are used for a wide variety of industrial purposes but some species are associated with human diseases. With increasing number of corynebacterial genomes having been sequenced, comparative analysis of these strains may provide better understanding of their biology, phylogeny, virulence and taxonomy that may lead to the discoveries of beneficial industrial strains or contribute to better management of diseases. To facilitate the ongoing research of corynebacteria, a specialized central repository and analysis platform for the corynebacterial research community is needed to host the fast-growing amount of genomic data and facilitate the analysis of these data. Here we present CoryneBase, a genomic database for Corynebacterium with diverse functionality for the analysis of genomes aimed to provide: (1) annotated genome sequences of Corynebacterium where 165,918 coding sequences and 4,180 RNAs can be found in 27 species; (2) access to comprehensive Corynebacterium data through the use of advanced web technologies for interactive web interfaces; and (3) advanced bioinformatic analysis tools consisting of standard BLAST for homology search, VFDB BLAST for sequence homology search against the Virulence Factor Database (VFDB), Pairwise Genome Comparison (PGC) tool for comparative genomic analysis, and a newly designed Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomic analysis. CoryneBase offers the access of a range of Corynebacterium genomic resources as well as analysis tools for comparative genomics and pathogenomics. It is publicly available at http://corynebacterium.um.edu.my/. PMID:24466021

  6. The Global Invertebrate Genomics Alliance (GIGA): Developing Community Resources to Study Diverse Invertebrate Genomes

    PubMed Central

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the “invertebrates,” but very few genomes from these organisms have been sequenced. We have, therefore, formed a “Global Invertebrate Genomics Alliance” (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major challenges (e.g., species selection, sample collection and storage, sequence assembly, annotation, analytical tools) associated with genome/transcriptome sequencing across a large taxonomic spectrum. We aim to promote standards that will facilitate comparative approaches to invertebrate genomics and collaborations across the international scientific community. Candidate study taxa include species from Porifera, Ctenophora, Cnidaria, Placozoa, Mollusca, Arthropoda, Echinodermata, Annelida, Bryozoa, and Platyhelminthes, among others. GIGA will target 7000 noninsect/nonnematode species, with an emphasis on marine taxa because of the unrivaled phyletic diversity in the oceans. Priorities for selecting invertebrates for sequencing will include, but are not restricted to, their phylogenetic placement; relevance to organismal, ecological, and conservation research; and their importance to fisheries and human health. We highlight benefits of sequencing both whole genomes (DNA) and transcriptomes and also suggest policies for genomic-level data access and sharing based on transparency and inclusiveness. The GIGA Web site (http://giga.nova.edu) has been launched to facilitate this collaborative venture. PMID:24336862

  7. The Global Invertebrate Genomics Alliance (GIGA): developing community resources to study diverse invertebrate genomes.

    PubMed

    Bracken-Grissom, Heather; Collins, Allen G; Collins, Timothy; Crandall, Keith; Distel, Daniel; Dunn, Casey; Giribet, Gonzalo; Haddock, Steven; Knowlton, Nancy; Martindale, Mark; Medina, Mónica; Messing, Charles; O'Brien, Stephen J; Paulay, Gustav; Putnam, Nicolas; Ravasi, Timothy; Rouse, Greg W; Ryan, Joseph F; Schulze, Anja; Wörheide, Gert; Adamska, Maja; Bailly, Xavier; Breinholt, Jesse; Browne, William E; Diaz, M Christina; Evans, Nathaniel; Flot, Jean-François; Fogarty, Nicole; Johnston, Matthew; Kamel, Bishoy; Kawahara, Akito Y; Laberge, Tammy; Lavrov, Dennis; Michonneau, François; Moroz, Leonid L; Oakley, Todd; Osborne, Karen; Pomponi, Shirley A; Rhodes, Adelaide; Santos, Scott R; Satoh, Nori; Thacker, Robert W; Van de Peer, Yves; Voolstra, Christian R; Welch, David Mark; Winston, Judith; Zhou, Xin

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the "invertebrates," but very few genomes from these organisms have been sequenced. We have, therefore, formed a "Global Invertebrate Genomics Alliance" (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major challenges (e.g., species selection, sample collection and storage, sequence assembly, annotation, analytical tools) associated with genome/transcriptome sequencing across a large taxonomic spectrum. We aim to promote standards that will facilitate comparative approaches to invertebrate genomics and collaborations across the international scientific community. Candidate study taxa include species from Porifera, Ctenophora, Cnidaria, Placozoa, Mollusca, Arthropoda, Echinodermata, Annelida, Bryozoa, and Platyhelminthes, among others. GIGA will target 7000 noninsect/nonnematode species, with an emphasis on marine taxa because of the unrivaled phyletic diversity in the oceans. Priorities for selecting invertebrates for sequencing will include, but are not restricted to, their phylogenetic placement; relevance to organismal, ecological, and conservation research; and their importance to fisheries and human health. We highlight benefits of sequencing both whole genomes (DNA) and transcriptomes and also suggest policies for genomic-level data access and sharing based on transparency and inclusiveness. The GIGA Web site (http://giga.nova.edu) has been launched to facilitate this collaborative venture.

  8. The draft genome of the pest tephritid fruit fly Bactrocera tryoni: resources for the genomic analysis of hybridising species.

    PubMed

    Gilchrist, Anthony Stuart; Shearman, Deborah C A; Frommer, Marianne; Raphael, Kathryn A; Deshpande, Nandan P; Wilkins, Marc R; Sherwin, William B; Sved, John A

    2014-12-20

    The tephritid fruit flies include a number of economically important pests of horticulture, with a large accumulated body of research on their biology and control. Amongst the Tephritidae, the genus Bactrocera, containing over 400 species, presents various species groups of potential utility for genetic studies of speciation, behaviour or pest control. In Australia, there exists a triad of closely-related, sympatric Bactrocera species which do not mate in the wild but which, despite distinct morphologies and behaviours, can be force-mated in the laboratory to produce fertile hybrid offspring. To exploit the opportunities offered by genomics, such as the efficient identification of genetic loci central to pest behaviour and to the earliest stages of speciation, investigators require genomic resources for future investigations. We produced a draft de novo genome assembly of Australia's major tephritid pest species, Bactrocera tryoni. The male genome (650-700 Mbp) includes approximately 150 Mb of interspersed repetitive DNA sequences and 60 Mb of satellite DNA. Assessment using conserved core eukaryotic sequences indicated 98% completeness. Over 16,000 MAKER-derived gene models showed a large degree of overlap with other Dipteran reference genomes. The sequence of the ribosomal RNA transcribed unit was also determined. Unscaffolded assemblies of B. neohumeralis and B. jarvisi were then produced; comparison with B. tryoni showed that the species are more closely related than any Drosophila species pair. The similarity of the genomes was exploited to identify 4924 potentially diagnostic indels between the species, all of which occur in non-coding regions. This first draft B. tryoni genome resembles other dipteran genomes in terms of size and putative coding sequences. For all three species included in this study, we have identified a comprehensive set of non-redundant repetitive sequences, including the ribosomal RNA unit, and have quantified the major satellite DNA

  9. Common-pool resources and population genomics in Iceland, Estonia, and Tonga.

    PubMed

    Barker, Jeffrey H

    2003-01-01

    This paper addresses the application of the ethical concept of trust and the legal and political concept of public trust to population genomics projects in Iceland, Estonia, and Tonga. Focusing on trust and public trust, the paper explores analogies between the genomics projects and the treatment of other common-pool resources, making use of the notion of trust as an ethical demand, derived from the works of Emmanuel Levinas and Knud Eljer Løgstrup. The paper discusses the degree to which the ethical demands for trust and public trust have been established and maintained in the three national population genomics projects.

  10. The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation

    PubMed Central

    McNeil, Leslie Klis; Reich, Claudia; Aziz, Ramy K.; Bartels, Daniela; Cohoon, Matthew; Disz, Terry; Edwards, Robert A.; Gerdes, Svetlana; Hwang, Kaitlyn; Kubal, Michael; Margaryan, Gohar Rem; Meyer, Folker; Mihalo, William; Olsen, Gary J.; Olson, Robert; Osterman, Andrei; Paarmann, Daniel; Paczian, Tobias; Parrello, Bruce; Pusch, Gordon D.; Rodionov, Dmitry A.; Shi, Xinghua; Vassieva, Olga; Vonstein, Veronika; Zagnitko, Olga; Xia, Fangfang; Zinner, Jenifer; Overbeek, Ross; Stevens, Rick

    2007-01-01

    The National Microbial Pathogen Data Resource (NMPDR) () is a National Institute of Allergy and Infections Disease (NIAID)-funded Bioinformatics Resource Center that supports research in selected Category B pathogens. NMPDR contains the complete genomes of ∼50 strains of pathogenic bacteria that are the focus of our curators, as well as >400 other genomes that provide a broad context for comparative analysis across the three phylogenetic Domains. NMPDR integrates complete, public genomes with expertly curated biological subsystems to provide the most consistent genome annotations. Subsystems are sets of functional roles related by a biologically meaningful organizing principle, which are built over large collections of genomes; they provide researchers with consistent functional assignments in a biologically structured context. Investigators can browse subsystems and reactions to develop accurate reconstructions of the metabolic networks of any sequenced organism. NMPDR provides a comprehensive bioinformatics platform, with tools and viewers for genome analysis. Results of precomputed gene clustering analyses can be retrieved in tabular or graphic format with one-click tools. NMPDR tools include Signature Genes, which finds the set of genes in common or that differentiates two groups of organisms. Essentiality data collated from genome-wide studies have been curated. Drug target identification and high-throughput, in silico, compound screening are in development. PMID:17145713

  11. Draft Genome Sequence of Mentha longifolia and Development of Resources for Mint Cultivar Improvement.

    PubMed

    Vining, Kelly J; Johnson, Sean R; Ahkami, Amirhossein; Lange, Iris; Parrish, Amber N; Trapp, Susan C; Croteau, Rodney B; Straub, Shannon C K; Pandelova, Iovanna; Lange, B Markus

    2017-02-13

    The genus Mentha encompasses mint species cultivated for their essential oils, which are formulated into a vast array of consumer products. Desirable oil characteristics and resistance to the fungal disease Verticillium wilt are top priorities for the mint industry. However, cultivated mints have complex polyploid genomes and are sterile. Breeding efforts, therefore, require the development of genomic resources for fertile mint species. Here, we present draft de novo genome and plastome assemblies for a wilt-resistant South African accession of Mentha longifolia (L.) Huds., a diploid species ancestral to cultivated peppermint and spearmint. The 353 Mb genome contains 35 597 predicted protein-coding genes, including 292 disease resistance gene homologs, and nine genes determining essential oil characteristics. A genetic linkage map ordered 1397 genome scaffolds on 12 pseudochromosomes. More than two million simple sequence repeats were identified, which will facilitate molecular marker development. The M. longifolia genome is a valuable resource for both metabolic engineering and molecular breeding. This is exemplified by employing the genome sequence to clone and functionally characterize the promoters in a peppermint cultivar, and demonstrating the utility of a glandular trichome-specific promoter to increase expression of a biosynthetic gene, thereby modulating essential oil composition. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.

  12. A Protocol for Generating and Exchanging (Genome-Scale) Metabolic Resource Allocation Models.

    PubMed

    Reimers, Alexandra-M; Lindhorst, Henning; Waldherr, Steffen

    2017-09-06

    In this article, we present a protocol for generating a complete (genome-scale) metabolic resource allocation model, as well as a proposal for how to represent such models in the systems biology markup language (SBML). Such models are used to investigate enzyme levels and achievable growth rates in large-scale metabolic networks. Although the idea of metabolic resource allocation studies has been present in the field of systems biology for some years, no guidelines for generating such a model have been published up to now. This paper presents step-by-step instructions for building a (dynamic) resource allocation model, starting with prerequisites such as a genome-scale metabolic reconstruction, through building protein and noncatalytic biomass synthesis reactions and assigning turnover rates for each reaction. In addition, we explain how one can use SBML level 3 in combination with the flux balance constraints and our resource allocation modeling annotation to represent such models.

  13. Apollo: a community resource for genome annotation editing

    PubMed Central

    Ed, Lee; Nomi, Harris; Mark, Gibson; Raymond, Chetty; Suzanna, Lewis

    2009-01-01

    Summary: Apollo is a genome annotation-editing tool with an easy to use graphical interface. It is a component of the GMOD project, with ongoing development driven by the community. Recent additions to the software include support for the generic feature format version 3 (GFF3), continuous transcriptome data, a full Chado database interface, integration with remote services for on-the-fly BLAST and Primer BLAST analyses, graphical interfaces for configuring user preferences and full undo of all edit operations. Apollo's user community continues to grow, including its use as an educational tool for college and high-school students. Availability: Apollo is a Java application distributed under a free and open source license. Installers for Windows, Linux, Unix, Solaris and Mac OS X are available at http://apollo.berkeleybop.org, and the source code is available from the SourceForge CVS repository at http://gmod.cvs.sourceforge.net/gmod/apollo. Contact: elee@berkeleybop.org PMID:19439563

  14. A structurally based analytic model of growth and biomass dynamics in single species stands of conifers

    Treesearch

    Robin J. Tausch

    2015-01-01

    A theoretically based analytic model of plant growth in single species conifer communities based on the species fully occupying a site and fully using the site resources is introduced. Model derivations result in a single equation simultaneously describes changes over both, different site conditions (or resources available), and over time for each variable for each...

  15. Prospects and challenges for the conservation of farm animal genomic resources, 2015-2025.

    PubMed

    Bruford, Michael W; Ginja, Catarina; Hoffmann, Irene; Joost, Stéphane; Orozco-terWengel, Pablo; Alberto, Florian J; Amaral, Andreia J; Barbato, Mario; Biscarini, Filippo; Colli, Licia; Costa, Mafalda; Curik, Ino; Duruz, Solange; Ferenčaković, Maja; Fischer, Daniel; Fitak, Robert; Groeneveld, Linn F; Hall, Stephen J G; Hanotte, Olivier; Hassan, Faiz-Ul; Helsen, Philippe; Iacolina, Laura; Kantanen, Juha; Leempoel, Kevin; Lenstra, Johannes A; Ajmone-Marsan, Paolo; Masembe, Charles; Megens, Hendrik-Jan; Miele, Mara; Neuditschko, Markus; Nicolazzi, Ezequiel L; Pompanon, François; Roosen, Jutta; Sevane, Natalia; Smetko, Anamarija; Štambuk, Anamaria; Streeter, Ian; Stucki, Sylvie; Supakorn, China; Telo Da Gama, Luis; Tixier-Boichard, Michèle; Wegmann, Daniel; Zhan, Xiangjiang

    2015-01-01

    Livestock conservation practice is changing rapidly in light of policy developments, climate change and diversifying market demands. The last decade has seen a step change in technology and analytical approaches available to define, manage and conserve Farm Animal Genomic Resources (FAnGR). However, these rapid changes pose challenges for FAnGR conservation in terms of technological continuity, analytical capacity and integrative methodologies needed to fully exploit new, multidimensional data. The final conference of the ESF Genomic Resources program aimed to address these interdisciplinary problems in an attempt to contribute to the agenda for research and policy development directions during the coming decade. By 2020, according to the Convention on Biodiversity's Aichi Target 13, signatories should ensure that "…the genetic diversity of …farmed and domesticated animals and of wild relatives …is maintained, and strategies have been developed and implemented for minimizing genetic erosion and safeguarding their genetic diversity." However, the real extent of genetic erosion is very difficult to measure using current data. Therefore, this challenging target demands better coverage, understanding and utilization of genomic and environmental data, the development of optimized ways to integrate these data with social and other sciences and policy analysis to enable more flexible, evidence-based models to underpin FAnGR conservation. At the conference, we attempted to identify the most important problems for effective livestock genomic resource conservation during the next decade. Twenty priority questions were identified that could be broadly categorized into challenges related to methodology, analytical approaches, data management and conservation. It should be acknowledged here that while the focus of our meeting was predominantly around genetics, genomics and animal science, many of the practical challenges facing conservation of genomic resources are

  16. Prospects and challenges for the conservation of farm animal genomic resources, 2015-2025

    PubMed Central

    Bruford, Michael W.; Ginja, Catarina; Hoffmann, Irene; Joost, Stéphane; Orozco-terWengel, Pablo; Alberto, Florian J.; Amaral, Andreia J.; Barbato, Mario; Biscarini, Filippo; Colli, Licia; Costa, Mafalda; Curik, Ino; Duruz, Solange; Ferenčaković, Maja; Fischer, Daniel; Fitak, Robert; Groeneveld, Linn F.; Hall, Stephen J. G.; Hanotte, Olivier; Hassan, Faiz-ul; Helsen, Philippe; Iacolina, Laura; Kantanen, Juha; Leempoel, Kevin; Lenstra, Johannes A.; Ajmone-Marsan, Paolo; Masembe, Charles; Megens, Hendrik-Jan; Miele, Mara; Neuditschko, Markus; Nicolazzi, Ezequiel L.; Pompanon, François; Roosen, Jutta; Sevane, Natalia; Smetko, Anamarija; Štambuk, Anamaria; Streeter, Ian; Stucki, Sylvie; Supakorn, China; Telo Da Gama, Luis; Tixier-Boichard, Michèle; Wegmann, Daniel; Zhan, Xiangjiang

    2015-01-01

    Livestock conservation practice is changing rapidly in light of policy developments, climate change and diversifying market demands. The last decade has seen a step change in technology and analytical approaches available to define, manage and conserve Farm Animal Genomic Resources (FAnGR). However, these rapid changes pose challenges for FAnGR conservation in terms of technological continuity, analytical capacity and integrative methodologies needed to fully exploit new, multidimensional data. The final conference of the ESF Genomic Resources program aimed to address these interdisciplinary problems in an attempt to contribute to the agenda for research and policy development directions during the coming decade. By 2020, according to the Convention on Biodiversity's Aichi Target 13, signatories should ensure that “…the genetic diversity of …farmed and domesticated animals and of wild relatives …is maintained, and strategies have been developed and implemented for minimizing genetic erosion and safeguarding their genetic diversity.” However, the real extent of genetic erosion is very difficult to measure using current data. Therefore, this challenging target demands better coverage, understanding and utilization of genomic and environmental data, the development of optimized ways to integrate these data with social and other sciences and policy analysis to enable more flexible, evidence-based models to underpin FAnGR conservation. At the conference, we attempted to identify the most important problems for effective livestock genomic resource conservation during the next decade. Twenty priority questions were identified that could be broadly categorized into challenges related to methodology, analytical approaches, data management and conservation. It should be acknowledged here that while the focus of our meeting was predominantly around genetics, genomics and animal science, many of the practical challenges facing conservation of genomic resources are

  17. A whole-genome, radiation hybrid mapping resource of hexaploid wheat.

    PubMed

    Tiwari, Vijay K; Heesacker, Adam; Riera-Lizarazu, Oscar; Gunn, Hilary; Wang, Shichen; Wang, Yi; Gu, Young Q; Paux, Etienne; Koo, Dal-Hoe; Kumar, Ajay; Luo, Ming-Cheng; Lazo, Gerard; Zemetra, Robert; Akhunov, Eduard; Friebe, Bernd; Poland, Jesse; Gill, Bikram S; Kianian, Shahryar; Leonard, Jeffrey M

    2016-04-01

    Generating a contiguous, ordered reference sequence of a complex genome such as hexaploid wheat (2n = 6x = 42; approximately 17 GB) is a challenging task due to its large, highly repetitive, and allopolyploid genome. In wheat, ordering of whole-genome or hierarchical shotgun sequencing contigs is primarily based on recombination and comparative genomics-based approaches. However, comparative genomics approaches are limited to syntenic inference and recombination is suppressed within the pericentromeric regions of wheat chromosomes, thus, precise ordering of physical maps and sequenced contigs across the whole-genome using these approaches is nearly impossible. We developed a whole-genome radiation hybrid (WGRH) resource and tested it by genotyping a set of 115 randomly selected lines on a high-density single nucleotide polymorphism (SNP) array. At the whole-genome level, 26 299 SNP markers were mapped on the RH panel and provided an average mapping resolution of approximately 248 Kb/cR1500 with a total map length of 6866 cR1500 . The 7296 unique mapping bins provided a five- to eight-fold higher resolution than genetic maps used in similar studies. Most strikingly, the RH map had uniform bin resolution across the entire chromosome(s), including pericentromeric regions. Our research provides a valuable and low-cost resource for anchoring and ordering sequenced BAC and next generation sequencing (NGS) contigs. The WGRH developed for reference wheat line Chinese Spring (CS-WGRH), will be useful for anchoring and ordering sequenced BAC and NGS based contigs for assembling a high-quality, reference sequence of hexaploid wheat. Additionally, this study provides an excellent model for developing similar resources for other polyploid species. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  18. An improved protocol for intact chloroplasts and cpDNA isolation in conifers.

    PubMed

    Vieira, Leila do Nascimento; Faoro, Helisson; Fraga, Hugo Pacheco de Freitas; Rogalski, Marcelo; de Souza, Emanuel Maltempi; de Oliveira Pedrosa, Fábio; Nodari, Rubens Onofre; Guerra, Miguel Pedro

    2014-01-01

    Performing chloroplast DNA (cpDNA) isolation is considered a major challenge among different plant groups, especially conifers. Isolating chloroplasts in conifers by such conventional methods as sucrose gradient and high salt has not been successful. So far, plastid genome sequencing protocols for conifer species have been based mainly on long-range PCR, which is known to be time-consuming and difficult to implement. We developed a protocol for cpDNA isolation using three different conifer families: Araucaria angustifolia and Araucaria bidwilli (Araucariaceae), Podocarpus lambertii (Podocarpaceae) and Pinus patula (Pinaceae). The present protocol is based on high salt isolation buffer followed by saline Percoll gradient. Combining these two strategies allowed enhanced chloroplast isolation, along with decreased contamination caused by polysaccharides, polyphenols, proteins, and nuclear DNA in cpDNA. Microscopy images confirmed the presence of intact chloroplasts in high abundance. This method was applied to cpDNA isolation and subsequent sequencing by Illumina MiSeq (2×250 bp), using only 50 ng of cpDNA. Reference-guided chloroplast genome mapping showed that high average coverage was achieved for all evaluated species: 24.63 for A. angustifolia, 135.97 for A. bidwilli, 1196.10 for P. lambertii, and 64.68 for P. patula. Results show that this improved protocol is suitable for enhanced quality and yield of chloroplasts and cpDNA isolation from conifers, providing a useful tool for studies that require isolated chloroplasts and/or whole cpDNA sequences.

  19. Development of a D genome specific marker resource for diploid and hexaploid wheat.

    PubMed

    Wang, Yi; Drader, Thomas; Tiwari, Vijay K; Dong, Lingli; Kumar, Ajay; Huo, Naxin; Ghavami, Farhad; Iqbal, M Javed; Lazo, Gerard R; Leonard, Jeff; Gill, Bikram S; Kianian, Shahryar F; Luo, Ming-Cheng; Gu, Yong Q

    2015-08-28

    Mapping and map-based cloning of genes that control agriculturally and economically important traits remain great challenges for plants with complex highly repetitive genomes such as those within the grass tribe, Triticeae. Mapping limitations in the Triticeae are primarily due to low frequencies of polymorphic gene markers and poor genetic recombination in certain genetic regions. Although the abundance of repetitive sequence may pose common problems in genome analysis and sequence assembly of large and complex genomes, they provide repeat junction markers with random and unbiased distribution throughout chromosomes. Hence, development of a high-throughput mapping technology that combine both gene-based and repeat junction-based markers is needed to generate maps that have better coverage of the entire genome. In this study, the available genomics resource of the diploid Aegilop tauschii, the D genome donor of bread wheat, were used to develop genome specific markers that can be applied for mapping in modern hexaploid wheat. A NimbleGen array containing both gene-based and repeat junction probe sequences derived from Ae. tauschii was developed and used to map the Chinese Spring nullisomic-tetrasomic lines and deletion bin lines of the D genome chromosomes. Based on these mapping data, we have now anchored 5,171 repeat junction probes and 10,892 gene probes, corresponding to 5,070 gene markers, to the delineated deletion bins of the D genome. The order of the gene-based markers within the deletion bins of the Chinese Spring can be inferred based on their positions on the Ae. tauschii genetic map. Analysis of the probe sequences against the Chinese Spring chromosome sequence assembly database facilitated mapping of the NimbleGen probes to the sequence contigs and allowed assignment or ordering of these sequence contigs within the deletion bins. The accumulated length of anchored sequence contigs is about 155 Mb, representing ~ 3.2 % of the D genome. A specific

  20. The Naked Mole Rat Genome Resource: facilitating analyses of cancer and longevity-related adaptations

    PubMed Central

    Keane, Michael; Craig, Thomas; Alföldi, Jessica; Berlin, Aaron M.; Johnson, Jeremy; Seluanov, Andrei; Gorbunova, Vera; Di Palma, Federica; Lindblad-Toh, Kerstin; Church, George M.; de Magalhães, João Pedro

    2014-01-01

    Motivation: The naked mole rat (Heterocephalus glaber) is an exceptionally long-lived and cancer-resistant rodent native to East Africa. Although its genome was previously sequenced, here we report a new assembly sequenced by us with substantially higher N50 values for scaffolds and contigs. Results: We analyzed the annotation of this new improved assembly and identified candidate genomic adaptations which may have contributed to the evolution of the naked mole rat’s extraordinary traits, including in regions of p53, and the hyaluronan receptors CD44 and HMMR (RHAMM). Furthermore, we developed a freely available web portal, the Naked Mole Rat Genome Resource (http://www.naked-mole-rat.org), featuring the data and results of our analysis, to assist researchers interested in the genome and genes of the naked mole rat, and also to facilitate further studies on this fascinating species. Availability and implementation: The Naked Mole Rat Genome Resource is freely available online at http://www.naked-mole-rat.org. This resource is open source and the source code is available at https://github.com/maglab/naked-mole-rat-portal. Contact: jp@senescence.info PMID:25172923

  1. Annosus Root Rot in Eastern Conifers

    Treesearch

    Kathryn Robbins

    1984-01-01

    The fungus Heterobasidion annosum (Fr.) Bref. (= Fomes annosus (Fr.) Karst.) causes a root and butt rot of conifers in many temperate parts of the world. The decay, called annosus root rot, often kills infected conifers; infected trees that survive grow more slowly and are susceptible to windthrow and bark beetle attack.

  2. Annosus Root disease of Western Conifers (FIDL)

    Treesearch

    Craig L. Schmitt; John R. Parmeter; John T. Kliejunas

    2000-01-01

    Annosus root disease is found on all western conifer species but is of most concern on true firs, hemlocks, and pines. Incense cedar, coast redwood and sequoia are sometimes infected in California. Western juniper is infected throughout its range. Annosus is common and causes extensive decay in old-growth western and mountain hemlock stands. Many mixed conifer stands...

  3. AphanoDB: a genomic resource for Aphanomyces pathogens

    PubMed Central

    Madoui, Mohammed-Amine; Gaulin, Elodie; Mathé, Catherine; San Clemente, Hélène; Couloux, Arnaud; Wincker, Patrick; Dumas, Bernard

    2007-01-01

    Background The Oomycete genus Aphanomyces comprises devastating plant and animal pathogens. However, little is known about the molecular mechanisms underlying pathogenicity of Aphanomyces species. In this study, we report on the development of a public database called AphanoDB which is dedicated to Aphanomyces genomic data. As a first step, a large collection of Expressed Sequence Tags was obtained from the legume pathogen A. euteiches, which was then processed and collected into AphanoDB. Description Two cDNA libraries of A. euteiches were created: one from mycelium growing on synthetic medium and one from mycelium grown in contact to root tissues of the model legume Medicago truncatula. From these libraries, 18,684 expressed sequence tags were obtained and assembled into 7,977 unigenes which were compared to public databases for annotation. Queries on AphanoDB allow the users to retrieve information for each unigene including similarity to known protein sequences, protein domains and Gene Ontology classification. Statistical analysis of EST frequency from the two different growth conditions was also added to the database. Conclusion AphanoDB is a public database with a user-friendly web interface. The sequence report pages are the main web interface which provides all annotation details for each unigene. These interactive sequence report pages are easily available through text, BLAST, Gene Ontology and expression profile search utilities. AphanoDB is available from URL: . PMID:18096036

  4. Genome resources for climate-resilient cowpea, an essential crop for food security

    USDA-ARS?s Scientific Manuscript database

    Cowpea is a legume crop that is resilient in hot and drought-prone climates, and a primary source of protein in sub-Saharan Africa and other parts of the developing world. However, genome resources for cowpea have lagged behind those of most other major crop plants. Here we describe foundational g...

  5. Genome-scale resources for Thermoanaerobacterium saccharolyticum

    SciTech Connect

    Currie, Devin H.; Raman, Babu; Gowen, Christopher M.; Tschaplinski, Timothy J.; Land, Miriam L.; Brown, Steven D.; Covalla, Sean; Klingeman, Dawn Marie; Yang, Zamin Koo; Engle, Nancy L.; Johnson, Courtney M.; Rodriguez, Miguel A.; Shaw, A. Joe; Kenealy, William R.; Lynd, Lee R.; Fong, Stephen S.; Mielenz, Jonathan R.; Davison, Brian H.; Hogsett, David A.; Herring, Christopher D.

    2015-06-26

    Thermoanaerobacterium saccharolyticum is a hemicellulose-degrading thermophilic anaerobe that was previously engineered to produce ethanol at high yield. For this research, a major project was undertaken to develop this organism into an industrial biocatalyst, but the lack of genome information and resources were recognized early on as a key limitation.

  6. VectorBase: improvements to a bioinformatics resource for invertebrate vector genomics.

    PubMed

    Megy, Karine; Emrich, Scott J; Lawson, Daniel; Campbell, David; Dialynas, Emmanuel; Hughes, Daniel S T; Koscielny, Gautier; Louis, Christos; Maccallum, Robert M; Redmond, Seth N; Sheehan, Andrew; Topalis, Pantelis; Wilson, Derek

    2012-01-01

    VectorBase (http://www.vectorbase.org) is a NIAID-supported bioinformatics resource for invertebrate vectors of human pathogens. It hosts data for nine genomes: mosquitoes (three Anopheles gambiae genomes, Aedes aegypti and Culex quinquefasciatus), tick (Ixodes scapularis), body louse (Pediculus humanus), kissing bug (Rhodnius prolixus) and tsetse fly (Glossina morsitans). Hosted data range from genomic features and expression data to population genetics and ontologies. We describe improvements and integration of new data that expand our taxonomic coverage. Releases are bi-monthly and include the delivery of preliminary data for emerging genomes. Frequent updates of the genome browser provide VectorBase users with increasing options for visualizing their own high-throughput data. One major development is a new population biology resource for storing genomic variations, insecticide resistance data and their associated metadata. It takes advantage of improved ontologies and controlled vocabularies. Combined, these new features ensure timely release of multiple types of data in the public domain while helping overcome the bottlenecks of bioinformatics and annotation by engaging with our user community.

  7. The Agassiz's desert tortoise genome provides a resource for the conservation of a threatened species.

    PubMed

    Tollis, Marc; DeNardo, Dale F; Cornelius, John A; Dolby, Greer A; Edwards, Taylor; Henen, Brian T; Karl, Alice E; Murphy, Robert W; Kusumi, Kenro

    2017-01-01

    Agassiz's desert tortoise (Gopherus agassizii) is a long-lived species native to the Mojave Desert and is listed as threatened under the US Endangered Species Act. To aid conservation efforts for preserving the genetic diversity of this species, we generated a whole genome reference sequence with an annotation based on deep transcriptome sequences of adult skeletal muscle, lung, brain, and blood. The draft genome assembly for G. agassizii has a scaffold N50 length of 252 kbp and a total length of 2.4 Gbp. Genome annotation reveals 20,172 protein-coding genes in the G. agassizii assembly, and that gene structure is more similar to chicken than other turtles. We provide a series of comparative analyses demonstrating (1) that turtles are among the slowest-evolving genome-enabled reptiles, (2) amino acid changes in genes controlling desert tortoise traits such as shell development, longevity and osmoregulation, and (3) fixed variants across the Gopherus species complex in genes related to desert adaptations, including circadian rhythm and innate immune response. This G. agassizii genome reference and annotation is the first such resource for any tortoise, and will serve as a foundation for future analysis of the genetic basis of adaptations to the desert environment, allow for investigation into genomic factors affecting tortoise health, disease and longevity, and serve as a valuable resource for additional studies in this species complex.

  8. Final progress report, Construction of a genome-wide highly characterized clone resource for genome sequencing

    SciTech Connect

    Nierman, William C.

    2000-02-14

    At TIGR, the human Bacterial Artificial Chromosome (BAC) end sequencing and trimming were with an overall sequencing success rate of 65%. CalTech human BAC libraries A, B, C and D as well as Roswell Park Cancer Institute's library RPCI-11 were used. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length {approx}460 bp for a total of 141 Mb covering {approx}4.7% of the genome. Over sixty percent of the clones have BAC end sequences (BESs) from both ends representing over five-fold coverage of the genome by the paired-end clones. The average phred Q20 length is {approx}400 bp. This high accuracy makes our BESs match the human finished sequences with an average identity of 99% and a match length of 450 bp, and a frequency of one match per 12.8 kb contig sequence. Our sample tracking has ensured a clone tracking accuracy of >90%, which gives researchers a high confidence in (1) retrieving the right clone from the BA C libraries based on the sequence matches; and (2) building a minimum tiling path of sequence-ready clones across the genome and genome assembly scaffolds.

  9. Ecophysiological controls of conifer distributions

    SciTech Connect

    Woodward, F.I.

    1995-07-01

    The boreal forest covers the most extensive worldwide area of conifer-dominated vegetation, with a total global area of about 12 million km{sup 2}. This large area is very species poor; in North America there are only nine widespread and dominant species of trees, of which six are conifers-Picea mariana, Picea glauca, Abies balsamea, Larix laricina, Pinus contorta, and Pinus banksiana. The remaining three angiosperms are Betula papyrifera, Populus tremuloides, and Populus balsamifera. In Fennoscandia and the former Soviet Union, 14 species dominate the boreal forest, 10 of which are conifers-Abies sibirica, Larix gmelinii, Larix sibirica, Larix sukaczewii, Picea abies, Picea ajanensis, Picea obovata, Pinus pumila, Pinus sibifica, and Pinus sylvestris. The dominant angiosperm trees are Betula pendula, Betula pubescens, Chosenia arbutifolia, and Populus tremula. Such species paucity detracts from realizing the remarkable capacity of these species to endure the harshest forest climates of the world. Both the short-term geological history and the current climate are major causes of the species paucity in the boreal forest. In general, the boreal forest has been present in its current distribution only since the Holocene era. In most cases, the dominant species of the boreal forest completed their postglacial expansion to their current distributions only over the past 2000 years. So the ecology of the forest is very young, in comparison with forests in warmer climates. It might be expected that over subsequent millennia, with no climatic change, there could be a slow influx of new species to the boreal zone; however, the extreme climatic, edaphic, and disturbance characteristics of the area are likely to set insurmountable limits on this influx of diversity.

  10. A diploid wheat TILLING resource for wheat functional genomics.

    PubMed

    Rawat, Nidhi; Sehgal, Sunish K; Joshi, Anupama; Rothe, Nolan; Wilson, Duane L; McGraw, Nathan; Vadlani, Praveen V; Li, Wanlong; Gill, Bikram S

    2012-11-07

    Triticum monococcum L., an A genome diploid einkorn wheat, was the first domesticated crop. As a diploid, it is attractive genetic model for the study of gene structure and function of wheat-specific traits. Diploid wheat is currently not amenable to reverse genetics approaches such as insertion mutagenesis and post-transcriptional gene silencing strategies. However, TILLING offers a powerful functional genetics approach for wheat gene analysis. We developed a TILLING population of 1,532 M2 families using EMS as a mutagen. A total of 67 mutants were obtained for the four genes studied. Waxy gene mutation frequencies are known to be 1/17.6 - 34.4 kb DNA in polyploid wheat TILLING populations. The T. monococcum diploid wheat TILLING population had a mutation frequency of 1/90 kb for the same gene. Lignin biosynthesis pathway genes- COMT1, HCT2, and 4CL1 had mutation frequencies of 1/86 kb, 1/92 kb and 1/100 kb, respectively. The overall mutation frequency of the diploid wheat TILLING population was 1/92 kb. The mutation frequency of a diploid wheat TILLING population was found to be higher than that reported for other diploid grasses. The rate, however, is lower than tetraploid and hexaploid wheat TILLING populations because of the higher tolerance of polyploids to mutations. Unlike polyploid wheat, most mutants in diploid wheat have a phenotype amenable to forward and reverse genetic analysis and establish diploid wheat as an attractive model to study gene function in wheat. We estimate that a TILLING population of 5, 520 will be needed to get a non-sense mutation for every wheat gene of interest with 95% probability.

  11. A diploid wheat TILLING resource for wheat functional genomics

    PubMed Central

    2012-01-01

    Background Triticum monococcum L., an A genome diploid einkorn wheat, was the first domesticated crop. As a diploid, it is attractive genetic model for the study of gene structure and function of wheat-specific traits. Diploid wheat is currently not amenable to reverse genetics approaches such as insertion mutagenesis and post-transcriptional gene silencing strategies. However, TILLING offers a powerful functional genetics approach for wheat gene analysis. Results We developed a TILLING population of 1,532 M2 families using EMS as a mutagen. A total of 67 mutants were obtained for the four genes studied. Waxy gene mutation frequencies are known to be 1/17.6 - 34.4 kb DNA in polyploid wheat TILLING populations. The T. monococcum diploid wheat TILLING population had a mutation frequency of 1/90 kb for the same gene. Lignin biosynthesis pathway genes- COMT1, HCT2, and 4CL1 had mutation frequencies of 1/86 kb, 1/92 kb and 1/100 kb, respectively. The overall mutation frequency of the diploid wheat TILLING population was 1/92 kb. Conclusion The mutation frequency of a diploid wheat TILLING population was found to be higher than that reported for other diploid grasses. The rate, however, is lower than tetraploid and hexaploid wheat TILLING populations because of the higher tolerance of polyploids to mutations. Unlike polyploid wheat, most mutants in diploid wheat have a phenotype amenable to forward and reverse genetic analysis and establish diploid wheat as an attractive model to study gene function in wheat. We estimate that a TILLING population of 5, 520 will be needed to get a non-sense mutation for every wheat gene of interest with 95% probability. PMID:23134614

  12. Bases and spaces: resources on the web for accessing the draft human genome - II - After publication of the draft

    PubMed Central

    Semple, Colin AM

    2001-01-01

    The volume of human genome sequence and the variety of web-based tools to access it continue to grow at an impressive rate, but a working knowledge of certain key resources can be sufficient to get the most from your genome. This article provides an update to Genome Biology 2000, 1(4):reviews2001.1-2001.5. PMID:11423014

  13. Characterization of whole genome radiation hybrid mapping resources for non-mammalian vertebrates.

    PubMed Central

    Kwok, C; Korn, R M; Davis, M E; Burt, D W; Critcher, R; McCarthy, L; Paw, B H; Zon, L I; Goodfellow, P N; Schmitt, K

    1998-01-01

    Radiation hybrid panels are already available for genome mapping in human and mouse. In this study we have used two model organisms (chicken and zebrafish) to show that hybrid panels that contain a full complement of the donor genome can be generated by fusion to hamster cells. The quality of the resulting hybrids has been assessed using PCR and FISH. We confirmed the utility of our panels by establishing the percentage of donor DNA present in the hybrids. Our hybrid resources will allow inexpensive gene mapping and we expect that this technology can be transferred to many other species. Such successes are providing the basis for a new era of mapping tools, in the form of whole genome radiation hybrid panels, and are opening new possibilities for systematic genome analysis in the animal genetics community. PMID:9671819

  14. AphidBase: A centralized bioinformatic resource for annotation of the pea aphid genome

    PubMed Central

    Legeai, Fabrice; Shigenobu, Shuji; Gauthier, Jean-Pierre; Colbourne, John; Rispe, Claude; Collin, Olivier; Richards, Stephen; Wilson, Alex C. C.; Tagu, Denis

    2015-01-01

    AphidBase is a centralized bioinformatic resource that was developed to facilitate community annotation of the pea aphid genome by the International Aphid Genomics Consortium (IAGC). The AphidBase Information System designed to organize and distribute genomic data and annotations for a large international community was constructed using open source software tools from the Generic Model Organism Database (GMOD). The system includes Apollo and GBrowse utilities as well as a wiki, blast search capabilities and a full text search engine. AphidBase strongly supported community cooperation and coordination in the curation of gene models during community annotation of the pea aphid genome. AphidBase can be accessed at http://www.aphidbase.com. PMID:20482635

  15. Searching the Mouse Genome Informatics (MGI) Resources for Information on Mouse Biology from Genotype to Phenotype.

    PubMed

    Shaw, David R

    2016-12-08

    The Mouse Genome Informatics (MGI) resource provides the research community with access to information on the genetics, genomics, and biology of the laboratory mouse. Core data in MGI include gene characterization and function, phenotype and disease model descriptions, DNA and protein sequence data, gene expression data, vertebrate homologies, SNPs, mapping data, and links to other bioinformatics databases. Semantic integration is supported through the use of standardized nomenclature, and through the use of controlled vocabularies such as the mouse Anatomical Dictionary, the Mammalian Phenotype Ontology, and the Gene Ontologies. MGI extracts and organizes data from primary literature. MGI data are shared with and widely displayed from other bioinformatics resources. The database is updated weekly with curated annotations, and regularly adds new datasets and features. This unit provides a guide to using the MGI bioinformatics resource. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.

  16. Whole Genome Sequencing of Elite Rice Cultivars as a Comprehensive Information Resource for Marker Assisted Selection

    PubMed Central

    Duitama, Jorge; Silva, Alexander; Sanabria, Yamid; Cruz, Daniel Felipe; Quintero, Constanza; Ballen, Carolina; Lorieux, Mathias; Scheffler, Brian; Farmer, Andrew; Torres, Edgar; Oard, James; Tohme, Joe

    2015-01-01

    Current advances in sequencing technologies and bioinformatics revealed the genomic background of rice, a staple food for the poor people, and provided the basis to develop large genomic variation databases for thousands of cultivars. Proper analysis of this massive resource is expected to give novel insights into the structure, function, and evolution of the rice genome, and to aid the development of rice varieties through marker assisted selection or genomic selection. In this work we present sequencing and bioinformatics analyses of 104 rice varieties belonging to the major subspecies of Oryza sativa. We identified repetitive elements and recurrent copy number variation covering about 200 Mbp of the rice genome. Genotyping of over 18 million polymorphic locations within O. sativa allowed us to reconstruct the individual haplotype patterns shaping the genomic background of elite varieties used by farmers throughout the Americas. Based on a reconstruction of the alleles for the gene GBSSI, we could identify novel genetic markers for selection of varieties with high amylose content. We expect that both the analysis methods and the genomic information described here would be of great use for the rice research community and for other groups carrying on similar sequencing efforts in other crops. PMID:25923345

  17. Advances in ecological genomics in forest trees and applications to genetic resources conservation and breeding.

    PubMed

    Holliday, Jason A; Aitken, Sally N; Cooke, Janice E K; Fady, Bruno; González-Martínez, Santiago C; Heuertz, Myriam; Jaramillo-Correa, Juan-Pablo; Lexer, Christian; Staton, Margaret; Whetten, Ross W; Plomion, Christophe

    2017-02-01

    Forest trees are an unparalleled group of organisms in their combined ecological, economic and societal importance. With widespread distributions, predominantly random mating systems and large population sizes, most tree species harbour extensive genetic variation both within and among populations. At the same time, demographic processes associated with Pleistocene climate oscillations and land-use change have affected contemporary range-wide diversity and may impinge on the potential for future adaptation. Understanding how these adaptive and neutral processes have shaped the genomes of trees species is therefore central to their management and conservation. As for many other taxa, the advent of high-throughput sequencing methods is expected to yield an understanding of the interplay between the genome and environment at a level of detail and depth not possible only a few years ago. An international conference entitled 'Genomics and Forest Tree Genetics' was held in May 2016, in Arcachon (France), and brought together forest geneticists with a wide range of research interests to disseminate recent efforts that leverage contemporary genomic tools to probe the population, quantitative and evolutionary genomics of trees. An important goal of the conference was to discuss how such data can be applied to both genome-enabled breeding and the conservation of forest genetic resources under land use and climate change. Here, we report discoveries presented at the meeting and discuss how the ecological genomic toolkit can be used to address both basic and applied questions in tree biology. © 2016 John Wiley & Sons Ltd.

  18. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells.

    PubMed

    Yang, Wanjuan; Soares, Jorge; Greninger, Patricia; Edelman, Elena J; Lightfoot, Howard; Forbes, Simon; Bindal, Nidhi; Beare, Dave; Smith, James A; Thompson, I Richard; Ramaswamy, Sridhar; Futreal, P Andrew; Haber, Daniel A; Stratton, Michael R; Benes, Cyril; McDermott, Ultan; Garnett, Mathew J

    2013-01-01

    Alterations in cancer genomes strongly influence clinical responses to treatment and in many instances are potent biomarkers for response to drugs. The Genomics of Drug Sensitivity in Cancer (GDSC) database (www.cancerRxgene.org) is the largest public resource for information on drug sensitivity in cancer cells and molecular markers of drug response. Data are freely available without restriction. GDSC currently contains drug sensitivity data for almost 75 000 experiments, describing response to 138 anticancer drugs across almost 700 cancer cell lines. To identify molecular markers of drug response, cell line drug sensitivity data are integrated with large genomic datasets obtained from the Catalogue of Somatic Mutations in Cancer database, including information on somatic mutations in cancer genes, gene amplification and deletion, tissue type and transcriptional data. Analysis of GDSC data is through a web portal focused on identifying molecular biomarkers of drug sensitivity based on queries of specific anticancer drugs or cancer genes. Graphical representations of the data are used throughout with links to related resources and all datasets are fully downloadable. GDSC provides a unique resource incorporating large drug sensitivity and genomic datasets to facilitate the discovery of new therapeutic biomarkers for cancer therapies.

  19. A new cancer genome anatomy project web resource for the community.

    PubMed

    Schaefer, C; Grouse, L; Buetow, K; Strausberg, R L

    2001-01-01

    The National Cancer Institute's Cancer Genome Anatomy Project (CGAP) is developing publicly accessible information, technology, and material resources that provide a platform for the interface of cancer research and genomics. CGAP's efforts have focused toward (1) building and annotating catalogues of genes expressed during cancer development, (2) identifying polymorphisms in those genes, and (3) developing resources for the molecular characterization of cancer-related chromosomal aberrations. To date, CGAP has produced more than 1,000,000 expressed sequence tags, approximately 3,300,000 serial analysis of gene expression tags, and identified more than 10,000 human gene-based single-nucleotide polymorphisms. To enhance access to these datasets by the research community, a new Cancer Genome Project web site (http://cgap.nci.nih.gov/) is being introduced. The web site includes genomic data for humans and mice, including transcript sequence, gene expression patterns, single-nucleotide polymorphisms, clone resources, and cytogenetic information. Descriptions of the methods and reagents used in deriving the CGAP datasets are also provided. An extensive suite of informatics tools facilitates queries and analysis of the CGAP data by the community. One of the newest features of the CGAP web site is an electronic version of the Mitelman Database of Chromosome Aberrations in Cancer.

  20. A genome resource to address mechanisms of developmental programming: determination of the fetal sheep heart transcriptome.

    PubMed

    Cox, Laura A; Glenn, Jeremy P; Spradling, Kimberly D; Nijland, Mark J; Garcia, Roy; Nathanielsz, Peter W; Ford, Stephen P

    2012-06-15

    The pregnant sheep has provided seminal insights into reproduction related to animal and human development (ovarian function, fertility, implantation, fetal growth, parturition and lactation). Fetal sheep physiology has been extensively studied since 1950, contributing significantly to the basis for our understanding of many aspects of fetal development and behaviour that remain in use in clinical practice today. Understanding mechanisms requires the combination of systems approaches uniquely available in fetal sheep with the power of genomic studies. Absence of the full range of sheep genomic resources has limited the full realization of the power of this model, impeding progress in emerging areas of pregnancy biology such as developmental programming. We have examined the expressed fetal sheep heart transcriptome using high-throughput sequencing technologies. In so doing we identified 36,737 novel transcripts and describe genes, gene variants and pathways relevant to fundamental developmental mechanisms. Genes with the highest expression levels and with novel exons in the fetal heart transcriptome are known to play central roles in muscle development. We show that high-throughput sequencing methods can generate extensive transcriptome information in the absence of an assembled and annotated genome for that species. The gene sequence data obtained provide a unique genomic resource for sheep specific genetic technology development and, combined with the polymorphism data, augment annotation and assembly of the sheep genome. In addition, identification and pathway analysis of novel fetal sheep heart transcriptome splice variants is a first step towards revealing mechanisms of genetic variation and gene environment interactions during fetal heart development.

  1. Genomic Resources for Gene Discovery, Functional Genome Annotation, and Evolutionary Studies of Maize and Its Close Relatives

    PubMed Central

    Wang, Chao; Shi, Xue; Liu, Lin; Li, Haiyan; Ammiraju, Jetty S.S.; Kudrna, David A.; Xiong, Wentao; Wang, Hao; Dai, Zhaozhao; Zheng, Yonglian; Lai, Jinsheng; Jin, Weiwei; Messing, Joachim; Bennetzen, Jeffrey L; Wing, Rod A.; Luo, Meizhong

    2013-01-01

    Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics. PMID:24037269

  2. Genomic resources for gene discovery, functional genome annotation, and evolutionary studies of maize and its close relatives.

    PubMed

    Wang, Chao; Shi, Xue; Liu, Lin; Li, Haiyan; Ammiraju, Jetty S S; Kudrna, David A; Xiong, Wentao; Wang, Hao; Dai, Zhaozhao; Zheng, Yonglian; Lai, Jinsheng; Jin, Weiwei; Messing, Joachim; Bennetzen, Jeffrey L; Wing, Rod A; Luo, Meizhong

    2013-11-01

    Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics.

  3. A Genomic Resource for the Development, Improvement, and Exploitation of Sorghum for Bioenergy

    PubMed Central

    Brenton, Zachary W.; Cooper, Elizabeth A.; Myers, Mathew T.; Boyles, Richard E.; Shakoor, Nadia; Zielinski, Kelsey J.; Rauh, Bradley L.; Bridges, William C.; Morris, Geoffrey P.; Kresovich, Stephen

    2016-01-01

    With high productivity and stress tolerance, numerous grass genera of the Andropogoneae have emerged as candidates for bioenergy production. To optimize these candidates, research examining the genetic architecture of yield, carbon partitioning, and composition is required to advance breeding objectives. Significant progress has been made developing genetic and genomic resources for Andropogoneae, and advances in comparative and computational genomics have enabled research examining the genetic basis of photosynthesis, carbon partitioning, composition, and sink strength. To provide a pivotal resource aimed at developing a comparative understanding of key bioenergy traits in the Andropogoneae, we have established and characterized an association panel of 390 racially, geographically, and phenotypically diverse Sorghum bicolor accessions with 232,303 genetic markers. Sorghum bicolor was selected because of its genomic simplicity, phenotypic diversity, significant genomic tools, and its agricultural productivity and resilience. We have demonstrated the value of sorghum as a functional model for candidate gene discovery for bioenergy Andropogoneae by performing genome-wide association analysis for two contrasting phenotypes representing key components of structural and non-structural carbohydrates. We identified potential genes, including a cellulase enzyme and a vacuolar transporter, associated with increased non-structural carbohydrates that could lead to bioenergy sorghum improvement. Although our analysis identified genes with potentially clear functions, other candidates did not have assigned functions, suggesting novel molecular mechanisms for carbon partitioning traits. These results, combined with our characterization of phenotypic and genetic diversity and the public accessibility of each accession and genomic data, demonstrate the value of this resource and provide a foundation for future improvement of sorghum and related grasses for bioenergy production

  4. A Genomic Resource for the Development, Improvement, and Exploitation of Sorghum for Bioenergy.

    PubMed

    Brenton, Zachary W; Cooper, Elizabeth A; Myers, Mathew T; Boyles, Richard E; Shakoor, Nadia; Zielinski, Kelsey J; Rauh, Bradley L; Bridges, William C; Morris, Geoffrey P; Kresovich, Stephen

    2016-09-01

    With high productivity and stress tolerance, numerous grass genera of the Andropogoneae have emerged as candidates for bioenergy production. To optimize these candidates, research examining the genetic architecture of yield, carbon partitioning, and composition is required to advance breeding objectives. Significant progress has been made developing genetic and genomic resources for Andropogoneae, and advances in comparative and computational genomics have enabled research examining the genetic basis of photosynthesis, carbon partitioning, composition, and sink strength. To provide a pivotal resource aimed at developing a comparative understanding of key bioenergy traits in the Andropogoneae, we have established and characterized an association panel of 390 racially, geographically, and phenotypically diverse Sorghum bicolor accessions with 232,303 genetic markers. Sorghum bicolor was selected because of its genomic simplicity, phenotypic diversity, significant genomic tools, and its agricultural productivity and resilience. We have demonstrated the value of sorghum as a functional model for candidate gene discovery for bioenergy Andropogoneae by performing genome-wide association analysis for two contrasting phenotypes representing key components of structural and non-structural carbohydrates. We identified potential genes, including a cellulase enzyme and a vacuolar transporter, associated with increased non-structural carbohydrates that could lead to bioenergy sorghum improvement. Although our analysis identified genes with potentially clear functions, other candidates did not have assigned functions, suggesting novel molecular mechanisms for carbon partitioning traits. These results, combined with our characterization of phenotypic and genetic diversity and the public accessibility of each accession and genomic data, demonstrate the value of this resource and provide a foundation for future improvement of sorghum and related grasses for bioenergy production.

  5. Evidence of intense chromosomal shuffling during conifer evolution.

    PubMed

    de Miguel, Marina; Bartholomé, Jérôme; Ehrenmann, François; Murat, Florent; Moriguchi, Yoshinari; Uchiyama, Kentaro; Ueno, Saneyoshi; Tsumura, Yoshihiko; Lagraulet, Hélène; de Maria, Nuria; Cabezas, José-Antonio; Cervera, Maria-Teresa; Gion, Jean Marc; Salse, Jérôme; Plomion, Christophe

    2015-09-23

    While recent advances have been gained on genome evolution in angiosperm lineages, virtually nothing is known about karyotype evolution in the other group of seed plants, the gymnosperms. Here we used high density gene-based linkage mapping to compare the karyotype structure of two families of conifers (the most abundant group of gymnosperms) separated around 290 million years ago: Pinaceae and Cupressaceae. We propose for the first time a model based on the fusion of 20 ancestral chromosomal blocks that may have shaped the modern karyotpes of Pinaceae (with n=12) and Cupressaceae (with n=11). The considerable difference in modern genome organization between these two lineages contrasts strongly with the remarkable level of synteny already reported within the Pinaceae. It also suggests a convergent evolutionary mechanism of chromosomal block shuffling that has shaped the genomes of the spermatophytes. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  6. Evidence of Intense Chromosomal Shuffling during Conifer Evolution

    PubMed Central

    de Miguel, Marina; Bartholomé, Jérôme; Ehrenmann, François; Murat, Florent; Moriguchi, Yoshinari; Uchiyama, Kentaro; Ueno, Saneyoshi; Tsumura, Yoshihiko; Lagraulet, Hélène; de Maria, Nuria; Cabezas, José-Antonio; Cervera, María-Teresa; Gion, Jean Marc; Salse, Jérôme; Plomion, Christophe

    2015-01-01

    Although recent advances have been gained on genome evolution in angiosperm lineages, virtually nothing is known about karyotype evolution in the other group of seed plants, the gymnosperms. Here, we used high-density gene-based linkage mapping to compare the karyotype structure of two families of conifers (the most abundant group of gymnosperms) separated around 290 Ma: Pinaceae and Cupressaceae. We propose for the first time a model based on the fusion of 20 ancestral chromosomal blocks that may have shaped the modern karyotpes of Pinaceae (with n = 12) and Cupressaceae (with n = 11). The considerable difference in modern genome organization between these two lineages contrasts strongly with the remarkable level of synteny already reported within the Pinaceae. It also suggests a convergent evolutionary mechanism of chromosomal block shuffling that has shaped the genomes of the spermatophytes. PMID:26400405

  7. Development of genomic resources for Nothofagus species using next-generation sequencing data.

    PubMed

    El Mujtar, V A; Gallo, L A; Lang, T; Garnier-Géré, P

    2014-11-01

    Using next-generation sequencing, we developed the first whole-genome resources for two hybridizing Nothofagus species of the Patagonian forests that crucially lack genomic data, despite their ecological and industrial value. A de novo assembly strategy combining base quality control and optimization of the putative chloroplast gene map yielded ~32,000 contigs from 43% of the reads produced. With 12.5% of assembled reads, we covered ~96% of the chloroplast genome and ~70% of the mitochondrial gene content, providing functional and structural annotations for 112 and 52 genes, respectively. Functional annotation was possible on 15% of the contigs, with ~1750 potentially novel nuclear genes identified for Nothofagus species. We estimated that the new resources (13.41 Mb in total) included ~4000 gene regions representing ~6.5% of the expected genic partition of the genome, the remaining contigs potentially being nongenic DNA. A high-quality single nucleotide polymorphisms resource was developed by comparing various filtering methods, and preliminary results indicate a strong conservation of cpDNA genomes in contrast to numerous exclusive nuclear polymorphisms in both species. Finally, we characterized 2274 potential simple sequence repeat (SSR) loci, designed primers for 769 of them and validated nine of 29 loci in 42 individuals per species. Nothofagus obliqua had more alleles (4.89) on average than N. nervosa (2.89), 8 SSRs were efficient to discriminate species, and three were successfully transferred in three other Nothofagus species. These resources will greatly help for future inferences of demographic, adaptive and hybridizing events in Nothofagus species, and for conserving and managing natural populations. © 2014 John Wiley & Sons Ltd.

  8. DFAST and DAGA: web-based integrated genome annotation tools and resources

    PubMed Central

    TANIZAWA, Yasuhiro; FUJISAWA, Takatomo; KAMINUMA, Eli; NAKAMURA, Yasukazu; ARITA, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus, obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii, whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp. PMID:27867804

  9. DFAST and DAGA: web-based integrated genome annotation tools and resources.

    PubMed

    Tanizawa, Yasuhiro; Fujisawa, Takatomo; Kaminuma, Eli; Nakamura, Yasukazu; Arita, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus, obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii, whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp.

  10. Resources for Genetic and Genomic Analysis of Emerging Pathogen Acinetobacter baumannii

    PubMed Central

    Ramage, Elizabeth; Weiss, Eli J.; Radey, Matthew; Hayden, Hillary S.; Held, Kiara G.; Huse, Holly K.; Zurawski, Daniel V.; Brittnacher, Mitchell J.; Manoil, Colin

    2015-01-01

    ABSTRACT Acinetobacter baumannii is a Gram-negative bacterial pathogen notorious for causing serious nosocomial infections that resist antibiotic therapy. Research to identify factors responsible for the pathogen's success has been limited by the resources available for genome-scale experimental studies. This report describes the development of several such resources for A. baumannii strain AB5075, a recently characterized wound isolate that is multidrug resistant and displays robust virulence in animal models. We report the completion and annotation of the genome sequence, the construction of a comprehensive ordered transposon mutant library, the extension of high-coverage transposon mutant pool sequencing (Tn-seq) to the strain, and the identification of the genes essential for growth on nutrient-rich agar. These resources should facilitate large-scale genetic analysis of virulence, resistance, and other clinically relevant traits that make A. baumannii a formidable public health threat. IMPORTANCE Acinetobacter baumannii is one of six bacterial pathogens primarily responsible for antibiotic-resistant infections that have become the scourge of health care facilities worldwide. Eliminating such infections requires a deeper understanding of the factors that enable the pathogen to persist in hospital environments, establish infections, and resist antibiotics. We present a set of resources that should accelerate genome-scale genetic characterization of these traits for a reference isolate of A. baumannii that is highly virulent and representative of current outbreak strains. PMID:25845845

  11. Genome resources for climate-resilient cowpea, an essential crop for food security.

    PubMed

    Muñoz-Amatriaín, María; Mirebrahim, Hamid; Xu, Pei; Wanamaker, Steve I; Luo, MingCheng; Alhakami, Hind; Alpert, Matthew; Atokple, Ibrahim; Batieno, Benoit J; Boukar, Ousmane; Bozdag, Serdar; Cisse, Ndiaga; Drabo, Issa; Ehlers, Jeffrey D; Farmer, Andrew; Fatokun, Christian; Gu, Yong Q; Guo, Yi-Ning; Huynh, Bao-Lam; Jackson, Scott A; Kusi, Francis; Lawley, Cynthia T; Lucas, Mitchell R; Ma, Yaqin; Timko, Michael P; Wu, Jiajie; You, Frank; Barkley, Noelle A; Roberts, Philip A; Lonardi, Stefano; Close, Timothy J

    2017-03-01

    Cowpea (Vigna unguiculata L. Walp.) is a legume crop that is resilient to hot and drought-prone climates, and a primary source of protein in sub-Saharan Africa and other parts of the developing world. However, genome resources for cowpea have lagged behind most other major crops. Here we describe foundational genome resources and their application to the analysis of germplasm currently in use in West African breeding programs. Resources developed from the African cultivar IT97K-499-35 include a whole-genome shotgun (WGS) assembly, a bacterial artificial chromosome (BAC) physical map, and assembled sequences from 4355 BACs. These resources and WGS sequences of an additional 36 diverse cowpea accessions supported the development of a genotyping assay for 51 128 SNPs, which was then applied to five bi-parental RIL populations to produce a consensus genetic map containing 37 372 SNPs. This genetic map enabled the anchoring of 100 Mb of WGS and 420 Mb of BAC sequences, an exploration of genetic diversity along each linkage group, and clarification of macrosynteny between cowpea and common bean. The SNP assay enabled a diversity analysis of materials from West African breeding programs. Two major subpopulations exist within those materials, one of which has significant parentage from South and East Africa and more diversity. There are genomic regions of high differentiation between subpopulations, one of which coincides with a cluster of nodulin genes. The new resources and knowledge help to define goals and accelerate the breeding of improved varieties to address food security issues related to limited-input small-holder farming and climate stress. © 2016 The Authors. The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.

  12. Preterm Birth Genome Project (PGP) -- validation of resources for preterm birth genome-wide studies.

    PubMed

    Pennell, Craig E; Vadillo-Ortega, Felipe; Olson, David M; Ha, Eun-Hee; Williams, Scott; Frayling, Tim M; Dolan, Siobhan; Katz, Michael; Merialdi, Mario; Menon, Ramkumar

    2013-01-01

    We determined a series of quality control (QC) analyses to assess the usability of DNA collected and processed from different countries utilizing different DNA extraction techniques prior to genome-wide association studies (GWAS). The quality of DNA collected utilizing four different DNA extraction techniques and the impact of shipping DNA at different temperatures on array performance were evaluated. Fifteen maternal-fetal pairs were used from four countries. DNA was extracted using four approaches: whole blood, blood spots with whole genome amplification (WGA), saliva and buccal swab. Samples were sent to a genotyping facility, either on dry ice or at room temperature and genotyped using Affymetrix SNP array 6.0. QC measured included extraction techniques, effect of shipping temperatures, accuracy and Mendelian concordance. Significantly fewer (50 % ) single nucleotide polymorphisms (SNPs) passed QC metrics for buccal swab DNA (P < 0.0001) due to missing genotype data (P < 0.0001). Whole blood or saliva DNA had the highest call rates (99.2 0.4 % and 99.3 0.2 % , respectively) and Mendelian concordance. Shipment temperature had no effect. DNA from blood or saliva had the highest call rate accuracy, and buccal swabs had the lowest. DNA extracted from blood, saliva and blood spots were found suitable for GWAS in our study.

  13. Genomics Encyclopedia of Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB): a resource for microsymbiont genomes (2013 DOE JGI Genomics of Energy and Environment 8th Annual User Meeting)

    SciTech Connect

    Reeve, Wayne

    2013-03-01

    Wayne Reeve of Murdoch University on "Genomics Encyclopedia of Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB): a resource for microsymbiont genomes" at the 8th Annual Genomics of Energy & Environment Meeting on March 27, 2013 in Walnut Creek, Calif.

  14. YersiniaBase: a genomic resource and analysis platform for comparative analysis of Yersinia.

    PubMed

    Tan, Shi Yang; Dutta, Avirup; Jakubovics, Nicholas S; Ang, Mia Yang; Siow, Cheuk Chuen; Mutha, Naresh Vr; Heydari, Hamed; Wee, Wei Yee; Wong, Guat Jah; Choo, Siew Woh

    2015-01-16

    Yersinia is a Gram-negative bacteria that includes serious pathogens such as the Yersinia pestis, which causes plague, Yersinia pseudotuberculosis, Yersinia enterocolitica. The remaining species are generally considered non-pathogenic to humans, although there is evidence that at least some of these species can cause occasional infections using distinct mechanisms from the more pathogenic species. With the advances in sequencing technologies, many genomes of Yersinia have been sequenced. However, there is currently no specialized platform to hold the rapidly-growing Yersinia genomic data and to provide analysis tools particularly for comparative analyses, which are required to provide improved insights into their biology, evolution and pathogenicity. To facilitate the ongoing and future research of Yersinia, especially those generally considered non-pathogenic species, a well-defined repository and analysis platform is needed to hold the Yersinia genomic data and analysis tools for the Yersinia research community. Hence, we have developed the YersiniaBase, a robust and user-friendly Yersinia resource and analysis platform for the analysis of Yersinia genomic data. YersiniaBase has a total of twelve species and 232 genome sequences, of which the majority are Yersinia pestis. In order to smooth the process of searching genomic data in a large database, we implemented an Asynchronous JavaScript and XML (AJAX)-based real-time searching system in YersiniaBase. Besides incorporating existing tools, which include JavaScript-based genome browser (JBrowse) and Basic Local Alignment Search Tool (BLAST), YersiniaBase also has in-house developed tools: (1) Pairwise Genome Comparison tool (PGC) for comparing two user-selected genomes; (2) Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomics analysis of Yersinia genomes; (3) YersiniaTree for constructing phylogenetic tree of Yersinia. We ran analyses based on the tools and genomic data in YersiniaBase and the

  15. Performance Ratio Based Resource Allocation Decision-Making in Genomic Medicine.

    PubMed

    Fragoulakis, Vasilios; Mitropoulou, Christina; Katelidou, Daphne; van Schaik, Ron H; Maniadakis, Nikolaos; Patrinos, George P

    2017-02-01

    In modern healthcare systems, the available resources may influence the morbidity, mortality, and-consequently-the level of healthcare provided in every country. This is of particular interest in developing countries where the resources are limited and must be spent wisely to address social justice and the right for equal access in healthcare services by all the citizens in economically viable terms. In this light, the current allocation is, in practice, inefficient and rests mostly on each country's individual political and historical context and, thus, does not always incorporate decision-making enabled by economic models. In this study, we present a new economic model, specifically for resource allocation for genomic medicine, based on performance ratio, with potential applications in diverse healthcare sectors, which are particularly appealing for developing countries and low-resource environments. The model proposes a new method for resource allocation taking into account (1) the size of innovation of a new technology, (2) the relative effectiveness in comparison with social preferences, and (3) the cost of the technology, which permits the measurement of effectiveness to be determined differently in the context of a specific disease and then to be expressed in a relative form using a common performance ratio. The present work expands on previous work for innovation in economic models pertaining to genomic medicine and supports translational science.

  16. Munich Information Center for Protein Sequences Plant Genome Resources. A Framework for Integrative and Comparative Analyses1[w

    PubMed Central

    Schoof, Heiko; Spannagl, Manuel; Yang, Li; Ernst, Rebecca; Gundlach, Heidrun; Haase, Dirk; Haberer, Georg; Mayer, Klaus F.X.

    2005-01-01

    With several plant genomes sequenced, the power of comparative genome analysis can now be applied. However, genome-scale cross-species analyses are limited by the effort for data integration. To develop an integrated cross-species plant genome resource, we maintain comprehensive databases for model plant genomes, including Arabidopsis (Arabidopsis thaliana), maize (Zea mays), Medicago truncatula, and rice (Oryza sativa). Integration of data and resources is emphasized, both in house as well as with external partners and databases. Manual curation and state-of-the-art bioinformatic analysis are combined to achieve quality data. Easy access to the data is provided through Web interfaces and visualization tools, bulk downloads, and Web services for application-level access. This allows a consistent view of the model plant genomes for comparative and evolutionary studies, the transfer of knowledge between species, and the integration with functional genomics data. PMID:16010004

  17. Mouse Genome Database (MGD)-2017: community knowledge resource for the laboratory mouse

    PubMed Central

    Blake, Judith A.; Eppig, Janan T.; Kadin, James A.; Richardson, Joel E.; Smith, Cynthia L.; Bult, Carol J.

    2017-01-01

    The Mouse Genome Database (MGD: http://www.informatics.jax.org) is the primary community data resource for the laboratory mouse. It provides a highly integrated and highly curated system offering a comprehensive view of current knowledge about mouse genes, genetic markers and genomic features as well as the associations of those features with sequence, phenotypes, functional and comparative information, and their relationships to human diseases. MGD continues to enhance access to these data, to extend the scope of data content and visualizations, and to provide infrastructure and user support that ensures effective and efficient use of MGD in the advancement of scientific knowledge. Here, we report on recent enhancements made to the resource and new features. PMID:27899570

  18. Utilizing Genomics to Study Entomopathogenicity in the Fungal Phylum Entomophthoromycota: A Review of Current Genetic Resources.

    PubMed

    De Fine Licht, H H; Hajek, A E; Eilenberg, J; Jensen, A B

    2016-01-01

    The order Entomophthorales, which formerly contained c.280 species, has recently been recognized as a separate phylum, Entomophthoromycota, consisting of three recognized classes and six families. Many genera in this group contain obligate insect-pathogenic species with narrow host ranges, capable of producing epizootics in natural insect populations. Available sequence information from the phylum Entomophthoromycota can be classified into three main categories: first, partial gene regions (exons+introns) used for phylogenetic inference; second, protein coding gene regions obtained using degenerate primers, expressed sequence tag methodology or de novo transcriptome sequencing with molecular function inferred by homology analysis; and third, primarily forthcoming whole-genome sequencing data sets. Here we summarize the current genetic resources for Entomophthoromycota and identify research areas that are likely to be significantly advanced from the availability of new whole-genome resources.

  19. SkateBase, an elasmobranch genome project and collection of molecular resources for chondrichthyan fishes

    PubMed Central

    Wyffels, Jennifer; L. King, Benjamin; Vincent, James; Chen, Chuming; Wu, Cathy H.; Polson, Shawn W.

    2014-01-01

    Chondrichthyan fishes are a diverse class of gnathostomes that provide a valuable perspective on fundamental characteristics shared by all jawed and limbed vertebrates. Studies of phylogeny, species diversity, population structure, conservation, and physiology are accelerated by genomic, transcriptomic and protein sequence data. These data are widely available for many sarcopterygii (coelacanth, lungfish and tetrapods) and actinoptergii (ray-finned fish including teleosts) taxa, but limited for chondrichthyan fishes.  In this study, we summarize available data for chondrichthyes and describe resources for one of the largest projects to characterize one of these fish, Leucoraja erinacea, the little skate.  SkateBase ( http://skatebase.org) serves as the skate genome project portal linking data, research tools, and teaching resources. PMID:25309735

  20. Genomic resources for a model in adaptation and speciation research: characterization of the Poecilia mexicana transcriptome

    PubMed Central

    2012-01-01

    Background Elucidating the genomic basis of adaptation and speciation is a major challenge in natural systems with large quantities of environmental and phenotypic data, mostly because of the scarcity of genomic resources for non-model organisms. The Atlantic molly (Poecilia mexicana, Poeciliidae) is a small livebearing fish that has been extensively studied for evolutionary ecology research, particularly because this species has repeatedly colonized extreme environments in the form of caves and toxic hydrogen sulfide containing springs. In such extreme environments, populations show strong patterns of adaptive trait divergence and the emergence of reproductive isolation. Here, we used RNA-sequencing to assemble and annotate the first transcriptome of P. mexicana to facilitate ecological genomics studies in the future and aid the identification of genes underlying adaptation and speciation in the system. Description We provide the first annotated reference transcriptome of P. mexicana. Our transcriptome shows high congruence with other published fish transcriptomes, including that of the guppy, medaka, zebrafish, and stickleback. Transcriptome annotation uncovered the presence of candidate genes relevant in the study of adaptation to extreme environments. We describe general and oxidative stress response genes as well as genes involved in pathways induced by hypoxia or involved in sulfide metabolism. To facilitate future comparative analyses, we also conducted quantitative comparisons between P. mexicana from different river drainages. 106,524 single nucleotide polymorphisms were detected in our dataset, including potential markers that are putatively fixed across drainages. Furthermore, specimens from different drainages exhibited some consistent differences in gene regulation. Conclusions Our study provides a valuable genomic resource to study the molecular underpinnings of adaptation to extreme environments in replicated sulfide spring and cave environments. In

  1. Optimizing de novo transcriptome assembly and extending genomic resources for striped catfish (Pangasianodon hypophthalmus).

    PubMed

    Thanh, Nguyen Minh; Jung, Hyungtaek; Lyons, Russell E; Njaci, Isaac; Yoon, Byoung-Ha; Chand, Vincent; Tuan, Nguyen Viet; Thu, Vo Thi Minh; Mather, Peter

    2015-10-01

    Striped catfish (Pangasianodon hypophthalmus) is a commercially important freshwater fish used in inland aquaculture in the Mekong Delta, Vietnam. The culture industry is facing a significant challenge however from saltwater intrusion into many low topographical coastal provinces across the Mekong Delta as a result of predicted climate change impacts. Developing genomic resources for this species can facilitate the production of improved culture lines that can withstand raised salinity conditions, and so we have applied high-throughput Ion Torrent sequencing of transcriptome libraries from six target osmoregulatory organs from striped catfish as a genomic resource for use in future selection strategies. We obtained 12,177,770 reads after trimming and processing with an average length of 97bp. De novo assemblies were generated using CLC Genomic Workbench, Trinity and Velvet/Oases with the best overall contig performance resulting from the CLC assembly. De novo assembly using CLC yielded 66,451 contigs with an average length of 478bp and N50 length of 506bp. A total of 37,969 contigs (57%) possessed significant similarity with proteins in the non-redundant database. Comparative analyses revealed that a significant number of contigs matched sequences reported in other teleost fishes, ranging in similarity from 45.2% with Atlantic cod to 52% with zebrafish. In addition, 28,879 simple sequence repeats (SSRs) and 55,721 single nucleotide polymorphisms (SNPs) were detected in the striped catfish transcriptome. The sequence collection generated in the current study represents the most comprehensive genomic resource for P. hypophthalmus available to date. Our results illustrate the utility of next-generation sequencing as an efficient tool for constructing a large genomic database for marker development in non-model species. Copyright © 2015 Elsevier B.V. All rights reserved.

  2. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

    PubMed

    Zheng, Wenning; Tan, Tze King; Paterson, Ian C; Mutha, Naresh V R; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A; Jakubovics, Nicholas S; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my.

  3. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform

    PubMed Central

    Zheng, Wenning; Paterson, Ian C.; Mutha, Naresh V. R.; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A.; Jakubovics, Nicholas S.; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my. PMID:27138013

  4. RSIADB, a collective resource for genome and transcriptome analyses in Rhizoctonia solani AG1 IA.

    PubMed

    Chen, Lei; Ai, Peng; Zhang, Jinfeng; Deng, Qiming; Wang, Shiquan; Li, Shuangcheng; Zhu, Jun; Li, Ping; Zheng, Aiping

    2016-01-01

    Rice [Oryza sativa (L.)] feeds more than half of the world's population. Rhizoctonia solaniis a major fungal pathogen of rice causing extreme crop losses in all rice-growing regions of the world. R. solani AG1 IA is a major cause of sheath blight in rice. In this study, we constructed a comprehensive and user-friendly web-based database, RSIADB, to analyse its draft genome and transcriptome. The database was built using the genome sequence (10,489 genes) and annotation information for R. solani AG1 IA. A total of six RNAseq samples of R. solani AG1 IA were also analysed, corresponding to 10, 18, 24, 32, 48 and 72 h after infection of rice leaves. The RSIADB database enables users to search, browse, and download gene sequences for R. solani AG1 IA, and mine the data using BLAST, Sequence Extractor, Browse and Construction Diagram tools that were integrated into the database. RSIADB is an important genomic resource for scientists working with R. solani AG1 IA and will assist researchers in analysing the annotated genome and transcriptome of this pathogen. This resource will facilitate studies on gene function, pathogenesis factors and secreted proteins, as well as provide an avenue for comparative analyses of genes expressed during different stages of infection. Database URL:http://genedenovoweb.ticp.net:81/rsia/index.php.

  5. Defense Mechanisms of Conifers 1

    PubMed Central

    Lewinsohn, Efraim; Gijzen, Mark; Savage, Thomas J.; Croteau, Rodney

    1991-01-01

    Cell-free extracts from Pinus ponderosa Lawson (ponderosa pine) and Pinus sylvestris L. (Scotch pine) wood exhibited high levels of monoterpene synthase (cyclase) activity, whereas bark extracts of these species contained no detectable activity, and they inhibited cyclase activity when added to extracts from wood, unless polyvinylpyrrolidone was included in the preparation. The molecular mass of the polyvinylpyrrolidone added was of little consequence; however, polyvinylpolypyrrolidone (a cross-linked insoluble form of the polymer) was ineffective in protecting enzyme activity. Based on these observations, methods were developed for the efficient extraction and assay of monoterpene cyclase activity from conifer stem (wood and bark) tissue. The level of monoterpene cyclase activity for a given conifer species was shown to correlate closely with the monoterpene content of the oleoresin and with the degree of anatomical complexity of the specialized resin-secreting structures. Cyclase activity and monoterpene content were lowest in the stems of species containing only isolated resin cells, such as western red cedar (Thuja plicata D. Don). Increasing levels of cyclase activity and oleoresin monoterpenes were observed in advancing from species with multicellular resin blisters (true firs [Abies]) to those with organized resin passages, such as western larch (Larix occidentalis Nutt.), Colorado blue spruce (Picea pungens Engelm.) and Douglas-fir (Pseudotsuga menziesii [Mirb.] Franco). The highest levels of cyclase activity and oleoresin monoterpenes were noted in Pinus species that contain the most highly developed resin duct systems. The relationship between biosynthetic capacity, as measured by cyclase activity, monoterpene content, and the degree of organization of the secretory structures for a given species, may reflect the total number of specialized resin-producing cells per unit mass of stem tissue. PMID:16668182

  6. Biomass in conifer plantations of northeastern Minnesota.

    Treesearch

    Lewis F. Ohmann

    1984-01-01

    Provides biomass (pounds/acre) estimates for vegetative strata and herb-low shrub species for 53 conifer plantations in Northeastern Minnesota. The estimates are analyzed by plantation age and silvicultural practices used to establish and release the plantations.

  7. Resources

    MedlinePlus

    ... Gastrointestinal disorders - resources Hearing impairment - resources Hearing or speech impairment - resources Heart disease - resources Hemophilia - resources Herpes - resources Incest - resources Incontinence - ...

  8. Physical mapping resources for large plant genomes: radiation hybrids for wheat D-genome progenitor Aegilops tauschii

    PubMed Central

    2012-01-01

    lines with an average deletion frequency of ~10% were identified for developing high density marker scaffolds of the D-genome. Conclusions The RH panel reported here is the first developed for any wild ancestor of a major cultivated plant species. The results provided insight into various aspects of RH mapping in plants, including the genetically effective cell number for wheat (for the first time) and the potential implementation of this technique in other plant species. This RH panel will be an invaluable resource for mapping gene based markers, developing a complete marker scaffold for the whole genome sequence assembly, fine mapping of markers and functional characterization of genes and gene networks present on the D-genome. PMID:23127207

  9. Physical mapping resources for large plant genomes: radiation hybrids for wheat D-genome progenitor Aegilops tauschii.

    PubMed

    Kumar, Ajay; Simons, Kristin; Iqbal, Muhammad J; de Jiménez, Monika Michalak; Bassi, Filippo M; Ghavami, Farhad; Al-Azzam, Omar; Drader, Thomas; Wang, Yi; Luo, Ming-Cheng; Gu, Yong Q; Denton, Anne; Lazo, Gerard R; Xu, Steven S; Dvorak, Jan; Kianian, Penny M A; Kianian, Shahryar F

    2012-11-05

    deletion frequency of ~10% were identified for developing high density marker scaffolds of the D-genome. The RH panel reported here is the first developed for any wild ancestor of a major cultivated plant species. The results provided insight into various aspects of RH mapping in plants, including the genetically effective cell number for wheat (for the first time) and the potential implementation of this technique in other plant species. This RH panel will be an invaluable resource for mapping gene based markers, developing a complete marker scaffold for the whole genome sequence assembly, fine mapping of markers and functional characterization of genes and gene networks present on the D-genome.

  10. Characteristics of snags containing excavated cavities in northern Arizona mixed-conifer and ponderosa pine forests

    Treesearch

    Joseph L. Ganey; Scott C. Vojta

    2004-01-01

    Snags provide an important resource for a rich assemblage of cavity-nesting birds in the southwestern United States. To expand our knowledge of snag use by cavity-nesting birds in this region, we documented characteristics of snags with and without excavated cavities in mixed-conifer and ponderosa pine (Pinus ponderosa Dougl. ex Laws) forest in north...

  11. Compatible management of red alder-conifer ecosystems in southeastern Alaska.

    Treesearch

    Mark S. Wipfli; Robert L. Deal; Paul E. Hennon; Adelaide C. Johnson; Richard T. Edwards; Toni L. De Santo; Takashi Gomi; Ewa H. Orlikowska; Mason D. Bryant; Mark E. Schultz; Christian LeSage; Ryan Kimbirauskus; David V. D' Amore

    2003-01-01

    Forest clearcutting has been the primary timber management practice in forests of southeastern Alaska since commercial timber harvesting began in the 1950s, and the dense, even-aged conifer stands that subsequently developed have broad and undesirable consequences for some nontimber resources-most notably, fish and wildlife. Because a few earlier reports suggested that...

  12. Structure, specificity, and evolution of insect guilds related to cones of conifers in Western Europe

    Treesearch

    Alain Roques

    1991-01-01

    Patchy and ephemeral resources, such as the cones of conifers, can be very useful in the study of plant-insect relationships. Studies of such relationships in forest entomology are typically complicated by the spatial and temporal characteristics of the host plants, which occur over vast areas and have lifespans of decades or even centuries. The reproductive structures...

  13. Density of large snags and logs in northern Arizona mixed-conifer and ponderosa pine forests

    Treesearch

    Joseph L. Ganey; Benjamin J. Bird; L. Scott Baggett; Jeffrey S. Jenness

    2015-01-01

    Large snags and logs provide important biological legacies and resources for native wildlife, yet data on populations of large snags and logs and factors influencing those populations are sparse. We monitored populations of large snags and logs in mixed-conifer and ponderosa pine (Pinus ponderosa) forests in northern Arizona from 1997 through 2012. We modeled density...

  14. In situ gene conservation of six conifers in western Washington and Oregon.

    Treesearch

    S.R. Lipow; K. Vance-Borland; J.B. St. Clair; J.A. Henderson; C. McCain

    2007-01-01

    A gap analysis was conducted to evaluate the extent to which genetic resources are conserved in situ in protected areas for six species of conifers in the Pacific Northwest. The gap analysis involved producing a geographic information system (GIS) detailing the location of protected areas and the distribution and abundance of tree species, as inferred from data on...

  15. Computational methods and resources for the interpretation of genomic variants in cancer

    PubMed Central

    2015-01-01

    The recent improvement of the high-throughput sequencing technologies is having a strong impact on the detection of genetic variations associated with cancer. Several institutions worldwide have been sequencing the whole exomes and or genomes of cancer patients in the thousands, thereby providing an invaluable collection of new somatic mutations in different cancer types. These initiatives promoted the development of methods and tools for the analysis of cancer genomes that are aimed at studying the relationship between genotype and phenotype in cancer. In this article we review the online resources and computational tools for the analysis of cancer genome. First, we describe the available repositories of cancer genome data. Next, we provide an overview of the methods for the detection of genetic variation and computational tools for the prioritization of cancer related genes and causative somatic variations. Finally, we discuss the future perspectives in cancer genomics focusing on the impact of computational methods and quantitative approaches for defining personalized strategies to improve the diagnosis and treatment of cancer. PMID:26111056

  16. WormBase: a multi-species resource for nematode biology and genomics

    PubMed Central

    Harris, Todd W.; Chen, Nansheng; Cunningham, Fiona; Tello-Ruiz, Marcela; Antoshechkin, Igor; Bastiani, Carol; Bieri, Tamberlyn; Blasiar, Darin; Bradnam, Keith; Chan, Juancarlos; Chen, Chao-Kung; Chen, Wen J.; Davis, Paul; Kenny, Eimear; Kishore, Ranjana; Lawson, Daniel; Lee, Raymond; Muller, Hans-Michael; Nakamura, Cecilia; Ozersky, Philip; Petcherski, Andrei; Rogers, Anthony; Sabo, Aniko; Schwarz, Erich M.; Van Auken, Kimberly; Wang, Qinghua; Durbin, Richard; Spieth, John; Sternberg, Paul W.; Stein, Lincoln D.

    2004-01-01

    WormBase (http://www.wormbase.org/) is the central data repository for information about Caenorhabditis elegans and related nematodes. As a model organism database, WormBase extends beyond the genomic sequence, integrating experimental results with extensively annotated views of the genome. The WormBase Consortium continues to expand the biological scope and utility of WormBase with the inclusion of large-scale genomic analyses, through active data and literature curation, through new analysis and visualization tools, and through refinement of the user interface. Over the past year, the nearly complete genomic sequence and comparative analyses of the closely related species Caenorhabditis briggsae have been integrated into WormBase, including gene predictions, ortholog assignments and a new synteny viewer to display the relationships between the two species. Extensive site-wide refinement of the user interface now provides quick access to the most frequently accessed resources and a consistent browsing experience across the site. Unified single-page views now provide complete summaries of commonly accessed entries like genes. These advances continue to increase the utility of WormBase for C.elegans researchers, as well as for those researchers exploring problems in functional and comparative genomics in the context of a powerful genetic system. PMID:14681445

  17. Computational methods and resources for the interpretation of genomic variants in cancer.

    PubMed

    Tian, Rui; Basu, Malay K; Capriotti, Emidio

    2015-01-01

    The recent improvement of the high-throughput sequencing technologies is having a strong impact on the detection of genetic variations associated with cancer. Several institutions worldwide have been sequencing the whole exomes and or genomes of cancer patients in the thousands, thereby providing an invaluable collection of new somatic mutations in different cancer types. These initiatives promoted the development of methods and tools for the analysis of cancer genomes that are aimed at studying the relationship between genotype and phenotype in cancer. In this article we review the online resources and computational tools for the analysis of cancer genome. First, we describe the available repositories of cancer genome data. Next, we provide an overview of the methods for the detection of genetic variation and computational tools for the prioritization of cancer related genes and causative somatic variations. Finally, we discuss the future perspectives in cancer genomics focusing on the impact of computational methods and quantitative approaches for defining personalized strategies to improve the diagnosis and treatment of cancer.

  18. flyDIVaS: A Comparative Genomics Resource for Drosophila Divergence and Selection

    PubMed Central

    Stanley, Craig E.; Kulathinal, Rob J.

    2016-01-01

    With arguably the best finished and expertly annotated genome assembly, Drosophila melanogaster is a formidable genetics model to study all aspects of biology. Nearly a decade ago, the 12 Drosophila genomes project expanded D. melanogaster’s breadth as a comparative model through the community-development of an unprecedented genus- and genome-wide comparative resource. However, since its inception, these datasets for evolutionary inference and biological discovery have become increasingly outdated, outmoded, and inaccessible. Here, we provide an updated and upgradable comparative genomics resource of Drosophila divergence and selection, flyDIVaS, based on the latest genomic assemblies, curated FlyBase annotations, and recent OrthoDB orthology calls. flyDIVaS is an online database containing D. melanogaster-centric orthologous gene sets, CDS and protein alignments, divergence statistics (% gaps, dN, dS, dN/dS), and codon-based tests of positive Darwinian selection. Out of 13,920 protein-coding D. melanogaster genes, ∼80% have one aligned ortholog in the closely related species, D. simulans, and ∼50% have 1–1 12-way alignments in the original 12 sequenced species that span over 80 million yr of divergence. Genes and their orthologs can be chosen from four different taxonomic datasets differing in phylogenetic depth and coverage density, and visualized via interactive alignments and phylogenetic trees. Users can also batch download entire comparative datasets. A functional survey finds conserved mitotic and neural genes, highly diverged immune and reproduction-related genes, more conspicuous signals of divergence across tissue-specific genes, and an enrichment of positive selection among highly diverged genes. flyDIVaS will be regularly updated and can be freely accessed at www.flydivas.info. We encourage researchers to regularly use this resource as a tool for biological inference and discovery, and in their classrooms to help train the next generation of

  19. Infrared radiation from hot cones on cool conifers attracts seed-feeding insects

    PubMed Central

    Takács, Stephen; Bottomley, Hannah; Andreller, Iisak; Zaradnik, Tracy; Schwarz, Joseph; Bennett, Robb; Strong, Ward; Gries, Gerhard

    2008-01-01

    Foraging animals use diverse cues to locate resources. Common foraging cues have visual, auditory, olfactory, tactile or gustatory characteristics. Here, we show a foraging herbivore using infrared (IR) radiation from living plants as a host-finding cue. We present data revealing that (i) conifer cones are warmer and emit more near-, mid- and long-range IR radiation than needles, (ii) cone-feeding western conifer seed bugs, Leptoglossus occidentalis (Hemiptera: Coreidae), possess IR receptive organs and orient towards experimental IR cues, and (iii) occlusion of the insects' IR receptors impairs IR perception. The conifers' cost of attracting cone-feeding insects may be offset by occasional mast seeding resulting in cone crops too large to be effectively exploited by herbivores. PMID:18945664

  20. NeisseriaBase: a specialised Neisseria genomic resource and analysis platform.

    PubMed

    Zheng, Wenning; Mutha, Naresh V R; Heydari, Hamed; Dutta, Avirup; Siow, Cheuk Chuen; Jakubovics, Nicholas S; Wee, Wei Yee; Tan, Shi Yang; Ang, Mia Yang; Wong, Guat Jah; Choo, Siew Woh

    2016-01-01

    Database (VFDB) specific homology searches, the VFDB BLAST is also incorporated into the database. In addition, NeisseriaBase is equipped with in-house designed tools such as the Pairwise Genome Comparison tool (PGC) for comparative genomic analysis and the Pathogenomics Profiling Tool (PathoProT) for the comparative pathogenomics analysis of Neisseria strains. Discussion. This user-friendly database not only provides access to a host of genomic resources on Neisseria but also enables high-quality comparative genome analysis, which is crucial for the expanding scientific community interested in Neisseria research. This database is freely available at http://neisseria.um.edu.my.

  1. NeisseriaBase: a specialised Neisseria genomic resource and analysis platform

    PubMed Central

    Zheng, Wenning; Mutha, Naresh V.R.; Heydari, Hamed; Dutta, Avirup; Siow, Cheuk Chuen; Jakubovics, Nicholas S.; Wee, Wei Yee; Tan, Shi Yang; Ang, Mia Yang; Wong, Guat Jah

    2016-01-01

    Database (VFDB) specific homology searches, the VFDB BLAST is also incorporated into the database. In addition, NeisseriaBase is equipped with in-house designed tools such as the Pairwise Genome Comparison tool (PGC) for comparative genomic analysis and the Pathogenomics Profiling Tool (PathoProT) for the comparative pathogenomics analysis of Neisseria strains. Discussion. This user-friendly database not only provides access to a host of genomic resources on Neisseria but also enables high-quality comparative genome analysis, which is crucial for the expanding scientific community interested in Neisseria research. This database is freely available at http://neisseria.um.edu.my. PMID:27017950

  2. A cricket Gene Index: a genomic resource for studying neurobiology, speciation, and molecular evolution

    PubMed Central

    Danley, Patrick D; Mullen, Sean P; Liu, Fenglong; Nene, Vishvanath; Quackenbush, John; Shaw, Kerry L

    2007-01-01

    Background As the developmental costs of genomic tools decline, genomic approaches to non-model systems are becoming more feasible. Many of these systems may lack advanced genetic tools but are extremely valuable models in other biological fields. Here we report the development of expressed sequence tags (EST's) in an orthopteroid insect, a model for the study of neurobiology, speciation, and evolution. Results We report the sequencing of 14,502 EST's from clones derived from a nerve cord cDNA library, and the subsequent construction of a Gene Index from these sequences, from the Hawaiian trigonidiine cricket Laupala kohalensis. The Gene Index contains 8607 unique sequences comprised of 2575 tentative consensus (TC) sequences and 6032 singletons. For each of the unique sequences, an attempt was made to assign a provisional annotation and to categorize its function using a Gene Ontology-based classification through a sequence-based comparison to known proteins. In addition, a set of unique 70 base pair oligomers that can be used for DNA microarrays was developed. All Gene Index information is posted at the DFCI Gene Indices web page Conclusion Orthopterans are models used to understand the neurophysiological basis of complex motor patterns such as flight and stridulation. The sequences presented in the cricket Gene Index will provide neurophysiologists with many genetic tools that have been largely absent in this field. The cricket Gene Index is one of only two gene indices to be developed in an evolutionary model system. Species within the genus Laupala have speciated recently, rapidly, and extensively. Therefore, the genes identified in the cricket Gene Index can be used to study the genomics of speciation. Furthermore, this gene index represents a significant EST resources for basal insects. As such, this resource is a valuable comparative tool for the understanding of invertebrate molecular evolution. The sequences presented here will provide much needed genomic

  3. The Culture Collection of Algae and Protozoa (CCAP): a biological resource for protistan genomics.

    PubMed

    Gachon, Claire M M; Day, John G; Campbell, Christine N; Pröschold, Thomas; Saxon, Rachel J; Küpper, Frithjof C

    2007-12-30

    CCAP, the largest European protistan culture collection, is based at the Scottish Association for Marine Science near Oban, Scotland (http://www.ccap.ac.uk). The Collection comprises more than 2700 strains in the public domain, of which 1050 are marine algae, 1300 freshwater algae, and 350 protozoa. The primary mission of CCAP is to maintain and distribute defined cultures and their associated information to its customers. It also has a support and advisory function on all aspects of protistan science. In addition, it is involved in the training of students and researchers in algal identification and culture techniques. In light of the increasing number of fully sequenced protists, the CCAP is striving to provide targeted services and support to workers involved in all aspects of genomic research. At present, the Collection holds several hundred strains of genomic model taxa including: Acanthamoeba, Cafeteria, Cercomonas, Chlamydomonas, Chlorella, Cyanophora, Dictyostelium, Dunaliella, Ectocarpus, Emiliania, Euglena, Micromonas, Naegleria, Nephroselmis, Paramecium, Pavlova, Phaeodactylum, Porphyra, Pseudendoclonium, Pylaiella, Rhodomonas, Scenedesmus, Staurastrum, Tetrahymena, Thalassiosira, Volvox and Zygnema. These strains provide a defined representation of natural variation within model organisms, an increasingly useful resource for post-genomics approaches. Our aim over the next 2-5 years is to add value to the Collection by increasing the number of genome model species, and by offering an integrated, up-to-date, easy-to-use resource that would provide curated information on our strain holdings. In collaboration with other major Biological Resource Centres worldwide, we intend to build a hub providing access to both protistan cultures and their associated bioinformatics data.

  4. A genome resource to address mechanisms of developmental programming: determination of the fetal sheep heart transcriptome

    PubMed Central

    Cox, Laura A; Glenn, Jeremy P; Spradling, Kimberly D; Nijland, Mark J; Garcia, Roy; Nathanielsz, Peter W; Ford, Stephen P

    2012-01-01

    The pregnant sheep has provided seminal insights into reproduction related to animal and human development (ovarian function, fertility, implantation, fetal growth, parturition and lactation). Fetal sheep physiology has been extensively studied since 1950, contributing significantly to the basis for our understanding of many aspects of fetal development and behaviour that remain in use in clinical practice today. Understanding mechanisms requires the combination of systems approaches uniquely available in fetal sheep with the power of genomic studies. Absence of the full range of sheep genomic resources has limited the full realization of the power of this model, impeding progress in emerging areas of pregnancy biology such as developmental programming. We have examined the expressed fetal sheep heart transcriptome using high-throughput sequencing technologies. In so doing we identified 36,737 novel transcripts and describe genes, gene variants and pathways relevant to fundamental developmental mechanisms. Genes with the highest expression levels and with novel exons in the fetal heart transcriptome are known to play central roles in muscle development. We show that high-throughput sequencing methods can generate extensive transcriptome information in the absence of an assembled and annotated genome for that species. The gene sequence data obtained provide a unique genomic resource for sheep specific genetic technology development and, combined with the polymorphism data, augment annotation and assembly of the sheep genome. In addition, identification and pathway analysis of novel fetal sheep heart transcriptome splice variants is a first step towards revealing mechanisms of genetic variation and gene environment interactions during fetal heart development. PMID:22508961

  5. Introducing National Center for Genome Resources (NCGR) Informatics (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    SciTech Connect

    Crow, John

    2012-06-01

    John Crow from the National Center for Genome Resources discusses his organization's informatics at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  6. Introducing National Center for Genome Resources (NCGR) Informatics (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Crow, John [National Center for Genome Resources

    2016-07-12

    John Crow from the National Center for Genome Resources discusses his organization's informatics at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  7. Putting the Pieces Together: Clinically Relevant Genetic and Genomic Resources for Hospitalists and Neonatologists.

    PubMed

    Miller, Rebecca; Khromykh, Alina; Babcock, Holly; Jenevein, Callie; Solomon, Benjamin D

    2017-02-01

    Genetic conditions are individually rare but are common in aggregate, and they often present in the neonatal and early pediatric periods. These conditions are often severe, can be difficult to diagnose and manage, and may heavily affect patients, families, health care systems, and society. Because of recent technological advances, the availability and uptake of genetic and genomic testing are increasing rapidly. However, there is a dearth of trained geneticists and genetic counselors to help guide and explain these conditions and relevant tests. To help hospitalists, neonatologists, and related practitioners navigate this complex and evolving field, we have compiled a list of free (mostly Web-based) resources relevant to the diagnosis and management of genetic conditions and related disorders. These resources, which we describe individually, can be useful for nongeneticist clinicians, and some also include material that can be used to explain concepts and conditions to patients or families. The resources presented are divided into the following categories (which overlap): general information, databases of genetic conditions, resources that can help generate differential diagnoses, databases of genetic testing laboratories (to help with logistics of ordering tests), information on newborn screening, and other resources. We also include a separate list of helpful textbooks and manuals. We conclude with 2 examples describing how some of these resources would be used by a pediatric hospitalist or neonatologist during the inpatient management of a child with a suspected genetic condition. Copyright © 2017 by the American Academy of Pediatrics.

  8. Whole-Genome Sequence of Mesorhizobium hungaricum sp. nov. Strain UASWS1009, a Potential Resource for Agricultural and Environmental Uses

    PubMed Central

    Crovadore, Julien; Cochard, Bastien; Calmin, Gautier; Chablais, Romain; Schulz, Torsten

    2016-01-01

    We report here the whole-genome shotgun sequences of the strain UASWS1009 of the species Mesorhizobium hungaricum sp. nov., which are different from any other known Mesorhizobium species. This is the first genome registered for this new species, which could be considered as a potential resource for agriculture and environmental uses. PMID:27738050

  9. Development of a high-throughput SNP resource to advance genomic, genetic and breeding research in carrot (Daucus carota L.)

    USDA-ARS?s Scientific Manuscript database

    The rapid advancement in high-throughput SNP genotyping technologies along with next generation sequencing (NGS) platforms has decreased the cost, improved the quality of large-scale genome surveys, and allowed specialty crops with limited genomic resources such as carrot (Daucus carota) to access t...

  10. Vitigene: A database for grape genomics and genetic resources delivery that benefits grape growers and scientific communities

    USDA-ARS?s Scientific Manuscript database

    A new grape genomic database was established for ‘Native American Grape Species’ as a genomic resource (http://vitigene.famu.edu:9082/eclient/ IDMLogon2.jsp). The new database hosts genetic information collected from disease tolerant/resistant grapevine endemic to North America, and is a valuable re...

  11. Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies

    PubMed Central

    2014-01-01

    Background The size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination. Results We develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome. Conclusions In addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied. PMID:24647006

  12. Soybean Knowledge Base (SoyKB): a Web Resource for Soybean Translational Genomics

    SciTech Connect

    Joshi, Trupti; Patil, Kapil; Fitzpatrick, Michael R.; Franklin, Levi D.; Yao, Qiuming; Cook, Jeffrey R.; Wang, Zhem; Libault, Marc; Brechenmacher, Laurent; Valliyodan, Babu; Wu, Xiaolei; Cheng, Jianlin; Stacey, Gary; Nguyen, Henry T.; Xu, Dong

    2012-01-17

    Background: Soybean Knowledge Base (SoyKB) is a comprehensive all-inclusive web resource for soybean translational genomics. SoyKB is designed to handle the management and integration of soybean genomics, transcriptomics, proteomics and metabolomics data along with annotation of gene function and biological pathway. It contains information on four entities, namely genes, microRNAs, metabolites and single nucleotide polymorphisms (SNPs). Methods: SoyKB has many useful tools such as Affymetrix probe ID search, gene family search, multiple gene/ metabolite search supporting co-expression analysis, and protein 3D structure viewer as well as download and upload capacity for experimental data and annotations. It has four tiers of registration, which control different levels of access to public and private data. It allows users of certain levels to share their expertise by adding comments to the data. It has a user-friendly web interface together with genome browser and pathway viewer, which display data in an intuitive manner to the soybean researchers, producers and consumers. Conclusions: SoyKB addresses the increasing need of the soybean research community to have a one-stop-shop functional and translational omics web resource for information retrieval and analysis in a user-friendly way. SoyKB can be publicly accessed at http://soykb.org/.

  13. Prokaryotic Virus Orthologous Groups (pVOGs): a resource for comparative genomics and protein family annotation

    PubMed Central

    Grazziotin, Ana Laura; Koonin, Eugene V.; Kristensen, David M.

    2017-01-01

    Viruses are the most abundant and diverse biological entities on earth, and while most of this diversity remains completely unexplored, advances in genome sequencing have provided unprecedented glimpses into the virosphere. The Prokaryotic Virus Orthologous Groups (pVOGs, formerly called Phage Orthologous Groups, POGs) resource has aided in this task over the past decade by using automated methods to keep pace with the rapid increase in genomic data. The uses of pVOGs include functional annotation of viral proteins, identification of genes and viruses in uncharacterized DNA samples, phylogenetic analysis, large-scale comparative genomics projects, and more. The pVOGs database represents a comprehensive set of orthologous gene families shared across multiple complete genomes of viruses that infect bacterial or archaeal hosts (viruses of eukaryotes will be added at a future date). The pVOGs are constructed within the Clusters of Orthologous Groups (COGs) framework that is widely used for orthology identification in prokaryotes. Since the previous release of the POGs, the size has tripled to nearly 3000 genomes and 300 000 proteins, and the number of conserved orthologous groups doubled to 9518. User-friendly webpages are available, including multiple sequence alignments and HMM profiles for each VOG. These changes provide major improvements to the pVOGs database, at a time of rapid advances in virus genomics. The pVOGs database is hosted jointly at the University of Iowa at http://dmk-brain.ecn.uiowa.edu/pVOGs and the NCBI at ftp://ftp.ncbi.nlm.nih.gov/pub/kristensen/pVOGs/home.html. PMID:27789703

  14. Prokaryotic Virus Orthologous Groups (pVOGs): a resource for comparative genomics and protein family annotation.

    PubMed

    Grazziotin, Ana Laura; Koonin, Eugene V; Kristensen, David M

    2017-01-04

    Viruses are the most abundant and diverse biological entities on earth, and while most of this diversity remains completely unexplored, advances in genome sequencing have provided unprecedented glimpses into the virosphere. The Prokaryotic Virus Orthologous Groups (pVOGs, formerly called Phage Orthologous Groups, POGs) resource has aided in this task over the past decade by using automated methods to keep pace with the rapid increase in genomic data. The uses of pVOGs include functional annotation of viral proteins, identification of genes and viruses in uncharacterized DNA samples, phylogenetic analysis, large-scale comparative genomics projects, and more. The pVOGs database represents a comprehensive set of orthologous gene families shared across multiple complete genomes of viruses that infect bacterial or archaeal hosts (viruses of eukaryotes will be added at a future date). The pVOGs are constructed within the Clusters of Orthologous Groups (COGs) framework that is widely used for orthology identification in prokaryotes. Since the previous release of the POGs, the size has tripled to nearly 3000 genomes and 300 000 proteins, and the number of conserved orthologous groups doubled to 9518. User-friendly webpages are available, including multiple sequence alignments and HMM profiles for each VOG. These changes provide major improvements to the pVOGs database, at a time of rapid advances in virus genomics. The pVOGs database is hosted jointly at the University of Iowa at http://dmk-brain.ecn.uiowa.edu/pVOGs and the NCBI at ftp://ftp.ncbi.nlm.nih.gov/pub/kristensen/pVOGs/home.html. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  15. Gramene: A Resource for Comparative Analysis of Plants Genomes and Pathways.

    PubMed

    Tello-Ruiz, Marcela Karey; Stein, Joshua; Wei, Sharon; Youens-Clark, Ken; Jaiswal, Pankaj; Ware, Doreen

    2016-01-01

    Gramene is an integrated informatics resource for accessing, visualizing, and comparing plant genomes and biological pathways. Originally targeting grasses, Gramene has grown to host annotations for economically important and research model crops, including wheat, potato, tomato, banana, grape, poplar, and Chlamydomonas. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. This chapter outlines system requirements for end users and database hosting, data types and basic navigation within Gramene, and provides examples of how to (1) view a phylogenetic tree for a family of transcription factors, (2) explore genetic variation in the orthologues of a gene with a known trait association, and (3) upload, visualize, and privately share end user data into a new genome browser track.Moreover, this is the first publication describing Gramene's new web interface-intended to provide a simplified portal to the most complete and up-to-date set of plant genome and pathway annotations.

  16. The SOL Genomics Network. A Comparative Resource for Solanaceae Biology and Beyond1

    PubMed Central

    Mueller, Lukas A.; Solow, Teri H.; Taylor, Nicolas; Skwarecki, Beth; Buels, Robert; Binns, John; Lin, Chenwei; Wright, Mark H.; Ahrens, Robert; Wang, Ying; Herbst, Evan V.; Keyder, Emil R.; Menda, Naama; Zamir, Dani; Tanksley, Steven D.

    2005-01-01

    The SOL Genomics Network (SGN; http://sgn.cornell.edu) is a rapidly evolving comparative resource for the plants of the Solanaceae family, which includes important crop and model plants such as potato (Solanum tuberosum), eggplant (Solanum melongena), pepper (Capsicum annuum), and tomato (Solanum lycopersicum). The aim of SGN is to relate these species to one another using a comparative genomics approach and to tie them to the other dicots through the fully sequenced genome of Arabidopsis (Arabidopsis thaliana). SGN currently houses map and marker data for Solanaceae species, a large expressed sequence tag collection with computationally derived unigene sets, an extensive database of phenotypic information for a mutagenized tomato population, and associated tools such as real-time quantitative trait loci. Recently, the International Solanaceae Project (SOL) was formed as an umbrella organization for Solanaceae research in over 30 countries to address important questions in plant biology. The first cornerstone of the SOL project is the sequencing of the entire euchromatic portion of the tomato genome. SGN is collaborating with other bioinformatics centers in building the bioinformatics infrastructure for the tomato sequencing project and implementing the bioinformatics strategy of the larger SOL project. The overarching goal of SGN is to make information available in an intuitive comparative format, thereby facilitating a systems approach to investigations into the basis of adaptation and phenotypic diversity in the Solanaceae family, other species in the Asterid clade such as coffee (Coffea arabica), Rubiaciae, and beyond. PMID:16010005

  17. TriTrypDB: a functional genomic resource for the Trypanosomatidae

    PubMed Central

    Aslett, Martin; Aurrecoechea, Cristina; Berriman, Matthew; Brestelli, John; Brunk, Brian P.; Carrington, Mark; Depledge, Daniel P.; Fischer, Steve; Gajria, Bindu; Gao, Xin; Gardner, Malcolm J.; Gingle, Alan; Grant, Greg; Harb, Omar S.; Heiges, Mark; Hertz-Fowler, Christiane; Houston, Robin; Innamorato, Frank; Iodice, John; Kissinger, Jessica C.; Kraemer, Eileen; Li, Wei; Logan, Flora J.; Miller, John A.; Mitra, Siddhartha; Myler, Peter J.; Nayak, Vishal; Pennington, Cary; Phan, Isabelle; Pinney, Deborah F.; Ramasamy, Gowthaman; Rogers, Matthew B.; Roos, David S.; Ross, Chris; Sivam, Dhileep; Smith, Deborah F.; Srinivasamoorthy, Ganesh; Stoeckert, Christian J.; Subramanian, Sandhya; Thibodeau, Ryan; Tivey, Adrian; Treatman, Charles; Velarde, Giles; Wang, Haiming

    2010-01-01

    TriTrypDB (http://tritrypdb.org) is an integrated database providing access to genome-scale datasets for kinetoplastid parasites, and supporting a variety of complex queries driven by research and development needs. TriTrypDB is a collaborative project, utilizing the GUS/WDK computational infrastructure developed by the Eukaryotic Pathogen Bioinformatics Resource Center (EuPathDB.org) to integrate genome annotation and analyses from GeneDB and elsewhere with a wide variety of functional genomics datasets made available by members of the global research community, often pre-publication. Currently, TriTrypDB integrates datasets from Leishmania braziliensis, L. infantum, L. major, L. tarentolae, Trypanosoma brucei and T. cruzi. Users may examine individual genes or chromosomal spans in their genomic context, including syntenic alignments with other kinetoplastid organisms. Data within TriTrypDB can be interrogated utilizing a sophisticated search strategy system that enables a user to construct complex queries combining multiple data types. All search strategies are stored, allowing future access and integrated searches. ‘User Comments’ may be added to any gene page, enhancing available annotation; such comments become immediately searchable via the text search, and are forwarded to curators for incorporation into the reference annotation when appropriate. PMID:19843604

  18. A reversible haploid mouse embryonic stem cell biobank resource for functional genomics.

    PubMed

    Elling, Ulrich; Wimmer, Reiner A; Leibbrandt, Andreas; Burkard, Thomas; Michlits, Georg; Leopoldi, Alexandra; Micheler, Thomas; Abdeen, Dana; Zhuk, Sergei; Aspalter, Irene M; Handl, Cornelia; Liebergesell, Julia; Hubmann, Maria; Husa, Anna-Maria; Kinzer, Manuela; Schuller, Nicole; Wetzel, Ellen; van de Loo, Nina; Martinez, Jorge Arturo Zepeda; Estoppey, David; Riedl, Ralph; Yang, Fengtang; Fu, Beiyuan; Dechat, Thomas; Ivics, Zoltán; Agu, Chukwuma A; Bell, Oliver; Blaas, Dieter; Gerhardt, Holger; Hoepfner, Dominic; Stark, Alexander; Penninger, Josef M

    2017-10-05

    The ability to directly uncover the contributions of genes to a given phenotype is fundamental for biology research. However, ostensibly homogeneous cell populations exhibit large clonal variance that can confound analyses and undermine reproducibility. Here we used genome-saturated mutagenesis to create a biobank of over 100,000 individual haploid mouse embryonic stem (mES) cell lines targeting 16,970 genes with genetically barcoded, conditional and reversible mutations. This Haplobank is, to our knowledge, the largest resource of hemi/homozygous mutant mES cells to date and is available to all researchers. Reversible mutagenesis overcomes clonal variance by permitting functional annotation of the genome directly in sister cells. We use the Haplobank in reverse genetic screens to investigate the temporal resolution of essential genes in mES cells, and to identify novel genes that control sprouting angiogenesis and lineage specification of blood vessels. Furthermore, a genome-wide forward screen with Haplobank identified PLA2G16 as a host factor that is required for cytotoxicity by rhinoviruses, which cause the common cold. Therefore, clones from the Haplobank combined with the use of reversible technologies enable high-throughput, reproducible, functional annotation of the genome.

  19. HmtDB, a genomic resource for mitochondrion-based human variability studies.

    PubMed

    Rubino, Francesco; Piredda, Roberta; Calabrese, Francesco Maria; Simone, Domenico; Lang, Martin; Calabrese, Claudia; Petruzzella, Vittoria; Tommaseo-Ponzetta, Mila; Gasparre, Giuseppe; Attimonelli, Marcella

    2012-01-01

    HmtDB (http://www.hmtdb.uniba.it:8080/hmdb) is a open resource created to support population genetics and mitochondrial disease studies. The database hosts human mitochondrial genome sequences annotated with population and variability data, the latter being estimated through the application of the SiteVar software based on site-specific nucleotide and amino acid variability calculations. The annotations are manually curated thus adding value to the quality of the information provided to the end-user. Classifier tools implemented in HmtDB allow the prediction of the haplogroup for any human mitochondrial genome currently stored in HmtDB or externally submitted de novo by an end-user. Haplogroup definition is based on the Phylotree system. End-users accessing HmtDB are hence allowed to (i) browse the database through the use of a multi-criterion 'query' system; (ii) analyze their own human mitochondrial sequences via the 'classify' tool (for complete genomes) or by downloading the 'fragment-classifier' tool (for partial sequences); (iii) download multi-alignments with reference genomes as well as variability data.

  20. HmtDB, a genomic resource for mitochondrion-based human variability studies

    PubMed Central

    Rubino, Francesco; Piredda, Roberta; Calabrese, Francesco Maria; Simone, Domenico; Lang, Martin; Calabrese, Claudia; Petruzzella, Vittoria; Tommaseo-Ponzetta, Mila; Gasparre, Giuseppe; Attimonelli, Marcella

    2012-01-01

    HmtDB (http://www.hmtdb.uniba.it:8080/hmdb) is a open resource created to support population genetics and mitochondrial disease studies. The database hosts human mitochondrial genome sequences annotated with population and variability data, the latter being estimated through the application of the SiteVar software based on site-specific nucleotide and amino acid variability calculations. The annotations are manually curated thus adding value to the quality of the information provided to the end-user. Classifier tools implemented in HmtDB allow the prediction of the haplogroup for any human mitochondrial genome currently stored in HmtDB or externally submitted de novo by an end-user. Haplogroup definition is based on the Phylotree system. End-users accessing HmtDB are hence allowed to (i) browse the database through the use of a multi-criterion ‘query’ system; (ii) analyze their own human mitochondrial sequences via the ‘classify’ tool (for complete genomes) or by downloading the ‘fragment-classifier’ tool (for partial sequences); (iii) download multi-alignments with reference genomes as well as variability data. PMID:22139932

  1. Fluorescence-based resource for semiautomated genomic analyses using microsatellite markers

    SciTech Connect

    Levitt, R.C.; Kiser, M.B.; Dragwa, C.

    1994-11-15

    To facilitate the practical application of highly-efficient semiautomated methods for general application in genomic analyses, the authors have developed a fluorescence-based microsatellite marker resource. Ninety highly polymorphic microsatellite markers were combined to provide a rapid, accurate, and highly efficient initial genome-wide screening system. These markers are spaced on average every 33 cM, with a mean heterozygosity of 81% (range 65-94%), covering 22 autosomes and the X and Y chromosomes. Less than 10% of the genome lies beyond 20 cM of the nearest marker. Since this genomic analysis system is fully compatible with automated fragment analyzers using simultaneous four-color fluorescence-based detection systems, the 5 groups of 18 markers can be detected concurrently. This multiplex detection provides a throughput of 1944 genotypes daily per instrument. This system will be highly beneficial in a number of clinical and research applications including linkage, cancer genetics, forensics, and cytogenetics. 16 refs., 1 fig., 2 tabs.

  2. MIPS Arabidopsis thaliana Database (MAtDB): an integrated biological knowledge resource for plant genomics.

    PubMed

    Schoof, Heiko; Ernst, Rebecca; Nazarov, Vladimir; Pfeifer, Lukas; Mewes, Hans-Werner; Mayer, Klaus F X

    2004-01-01

    Arabidopsis thaliana is the most widely studied model plant. Functional genomics is intensively underway in many laboratories worldwide. Beyond the basic annotation of the primary sequence data, the annotated genetic elements of Arabidopsis must be linked to diverse biological data and higher order information such as metabolic or regulatory pathways. The MIPS Arabidopsis thaliana database MAtDB aims to provide a comprehensive resource for Arabidopsis as a genome model that serves as a primary reference for research in plants and is suitable for transfer of knowledge to other plants, especially crops. The genome sequence as a common backbone serves as a scaffold for the integration of data, while, in a complementary effort, these data are enhanced through the application of state-of-the-art bioinformatics tools. This information is visualized on a genome-wide and a gene-by-gene basis with access both for web users and applications. This report updates the information given in a previous report and provides an outlook on further developments. The MAtDB web interface can be accessed at http://mips.gsf.de/proj/thal/db.

  3. Human Ageing Genomic Resources: Integrated databases and tools for the biology and genetics of ageing

    PubMed Central

    Tacutu, Robi; Craig, Thomas; Budovsky, Arie; Wuttke, Daniel; Lehmann, Gilad; Taranukha, Dmitri; Costa, Joana; Fraifeld, Vadim E.; de Magalhães, João Pedro

    2013-01-01

    The Human Ageing Genomic Resources (HAGR, http://genomics.senescence.info) is a freely available online collection of research databases and tools for the biology and genetics of ageing. HAGR features now several databases with high-quality manually curated data: (i) GenAge, a database of genes associated with ageing in humans and model organisms; (ii) AnAge, an extensive collection of longevity records and complementary traits for >4000 vertebrate species; and (iii) GenDR, a newly incorporated database, containing both gene mutations that interfere with dietary restriction-mediated lifespan extension and consistent gene expression changes induced by dietary restriction. Since its creation about 10 years ago, major efforts have been undertaken to maintain the quality of data in HAGR, while further continuing to develop, improve and extend it. This article briefly describes the content of HAGR and details the major updates since its previous publications, in terms of both structure and content. The completely redesigned interface, more intuitive and more integrative of HAGR resources, is also presented. Altogether, we hope that through its improvements, the current version of HAGR will continue to provide users with the most comprehensive and accessible resources available today in the field of biogerontology. PMID:23193293

  4. Human Ageing Genomic Resources: integrated databases and tools for the biology and genetics of ageing.

    PubMed

    Tacutu, Robi; Craig, Thomas; Budovsky, Arie; Wuttke, Daniel; Lehmann, Gilad; Taranukha, Dmitri; Costa, Joana; Fraifeld, Vadim E; de Magalhães, João Pedro

    2013-01-01

    The Human Ageing Genomic Resources (HAGR, http://genomics.senescence.info) is a freely available online collection of research databases and tools for the biology and genetics of ageing. HAGR features now several databases with high-quality manually curated data: (i) GenAge, a database of genes associated with ageing in humans and model organisms; (ii) AnAge, an extensive collection of longevity records and complementary traits for >4000 vertebrate species; and (iii) GenDR, a newly incorporated database, containing both gene mutations that interfere with dietary restriction-mediated lifespan extension and consistent gene expression changes induced by dietary restriction. Since its creation about 10 years ago, major efforts have been undertaken to maintain the quality of data in HAGR, while further continuing to develop, improve and extend it. This article briefly describes the content of HAGR and details the major updates since its previous publications, in terms of both structure and content. The completely redesigned interface, more intuitive and more integrative of HAGR resources, is also presented. Altogether, we hope that through its improvements, the current version of HAGR will continue to provide users with the most comprehensive and accessible resources available today in the field of biogerontology.

  5. Pearl millet genome sequence provides a resource to improve agronomic traits in arid environments.

    PubMed

    Varshney, Rajeev K; Shi, Chengcheng; Thudi, Mahendar; Mariac, Cedric; Wallace, Jason; Qi, Peng; Zhang, He; Zhao, Yusheng; Wang, Xiyin; Rathore, Abhishek; Srivastava, Rakesh K; Chitikineni, Annapurna; Fan, Guangyi; Bajaj, Prasad; Punnuri, Somashekhar; Gupta, S K; Wang, Hao; Jiang, Yong; Couderc, Marie; Katta, Mohan A V S K; Paudel, Dev R; Mungra, K D; Chen, Wenbin; Harris-Shultz, Karen R; Garg, Vanika; Desai, Neetin; Doddamani, Dadakhalandar; Kane, Ndjido Ardo; Conner, Joann A; Ghatak, Arindam; Chaturvedi, Palak; Subramaniam, Sabarinath; Yadav, Om Parkash; Berthouly-Salazar, Cécile; Hamidou, Falalou; Wang, Jianping; Liang, Xinming; Clotault, Jérémy; Upadhyaya, Hari D; Cubry, Philippe; Rhoné, Bénédicte; Gueye, Mame Codou; Sunkar, Ramanjulu; Dupuy, Christian; Sparvoli, Francesca; Cheng, Shifeng; Mahala, R S; Singh, Bharat; Yadav, Rattan S; Lyons, Eric; Datta, Swapan K; Hash, C Tom; Devos, Katrien M; Buckler, Edward; Bennetzen, Jeffrey L; Paterson, Andrew H; Ozias-Akins, Peggy; Grando, Stefania; Wang, Jun; Mohapatra, Trilochan; Weckwerth, Wolfram; Reif, Jochen C; Liu, Xin; Vigouroux, Yves; Xu, Xun

    2017-09-18

    Pearl millet [Cenchrus americanus (L.) Morrone] is a staple food for more than 90 million farmers in arid and semi-arid regions of sub-Saharan Africa, India and South Asia. We report the ∼1.79 Gb draft whole genome sequence of reference genotype Tift 23D2B1-P1-P5, which contains an estimated 38,579 genes. We highlight the substantial enrichment for wax biosynthesis genes, which may contribute to heat and drought tolerance in this crop. We resequenced and analyzed 994 pearl millet lines, enabling insights into population structure, genetic diversity and domestication. We use these resequencing data to establish marker trait associations for genomic selection, to define heterotic pools, and to predict hybrid performance. We believe that these resources should empower researchers and breeders to improve this important staple crop.

  6. Development of Genomic Resources for a thraustochytrid Pathogen and Investigation of Temperature Influences on Gene Expression

    PubMed Central

    Garcia-Vedrenne, Ana Elisa; Groner, Maya; Page-Karjian, Annie; Siegmund, Gregor-Fausto; Singhal, Sonia; Sziklay, Jamie; Roberts, Steven

    2013-01-01

    Understanding how environmental changes influence the pathogenicity and virulence of infectious agents is critical for predicting epidemiological patterns of disease. Thraustochytrids, part of the larger taxonomic class Labyrinthulomycetes, contain several highly pathogenic species, including the hard clam pathogen quahog parasite unknown (QPX). QPX has been associated with large-scale mortality events along the northeastern coast of North America. Growth and physiology of QPX is temperature-dependent, and changes in local temperature profiles influence pathogenicity. In this study we characterize the partial genome of QPX and examine the influence of temperature on gene expression. Genes involved in several biological processes are differentially expressed upon temperature change, including those associated with altered growth and metabolism and virulence. The genomic and transcriptomic resources developed in this study provide a foundation for better understanding virulence, pathogenicity and life history of thraustochytrid pathogens. PMID:24069279

  7. Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement.

    PubMed

    Varshney, Rajeev K; Song, Chi; Saxena, Rachit K; Azam, Sarwar; Yu, Sheng; Sharpe, Andrew G; Cannon, Steven; Baek, Jongmin; Rosen, Benjamin D; Tar'an, Bunyamin; Millan, Teresa; Zhang, Xudong; Ramsay, Larissa D; Iwata, Aiko; Wang, Ying; Nelson, William; Farmer, Andrew D; Gaur, Pooran M; Soderlund, Carol; Penmetsa, R Varma; Xu, Chunyan; Bharti, Arvind K; He, Weiming; Winter, Peter; Zhao, Shancen; Hane, James K; Carrasquilla-Garcia, Noelia; Condie, Janet A; Upadhyaya, Hari D; Luo, Ming-Cheng; Thudi, Mahendar; Gowda, C L L; Singh, Narendra P; Lichtenzveig, Judith; Gali, Krishna K; Rubio, Josefa; Nadarajan, N; Dolezel, Jaroslav; Bansal, Kailash C; Xu, Xun; Edwards, David; Zhang, Gengyun; Kahl, Guenter; Gil, Juan; Singh, Karam B; Datta, Swapan K; Jackson, Scott A; Wang, Jun; Cook, Douglas R

    2013-03-01

    Chickpea (Cicer arietinum) is the second most widely grown legume crop after soybean, accounting for a substantial proportion of human dietary nitrogen intake and playing a crucial role in food security in developing countries. We report the ∼738-Mb draft whole genome shotgun sequence of CDC Frontier, a kabuli chickpea variety, which contains an estimated 28,269 genes. Resequencing and analysis of 90 cultivated and wild genotypes from ten countries identifies targets of both breeding-associated genetic sweeps and breeding-associated balancing selection. Candidate genes for disease resistance and agronomic traits are highlighted, including traits that distinguish the two main market classes of cultivated chickpea--desi and kabuli. These data comprise a resource for chickpea improvement through molecular breeding and provide insights into both genome diversity and domestication.

  8. A genome-wide resource for the analysis of protein localisation in Drosophila.

    PubMed

    Sarov, Mihail; Barz, Christiane; Jambor, Helena; Hein, Marco Y; Schmied, Christopher; Suchold, Dana; Stender, Bettina; Janosch, Stephan; K J, Vinay Vikas; Krishnan, R T; Krishnamoorthy, Aishwarya; Ferreira, Irene R S; Ejsmont, Radoslaw K; Finkl, Katja; Hasse, Susanne; Kämpfer, Philipp; Plewka, Nicole; Vinis, Elisabeth; Schloissnig, Siegfried; Knust, Elisabeth; Hartenstein, Volker; Mann, Matthias; Ramaswami, Mani; VijayRaghavan, K; Tomancak, Pavel; Schnorrer, Frank

    2016-02-20

    The Drosophila genome contains >13000 protein-coding genes, the majority of which remain poorly investigated. Important reasons include the lack of antibodies or reporter constructs to visualise these proteins. Here, we present a genome-wide fosmid library of 10000 GFP-tagged clones, comprising tagged genes and most of their regulatory information. For 880 tagged proteins, we created transgenic lines, and for a total of 207 lines, we assessed protein expression and localisation in ovaries, embryos, pupae or adults by stainings and live imaging approaches. Importantly, we visualised many proteins at endogenous expression levels and found a large fraction of them localising to subcellular compartments. By applying genetic complementation tests, we estimate that about two-thirds of the tagged proteins are functional. Moreover, these tagged proteins enable interaction proteomics from developing pupae and adult flies. Taken together, this resource will boost systematic analysis of protein expression and localisation in various cellular and developmental contexts.

  9. Defense Mechanisms of Conifers 1

    PubMed Central

    Lewinsohn, Efraim; Gijzen, Mark; Croteau, Rodney

    1991-01-01

    Levels of monoterpene cyclase activity were determined in extracts from wounded and unwounded saplings of 10 conifer species to assess whether oleoresin biosynthesis is induced by stem wounding. Species of Abies and Picea, with low to moderate levels of constitutive monoterpene cyclase activity, exhibited a five- to 15-fold increase in cyclase activity 7 days after wounding relative to unwounded controls. In contrast, species of genera such as Pinus, with high levels of constitutive cyclase activity, did not significantly respond to wounding by alteration in the level of cyclase activity. The highest fold increase in monoterpene cyclase activity was consistently observed in Abies grandis, and the time-course of induction of activity following stem wounding in this species demonstrated a threefold increase at 2 days relative to unwounded controls, rising to a maximum increase in the response at 9 days (greater than 10-fold) followed by an apparent decline. The wound response was localized, and both bark (phloem) and wood (xylem) tissues displayed increased cyclase activity at the wound site. The magnitude of the increase in cyclase activity was dependent on the severity of the wound. PMID:16668184

  10. Genomic resources for songbird research and their use in characterizing gene expression during brain development

    PubMed Central

    Li, XiaoChing; Wang, Xiu-Jie; Tannenhauser, Jonathan; Podell, Sheila; Mukherjee, Piali; Hertel, Moritz; Biane, Jeremy; Masuda, Shoko; Nottebohm, Fernando; Gaasterland, Terry

    2007-01-01

    Vocal learning and neuronal replacement have been studied extensively in songbirds, but until recently, few molecular and genomic tools for songbird research existed. Here we describe new molecular/genomic resources developed in our laboratory. We made cDNA libraries from zebra finch (Taeniopygia guttata) brains at different developmental stages. A total of 11,000 cDNA clones from these libraries, representing 5,866 unique gene transcripts, were randomly picked and sequenced from the 3′ ends. A web-based database was established for clone tracking, sequence analysis, and functional annotations. Our cDNA libraries were not normalized. Sequencing ESTs without normalization produced many developmental stage-specific sequences, yielding insights into patterns of gene expression at different stages of brain development. In particular, the cDNA library made from brains at posthatching day 30–50, corresponding to the period of rapid song system development and song learning, has the most diverse and richest set of genes expressed. We also identified five microRNAs whose sequences are highly conserved between zebra finch and other species. We printed cDNA microarrays and profiled gene expression in the high vocal center of both adult male zebra finches and canaries (Serinus canaria). Genes differentially expressed in the high vocal center were identified from the microarray hybridization results. Selected genes were validated by in situ hybridization. Networks among the regulated genes were also identified. These resources provide songbird biologists with tools for genome annotation, comparative genomics, and microarray gene expression analysis. PMID:17426146

  11. The chickpea genomic web resource: visualization and analysis of the desi-type Cicer arietinum nuclear genome for comparative exploration of legumes.

    PubMed

    Misra, Gopal; Priya, Piyush; Bandhiwal, Nitesh; Bareja, Neha; Jain, Mukesh; Bhatia, Sabhyata; Chattopadhyay, Debasis; Tyagi, Akhilesh K; Yadav, Gitanjali

    2014-12-18

    Availability of the draft nuclear genome sequences of small-seeded desi-type legume crop Cicer arietinum has provided an opportunity for investigating unique chickpea genomic features and evaluation of their biological significance. The increasing number of legume genome sequences also presents a challenge for developing reliable and information-driven bioinformatics applications suitable for comparative exploration of this important class of crop plants. The Chickpea Genomic Web Resource (CGWR) is an implementation of a suite of web-based applications dedicated to chickpea genome visualization and comparative analysis, based on next generation sequencing and assembly of Cicer arietinum desi-type genotype ICC4958. CGWR has been designed and configured for mapping, scanning and browsing the significant chickpea genomic features in view of the important existing and potential roles played by the various legume genome projects in mutant mapping and cloning. It also enables comparative informatics of ICC4958 DNA sequence analysis with other wild and cultivated genotypes of chickpea, various other leguminous species as well as several non-leguminous model plants, to enable investigations into evolutionary processes that shape legume genomes. CGWR is an online database offering a comprehensive visual and functional genomic analysis of the chickpea genome, along with customized maps and gene-clustering options. It is also the only plant based web resource supporting display and analysis of nucleosome positioning patterns in the genome. The usefulness of CGWR has been demonstrated with discoveries of biological significance made using this server. The CGWR is compatible with all available operating systems and browsers, and is available freely under the open source license at http://www.nipgr.res.in/CGWR/home.php.

  12. The Human Ageing Genomic Resources: online databases and tools for biogerontologists

    PubMed Central

    de Magalhães, João Pedro; Budovsky, Arie; Lehmann, Gilad; Costa, Joana; Li, Yang; Fraifeld, Vadim; Church, George M.

    2009-01-01

    Summary Ageing is a complex, challenging phenomenon that will require multiple, interdisciplinary approaches to unravel its puzzles. To assist basic research on ageing, we developed the Human Ageing Genomic Resources (HAGR). This work provides an overview of the databases and tools in HAGR and describes how the gerontology research community can employ them. Several recent changes and improvements to HAGR are also presented. The two centrepieces in HAGR are GenAge and AnAge. GenAge is a gene database featuring genes associated with ageing and longevity in model organisms, a curated database of genes potentially associated with human ageing, and a list of genes tested for their association with human longevity. A myriad of biological data and information is included for hundreds of genes, making GenAge a reference for research that reflects our current understanding of the genetic basis of ageing. GenAge can also serve as a platform for the systems biology of ageing, and tools for the visualization of protein-protein interactions are also included. AnAge is a database of ageing in animals, featuring over 4,000 species, primarily assembled as a resource for comparative and evolutionary studies of ageing. Longevity records, developmental and reproductive traits, taxonomic information, basic metabolic characteristics, and key observations related to ageing are included in AnAge. Software is also available to aid researchers in the form of Perl modules to automate numerous tasks and as an SPSS script to analyse demographic mortality data. The Human Ageing Genomic Resources are available online at http://genomics.senescence.info. PMID:18986374

  13. GeneSeer: A sage for gene names and genomic resources

    PubMed Central

    Olson, Andrew J; Tully, Tim; Sachidanandam, Ravi

    2005-01-01

    Background Independent identification of genes in different organisms and assays has led to a multitude of names for each gene. This balkanization makes it difficult to use gene names to locate genomic resources, homologs in other species and relevant publications. Methods We solve the naming problem by collecting data from a variety of sources and building a name-translation database. We have also built a table of homologs across several model organisms: H. sapiens, M. musculus, R. norvegicus, D. melanogaster, C. elegans, S. cerevisiae, S. pombe and A. thaliana. This allows GeneSeer to draw phylogenetic trees and identify the closest homologs. This, in turn, allows the use of names from one species to identify homologous genes in another species. A website is connected to the database to allow user-friendly access to our tools and external genomic resources using familiar gene names. Conclusion GeneSeer allows access to gene information through common names and can map sequences to names. GeneSeer also allows identification of homologs and paralogs for a given gene. A variety of genomic data such as sequences, SNPs, splice variants, expression patterns and others can be accessed through the GeneSeer interface. It is freely available over the web and can be incorporated in other tools through an http-based software interface described on the website. It is currently used as the search engine in the RNAi codex resource, which is a portal for short hairpin RNA (shRNA) gene-silencing constructs. PMID:16176584

  14. An Ancient Transkingdom Horizontal Transfer of Penelope-Like Retroelements from Arthropods to Conifers.

    PubMed

    Lin, Xuan; Faridi, Nurul; Casola, Claudio

    2016-05-02

    Comparative genomics analyses empowered by the wealth of sequenced genomes have revealed numerous instances of horizontal DNA transfers between distantly related species. In eukaryotes, repetitive DNA sequences known as transposable elements (TEs) are especially prone to move across species boundaries. Such horizontal transposon transfers, or HTTs, are relatively common within major eukaryotic kingdoms, including animals, plants, and fungi, while rarely occurring across these kingdoms. Here, we describe the first case of HTT from animals to plants, involving TEs known as Penelope-like elements, or PLEs, a group of retrotransposons closely related to eukaryotic telomerases. Using a combination of in situ hybridization on chromosomes, polymerase chain reaction experiments, and computational analyses we show that the predominant PLE lineage, EN(+)PLEs, is highly diversified in loblolly pine and other conifers, but appears to be absent in other gymnosperms. Phylogenetic analyses of both protein and DNA sequences reveal that conifers EN(+)PLEs, or Dryads, form a monophyletic group clustering within a clade of primarily arthropod elements. Additionally, no EN(+)PLEs were detected in 1,928 genome assemblies from 1,029 nonmetazoan and nonconifer genomes from 14 major eukaryotic lineages. These findings indicate that Dryads emerged following an ancient horizontal transfer of EN(+)PLEs from arthropods to a common ancestor of conifers approximately 340 Ma. This represents one of the oldest known interspecific transmissions of TEs, and the most conspicuous case of DNA transfer between animals and plants.

  15. An Ancient Transkingdom Horizontal Transfer of Penelope-Like Retroelements from Arthropods to Conifers

    PubMed Central

    Lin, Xuan; Faridi, Nurul; Casola, Claudio

    2016-01-01

    Comparative genomics analyses empowered by the wealth of sequenced genomes have revealed numerous instances of horizontal DNA transfers between distantly related species. In eukaryotes, repetitive DNA sequences known as transposable elements (TEs) are especially prone to move across species boundaries. Such horizontal transposon transfers, or HTTs, are relatively common within major eukaryotic kingdoms, including animals, plants, and fungi, while rarely occurring across these kingdoms. Here, we describe the first case of HTT from animals to plants, involving TEs known as Penelope-like elements, or PLEs, a group of retrotransposons closely related to eukaryotic telomerases. Using a combination of in situ hybridization on chromosomes, polymerase chain reaction experiments, and computational analyses we show that the predominant PLE lineage, EN(+)PLEs, is highly diversified in loblolly pine and other conifers, but appears to be absent in other gymnosperms. Phylogenetic analyses of both protein and DNA sequences reveal that conifers EN(+)PLEs, or Dryads, form a monophyletic group clustering within a clade of primarily arthropod elements. Additionally, no EN(+)PLEs were detected in 1,928 genome assemblies from 1,029 nonmetazoan and nonconifer genomes from 14 major eukaryotic lineages. These findings indicate that Dryads emerged following an ancient horizontal transfer of EN(+)PLEs from arthropods to a common ancestor of conifers approximately 340 Ma. This represents one of the oldest known interspecific transmissions of TEs, and the most conspicuous case of DNA transfer between animals and plants. PMID:27190138

  16. An overview of comparative modelling and resources dedicated to large-scale modelling of genome sequences.

    PubMed

    Lam, Su Datt; Das, Sayoni; Sillitoe, Ian; Orengo, Christine

    2017-08-01

    Computational modelling of proteins has been a major catalyst in structural biology. Bioinformatics groups have exploited the repositories of known structures to predict high-quality structural models with high efficiency at low cost. This article provides an overview of comparative modelling, reviews recent developments and describes resources dedicated to large-scale comparative modelling of genome sequences. The value of subclustering protein domain superfamilies to guide the template-selection process is investigated. Some recent cases in which structural modelling has aided experimental work to determine very large macromolecular complexes are also cited.

  17. An overview of comparative modelling and resources dedicated to large-scale modelling of genome sequences

    PubMed Central

    Lam, Su Datt; Das, Sayoni; Sillitoe, Ian; Orengo, Christine

    2017-01-01

    Computational modelling of proteins has been a major catalyst in structural biology. Bioinformatics groups have exploited the repositories of known structures to predict high-quality structural models with high efficiency at low cost. This article provides an overview of comparative modelling, reviews recent developments and describes resources dedicated to large-scale comparative modelling of genome sequences. The value of subclustering protein domain superfamilies to guide the template-selection process is investigated. Some recent cases in which structural modelling has aided experimental work to determine very large macromolecular complexes are also cited. PMID:28777078

  18. Genetic evaluation of the breeding population of a valuable reforestation conifer Platycladus orientalis (Cupressaceae)

    PubMed Central

    Jin, Yuqing; Ma, Yongpeng; Wang, Shun; Hu, Xian-Ge; Huang, Li-Sha; Li, Yue; Wang, Xiao-Ru; Mao, Jian-Feng

    2016-01-01

    Platycladus orientalis, a widespread conifer with long lifespan and significant adaptability. It is much used in reforestation in north China and commonly planted in central Asia. With the increasing demand for plantation forest in central to north China, breeding programs are progressively established for this species. Efficient use of breeding resources requires good understanding of the genetic value of the founder breeding materials. This study investigated the distribution of genetic variation in 192 elite trees collected for the breeding program for the central range of the species. We developed first set of 27 polymorphic EST-derived SSR loci for the species from transcriptome/genome data. After examination of amplification quality, 10 loci were used to evaluate the genetic variation in the breeding population. We found moderate genetic diversity (average He = 0.348) and low population differentiation (Fst = 0.011). Extensive admixture and no significant geographic population structure characterized this set of collections. Our analyses of the diversity and population structure are important steps toward a long-term sustainable deployment of the species and provide valuable genetic information for conservation and breeding applications. PMID:27721449

  19. Genetic evaluation of the breeding population of a valuable reforestation conifer Platycladus orientalis (Cupressaceae).

    PubMed

    Jin, Yuqing; Ma, Yongpeng; Wang, Shun; Hu, Xian-Ge; Huang, Li-Sha; Li, Yue; Wang, Xiao-Ru; Mao, Jian-Feng

    2016-10-10

    Platycladus orientalis, a widespread conifer with long lifespan and significant adaptability. It is much used in reforestation in north China and commonly planted in central Asia. With the increasing demand for plantation forest in central to north China, breeding programs are progressively established for this species. Efficient use of breeding resources requires good understanding of the genetic value of the founder breeding materials. This study investigated the distribution of genetic variation in 192 elite trees collected for the breeding program for the central range of the species. We developed first set of 27 polymorphic EST-derived SSR loci for the species from transcriptome/genome data. After examination of amplification quality, 10 loci were used to evaluate the genetic variation in the breeding population. We found moderate genetic diversity (average He = 0.348) and low population differentiation (Fst = 0.011). Extensive admixture and no significant geographic population structure characterized this set of collections. Our analyses of the diversity and population structure are important steps toward a long-term sustainable deployment of the species and provide valuable genetic information for conservation and breeding applications.

  20. Genetic evaluation of the breeding population of a valuable reforestation conifer Platycladus orientalis (Cupressaceae)

    NASA Astrophysics Data System (ADS)

    Jin, Yuqing; Ma, Yongpeng; Wang, Shun; Hu, Xian-Ge; Huang, Li-Sha; Li, Yue; Wang, Xiao-Ru; Mao, Jian-Feng

    2016-10-01

    Platycladus orientalis, a widespread conifer with long lifespan and significant adaptability. It is much used in reforestation in north China and commonly planted in central Asia. With the increasing demand for plantation forest in central to north China, breeding programs are progressively established for this species. Efficient use of breeding resources requires good understanding of the genetic value of the founder breeding materials. This study investigated the distribution of genetic variation in 192 elite trees collected for the breeding program for the central range of the species. We developed first set of 27 polymorphic EST-derived SSR loci for the species from transcriptome/genome data. After examination of amplification quality, 10 loci were used to evaluate the genetic variation in the breeding population. We found moderate genetic diversity (average He = 0.348) and low population differentiation (Fst = 0.011). Extensive admixture and no significant geographic population structure characterized this set of collections. Our analyses of the diversity and population structure are important steps toward a long-term sustainable deployment of the species and provide valuable genetic information for conservation and breeding applications.

  1. An Improved Protocol for Intact Chloroplasts and cpDNA Isolation in Conifers

    PubMed Central

    Vieira, Leila do Nascimento; Faoro, Helisson; Fraga, Hugo Pacheco de Freitas; Rogalski, Marcelo; de Souza, Emanuel Maltempi; de Oliveira Pedrosa, Fábio; Nodari, Rubens Onofre; Guerra, Miguel Pedro

    2014-01-01

    Background Performing chloroplast DNA (cpDNA) isolation is considered a major challenge among different plant groups, especially conifers. Isolating chloroplasts in conifers by such conventional methods as sucrose gradient and high salt has not been successful. So far, plastid genome sequencing protocols for conifer species have been based mainly on long-range PCR, which is known to be time-consuming and difficult to implement. Methodology/Principal Findings We developed a protocol for cpDNA isolation using three different conifer families: Araucaria angustifolia and Araucaria bidwilli (Araucariaceae), Podocarpus lambertii (Podocarpaceae) and Pinus patula (Pinaceae). The present protocol is based on high salt isolation buffer followed by saline Percoll gradient. Combining these two strategies allowed enhanced chloroplast isolation, along with decreased contamination caused by polysaccharides, polyphenols, proteins, and nuclear DNA in cpDNA. Microscopy images confirmed the presence of intact chloroplasts in high abundance. This method was applied to cpDNA isolation and subsequent sequencing by Illumina MiSeq (2×250 bp), using only 50 ng of cpDNA. Reference-guided chloroplast genome mapping showed that high average coverage was achieved for all evaluated species: 24.63 for A. angustifolia, 135.97 for A. bidwilli, 1196.10 for P. lambertii, and 64.68 for P. patula. Conclusion Results show that this improved protocol is suitable for enhanced quality and yield of chloroplasts and cpDNA isolation from conifers, providing a useful tool for studies that require isolated chloroplasts and/or whole cpDNA sequences. PMID:24392157

  2. Conifer-Derived Monoterpenes and Forest Walking

    PubMed Central

    Sumitomo, Kazuhiro; Akutsu, Hiroaki; Fukuyama, Syusei; Minoshima, Akiho; Kukita, Shin; Yamamura, Yuji; Sato, Yoshiaki; Hayasaka, Taiki; Osanai, Shinobu; Funakoshi, Hiroshi; Hasebe, Naoyuki; Nakamura, Masao

    2015-01-01

    Conifer and broadleaf trees emit volatile organic compounds in the summer. The major components of these emissions are volatile monoterpenes. Using solid phase microextraction fiber as the adsorbant, monoterpenes were successfully detected and identified in forest air samples. Gas chromatography/mass chromatogram of monoterpenes in the atmosphere of a conifer forest and that of serum from subjects who were walking in a forest were found to be similar each other. The amounts of α-pinene in the subjects became several folds higher after forest walking. The results indicate that monoterpenes in the atmosphere of conifer forests are transferred to and accumulate in subjects by inhalation while they are exposed to this type of environment. PMID:26819913

  3. Exploration of Genetic and Genomic Resources for Abiotic and Biotic Stress Tolerance in Pearl Millet

    PubMed Central

    Shivhare, Radha; Lata, Charu

    2017-01-01

    genes and assist in crop improvement programs through molecular breeding approaches. This review thus summarizes the exploration of pearl millet genetic and genomic resources for improving abiotic and biotic stress resistance and development of cultivars superior in stress tolerance. PMID:28167949

  4. Exploration of Genetic and Genomic Resources for Abiotic and Biotic Stress Tolerance in Pearl Millet.

    PubMed

    Shivhare, Radha; Lata, Charu

    2016-01-01

    genes and assist in crop improvement programs through molecular breeding approaches. This review thus summarizes the exploration of pearl millet genetic and genomic resources for improving abiotic and biotic stress resistance and development of cultivars superior in stress tolerance.

  5. Genomic Resources for Water Yam (Dioscorea alata L.): Analyses of EST-Sequences, De Novo Sequencing and GBS Libraries

    PubMed Central

    Saski, Christopher A.; Bhattacharjee, Ranjana; Scheffler, Brian E.; Asiedu, Robert

    2015-01-01

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources in several model and non-model plant species. Yam (Dioscorea spp.) is a major food and cash crop in many countries but research efforts have been limited to understand the genetics and generate genomic information for the crop. The availability of a large number of genomic resources including genome-wide molecular markers will accelerate the breeding efforts and application of genomic selection in yams. In the present study, several methods including expressed sequence tags (EST)-sequencing, de novo sequencing, and genotyping-by-sequencing (GBS) profiles on two yam (Dioscorea alata L.) genotypes (TDa 95/00328 and TDa 95-310) was performed to generate genomic resources for use in its improvement programs. This includes a comprehensive set of EST-SSRs, genomic SSRs, whole genome SNPs, and reduced representation SNPs. A total of 1,152 EST-SSRs were developed from >40,000 EST-sequences generated from the two genotypes. A set of 388 EST-SSRs were validated as polymorphic showing a polymorphism rate of 34% when tested on two diverse parents targeted for anthracnose disease. In addition, approximately 40X de novo whole genome sequence coverage was generated for each of the two genotypes, and a total of 18,584 and 15,952 genomic SSRs were identified for TDa 95/00328 and TDa 95-310, respectively. A custom made pipeline resulted in the selection of 573 genomic SSRs common across the two genotypes, of which only eight failed, 478 being polymorphic and 62 monomorphic indicating a polymorphic rate of 83.5%. Additionally, 288,505 high quality SNPs were also identified between these two genotypes. Genotyping by sequencing reads on these two genotypes also revealed 36,790 overlapping SNP positions that are distributed throughout the genome. Our efforts in using different approaches

  6. Genomic Resources for Water Yam (Dioscorea alata L.): Analyses of EST-Sequences, De Novo Sequencing and GBS Libraries.

    PubMed

    Saski, Christopher A; Bhattacharjee, Ranjana; Scheffler, Brian E; Asiedu, Robert

    2015-01-01

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources in several model and non-model plant species. Yam (Dioscorea spp.) is a major food and cash crop in many countries but research efforts have been limited to understand the genetics and generate genomic information for the crop. The availability of a large number of genomic resources including genome-wide molecular markers will accelerate the breeding efforts and application of genomic selection in yams. In the present study, several methods including expressed sequence tags (EST)-sequencing, de novo sequencing, and genotyping-by-sequencing (GBS) profiles on two yam (Dioscorea alata L.) genotypes (TDa 95/00328 and TDa 95-310) was performed to generate genomic resources for use in its improvement programs. This includes a comprehensive set of EST-SSRs, genomic SSRs, whole genome SNPs, and reduced representation SNPs. A total of 1,152 EST-SSRs were developed from >40,000 EST-sequences generated from the two genotypes. A set of 388 EST-SSRs were validated as polymorphic showing a polymorphism rate of 34% when tested on two diverse parents targeted for anthracnose disease. In addition, approximately 40X de novo whole genome sequence coverage was generated for each of the two genotypes, and a total of 18,584 and 15,952 genomic SSRs were identified for TDa 95/00328 and TDa 95-310, respectively. A custom made pipeline resulted in the selection of 573 genomic SSRs common across the two genotypes, of which only eight failed, 478 being polymorphic and 62 monomorphic indicating a polymorphic rate of 83.5%. Additionally, 288,505 high quality SNPs were also identified between these two genotypes. Genotyping by sequencing reads on these two genotypes also revealed 36,790 overlapping SNP positions that are distributed throughout the genome. Our efforts in using different approaches

  7. Development of Genomic Resources for Pacific Herring through Targeted Transcriptome Pyrosequencing

    PubMed Central

    Roberts, Steven B.; Hauser, Lorenz; Seeb, Lisa W.; Seeb, James E.

    2012-01-01

    Pacific herring (Clupea pallasii) support commercially and culturally important fisheries but have experienced significant additional pressure from a variety of anthropogenic and environmental sources. In order to provide genomic resources to facilitate organismal and population level research, high-throughput pyrosequencing (Roche 454) was carried out on transcriptome libraries from liver and testes samples taken in Prince William Sound, the Bering Sea, and the Gulf of Alaska. Over 40,000 contigs were identified with an average length of 728 bp. We describe an annotated transcriptome as well as a workflow for single nucleotide polymorphism (SNP) discovery and validation. A subset of 96 candidate SNPs chosen from 10,933 potential SNPs, were tested using a combination of Sanger sequencing and high-resolution melt-curve analysis. Five SNPs supported between-ocean-basin differentiation, while one SNP associated with immune function provided high differentiation between Prince William Sound and Kodiak Island within the Gulf of Alaska. These genomic resources provide a basis for environmental physiology studies and opportunities for marker development and subsequent population structure analysis. PMID:22383979

  8. The FunGenES database: a genomics resource for mouse embryonic stem cell differentiation.

    PubMed

    Schulz, Herbert; Kolde, Raivo; Adler, Priit; Aksoy, Irène; Anastassiadis, Konstantinos; Bader, Michael; Billon, Nathalie; Boeuf, Hélène; Bourillot, Pierre-Yves; Buchholz, Frank; Dani, Christian; Doss, Michael Xavier; Forrester, Lesley; Gitton, Murielle; Henrique, Domingos; Hescheler, Jürgen; Himmelbauer, Heinz; Hübner, Norbert; Karantzali, Efthimia; Kretsovali, Androniki; Lubitz, Sandra; Pradier, Laurent; Rai, Meena; Reimand, Jüri; Rolletschek, Alexandra; Sachinidis, Agapios; Savatier, Pierre; Stewart, Francis; Storm, Mike P; Trouillas, Marina; Vilo, Jaak; Welham, Melanie J; Winkler, Johannes; Wobus, Anna M; Hatzopoulos, Antonis K

    2009-09-03

    Embryonic stem (ES) cells have high self-renewal capacity and the potential to differentiate into a large variety of cell types. To investigate gene networks operating in pluripotent ES cells and their derivatives, the "Functional Genomics in Embryonic Stem Cells" consortium (FunGenES) has analyzed the transcriptome of mouse ES cells in eleven diverse settings representing sixty-seven experimental conditions. To better illustrate gene expression profiles in mouse ES cells, we have organized the results in an interactive database with a number of features and tools. Specifically, we have generated clusters of transcripts that behave the same way under the entire spectrum of the sixty-seven experimental conditions; we have assembled genes in groups according to their time of expression during successive days of ES cell differentiation; we have included expression profiles of specific gene classes such as transcription regulatory factors and Expressed Sequence Tags; transcripts have been arranged in "Expression Waves" and juxtaposed to genes with opposite or complementary expression patterns; we have designed search engines to display the expression profile of any transcript during ES cell differentiation; gene expression data have been organized in animated graphs of KEGG signaling and metabolic pathways; and finally, we have incorporated advanced functional annotations for individual genes or gene clusters of interest and links to microarray and genomic resources. The FunGenES database provides a comprehensive resource for studies into the biology of ES cells.

  9. The FunGenES Database: A Genomics Resource for Mouse Embryonic Stem Cell Differentiation

    PubMed Central

    Adler, Priit; Aksoy, Irène; Anastassiadis, Konstantinos; Bader, Michael; Billon, Nathalie; Boeuf, Hélène; Bourillot, Pierre-Yves; Buchholz, Frank; Dani, Christian; Doss, Michael Xavier; Forrester, Lesley; Gitton, Murielle; Henrique, Domingos; Hescheler, Jürgen; Himmelbauer, Heinz; Hübner, Norbert; Karantzali, Efthimia; Kretsovali, Androniki; Lubitz, Sandra; Pradier, Laurent; Rai, Meena; Reimand, Jüri; Rolletschek, Alexandra; Sachinidis, Agapios; Savatier, Pierre; Stewart, Francis; Storm, Mike P.; Trouillas, Marina; Vilo, Jaak; Welham, Melanie J.; Winkler, Johannes; Wobus, Anna M.; Hatzopoulos, Antonis K.

    2009-01-01

    Embryonic stem (ES) cells have high self-renewal capacity and the potential to differentiate into a large variety of cell types. To investigate gene networks operating in pluripotent ES cells and their derivatives, the “Functional Genomics in Embryonic Stem Cells” consortium (FunGenES) has analyzed the transcriptome of mouse ES cells in eleven diverse settings representing sixty-seven experimental conditions. To better illustrate gene expression profiles in mouse ES cells, we have organized the results in an interactive database with a number of features and tools. Specifically, we have generated clusters of transcripts that behave the same way under the entire spectrum of the sixty-seven experimental conditions; we have assembled genes in groups according to their time of expression during successive days of ES cell differentiation; we have included expression profiles of specific gene classes such as transcription regulatory factors and Expressed Sequence Tags; transcripts have been arranged in “Expression Waves” and juxtaposed to genes with opposite or complementary expression patterns; we have designed search engines to display the expression profile of any transcript during ES cell differentiation; gene expression data have been organized in animated graphs of KEGG signaling and metabolic pathways; and finally, we have incorporated advanced functional annotations for individual genes or gene clusters of interest and links to microarray and genomic resources. The FunGenES database provides a comprehensive resource for studies into the biology of ES cells. PMID:19727443

  10. Vitrification as a method for genome resource banking oocytes from the endangered Tasmanian devil (Sarcophilus harrisii).

    PubMed

    Czarny, N A; Rodger, J C

    2010-06-01

    Populations of Australia's largest terrestrial marsupial carnivore, the Tasmanian devil (Sarcophilus harrisii), are rapidly declining in the wild due to Tasmanian Devil Facial Tumour Disease (TDFTD). One tool which can reduce the loss of genetic diversity is genome resource banking. This study examines the application of an oocyte vitrification protocol, initially developed in a model marsupial carnivore, to the endangered Tasmanian devil. Ovarian tissue was transported to the laboratory on ice from Tasmania which took up to 48 h. Individual granulosa oocyte complexes (GOC) were isolated enzymatically and the viability of oocytes from primary GOC was assessed immediately following isolation or after exposure to cold shock, vitrification and thawing media without exposure to liquid nitrogen or the full vitrification and thawing process. There was no decline in oocyte viability following cold shock or exposure to the vitrification and thawing media. Following the full vitrification and thawing process there was a decline in oocyte viability (chi(2)=20.0, P<0.001) but approximately 70% of oocytes remained viable. This study provides further evidence that oocyte vitrification is a promising strategy for genome resource banking in carnivorous marsupials and suggests that it should be considered in conservation plans for the survival of the iconic Tasmanian devil. Copyright 2010 Elsevier Inc. All rights reserved.

  11. Enriching Genomic Resources and Marker Development from Transcript Sequences of Jatropha curcas for Microgravity Studies

    PubMed Central

    Tian, Wenlan; Paudel, Dev

    2017-01-01

    Jatropha (Jatropha curcas L.) is an economically important species with a great potential for biodiesel production. To enrich the jatropha genomic databases and resources for microgravity studies, we sequenced and annotated the transcriptome of jatropha and developed SSR and SNP markers from the transcriptome sequences. In total 1,714,433 raw reads with an average length of 441.2 nucleotides were generated. De novo assembling and clustering resulted in 115,611 uniquely assembled sequences (UASs) including 21,418 full-length cDNAs and 23,264 new jatropha transcript sequences. The whole set of UASs were fully annotated, out of which 59,903 (51.81%) were assigned with gene ontology (GO) term, 12,584 (10.88%) had orthologs in Eukaryotic Orthologous Groups (KOG), and 8,822 (7.63%) were mapped to 317 pathways in six different categories in Kyoto Encyclopedia of Genes and Genome (KEGG) database, and it contained 3,588 putative transcription factors. From the UASs, 9,798 SSRs were discovered with AG/CT as the most frequent (45.8%) SSR motif type. Further 38,693 SNPs were detected and 7,584 remained after filtering. This UAS set has enriched the current jatropha genomic databases and provided a large number of genetic markers, which can facilitate jatropha genetic improvement and many other genetic and biological studies. PMID:28154822

  12. Limited resources of genome sequencing in developing countries: Challenges and solutions.

    PubMed

    Helmy, Mohamed; Awad, Mohamed; Mosa, Kareem A

    2016-06-01

    The differences between countries in national income, growth, human development and many other factors are used to classify countries into developed and developing countries. There are several classification systems that use different sets of measures and criteria. The most common classifications are the United Nations (UN) and the World Bank (WB) systems. The UN classification system uses the UN Human Development Index (HDI), an indicator that uses statistic of life expectancy, education, and income per capita for countries' classification. While the WB system uses gross national income (GNI) per capita that is calculated using the World Bank Atlas method. According to the UN and WB classification systems, there are 151 and 134 developing countries, respectively, with 89% overlap between the two systems. Developing countries have limited human development, and limited expenditure in education and research, among several other limitations. The biggest challenge facing genomic researchers and clinicians is limited resources. As a result, genomic tools, specifically genome sequencing technologies, which are rapidly becoming indispensable, are not widely available. In this report, we explore the current status of sequencing technologies in developing countries, describe the associated challenges and emphasize potential solutions.

  13. Enzymes of Glyoxylate in Conifers 12

    PubMed Central

    Firenzuoli, A. M.; Vanni, P.; Mastronuzzi, E.; Zanobini, A.; Baccari, V.

    1968-01-01

    The high level of lipids in seeds of some species of conifers suggested that the glyoxylate cycle might have a role in conifer seed metabolism. Six species (Pinus pinea, Pinus pinaster, Pinus canariensis, Pinus strobus, Abies alba, and Cupressus sempervirens) were investigated for their lipid content and malate synthase and isocitrate lyase level. The fatty acid composition of the triglyceride fraction was also investigated. The correlation between lipid content of germinating seed with the presence of the cycle was confirmed. The enzymes of the glyoxylate cycle were not detected in Cupressus sempervirens where the lipid content is very low. PMID:16656892

  14. Exploiting Genomics Resources to Identify Candidate Genes Underlying Antioxidants Content in Tomato Fruit

    PubMed Central

    Calafiore, Roberta; Ruggieri, Valentino; Raiola, Assunta; Rigano, Maria M.; Sacco, Adriana; Hassan, Mohamed I.; Frusciante, Luigi; Barone, Amalia

    2016-01-01

    The tomato is a model species for fleshy fruit development and ripening, as well as for genomics studies of others Solanaceae. Many genetic and genomics resources, including databases for sequencing, transcriptomics and metabolomics data, have been developed and are today available. The purpose of the present work was to uncover new genes and/or alleles that determine ascorbic acid and carotenoids accumulation, by exploiting one Solanum pennellii introgression lines (IL7-3) harboring quantitative trait loci (QTL) that increase the content of these metabolites in the fruit. The higher ascorbic acid and carotenoids content in IL7-3 was confirmed at three fruit developmental stages. The tomato genome reference sequence and the recently released S. pennellii genome sequence were investigated to identify candidate genes (CGs) that might control ascorbic acid and carotenoids accumulation. First of all, a refinement of the wild region borders in the IL7-3 was achieved by analyzing CAPS markers designed in our laboratory. Afterward, six CGs associated to ascorbic acid and one with carotenoids metabolism were identified exploring the annotation and the Gene Ontology terms of genes included in the region. Variants between the sequence of the wild and the cultivated alleles of these genes were investigated for their functional relevance and their potential effects on the protein sequences were predicted. Transcriptional levels of CGs in the introgression region were extracted from RNA-Seq data available for the entire S. pennellii introgression lines collection and verified by Real-Time qPCR. Finally, seven IL7-3 sub-lines were genotyped using 28 species-specific markers and then were evaluated for metabolites content. These analyses evidenced a significant decrease in transcript abundance for one 9-cis-epoxycarotenoid dioxygenase and one L-ascorbate oxidase homolog, whose role in the accumulation of carotenoids and ascorbic acid is discussed. Comprehensively, the reported

  15. Specific and sensitive detection of the conifer pathogen Gremmeniella abietina by nested PCR

    PubMed Central

    Zeng, Qing-Yin; Hansson, Per; Wang, Xiao-Ru

    2005-01-01

    Background Gremmeniella abietina (Lagerb.) Morelet is an ascomycete fungus that causes stem canker and shoot dieback in many conifer species. The fungus is widespread and causes severe damage to forest plantations in Europe, North America and Asia. To facilitate early diagnosis and improve measures to control the spread of the disease, rapid, specific and sensitive detection methods for G. abietina in conifer hosts are needed. Results We designed two pairs of specific primers for G. abietina based on the 18S rDNA sequence variation pattern. These primers were validated against a wide range of fungi and 14 potential conifer hosts. Based on these specific primers, two nested PCR systems were developed. The first system employed universal fungal primers to enrich the fungal DNA targets in the first round, followed by a second round selective amplification of the pathogen. The other system employed G. abietina-specific primers in both PCR steps. Both approaches can detect the presence of G. abietina in composite samples with high sensitivity, as little as 7.5 fg G. abietina DNA in the host genomic background. Conclusion The methods described here are rapid and can be applied directly to a wide range of conifer species, without the need for fungal isolation and cultivation. Therefore, it represents a promising alternative to disease inspection in forest nurseries, plantations and quarantine control facilities. PMID:16280082

  16. AlliumMap-A comparative genomics resource for cultivated Allium vegetables

    PubMed Central

    2012-01-01

    Background Vegetables of the genus Allium are widely consumed but remain poorly understood genetically. Genetic mapping has been conducted in intraspecific crosses of onion (Allium cepa L.), A. fistulosum and interspecific crosses between A. roylei and these two species, but it has not been possible to access genetic maps and underlying data from these studies easily. Description An online comparative genomics database, AlliumMap, has been developed based on the GMOD CMap tool at http://alliumgenetics.org. It has been populated with curated data linking genetic maps with underlying markers and sequence data from multiple studies. It includes data from multiple onion mapping populations as well as the most closely related species A. roylei and A. fistulosum. Further onion EST-derived markers were evaluated in the A. cepa x A. roylei interspecific population, enabling merging of the AFLP-based maps. In addition, data concerning markers assigned in multiple studies to the Allium physical map using A. cepa-A. fistulosum alien monosomic addition lines have been compiled. The compiled data reveal extensive synteny between onion and A. fistulosum. Conclusions The database provides the first online resource providing genetic map and marker data from multiple Allium species and populations. The additional markers placed on the interspecific Allium map confirm the value of A. roylei as a valuable bridge between the genetics of onion and A. fistulosum and as a means to conduct efficient mapping of expressed sequence markers in Allium. The data presented suggest that comparative approaches will be valuable for genetic and genomic studies of onion and A. fistulosum. This online resource will provide a valuable means to integrate genetic and sequence-based explorations of Allium genomes. PMID:22559261

  17. Reptilian Transcriptomes v2.0: An Extensive Resource for Sauropsida Genomics and Transcriptomics

    PubMed Central

    Tzika, Athanasia C.; Ullate-Agote, Asier; Grbic, Djordje; Milinkovitch, Michel C.

    2015-01-01

    Despite the availability of deep-sequencing techniques, genomic and transcriptomic data remain unevenly distributed across phylogenetic groups. For example, reptiles are poorly represented in sequence databases, hindering functional evolutionary and developmental studies in these lineages substantially more diverse than mammals. In addition, different studies use different assembly and annotation protocols, inhibiting meaningful comparisons. Here, we present the “Reptilian Transcriptomes Database 2.0,” which provides extensive annotation of transcriptomes and genomes from species covering the major reptilian lineages. To this end, we sequenced normalized complementary DNA libraries of multiple adult tissues and various embryonic stages of the leopard gecko and the corn snake and gathered published reptilian sequence data sets from representatives of the four extant orders of reptiles: Squamata (snakes and lizards), the tuatara, crocodiles, and turtles. The LANE runner 2.0 software was implemented to annotate all assemblies within a single integrated pipeline. We show that this approach increases the annotation completeness of the assembled transcriptomes/genomes. We then built large concatenated protein alignments of single-copy genes and inferred phylogenetic trees that support the positions of turtles and the tuatara as sister groups of Archosauria and Squamata, respectively. The Reptilian Transcriptomes Database 2.0 resource will be updated to include selected new data sets as they become available, thus making it a reference for differential expression studies, comparative genomics and transcriptomics, linkage mapping, molecular ecology, and phylogenomic analyses involving reptiles. The database is available at www.reptilian-transcriptomes.org and can be enquired using a wwwblast server installed at the University of Geneva. PMID:26133641

  18. Reptilian Transcriptomes v2.0: An Extensive Resource for Sauropsida Genomics and Transcriptomics.

    PubMed

    Tzika, Athanasia C; Ullate-Agote, Asier; Grbic, Djordje; Milinkovitch, Michel C

    2015-07-01

    Despite the availability of deep-sequencing techniques, genomic and transcriptomic data remain unevenly distributed across phylogenetic groups. For example, reptiles are poorly represented in sequence databases, hindering functional evolutionary and developmental studies in these lineages substantially more diverse than mammals. In addition, different studies use different assembly and annotation protocols, inhibiting meaningful comparisons. Here, we present the "Reptilian Transcriptomes Database 2.0," which provides extensive annotation of transcriptomes and genomes from species covering the major reptilian lineages. To this end, we sequenced normalized complementary DNA libraries of multiple adult tissues and various embryonic stages of the leopard gecko and the corn snake and gathered published reptilian sequence data sets from representatives of the four extant orders of reptiles: Squamata (snakes and lizards), the tuatara, crocodiles, and turtles. The LANE runner 2.0 software was implemented to annotate all assemblies within a single integrated pipeline. We show that this approach increases the annotation completeness of the assembled transcriptomes/genomes. We then built large concatenated protein alignments of single-copy genes and inferred phylogenetic trees that support the positions of turtles and the tuatara as sister groups of Archosauria and Squamata, respectively. The Reptilian Transcriptomes Database 2.0 resource will be updated to include selected new data sets as they become available, thus making it a reference for differential expression studies, comparative genomics and transcriptomics, linkage mapping, molecular ecology, and phylogenomic analyses involving reptiles. The database is available at www.reptilian-transcriptomes.org and can be enquired using a wwwblast server installed at the University of Geneva. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Nuclear DNA content affects the productivity of conifer forests by altering hydraulic architecture

    NASA Astrophysics Data System (ADS)

    Alday, Josu; Resco de Dios, Víctor

    2014-05-01

    Predictions of future global climate rely on feedbacks between terrestrial vegetation and the global carbon cycle, but the exact mechanisms underlying this relationship are still being discussed. One of the key knowledge gaps lies on the scaling of cellular processes to the ecosystem level. Here we examine whether an under-explored plant trait, inter-specific variation in the bulk amount of DNA in unreplicated somatic cells (2C DNA content), can explain inter-specific variation in the maximum productivity of conifer forests. We expected 2C DNA content to be negatively related to conifer productivity because: 1) it is positively correlated with cell volume (which, in turn, potentially affects structural features such as leaf mass area, a strong predictor of photosynthetic capacity); 2) it is positively correlated with stomatal size (with larger stomata leading to lower overall stomatal conductance and, by extension, lower CO2 uptake); and 3) larger genome sizes may reduce P availability in RNA (which has been hypothesized to slow growth). We present the results of regression and independent contrasts in different monospecific forests encompassing a 52º latitudinal gradient, each being dominated by 1 of 35 different conifer species. Contrary to expectations, we observed a positive correlation between genome size and maximum Gross Primary Productivity (R2 = 0.47) and also between genome size maximum tree height (R2 = 0.27). This correlation was apparently driven by the effects of genome size on stem hydraulics, since 2C DNA was positively correlated with wood density (R2 = 0.40) and also with resistance to cavitation (P50, R2 = 0.28). That is, increased genome sizes have a positive effect on the productivity of conifer forests by affecting the vascular tissues to increase their capacity for water transport. Our results shed a new light on the evolution of the vascular system of conifer forests and how they affect ecosystem productivity, and indicate the potential to

  20. MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes

    PubMed Central

    Vallenet, David; Calteau, Alexandra; Cruveiller, Stéphane; Gachet, Mathieu; Lajus, Aurélie; Josso, Adrien; Mercier, Jonathan; Renaux, Alexandre; Rollin, Johan; Rouy, Zoe; Roche, David; Scarpelli, Claude; Médigue, Claudine

    2017-01-01

    The annotation of genomes from NGS platforms needs to be automated and fully integrated. However, maintaining consistency and accuracy in genome annotation is a challenging problem because millions of protein database entries are not assigned reliable functions. This shortcoming limits the knowledge that can be extracted from genomes and metabolic models. Launched in 2005, the MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Effective comparative analysis requires a consistent and complete view of biological data, and therefore, support for reviewing the quality of functional annotation is critical. MicroScope allows users to analyze microbial (meta)genomes together with post-genomic experiment results if any (i.e. transcriptomics, re-sequencing of evolved strains, mutant collections, phenotype data). It combines tools and graphical interfaces to analyze genomes and to perform the expert curation of gene functions in a comparative context. Starting with a short overview of the MicroScope system, this paper focuses on some major improvements of the Web interface, mainly for the submission of genomic data and on original tools and pipelines that have been developed and integrated in the platform: computation of pan-genomes and prediction of biosynthetic gene clusters. Today the resource contains data for more than 6000 microbial genomes, and among the 2700 personal accounts (65% of which are now from foreign countries), 14% of the users are performing expert annotations, on at least a weekly basis, contributing to improve the quality of microbial genome annotations. PMID:27899624

  1. MicroScope in 2017: an expanding and evolving integrated resource for community expertise of microbial genomes.

    PubMed

    Vallenet, David; Calteau, Alexandra; Cruveiller, Stéphane; Gachet, Mathieu; Lajus, Aurélie; Josso, Adrien; Mercier, Jonathan; Renaux, Alexandre; Rollin, Johan; Rouy, Zoe; Roche, David; Scarpelli, Claude; Médigue, Claudine

    2017-01-04

    The annotation of genomes from NGS platforms needs to be automated and fully integrated. However, maintaining consistency and accuracy in genome annotation is a challenging problem because millions of protein database entries are not assigned reliable functions. This shortcoming limits the knowledge that can be extracted from genomes and metabolic models. Launched in 2005, the MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Effective comparative analysis requires a consistent and complete view of biological data, and therefore, support for reviewing the quality of functional annotation is critical. MicroScope allows users to analyze microbial (meta)genomes together with post-genomic experiment results if any (i.e. transcriptomics, re-sequencing of evolved strains, mutant collections, phenotype data). It combines tools and graphical interfaces to analyze genomes and to perform the expert curation of gene functions in a comparative context. Starting with a short overview of the MicroScope system, this paper focuses on some major improvements of the Web interface, mainly for the submission of genomic data and on original tools and pipelines that have been developed and integrated in the platform: computation of pan-genomes and prediction of biosynthetic gene clusters. Today the resource contains data for more than 6000 microbial genomes, and among the 2700 personal accounts (65% of which are now from foreign countries), 14% of the users are performing expert annotations, on at least a weekly basis, contributing to improve the quality of microbial genome annotations.

  2. Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression.

    PubMed

    Arnaiz, Olivier; Van Dijk, Erwin; Bétermier, Mireille; Lhuillier-Akakpo, Maoussi; de Vanssay, Augustin; Duharcourt, Sandra; Sallet, Erika; Gouzy, Jérôme; Sperling, Linda

    2017-06-26

    The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3' and 5' UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis

  3. Montane conifer fuel dynamics, Yosemite National Park

    USGS Publications Warehouse

    van Wagtendonk, J.W.; Moore, P.E.

    1997-01-01

    Litter and woody fuel accumulation rates over 7 years for 7 montane Sierra Nevada conifer species, including giant sequoia, ponderosa pine, sugar pine, Jeffrey pine, incense-cedar and white fir. Data are from four sites per size class per species with four size classes each. Nonspatial, georeferenced.

  4. Silvics of North America: Volume 1. Conifers

    Treesearch

    Russell M Burns; Barbara H. Honkala; [Technical coordinators

    1990-01-01

    The silvical characteristics of about 200 forest tree species and varieties are described. Most are native to the 50 United States and Puerto Rico, but a few are introduced and naturalized. Information on habitat, life history, and genetics is given for 15 genera, 63 species, and 20 varieties of conifers and for 58 genera, 128 species, and 6 varieties of hardwoods....

  5. Black Stain Root Disease of Conifers (FIDL)

    Treesearch

    Paul F. Hessburg; Donald J. Goheen; Robert V. Bega

    1995-01-01

    The black stain fungus?Leptographium wageneri (Kendrick) Wingfield*?infects and kills several species of western conifers. The fungus colonizes water-conducting tissues of the host's roots, root collars, and lower stems, ultimately blocking the movement of water to foliage. Severely infected trees exhibit wilting symptoms characteristic of vascular wilt diseases...

  6. Indicators of cull in western Oregon conifers.

    Treesearch

    Paul E. Aho

    1982-01-01

    Descriptions and color photographs of important fungal sporophores (conks), other indicators of cull (wounds), and associated decays in western Oregon conifers are provided to aid timber markers, cruisers, and scalers in identifying them. Cull factors are given for the indicators by tree species.

  7. Genetics and the physiological ecology of conifers

    SciTech Connect

    Mitton, J.B.

    1995-07-01

    Natural selection acts on the diversity of genotypes, adapting populations to their specific environments and driving evolution in response to changes in climate. Genetically based differences in physiology and demography adapt species to alternate environments and produce, along with historical accidents, the present distribution of species. The sorting of conifer species by elevation is so marked that conifers help to define plant communities arranged in elevational bands in the Rocky Mountains. For these reasons, a genetic perspective is necessary to appreciate the evolution of ecophysiological patterns in the coniferous forests of the Rocky Mountains. The fascinating natural history and the economic importance of western conifers have stimulated numerous studies of their ecology, ecological genetics, and geographic variation. These studies yield some generalizations, and present some puzzling contradictions. This chapter focuses on the genetic variability associated with the physiological differences among genotypes in Rocky Mountain conifers. Variation among genotypes in survival, growth, and resistance to herbivores is used to illustrate genetically based differences in physiology, and to suggest the mechanistic studies needed to understand the relationships between genetic and physiological variation.

  8. Rapid analysis of abietanes in conifers

    Treesearch

    P.J. Kersten; B.J. Kopper; K.F. Raffa; B.L. Illman

    2006-01-01

    Diterpene resin acids are major constituents of conifer oleoresin and play important roles in tree defense against insects and microbial pathogens. The tricyclic C-20 carboxylic acids are generally classified into two groups, the abietanes and the pimaranes. The abietanes have conjugated double bonds and exhibit characteristic UV spectra. Here, we report the analysis...

  9. Diseases of Pacific Coast conifers. Agriculture handbook

    SciTech Connect

    Scharpf, R.F.

    1993-06-01

    The handbook provides basic information needed to identify the common diseases of Pacific Coast conifers. Hosts, distribution, disease cycles, and identifying characteristics are described for more than 150 diseases, including cankers, diebacks, galls, rusts, needle diseases, root diseases, mistletoes, and rots. Diseases in which abiotic factors are involved are also described. For some groups of diseases, a descriptive key to field identification is included.

  10. MicroScope--an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data.

    PubMed

    Vallenet, David; Belda, Eugeni; Calteau, Alexandra; Cruveiller, Stéphane; Engelen, Stefan; Lajus, Aurélie; Le Fèvre, François; Longin, Cyrille; Mornico, Damien; Roche, David; Rouy, Zoé; Salvignol, Gregory; Scarpelli, Claude; Thil Smith, Adam Alexander; Weiman, Marion; Médigue, Claudine

    2013-01-01

    MicroScope is an integrated platform dedicated to both the methodical updating of microbial genome annotation and to comparative analysis. The resource provides data from completed and ongoing genome projects (automatic and expert annotations), together with data sources from post-genomic experiments (i.e. transcriptomics, mutant collections) allowing users to perfect and improve the understanding of gene functions. MicroScope (http://www.genoscope.cns.fr/agc/microscope) combines tools and graphical interfaces to analyse genomes and to perform the manual curation of gene annotations in a comparative context. Since its first publication in January 2006, the system (previously named MaGe for Magnifying Genomes) has been continuously extended both in terms of data content and analysis tools. The last update of MicroScope was published in 2009 in the Database journal. Today, the resource contains data for >1600 microbial genomes, of which ∼300 are manually curated and maintained by biologists (1200 personal accounts today). Expert annotations are continuously gathered in the MicroScope database (∼50 000 a year), contributing to the improvement of the quality of microbial genomes annotations. Improved data browsing and searching tools have been added, original tools useful in the context of expert annotation have been developed and integrated and the website has been significantly redesigned to be more user-friendly. Furthermore, in the context of the European project Microme (Framework Program 7 Collaborative Project), MicroScope is becoming a resource providing for the curation and analysis of both genomic and metabolic data. An increasing number of projects are related to the study of environmental bacterial (meta)genomes that are able to metabolize a large variety of chemical compounds that may be of high industrial interest.

  11. MicroScope—an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data

    PubMed Central

    Vallenet, David; Belda, Eugeni; Calteau, Alexandra; Cruveiller, Stéphane; Engelen, Stefan; Lajus, Aurélie; Le Fèvre, François; Longin, Cyrille; Mornico, Damien; Roche, David; Rouy, Zoé; Salvignol, Gregory; Scarpelli, Claude; Thil Smith, Adam Alexander; Weiman, Marion; Médigue, Claudine

    2013-01-01

    MicroScope is an integrated platform dedicated to both the methodical updating of microbial genome annotation and to comparative analysis. The resource provides data from completed and ongoing genome projects (automatic and expert annotations), together with data sources from post-genomic experiments (i.e. transcriptomics, mutant collections) allowing users to perfect and improve the understanding of gene functions. MicroScope (http://www.genoscope.cns.fr/agc/microscope) combines tools and graphical interfaces to analyse genomes and to perform the manual curation of gene annotations in a comparative context. Since its first publication in January 2006, the system (previously named MaGe for Magnifying Genomes) has been continuously extended both in terms of data content and analysis tools. The last update of MicroScope was published in 2009 in the Database journal. Today, the resource contains data for >1600 microbial genomes, of which ∼300 are manually curated and maintained by biologists (1200 personal accounts today). Expert annotations are continuously gathered in the MicroScope database (∼50 000 a year), contributing to the improvement of the quality of microbial genomes annotations. Improved data browsing and searching tools have been added, original tools useful in the context of expert annotation have been developed and integrated and the website has been significantly redesigned to be more user-friendly. Furthermore, in the context of the European project Microme (Framework Program 7 Collaborative Project), MicroScope is becoming a resource providing for the curation and analysis of both genomic and metabolic data. An increasing number of projects are related to the study of environmental bacterial (meta)genomes that are able to metabolize a large variety of chemical compounds that may be of high industrial interest. PMID:23193269

  12. Open access resources for genome-wide association mapping in rice

    PubMed Central

    McCouch, Susan R.; Wright, Mark H.; Tung, Chih-Wei; Maron, Lyza G.; McNally, Kenneth L.; Fitzgerald, Melissa; Singh, Namrata; DeClerck, Genevieve; Agosto-Perez, Francisco; Korniliev, Pavel; Greenberg, Anthony J.; Naredo, Ma. Elizabeth B.; Mercado, Sheila Mae Q.; Harrington, Sandra E.; Shi, Yuxin; Branchini, Darcy A.; Kuser-Falcão, Paula R.; Leung, Hei; Ebana, Kowaru; Yano, Masahiro; Eizenga, Georgia; McClung, Anna; Mezey, Jason

    2016-01-01

    Increasing food production is essential to meet the demands of a growing human population, with its rising income levels and nutritional expectations. To address the demand, plant breeders seek new sources of genetic variation to enhance the productivity, sustainability and resilience of crop varieties. Here we launch a high-resolution, open-access research platform to facilitate genome-wide association mapping in rice, a staple food crop. The platform provides an immortal collection of diverse germplasm, a high-density single-nucleotide polymorphism data set tailored for gene discovery, well-documented analytical strategies, and a suite of bioinformatics resources to facilitate biological interpretation. Using grain length, we demonstrate the power and resolution of our new high-density rice array, the accompanying genotypic data set, and an expanded diversity panel for detecting major and minor effect QTLs and subpopulation-specific alleles, with immediate implications for rice improvement. PMID:26842267

  13. Genomic resources for Myzus persicae: EST sequencing, SNP identification, and microarray design

    PubMed Central

    Ramsey, John S; Wilson, Alex CC; de Vos, Martin; Sun, Qi; Tamborindeguy, Cecilia; Winfield, Agnese; Malloch, Gaynor; Smith, Dawn M; Fenton, Brian; Gray, Stewart M; Jander, Georg

    2007-01-01

    Background The green peach aphid, Myzus persicae (Sulzer), is a world-wide insect pest capable of infesting more than 40 plant families, including many crop species. However, despite the significant damage inflicted by M. persicae in agricultural systems through direct feeding damage and by its ability to transmit plant viruses, limited genomic information is available for this species. Results Sequencing of 16 M. persicae cDNA libraries generated 26,669 expressed sequence tags (ESTs). Aphids for library construction were raised on Arabidopsis thaliana, Nicotiana benthamiana, Brassica oleracea, B. napus, and Physalis floridana (with and without Potato leafroll virus infection). The M. persicae cDNA libraries include ones made from sexual and asexual whole aphids, guts, heads, and salivary glands. In silico comparison of cDNA libraries identified aphid genes with tissue-specific expression patterns, and gene expression that is induced by feeding on Nicotiana benthamiana. Furthermore, 2423 genes that are novel to science and potentially aphid-specific were identified. Comparison of cDNA data from three aphid lineages identified single nucleotide polymorphisms that can be used as genetic markers and, in some cases, may represent functional differences in the protein products. In particular, non-conservative amino acid substitutions in a highly expressed gut protease may be of adaptive significance for M. persicae feeding on different host plants. The Agilent eArray platform was used to design an M. persicae oligonucleotide microarray representing over 10,000 unique genes. Conclusion New genomic resources have been developed for M. persicae, an agriculturally important insect pest. These include previously unknown sequence data, a collection of expressed genes, molecular markers, and a DNA microarray that can be used to study aphid gene expression. These resources will help elucidate the adaptations that allow M. persicae to develop compatible interactions with its

  14. The Human Ageing Genomic Resources: online databases and tools for biogerontologists.

    PubMed

    de Magalhães, João Pedro; Budovsky, Arie; Lehmann, Gilad; Costa, Joana; Li, Yang; Fraifeld, Vadim; Church, George M

    2009-02-01

    Aging is a complex, challenging phenomenon that requires multiple, interdisciplinary approaches to unravel its puzzles. To assist basic research on aging, we developed the Human Ageing Genomic Resources (HAGR). This work provides an overview of the databases and tools in HAGR and describes how the gerontology research community can employ them. Several recent changes and improvements to HAGR are also presented. The two centrepieces in HAGR are GenAge and AnAge. GenAge is a gene database featuring genes associated with aging and longevity in model organisms, a curated database of genes potentially associated with human aging, and a list of genes tested for their association with human longevity. A myriad of biological data and information is included for hundreds of genes, making GenAge a reference for research that reflects our current understanding of the genetic basis of aging. GenAge can also serve as a platform for the systems biology of aging, and tools for the visualization of protein-protein interactions are also included. AnAge is a database of aging in animals, featuring over 4000 species, primarily assembled as a resource for comparative and evolutionary studies of aging. Longevity records, developmental and reproductive traits, taxonomic information, basic metabolic characteristics, and key observations related to aging are included in AnAge. Software is also available to aid researchers in the form of Perl modules to automate numerous tasks and as an SPSS script to analyse demographic mortality data. The HAGR are available online at http://genomics.senescence.info.

  15. A genome-wide resource for the analysis of protein localisation in Drosophila

    PubMed Central

    Sarov, Mihail; Barz, Christiane; Jambor, Helena; Hein, Marco Y; Schmied, Christopher; Suchold, Dana; Stender, Bettina; Janosch, Stephan; KJ, Vinay Vikas; Krishnan, RT; Krishnamoorthy, Aishwarya; Ferreira, Irene RS; Ejsmont, Radoslaw K; Finkl, Katja; Hasse, Susanne; Kämpfer, Philipp; Plewka, Nicole; Vinis, Elisabeth; Schloissnig, Siegfried; Knust, Elisabeth; Hartenstein, Volker; Mann, Matthias; Ramaswami, Mani; VijayRaghavan, K; Tomancak, Pavel; Schnorrer, Frank

    2016-01-01

    The Drosophila genome contains >13000 protein-coding genes, the majority of which remain poorly investigated. Important reasons include the lack of antibodies or reporter constructs to visualise these proteins. Here, we present a genome-wide fosmid library of 10000 GFP-tagged clones, comprising tagged genes and most of their regulatory information. For 880 tagged proteins, we created transgenic lines, and for a total of 207 lines, we assessed protein expression and localisation in ovaries, embryos, pupae or adults by stainings and live imaging approaches. Importantly, we visualised many proteins at endogenous expression levels and found a large fraction of them localising to subcellular compartments. By applying genetic complementation tests, we estimate that about two-thirds of the tagged proteins are functional. Moreover, these tagged proteins enable interaction proteomics from developing pupae and adult flies. Taken together, this resource will boost systematic analysis of protein expression and localisation in various cellular and developmental contexts. DOI: http://dx.doi.org/10.7554/eLife.12068.001 PMID:26896675

  16. Conifer defence against insects: microarray gene expression profiling of Sitka spruce (Picea sitchensis) induced by mechanical wounding or feeding by spruce budworms (Choristoneura occidentalis) or white pine weevils (Pissodes strobi) reveals large-scale changes of the host transcriptome.

    PubMed

    Ralph, Steven G; Yueh, Hesther; Friedmann, Michael; Aeschliman, Dana; Zeznik, Jeffrey A; Nelson, Colleen C; Butterfield, Yaron S N; Kirkpatrick, Robert; Liu, Jerry; Jones, Steven J M; Marra, Marco A; Douglas, Carl J; Ritland, Kermit; Bohlmann, Jörg

    2006-08-01

    Conifers are resistant to attack from a large number of potential herbivores or pathogens. Previous molecular and biochemical characterization of selected conifer defence systems support a model of multigenic, constitutive and induced defences that act on invading insects via physical, chemical, biochemical or ecological (multitrophic) mechanisms. However, the genomic foundation of the complex defence and resistance mechanisms of conifers is largely unknown. As part of a genomics strategy to characterize inducible defences and possible resistance mechanisms of conifers against insect herbivory, we developed a cDNA microarray building upon a new spruce (Picea spp.) expressed sequence tag resource. This first-generation spruce cDNA microarray contains 9720 cDNA elements representing c. 5500 unique genes. We used this array to monitor gene expression in Sitka spruce (Picea sitchensis) bark in response to herbivory by white pine weevils (Pissodes strobi, Curculionidae) or wounding, and in young shoot tips in response to western spruce budworm (Choristoneura occidentalis, Lepidopterae) feeding. Weevils are stem-boring insects that feed on phloem, while budworms are foliage feeding larvae that consume needles and young shoot tips. Both insect species and wounding treatment caused substantial changes of the host plant transcriptome detected in each case by differential gene expression of several thousand array elements at 1 or 2 d after the onset of treatment. Overall, there was considerable overlap among differentially expressed gene sets from these three stress treatments. Functional classification of the induced transcripts revealed genes with roles in general plant defence, octadecanoid and ethylene signalling, transport, secondary metabolism, and transcriptional regulation. Several genes involved in primary metabolic processes such as photosynthesis were down-regulated upon insect feeding or wounding, fitting with the concept of dynamic resource allocation in plant

  17. The Biofuel Feedstock Genomics Resource: a web-based portal and database to enable functional genomics of plant biofuel feedstock species.

    PubMed

    Childs, Kevin L; Konganti, Kranti; Buell, C Robin

    2012-01-01

    Major feedstock sources for future biofuel production are likely to be high biomass producing plant species such as poplar, pine, switchgrass, sorghum and maize. One active area of research in these species is genome-enabled improvement of lignocellulosic biofuel feedstock quality and yield. To facilitate genomic-based investigations in these species, we developed the Biofuel Feedstock Genomic Resource (BFGR), a database and web-portal that provides high-quality, uniform and integrated functional annotation of gene and transcript assembly sequences from species of interest to lignocellulosic biofuel feedstock researchers. The BFGR includes sequence data from 54 species and permits researchers to view, analyze and obtain annotation at the gene, transcript, protein and genome level. Annotation of biochemical pathways permits the identification of key genes and transcripts central to the improvement of lignocellulosic properties in these species. The integrated nature of the BFGR in terms of annotation methods, orthologous/paralogous relationships and linkage to seven species with complete genome sequences allows comparative analyses for biofuel feedstock species with limited sequence resources. Database URL: http://bfgr.plantbiology.msu.edu.

  18. Genomic and transcriptomic resources for assassin flies including the complete genome sequence of Proctacanthus coquilletti (Insecta: Diptera: Asilidae) and 16 representative transcriptomes.

    PubMed

    Dikow, Rebecca B; Frandsen, Paul B; Turcatel, Mauren; Dikow, Torsten

    2017-01-01

    A high-quality draft genome for Proctacanthus coquilletti (Insecta: Diptera: Asilidae) is presented along with transcriptomes for 16 Diptera species from five families: Asilidae, Apioceridae, Bombyliidae, Mydidae, and Tabanidae. Genome sequencing reveals that P. coquilletti has a genome size of approximately 210 Mbp and remarkably low heterozygosity (0.47%) and few repeats (15%). These characteristics helped produce a highly contiguous (N50 = 862 kbp) assembly, particularly given that only a single 2 × 250 bp PCR-free Illumina library was sequenced. A phylogenomic hypothesis is presented based on thousands of putative orthologs across the 16 transcriptomes. Phylogenetic relationships support the sister group relationship of Apioceridae + Mydidae to Asilidae. A time-calibrated phylogeny is also presented, with seven fossil calibration points, which suggests an older age of the split among Apioceridae, Asilidae, and Mydidae (158 mya) and Apioceridae and Mydidae (135 mya) than proposed in the AToL FlyTree project. Future studies will be able to take advantage of the resources presented here in order to produce large scale phylogenomic and evolutionary studies of assassin fly phylogeny, life histories, or venom. The bioinformatics tools and workflow presented here will be useful to others wishing to generate de novo genomic resources in species-rich taxa without a closely-related reference genome.

  19. Genomic and transcriptomic resources for assassin flies including the complete genome sequence of Proctacanthus coquilletti (Insecta: Diptera: Asilidae) and 16 representative transcriptomes

    PubMed Central

    Frandsen, Paul B.; Turcatel, Mauren

    2017-01-01

    A high-quality draft genome for Proctacanthus coquilletti (Insecta: Diptera: Asilidae) is presented along with transcriptomes for 16 Diptera species from five families: Asilidae, Apioceridae, Bombyliidae, Mydidae, and Tabanidae. Genome sequencing reveals that P. coquilletti has a genome size of approximately 210 Mbp and remarkably low heterozygosity (0.47%) and few repeats (15%). These characteristics helped produce a highly contiguous (N50 = 862 kbp) assembly, particularly given that only a single 2 × 250 bp PCR-free Illumina library was sequenced. A phylogenomic hypothesis is presented based on thousands of putative orthologs across the 16 transcriptomes. Phylogenetic relationships support the sister group relationship of Apioceridae + Mydidae to Asilidae. A time-calibrated phylogeny is also presented, with seven fossil calibration points, which suggests an older age of the split among Apioceridae, Asilidae, and Mydidae (158 mya) and Apioceridae and Mydidae (135 mya) than proposed in the AToL FlyTree project. Future studies will be able to take advantage of the resources presented here in order to produce large scale phylogenomic and evolutionary studies of assassin fly phylogeny, life histories, or venom. The bioinformatics tools and workflow presented here will be useful to others wishing to generate de novo genomic resources in species-rich taxa without a closely-related reference genome. PMID:28168115

  20. Novel genomic resources for a climate change sensitive mammal: characterization of the American pika transcriptome

    PubMed Central

    2013-01-01

    Background When faced with climate change, species must either shift their home range or adapt in situ in order to maintain optimal physiological balance with their environment. The American pika (Ochotona princeps) is a small alpine mammal with limited dispersal capacity and low tolerance for thermal stress. As a result, pikas have become an important system for examining biotic responses to changing climatic conditions. Previous research using amplified fragment length polymorphisms (AFLPs) has revealed evidence for environmental-mediated selection in O. princeps populations distributed along elevation gradients, yet the anonymity of AFLP loci and lack of available genomic resources precluded the identification of associated gene regions. Here, we harnessed next-generation sequencing technology in order to characterize the American pika transcriptome and identify a large suite of single nucleotide polymorphisms (SNPs), which can be used to elucidate elevation- and site-specific patterns of sequence variation. Results We constructed pooled cDNA libraries of O. princeps from high (1400m) and low (300m) elevation sites along a previously established transect in British Columbia. Transcriptome sequencing using the Roche 454 GS FLX titanium platform generated 780 million base pairs of data, which were assembled into 7,325 high coverage contigs. These contigs were used to identify 24,261 novel SNP loci. Using high resolution melt analysis, we developed 17 of these SNPs into genotyping assays, which were validated with independent DNA samples from British Columbia Canada and Oregon State USA. In addition, we detected haplotypes in the NADH dehydrogenase subunit 5 of the mitochondrial genome that were fixed and different among elevations, suggesting that this may be an informative target gene for studying the role of cellular respiration in local adaptation. We also identified contigs that were unique to each elevation, including a high elevation-specific contig that was

  1. Novel genomic resources for a climate change sensitive mammal: characterization of the American pika transcriptome.

    PubMed

    Lemay, Matthew A; Henry, Philippe; Lamb, Clayton T; Robson, Kelsey M; Russello, Michael A

    2013-05-10

    When faced with climate change, species must either shift their home range or adapt in situ in order to maintain optimal physiological balance with their environment. The American pika (Ochotona princeps) is a small alpine mammal with limited dispersal capacity and low tolerance for thermal stress. As a result, pikas have become an important system for examining biotic responses to changing climatic conditions. Previous research using amplified fragment length polymorphisms (AFLPs) has revealed evidence for environmental-mediated selection in O. princeps populations distributed along elevation gradients, yet the anonymity of AFLP loci and lack of available genomic resources precluded the identification of associated gene regions. Here, we harnessed next-generation sequencing technology in order to characterize the American pika transcriptome and identify a large suite of single nucleotide polymorphisms (SNPs), which can be used to elucidate elevation- and site-specific patterns of sequence variation. We constructed pooled cDNA libraries of O. princeps from high (1400 m) and low (300 m) elevation sites along a previously established transect in British Columbia. Transcriptome sequencing using the Roche 454 GS FLX titanium platform generated 780 million base pairs of data, which were assembled into 7,325 high coverage contigs. These contigs were used to identify 24,261 novel SNP loci. Using high resolution melt analysis, we developed 17 of these SNPs into genotyping assays, which were validated with independent DNA samples from British Columbia Canada and Oregon State USA. In addition, we detected haplotypes in the NADH dehydrogenase subunit 5 of the mitochondrial genome that were fixed and different among elevations, suggesting that this may be an informative target gene for studying the role of cellular respiration in local adaptation. We also identified contigs that were unique to each elevation, including a high elevation-specific contig that was a positive match

  2. AtPID: a genome-scale resource for genotype–phenotype associations in Arabidopsis

    PubMed Central

    Lv, Qi; Lan, Yiheng; Shi, Yan; Wang, Huan; Pan, Xia; Li, Peng; Shi, Tieliu

    2017-01-01

    AtPID (Arabidopsis thaliana Protein Interactome Database, available at http://www.megabionet.org/atpid) is an integrated database resource for protein interaction network and functional annotation. In the past few years, we collected 5564 mutants with significant morphological alterations and manually curated them to 167 plant ontology (PO) morphology categories. These single/multiple-gene mutants were indexed and linked to 3919 genes. After integrated these genotype–phenotype associations with the comprehensive protein interaction network in AtPID, we developed a Naïve Bayes method and predicted 4457 novel high confidence gene-PO pairs with 1369 genes as the complement. Along with the accumulated novel data for protein interaction and functional annotation, and the updated visualization toolkits, we present a genome-scale resource for genotype–phenotype associations for Arabidopsis in AtPID 5.0. In our updated website, all the new genotype–phenotype associations from mutants, protein network, and the protein annotation information can be vividly displayed in a comprehensive network view, which will greatly enhance plant protein function and genotype–phenotype association studies in a systematical way. PMID:27899679

  3. Trends in snag populations in drought-stressed mixed-conifer and ponderosa pine forests (1997-2007)

    Treesearch

    Joseph L. Ganey; Scott C. Vojta

    2012-01-01

    Snags provide important biological legacies, resources for numerous species of native wildlife, and contribute to decay dynamics and ecological processes in forested ecosystems. We monitored trends in snag populations from 1997 to 2007 in drought-stressed mixed-conifer and ponderosa pine (Pinus ponderosa Dougl. ex Laws) forests, northern Arizona. Median snag density...

  4. Science You Can Use Bulletin: Revisiting disturbance: A new guide for keeping dry mixed conifer forests healthy through fuel management

    Treesearch

    Sue Miller; Theresa Jain; Mike A. Battaglia; Han-Sup Han; Russell T. Graham; Christopher R. Keyes; Jeremy S. Fried; Jonathan E. Sandquist

    2014-01-01

    Planning for hazardous fuels reduction can be challenging, given that land managers must balance multiple resource objectives. To help managers with planning and implementing fuel treatments, the Rocky Mountain Research Station, with support from the Joint Fire Science Program, has published A Comprehensive Guide to Fuel Management Practices for Dry Mixed Conifer...

  5. Red alder-conifer stands in Alaska: An example of mixed species management to enhance structural and biological complexity

    Treesearch

    Robert Deal; Ewa Orlikowska; David D’Amore; Paul Hennon

    2017-01-01

    There is worldwide interest in managing forests to improve biodiversity, enhance ecosystem services and assure long-term sustainability of forest resources. An increasingly important goal of forest management is to increase stand diversity and improve wildlife and aquatic habitat. Well-planned silvicultural systems containing a mixture of broadleaf-conifer species have...

  6. Comparison of terrestrial invertebrate biomass and richness in young mixed red alder-conifer, young conifer, and old conifer stands of southeast Alaska.

    Treesearch

    Mark E. Schultz; Toni L. De Santo

    2008-01-01

    Coniferous stands that regenerate following clearcutting in southeast Alaska can be characterized by the amount of soil disturbance during logging. There are indications that red alder in mixed stands mitigates some of the negative effects of clearcutting. We compared invertebrate biomass in four stands each of (1) young conifers, (2) young mixed alder and conifer, and...

  7. Genomic resources for water yam (Dioscorea alata L.): analyses of EST-Sequences, De Novo sequencing and GBS libraries

    USDA-ARS?s Scientific Manuscript database

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources such as SSRs, SNPs and InDels in several model and non-model plant species. Yam (Dioscorea spp.) i...

  8. Large-scale reduction of the Bacillus subtilis genome: consequences for the transcriptional network, resource allocation, and metabolism.

    PubMed

    Reuß, Daniel R; Altenbuchner, Josef; Mäder, Ulrike; Rath, Hermann; Ischebeck, Till; Sappa, Praveen Kumar; Thürmer, Andrea; Guérin, Cyprien; Nicolas, Pierre; Steil, Leif; Zhu, Bingyao; Feussner, Ivo; Klumpp, Stefan; Daniel, Rolf; Commichau, Fabian M; Völker, Uwe; Stülke, Jörg

    2017-02-01

    Understanding cellular life requires a comprehensive knowledge of the essential cellular functions, the components involved, and their interactions. Minimized genomes are an important tool to gain this knowledge. We have constructed strains of the model bacterium, Bacillus subtilis, whose genomes have been reduced by ∼36%. These strains are fully viable, and their growth rates in complex medium are comparable to those of wild type strains. An in-depth multi-omics analysis of the genome reduced strains revealed how the deletions affect the transcription regulatory network of the cell, translation resource allocation, and metabolism. A comparison of gene counts and resource allocation demonstrates drastic differences in the two parameters, with 50% of the genes using as little as 10% of translation capacity, whereas the 6% essential genes require 57% of the translation resources. Taken together, the results are a valuable resource on gene dispensability in B. subtilis, and they suggest the roads to further genome reduction to approach the final aim of a minimal cell in which all functions are understood.

  9. The Agassiz’s desert tortoise genome provides a resource for the conservation of a threatened species

    PubMed Central

    Tollis, Marc; DeNardo, Dale F.; Cornelius, John A.; Dolby, Greer A.; Edwards, Taylor; Henen, Brian T.; Karl, Alice E.; Murphy, Robert W.

    2017-01-01

    Agassiz’s desert tortoise (Gopherus agassizii) is a long-lived species native to the Mojave Desert and is listed as threatened under the US Endangered Species Act. To aid conservation efforts for preserving the genetic diversity of this species, we generated a whole genome reference sequence with an annotation based on deep transcriptome sequences of adult skeletal muscle, lung, brain, and blood. The draft genome assembly for G. agassizii has a scaffold N50 length of 252 kbp and a total length of 2.4 Gbp. Genome annotation reveals 20,172 protein-coding genes in the G. agassizii assembly, and that gene structure is more similar to chicken than other turtles. We provide a series of comparative analyses demonstrating (1) that turtles are among the slowest-evolving genome-enabled reptiles, (2) amino acid changes in genes controlling desert tortoise traits such as shell development, longevity and osmoregulation, and (3) fixed variants across the Gopherus species complex in genes related to desert adaptations, including circadian rhythm and innate immune response. This G. agassizii genome reference and annotation is the first such resource for any tortoise, and will serve as a foundation for future analysis of the genetic basis of adaptations to the desert environment, allow for investigation into genomic factors affecting tortoise health, disease and longevity, and serve as a valuable resource for additional studies in this species complex. PMID:28562605

  10. Hygroscopic motions of fossil conifer cones

    PubMed Central

    Poppinga, Simon; Nestle, Nikolaus; Šandor, Andrea; Reible, Bruno; Masselter, Tom; Bruchmann, Bernd; Speck, Thomas

    2017-01-01

    Conifer cones represent natural, woody compliant structures which move their scales as passive responses to changes in environmental humidity. Here we report on water-driven opening and closing motions in coalified conifer cones from the Eemian Interglacial (approx. 126,000–113,000 years BP) and from the Middle Miocene (approx. 16.5 to 11.5 million years BP). These cones represent by far the oldest documented evidence of plant parts showing full functionality of such passive hydraulically actuated motion. The functional resilience of these structures is far beyond the biological purpose of seed dispersal and protection and is because of a low level of mineralization of the fossils. Our analysis emphasizes the functional-morphological integrity of these biological compliant mechanisms which, in addition to their biological fascination, are potentially also role models for resilient and maintenance-free biomimetic applications (e.g., adaptive and autonomously moving structures including passive hydraulic actuators). PMID:28074936

  11. Hygroscopic motions of fossil conifer cones.

    PubMed

    Poppinga, Simon; Nestle, Nikolaus; Šandor, Andrea; Reible, Bruno; Masselter, Tom; Bruchmann, Bernd; Speck, Thomas

    2017-01-11

    Conifer cones represent natural, woody compliant structures which move their scales as passive responses to changes in environmental humidity. Here we report on water-driven opening and closing motions in coalified conifer cones from the Eemian Interglacial (approx. 126,000-113,000 years BP) and from the Middle Miocene (approx. 16.5 to 11.5 million years BP). These cones represent by far the oldest documented evidence of plant parts showing full functionality of such passive hydraulically actuated motion. The functional resilience of these structures is far beyond the biological purpose of seed dispersal and protection and is because of a low level of mineralization of the fossils. Our analysis emphasizes the functional-morphological integrity of these biological compliant mechanisms which, in addition to their biological fascination, are potentially also role models for resilient and maintenance-free biomimetic applications (e.g., adaptive and autonomously moving structures including passive hydraulic actuators).

  12. Hygroscopic motions of fossil conifer cones

    NASA Astrophysics Data System (ADS)

    Poppinga, Simon; Nestle, Nikolaus; Šandor, Andrea; Reible, Bruno; Masselter, Tom; Bruchmann, Bernd; Speck, Thomas

    2017-01-01

    Conifer cones represent natural, woody compliant structures which move their scales as passive responses to changes in environmental humidity. Here we report on water-driven opening and closing motions in coalified conifer cones from the Eemian Interglacial (approx. 126,000-113,000 years BP) and from the Middle Miocene (approx. 16.5 to 11.5 million years BP). These cones represent by far the oldest documented evidence of plant parts showing full functionality of such passive hydraulically actuated motion. The functional resilience of these structures is far beyond the biological purpose of seed dispersal and protection and is because of a low level of mineralization of the fossils. Our analysis emphasizes the functional-morphological integrity of these biological compliant mechanisms which, in addition to their biological fascination, are potentially also role models for resilient and maintenance-free biomimetic applications (e.g., adaptive and autonomously moving structures including passive hydraulic actuators).

  13. Fuel bed characteristics of Sierra Nevada conifers

    USGS Publications Warehouse

    van Wagtendonk, J.W.; Benedict, J.M.; Sydoriak, W.M.

    1998-01-01

    A study of fuels in Sierra Nevada conifer forests showed that fuel bed depth and fuel bed weight significantly varied by tree species and developmental stage of the overstory. Specific values for depth and weight of woody, litter, and duff fuels are reported. There was a significant positive relationship between fuel bed depth and weight. Estimates of woody fuel weight using the planar intercept method were significantly related to sampled values. These relationships can be used to estimate fuel weights in the field.

  14. Sugar export limits size of conifer needles

    NASA Astrophysics Data System (ADS)

    Rademaker, Hanna; Zwieniecki, Maciej A.; Bohr, Tomas; Jensen, Kaare H.

    2017-04-01

    Plant leaf size varies by more than three orders of magnitude, from a few millimeters to over one meter. Conifer leaves, however, are relatively short and the majority of needles are no longer than 6 cm. The reason for the strong confinement of the trait-space is unknown. We show that sugars produced near the tip of long needles cannot be exported efficiently, because the pressure required to drive vascular flow would exceed the greatest available pressure (the osmotic pressure). This basic constraint leads to the formation of an inactive region of stagnant fluid near the needle tip, which does not contribute to sugar flow. Remarkably, we find that the size of the active part does not scale with needle length. We predict a single maximum needle size of 5 cm, in accord with data from 519 conifer species. This could help rationalize the recent observation that conifers have significantly smaller leaves than angiosperms, and provide a biophysical explanation for this intriguing difference between the two largest groups of plants.

  15. Resource base influences genome-wide DNA methylation levels in wild baboons (Papio cynocephalus).

    PubMed

    Lea, Amanda J; Altmann, Jeanne; Alberts, Susan C; Tung, Jenny

    2016-04-01

    Variation in resource availability commonly exerts strong effects on fitness-related traits in wild animals. However, we know little about the molecular mechanisms that mediate these effects, or about their persistence over time. To address these questions, we profiled genome-wide whole-blood DNA methylation levels in two sets of wild baboons: (i) 'wild-feeding' baboons that foraged naturally in a savanna environment and (ii) 'Lodge' baboons that had ready access to spatially concentrated human food scraps, resulting in high feeding efficiency and low daily travel distances. We identified 1014 sites (0.20% of sites tested) that were differentially methylated between wild-feeding and Lodge baboons, providing the first evidence that resource availability shapes the epigenome in a wild mammal. Differentially methylated sites tended to occur in contiguous stretches (i.e., in differentially methylated regions or DMRs), in promoters and enhancers, and near metabolism-related genes, supporting their functional importance in gene regulation. In agreement, reporter assay experiments confirmed that methylation at the largest identified DMR, located in the promoter of a key glycolysis-related gene, was sufficient to causally drive changes in gene expression. Intriguingly, all dispersing males carried a consistent epigenetic signature of their membership in a wild-feeding group, regardless of whether males dispersed into or out of this group as adults. Together, our findings support a role for DNA methylation in mediating ecological effects on phenotypic traits in the wild and emphasize the dynamic environmental sensitivity of DNA methylation levels across the life course.

  16. GenomicusPlants: a web resource to study genome evolution in flowering plants.

    PubMed

    Louis, Alexandra; Murat, Florent; Salse, Jérôme; Crollius, Hugues Roest

    2015-01-01

    Comparative genomics combined with phylogenetic reconstructions are powerful approaches to study the evolution of genes and genomes. However, the current rapid expansion of the volume of genomic information makes it increasingly difficult to interrogate, integrate and synthesize comparative genome data while taking into account the maximum breadth of information available. GenomicusPlants (http://www.genomicus.biologie.ens.fr/genomicus-plants) is an extension of the Genomicus webserver that addresses this issue by allowing users to explore flowering plant genomes in an intuitive way, across the broadest evolutionary scales. Extant genomes of 26 flowering plants can be analyzed, as well as 23 ancestral reconstructed genomes. Ancestral gene order provides a long-term chronological view of gene order evolution, greatly facilitating comparative genomics and evolutionary studies. Four main interfaces ('views') are available where: (i) PhyloView combines phylogenetic trees with comparisons of genomic loci across any number of genomes; (ii) AlignView projects loci of interest against all other genomes to visualize its topological conservation; (iii) MatrixView compares two genomes in a classical dotplot representation; and (iv) Karyoview visualizes chromosome karyotypes 'painted' with colours of another genome of interest. All four views are interconnected and benefit from many customizable features.

  17. GenomicusPlants: A Web Resource to Study Genome Evolution in Flowering Plants

    PubMed Central

    Louis, Alexandra; Murat, Florent; Salse, Jérôme; Roest Crollius, Hugues

    2015-01-01

    Comparative genomics combined with phylogenetic reconstructions are powerful approaches to study the evolution of genes and genomes. However, the current rapid expansion of the volume of genomic information makes it increasingly difficult to interrogate, integrate and synthesize comparative genome data while taking into account the maximum breadth of information available. GenomicusPlants (http://www.genomicus.biologie.ens.fr/genomicus-plants) is an extension of the Genomicus webserver that addresses this issue by allowing users to explore flowering plant genomes in an intuitive way, across the broadest evolutionary scales. Extant genomes of 26 flowering plants can be analyzed, as well as 23 ancestral reconstructed genomes. Ancestral gene order provides a long-term chronological view of gene order evolution, greatly facilitating comparative genomics and evolutionary studies. Four main interfaces (‘views’) are available where: (i) PhyloView combines phylogenetic trees with comparisons of genomic loci across any number of genomes; (ii) AlignView projects loci of interest against all other genomes to visualize its topological conservation; (iii) MatrixView compares two genomes in a classical dotplot representation; and (iv) Karyoview visualizes chromosome karyotypes ‘painted’ with colours of another genome of interest. All four views are interconnected and benefit from many customizable features. PMID:25432975

  18. DNA Barcoding the Native Flowering Plants and Conifers of Wales

    PubMed Central

    de Vere, Natasha; Rich, Tim C. G.; Ford, Col R.; Trinder, Sarah A.; Long, Charlotte; Moore, Chris W.; Satterthwaite, Danielle; Davies, Helena; Allainguillaume, Joel; Ronca, Sandra; Tatarinova, Tatiana; Garbett, Hannah; Walker, Kevin; Wilkinson, Mike J.

    2012-01-01

    We present the first national DNA barcode resource that covers the native flowering plants and conifers for the nation of Wales (1143 species). Using the plant DNA barcode markers rbcL and matK, we have assembled 97.7% coverage for rbcL, 90.2% for matK, and a dual-locus barcode for 89.7% of the native Welsh flora. We have sampled multiple individuals for each species, resulting in 3304 rbcL and 2419 matK sequences. The majority of our samples (85%) are from DNA extracted from herbarium specimens. Recoverability of DNA barcodes is lower using herbarium specimens, compared to freshly collected material, mostly due to lower amplification success, but this is balanced by the increased efficiency of sampling species that have already been collected, identified, and verified by taxonomic experts. The effectiveness of the DNA barcodes for identification (level of discrimination) is assessed using four approaches: the presence of a barcode gap (using pairwise and multiple alignments), formation of monophyletic groups using Neighbour-Joining trees, and sequence similarity in BLASTn searches. These approaches yield similar results, providing relative discrimination levels of 69.4 to 74.9% of all species and 98.6 to 99.8% of genera using both markers. Species discrimination can be further improved using spatially explicit sampling. Mean species discrimination using barcode gap analysis (with a multiple alignment) is 81.6% within 10×10 km squares and 93.3% for 2×2 km squares. Our database of DNA barcodes for Welsh native flowering plants and conifers represents the most complete coverage of any national flora, and offers a valuable platform for a wide range of applications that require accurate species identification. PMID:22701588

  19. DNA barcoding the native flowering plants and conifers of Wales.

    PubMed

    de Vere, Natasha; Rich, Tim C G; Ford, Col R; Trinder, Sarah A; Long, Charlotte; Moore, Chris W; Satterthwaite, Danielle; Davies, Helena; Allainguillaume, Joel; Ronca, Sandra; Tatarinova, Tatiana; Garbett, Hannah; Walker, Kevin; Wilkinson, Mike J

    2012-01-01

    We present the first national DNA barcode resource that covers the native flowering plants and conifers for the nation of Wales (1143 species). Using the plant DNA barcode markers rbcL and matK, we have assembled 97.7% coverage for rbcL, 90.2% for matK, and a dual-locus barcode for 89.7% of the native Welsh flora. We have sampled multiple individuals for each species, resulting in 3304 rbcL and 2419 matK sequences. The majority of our samples (85%) are from DNA extracted from herbarium specimens. Recoverability of DNA barcodes is lower using herbarium specimens, compared to freshly collected material, mostly due to lower amplification success, but this is balanced by the increased efficiency of sampling species that have already been collected, identified, and verified by taxonomic experts. The effectiveness of the DNA barcodes for identification (level of discrimination) is assessed using four approaches: the presence of a barcode gap (using pairwise and multiple alignments), formation of monophyletic groups using Neighbour-Joining trees, and sequence similarity in BLASTn searches. These approaches yield similar results, providing relative discrimination levels of 69.4 to 74.9% of all species and 98.6 to 99.8% of genera using both markers. Species discrimination can be further improved using spatially explicit sampling. Mean species discrimination using barcode gap analysis (with a multiple alignment) is 81.6% within 10×10 km squares and 93.3% for 2×2 km squares. Our database of DNA barcodes for Welsh native flowering plants and conifers represents the most complete coverage of any national flora, and offers a valuable platform for a wide range of applications that require accurate species identification.

  20. Computational tools and resources for prediction and analysis of gene regulatory regions in the chick genome

    PubMed Central

    Khan, Mohsin A. F.; Soto-Jimenez, Luz Mayela; Howe, Timothy; Streit, Andrea; Sosinsky, Alona; Stern, Claudio D.

    2013-01-01

    The discovery of cis-regulatory elements is a challenging problem in bioinformatics, owing to distal locations and context-specific roles of these elements in controlling gene regulation. Here we review the current bioinformatics methodologies and resources available for systematic discovery of cis-acting regulatory elements and conserved transcription factor binding sites in the chick genome. In addition, we propose and make available, a novel workflow using computational tools that integrate CTCF analysis to predict putative insulator elements, enhancer prediction and TFBS analysis. To demonstrate the usefulness of this computational workflow, we then use it to analyze the locus of the gene Sox2 whose developmental expression is known to be controlled by a complex array of cis-acting regulatory elements. The workflow accurately predicts most of the experimentally verified elements along with some that have not yet been discovered. A web version of the CTCF tool, together with instructions for using the workflow can be accessed from http://toolshed.g2.bx.psu.edu/view/mkhan1980/ctcf_analysis. For local installation of the tool, relevant Perl scripts and instructions are provided in the directory named “code” in the supplementary materials. PMID:23355428

  1. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder.

    PubMed

    C Yuen, Ryan K; Merico, Daniele; Bookman, Matt; L Howe, Jennifer; Thiruvahindrapuram, Bhooma; Patel, Rohan V; Whitney, Joe; Deflaux, Nicole; Bingham, Jonathan; Wang, Zhuozhi; Pellecchia, Giovanna; Buchanan, Janet A; Walker, Susan; Marshall, Christian R; Uddin, Mohammed; Zarrei, Mehdi; Deneault, Eric; D'Abate, Lia; Chan, Ada J S; Koyanagi, Stephanie; Paton, Tara; Pereira, Sergio L; Hoang, Ny; Engchuan, Worrawat; Higginbotham, Edward J; Ho, Karen; Lamoureux, Sylvia; Li, Weili; MacDonald, Jeffrey R; Nalpathamkalam, Thomas; Sung, Wilson W L; Tsoi, Fiona J; Wei, John; Xu, Lizhen; Tasse, Anne-Marie; Kirby, Emily; Van Etten, William; Twigger, Simon; Roberts, Wendy; Drmic, Irene; Jilderda, Sanne; Modi, Bonnie MacKinnon; Kellam, Barbara; Szego, Michael; Cytrynbaum, Cheryl; Weksberg, Rosanna; Zwaigenbaum, Lonnie; Woodbury-Smith, Marc; Brian, Jessica; Senman, Lili; Iaboni, Alana; Doyle-Thomas, Krissy; Thompson, Ann; Chrysler, Christina; Leef, Jonathan; Savion-Lemieux, Tal; Smith, Isabel M; Liu, Xudong; Nicolson, Rob; Seifer, Vicki; Fedele, Angie; Cook, Edwin H; Dager, Stephen; Estes, Annette; Gallagher, Louise; Malow, Beth A; Parr, Jeremy R; Spence, Sarah J; Vorstman, Jacob; Frey, Brendan J; Robinson, James T; Strug, Lisa J; Fernandez, Bridget A; Elsabbagh, Mayada; Carter, Melissa T; Hallmayer, Joachim; Knoppers, Bartha M; Anagnostou, Evdokia; Szatmari, Peter; Ring, Robert H; Glazer, David; Pletcher, Mathew T; Scherer, Stephen W

    2017-04-01

    We are performing whole-genome sequencing of families with autism spectrum disorder (ASD) to build a resource (MSSNG) for subcategorizing the phenotypes and underlying genetic factors involved. Here we report sequencing of 5,205 samples from families with ASD, accompanied by clinical information, creating a database accessible on a cloud platform and through a controlled-access internet portal. We found an average of 73.8 de novo single nucleotide variants and 12.6 de novo insertions and deletions or copy number variations per ASD subject. We identified 18 new candidate ASD-risk genes and found that participants bearing mutations in susceptibility genes had significantly lower adaptive ability (P = 6 × 10(-4)). In 294 of 2,620 (11.2%) of ASD cases, a molecular basis could be determined and 7.2% of these carried copy number variations and/or chromosomal abnormalities, emphasizing the importance of detecting all forms of genetic variation as diagnostic and therapeutic targets in ASD.

  2. Computational tools and resources for prediction and analysis of gene regulatory regions in the chick genome.

    PubMed

    Khan, Mohsin A F; Soto-Jimenez, Luz Mayela; Howe, Timothy; Streit, Andrea; Sosinsky, Alona; Stern, Claudio D

    2013-05-01

    The discovery of cis-regulatory elements is a challenging problem in bioinformatics, owing to distal locations and context-specific roles of these elements in controlling gene regulation. Here we review the current bioinformatics methodologies and resources available for systematic discovery of cis-acting regulatory elements and conserved transcription factor binding sites in the chick genome. In addition, we propose and make available, a novel workflow using computational tools that integrate CTCF analysis to predict putative insulator elements, enhancer prediction, and TFBS analysis. To demonstrate the usefulness of this computational workflow, we then use it to analyze the locus of the gene Sox2 whose developmental expression is known to be controlled by a complex array of cis-acting regulatory elements. The workflow accurately predicts most of the experimentally verified elements along with some that have not yet been discovered. A web version of the CTCF tool, together with instructions for using the workflow can be accessed from http://toolshed.g2.bx.psu.edu/view/mkhan1980/ctcf_analysis. For local installation of the tool, relevant Perl scripts and instructions are provided in the directory named "code" in the supplementary materials.

  3. RegPrecise 3.0--a resource for genome-scale exploration of transcriptional regulation in bacteria.

    PubMed

    Novichkov, Pavel S; Kazakov, Alexey E; Ravcheev, Dmitry A; Leyn, Semen A; Kovaleva, Galina Y; Sutormin, Roman A; Kazanov, Marat D; Riehl, William; Arkin, Adam P; Dubchak, Inna; Rodionov, Dmitry A

    2013-11-01

    Genome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in prokaryotes is one of the critical tasks of modern genomics. Bacteria from different taxonomic groups, whose lifestyles and natural environments are substantially different, possess highly diverged transcriptional regulatory networks. The comparative genomics approaches are useful for in silico reconstruction of bacterial regulons and networks operated by both transcription factors (TFs) and RNA regulatory elements (riboswitches). RegPrecise (http://regprecise.lbl.gov) is a web resource for collection, visualization and analysis of transcriptional regulons reconstructed by comparative genomics. We significantly expanded a reference collection of manually curated regulons we introduced earlier. RegPrecise 3.0 provides access to inferred regulatory interactions organized by phylogenetic, structural and functional properties. Taxonomy-specific collections include 781 TF regulogs inferred in more than 160 genomes representing 14 taxonomic groups of Bacteria. TF-specific collections include regulogs for a selected subset of 40 TFs reconstructed across more than 30 taxonomic lineages. Novel collections of regulons operated by RNA regulatory elements (riboswitches) include near 400 regulogs inferred in 24 bacterial lineages. RegPrecise 3.0 provides four classifications of the reference regulons implemented as controlled vocabularies: 55 TF protein families; 43 RNA motif families; ~150 biological processes or metabolic pathways; and ~200 effectors or environmental signals. Genome-wide visualization of regulatory networks and metabolic pathways covered by the reference regulons are available for all studied genomes. A separate section of RegPrecise 3.0 contains draft regulatory networks in 640 genomes obtained by an conservative propagation of the reference regulons to closely related genomes. RegPrecise 3.0 gives access to the transcriptional regulons reconstructed in

  4. Management strategies for bark beetles in conifer forests

    Treesearch

    Christopher Fettig; Jacek  Hilszczański

    2015-01-01

    Several species of bark beetles (Coleoptera: Curculionidae, Scolytinae) are capable of causing significant amounts of tree mortality in conifer forests throughout much of the world.  In most cases, these events are part of the ecology of conifer forests and positively influence many ecological processes, but the economic and social implications can be...

  5. The heterogeneous levels of linkage disequilibrium in white spruce genes and comparative analysis with other conifers

    PubMed Central

    Pavy, N; Namroud, M-C; Gagnon, F; Isabel, N; Bousquet, J

    2012-01-01

    In plants, knowledge about linkage disequilibrium (LD) is relevant for the design of efficient single-nucleotide polymorphism arrays in relation to their use in population and association genomics studies. Previous studies of conifer genes have shown LD to decay rapidly within gene limits, but exceptions have been reported. To evaluate the extent of heterogeneity of LD among conifer genes and its potential causes, we examined LD in 105 genes of white spruce (Picea glauca) by sequencing a panel of 48 haploid megagametophytes from natural populations and further compared it with LD in other conifer species. The average pairwise r2 value was 0.19 (s.d.=0.19), and LD dropped quickly with a half-decay being reached at a distance of 65 nucleotides between sites. However, LD was significantly heterogeneous among genes. A first group of 29 genes had stronger LD (mean r2=0.28), and a second group of 38 genes had weaker LD (mean r2=0.12). While a strong relationship was found with the recombination rate, there was no obvious relationship between LD and functional classification. The level of nucleotide diversity, which was highly heterogeneous across genes, was also not significantly correlated with LD. A search for selection signatures highlighted significant deviations from the standard neutral model, which could be mostly attributed to recent demographic changes. Little evidence was seen for hitchhiking and clear relationships with LD. When compared among conifer species, on average, levels of LD were similar in genes from white spruce, Norway spruce and Scots pine, whereas loblolly pine and Douglas fir genes exhibited a significantly higher LD. PMID:21897435

  6. The heterogeneous levels of linkage disequilibrium in white spruce genes and comparative analysis with other conifers.

    PubMed

    Pavy, N; Namroud, M-C; Gagnon, F; Isabel, N; Bousquet, J

    2012-03-01

    In plants, knowledge about linkage disequilibrium (LD) is relevant for the design of efficient single-nucleotide polymorphism arrays in relation to their use in population and association genomics studies. Previous studies of conifer genes have shown LD to decay rapidly within gene limits, but exceptions have been reported. To evaluate the extent of heterogeneity of LD among conifer genes and its potential causes, we examined LD in 105 genes of white spruce (Picea glauca) by sequencing a panel of 48 haploid megagametophytes from natural populations and further compared it with LD in other conifer species. The average pairwise r(2) value was 0.19 (s.d.=0.19), and LD dropped quickly with a half-decay being reached at a distance of 65 nucleotides between sites. However, LD was significantly heterogeneous among genes. A first group of 29 genes had stronger LD (mean r(2)=0.28), and a second group of 38 genes had weaker LD (mean r(2)=0.12). While a strong relationship was found with the recombination rate, there was no obvious relationship between LD and functional classification. The level of nucleotide diversity, which was highly heterogeneous across genes, was also not significantly correlated with LD. A search for selection signatures highlighted significant deviations from the standard neutral model, which could be mostly attributed to recent demographic changes. Little evidence was seen for hitchhiking and clear relationships with LD. When compared among conifer species, on average, levels of LD were similar in genes from white spruce, Norway spruce and Scots pine, whereas loblolly pine and Douglas fir genes exhibited a significantly higher LD.

  7. The Homeodomain Resource: a comprehensive collection of sequence, structure, interaction, genomic and functional information on the homeodomain protein family

    PubMed Central

    Moreland, R. Travis; Ryan, Joseph F.; Pan, Christopher; Baxevanis, Andreas D.

    2009-01-01

    The Homeodomain Resource is a curated collection of sequence, structure, interaction, genomic and functional information on the homeodomain family. The current version builds upon previous versions by the addition of new, complete sets of homeodomain sequences from fully sequenced genomes, the expansion of existing curated homeodomain information and the improvement of data accessibility through better search tools and more complete data integration. This release contains 1534 full-length homeodomain-containing sequences, 93 experimentally derived homeodomain structures, 101 homeodomain protein–protein interactions, 107 homeodomain DNA-binding sites and 206 homeodomain proteins implicated in human genetic disorders. Database URL: The Homeodomain Resource is freely available and can be accessed at http://research.nhgri.nih.gov/homeodomain/ PMID:20157477

  8. A large-scale zebrafish gene knockout resource for the genome-wide study of gene function.

    PubMed

    Varshney, Gaurav K; Lu, Jing; Gildea, Derek E; Huang, Haigen; Pei, Wuhong; Yang, Zhongan; Huang, Sunny C; Schoenfeld, David; Pho, Nam H; Casero, David; Hirase, Takashi; Mosbrook-Davis, Deborah; Zhang, Suiyuan; Jao, Li-En; Zhang, Bo; Woods, Ian G; Zimmerman, Steven; Schier, Alexander F; Wolfsberg, Tyra G; Pellegrini, Matteo; Burgess, Shawn M; Lin, Shuo

    2013-04-01

    With the completion of the zebrafish genome sequencing project, it becomes possible to analyze the function of zebrafish genes in a systematic way. The first step in such an analysis is to inactivate each protein-coding gene by targeted or random mutation. Here we describe a streamlined pipeline using proviral insertions coupled with high-throughput sequencing and mapping technologies to widely mutagenize genes in the zebrafish genome. We also report the first 6144 mutagenized and archived F1's predicted to carry up to 3776 mutations in annotated genes. Using in vitro fertilization, we have rescued and characterized ~0.5% of the predicted mutations, showing mutation efficacy and a variety of phenotypes relevant to both developmental processes and human genetic diseases. Mutagenized fish lines are being made freely available to the public through the Zebrafish International Resource Center. These fish lines establish an important milestone for zebrafish genetics research and should greatly facilitate systematic functional studies of the vertebrate genome.

  9. Carbon allocation and accumulation in conifers

    SciTech Connect

    Gower, S.T.; Isebrands, J.G.; Sheriff, D.W.

    1995-07-01

    Forests cover approximately 33% of the land surface of the earth, yet they are responsible for 65% of the annual carbon (C) accumulated by all terrestrial biomes. In general, total C content and net primary production rates are greater for forests than for other biomes, but C budgets differ greatly among forests. Despite several decades of research on forest C budgets, there is still an incomplete understanding of the factors controlling C allocation. Yet, if we are to understand how changing global events such as land use, climate change, atmospheric N deposition, ozone, and elevated atmospheric CO{sub 2} affect the global C budget, a mechanistic understanding of C assimilation, partitioning, and allocation is necessary. The objective of this chapter is to review the major factors that influence C allocation and accumulation in conifer trees and forests. In keeping with the theme of this book, we will focus primarily on evergreen conifers. However, even among evergreen conifers, leaf, canopy, and stand-level C and nutrient allocation patterns differ, often as a function of leaf development and longevity. The terminology related to C allocation literature is often inconsistent, confusing and inadequate for understanding and integrating past and current research. For example, terms often used synonymously to describe C flow or movement include translocation, transport, distribution, allocation, partitioning, apportionment, and biomass allocation. A common terminology is needed because different terms have different meanings to readers. In this paper we use C allocation, partitioning, and accumulation according to the definitions of Dickson and Isebrands (1993). Partitioning is the process of C flow into and among different chemical, storage, and transport pools. Allocation is the distribution of C to different plant parts within the plant (i.e., source to sink). Accumulation is the end product of the process of C allocation.

  10. Bluejay 1.0: genome browsing and comparison with rich customization provision and dynamic resource linking

    PubMed Central

    Soh, Jung; Gordon, Paul MK; Taschuk, Morgan L; Dong, Anguo; Ah-Seng, Andrew C; Turinsky, Andrei L; Sensen, Christoph W

    2008-01-01

    Background The Bluejay genome browser has been developed over several years to address the challenges posed by the ever increasing number of data types as well as the increasing volume of data in genome research. Beginning with a browser capable of rendering views of XML-based genomic information and providing scalable vector graphics output, we have now completed version 1.0 of the system with many additional features. Our development efforts were guided by our observation that biologists who use both gene expression profiling and comparative genomics gain functional insights above and beyond those provided by traditional per-gene analyses. Results Bluejay 1.0 is a genome viewer integrating genome annotation with: (i) gene expression information; and (ii) comparative analysis with an unlimited number of other genomes in the same view. This allows the biologist to see a gene not just in the context of its genome, but also its regulation and its evolution. Bluejay now has rich provision for personalization by users: (i) numerous display customization features; (ii) the availability of waypoints for marking multiple points of interest on a genome and subsequently utilizing them; and (iii) the ability to take user relevance feedback of annotated genes or textual items to offer personalized recommendations. Bluejay 1.0 also embeds the Seahawk browser for the Moby protocol, enabling users to seamlessly invoke hundreds of Web Services on genomic data of interest without any hard-coding. Conclusion Bluejay offers a unique set of customizable genome-browsing features, with the goal of allowing biologists to quickly focus on, analyze, compare, and retrieve related information on the parts of the genomic data they are most interested in. We expect these capabilities of Bluejay to benefit the many biologists who want to answer complex questions using the information available from completely sequenced genomes. PMID:18940007

  11. Conifer health classification for Colorado, 2008

    USGS Publications Warehouse

    Cole, Christopher J.; Noble, Suzanne M.; Blauer, Steven L.; Friesen, Beverly A.; Curry, Stacy E.; Bauer, Mark A.

    2010-01-01

    Colorado has undergone substantial changes in forests due to urbanization, wildfires, insect-caused tree mortality, and other human and environmental factors. The U.S. Geological Survey Rocky Mountain Geographic Science Center evaluated and developed a methodology for applying remotely-sensed imagery for assessing conifer health in Colorado. Two classes were identified for the purposes of this study: healthy and unhealthy (for example, an area the size of a 30- x 30-m pixel with 20 percent or greater visibly dead trees was defined as ?unhealthy?). Medium-resolution Landsat 5 Thematic Mapper imagery were collected. The normalized, reflectance-converted, cloud-filled Landsat scenes were merged to form a statewide image mosaic, and a Normalized Difference Vegetation Index (NDVI) and Renormalized Difference Infrared Index (RDII) were derived. A supervised maximum likelihood classification was done using the Landsat multispectral bands, the NDVI, the RDII, and 30-m U.S. Geological Survey National Elevation Dataset (NED). The classification was constrained to pixels identified in the updated landcover dataset as coniferous or mixed coniferous/deciduous vegetation. The statewide results were merged with a separate health assessment of Grand County, Colo., produced in late 2008. Sampling and validation was done by collecting field data and high-resolution imagery. The 86 percent overall classification accuracy attained in this study suggests that the data and methods used successfully characterized conifer conditions within Colorado. Although forest conditions for Lodgepole Pine (Pinus contorta) are easily characterized, classification uncertainty exists between healthy/unhealthy Ponderosa Pine (Pinus ponderosa), Pi?on (Pinus edulis), and Juniper (Juniperus sp.) vegetation. Some underestimation of conifer mortality in Summit County is likely, where recent (2008) cloud-free imagery was unavailable. These classification uncertainties are primarily due to the spatial and

  12. Leveraging genomic resources of model species for the assessment of diversity and phylogeny in wild and domesticated lentil.

    PubMed

    Alo, Fida; Furman, Bonnie J; Akhunov, Eduard; Dvorak, Jan; Gepts, Paul

    2011-01-01

    Advances in comparative genomics have provided significant opportunities for analysis of genetic diversity in species with limited genomic resources, such as the genus Lens. Medicago truncatula expressed sequence tags (ESTs) were aligned with the Arabidopsis thaliana genome sequence to identify conserved exon sequences and splice sites in the ESTs. Conserved primers (CPs) based on M. truncatula EST sequences flanking one or more introns were then designed. A total of 22% of the CPs produced polymerase chain reaction amplicons in lentil and were used to sequence amplicons in 175 wild and 133 domesticated lentil accessions. Analysis of the sequences confirmed that L. nigricans and L. ervoides are well-defined species at the DNA sequence level. Lens culinaris subsp. odemensis, L. culinaris subsp. tomentosus, and L. lamottei may constitute a single taxon pending verification with crossability experiments. Lens culinaris subsp. orientalis is the progenitor of domesticated lentil, L. culinaris subsp. culinaris (as proposed before), but a more specific area of origin can be suggested in southern Turkey. We were also able to detect the divergence, following domestication, of the domesticated gene pool into overlapping large-seeded (megasperma) and small-seeded (microsperma) groups. Lentil domestication led to a loss of genetic diversity of approximately 40%. The approach followed in this research has allowed us to rapidly exploit sequence information from model plant species for the study of genetic diversity of a crop such as lentil with limited genomic resources.

  13. PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms

    PubMed Central

    Page, Justin T.; Gingle, Alan R.; Udall, Joshua A.

    2013-01-01

    Read mapping is a fundamental part of next-generation genomic research but is complicated by genome duplication in many plants. Categorizing DNA sequence reads into their respective genomes enables current methods to analyze polyploid genomes as if they were diploid. We present PolyCat—a pipeline for mapping and categorizing all types of next-generation sequence data produced from allopolyploid organisms. PolyCat uses GSNAP’s single-nucleotide polymorphism (SNP)-tolerant mapping to minimize the mapping efficiency bias caused by SNPs between genomes. PolyCat then uses SNPs between genomes to categorize reads according to their respective genomes. Bisulfite-treated reads have a significant reduction in nucleotide complexity because nucleotide conversion events are confounded with transition substitutions. PolyCat includes special provisions to properly handle bisulfite-treated data. We demonstrate the functionality of PolyCat on allotetraploid cotton, Gossypium hirsutum, and create a functional SNP index for efficiently mapping sequence reads to the D-genome sequence of G. raimondii. PolyCat is appropriate for all allopolyploids and all types of next-generation genome analysis, including differential expression (RNA sequencing), differential methylation (bisulfite sequencing), differential DNA-protein binding (chromatin immunoprecipitation sequencing), and population diversity. PMID:23450226

  14. Draft genome sequence of Cicer reticulatum L., the wild progenitor of chickpea provides a resource for agronomic trait improvement

    PubMed Central

    Gupta, Sonal; Nawaz, Kashif; Parween, Sabiha; Roy, Riti; Sahu, Kamlesh; Kumar Pole, Anil; Khandal, Hitaishi; Srivastava, Rishi; Kumar Parida, Swarup

    2017-01-01

    Abstract Cicer reticulatum L. is the wild progenitor of the fourth most important legume crop chickpea (C. arietinum L.). We assembled short-read sequences into 416 Mb draft genome of C. reticulatum and anchored 78% (327 Mb) of this assembly to eight linkage groups. Genome annotation predicted 25,680 protein-coding genes covering more than 90% of predicted gene space. The genome assembly shared a substantial synteny and conservation of gene orders with the genome of the model legume Medicago truncatula. Resistance gene homologs of wild and domesticated chickpeas showed high sequence homology and conserved synteny. Comparison of gene sequences and nucleotide diversity using 66 wild and domesticated chickpea accessions suggested that the desi type chickpea was genetically closer to the wild species than the kabuli type. Comparative analyses predicted gene flow between the wild and the cultivated species during domestication. Molecular diversity and population genetic structure determination using 15,096 genome-wide single nucleotide polymorphisms revealed an admixed domestication pattern among cultivated (desi and kabuli) and wild chickpea accessions belonging to three population groups reflecting significant influence of parentage or geographical origin for their cultivar-specific population classification. The assembly and the polymorphic sequence resources presented here would facilitate the study of chickpea domestication and targeted use of wild Cicer germplasms for agronomic trait improvement in chickpea. PMID:27567261

  15. Draft genome sequence of Cicer reticulatum L., the wild progenitor of chickpea provides a resource for agronomic trait improvement.

    PubMed

    Gupta, Sonal; Nawaz, Kashif; Parween, Sabiha; Roy, Riti; Sahu, Kamlesh; Kumar Pole, Anil; Khandal, Hitaishi; Srivastava, Rishi; Kumar Parida, Swarup; Chattopadhyay, Debasis

    2017-02-01

    Cicer reticulatum L. is the wild progenitor of the fourth most important legume crop chickpea (C. arietinum L.). We assembled short-read sequences into 416 Mb draft genome of C. reticulatum and anchored 78% (327 Mb) of this assembly to eight linkage groups. Genome annotation predicted 25,680 protein-coding genes covering more than 90% of predicted gene space. The genome assembly shared a substantial synteny and conservation of gene orders with the genome of the model legume Medicago truncatula. Resistance gene homologs of wild and domesticated chickpeas showed high sequence homology and conserved synteny. Comparison of gene sequences and nucleotide diversity using 66 wild and domesticated chickpea accessions suggested that the desi type chickpea was genetically closer to the wild species than the kabuli type. Comparative analyses predicted gene flow between the wild and the cultivated species during domestication. Molecular diversity and population genetic structure determination using 15,096 genome-wide single nucleotide polymorphisms revealed an admixed domestication pattern among cultivated (desi and kabuli) and wild chickpea accessions belonging to three population groups reflecting significant influence of parentage or geographical origin for their cultivar-specific population classification. The assembly and the polymorphic sequence resources presented here would facilitate the study of chickpea domestication and targeted use of wild Cicer germplasms for agronomic trait improvement in chickpea. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  16. HelmCoP: an online resource for helminth functional genomics and drug and vaccine targets prioritization.

    PubMed

    Abubucker, Sahar; Martin, John; Taylor, Christina M; Mitreva, Makedonka

    2011-01-01

    A vast majority of the burden from neglected tropical diseases result from helminth infections (nematodes and platyhelminthes). Parasitic helminthes infect over 2 billion, exerting a high collective burden that rivals high-mortality conditions such as AIDS or malaria, and cause devastation to crops and livestock. The challenges to improve control of parasitic helminth infections are multi-fold and no single category of approaches will meet them all. New information such as helminth genomics, functional genomics and proteomics coupled with innovative bioinformatic approaches provide fundamental molecular information about these parasites, accelerating both basic research as well as development of effective diagnostics, vaccines and new drugs. To facilitate such studies we have developed an online resource, HelmCoP (Helminth Control and Prevention), built by integrating functional, structural and comparative genomic data from plant, animal and human helminthes, to enable researchers to develop strategies for drug, vaccine and pesticide prioritization, while also providing a useful comparative genomics platform. HelmCoP encompasses genomic data from several hosts, including model organisms, along with a comprehensive suite of structural and functional annotations, to assist in comparative analyses and to study host-parasite interactions. The HelmCoP interface, with a sophisticated query engine as a backbone, allows users to search for multi-factorial combinations of properties and serves readily accessible information that will assist in the identification of various genes of interest. HelmCoP is publicly available at: http://www.nematode.net/helmcop.html.

  17. Gondwanan conifer clones imperilled by bushfire

    PubMed Central

    Worth, James R. P.; Sakaguchi, Shota; Rann, Karl D.; Bowman, Clarence J. W.; Ito, Motomi; Jordan, Gregory J.; Bowman, David M. J. S.

    2016-01-01

    Global increases in fire frequency driven by anthropogenic greenhouse emissions and land use change could threaten unique and ancient species by creeping into long-term fire refugia. The perhumid and mountainous western half of Tasmania is a globally important refugium for palaeo-endemic, fire intolerant lineages, especially conifers. Reproductive strategy will be crucial to the resilience of these organisms under warmer, dryer and more fire prone climates. This study analysed clonal versus sexual reproduction in old growth plots dominated by the palaeo-endemic conifer Athrotaxis cupressoides (Cupressaceae), a species that lacks any traits to tolerate frequent landscape fire. Across most of the seven plots the amount of sexually derived individuals was lower than clonally derived with, on average, 60% of all stems belonging to the same multi-locus lineage (MLL) (i.e. were clonal). Some MLLs were large spanning over 10 s of metres and consisted of up to 62 stems. The high mortality after fire and the rarity of sexual regeneration means that the range of this fire-intolerant species is likely to contract under enhanced fire regimes and has a limited capacity to disperse via seed to available fire refugia in the landscape. PMID:27666536

  18. Plant hormones and ecophysiology of conifers

    SciTech Connect

    Davies, W.J.

    1995-07-01

    Over the past 30 years, there have been very substantial fluctuations in the interests of plant scientists in the involvement of plant growth regulators in the control of physiology, growth, and development of plants. In the years following the identification of the five major classes of growth regulators and identification of other groups of compounds of somewhat more restricted interest, an enormous number of papers reported the effects of hormones applied externally to a very wide range of plants. During this period, it became very fashionable to compare effects of hormones with the effects of the environment on developmental and physiological phenomena and to suggest a regulatory role for the hormone(s) in the processes under consideration. Ross et al. (1983) have published a very comprehensive survey of the effects of growth regulators applied externally to conifers, and even 10 years later, it is difficult to improve on what they have done. Nevertheless, in the light of recent changes in our understanding of how growth regulators may work, it is necessary to reexamine this field and ask what we really know about the involvement of growth regulators in the ecophysiology of conifers.

  19. FveGD: an online resource for diploid strawberry (fragaria vesca) genomics data

    USDA-ARS?s Scientific Manuscript database

    Fragaria vesca, a diploid strawberry species commonly known as the alpine or woodland strawberry, is a versatile experimental plant system that is an emerging model for the Rosaceae family. An ancestral F. vesca genome contributed to the genome of the octoploid dessert strawberry (F. xananassa) and...

  20. AmeriFlux US-Vcm Valles Caldera Mixed Conifer

    SciTech Connect

    Litvak, Marcy

    2016-01-01

    This is the AmeriFlux version of the carbon flux data for the site US-Vcm Valles Caldera Mixed Conifer. Site Description - The Valles Caldera Mixed Conifer site is located in the 1200 km2 Jemez River basin in north-central New Mexico. Common to elevations ranging from 3040 to 2740 m in the region, the mixed conifer stand, within the entirety of the tower footprint in all directions, provides an excellent setting for studying the seasonal interaction between snow and vegetation.

  1. RoBuST: an integrated genomics resource for the root and bulb crop families Apiaceae and Alliaceae

    PubMed Central

    2010-01-01

    Background Root and bulb vegetables (RBV) include carrots, celeriac (root celery), parsnips (Apiaceae), onions, garlic, and leek (Alliaceae)—food crops grown globally and consumed worldwide. Few data analysis platforms are currently available where data collection, annotation and integration initiatives are focused on RBV plant groups. Scientists working on RBV include breeders, geneticists, taxonomists, plant pathologists, and plant physiologists who use genomic data for a wide range of activities including the development of molecular genetic maps, delineation of taxonomic relationships, and investigation of molecular aspects of gene expression in biochemical pathways and disease responses. With genomic data coming from such diverse areas of plant science, availability of a community resource focused on these RBV data types would be of great interest to this scientific community. Description The RoBuST database has been developed to initiate a platform for collecting and organizing genomic information useful for RBV researchers. The current release of RoBuST contains genomics data for 294 Alliaceae and 816 Apiaceae plant species and has the following features: (1) comprehensive sequence annotations of 3663 genes 5959 RNAs, 22,723 ESTs and 11,438 regulatory sequence elements from Apiaceae and Alliaceae plant families; (2) graphical tools for visualization and analysis of sequence data; (3) access to traits, biosynthetic pathways, genetic linkage maps and molecular taxonomy data associated with Alliaceae and Apiaceae plants; and (4) comprehensive plant splice signal repository of 659,369 splice signals collected from 6015 plant species for comparative analysis of plant splicing patterns. Conclusions RoBuST, available at http://robust.genome.com, provides an integrated platform for researchers to effortlessly explore and analyze genomic data associated with root and bulb vegetables. PMID:20691054

  2. Aberrant Classopollis pollen reveals evidence for unreduced (2n) pollen in the conifer family Cheirolepidiaceae during the Triassic-Jurassic transition.

    PubMed

    Kürschner, Wolfram M; Batenburg, Sietske J; Mander, Luke

    2013-10-07

    Polyploidy (or whole-genome doubling) is a key mechanism for plant speciation leading to new evolutionary lineages. Several lines of evidence show that most species among flowering plants had polyploidy ancestry, but it is virtually unknown for conifers. Here, we study variability in pollen tetrad morphology and the size of the conifer pollen type Classopollis extracted from sediments of the Triassic-Jurassic transition, 200 Ma. Classopollis producing Cheirolepidiaceae were one of the most dominant and diverse groups of conifers during the Mesozoic. We show that aberrant pollen Classopollis tetrads, triads and dyads, and the large variation in pollen size indicates the presence of unreduced (2n) pollen, which is one of the main mechanisms in modern polyploid formation. Polyploid speciation may explain the high variability of growth forms and adaptation of these conifers to different environments and their resistance to extreme growth conditions. We suggest that polyploidy may have also reduced the extinction risk of these conifers during the End-Triassic biotic crisis.

  3. Aberrant Classopollis pollen reveals evidence for unreduced (2n) pollen in the conifer family Cheirolepidiaceae during the Triassic–Jurassic transition

    PubMed Central

    Kürschner, Wolfram M.; Batenburg, Sietske J.; Mander, Luke

    2013-01-01

    Polyploidy (or whole-genome doubling) is a key mechanism for plant speciation leading to new evolutionary lineages. Several lines of evidence show that most species among flowering plants had polyploidy ancestry, but it is virtually unknown for conifers. Here, we study variability in pollen tetrad morphology and the size of the conifer pollen type Classopollis extracted from sediments of the Triassic–Jurassic transition, 200 Ma. Classopollis producing Cheirolepidiaceae were one of the most dominant and diverse groups of conifers during the Mesozoic. We show that aberrant pollen Classopollis tetrads, triads and dyads, and the large variation in pollen size indicates the presence of unreduced (2n) pollen, which is one of the main mechanisms in modern polyploid formation. Polyploid speciation may explain the high variability of growth forms and adaptation of these conifers to different environments and their resistance to extreme growth conditions. We suggest that polyploidy may have also reduced the extinction risk of these conifers during the End-Triassic biotic crisis. PMID:23926159

  4. A resource of genome-wide single-nucleotide polymorphisms generated by RAD tag sequencing in the critically endangered European eel.

    PubMed

    Pujolar, J M; Jacobsen, M W; Frydenberg, J; Als, T D; Larsen, P F; Maes, G E; Zane, L; Jian, J B; Cheng, L; Hansen, M M

    2013-07-01

    Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers for the European eel using the RAD sequencing approach that was simultaneously identified and scored in a genome-wide scan of 30 individuals. Whereas genomic resources are increasingly becoming available for this species, including the recent release of a draft genome, no genome-wide set of SNP markers was available until now. The generated SNPs were widely distributed across the eel genome, aligning to 4779 different contigs and 19,703 different scaffolds. Significant variation was identified, with an average nucleotide diversity of 0.00529 across individuals. Results varied widely across the genome, ranging from 0.00048 to 0.00737 per locus. Based on the average nucleotide diversity across all loci, long-term effective population size was estimated to range between 132,000 and 1,320,000, which is much higher than previous estimates based on microsatellite loci. The generated SNP resource consisting of 82,425 loci and 376,918 associated SNPs provides a valuable tool for future population genetics and genomics studies and allows for targeting specific genes and particularly interesting regions of the eel genome.

  5. Creation of a minimal tiling path of genomic clones for Drosophila: provision of a common resource.

    PubMed

    Hollich, Volker; Johnson, Eric; Furlong, Eileen E; Beckmann, Boris; Carlson, Joseph; Celniker, Susan E; Hoheisel, Jörg D

    2004-08-01

    On the basis of shotgun subclone libraries used in the sequencing of the Drosophila melanogaster genome, a minimal tiling path of subclones across much of the genome was determined. About 320,000 shotgun clones for chromosomes X(12-20), 2R, 2L, 3R, and 4 were available from the Berkeley Drosophila Genome Project. The clone inserts have an average length of 3.4 kb and are amenable to standard PCR amplification. The resulting tiling path covers 86.2% of chromosome X(12-20), 86.2% of chromosomal arm 2R, 79.0% of 2L, 89.6% of 3R, and 80.5% of chromosome 4. In total, the 25,135 clones represent 76.7 Mb--equivalent to about 67% of the genome--and would be suitable for producing a microarray on a single slide.

  6. Evaluating the Clinical Validity of Gene-Disease Associations: An Evidence-Based Framework Developed by the Clinical Genome Resource.

    PubMed

    Strande, Natasha T; Riggs, Erin Rooney; Buchanan, Adam H; Ceyhan-Birsoy, Ozge; DiStefano, Marina; Dwight, Selina S; Goldstein, Jenny; Ghosh, Rajarshi; Seifert, Bryce A; Sneddon, Tam P; Wright, Matt W; Milko, Laura V; Cherry, J Michael; Giovanni, Monica A; Murray, Michael F; O'Daniel, Julianne M; Ramos, Erin M; Santani, Avni B; Scott, Alan F; Plon, Sharon E; Rehm, Heidi L; Martin, Christa L; Berg, Jonathan S

    2017-06-01

    With advances in genomic sequencing technology, the number of reported gene-disease relationships has rapidly expanded. However, the evidence supporting these claims varies widely, confounding accurate evaluation of genomic variation in a clinical setting. Despite the critical need to differentiate clinically valid relationships from less well-substantiated relationships, standard guidelines for such evaluation do not currently exist. The NIH-funded Clinical Genome Resource (ClinGen) has developed a framework to define and evaluate the clinical validity of gene-disease pairs across a variety of Mendelian disorders. In this manuscript we describe a proposed framework to evaluate relevant genetic and experimental evidence supporting or contradicting a gene-disease relationship and the subsequent validation of this framework using a set of representative gene-disease pairs. The framework provides a semiquantitative measurement for the strength of evidence of a gene-disease relationship that correlates to a qualitative classification: "Definitive," "Strong," "Moderate," "Limited," "No Reported Evidence," or "Conflicting Evidence." Within the ClinGen structure, classifications derived with this framework are reviewed and confirmed or adjusted based on clinical expertise of appropriate disease experts. Detailed guidance for utilizing this framework and access to the curation interface is available on our website. This evidence-based, systematic method to assess the strength of gene-disease relationships will facilitate more knowledgeable utilization of genomic variants in clinical and research settings. Copyright © 2017 American Society of Human Genetics. All rights reserved.

  7. Integrative Transcriptome, Genome and Quantitative Trait Loci Resources Identify Single Nucleotide Polymorphisms in Candidate Genes for Growth Traits in Turbot.

    PubMed

    Robledo, Diego; Fernández, Carlos; Hermida, Miguel; Sciara, Andrés; Álvarez-Dios, José Antonio; Cabaleiro, Santiago; Caamaño, Rubén; Martínez, Paulino; Bouza, Carmen

    2016-02-17

    Growth traits represent a main goal in aquaculture breeding programs and may be related to adaptive variation in wild fisheries. Integrating quantitative trait loci (QTL) mapping and next generation sequencing can greatly help to identify variation in candidate genes, which can result in marker-assisted selection and better genetic structure information. Turbot is a commercially important flatfish in Europe and China, with available genomic information on QTLs and genome mapping. Muscle and liver RNA-seq from 18 individuals was carried out to obtain gene sequences and markers functionally related to growth, resulting in a total of 20,447 genes and 85,344 single nucleotide polymorphisms (SNPs). Many growth-related genes and SNPs were identified and placed in the turbot genome and genetic map to explore their co-localization with growth-QTL markers. Forty-five SNPs on growth-related genes were selected based on QTL co-localization and relevant function for growth traits. Forty-three SNPs were technically feasible and validated in a wild Atlantic population, where 91% were polymorphic. The integration of functional and structural genomic resources in turbot provides a practical approach for QTL mining in this species. Validated SNPs represent a useful set of growth-related gene markers for future association, functional and population studies in this flatfish species.

  8. Integrative Transcriptome, Genome and Quantitative Trait Loci Resources Identify Single Nucleotide Polymorphisms in Candidate Genes for Growth Traits in Turbot

    PubMed Central

    Robledo, Diego; Fernández, Carlos; Hermida, Miguel; Sciara, Andrés; Álvarez-Dios, José Antonio; Cabaleiro, Santiago; Caamaño, Rubén; Martínez, Paulino; Bouza, Carmen

    2016-01-01

    Growth traits represent a main goal in aquaculture breeding programs and may be related to adaptive variation in wild fisheries. Integrating quantitative trait loci (QTL) mapping and next generation sequencing can greatly help to identify variation in candidate genes, which can result in marker-assisted selection and better genetic structure information. Turbot is a commercially important flatfish in Europe and China, with available genomic information on QTLs and genome mapping. Muscle and liver RNA-seq from 18 individuals was carried out to obtain gene sequences and markers functionally related to growth, resulting in a total of 20,447 genes and 85,344 single nucleotide polymorphisms (SNPs). Many growth-related genes and SNPs were identified and placed in the turbot genome and genetic map to explore their co-localization with growth-QTL markers. Forty-five SNPs on growth-related genes were selected based on QTL co-localization and relevant function for growth traits. Forty-three SNPs were technically feasible and validated in a wild Atlantic population, where 91% were polymorphic. The integration of functional and structural genomic resources in turbot provides a practical approach for QTL mining in this species. Validated SNPs represent a useful set of growth-related gene markers for future association, functional and population studies in this flatfish species. PMID:26901189

  9. The HuRef Browser: a web resource for individual human genomics

    PubMed Central

    Axelrod, Nelson; Lin, Yuan; Ng, Pauline C.; Stockwell, Timothy B.; Crabtree, Jonathan; Huang, Jiaqi; Kirkness, Ewen; Strausberg, Robert L.; Frazier, Marvin E.; Venter, J. Craig; Kravitz, Saul; Levy, Samuel

    2009-01-01

    The HuRef Genome Browser is a web application for the navigation and analysis of the previously published genome of a human individual, termed HuRef. The browser provides a comparative view between the NCBI human reference sequence and the HuRef assembly, and it enables the navigation of the HuRef genome in the context of HuRef, NCBI and Ensembl annotations. Single nucleotide polymorphisms, indels, inversions, structural and copy-number variations are shown in the context of existing functional annotations on either genome in the comparative view. Demonstrated here are some potential uses of the browser to enable a better understanding of individual human genetic variation. The browser provides full access to the underlying reads with sequence and quality information, the genome assembly and the evidence supporting the identification of DNA polymorphisms. The HuRef Browser is a unique and versatile tool for browsing genome assemblies and studying individual human sequence variation in a diploid context. The browser is available online at http://huref.jcvi.org. PMID:19036787

  10. The HuRef Browser: a web resource for individual human genomics.

    PubMed

    Axelrod, Nelson; Lin, Yuan; Ng, Pauline C; Stockwell, Timothy B; Crabtree, Jonathan; Huang, Jiaqi; Kirkness, Ewen; Strausberg, Robert L; Frazier, Marvin E; Venter, J Craig; Kravitz, Saul; Levy, Samuel

    2009-01-01

    The HuRef Genome Browser is a web application for the navigation and analysis of the previously published genome of a human individual, termed HuRef. The browser provides a comparative view between the NCBI human reference sequence and the HuRef assembly, and it enables the navigation of the HuRef genome in the context of HuRef, NCBI and Ensembl annotations. Single nucleotide polymorphisms, indels, inversions, structural and copy-number variations are shown in the context of existing functional annotations on either genome in the comparative view. Demonstrated here are some potential uses of the browser to enable a better understanding of individual human genetic variation. The browser provides full access to the underlying reads with sequence and quality information, the genome assembly and the evidence supporting the identification of DNA polymorphisms. The HuRef Browser is a unique and versatile tool for browsing genome assemblies and studying individual human sequence variation in a diploid context. The browser is available online at http://huref.jcvi.org.

  11. WormBase ParaSite - a comprehensive resource for helminth genomics.

    PubMed

    Howe, Kevin L; Bolt, Bruce J; Shafie, Myriam; Kersey, Paul; Berriman, Matthew

    2016-11-27

    The number of publicly available parasitic worm genome sequences has increased dramatically in the past three years, and research interest in helminth functional genomics is now quickly gathering pace in response to the foundation that has been laid by these collective efforts. A systematic approach to the organisation, curation, analysis and presentation of these data is clearly vital for maximising the utility of these data to researchers. We have developed a portal called WormBase ParaSite (http://parasite.wormbase.org) for interrogating helminth genomes on a large scale. Data from over 100 nematode and platyhelminth species are integrated, adding value by way of systematic and consistent functional annotation (e.g. protein domains and Gene Ontology terms), gene expression analysis (e.g. alignment of life-stage specific transcriptome data sets), and comparative analysis (e.g. orthologues and paralogues). We provide several ways of exploring the data, including genome browsers, genome and gene summary pages, text search, sequence search, a query wizard, bulk downloads, and programmatic interfaces. In this review, we provide an overview of the back-end infrastructure and analysis behind WormBase ParaSite, and the displays and tools available to users for interrogating helminth genomic data.

  12. Specific patterns of gene space organisation revealed in wheat by using the combination of barley and wheat genomic resources

    PubMed Central

    2010-01-01

    Background Because of its size, allohexaploid nature and high repeat content, the wheat genome has always been perceived as too complex for efficient molecular studies. We recently constructed the first physical map of a wheat chromosome (3B). However gene mapping is still laborious in wheat because of high redundancy between the three homoeologous genomes. In contrast, in the closely related diploid species, barley, numerous gene-based markers have been developed. This study aims at combining the unique genomic resources developed in wheat and barley to decipher the organisation of gene space on wheat chromosome 3B. Results Three dimensional pools of the minimal tiling path of wheat chromosome 3B physical map were hybridised to a barley Agilent 15K expression microarray. This led to the fine mapping of 738 barley orthologous genes on wheat chromosome 3B. In addition, comparative analyses revealed that 68% of the genes identified were syntenic between the wheat chromosome 3B and barley chromosome 3 H and 59% between wheat chromosome 3B and rice chromosome 1, together with some wheat-specific rearrangements. Finally, it indicated an increasing gradient of gene density from the centromere to the telomeres positively correlated with the number of genes clustered in islands on wheat chromosome 3B. Conclusion Our study shows that novel structural genomics resources now available in wheat and barley can be combined efficiently to overcome specific problems of genetic anchoring of physical contigs in wheat and to perform high-resolution comparative analyses with rice for deciphering the organisation of the wheat gene space. PMID:21167071

  13. Specific patterns of gene space organisation revealed in wheat by using the combination of barley and wheat genomic resources.

    PubMed

    Rustenholz, Camille; Hedley, Pete E; Morris, Jenny; Choulet, Frédéric; Feuillet, Catherine; Waugh, Robbie; Paux, Etienne

    2010-12-19

    Because of its size, allohexaploid nature and high repeat content, the wheat genome has always been perceived as too complex for efficient molecular studies. We recently constructed the first physical map of a wheat chromosome (3B). However gene mapping is still laborious in wheat because of high redundancy between the three homoeologous genomes. In contrast, in the closely related diploid species, barley, numerous gene-based markers have been developed. This study aims at combining the unique genomic resources developed in wheat and barley to decipher the organisation of gene space on wheat chromosome 3B. Three dimensional pools of the minimal tiling path of wheat chromosome 3B physical map were hybridised to a barley Agilent 15K expression microarray. This led to the fine mapping of 738 barley orthologous genes on wheat chromosome 3B. In addition, comparative analyses revealed that 68% of the genes identified were syntenic between the wheat chromosome 3B and barley chromosome 3 H and 59% between wheat chromosome 3B and rice chromosome 1, together with some wheat-specific rearrangements. Finally, it indicated an increasing gradient of gene density from the centromere to the telomeres positively correlated with the number of genes clustered in islands on wheat chromosome 3B. Our study shows that novel structural genomics resources now available in wheat and barley can be combined efficiently to overcome specific problems of genetic anchoring of physical contigs in wheat and to perform high-resolution comparative analyses with rice for deciphering the organisation of the wheat gene space.

  14. Leaf area dynamics of conifer forests

    SciTech Connect

    Margolis, H.; Oren, R.; Whitehead, D.; Kaufmann, M.R.

    1995-07-01

    Estimating the surface area of foliage supported by a coniferous forest canopy is critical for modeling its biological properties. Leaf area represents the surface area available for the interception of energy, the absorption of carbon dioxide, and the diffusion of water from the leaf to the atmosphere. The concept of leaf area is pertinent to the physiological and ecological dynamics of conifers at a wide range of spatial scales, from individual leaves to entire biomes. In fact, the leaf area of vegetation at a global level can be thought of as a carbon-absorbing, water-emitting membrane of variable thickness, which can have an important influence on the dynamics and chemistry of the Earth`s atmosphere over both the short and the long term. Unless otherwise specified, references to leaf area herein refer to projected leaf area, i.e., the vertical projection of needles placed on a flat plane. Total leaf surface area is generally from 2.0 to 3.14 times that of projected leaf area for conifers. It has recently been suggested that hemisurface leaf area, i.e., one-half of the total surface area of a leaf, a more useful basis for expressing leaf area than is projected area. This chapter is concerned with the dynamics of coniferous forest leaf area at different spatial and temporal scales. In the first part, we consider various hypotheses related to the control of leaf area development, ranging from simple allometric relations with tree size to more complex mechanistic models that consider the movement of water and nutrients to tree canopies. In the second part, we consider various aspects of leaf area dynamics at varying spatial and temporal scales, including responses to perturbation, seasonal dynamics, genetic variation in crown architecture, the responses to silvicultural treatments, the causes and consequences of senescence, and the direct measurement of coniferous leaf area at large spatial scales using remote sensing.

  15. Conifer Decline and Mortality in Siberia

    NASA Astrophysics Data System (ADS)

    Kharuk, V.; Im, S.; Ranson, K.

    2015-12-01

    "Dark needle conifer" (DNC: Abies sibirica, Pinus sibirica and Picea obovata) decline and mortality increase were documented in Russia during recent decades. Here we analyzed causes and scale of Siberian pine and fir mortality in Altai-Sayan and Baikal Lake Regions and West Siberian Plane based on in situdata and remote sensing (QuickBird, Landsat, GRACE). Geographically, mortality began on the margins of the DNC range (i.e., within the forest-steppe and conifer-broadleaf ecotones) and on terrain features with maximal water stress risk (narrow-shaped hilltops, convex steep south facing slopes, shallow well-drained soils). Within ridges, mortality occurred mainly along mountain passes, where stands faced drying winds. Regularly mortality was observed to decrease with elevation increase with the exception of Baikal Lake Mountains, where it was minimal near the lake shore and increased with elevation (up to about 1000 m a.s.l.). Siberian pine and fir mortality followed a drying trend with consecutive droughts since the 1980s. Dendrochronology analysis showed that mortality was correlated with vapor pressure deficit increase, drought index, soil moisture decrease and occurrence of late frosts. In Baikal region Siberian pine mortality correlated with Baikal watershed meteorological variables. An impact of previous year climate conditions on the current growth was found (r2 = 0.6). Thus, water-stressed trees became sensitive to bark beetles and fungi impact (including Polygraphus proximus and Heterobasidion annosum). At present, an increase in mortality is observed within the majority of DNC range. Results obtained also showed a primary role of water stress in that phenomenon with a secondary role of bark beetles and fungi attacks. In future climate with increased drought severity and frequency Siberian pine and fir will partly disappear from its current range, and will be substituted by drought-tolerant species (e.g., Pinus silvestris, Larix sibirica).

  16. Source-sink-storage relationships of conifers

    SciTech Connect

    Luxmoore, R.J.; Oren, R.; Sheriff, D.W.; Thomas, R.B.

    1995-07-01

    Irradiance, air temperature, saturation vapor pressure deficit, and soil temperature vary in association with Earth`s daily rotation, inducing significant hourly changes in the rates of plant physiological processes. These processes include carbon fixation in photosynthesis, sucrose translocation, and carbon utilization in growth, storage, and respiration. The sensitivity of these physiological processes to environmental factors such as temperature, soil water availability, and nutrient supply reveals differences that must be viewed as an interactive whole in order to comprehend whole-plant responses to the environment. Integrative frameworks for relationships between plant physiological processes are needed to provide syntheses of plant growth and development. Source-sink-storage relationships, addressed in this chapter, provide one framework for synthesis of whole-plant responses to external environmental variables. To address this issue, some examples of carbon assimilation and utilization responses of five conifer species to environmental factors from a range of field environments are first summarized. Next, the interactions between sources, sinks, and storages of carbon are examined at the leaf and tree scales, and finally, the review evaluates the proposition that processes involved with carbon utilization (sink activity) are more sensitive to the supply of water and nutrients (particularly nitrogen) than are the processes of carbon gain (source activity) and carbon storage. The terms {open_quotes}sink{close_quotes} and {open_quotes}source{close_quotes} refer to carbon utilization and carbon gain, respectively. The relative roles of stored carbon reserves and of current photosynthate in meeting sink demand are addressed. Discussions focus on source-sink-storage relationships within the diurnal, wetting-drying, and annual cycles of conifer growth and development, and some discussion of life cycle aspects is also presented.

  17. The Earliest Evidence of Holometabolan Insect Pupation in Conifer Wood

    PubMed Central

    Tapanila, Leif; Roberts, Eric M.

    2012-01-01

    Background The pre-Jurassic record of terrestrial wood borings is poorly resolved, despite body fossil evidence of insect diversification among xylophilic clades starting in the late Paleozoic. Detailed analysis of borings in petrified wood provides direct evidence of wood utilization by invertebrate animals, which typically comprises feeding behaviors. Methodology/Principal Findings We describe a U-shaped boring in petrified wood from the Late Triassic Chinle Formation of southern Utah that demonstrates a strong linkage between insect ontogeny and conifer wood resources. Xylokrypta durossi new ichnogenus and ichnospecies is a large excavation in wood that is backfilled with partially digested xylem, creating a secluded chamber. The tracemaker exited the chamber by way of a small vertical shaft. This sequence of behaviors is most consistent with the entrance of a larva followed by pupal quiescence and adult emergence — hallmarks of holometabolous insect ontogeny. Among the known body fossil record of Triassic insects, cupedid beetles (Coleoptera: Archostemata) are deemed the most plausible tracemakers of Xylokrypta, based on their body size and modern xylobiotic lifestyle. Conclusions/Significance This oldest record of pupation in fossil wood provides an alternative interpretation to borings once regarded as evidence for Triassic bees. Instead Xylokrypta suggests that early archostematan beetles were leaders in exploiting wood substrates well before modern clades of xylophages arose in the late Mesozoic. PMID:22355387

  18. A genome sequence resource for the aye-aye (Daubentonia madagascariensis), a nocturnal lemur from Madagascar.

    PubMed

    Perry, George H; Reeves, Darryl; Melsted, Páll; Ratan, Aakrosh; Miller, Webb; Michelini, Katelyn; Louis, Edward E; Pritchard, Jonathan K; Mason, Christopher E; Gilad, Yoav

    2012-01-01

    We present a high-coverage draft genome assembly of the aye-aye (Daubentonia madagascariensis), a highly unusual nocturnal primate from Madagascar. Our assembly totals ~3.0 billion bp (3.0 Gb), roughly the size of the human genome, comprised of ~2.6 million scaffolds (N50 scaffold size = 13,597 bp) based on short paired-end sequencing reads. We compared the aye-aye genome sequence data with four other published primate genomes (human, chimpanzee, orangutan, and rhesus macaque) as well as with the mouse and dog genomes as nonprimate outgroups. Unexpectedly, we observed strong evidence for a relatively slow substitution rate in the aye-aye lineage compared with these and other primates. In fact, the aye-aye branch length is estimated to be ~10% shorter than that of the human lineage, which is known for its low substitution rate. This finding may be explained, in part, by the protracted aye-aye life-history pattern, including late weaning and age of first reproduction relative to other lemurs. Additionally, the availability of this draft lemur genome sequence allowed us to polarize nucleotide and protein sequence changes to the ancestral primate lineage-a critical period in primate evolution, for which the relevant fossil record is sparse. Finally, we identified 293,800 high-confidence single nucleotide polymorphisms in the donor individual for our aye-aye genome sequence, a captive-born individual from two wild-born parents. The resulting heterozygosity estimate of 0.051% is the lowest of any primate studied to date, which is understandable considering the aye-aye's extensive home-range size and relatively low population densities. Yet this level of genetic diversity also suggests that conservation efforts benefiting this unusual species should be prioritized, especially in the face of the accelerating degradation and fragmentation of Madagascar's forests.

  19. A Genome Sequence Resource for the Aye-Aye (Daubentonia madagascariensis), a Nocturnal Lemur from Madagascar

    PubMed Central

    Perry, George H.; Reeves, Darryl; Melsted, Páll; Ratan, Aakrosh; Miller, Webb; Michelini, Katelyn; Louis, Edward E.; Pritchard, Jonathan K.; Mason, Christopher E.; Gilad, Yoav

    2012-01-01

    We present a high-coverage draft genome assembly of the aye-aye (Daubentonia madagascariensis), a highly unusual nocturnal primate from Madagascar. Our assembly totals ∼3.0 billion bp (3.0 Gb), roughly the size of the human genome, comprised of ∼2.6 million scaffolds (N50 scaffold size = 13,597 bp) based on short paired-end sequencing reads. We compared the aye-aye genome sequence data with four other published primate genomes (human, chimpanzee, orangutan, and rhesus macaque) as well as with the mouse and dog genomes as nonprimate outgroups. Unexpectedly, we observed strong evidence for a relatively slow substitution rate in the aye-aye lineage compared with these and other primates. In fact, the aye-aye branch length is estimated to be ∼10% shorter than that of the human lineage, which is known for its low substitution rate. This finding may be explained, in part, by the protracted aye-aye life-history pattern, including late weaning and age of first reproduction relative to other lemurs. Additionally, the availability of this draft lemur genome sequence allowed us to polarize nucleotide and protein sequence changes to the ancestral primate lineage—a critical period in primate evolution, for which the relevant fossil record is sparse. Finally, we identified 293,800 high-confidence single nucleotide polymorphisms in the donor individual for our aye-aye genome sequence, a captive-born individual from two wild-born parents. The resulting heterozygosity estimate of 0.051% is the lowest of any primate studied to date, which is understandable considering the aye-aye's extensive home-range size and relatively low population densities. Yet this level of genetic diversity also suggests that conservation efforts benefiting this unusual species should be prioritized, especially in the face of the accelerating degradation and fragmentation of Madagascar's forests. PMID:22155688

  20. Generation of an integrated Hieracium genomic and transcriptomic resource enables exploration of small RNA pathways during apomixis initiation.

    PubMed

    Rabiger, David S; Taylor, Jennifer M; Spriggs, Andrew; Hand, Melanie L; Henderson, Steven T; Johnson, Susan D; Oelkers, Karsten; Hrmova, Maria; Saito, Keisuke; Suzuki, Go; Mukai, Yasuhiko; Carroll, Bernard J; Koltunow, Anna M G

    2016-10-06

    Application of apomixis, or asexual seed formation, in crop breeding would allow rapid fixation of complex traits, economizing improved crop delivery. Identification of apomixis genes is confounded by the polyploid nature, high genome complexity and lack of genomic sequence integration with reproductive tissue transcriptomes in most apomicts. A genomic and transcriptomic resource was developed for Hieracium subgenus Pilosella (Asteraceae) which incorporates characterized sexual, apomictic and mutant apomict plants exhibiting reversion to sexual reproduction. Apomicts develop additional female gametogenic cells that suppress the sexual pathway in ovules. Disrupting small RNA pathways in sexual Arabidopsis also induces extra female gametogenic cells; therefore, the resource was used to examine if changes in small RNA pathways correlate with apomixis initiation. An initial characterization of small RNA pathway genes within Hieracium was undertaken, and ovary-expressed ARGONAUTE genes were identified and cloned. Comparisons of whole ovary transcriptomes from mutant apomicts, relative to the parental apomict, revealed that differentially expressed genes were enriched for processes involved in small RNA biogenesis and chromatin silencing. Small RNA profiles within mutant ovaries did not reveal large-scale alterations in composition or length distributions; however, a small number of differentially expressed, putative small RNA targets were identified. The established Hieracium resource represents a substantial contribution towards the investigation of early sexual and apomictic female gamete development, and the generation of new candidate genes and markers. Observed changes in small RNA targets and biogenesis pathways within sexual and apomictic ovaries will underlie future functional research into apomixis initiation in Hieracium.

  1. Construction of genome-wide physical BAC contigs using mapped cDNA as probes: Toward an integrated BAC library resource for genome sequencing and analysis. Annual report, July 1995--January 1997

    SciTech Connect

    Mitchell, S.C.; Bocskai, D.; Cao, Y.

    1997-12-31

    The goal of human genome project is to characterize and sequence entire genomes of human and several model organisms, thus providing complete sets of information on the entire structure of transcribed, regulatory and other functional regions for these organisms. In the past years, a number of useful genetic and physical markers on human and mouse genomes have been made available along with the advent of BAC library resources for these organisms. The advances in technology and resource development made it feasible to efficiently construct genome-wide physical BAC contigs for human and other genomes. Currently, over 30,000 mapped STSs and 27,000 mapped Unigenes are available for human genome mapping. ESTs and cDNAs are excellent resources for building contig maps for two reasons. Firstly, they exist in two alternative forms--as both sequence information for PCR primer pairs, and cDoreen genomic libraries efficiently for large number of DNA probes by combining over 100 cDNA probes in each hybridization. Second, the linkage and order of genes are rather conserved among human, mouse and other model organisms. Therefore, gene markers have advantages over random anonymous STSs in building maps for comparative genomic studies.

  2. Tall shrub layer biomass in conifer plantations in northeastern Minnesota.

    Treesearch

    Lewis F. Ohmann

    1982-01-01

    Provides estimates of biomass (pounds/acre) for tall shrub species in 53 conifer plantations in northeastern Minnesota. The estimates are analyzed by plantation age and silvicultural practices used to establish and release the plantations.

  3. Silviculture of mixed conifer forests in eastern Oregon and Washington.

    Treesearch

    K.W. Seidel; P.H. Cochran

    1981-01-01

    The silviculture of mixed conifer forests in eastern Oregon and Washington is described. Topics discussed include ecological setting, damaging agents, silviculture, and management. The relevant literature is presented, along with unpublished research, experience, and observations. Research needs are also proposed.

  4. Expression Divergence Is Correlated with Sequence Evolution but Not Positive Selection in Conifers.

    PubMed

    Hodgins, Kathryn A; Yeaman, Sam; Nurkowski, Kristin A; Rieseberg, Loren H; Aitken, Sally N

    2016-06-01

    The evolutionary and genomic determinants of sequence evolution in conifers are poorly understood, and previous studies have found only limited evidence for positive selection. Using RNAseq data, we compared gene expression profiles to patterns of divergence and polymorphism in 44 seedlings of lodgepole pine (Pinus contorta) and 39 seedlings of interior spruce (Picea glauca × engelmannii) to elucidate the evolutionary forces that shape their genomes and their plastic responses to abiotic stress. We found that rapidly diverging genes tend to have greater expression divergence, lower expression levels, reduced levels of synonymous site diversity, and longer proteins than slowly diverging genes. Similar patterns were identified for the untranslated regions, but with some exceptions. We found evidence that genes with low expression levels had a larger fraction of nearly neutral sites, suggesting a primary role for negative selection in determining the association between evolutionary rate and expression level. There was limited evidence for differences in the rate of positive selection among genes with divergent versus conserved expression profiles and some evidence supporting relaxed selection in genes diverging in expression between the species. Finally, we identified a small number of genes that showed evidence of site-specific positive selection using divergence data alone. However, estimates of the proportion of sites fixed by positive selection (α) were in the range of other plant species with large effective population sizes suggesting relatively high rates of adaptive divergence among conifers.

  5. Rat Genome Database: a unique resource for rat, human, and mouse quantitative trait locus data.

    PubMed

    Nigam, Rajni; Laulederkind, Stanley J F; Hayman, G Thomas; Smith, Jennifer R; Wang, Shur-Jen; Lowry, Timothy F; Petri, Victoria; De Pons, Jeff; Tutaj, Marek; Liu, Weisong; Jayaraman, Pushkala; Munzenmaier, Diane H; Worthey, Elizabeth A; Dwinell, Melinda R; Shimoyama, Mary; Jacob, Howard J

    2013-09-16

    The rat has been widely used as a disease model in a laboratory setting, resulting in an abundance of genetic and phenotype data from a wide variety of studies. These data can be found at the Rat Genome Database (RGD, http://rgd.mcw.edu/), which provides a platform for researchers interested in linking genomic variations to phenotypes. Quantitative trait loci (QTLs) form one of the earliest and core datasets, allowing researchers to identify loci harboring genes associated with disease. These QTLs are not only important for those using the rat to identify genes and regions associated with disease, but also for cross-organism analyses of syntenic regions on the mouse and the human genomes to identify potential regions for study in these organisms. Currently, RGD has data on >1,900 rat QTLs that include details about the methods and animals used to determine the respective QTL along with the genomic positions and markers that define the region. RGD also curates human QTLs (>1,900) and houses>4,000 mouse QTLs (imported from Mouse Genome Informatics). Multiple ontologies are used to standardize traits, phenotypes, diseases, and experimental methods to facilitate queries, analyses, and cross-organism comparisons. QTLs are visualized in tools such as GBrowse and GViewer, with additional tools for analysis of gene sets within QTL regions. The QTL data at RGD provide valuable information for the study of mapped phenotypes and identification of candidate genes for disease associations.

  6. Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data.

    PubMed

    Birol, Inanc; Raymond, Anthony; Jackman, Shaun D; Pleasance, Stephen; Coope, Robin; Taylor, Greg A; Yuen, Macaire Man Saint; Keeling, Christopher I; Brand, Dana; Vandervalk, Benjamin P; Kirk, Heather; Pandoh, Pawan; Moore, Richard A; Zhao, Yongjun; Mungall, Andrew J; Jaquish, Barry; Yanchuk, Alvin; Ritland, Carol; Boyle, Brian; Bousquet, Jean; Ritland, Kermit; Mackay, John; Bohlmann, Jörg; Jones, Steven J M

    2013-06-15

    White spruce (Picea glauca) is a dominant conifer of the boreal forests of North America, and providing genomics resources for this commercially valuable tree will help improve forest management and conservation efforts. Sequencing and assembling the large and highly repetitive spruce genome though pushes the boundaries of the current technology. Here, we describe a whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software. We report a 20.8 giga base pairs draft genome in 4.9 million scaffolds, with a scaffold N50 of 20,356 bp. We demonstrate how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity. We also note that scalable bioinformatics tools are instrumental in providing rapid draft assemblies. The Picea glauca genome sequencing and assembly data are available through NCBI (Accession#: ALWZ0100000000 PID: PRJNA83435). http://www.ncbi.nlm.nih.gov/bioproject/83435.

  7. Radiometric measurements of gap probability in conifer tree canopies

    NASA Technical Reports Server (NTRS)

    Albers, Bryan J.; Strahler, Alan H.; Li, Xiaowen; Liang, Shunlin; Clarke, Keith C.

    1990-01-01

    Measurements of gap probability were made for some moderate-sized, open-grown conifers of varying species. Results of the radiometric analysis show that the gap probability, which is taken as the mean of the binomial, fits well a negative exponential function of a path length. The conifer shadow, then, is an object of almost uniform darkness with some bright holes or gaps that are found near the shadow's edge and rapidly disappear toward the shadows center.

  8. Data set on the effects of conifer control and slash burning to soil carbon, total N, organic matter and extractable micro-nutrients

    USDA-ARS?s Scientific Manuscript database

    Conifer control in sagebrush steppe of the western United States causes various levels of site disturbance influencing vegetation recovery and resource availability. The data presented in this article are growing season availability of soil micronutrients and levels of total soil carbon, organic ma...

  9. Developing a Tissue Resource to Characterize the Genome of Pancreatic Cancer

    PubMed Central

    Voidonikolas, Georgios; Gingras, Marie-Claude; Hodges, Sally; McGuire, Amy L.; Chen, Changyi; Gibbs, Richard A.; Brunicardi, F. Charles; Fisher, William E.

    2010-01-01

    With recent advances in DNA sequencing technology, medicine is entering an era in which a personalized genomic approach to diagnosis and treatment of disease is now feasible. However, discovering the role of altered DNA sequences in various disease states will be a challenging task. The genomic approach offers great promise for diseases like pancreatic cancer in which the effect of current diagnostic and treatment modalities is disappointing. To facilitate the characterization of the genome of pancreatic cancer, high quality and well annotated tissue repositories are needed. This article summarizes basic principles guiding the creation of such a repository including sample processing and preservation techniques, sample size and composition, and collection of clinical data elements. PMID:19137368

  10. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants

    PubMed Central

    2010-01-01

    Background Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. Results The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conclusions Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution. PMID:20565927

  11. Comparative genomics reveals conservative evolution of the xylem transcriptome in vascular plants.

    PubMed

    Li, Xinguo; Wu, Harry X; Southerton, Simon G

    2010-06-21

    Wood is a valuable natural resource and a major carbon sink. Wood formation is an important developmental process in vascular plants which played a crucial role in plant evolution. Although genes involved in xylem formation have been investigated, the molecular mechanisms of xylem evolution are not well understood. We use comparative genomics to examine evolution of the xylem transcriptome to gain insights into xylem evolution. The xylem transcriptome is highly conserved in conifers, but considerably divergent in angiosperms. The functional domains of genes in the xylem transcriptome are moderately to highly conserved in vascular plants, suggesting the existence of a common ancestral xylem transcriptome. Compared to the total transcriptome derived from a range of tissues, the xylem transcriptome is relatively conserved in vascular plants. Of the xylem transcriptome, cell wall genes, ancestral xylem genes, known proteins and transcription factors are relatively more conserved in vascular plants. A total of 527 putative xylem orthologs were identified, which are unevenly distributed across the Arabidopsis chromosomes with eight hot spots observed. Phylogenetic analysis revealed that evolution of the xylem transcriptome has paralleled plant evolution. We also identified 274 conifer-specific xylem unigenes, all of which are of unknown function. These xylem orthologs and conifer-specific unigenes are likely to have played a crucial role in xylem evolution. Conifers have highly conserved xylem transcriptomes, while angiosperm xylem transcriptomes are relatively diversified. Vascular plants share a common ancestral xylem transcriptome. The xylem transcriptomes of vascular plants are more conserved than the total transcriptomes. Evolution of the xylem transcriptome has largely followed the trend of plant evolution.

  12. 290 metagenome-assembled genomes from the Mediterranean Sea: a resource for marine microbiology

    PubMed Central

    Sachdeva, Rohan; Graham, Elaina D.; Heidelberg, John F.

    2017-01-01

    The Tara Oceans Expedition has provided large, publicly-accessible microbial metagenomic datasets from a circumnavigation of the globe. Utilizing several size fractions from the samples originating in the Mediterranean Sea, we have used current assembly and binning techniques to reconstruct 290 putative draft metagenome-assembled bacterial and archaeal genomes, with an estimated completion of ≥50%, and an additional 2,786 bins, with estimated completion of 0–50%. We have submitted our results, including initial taxonomic and phylogenetic assignments, for the putative draft genomes to open-access repositories for the scientific community to use in ongoing research. PMID:28713657

  13. 290 metagenome-assembled genomes from the Mediterranean Sea: a resource for marine microbiology.

    PubMed

    Tully, Benjamin J; Sachdeva, Rohan; Graham, Elaina D; Heidelberg, John F

    2017-01-01

    The Tara Oceans Expedition has provided large, publicly-accessible microbial metagenomic datasets from a circumnavigation of the globe. Utilizing several size fractions from the samples originating in the Mediterranean Sea, we have used current assembly and binning techniques to reconstruct 290 putative draft metagenome-assembled bacterial and archaeal genomes, with an estimated completion of ≥50%, and an additional 2,786 bins, with estimated completion of 0-50%. We have submitted our results, including initial taxonomic and phylogenetic assignments, for the putative draft genomes to open-access repositories for the scientific community to use in ongoing research.

  14. The DNA Data Bank of Japan launches a new resource, the DDBJ Omics Archive of functional genomics experiments.

    PubMed

    Kodama, Yuichi; Mashima, Jun; Kaminuma, Eli; Gojobori, Takashi; Ogasawara, Osamu; Takagi, Toshihisa; Okubo, Kousaku; Nakamura, Yasukazu

    2012-01-01

    The DNA Data Bank of Japan (DDBJ; http://www.ddbj.nig.ac.jp) maintains and provides archival, retrieval and analytical resources for biological information. The central DDBJ resource consists of public, open-access nucleotide sequence databases including raw sequence reads, assembly information and functional annotation. Database content is exchanged with EBI and NCBI within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). In 2011, DDBJ launched two new resources: the 'DDBJ Omics Archive' (DOR; http://trace.ddbj.nig.ac.jp/dor) and BioProject (http://trace.ddbj.nig.ac.jp/bioproject). DOR is an archival database of functional genomics data generated by microarray and highly parallel new generation sequencers. Data are exchanged between the ArrayExpress at EBI and DOR in the common MAGE-TAB format. BioProject provides an organizational framework to access metadata about research projects and the data from the projects that are deposited into different databases. In this article, we describe major changes and improvements introduced to the DDBJ services, and the launch of two new resources: DOR and BioProject.

  15. The MiST2 database: a comprehensive genomics resource on microbial signal transduction

    PubMed Central

    Ulrich, Luke E.; Zhulin, Igor B.

    2010-01-01

    The MiST2 database (http://mistdb.com) identifies and catalogs the repertoire of signal transduction proteins in microbial genomes. Signal transduction systems regulate the majority of cellular activities including the metabolism, development, host-recognition, biofilm production, virulence, and antibiotic resistance of human pathogens. Thus, knowledge of the proteins and interactions that comprise these communication networks is an essential component to furthering biomedical discovery. These are identified by searching protein sequences for specific domain profiles that implicate a protein in signal transduction. Compared to the previous version of the database, MiST2 contains a host of new features and improvements including the following: draft genomes; extracytoplasmic function (ECF) sigma factor protein identification; enhanced classification of signaling proteins; novel, high-quality domain models for identifying histidine kinases and response regulators; neighboring two-component genes; gene cart; better search capabilities; enhanced taxonomy browser; advanced genome browser; and a modern, biologist-friendly web interface. MiST2 currently contains 966 complete and 157 draft bacterial and archaeal genomes, which collectively contain more than 245 000 signal transduction proteins. The majority (66%) of these are one-component systems, followed by two-component proteins (26%), chemotaxis (6%), and finally ECF factors (2%). PMID:19900966

  16. Whole genome sequencing of elite rice cultivars as a comprehensive information resource for marker assisted selection

    USDA-ARS?s Scientific Manuscript database

    Current advances in sequencing technologies and bioinformatics allow to determine a nearly complete genomic background of rice, a staple food for the poor people. Consequently, comprehensive databases of variation among thousands of varieties is currently being assembled and released. Proper analysi...

  17. The porcine translational research database: A manually curated, genomics and proteomics-based research resource

    USDA-ARS?s Scientific Manuscript database

    The use of swine in biomedical research has increased dramatically in the last decade. Diverse genomic- and proteomic databases have been developed to facilitate research using human and rodent models. Current porcine gene databases, however, lack the robust annotation to study pig models that are...

  18. ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis.

    PubMed

    Gupta, Amit Kumar; Kaur, Karambir; Rajput, Akanksha; Dhanda, Sandeep Kumar; Sehgal, Manika; Khan, Md Shoaib; Monga, Isha; Dar, Showkat Ahmad; Singh, Sandeep; Nagpal, Gandharva; Usmani, Salman Sadullah; Thakur, Anamika; Kaur, Gazaldeep; Sharma, Shivangi; Bhardwaj, Aman; Qureshi, Abid; Raghava, Gajendra Pal Singh; Kumar, Manoj

    2016-09-16

    Current Zika virus (ZIKV) outbreaks that spread in several areas of Africa, Southeast Asia, and in pacific islands is declared as a global health emergency by World Health Organization (WHO). It causes Zika fever and illness ranging from severe autoimmune to neurological complications in humans. To facilitate research on this virus, we have developed an integrative multi-omics platform; ZikaVR (http://bioinfo.imtech.res.in/manojk/zikavr/), dedicated to the ZIKV genomic, proteomic and therapeutic knowledge. It comprises of whole genome sequences, their respective functional information regarding proteins, genes, and structural content. Additionally, it also delivers sophisticated analysis such as whole-genome alignments, conservation and variation, CpG islands, codon context, usage bias and phylogenetic inferences at whole genome and proteome level with user-friendly visual environment. Further, glycosylation sites and molecular diagnostic primers were also analyzed. Most importantly, we also proposed potential therapeutically imperative constituents namely vaccine epitopes, siRNAs, miRNAs, sgRNAs and repurposing drug candidates.

  19. Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement

    USDA-ARS?s Scientific Manuscript database

    Chickpea (Cicer arietinum) is the world’s second most important grain legume crop, accounting for a significant proportion of human dietary protein and playing a critical role in food security in developing countries. We report the sequence of the ~738 Mb kabuli (CDC Frontier) chickpea genome, which...

  20. The MiST2 database: a comprehensive genomics resource on microbial signal transduction.

    PubMed

    Ulrich, Luke E; Zhulin, Igor B

    2010-01-01

    The MiST2 database (http://mistdb.com) identifies and catalogs the repertoire of signal transduction proteins in microbial genomes. Signal transduction systems regulate the majority of cellular activities including the metabolism, development, host-recognition, biofilm production, virulence, and antibiotic resistance of human pathogens. Thus, knowledge of the proteins and interactions that comprise these communication networks is an essential component to furthering biomedical discovery. These are identified by searching protein sequences for specific domain profiles that implicate a protein in signal transduction. Compared to the previous version of the database, MiST2 contains a host of new features and improvements including the following: draft genomes; extracytoplasmic function (ECF) sigma factor protein identification; enhanced classification of signaling proteins; novel, high-quality domain models for identifying histidine kinases and response regulators; neighboring two-component genes; gene cart; better search capabilities; enhanced taxonomy browser; advanced genome browser; and a modern, biologist-friendly web interface. MiST2 currently contains 966 complete and 157 draft bacterial and archaeal genomes, which collectively contain more than 245 000 signal transduction proteins. The majority (66%) of these are one-component systems, followed by two-component proteins (26%), chemotaxis (6%), and finally ECF factors (2%).

  1. Genome-Wide Discovery and Information Resource Development of DNA Polymorphisms in Cassava

    PubMed Central

    Yoshida, Takuhiro; Akiyama, Kenji; Ishitani, Manabu; Seki, Motoaki; Shinozaki, Kazuo

    2013-01-01

    Cassava (Manihot esculenta Crantz) is an important crop that provides food security and income generation in many tropical countries, and is known for its adaptability to various environmental conditions. Its draft genome sequence and many expressed sequence tags are now publicly available, allowing the development of cassava polymorphism information. Here, we describe the genome-wide discovery of cassava DNA polymorphisms. Using the alignment of predicted transcribed sequences from the cassava draft genome sequence and ESTs from GenBank, we discovered 10,546 single-nucleotide polymorphisms and 647 insertions and deletions. To facilitate molecular marker development for cassava, we designed 9,316 PCR primer pairs to amplify the genomic region around each DNA polymorphism. Of the discovered SNPs, 62.7% occurred in protein-coding regions. Disease-resistance genes were found to have a significantly higher ratio of nonsynonymous-to-synonymous substitutions. We identified 24 read-through (changes of a stop codon to a coding codon) and 38 premature stop (changes of a coding codon to a stop codon) single-nucleotide polymorphisms, and found that the 5 gene ontology terms in biological process were significantly different in genes with read-through single-nucleotide polymorphisms compared with all cassava genes. All data on the discovered DNA polymorphisms were organized into the Cassava Online Archive database, which is available at http://cassava.psc.riken.jp/. PMID:24040164

  2. Function-driven discovery of disease genes in zebrafish using an integrated genomics big data resource.

    PubMed

    Shim, Hongseok; Kim, Ji Hyun; Kim, Chan Yeong; Hwang, Sohyun; Kim, Hyojin; Yang, Sunmo; Lee, Ji Eun; Lee, Insuk

    2016-11-16

    Whole exome sequencing (WES) accelerates disease gene discovery using rare genetic variants, but further statistical and functional evidence is required to avoid false-discovery. To complement variant-driven disease gene discovery, here we present function-driven disease gene discovery in zebrafish (Danio rerio), a promising human disease model owing to its high anatomical and genomic similarity to humans. To facilitate zebrafish-based function-driven disease gene discovery, we developed a genome-scale co-functional network of zebrafish genes, DanioNet (www.inetbio.org/danionet), which was constructed by Bayesian integration of genomics big data. Rigorous statistical assessment confirmed the high prediction capacity of DanioNet for a wide variety of human diseases. To demonstrate the feasibility of the function-driven disease gene discovery using DanioNet, we predicted genes for ciliopathies and performed experimental validation for eight candidate genes. We also validated the existence of heterozygous rare variants in the candidate genes of individuals with ciliopathies yet not in controls derived from the UK10K consortium, suggesting that these variants are potentially involved in enhancing the risk of ciliopathies. These results showed that an integrated genomics big data for a model animal of diseases can expand our opportunity for harnessing WES data in disease gene discovery. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. ZikaVR: An Integrated Zika Virus Resource for Genomics, Proteomics, Phylogenetic and Therapeutic Analysis

    PubMed Central

    Gupta, Amit Kumar; Kaur, Karambir; Rajput, Akanksha; Dhanda, Sandeep Kumar; Sehgal, Manika; Khan, Md. Shoaib; Monga, Isha; Dar, Showkat Ahmad; Singh, Sandeep; Nagpal, Gandharva; Usmani, Salman Sadullah; Thakur, Anamika; Kaur, Gazaldeep; Sharma, Shivangi; Bhardwaj, Aman; Qureshi, Abid; Raghava, Gajendra Pal Singh; Kumar, Manoj

    2016-01-01

    Current Zika virus (ZIKV) outbreaks that spread in several areas of Africa, Southeast Asia, and in pacific islands is declared as a global health emergency by World Health Organization (WHO). It causes Zika fever and illness ranging from severe autoimmune to neurological complications in humans. To facilitate research on this virus, we have developed an integrative multi-omics platform; ZikaVR (http://bioinfo.imtech.res.in/manojk/zikavr/), dedicated to the ZIKV genomic, proteomic and therapeutic knowledge. It comprises of whole genome sequences, their respective functional information regarding proteins, genes, and structural content. Additionally, it also delivers sophisticated analysis such as whole-genome alignments, conservation and variation, CpG islands, codon context, usage bias and phylogenetic inferences at whole genome and proteome level with user-friendly visual environment. Further, glycosylation sites and molecular diagnostic primers were also analyzed. Most importantly, we also proposed potential therapeutically imperative constituents namely vaccine epitopes, siRNAs, miRNAs, sgRNAs and repurposing drug candidates. PMID:27633273

  4. Function-driven discovery of disease genes in zebrafish using an integrated genomics big data resource

    PubMed Central

    Shim, Hongseok; Kim, Ji Hyun; Kim, Chan Yeong; Hwang, Sohyun; Kim, Hyojin; Yang, Sunmo; Lee, Ji Eun; Lee, Insuk

    2016-01-01

    Whole exome sequencing (WES) accelerates disease gene discovery using rare genetic variants, but further statistical and functional evidence is required to avoid false-discovery. To complement variant-driven disease gene discovery, here we present function-driven disease gene discovery in zebrafish (Danio rerio), a promising human disease model owing to its high anatomical and genomic similarity to humans. To facilitate zebrafish-based function-driven disease gene discovery, we developed a genome-scale co-functional network of zebrafish genes, DanioNet (www.inetbio.org/danionet), which was constructed by Bayesian integration of genomics big data. Rigorous statistical assessment confirmed the high prediction capacity of DanioNet for a wide variety of human diseases. To demonstrate the feasibility of the function-driven disease gene discovery using DanioNet, we predicted genes for ciliopathies and performed experimental validation for eight candidate genes. We also validated the existence of heterozygous rare variants in the candidate genes of individuals with ciliopathies yet not in controls derived from the UK10K consortium, suggesting that these variants are potentially involved in enhancing the risk of ciliopathies. These results showed that an integrated genomics big data for a model animal of diseases can expand our opportunity for harnessing WES data in disease gene discovery. PMID:27903883

  5. Interspecific chromosome substitution lines as genetic resources for improvement, trait analysis and genomic inference

    USDA-ARS?s Scientific Manuscript database

    The genetic base that cotton breeders commonly use to improve Upland cultivars is very narrow. The AD-genome species G. barbadense, G. tomentosum, and G. mustelinum are part of the primary germplasm pool, too, and constitute genetic reservoirs of genes for resistance to abiotic stress, pests and pa...

  6. Black raspberry genomic and genetic resource development to enable cultivar improvement

    USDA-ARS?s Scientific Manuscript database

    This project incorporates use of phenotypic, genotypic and genomic data to advance and streamline identification of traits of economic interest and to develop molecular markers for marker assisted breeding of black raspberry (Rubus occidentalis L.). A lack of adapted, disease resistant cultivars has...

  7. Development of a D genome specific marker resource for diploid and hexaploid wheat

    USDA-ARS?s Scientific Manuscript database

    Mapping and map-based cloning of genes that control agriculturally and economically important traits remain great challenges for plants with complex highly repetitive genomes such as those of the grass tribe, Triticeae. Mapping limitations in the Triticeae are primarily due to low frequencies of po...

  8. Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers.

    PubMed

    Varshney, Rajeev K; Chen, Wenbin; Li, Yupeng; Bharti, Arvind K; Saxena, Rachit K; Schlueter, Jessica A; Donoghue, Mark T A; Azam, Sarwar; Fan, Guangyi; Whaley, Adam M; Farmer, Andrew D; Sheridan, Jaime; Iwata, Aiko; Tuteja, Reetu; Penmetsa, R Varma; Wu, Wei; Upadhyaya, Hari D; Yang, Shiaw-Pyng; Shah, Trushar; Saxena, K B; Michael, Todd; McCombie, W Richard; Yang, Bicheng; Zhang, Gengyun; Yang, Huanming; Wang, Jun; Spillane, Charles; Cook, Douglas R; May, Gregory D; Xu, Xun; Jackson, Scott A

    2011-11-06

    Pigeonpea is an important legume food crop grown primarily by smallholder farmers in many semi-arid tropical regions of the world. We used the Illumina next-generation sequencing platform to generate 237.2 Gb of sequence, which along with Sanger-based bacterial artificial chromosome end sequences and a genetic map, we assembled into scaffolds representing 72.7% (605.78 Mb) of the 833.07 Mb pigeonpea genome. Genome analysis predicted 48,680 genes for pigeonpea and also showed the potential role that certain gene families, for example, drought tolerance-related genes, have played throughout the domestication of pigeonpea and the evolution of its ancestors. Although we found a few segmental duplication events, we did not observe the recent genome-wide duplication events observed in soybean. This reference genome sequence will facilitate the identification of the genetic basis of agronomically important traits, and accelerate the development of improved pigeonpea varieties that could improve food security in many developing countries.

  9. MICdb3.0: a comprehensive resource of microsatellite repeats from prokaryotic genomes.

    PubMed

    Mudunuri, Suresh B; Patnana, Sujan; Nagarajaram, Hampapathalu A

    2014-01-01

    The MICdb is a comprehensive relational database of perfect microsatellites extracted from completely sequenced and annotated genomes of bacteria and archaea. The current version MICdb3.0 is an updated and revised version of MICdb2.0. As compared with the previous version MICdb2.0, the current release is significantly improved in terms of much larger coverage of genomes, improved presentation of queried results, user-friendly administration module to manage Simple Sequence Repeat (SSR) data such as addition of new genomes, deletion of obsolete data, etc., and also removal of certain features deemed to be redundant. The new web-interface to the database called Microsatellite Analysis Server (MICAS) version 3.0 has been improved by the addition of powerful high-quality visualization tools to view the query results in the form of pie charts and bar graphs. All the query results and graphs can be exported in different formats so that the users can use them for further analysis. MICAS3.0 is also equipped with a unique genome comparison module using which users can do pair-wise comparison of genomes with regard to their microsatellite distribution. The advanced search module can be used to filter the repeats based on certain criteria such as filtering repeats of a particular motif/repeat size, extracting repeats of coding/non-coding regions, sort repeats, etc. The MICdb database has, therefore, been made portable to be administered by a person with the necessary administrative privileges. The MICdb3.0 database and analysis server can be accessed for free from www.cdfd.org.in/micas. Database URL: http://www.cdfd.org.in/micas.

  10. Microsatellite primers resource developed from the mapped sequence scaffolds of Nisqually-1 genome. Submitted to New Phytologist

    SciTech Connect

    Yin, Tongming; ZHANG, Dr. XINYE; Gunter, Lee E; Li, Shuxian; Wullschleger, Stan D; Huang, Prof. Minren; Tuskan, Gerald A

    2009-01-01

    In this study, 148 428 simple sequence repeat (SSR) primer pairs were designed from the unambiguously mapped sequence scaffolds of the Nisqually-1 genome. The physical position of the priming sites were identified along each of the 19 Populus chromosomes, and it was specified whether the priming sequences belong to intronic, intergenic, exonic or UTR regions. A subset of 150 SSR loci were amplified and a high amplification success rate (72%) was obtained in P. tremuloides, which belongs to a divergent subgenus of Populus relative to Nisqually-1. PCR reactions showed that the amplification success rate of exonic primer pairs was much higher than that of the intronic/intergenic primer pairs. Applying ANOVA and regression analyses to the flanking sequences of microsatellites, the repeat lengths, the GC contents of the repeats, the repeat motif numbers, the repeat motif length and the base composition of the repeat motif, it was determined that only the base composition of the repeat motif and the repeat motif length significantly affect the microsatellite variability in P. tremuloides samples. The SSR primer resource developed in this study provides a database for selecting highly transferable SSR markers with known physical position in the Populus genome and provides a comprehensive genetic tool to extend the genome sequence of Nisqually-1 to genetic studies in different Populus species.

  11. A large-scale zebrafish gene knockout resource for the genome-wide study of gene function

    PubMed Central

    Varshney, Gaurav K.; Lu, Jing; Gildea, Derek E.; Huang, Haigen; Pei, Wuhong; Yang, Zhongan; Huang, Sunny C.; Schoenfeld, David; Pho, Nam H.; Casero, David; Hirase, Takashi; Mosbrook-Davis, Deborah; Zhang, Suiyuan; Jao, Li-En; Zhang, Bo; Woods, Ian G.; Zimmerman, Steven; Schier, Alexander F.; Wolfsberg, Tyra G.; Pellegrini, Matteo; Burgess, Shawn M.; Lin, Shuo

    2013-01-01

    With the completion of the zebrafish genome sequencing project, it becomes possible to analyze the function of zebrafish genes in a systematic way. The first step in such an analysis is to inactivate each protein-coding gene by targeted or random mutation. Here we describe a streamlined pipeline using proviral insertions coupled with high-throughput sequencing and mapping technologies to widely mutagenize genes in the zebrafish genome. We also report the first 6144 mutagenized and archived F1's predicted to carry up to 3776 mutations in annotated genes. Using in vitro fertilization, we have rescued and characterized ∼0.5% of the predicted mutations, showing mutation efficacy and a variety of phenotypes relevant to both developmental processes and human genetic diseases. Mutagenized fish lines are being made freely available to the public through the Zebrafish International Resource Center. These fish lines establish an important milestone for zebrafish genetics research and should greatly facilitate systematic functional studies of the vertebrate genome. PMID:23382537

  12. A high-resolution map of the Nile tilapia genome: a resource for studying cichlids and other percomorphs

    PubMed Central

    2012-01-01

    Background The Nile tilapia (Oreochromis niloticus) is the second most farmed fish species worldwide. It is also an important model for studies of fish physiology, particularly because of its broad tolerance to an array of environments. It is a good model to study evolutionary mechanisms in vertebrates, because of its close relationship to haplochromine cichlids, which have undergone rapid speciation in East Africa. The existing genomic resources for Nile tilapia include a genetic map, BAC end sequences and ESTs, but comparative genome analysis and maps of quantitative trait loci (QTL) are still limited. Results We have constructed a high-resolution radiation hybrid (RH) panel for the Nile tilapia and genotyped 1358 markers consisting of 850 genes, 82 markers corresponding to BAC end sequences, 154 microsatellites and 272 single nucleotide polymorphisms (SNPs). From these, 1296 markers could be associated in 81 RH groups, while 62 were not linked. The total size of the RH map is 34,084 cR3500 and 937,310 kb. It covers 88% of the entire genome with an estimated inter-marker distance of 742 Kb. Mapping of microsatellites enabled integration to the genetic map. We have merged LG8 and LG24 into a single linkage group, and confirmed that LG16-LG21 are also merged. The orientation and association of RH groups to each chromosome and LG was confirmed by chromosomal in situ hybridizations (FISH) of 55 BACs. Fifty RH groups were localized on the 22 chromosomes while 31 remained small orphan groups. Synteny relationships were determined between Nile tilapia, stickleback, medaka and pufferfish. Conclusion The RH map and associated FISH map provide a valuable gene-ordered resource for gene mapping and QTL studies. All genetic linkage groups with their corresponding RH groups now have a corresponding chromosome which can be identified in the karyotype. Placement of conserved segments indicated that multiple inter-chromosomal rearrangements have occurred between Nile tilapia

  13. Development of genomic resources for the narrow-leafed lupin (Lupinus angustifolius): construction of a bacterial artificial chromosome (BAC) library and BAC-end sequencing

    PubMed Central

    2011-01-01

    Background Lupinus angustifolius L, also known as narrow-leafed lupin (NLL), is becoming an important grain legume crop that is valuable for sustainable farming and is becoming recognised as a potential human health food. Recent interest is being directed at NLL to improve grain production, disease and pest management and health benefits of the grain. However, studies have been hindered by a lack of extensive genomic resources for the species. Results A NLL BAC library was constructed consisting of 111,360 clones with an average insert size of 99.7 Kbp from cv Tanjil. The library has approximately 12 × genome coverage. Both ends of 9600 randomly selected BAC clones were sequenced to generate 13985 BAC end-sequences (BESs), covering approximately 1% of the NLL genome. These BESs permitted a preliminary characterisation of the NLL genome such as organisation and composition, with the BESs having approximately 39% G:C content, 16.6% repetitive DNA and 5.4% putative gene-encoding regions. From the BESs 9966 simple sequence repeat (SSR) motifs were identified and some of these are shown to be potential markers. Conclusions The NLL BAC library and BAC-end sequences are powerful resources for genetic and genomic research on lupin. These resources will provide a robust platform for future high-resolution mapping, map-based cloning, comparative genomics and assembly of whole-genome sequencing data for the species. PMID:22014081

  14. Causes and consequences of variation in conifer leaf life-span

    SciTech Connect

    Reich, P.B.; Koike, T.; Gower, S.T.; Schoettle, A.W.

    1995-07-01

    Species with mutually supporting traits, such as high N{sub mass}, SLA, and A{sub mass}, and short leaf life-span, tend to inhabit either generally resource-rich environments or spatial and/or temporal microhabitats that are resource-rich in otherwise more limited habitats (e.g., {open_quotes}precipitation{close_quotes} ephemerals in warm deserts or spring ephemerals in the understory of temperate deciduous forests). In contrast, species with long leaf life-span often support foliage with low SLA, N{sub mass}, and A{sub mass}, and often grow in low-temperature limited, dry, and/or nutrient-poor environments. The contrast between evergreen and deciduous species, and the implications that emerge from such comparisons, can be considered a paradigm of modern ecological theory. However, based on the results of Reich et al. (1992) and Gower et al. (1993), coniferous species with foliage that persists for 9-10 years are likely to assimilate and allocate carbon and nutrients differently than other evergreen conifers that retain foliage for 2-3 years. Thus, attempts to contrast ecophysiological or ecosystem characteristics of evergreen versus deciduous life forms may be misleading, and pronounced differences among evergreen conifers may be ignored. Clearly, the deciduous-evergreen contrast, although useful in several ways, should be viewed from the broader perspective of a gradient in leaf life-span.

  15. Colorectal cancer atlas: An integrative resource for genomic and proteomic annotations from colorectal cancer cell lines and tissues.

    PubMed

    Chisanga, David; Keerthikumar, Shivakumar; Pathan, Mohashin; Ariyaratne, Dinuka; Kalra, Hina; Boukouris, Stephanie; Mathew, Nidhi Abraham; Al Saffar, Haidar; Gangoda, Lahiru; Ang, Ching-Seng; Sieber, Oliver M; Mariadason, John M; Dasgupta, Ramanuj; Chilamkurti, Naveen; Mathivanan, Suresh

    2016-01-04

    In order to advance our understanding of colorectal cancer (CRC) development and progression, biomedical researchers have generated large amounts of OMICS data from CRC patient samples and representative cell lines. However, these data are deposited in various repositories or in supplementary tables. A database which integrates data from heterogeneous resources and enables analysis of the multidimensional data sets, specifically pertaining to CRC is currently lacking. Here, we have developed Colorectal Cancer Atlas (http://www.colonatlas.org), an integrated web-based resource that catalogues the genomic and proteomic annotations identified in CRC tissues and cell lines. The data catalogued to-date include sequence variations as well as quantitative and non-quantitative protein expression data. The database enables the analysis of these data in the context of signaling pathways, protein-protein interactions, Gene Ontology terms, protein domains and post-translational modifications. Currently, Colorectal Cancer Atlas contains data for >13 711 CRC tissues, >165 CRC cell lines, 62 251 protein identifications, >8.3 million MS/MS spectra, >18 410 genes with sequence variations (404 278 entries) and 351 pathways with sequence variants. Overall, Colorectal Cancer Atlas has been designed to serve as a central resource to facilitate research in CRC. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. The genetics of drought tolerance in conifers.

    PubMed

    Moran, Emily; Lauder, Jeffrey; Musser, Cameron; Stathos, Angela; Shu, Mengjun

    2017-09-12

    1 I. 1 II. 2 III. 4 IV. 5 V. 9 VI. 10 VII. 12 References 12 SUMMARY: As temperatures warm and precipitation patterns shift as a result of climate change, interest in the identification of tree genotypes that will thrive under more arid conditions has grown. In this review, we discuss the multiple definitions of 'drought tolerance' and the biological processes involved in drought responses. We describe the three major approaches taken in the study of genetic variation in drought responses, the advantages and shortcomings of each, and what each of these approaches has revealed about the genetic basis of adaptation to drought in conifers. Finally, we discuss how a greater knowledge of the genetics of drought tolerance may aid forest management, and provide recommendations for how future studies may overcome the limitations of past approaches. In particular, we urge a more direct focus on survival, growth and the traits that directly predict them (rather than on proxies, such as water use efficiency), combining research approaches with complementary strengths and weaknesses, and the inclusion of a wider range of taxa and life stages. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.

  17. Resources.

    ERIC Educational Resources Information Center

    Aviation/Space, 1980

    1980-01-01

    The resources listed different types of materials related to the aerospace science under specified categories: free materials and inexpensive, selected government publication, audiovisual (government, nongovernment), aviation books, and space books. The list includes the publisher's name and the price for each publication. (SK)

  18. Resources.

    ERIC Educational Resources Information Center

    Stewart, John; MacDonald, Ian

    1980-01-01

    Presents a guide to resources on television drama available to teachers for classroom use in television curriculum. Lists American and British television drama videorecordings of both series and individual presentations and offers a bibliography of "one-off" single fiction plays produced for British television. (JMF)

  19. Resources.

    ERIC Educational Resources Information Center

    Aviation/Space, 1980

    1980-01-01

    The resources listed different types of materials related to the aerospace science under specified categories: free materials and inexpensive, selected government publication, audiovisual (government, nongovernment), aviation books, and space books. The list includes the publisher's name and the price for each publication. (SK)

  20. Toward an Integrated BAC Library Resource for Genome Sequencing and Analysis

    SciTech Connect

    Simon, M. I.; Kim, U.-J.

    2002-02-26

    We developed a great deal of expertise in building large BAC libraries from a variety of DNA sources including humans, mice, corn, microorganisms, worms, and Arabidopsis. We greatly improved the technology for screening these libraries rapidly and for selecting appropriate BACs and mapping BACs to develop large overlapping contigs. We became involved in supplying BACs and BAC contigs to a variety of sequencing and mapping projects and we began to collaborate with Drs. Adams and Venter at TIGR and with Dr. Leroy Hood and his group at University of Washington to provide BACs for end sequencing and for mapping and sequencing of large fragments of chromosome 16. Together with Dr. Ian Dunham and his co-workers at the Sanger Center we completed the mapping and they completed the sequencing of the first human chromosome, chromosome 22. This was published in Nature in 1999 and our BAC contigs made a major contribution to this sequencing effort. Drs. Shizuya and Ding invented an automated highly accurate BAC mapping technique. We also developed long-term collaborations with Dr. Uli Weier at UCSF in the design of BAC probes for characterization of human tumors and specific chromosome deletions and breakpoints. Finally the contribution of our work to the human genome project has been recognized in the publication both by the international consortium and the NIH of a draft sequence of the human genome in Nature last year. Dr. Shizuya was acknowledged in the authorship of that landmark paper. Dr. Simon was also an author on the Venter/Adams Celera project sequencing the human genome that was published in Science last year.

  1. A New Resource for Cereal Genomics: 22K Barley GeneChip Comes of Age1

    PubMed Central

    Close, Timothy J.; Wanamaker, Steve I.; Caldo, Rico A.; Turner, Stacy M.; Ashlock, Daniel A.; Dickerson, Julie A.; Wing, Rod A.; Muehlbauer, Gary J.; Kleinhofs, Andris; Wise, Roger P.

    2004-01-01

    In recent years, access to complete genomic sequences, coupled with rapidly accumulating data related to RNA and protein expression patterns, has made it possible to determine comprehensively how genes contribute to complex phenotypes. However, for major crop plants, publicly available, standard platforms for parallel expression analysis have been limited. We report the conception and design of the new publicly available, 22K Barley1 GeneChip probe array, a model for plants without a fully sequenced genome. Array content was derived from worldwide contribution of 350,000 high-quality ESTs from 84 cDNA libraries, in addition to 1,145 barley (Hordeum vulgare) gene sequences from the National Center for Biotechnology Information nonredundant database. Conserved sequences expressed in seedlings of wheat (Triticum aestivum), oat (Avena strigosa), rice (Oryza sativa), sorghum (Sorghum bicolor), and maize (Zea mays) were identified that will be valuable in the design of arrays across grasses. To enhance the usability of the data, BarleyBase, a MIAME-compliant, MySQL relational database, serves as a public repository for raw and normalized expression data from the Barley1 GeneChip probe array. Interconnecting links with PlantGDB and Gramene allow BarleyBase users to perform gene predictions using the 21,439 non-redundant Barley1 exemplar sequences or cross-species comparison at the genome level, respectively. We expect that this first generation array will accelerate hypothesis generation and gene discovery in disease defense pathways, responses to abiotic stresses, development, and evolutionary diversity in monocot plants. PMID:15020760

  2. Mapped DNA probes from loblolly pine can be used for restriction fragment length polymorphism mapping in other conifers.

    PubMed

    Ahuja, M R; Devey, M E; Groover, A T; Jermstad, K D; Neale, D B

    1994-06-01

    A high-density genetic map based on restriction fragment length polymorphisms (RFLPs) is being constructed for loblolly pine (Pinus taeda L.). Consequently, a large number of DNA probes from loblolly pine are potentially available for use in other species. We have used some of these DNA probes to detect RFLPs in 12 conifers and an angiosperm. Thirty complementary DNA and two genomic DNA probes from loblolly pine were hybridized to Southern blots containing DNA from five species of Pinus (P. elliottii, P. lambertiana, P. radiata, P. sylvestris, and P. taeda), one species from each of four other genera of Pinaceae (Abies concolor, Larix laricina, Picea abies, and Pseudotsuga menziesii), one species from each of three other families of Coniferales [Sequoia sempervirens (Taxodiaceae), Torreya californica (Taxaceae) and Calocedrus decurrens (Cupressaceae)], and to one angiosperm species (Populus nigra). Results showed that mapped DNA probes from lobolly pine will cross-hybridize to genomic DNA of other species of Pinus and some other genera of the Pinaceae. Only a small proportion of the probes hybridized to genomic DNA from three other families of the Coniferales and the one angiosperm examined. This study demonstrates that mapped DNA probes from loblolly pine can be used to construct RFLP maps for related species, thus enabling the opportunity for comparative genome mapping in conifers.

  3. Hemisphere-scale differences in conifer evolutionary dynamics.

    PubMed

    Leslie, Andrew B; Beaulieu, Jeremy M; Rai, Hardeep S; Crane, Peter R; Donoghue, Michael J; Mathews, Sarah

    2012-10-02

    Fundamental differences in the distribution of oceans and landmasses in the Northern and Southern Hemispheres potentially impact patterns of biological diversity in the two areas. The evolutionary history of conifers provides an opportunity to explore these dynamics, because the majority of extant conifer species belong to lineages that have been broadly confined to the Northern or Southern Hemisphere during the Cenozoic. Incorporating genetic information with a critical review of fossil evidence, we developed an age-calibrated phylogeny sampling ∼80% of living conifer species. Most extant conifer species diverged recently during the Neogene within clades that generally were established during the later Mesozoic, but lineages that diversified mainly in the Southern Hemisphere show a significantly older distribution of divergence ages than their counterparts in the Northern Hemisphere. Our tree topology and divergence times also are best fit by diversification models in which Northern Hemisphere conifer lineages have higher rates of species turnover than Southern Hemisphere lineages. The abundance of recent divergences in northern clades may reflect complex patterns of migration and range shifts during climatic cycles over the later Neogene leading to elevated rates of speciation and extinction, whereas the scattered persistence of mild, wetter habitats in the Southern Hemisphere may have favored the survival of older lineages.

  4. Genome-wide Association Study of Porcine Hematological Parameters in a Large White × Minzhu F2 Resource Population

    PubMed Central

    Luo, Weizhen; Chen, Shaokang; Cheng, Duxue; Wang, Ligang; Li, Yong; Ma, Xiaojun; Song, Xin; Liu, Xin; Li, Wen; Liang, Jing; Yan, Hua; Zhao, Kebin; Wang, Chuduan; Wang, Lixian; Zhang, Longchao

    2012-01-01

    Hematological traits, which are important indicators of immune function in animals, have been commonly examined as biomarkers of disease and disease severity in humans and animals. Genome-wide significant quantitative trait loci (QTLs) provide important information for use in breeding programs of animals such as pigs. QTLs for hematological parameters (hematological traits) have been detected in pig chromosomes, although these are often mapped by linkage analysis to large intervals making identification of the underlying mutation problematic. Single nucleotide polymorphisms (SNPs) are the common form of genetic variation among individuals and are thought to account for the majority of inherited traits. In this study, a genome-wide association study (GWAS) was performed to detect regions of association with hematological traits in a three-generation resource population produced by intercrossing Large White boars and Minzhu sows during the period from 2007 to 2011. Illumina PorcineSNP60 BeadChip technology was used to genotype each animal and seven hematological parameters were measured (hematocrit (HCT), hemoglobin (HGB), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), mean corpuscular volume (MCV), red blood cell count (RBC) and red blood cell volume distribution width (RDW)). Data were analyzed in a three step Genome-wide Rapid Association using the Mixed Model and Regression-Genomic Control (GRAMMAR-GC) method. A total of 62 genome-wide significant and three chromosome-wide significant SNPs associated with hematological parameters were detected in this GWAS. Seven and five SNPs were associated with HCT and HGB, respectively. These SNPs were all located within the region of 34.6-36.5 Mb on SSC7. Four SNPs within the region of 43.7-47.0 Mb and fifty-five SNPs within the region of 42.2-73.8 Mb on SSC8 showed significant association with MCH and MCV, respectively. At chromosome-wide significant level, one SNP at 29.2 Mb on SSC1

  5. The Ty1-copia LTR retroelement family PARTC is highly conserved in conifers over 200 MY of evolution.

    PubMed

    Zuccolo, Andrea; Scofield, Douglas G; De Paoli, Emanuele; Morgante, Michele

    2015-08-15

    Long Terminal Repeat retroelements (LTR-RTs) are a major component of many plant genomes. Although well studied and described in angiosperms, their features and dynamics are poorly understood in gymnosperms. Representative complete copies of a Ty1-copia element isolate in Picea abies and named PARTC were identified in six other conifer species (Picea glauca, Pinus sylvestris, Pinus taeda, Abies sibirica, Taxus baccata and Juniperus communis) covering more than 200 million years of evolution. Here we characterized the structure of this element, assessed its abundance across conifers, studied the modes and timing of its amplification, and evaluated the degree of conservation of its extant copies at nucleotide level over distant species. We demonstrated that the element is ancient, abundant, widespread and its paralogous copies are present in the genera Picea, Pinus and Abies as an LTR-RT family. The amplification leading to the extant copies of PARTC occurred over long evolutionary times spanning 10s of MY and mostly took place after the speciation of the conifers analyzed. The level of conservation of PARTC is striking and may be explained by low substitution rates and limited removal mechanisms for LTR-RTs. These PARTC features and dynamics are representative of a more general scenario for LTR-RTs in gymnosperms quite different from that characterizing the vast majority of LTR-RT elements in angiosperms.

  6. ParameciumDB: a community resource that integrates the Paramecium tetraurelia genome sequence with genetic data.

    PubMed

    Arnaiz, Olivier; Cain, Scott; Cohen, Jean; Sperling, Linda

    2007-01-01

    ParameciumDB (http://paramecium.cgm.cnrs-gif.fr) is a new model organism database associated with the genome sequencing project of the unicellular eukaryote Paramecium tetraurelia. Built with the core components of the Generic Model Organism Database (GMOD) project, ParameciumDB currently contains the genome sequence and annotations, linked to available genetic data including the Gif Paramecium stock collection. It is thus possible to navigate between sequences and stocks via the genes and alleles. Phenotypes, of mutant strains and of knockdowns obtained by RNA interference, are captured using controlled vocabularies according to the Entity-Attribute-Value model. ParameciumDB currently supports browsing of phenotypes, alleles and stocks as well as querying of sequence features (genes, UniProt matches, InterPro domains, Gene Ontology terms) and of genetic data (phenotypes, stocks, RNA interference experiments). Forms allow submission of RNA interference data and some bioinformatics services are available. Future ParameciumDB development plans include coordination of human curation of the near 40 000 gene models by members of the research community.

  7. Wildlife species associated with non-coniferous vegetation in Pacific Northwest conifer forests: A review

    USGS Publications Warehouse

    Hagar, J.C.

    2007-01-01

    Non-coniferous vegetation, including herbs, shrubs, and broad-leaved trees, makes a vital contribution to ecosystem function and diversity in Pacific Northwest conifer forests. However, forest management has largely been indifferent or detrimental to shrubs and trees that have low commercial value, in spite of a paradigm shift towards more holistic management in recent decades. Forest management practices that are detrimental to broad-leaved trees and shrubs are likely to decrease habitat diversity for wildlife, but the number of species that may be affected has not previously been enumerated. I reviewed life history accounts for forest-dwelling vertebrate wildlife species and derived a list of 78 species in Oregon and Washington that are associated with non-coniferous vegetation. The diversity of direct and indirect food resources provided was the primary functional basis for associations of most species with non-coniferous vegetation. Thus, a diversity of herbs and broad-leaved trees and shrubs provides the foundation for food webs that contribute to diversity at multiple trophic levels in Pacific Northwest conifer forests. Given the number of species associated with non-coniferous vegetation in conifer-dominated forests, maintaining habitats that support diverse plant communities, particularly broad-leaved trees and shrubs, will be an important component of management strategies intended to foster biodiversity. Silvicultural practices such as modified planting densities, and pre-commercial and commercial thinning, can be used to control stand density in order to favor the development of understory herbs, shrubs, and a diversity of tree species within managed stands. Allowing shrubs and hardwood trees to develop and persist in early seral stands by curtailing vegetation control also would benefit many species associated with non-coniferous vegetation.

  8. Review of Current Information Regarding the Phytosanitary Risks of Phytophthora ramorum and North American Conifers

    Treesearch

    Brenda Callan; Shane Sela; Eric Allen

    2008-01-01

    On March 3, 2007 the North American Plant Protection Organization (NAPPO) sponsored a ?Risks to Conifers? discussion panel to review the state of scientific knowledge regarding Phytophthora ramorum Werres, De Cock & Man in 't Veld and conifers and the potential for the pathogen to be transported with conifer forest products moving in...

  9. Utilization of treated conifer wood chips by Pleurotus (Fr.) P. Karst. species for cultivating mushrooms

    Treesearch

    Suki C. Croan

    2003-01-01

    Mushroom-producing white-rot basidiomycetes can grow rapidly and produce heavy mycelial growth on treated conifer wastes with extractive-degrading fungi. This study evaluates the treatment of scaled-up conifer wood chips with Ophiostoma piliferum (Cartapip 97). Treated conifer chips were used as substrates for cultivating mushroom-producing basidiomycetes of various...

  10. Prediction of periodic basal area increment for young-growth mixed conifers in sierra Nevada

    Treesearch

    Leroy K. Dolph

    1988-01-01

    Mixed-conifer forests are the largest vegetation type in California, covering more than 13 million acres (Barbour and Major 1977). This type, the Sierra Nevada Mixed Conifer (Society of American Foresters Forest cover type 243, Tappeiner 1980) dominates mid-elevations of the Sierra Nevada's western slopes. The extent of the mixed-conifer type and the amount of...

  11. A density management diagram for even-aged Sierra Nevada mixed-conifer stands

    Treesearch

    James N. Long; John D. Shaw

    2012-01-01

    We have developed a density management diagram (DMD) for even-aged mixed-conifer stands in the Sierra Nevada Mountains using forest inventory and analysis (FIA) data. Analysis plots were drawn from FIA plots in California, southern Oregon, and western Nevada which included those conifer species associated with the mixed-conifer forest type. A total of 204 plots met the...

  12. Stand dynamics of mixed red alder-conifer forests of southeast Alaska.

    Treesearch

    Robert L. Deal; Paul E. Hennon; Ewa H. Orlikowska; David V. D' Amore

    2004-01-01

    Stand structure and dynamics were evaluated in mixed red alder (Alnus rubra Bong.) - conifer forests of southeast Alaska. We assessed stand development, tree density, total basal area, diameter distribution of live and dead trees, height distribution of live trees, and mean diameter of all and largest conifers in 40-year-old red alder - conifer...

  13. The Arabidopsis Information Resource: Making and Mining the ‘Gold Standard’ Annotated Reference Plant Genome

    PubMed Central

    Berardini, Tanya Z.; Reiser, Leonore; Li, Donghui; Mezheritsky, Yarik; Muller, Robert; Strait, Emily; Huala, Eva

    2015-01-01

    The Arabidopsis Information Resource (TAIR) is a continuously updated, online database of genetic and molecular biology data for the model plant Arabidopsis thaliana that provides a global research community with centralized access to data for over 30,000 Arabidopsis genes. TAIR’s biocurators systematically extract, organize, and interconnect experimental data from the literature along with computational predictions, community submissions, and high throughput datasets to present a high quality and comprehensive picture of Arabidopsis gene function. TAIR provides tools for data visualization and analysis, and enables ordering of seed and DNA stocks, protein chips and other experimental resources. TAIR actively engages with its users who contribute expertise and data that augments the work of the curatorial staff. TAIR’s focus in an extensive and evolving ecosystem of online resources for plant biology is on the critically important role of extracting experimentally-based research findings from the literature and making that information computationally accessible. In response to the loss of government grant funding, the TAIR team founded a nonprofit entity, Phoenix Bioinformatics, with the aim of developing sustainable funding models for biological databases, using TAIR as a test case. Phoenix has successfully transitioned TAIR to subscription-based funding while still keeping its data relatively open and accessible. PMID:26201819

  14. GeneSpeed Beta Cell: An Online Genomics Data Repository and Analysis Resource Tailored for the Islet Cell Biologist

    PubMed Central

    Quayum, Nayeem; Kutchma, Alecksandr; Sarkar, Suparna A.; Juhl, Kirstine; Gradwohl, Gerard; Mellitzer, Georg; Hutton, John C.; Jensen, Jan

    2008-01-01

    Objective. We here describe the development of a freely available online database resource, GeneSpeed Beta Cell, which has been created for the pancreatic islet and pancreatic developmental biology investigator community. Research Design and Methods. We have developed GeneSpeed Beta Cell as a separate component of the GeneSpeed database, providing a genomics-type data repository of pancreas and islet-relevant datasets interlinked with the domain-oriented GeneSpeed database. Results. GeneSpeed Beta Cell allows the query of multiple published and unpublished select genomics datasets in a simultaneous fashion (multiexperiment viewing) and is capable of defining intersection results from precomputed analysis of such datasets (multidimensional querying). Combined with the protein-domain categorization/assembly toolbox provided by the GeneSpeed database, the user is able to define spatial expression constraints of select gene lists in a relatively rigid fashion within the pancreatic expression space. We provide several demonstration case studies of relevance to islet cell biology and development of the pancreas that provide novel insight into islet biology. Conclusions. The combination of an exhaustive domain-based compilation of the transcriptome with gene array data of interest to the islet biologist affords novel methods for multidimensional querying between individual datasets in a rapid fashion, presently not available elsewhere. PMID:18795106

  15. Analysis of Multiple Genomic Sequence Alignments: A Web Resource, Online Tools, and Lessons Learned From Analysis of Mammalian SCL Loci

    PubMed Central

    Chapman, Michael A.; Donaldson, Ian J.; Gilbert, James; Grafham, Darren; Rogers, Jane; Green, Anthony R.; Göttgens, Berthold

    2004-01-01

    Comparative analysis of genomic sequences is becoming a standard technique for studying gene regulation. However, only a limited number of tools are currently available for the analysis of multiple genomic sequences. An extensive data set for the testing and training of such tools is provided by the SCL gene locus. Here we have expanded the data set to eight vertebrate species by sequencing the dog SCL locus and by annotating the dog and rat SCL loci. To provide a resource for the bioinformatics community, all SCL sequences and functional annotations, comprising a collation of the extensive experimental evidence pertaining to SCL regulation, have been made available via a Web server. A Web interface to new tools specifically designed for the display and analysis of multiple sequence alignments was also implemented. The unique SCL data set and new sequence comparison tools allowed us to perform a rigorous examination of the true benefits of multiple sequence comparisons. We demonstrate that multiple sequence alignments are, overall, superior to pairwise alignments for identification of mammalian regulatory regions. In the search for individual transcription factor binding sites, multiple alignments markedly increase the signal-to-noise ratio compared to pairwise alignments. PMID:14718377

  16. Biofortification of wheat grain with iron and zinc: integrating novel genomic resources and knowledge from model crops.

    PubMed

    Borrill, Philippa; Connorton, James M; Balk, Janneke; Miller, Anthony J; Sanders, Dale; Uauy, Cristobal

    2014-01-01

    Wheat, like many other staple cereals, contains low levels of the essential micronutrients iron and zinc. Up to two billion people worldwide suffer from iron and zinc deficiencies, particularly in regions with predominantly cereal-based diets. Although wheat flour is commonly fortified during processing, an attractive and more sustainable solution is biofortification, which requires developing new varieties of wheat with inherently higher iron and zinc content in their grains. Until now most studies aimed at increasing iron and zinc content in wheat grains have focused on discovering natural variation in progenitor or related species. However, recent developments in genomics and transformation have led to a step change in targeted research on wheat at a molecular level. We discuss promising approaches to improve iron and zinc content in wheat using knowledge gained in model grasses. We explore how the latest resources developed in wheat, including sequenced genomes and mutant populations, can be exploited for biofortification. We also highlight the key research and practical challenges that remain in improving iron and zinc content in wheat.

  17. Biofortification of wheat grain with iron and zinc: integrating novel genomic resources and knowledge from model crops

    PubMed Central

    Borrill, Philippa; Connorton, James M.; Balk, Janneke; Miller, Anthony J.; Sanders, Dale; Uauy, Cristobal

    2014-01-01

    Wheat, like many other staple cereals, contains low levels of the essential micronutrients iron and zinc. Up to two billion people worldwide suffer from iron and zinc deficiencies, particularly in regions with predominantly cereal-based diets. Although wheat flour is commonly fortified during processing, an attractive and more sustainable solution is biofortification, which requires developing new varieties of wheat with inherently higher iron and zinc content in their grains. Until now most studies aimed at increasing iron and zinc content in wheat grains have focused on discovering natural variation in progenitor or related species. However, recent developments in genomics and transformation have led to a step change in targeted research on wheat at a molecular level. We discuss promising approaches to improve iron and zinc content in wheat using knowledge gained in model grasses. We explore how the latest resources developed in wheat, including sequenced genomes and mutant populations, can be exploited for biofortification. We also highlight the key research and practical challenges that remain in improving iron and zinc content in wheat. PMID:24600464

  18. Genomic evidence that resource-based trade-offs limit host-range expansion in a seed beetle.

    PubMed

    Gompert, Zachariah; Messina, Frank J

    2016-06-01

    Trade-offs have often been invoked to explain the evolution of ecological specialization. Phytophagous insects have been especially well studied, but there has been little evidence that resource-based trade-offs contribute to the evolution of host specialization in this group. Here, we combine experimental evolution and partial genome resequencing of replicate seed beetle selection lines to test the trade-off hypothesis and measure the repeatability of evolution. Bayesian estimates of selection coefficients suggest that rapid adaptation to a poor host (lentil) was mediated by standing genetic variation at multiple genetic loci and involved many of the same variants in replicate lines. Sublines that were then switched back to the ancestral host (mung bean) showed a more gradual and variable (less repeatable) loss of adaptation to lentil. We were able to obtain estimates of variance effective population sizes from genome-wide differences in allele frequencies within and between lines. These estimates were relatively large, which suggests that the contribution of genetic drift to the loss of adaptation following reversion was small. Instead, we find that some alleles that were favored on lentil were selected against during reversion on mung bean, consistent with the genetic trade-off hypothesis.

  19. WormBase: a comprehensive data resource for Caenorhabditis biology and genomics

    PubMed Central

    Chen, Nansheng; Harris, Todd W.; Antoshechkin, Igor; Bastiani, Carol; Bieri, Tamberlyn; Blasiar, Darin; Bradnam, Keith; Canaran, Payan; Chan, Juancarlos; Chen, Chao-Kung; Chen, Wen J.; Cunningham, Fiona; Davis, Paul; Kenny, Eimear; Kishore, Ranjana; Lawson, Daniel; Lee, Raymond; Muller, Hans-Michael; Nakamura, Cecilia; Pai, Shraddha; Ozersky, Philip; Petcherski, Andrei; Rogers, Anthony; Sabo, Aniko; Schwarz, Erich M.; Van Auken, Kimberly; Wang, Qinghua; Durbin, Richard; Spieth, John; Sternberg, Paul W.; Stein, Lincoln D.

    2005-01-01

    WormBase (http://www.wormbase.org), the model organism database for information about Caenorhabditis elegans and related nematodes, continues to expand in breadth and depth. Over the past year, WormBase has added multiple large-scale datasets including SAGE, interactome, 3D protein structure datasets and NCBI KOGs. To accommodate this growth, the International WormBase Consortium has improved the user interface by adding new features to aid in navigation, visualization of large-scale datasets, advanced searching and data mining. Internally, we have restructured the database models to rationalize the representation of genes and to prepare the system to accept the genome sequences of three additional Caenorhabditis species over the coming year. PMID:15608221

  20. Gene Overexpression Resources in Cereals for Functional Genomics and Discovery of Useful Genes

    PubMed Central

    Abe, Kiyomi; Ichikawa, Hiroaki

    2016-01-01

    Identification and elucidation of functions of plant genes is valuable for both basic and applied research. In addition to natural variation in model plants, numerous loss-of-function resources have been produced by mutagenesis with chemicals, irradiation, or insertions of transposable elements or T-DNA. However, we may be unable to observe loss-of-function phenotypes for genes with functionally redundant homologs and for those essential for growth and development. To offset such disadvantages, gain-of-function transgenic resources have been exploited. Activation-tagged lines have been generated using obligatory overexpression of endogenous genes by random insertion of an enhancer. Recent progress in DNA sequencing technology and bioinformatics has enabled the preparation of genomewide collections of full-length cDNAs (fl-cDNAs) in some mod