Science.gov

Sample records for genomics developing tools

  1. Technology developments in biological tools for targeted genome surgery.

    PubMed

    Teimourian, Shahram; Abdollahzadeh, Rasoul

    2015-01-01

    Different biological tools for targeted genome engineering have recently appeared and these include tools like meganucleases, zinc-finger nucleases and newer technologies including TALENs and CRISPR/Cas systems. transcription activator-like effector nucleases (TALENs) have greatly improved genome editing efficiency by making site-specific DNA double-strand breaks. Several studies have shown the prominence of TALENs in comparison to the meganucleases and zinc-finger nucleases. The most important feature of TALENs that makes them suitable tools for targeted genome editing is the modularity of central repeat domains, meaning that they can be designed to recognize any desirable DNA sequence. In this review, we present a comprehensive and concise description of TALENs technology developments for targeted genome surgery with to the point description and comparison of other tools.

  2. Genomics and public health: development of Web-based training tools for increasing genomic awareness.

    PubMed

    Bodzin, Jennifer; Kardia, Sharon L R; Goldenberg, Aaron; Raup, Sarah F; Bach, Janice V; Citrin, Toby

    2005-04-01

    In 2001, the Centers for Disease Control and Prevention funded three Centers for Genomics and Public Health to develop training tools for increasing genomic awareness. Over the past three years, the centers, working together with the Centers for Disease Control and Prevention's Office of Genomics and Disease Prevention, have developed tools to increase awareness of the impact genomics will have on public health practice, to provide a foundation for understanding basic genomic advances, and to translate the relevance of that information to public health practitioners' own work. These training tools serve to communicate genomic advances and their potential for integration into public heath practice. This paper highlights two of these training tools: 1) Genomics for Public Health Practitioners: The Practical Application of Genomics in Public Health Practice, a Web-based introduction to genomics, and 2) Six Weeks to Genomic Awareness, an in-depth training module on public health genomics. This paper focuses on the processes and collaborative efforts by which these live presentations were developed and delivered as Web-based training sessions.

  3. Coral Reef Genomics: Developing tools for functional genomics ofcoral symbiosis

    SciTech Connect

    Schwarz, Jodi; Brokstein, Peter; Manohar, Chitra; Coffroth, MaryAlice; Szmant, Alina; Medina, Monica

    2005-03-01

    Symbioses between cnidarians and dinoflagellates in the genus Symbiodinium are widespread in the marine environment. The importance of this symbiosis to reef-building corals and reef nutrient and carbon cycles is well documented, but little is known about the mechanisms by which the partners establish and regulate the symbiosis. Because the dinoflagellate symbionts live inside the cells of their host coral, the interactions between the partners occur on cellular and molecular levels, as each partner alters the expression of genes and proteins to facilitate the partnership. These interactions can examined using high-throughput techniques that allow thousands of genes to be examined simultaneously. We are developing the groundwork so that we can use DNA microarray profiling to identify genes involved in the Montastraea faveolata and Acropora palmata symbioses. Here we report results from the initial steps in this microarray initiative, that is, the construction of cDNA libraries from 4 of 16 target stages, sequencing of 3450 cDNA clones to generate Expressed Sequenced Tags (ESTs), and annotation of the ESTs to identify candidate genes to include in the microarrays. An understanding of how the coral-dinoflagellate symbiosis is regulated will have implications for atmospheric and ocean sciences, conservation biology, the study and diagnosis of coral bleaching and disease, and comparative studies of animal-protest interactions.

  4. Development of peanut expessed sequence tag-based genomic resources and tools

    USDA-ARS?s Scientific Manuscript database

    U.S. Peanut Genome Initiative (PGI) has widely recognized the need for peanut genome tools and resources development for mitigating peanut allergens and food safety. Genomics such as Expressed Sequence Tag (EST), microarray technologies, and whole genome sequencing provides robotic tools for profili...

  5. GenomicTools: a computational platform for developing high-throughput analytics in genomics.

    PubMed

    Tsirigos, Aristotelis; Haiminen, Niina; Bilal, Erhan; Utro, Filippo

    2012-01-15

    Recent advances in sequencing technology have resulted in the dramatic increase of sequencing data, which, in turn, requires efficient management of computational resources, such as computing time, memory requirements as well as prototyping of computational pipelines. We present GenomicTools, a flexible computational platform, comprising both a command-line set of tools and a C++ API, for the analysis and manipulation of high-throughput sequencing data such as DNA-seq, RNA-seq, ChIP-seq and MethylC-seq. GenomicTools implements a variety of mathematical operations between sets of genomic regions thereby enabling the prototyping of computational pipelines that can address a wide spectrum of tasks ranging from pre-processing and quality control to meta-analyses. Additionally, the GenomicTools platform is designed to analyze large datasets of any size by minimizing memory requirements. In practical applications, where comparable, GenomicTools outperforms existing tools in terms of both time and memory usage. The GenomicTools platform (version 2.0.0) was implemented in C++. The source code, documentation, user manual, example datasets and scripts are available online at http://code.google.com/p/ibm-cbc-genomic-tools.

  6. Genomic tools for developing markers for postharvest disease resistance in Rosaceae fruit crops

    USDA-ARS?s Scientific Manuscript database

    A wealth of new plant genomic information and molecular tools have been developed over the past ten years and now the challenge is to learn how to apply this information to address critical production problems, such as disease resistance and abiotic stress tolerance. Malus sieversii, an apple speci...

  7. Recent advances in developing molecular tools for targeted genome engineering of mammalian cells.

    PubMed

    Lim, Kwang-il

    2015-01-01

    Various biological molecules naturally existing in diversified species including fungi, bacteria, and bacteriophage have functionalities for DNA binding and processing. The biological molecules have been recently actively engineered for use in customized genome editing of mammalian cells as the molecule-encoding DNA sequence information and the underlying mechanisms how the molecules work are unveiled. Excitingly, multiple novel methods based on the newly constructed artificial molecular tools have enabled modifications of specific endogenous genetic elements in the genome context at efficiencies that are much higher than that of the conventional homologous recombination based methods. This minireview introduces the most recently spotlighted molecular genome engineering tools with their key features and ongoing modifications for better performance. Such ongoing efforts have mainly focused on the removal of the inherent DNA sequence recognition rigidity from the original molecular platforms, the addition of newly tailored targeting functions into the engineered molecules, and the enhancement of their targeting specificity. Effective targeted genome engineering of mammalian cells will enable not only sophisticated genetic studies in the context of the genome, but also widely-applicable universal therapeutics based on the pinpointing and correction of the disease-causing genetic elements within the genome in the near future.

  8. Transcriptome sequencing and microarray development for the Manila clam, Ruditapes philippinarum: genomic tools for environmental monitoring.

    PubMed

    Milan, Massimo; Coppe, Alessandro; Reinhardt, Richard; Cancela, Leonor M; Leite, Ricardo B; Saavedra, Carlos; Ciofi, Claudio; Chelazzi, Guido; Patarnello, Tomaso; Bortoluzzi, Stefania; Bargelloni, Luca

    2011-05-12

    The Manila clam, Ruditapes philippinarum, is one of the major aquaculture species in the world and a potential sentinel organism for monitoring the status of marine ecosystems. However, genomic resources for R. philippinarum are still extremely limited. Global analysis of gene expression profiles is increasingly used to evaluate the biological effects of various environmental stressors on aquatic animals under either artificial conditions or in the wild. Here, we report on the development of a transcriptomic platform for global gene expression profiling in the Manila clam. A normalized cDNA library representing a mixture of adult tissues was sequenced using a ultra high-throughput sequencing technology (Roche 454). A database consisting of 32,606 unique transcripts was constructed, 9,747 (30%) of which could be annotated by similarity. An oligo-DNA microarray platform was designed and applied to profile gene expression of digestive gland and gills. Functional annotation of differentially expressed genes between different tissues was performed by enrichment analysis. Expression of Natural Antisense Transcripts (NAT) analysis was also performed and bi-directional transcription appears a common phenomenon in the R. philippinarum transcriptome. A preliminary study on clam samples collected in a highly polluted area of the Venice Lagoon demonstrated the applicability of genomic tools to environmental monitoring. The transcriptomic platform developed for the Manila clam confirmed the high level of reproducibility of current microarray technology. Next-generation sequencing provided a good representation of the clam transcriptome. Despite the known limitations in transcript annotation and sequence coverage for non model species, sufficient information was obtained to identify a large set of genes potentially involved in cellular response to environmental stress.

  9. Transcriptome sequencing and microarray development for the Manila clam, Ruditapes philippinarum: genomic tools for environmental monitoring

    PubMed Central

    2011-01-01

    Background The Manila clam, Ruditapes philippinarum, is one of the major aquaculture species in the world and a potential sentinel organism for monitoring the status of marine ecosystems. However, genomic resources for R. philippinarum are still extremely limited. Global analysis of gene expression profiles is increasingly used to evaluate the biological effects of various environmental stressors on aquatic animals under either artificial conditions or in the wild. Here, we report on the development of a transcriptomic platform for global gene expression profiling in the Manila clam. Results A normalized cDNA library representing a mixture of adult tissues was sequenced using a ultra high-throughput sequencing technology (Roche 454). A database consisting of 32,606 unique transcripts was constructed, 9,747 (30%) of which could be annotated by similarity. An oligo-DNA microarray platform was designed and applied to profile gene expression of digestive gland and gills. Functional annotation of differentially expressed genes between different tissues was performed by enrichment analysis. Expression of Natural Antisense Transcripts (NAT) analysis was also performed and bi-directional transcription appears a common phenomenon in the R. philippinarum transcriptome. A preliminary study on clam samples collected in a highly polluted area of the Venice Lagoon demonstrated the applicability of genomic tools to environmental monitoring. Conclusions The transcriptomic platform developed for the Manila clam confirmed the high level of reproducibility of current microarray technology. Next-generation sequencing provided a good representation of the clam transcriptome. Despite the known limitations in transcript annotation and sequence coverage for non model species, sufficient information was obtained to identify a large set of genes potentially involved in cellular response to environmental stress. PMID:21569398

  10. An Analysis of Adenovirus Genomes Using Whole Genome Software Tools

    PubMed Central

    Mahadevan, Padmanabhan

    2016-01-01

    The evolution of sequencing technology has lead to an enormous increase in the number of genomes that have been sequenced. This is especially true in the field of virus genomics. In order to extract meaningful biological information from these genomes, whole genome data mining software tools must be utilized. Hundreds of tools have been developed to analyze biological sequence data. However, only some of these tools are user-friendly to biologists. Several of these tools that have been successfully used to analyze adenovirus genomes are described here. These include Artemis, EMBOSS, pDRAW, zPicture, CoreGenes, GeneOrder, and PipMaker. These tools provide functionalities such as visualization, restriction enzyme analysis, alignment, and proteome comparisons that are extremely useful in the bioinformatics analysis of adenovirus genomes. PMID:28293072

  11. Integrative genomics--a basic and essential tool for the development of molecular medicine.

    PubMed

    Ostrowski, Jerzy

    2008-01-01

    Understanding the molecular mechanisms of disease requires the introduction of molecular diagnostics into medical practice. Current medicine employs only elements of molecular diagnostics, and usually on the scale of single genes. Medicine in the post-genomic era will utilize thousands of molecular markers associated with disease that are provided by high-throughput sequencing and functional genomic, proteomic and metabolomic studies. Such a spectrum of techniques will link clinical medicine based on molecularly oriented diagnostics with the prediction and prevention of disease. To achieve this task, large-scale and genome-wide biological and medical data must be combined with biostatistical analyses and bioinformatic modeling of biological systems. The collecting, cataloging and comparison of data from molecular studies and the subsequent development of conclusions create the fundamentals of systems biology. This highly complex analytical process reflects a new scientific paradigm called integrative genomics.

  12. Genoviz Software Development Kit: Java tool kit for building genomics visualization applications.

    PubMed

    Helt, Gregg A; Nicol, John W; Erwin, Ed; Blossom, Eric; Blanchard, Steven G; Chervitz, Stephen A; Harmon, Cyrus; Loraine, Ann E

    2009-08-25

    Visualization software can expose previously undiscovered patterns in genomic data and advance biological science. The Genoviz Software Development Kit (SDK) is an open source, Java-based framework designed for rapid assembly of visualization software applications for genomics. The Genoviz SDK framework provides a mechanism for incorporating adaptive, dynamic zooming into applications, a desirable feature of genome viewers. Visualization capabilities of the Genoviz SDK include automated layout of features along genetic or genomic axes; support for user interactions with graphical elements (Glyphs) in a map; a variety of Glyph sub-classes that promote experimentation with new ways of representing data in graphical formats; and support for adaptive, semantic zooming, whereby objects change their appearance depending on zoom level and zooming rate adapts to the current scale. Freely available demonstration and production quality applications, including the Integrated Genome Browser, illustrate Genoviz SDK capabilities. Separation between graphics components and genomic data models makes it easy for developers to add visualization capability to pre-existing applications or build new applications using third-party data models. Source code, documentation, sample applications, and tutorials are available at http://genoviz.sourceforge.net/.

  13. Development of Genomic and Genetic Tools for Foxtail Millet, and Use of These Tools in the Improvement of Biomass Production for Bioenergy Crops

    SciTech Connect

    Doust, Andrew, N.

    2011-11-11

    The overall aim of this research was to develop genomic and genetic tools in foxtail millet that will be useful in improving biomass production in bioenergy crops such as switchgrass, napier grass, and pearl millet. A variety of approaches have been implemented, and our lab has been primarily involved in genome analysis and quantitative genetic analysis. Our progress in these activities has been substantially helped by the genomic sequence of foxtail millet produced by the Joint Genome Institute (Bennetzen et al., in prep). In particular, the annotation and analysis of candidate genes for architecture, biomass production and flowering has led to new insights into the control of branching and flowering time, and has shown how closely related flowering time is to vegetative architectural development and biomass accumulation. The differences in genetic control identified at high and low density plantings have direct relevance to the breeding of bioenergy grasses that are tolerant of high planting densities. The developmental analyses have shown how plant architecture changes over time and may indicate which genes may best be manipulated at various times during development to obtain required biomass characteristics. This data contributes to the overall aim of significantly improving genetic and genomic tools in foxtail millet that can be directed to improvement of bioenergy grasses such as switchgrass, where it is important to maximize vegetative growth for greatest biomass production.

  14. Computational Tools for Genomic Studies in Plants.

    PubMed

    Martinez, Manuel

    2016-12-01

    In recent years, the genomic sequence of numerous plant species including the main crop species has been determined. Computational tools have been developed to deal with the issue of which plant has been sequenced and where is the sequence hosted. In this mini-review, the databases for genome projects, the databases created to host species/clade projects and the databases developed to perform plant comparative genomics are revised. Because of their importance in modern research, an in-depth analysis of the plant comparative genomics databases has been performed. This comparative analysis is focused in the common and specific computational tools developed to achieve the particular objectives of each database. Besides, emerging high-performance bioinformatics tools specific for plant research are commented. What kind of computational approaches should be implemented in next years to efficiently analyze plant genomes is discussed.

  15. Genome Exploitation and Bioinformatics Tools

    NASA Astrophysics Data System (ADS)

    de Jong, Anne; van Heel, Auke J.; Kuipers, Oscar P.

    Bioinformatic tools can greatly improve the efficiency of bacteriocin screening efforts by limiting the amount of strains. Different classes of bacteriocins can be detected in genomes by looking at different features. Finding small bacteriocins can be especially challenging due to low homology and because small open reading frames (ORFs) are often omitted from annotations. In this chapter, several bioinformatic tools/strategies to identify bacteriocins in genomes are discussed.

  16. Application of Genomic Tools in Plant Breeding

    PubMed Central

    Pérez-de-Castro, A.M.; Vilanova, S.; Cañizares, J.; Pascual, L.; Blanca, J.M.; Díez, M.J.; Prohens, J.; Picó, B.

    2012-01-01

    Plant breeding has been very successful in developing improved varieties using conventional tools and methodologies. Nowadays, the availability of genomic tools and resources is leading to a new revolution of plant breeding, as they facilitate the study of the genotype and its relationship with the phenotype, in particular for complex traits. Next Generation Sequencing (NGS) technologies are allowing the mass sequencing of genomes and transcriptomes, which is producing a vast array of genomic information. The analysis of NGS data by means of bioinformatics developments allows discovering new genes and regulatory sequences and their positions, and makes available large collections of molecular markers. Genome-wide expression studies provide breeders with an understanding of the molecular basis of complex traits. Genomic approaches include TILLING and EcoTILLING, which make possible to screen mutant and germplasm collections for allelic variants in target genes. Re-sequencing of genomes is very useful for the genome-wide discovery of markers amenable for high-throughput genotyping platforms, like SSRs and SNPs, or the construction of high density genetic maps. All these tools and resources facilitate studying the genetic diversity, which is important for germplasm management, enhancement and use. Also, they allow the identification of markers linked to genes and QTLs, using a diversity of techniques like bulked segregant analysis (BSA), fine genetic mapping, or association mapping. These new markers are used for marker assisted selection, including marker assisted backcross selection, ‘breeding by design’, or new strategies, like genomic selection. In conclusion, advances in genomics are providing breeders with new tools and methodologies that allow a great leap forward in plant breeding, including the ‘superdomestication’ of crops and the genetic dissection and breeding for complex traits. PMID:23115520

  17. Developing tools for the study of molluscan immunity: The sequencing of the genome of the eastern oyster, Crassostrea virginica.

    PubMed

    Gómez-Chiarri, Marta; Warren, Wesley C; Guo, Ximing; Proestou, Dina

    2015-09-01

    The eastern oyster, Crassostrea virginica, provides important ecological and economical services, making it the target of restoration projects and supporting a significant fishery/aquaculture industry with landings valued at more than $100 million in 2012 in the United States of America. Due to the impact of infectious diseases on wild, restored, and cultured populations, the eastern oyster has been the focus of studies on host-pathogen interactions and immunity, as well as the target of selective breeding efforts for disease resistant oyster lines. Despite these efforts, relatively little is known about the genetic basis of resistance to diseases or environmental stress, not only in eastern oyster, but also in other molluscan species of commercial interest worldwide. In order to develop tools and resources to assist in the elucidation of the genomic basis of traits of commercial, biological, and ecological interest in oysters, a team of genome and bioinformatics experts, in collaboration with the oyster research community, is sequencing, assembling, and annotating the first reference genome for the eastern oyster and producing an exhaustive transcriptome from a variety of oyster developmental stages and tissues in response to a diverse set of environmentally-relevant stimuli. These transcriptomes and reference genome for the eastern oyster, added to the already available genome and transcriptomes for the Pacific oyster (Crassostrea gigas) and other bivalve species, will be an essential resource for the discovery of candidate genes and markers associated with traits of commercial, biological, and ecologic importance in bivalve molluscs, including those related to host-pathogen interactions and immunity.

  18. Development and validation of a comprehensive genomic diagnostic tool for myeloid malignancies

    PubMed Central

    McKerrell, Thomas; Moreno, Thaidy; Ponstingl, Hannes; Bolli, Niccolo; Dias, João M. L.; Tischler, German; Colonna, Vincenza; Manasse, Bridget; Bench, Anthony; Bloxham, David; Herman, Bram; Fletcher, Danielle; Park, Naomi; Quail, Michael A.; Manes, Nicla; Hodkinson, Clare; Baxter, Joanna; Sierra, Jorge; Foukaneli, Theodora; Warren, Alan J.; Chi, Jianxiang; Costeas, Paul; Rad, Roland; Huntly, Brian; Grove, Carolyn; Ning, Zemin; Tyler-Smith, Chris; Varela, Ignacio; Scott, Mike; Nomdedeu, Josep; Mustonen, Ville

    2016-01-01

    The diagnosis of hematologic malignancies relies on multidisciplinary workflows involving morphology, flow cytometry, cytogenetic, and molecular genetic analyses. Advances in cancer genomics have identified numerous recurrent mutations with clear prognostic and/or therapeutic significance to different cancers. In myeloid malignancies, there is a clinical imperative to test for such mutations in mainstream diagnosis; however, progress toward this has been slow and piecemeal. Here we describe Karyogene, an integrated targeted resequencing/analytical platform that detects nucleotide substitutions, insertions/deletions, chromosomal translocations, copy number abnormalities, and zygosity changes in a single assay. We validate the approach against 62 acute myeloid leukemia, 50 myelodysplastic syndrome, and 40 blood DNA samples from individuals without evidence of clonal blood disorders. We demonstrate robust detection of sequence changes in 49 genes, including difficult-to-detect mutations such as FLT3 internal-tandem and mixed-lineage leukemia (MLL) partial-tandem duplications, and clinically significant chromosomal rearrangements including MLL translocations to known and unknown partners, identifying the novel fusion gene MLL-DIAPH2 in the process. Additionally, we identify most significant chromosomal gains and losses, and several copy neutral loss-of-heterozygosity mutations at a genome-wide level, including previously unreported changes such as homozygosity for DNMT3A R882 mutations. Karyogene represents a dependable genomic diagnosis platform for translational research and for the clinical management of myeloid malignancies, which can be readily adapted for use in other cancers. PMID:27121471

  19. From plant genomes to protein families: computational tools

    PubMed Central

    Martinez, Manuel

    2013-01-01

    The development of new high-throughput sequencing technologies has increased dramatically the number of successful genomic projects. Thus, draft genomic sequences of more than 60 plant species are currently available. Suitable bioinformatics tools are being developed to assemble, annotate and analyze the enormous number of sequences produced. In this context, specific plant comparative genomic databases are become powerful tools for gene family annotation in plant clades. In this mini-review, the current state-of-art of genomic projects is glossed. Besides, the computational tools developed to compare genomic data are compiled. PMID:24688740

  20. Skittle: a 2-dimensional genome visualization tool.

    PubMed

    Seaman, Josiah D; Sanford, John C

    2009-12-30

    It is increasingly evident that there are multiple and overlapping patterns within the genome, and that these patterns contain different types of information--regarding both genome function and genome history. In order to discover additional genomic patterns which may have biological significance, novel strategies are required. To partially address this need, we introduce a new data visualization tool entitled Skittle. This program first creates a 2-dimensional nucleotide display by assigning four colors to the four nucleotides, and then text-wraps to a user adjustable width. This nucleotide display is accompanied by a "repeat map" which comprehensively displays all local repeating units, based upon analysis of all possible local alignments. Skittle includes a smooth-zooming interface which allows the user to analyze genomic patterns at any scale.Skittle is especially useful in identifying and analyzing tandem repeats, including repeats not normally detectable by other methods. However, Skittle is also more generally useful for analysis of any genomic data, allowing users to correlate published annotations and observable visual patterns, and allowing for sequence and construct quality control. Preliminary observations using Skittle reveal intriguing genomic patterns not otherwise obvious, including structured variations inside tandem repeats. The striking visual patterns revealed by Skittle appear to be useful for hypothesis development, and have already led the authors to theorize that imperfect tandem repeats could act as information carriers, and may form tertiary structures within the interphase nucleus.

  1. Comparative genomics tools applied to bioterrorism defence.

    PubMed

    Slezak, Tom; Kuczmarski, Tom; Ott, Linda; Torres, Clinton; Medeiros, Dan; Smith, Jason; Truitt, Brian; Mulakken, Nisha; Lam, Marisa; Vitalis, Elizabeth; Zemla, Adam; Zhou, Carol Ecale; Gardner, Shea

    2003-06-01

    Rapid advances in the genomic sequencing of bacteria and viruses over the past few years have made it possible to consider sequencing the genomes of all pathogens that affect humans and the crops and livestock upon which our lives depend. Recent events make it imperative that full genome sequencing be accomplished as soon as possible for pathogens that could be used as weapons of mass destruction or disruption. This sequence information must be exploited to provide rapid and accurate diagnostics to identify pathogens and distinguish them from harmless near-neighbours and hoaxes. The Chem-Bio Non-Proliferation (CBNP) programme of the US Department of Energy (DOE) began a large-scale effort of pathogen detection in early 2000 when it was announced that the DOE would be providing bio-security at the 2002 Winter Olympic Games in Salt Lake City, Utah. Our team at the Lawrence Livermore National Lab (LLNL) was given the task of developing reliable and validated assays for a number of the most likely bioterrorist agents. The short timeline led us to devise a novel system that utilised whole-genome comparison methods to rapidly focus on parts of the pathogen genomes that had a high probability of being unique. Assays developed with this approach have been validated by the Centers for Disease Control (CDC). They were used at the 2002 Winter Olympics, have entered the public health system, and have been in continual use for non-publicised aspects of homeland defence since autumn 2001. Assays have been developed for all major threat list agents for which adequate genomic sequence is available, as well as for other pathogens requested by various government agencies. Collaborations with comparative genomics algorithm developers have enabled our LLNL team to make major advances in pathogen detection, since many of the existing tools simply did not scale well enough to be of practical use for this application. It is hoped that a discussion of a real-life practical application of

  2. Data management tools for genomic applications: A progress report

    SciTech Connect

    Markowitz, V.M.; Chen, I-Min A.

    1993-09-01

    We report in this paper on the development of data management tools that allow scientist to construct and manipulate genomic data bases in terms of application-specific objects and protocols. We are developing tools for specifying genomic database structures, as well as for entering, changing, maintaining, browsing and querying data in genomic data bases. These tools are based on the Object-protocol Model (OPM) developed by us and target commercial relational database management systems which are widely used in molecular biology laboratories. OPM allows scientists to interact with genomic databases in terms of their own frame or reference, namely genomic objects and protocols. Databases developed using the data management tools are easier to use, manage, and adapt.

  3. Strategies and tools for whole genome alignments

    SciTech Connect

    Couronne, Olivier; Poliakov, Alexander; Bray, Nicolas; Ishkhanov,Tigran; Ryaboy, Dmitriy; Rubin, Edward; Pachter, Lior; Dubchak, Inna

    2002-11-25

    The availability of the assembled mouse genome makespossible, for the first time, an alignment and comparison of two largevertebrate genomes. We have investigated different strategies ofalignment for the subsequent analysis of conservation of genomes that areeffective for different quality assemblies. These strategies were appliedto the comparison of the working draft of the human genome with the MouseGenome Sequencing Consortium assembly, as well as other intermediatemouse assemblies. Our methods are fast and the resulting alignmentsexhibit a high degree of sensitivity, covering more than 90 percent ofknown coding exons in the human genome. We have obtained such coveragewhile preserving specificity. With a view towards the end user, we havedeveloped a suite of tools and websites for automatically aligning, andsubsequently browsing and working with whole genome comparisons. Wedescribe the use of these tools to identify conserved non-coding regionsbetween the human and mouse genomes, some of which have not beenidentified by other methods.

  4. DEVELOPMENT OF GENOMIC AND GENETIC TOOLS FOR FOXTAIL MILLET, AND USE OF THESE TOOLS IN THE IMPROVEMENT OF BIOMASS PRODUCTION FOR BIOENERGY CROPS

    SciTech Connect

    Chen, Xinlu; Zale, Janice; Chen, Feng

    2013-01-22

    Foxtail millet (Setaria italica L.) is a warm-season, C4 annual crop commonly grown for grain and forage worldwide. It has a relatively short generation time, yet produces hundreds of seeds per inflorescence. The crop is inbred and it has a small-size genome (~500 Mb). These features make foxtail millet an attractive grass model, especially for bioenergy crops. While a number of genomic tools have been established for foxtail millet, including a fully sequenced genome and molecular markers, the objectives of this project were to develop a tissue culture system, determine the best explant(s) for tissue culture, optimize transient gene expression, and establish a stable transformation system for foxtail millet cultivar Yugu1. In optimizing a tissue culture medium for the induction of calli and somatic embryos from immature inflorescences and mature seed explants, Murashige and Skoog medium containing 2.5 mg l-1 2,4-dichlorophenoxyacetic acid and 0.6 mg l-1 6- benzylaminopurine was determined to be optimal for callus induction of foxtail millet. The efficiency of callus induction from explants of immature inflorescences was significantly higher at 76% compared to that of callus induction from mature seed explants at 68%. The calli induced from this medium were regenerated into plants at high frequency (~100%) using 0.2 mg l-1 kinetin in the regeneration media. For performing transient gene expression, immature embryos were first isolated from inflorescences. Transient expression of the GUS reporter gene in immature embryos was significantly increased after sonication, a vacuum treatment, centrifugation and the addition of L-cysteine and dithiothreitol, which led to the efficiency of transient expression at levels greater than 70% after Agrobacterium inoculation. Inoculation with Agrobacterium was also tested with germinated seeds. The radicals of germinated seeds were pierced with needles and dipped into Agrobacterium solution. This method achieved a 10% transient

  5. Bioinformatics tools for small genomes, such as hepatitis B virus.

    PubMed

    Bell, Trevor G; Kramvis, Anna

    2015-02-16

    DNA sequence analysis is undertaken in many biological research laboratories. The workflow consists of several steps involving the bioinformatic processing of biological data. We have developed a suite of web-based online bioinformatic tools to assist with processing, analysis and curation of DNA sequence data. Most of these tools are genome-agnostic, with two tools specifically designed for hepatitis B virus sequence data. Tools in the suite are able to process sequence data from Sanger sequencing, ultra-deep amplicon resequencing (pyrosequencing) and chromatograph (trace files), as appropriate. The tools are available online at no cost and are aimed at researchers without specialist technical computer knowledge. The tools can be accessed at http://hvdr.bioinf.wits.ac.za/SmallGenomeTools, and the source code is available online at https://github.com/DrTrevorBell/SmallGenomeTools.

  6. Bioinformatics Tools for Small Genomes, Such as Hepatitis B Virus

    PubMed Central

    Bell, Trevor G.; Kramvis, Anna

    2015-01-01

    DNA sequence analysis is undertaken in many biological research laboratories. The workflow consists of several steps involving the bioinformatic processing of biological data. We have developed a suite of web-based online bioinformatic tools to assist with processing, analysis and curation of DNA sequence data. Most of these tools are genome-agnostic, with two tools specifically designed for hepatitis B virus sequence data. Tools in the suite are able to process sequence data from Sanger sequencing, ultra-deep amplicon resequencing (pyrosequencing) and chromatograph (trace files), as appropriate. The tools are available online at no cost and are aimed at researchers without specialist technical computer knowledge. The tools can be accessed at http://hvdr.bioinf.wits.ac.za/SmallGenomeTools, and the source code is available online at https://github.com/DrTrevorBell/SmallGenomeTools. PMID:25690798

  7. DCODE.ORG Anthology of Comparative Genomic Tools

    SciTech Connect

    Loots, G G; Ovcharenko, I

    2005-01-11

    Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the noncoding encryption of gene regulation across genomes. To facilitate the use of comparative genomics to practical applications in genetics and genomics we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools: zPicture and Mulan; a phylogenetic shadowing tool: eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools: rVista and multiTF; a tool for extracting cis-regulatory modules governing the expression of co-regulated genes, CREME; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ web site.

  8. Galaxy tools to study genome diversity

    PubMed Central

    2013-01-01

    Background Intra-species genetic variation can be used to investigate population structure, selection, and gene flow in non-model vertebrates; and due to the plummeting costs for genome sequencing, it is now possible for small labs to obtain full-genome variation data from their species of interest. However, those labs may not have easy access to, and familiarity with, computational tools to analyze those data. Results We have created a suite of tools for the Galaxy web server aimed at handling nucleotide and amino-acid polymorphisms discovered by full-genome sequencing of several individuals of the same species, or using a SNP genotyping microarray. In addition to providing user-friendly tools, a main goal is to make published analyses reproducible. While most of the examples discussed in this paper deal with nuclear-genome diversity in non-human vertebrates, we also illustrate the application of the tools to fungal genomes, human biomedical data, and mitochondrial sequences. Conclusions This project illustrates that a small group can design, implement, test, document, and distribute a Galaxy tool collection to meet the needs of a particular community of biologists. PMID:24377391

  9. Tools for sea urchin genomic analysis.

    PubMed

    Cameron, R Andrew

    2014-01-01

    The Sea Urchin Genome Project Web site, SpBase ( http://SpBase.org ), in association with a suite of publicly available sequence comparison tools provides a platform from which to analyze genes and genomic sequences of sea urchin. This information system is specifically designed to support laboratory bench studies in cell and molecular biology. In particular these tools and datasets have supported the description of the gene regulatory networks of the purple sea urchin S. purpuratus. This chapter details methods to undertake in the first steps to find genes and noncoding regulatory sequences for further analysis.

  10. PanGeT: Pan-genomics tool.

    PubMed

    Yuvaraj, Iyyappan; Sridhar, Jayavel; Michael, Daliah; Sekar, Kanagaraj

    2017-02-05

    A decade after the concept of Pan-genome was first introduced; research in this field has spread its tentacles to areas such as pathogenesis of diseases, bacterial evolutionary studies and drug resistance. Gene content-based differentiation of virulent and a virulent strains of bacteria and identification of pathogen specific genes is imperative to understand their physiology and gain insights into the mechanism of genome evolution. Subsequently, this will aid in identifying diagnostic targets and in developing and selecting vaccines. The root of pan-genomic studies, however, is to identify the core genes, dispensable genes and strain specific genes across the genomes belonging to a clade. To this end, we have developed a tool, "PanGeT - Pan-genomics Tool" to compute the 'pan-genome' based on comparisons at the genome as well as the proteome levels. This automated tool is implemented using LaTeX libraries for effective visualization of overall pan-genome through graphical plots. Links to retrieve sequence information and functional annotations have also been provided. PanGeT can be downloaded from http://pranag.physics.iisc.ernet.in/PanGeT/ or https://github.com/PanGeTv1/PanGeT.

  11. Bovine Genome Database: new tools for gleaning function from the Bos taurus genome.

    PubMed

    Elsik, Christine G; Unni, Deepak R; Diesh, Colin M; Tayal, Aditi; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-04

    We report an update of the Bovine Genome Database (BGD) (http://BovineGenome.org). The goal of BGD is to support bovine genomics research by providing genome annotation and data mining tools. We have developed new genome and annotation browsers using JBrowse and WebApollo for two Bos taurus genome assemblies, the reference genome assembly (UMD3.1.1) and the alternate genome assembly (Btau_4.6.1). Annotation tools have been customized to highlight priority genes for annotation, and to aid annotators in selecting gene evidence tracks from 91 tissue specific RNAseq datasets. We have also developed BovineMine, based on the InterMine data warehousing system, to integrate the bovine genome, annotation, QTL, SNP and expression data with external sources of orthology, gene ontology, gene interaction and pathway information. BovineMine provides powerful query building tools, as well as customized query templates, and allows users to analyze and download genome-wide datasets. With BovineMine, bovine researchers can use orthology to leverage the curated gene pathways of model organisms, such as human, mouse and rat. BovineMine will be especially useful for gene ontology and pathway analyses in conjunction with GWAS and QTL studies. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Genomic Tools in Groundnut Breeding Program: Status and Perspectives.

    PubMed

    Janila, P; Variath, Murali T; Pandey, Manish K; Desmae, Haile; Motagi, Babu N; Okori, Patrick; Manohar, Surendra S; Rathnakumar, A L; Radhakrishnan, T; Liao, Boshou; Varshney, Rajeev K

    2016-01-01

    Groundnut, a nutrient-rich food legume, is cultivated world over. It is valued for its good quality cooking oil, energy and protein rich food, and nutrient-rich fodder. Globally, groundnut improvement programs have developed varieties to meet the preferences of farmers, traders, processors, and consumers. Enhanced yield, tolerance to biotic and abiotic stresses and quality parameters have been the target traits. Spurt in genetic information of groundnut was facilitated by development of molecular markers, genetic, and physical maps, generation of expressed sequence tags (EST), discovery of genes, and identification of quantitative trait loci (QTL) for some important biotic and abiotic stresses and quality traits. The first groundnut variety developed using marker assisted breeding (MAB) was registered in 2003. Since then, USA, China, Japan, and India have begun to use genomic tools in routine groundnut improvement programs. Introgression lines that combine foliar fungal disease resistance and early maturity were developed using MAB. Establishment of marker-trait associations (MTA) paved way to integrate genomic tools in groundnut breeding for accelerated genetic gain. Genomic Selection (GS) tools are employed to improve drought tolerance and pod yield, governed by several minor effect QTLs. Draft genome sequence and low cost genotyping tools such as genotyping by sequencing (GBS) are expected to accelerate use of genomic tools to enhance genetic gains for target traits in groundnut.

  13. Genomic Tools in Groundnut Breeding Program: Status and Perspectives

    PubMed Central

    Janila, P.; Variath, Murali T.; Pandey, Manish K.; Desmae, Haile; Motagi, Babu N.; Okori, Patrick; Manohar, Surendra S.; Rathnakumar, A. L.; Radhakrishnan, T.; Liao, Boshou; Varshney, Rajeev K.

    2016-01-01

    Groundnut, a nutrient-rich food legume, is cultivated world over. It is valued for its good quality cooking oil, energy and protein rich food, and nutrient-rich fodder. Globally, groundnut improvement programs have developed varieties to meet the preferences of farmers, traders, processors, and consumers. Enhanced yield, tolerance to biotic and abiotic stresses and quality parameters have been the target traits. Spurt in genetic information of groundnut was facilitated by development of molecular markers, genetic, and physical maps, generation of expressed sequence tags (EST), discovery of genes, and identification of quantitative trait loci (QTL) for some important biotic and abiotic stresses and quality traits. The first groundnut variety developed using marker assisted breeding (MAB) was registered in 2003. Since then, USA, China, Japan, and India have begun to use genomic tools in routine groundnut improvement programs. Introgression lines that combine foliar fungal disease resistance and early maturity were developed using MAB. Establishment of marker-trait associations (MTA) paved way to integrate genomic tools in groundnut breeding for accelerated genetic gain. Genomic Selection (GS) tools are employed to improve drought tolerance and pod yield, governed by several minor effect QTLs. Draft genome sequence and low cost genotyping tools such as genotyping by sequencing (GBS) are expected to accelerate use of genomic tools to enhance genetic gains for target traits in groundnut. PMID:27014312

  14. The UCSC genome browser and associated tools.

    PubMed

    Kuhn, Robert M; Haussler, David; Kent, W James

    2013-03-01

    The UCSC Genome Browser (http://genome.ucsc.edu) is a graphical viewer for genomic data now in its 13th year. Since the early days of the Human Genome Project, it has presented an integrated view of genomic data of many kinds. Now home to assemblies for 58 organisms, the Browser presents visualization of annotations mapped to genomic coordinates. The ability to juxtapose annotations of many types facilitates inquiry-driven data mining. Gene predictions, mRNA alignments, epigenomic data from the ENCODE project, conservation scores from vertebrate whole-genome alignments and variation data may be viewed at any scale from a single base to an entire chromosome. The Browser also includes many other widely used tools, including BLAT, which is useful for alignments from high-throughput sequencing experiments. Private data uploaded as Custom Tracks and Data Hubs in many formats may be displayed alongside the rich compendium of precomputed data in the UCSC database. The Table Browser is a full-featured graphical interface, which allows querying, filtering and intersection of data tables. The Saved Session feature allows users to store and share customized views, enhancing the utility of the system for organizing multiple trains of thought. Binary Alignment/Map (BAM), Variant Call Format and the Personal Genome Single Nucleotide Polymorphisms (SNPs) data formats are useful for visualizing a large sequencing experiment (whole-genome or whole-exome), where the differences between the data set and the reference assembly may be displayed graphically. Support for high-throughput sequencing extends to compact, indexed data formats, such as BAM, bigBed and bigWig, allowing rapid visualization of large datasets from RNA-seq and ChIP-seq experiments via local hosting.

  15. VISTA - computational tools for comparative genomics

    SciTech Connect

    Frazer, Kelly A.; Pachter, Lior; Poliakov, Alexander; Rubin,Edward M.; Dubchak, Inna

    2004-01-01

    Comparison of DNA sequences from different species is a fundamental method for identifying functional elements in genomes. Here we describe the VISTA family of tools created to assist biologists in carrying out this task. Our first VISTA server at http://www-gsd.lbl.gov/VISTA/ was launched in the summer of 2000 and was designed to align long genomic sequences and visualize these alignments with associated functional annotations. Currently the VISTA site includes multiple comparative genomics tools and provides users with rich capabilities to browse pre-computed whole-genome alignments of large vertebrate genomes and other groups of organisms with VISTA Browser, submit their own sequences of interest to several VISTA servers for various types of comparative analysis, and obtain detailed comparative analysis results for a set of cardiovascular genes. We illustrate capabilities of the VISTA site by the analysis of a 180 kilobase (kb) interval on human chromosome 5 that encodes for the kinesin family member3A (KIF3A) protein.

  16. Bioinformatics tools for analysing viral genomic data.

    PubMed

    Orton, R J; Gu, Q; Hughes, J; Maabar, M; Modha, S; Vattipally, S B; Wilkie, G S; Davison, A J

    2016-04-01

    The field of viral genomics and bioinformatics is experiencing a strong resurgence due to high-throughput sequencing (HTS) technology, which enables the rapid and cost-effective sequencing and subsequent assembly of large numbers of viral genomes. In addition, the unprecedented power of HTS technologies has enabled the analysis of intra-host viral diversity and quasispecies dynamics in relation to important biological questions on viral transmission, vaccine resistance and host jumping. HTS also enables the rapid identification of both known and potentially new viruses from field and clinical samples, thus adding new tools to the fields of viral discovery and metagenomics. Bioinformatics has been central to the rise of HTS applications because new algorithms and software tools are continually needed to process and analyse the large, complex datasets generated in this rapidly evolving area. In this paper, the authors give a brief overview of the main bioinformatics tools available for viral genomic research, with a particular emphasis on HTS technologies and their main applications. They summarise the major steps in various HTS analyses, starting with quality control of raw reads and encompassing activities ranging from consensus and de novo genome assembly to variant calling and metagenomics, as well as RNA sequencing.

  17. BEACON: automated tool for Bacterial GEnome Annotation ComparisON.

    PubMed

    Kalkatawi, Manal; Alam, Intikhab; Bajic, Vladimir B

    2015-08-18

    Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON's utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27%, while the number of genes without any function assignment is reduced. We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/ .

  18. GenomePeek—an online tool for prokaryotic genome and metagenome analysis

    DOE PAGES

    McNair, Katelyn; Edwards, Robert A.

    2015-06-16

    As increases in prokaryotic sequencing take place, a method to quickly and accurately analyze this data is needed. Previous tools are mainly designed for metagenomic analysis and have limitations; such as long runtimes and significant false positive error rates. The online tool GenomePeek (edwards.sdsu.edu/GenomePeek) was developed to analyze both single genome and metagenome sequencing files, quickly and with low error rates. GenomePeek uses a sequence assembly approach where reads to a set of conserved genes are extracted, assembled and then aligned against the highly specific reference database. GenomePeek was found to be faster than traditional approaches while still keeping errormore » rates low, as well as offering unique data visualization options.« less

  19. GenomePeek—an online tool for prokaryotic genome and metagenome analysis

    SciTech Connect

    McNair, Katelyn; Edwards, Robert A.

    2015-06-16

    As increases in prokaryotic sequencing take place, a method to quickly and accurately analyze this data is needed. Previous tools are mainly designed for metagenomic analysis and have limitations; such as long runtimes and significant false positive error rates. The online tool GenomePeek (edwards.sdsu.edu/GenomePeek) was developed to analyze both single genome and metagenome sequencing files, quickly and with low error rates. GenomePeek uses a sequence assembly approach where reads to a set of conserved genes are extracted, assembled and then aligned against the highly specific reference database. GenomePeek was found to be faster than traditional approaches while still keeping error rates low, as well as offering unique data visualization options.

  20. GLANET: genomic loci annotation and enrichment tool.

    PubMed

    Otlu, Burçak; Firtina, Can; Keles, Sündüz; Tastan, Oznur

    2017-09-15

    Genomic studies identify genomic loci representing genetic variations, transcription factor (TF) occupancy, or histone modification through next generation sequencing (NGS) technologies. Interpreting these loci requires evaluating them with known genomic and epigenomic annotations. We present GLANET as a comprehensive annotation and enrichment analysis tool which implements a sampling-based enrichment test that accounts for GC content and/or mappability biases, jointly or separately. GLANET annotates and performs enrichment analysis on these loci with a rich library. We introduce and perform novel data-driven computational experiments for assessing the power and Type-I error of its enrichment procedure which show that GLANET has attained high statistical power and well-controlled Type-I error rate. As a key feature, users can easily extend its library with new gene sets and genomic intervals. Other key features include assessment of impact of single nucleotide variants (SNPs) on TF binding sites and regulation based pathway enrichment analysis. GLANET can be run using its GUI or on command line. GLANET's source code is available at https://github.com/burcakotlu/GLANET . Tutorials are provided at https://glanet.readthedocs.org . burcak@ceng.metu.edu.tr or oznur.tastan@cs.bilkent.edu.tr. Supplementary data are available at Bioinformatics online.

  1. Microarray Genomic Systems Development

    DTIC Science & Technology

    2008-06-01

    D Canada Contract Report DRDC Suffield CR 2009-145 June 2008 V. Lam, M. Crichton , T. Dickinson Laing, and D.C. Mah Canada West Biosciences Inc...Genomic Systems Development V. Lam, M. Crichton , T. Dickinson Laing, and D.C. Mah Canada West Biosciences Inc. Canada West Biosciences Inc. 5429... Crichton , M.; Dickinson Laing, T.; Mah, D.C.; DRDC Suffield CR 2009- 145; Defence R&D Canada – Suffield; June 2008. Introduction: Conventional

  2. Genome editing strategies: potential tools for eradicating HIV-1/AIDS

    PubMed Central

    Khalili, Kamel; Gordon, Jennifer; Cosentino, Laura; Hu, Wenhui

    2015-01-01

    Current therapy for controlling HIV-1 infection and preventing AIDS progression has profoundly decreased viral replication in cells susceptible to HIV-1 infection, but it does not eliminate the low level of viral replication in latently infected cells which contain integrated copies of HIV-1 proviral DNA. There is an urgent need for the development of HIV-1 genome eradication strategies that will lead to a permanent or “sterile” cure of HIV-1/AIDS. In the past few years, novel nuclease-initiated genome editing tools have been developing rapidly, including ZFNs, TALENs, and the CRISPR/Cas9 system. These surgical knives, which can excise any genome, provide a great opportunity to eradicate the HIV-1 genome by targeting highly conserved regions of the HIV-1 long terminal repeats or essential viral genes. Given the time consuming and costly engineering of target-specific ZFNs and TALENs, the RNA-guided endonuclease Cas9 technology has emerged as a simpler and more versatile technology to allow permanent removal of integrated HIV-1 proviral DNA in eukaryotic cells, and hopefully animal models or human patients. The major unmet challenges of this approach at present include inefficient nuclease gene delivery, potential off-target cleavage, and cell-specific genome targeting. Nanoparticle or lentivirus-mediated delivery of next generation Cas9 technologies including nickase or RNA-guided FokI nuclease (RFN) will further improve the potential for genome editing to become a promising approach for curing HIV-1/AIDS. PMID:25716921

  3. Unraveling the 3D genome: genomics tools for multi-scale exploration

    PubMed Central

    Risca, Viviana I.; Greenleaf, William J.

    2015-01-01

    A decade of rapid method development has begun to yield exciting insights into the three-dimensional architecture of the metazoan genome and the roles it may play in regulating transcription. We review here core methods and new tools in the modern genomicist’s toolbox at three length scales, ranging from single base pair to megabase scale chromosomal domains, and discuss the emerging picture of the 3D genome that these tools have revealed. Blind spots remain, especially at intermediate length scales spanning a few nucleosomes, but thanks in part to new technologies that permit targeted alteration of chromatin states and time-resolved studies, the next decade holds great promise for hypothesis-driven research into the mechanisms that drive genome architecture and transcriptional regulation. PMID:25887733

  4. Rapid SAW Sensor Development Tools

    NASA Technical Reports Server (NTRS)

    Wilson, William C.; Atkinson, Gary M.

    2007-01-01

    The lack of integrated design tools for Surface Acoustic Wave (SAW) devices has led us to develop tools for the design, modeling, analysis, and automatic layout generation of SAW devices. These tools enable rapid development of wireless SAW sensors. The tools developed have been designed to integrate into existing Electronic Design Automation (EDA) tools to take advantage of existing 3D modeling, and Finite Element Analysis (FEA). This paper presents the SAW design, modeling, analysis, and automated layout generation tools.

  5. Genomic-based-breeding tools for tropical maize improvement.

    PubMed

    Chakradhar, Thammineni; Hindu, Vemuri; Reddy, Palakolanu Sudhakar

    2017-09-05

    Maize has traditionally been the main staple diet in the Southern Asia and Sub-Saharan Africa and widely grown by millions of resource poor small scale farmers. Approximately, 35.4 million hectares are sown to tropical maize, constituting around 59% of the developing worlds. Tropical maize encounters tremendous challenges besides poor agro-climatic situations with average yields recorded <3 tones/hectare that is far less than the average of developed countries. On the contrary to poor yields, the demand for maize as food, feed, and fuel is continuously increasing in these regions. Heterosis breeding introduced in early 90 s improved maize yields significantly, but genetic gains is still a mirage, particularly for crop growing under marginal environments. Application of molecular markers has accelerated the pace of maize breeding to some extent. The availability of array of sequencing and genotyping technologies offers unrivalled service to improve precision in maize-breeding programs through modern approaches such as genomic selection, genome-wide association studies, bulk segregant analysis-based sequencing approaches, etc. Superior alleles underlying complex traits can easily be identified and introgressed efficiently using these sequence-based approaches. Integration of genomic tools and techniques with advanced genetic resources such as nested association mapping and backcross nested association mapping could certainly address the genetic issues in maize improvement programs in developing countries. Huge diversity in tropical maize and its inherent capacity for doubled haploid technology offers advantage to apply the next generation genomic tools for accelerating production in marginal environments of tropical and subtropical world. Precision in phenotyping is the key for success of any molecular-breeding approach. This article reviews genomic technologies and their application to improve agronomic traits in tropical maize breeding has been reviewed in

  6. A web-based genomic sequence database for the Streptomycetaceae: a tool for systematics and genome mining

    USDA-ARS?s Scientific Manuscript database

    The ARS Microbial Genome Sequence Database (http://199.133.98.43), a web-based database server, was established utilizing the BIGSdb (Bacterial Isolate Genomics Sequence Database) software package, developed at Oxford University, as a tool to manage multi-locus sequence data for the family Streptomy...

  7. Genomic tools to profile antibiotic mode of action.

    PubMed

    Cardona, Silvia T; Selin, Carrie; Gislason, April S

    2015-01-01

    The increasing emergence of antimicrobial multiresistant bacteria is of great concern to public health. While these bacteria are becoming an ever more prominent cause of nosocomial and community-acquired infections worldwide, the antibiotic discovery pipeline has been stalled in the last few years with very few efforts in the research and development of novel antibacterial therapies. Some of the root causes that have hampered current antibiotic drug development are the lack of understanding of the mode of action (MOA) of novel antibiotic molecules and the poor characterization of the bacterial physiological response to antibiotics that ultimately causes resistance. Here, we review how bacterial genetic tools can be applied at the genomic level with the goal of profiling resistance to antibiotics and elucidating antibiotic MOAs. Specifically, we highlight how chemical genomic detection of the MOA of novel antibiotic molecules and antibiotic profiling by next-generation sequencing are leveraging basic antibiotic research to unprecedented levels with great opportunities for knowledge translation.

  8. Genome editing strategies: potential tools for eradicating HIV-1/AIDS.

    PubMed

    Khalili, Kamel; Kaminski, Rafal; Gordon, Jennifer; Cosentino, Laura; Hu, Wenhui

    2015-06-01

    Current therapy for controlling human immunodeficiency virus (HIV-1) infection and preventing acquired immunodeficiency syndrome (AIDS) progression has profoundly decreased viral replication in cells susceptible to HIV-1 infection, but it does not eliminate the low level of viral replication in latently infected cells, which contain integrated copies of HIV-1 proviral DNA. There is an urgent need for the development of HIV-1 genome eradication strategies that will lead to a permanent or "sterile" cure of HIV-1/AIDS. In the past few years, novel nuclease-initiated genome editing tools have been developing rapidly, including zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and the CRISPR/Cas9 system. These surgical knives, which can excise any genome, provide a great opportunity to eradicate the HIV-1 genome by targeting highly conserved regions of the HIV-1 long terminal repeats or essential viral genes. Given the time consuming and costly engineering of target-specific ZFNs and TALENs, the RNA-guided endonuclease Cas9 technology has emerged as a simpler and more versatile technology to allow permanent removal of integrated HIV-1 proviral DNA in eukaryotic cells, and hopefully animal models or human patients. The major unmet challenges of this approach at present include inefficient nuclease gene delivery, potential off-target cleavage, and cell-specific genome targeting. Nanoparticle or lentivirus-mediated delivery of next generation Cas9 technologies including nickase or RNA-guided FokI nuclease (RFN) will further improve the potential for genome editing to become a promising approach for curing HIV-1/AIDS.

  9. GAViT: Genome Assembly Visualization Tool for Short Read Data

    SciTech Connect

    Syed, Aijazuddin; Shapiro, Harris; Tu, Hank; Pangilinan, Jasmyn; Trong, Stephan

    2008-03-14

    It is a challenging job for genome analysts to accurately debug, troubleshoot, and validate genome assembly results. Genome analysts rely on visualization tools to help validate and troubleshoot assembly results, including such problems as mis-assemblies, low-quality regions, and repeats. Short read data adds further complexity and makes it extremely challenging for the visualization tools to scale and to view all needed assembly information. As a result, there is a need for a visualization tool that can scale to display assembly data from the new sequencing technologies. We present Genome Assembly Visualization Tool (GAViT), a highly scalable and interactive assembly visualization tool developed at the DOE Joint Genome Institute (JGI).

  10. Cytoscape: the network visualization tool for GenomeSpace workflows

    PubMed Central

    Demchak, Barry; Hull, Tim; Reich, Michael; Liefeld, Ted; Smoot, Michael; Ideker, Trey; Mesirov, Jill P.

    2014-01-01

    Modern genomic analysis often requires workflows incorporating multiple best-of-breed tools. GenomeSpace is a web-based visual workbench that combines a selection of these tools with mechanisms that create data flows between them. One such tool is Cytoscape 3, a popular application that enables analysis and visualization of graph-oriented genomic networks. As Cytoscape runs on the desktop, and not in a web browser, integrating it into GenomeSpace required special care in creating a seamless user experience and enabling appropriate data flows. In this paper, we present the design and operation of the Cytoscape GenomeSpace app, which accomplishes this integration, thereby providing critical analysis and visualization functionality for GenomeSpace users. It has been downloaded over 850 times since the release of its first version in September, 2013. PMID:25165537

  11. CucCAP - Developing genomic resources for the cucurbit community

    USDA-ARS?s Scientific Manuscript database

    The U.S. cucurbit community has initiated a USDA-SCRI funded cucurbit genomics project, CucCAP: Leveraging applied genomics to increase disease resistance in cucurbit crops. Our primary objectives are: develop genomic and bioinformatic breeding tool kits for accelerated crop improvement across the...

  12. Endonucleases: new tools to edit the mouse genome.

    PubMed

    Wijshake, Tobias; Baker, Darren J; van de Sluis, Bart

    2014-10-01

    Mouse transgenesis has been instrumental in determining the function of genes in the pathophysiology of human diseases and modification of genes by homologous recombination in mouse embryonic stem cells remains a widely used technology. However, this approach harbors a number of disadvantages, as it is time-consuming and quite laborious. Over the last decade a number of new genome editing technologies have been developed, including zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and clustered regularly interspaced short palindromic repeats/CRISPR-associated (CRISPR/Cas). These systems are characterized by a designed DNA binding protein or RNA sequence fused or co-expressed with a non-specific endonuclease, respectively. The engineered DNA binding protein or RNA sequence guides the nuclease to a specific target sequence in the genome to induce a double strand break. The subsequent activation of the DNA repair machinery then enables the introduction of gene modifications at the target site, such as gene disruption, correction or insertion. Nuclease-mediated genome editing has numerous advantages over conventional gene targeting, including increased efficiency in gene editing, reduced generation time of mutant mice, and the ability to mutagenize multiple genes simultaneously. Although nuclease-driven modifications in the genome are a powerful tool to generate mutant mice, there are concerns about off-target cleavage, especially when using the CRISPR/Cas system. Here, we describe the basic principles of these new strategies in mouse genome manipulation, their inherent advantages, and their potential disadvantages compared to current technologies used to study gene function in mouse models. This article is part of a Special Issue entitled: From Genome to Function.

  13. Bovine Genome Database: integrated tools for genome annotation and discovery.

    PubMed

    Childers, Christopher P; Reese, Justin T; Sundaram, Jaideep P; Vile, Donald C; Dickens, C Michael; Childs, Kevin L; Salih, Hanni; Bennett, Anna K; Hagen, Darren E; Adelson, David L; Elsik, Christine G

    2011-01-01

    The Bovine Genome Database (BGD; http://BovineGenome.org) strives to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data. BGD includes GBrowse genome browsers, the Apollo Annotation Editor, a quantitative trait loci (QTL) viewer, BLAST databases and gene pages. Genome browsers, available for both scaffold and chromosome coordinate systems, display the bovine Official Gene Set (OGS), RefSeq and Ensembl gene models, non-coding RNA, repeats, pseudogenes, single-nucleotide polymorphism, markers, QTL and alignments to complementary DNAs, ESTs and protein homologs. The Bovine QTL viewer is connected to the BGD Chromosome GBrowse, allowing for the identification of candidate genes underlying QTL. The Apollo Annotation Editor connects directly to the BGD Chado database to provide researchers with remote access to gene evidence in a graphical interface that allows editing and creating new gene models. Researchers may upload their annotations to the BGD server for review and integration into the subsequent release of the OGS. Gene pages display information for individual OGS gene models, including gene structure, transcript variants, functional descriptions, gene symbols, Gene Ontology terms, annotator comments and links to National Center for Biotechnology Information and Ensembl. Each gene page is linked to a wiki page to allow input from the research community.

  14. Genomic Tools in Pea Breeding Programs: Status and Perspectives

    PubMed Central

    Tayeh, Nadim; Aubert, Grégoire; Pilet-Nayel, Marie-Laure; Lejeune-Hénaut, Isabelle; Warkentin, Thomas D.; Burstin, Judith

    2015-01-01

    Pea (Pisum sativum L.) is an annual cool-season legume and one of the oldest domesticated crops. Dry pea seeds contain 22–25% protein, complex starch and fiber constituents, and a rich array of vitamins, minerals, and phytochemicals which make them a valuable source for human consumption and livestock feed. Dry pea ranks third to common bean and chickpea as the most widely grown pulse in the world with more than 11 million tons produced in 2013. Pea breeding has achieved great success since the time of Mendel's experiments in the mid-1800s. However, several traits still require significant improvement for better yield stability in a larger growing area. Key breeding objectives in pea include improving biotic and abiotic stress resistance and enhancing yield components and seed quality. Taking advantage of the diversity present in the pea genepool, many mapping populations have been constructed in the last decades and efforts have been deployed to identify loci involved in the control of target traits and further introgress them into elite breeding materials. Pea now benefits from next-generation sequencing and high-throughput genotyping technologies that are paving the way for genome-wide association studies and genomic selection approaches. This review covers the significant development and deployment of genomic tools for pea breeding in recent years. Future prospects are discussed especially in light of current progress toward deciphering the pea genome. PMID:26640470

  15. Elucidating the molecular responses of apple rootstock resistant to ARD pathogens: challenges and opportunities for development of genomics-assisted breeding tools.

    PubMed

    Zhu, Yanmin; Fazio, Gennaro; Mazzola, Mark

    2014-01-01

    may lead to different outcomes between rootstocks in response to invasion by necrotrophic pathogens. Elucidation of host defense mechanisms is critical in developing molecular tools for genomics-assisted breeding of resistant apple rootstocks. Due to their perennial nature, use of resistant rootstocks as a component for disease management might offer a durable and cost-effective benefit to tree performance than the standard practice of soil fumigation for control of ARD.

  16. Elucidating the molecular responses of apple rootstock resistant to ARD pathogens: challenges and opportunities for development of genomics-assisted breeding tools

    PubMed Central

    Zhu, Yanmin; Fazio, Gennaro; Mazzola, Mark

    2014-01-01

    may lead to different outcomes between rootstocks in response to invasion by necrotrophic pathogens. Elucidation of host defense mechanisms is critical in developing molecular tools for genomics-assisted breeding of resistant apple rootstocks. Due to their perennial nature, use of resistant rootstocks as a component for disease management might offer a durable and cost-effective benefit to tree performance than the standard practice of soil fumigation for control of ARD. PMID:26504547

  17. CRISPR/Cas9: an advanced tool for editing plant genomes.

    PubMed

    Samanta, Milan Kumar; Dey, Avishek; Gayen, Srimonta

    2016-10-01

    To meet current challenges in agriculture, genome editing using sequence-specific nucleases (SSNs) is a powerful tool for basic and applied plant biology research. Here, we describe the principle and application of available genome editing tools, including zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and the clustered regularly interspaced short palindromic repeat associated CRISPR/Cas9 system. Among these SSNs, CRISPR/Cas9 is the most recently characterized and rapidly developing genome editing technology, and has been successfully utilized in a wide variety of organisms. This review specifically illustrates the power of CRISPR/Cas9 as a tool for plant genome engineering, and describes the strengths and weaknesses of the CRISPR/Cas9 technology compared to two well-established genome editing tools, ZFNs and TALENs.

  18. FDA Bioinformatics Tool for Microbial Genomics Research on Molecular Characterization of Bacterial Foodborne Pathogens Using Microarrays

    USDA-ARS?s Scientific Manuscript database

    Background: Advances in microbial genomics and bioinformatics are offering greater insights into the emergence and spread of foodborne pathogens in outbreak scenarios. The Food and Drug Administration (FDA) has developed the genomics tool ArrayTrackTM, which provides extensive functionalities to man...

  19. Patenting genome research tools and the law.

    PubMed

    Eisenberg, Rebecca

    2003-01-01

    Patenting genes encoding therapeutic proteins was relatively uncontroversial in the early days of biotechnology. Controversy arose in the era of high-throughput DNA sequencing, when gene patents started to look less like patents on drugs and more like patents on scientific information. Evolving scientific and business strategies for exploiting genomic information raised concerns that patents might slow subsequent research. The trend towards stricter enforcement of the utility and disclosure requirements by the patent offices should help clarify the current confusion.

  20. First insights into the transcriptome and development of new genomic tools of a widespread circum-Mediterranean tree species, Pinus halepensis Mill.

    PubMed

    Pinosio, S; González-Martínez, S C; Bagnoli, F; Cattonaro, F; Grivet, D; Marroni, F; Lorenzo, Z; Pausas, J G; Verdú, M; Vendramin, G G

    2014-07-01

    Aleppo pine (Pinus halepensis Mill.) is a relevant conifer species for studying adaptive responses to drought and fire regimes in the Mediterranean region. In this study, we performed Illumina next-generation sequencing of two phenotypically divergent Aleppo pine accessions with the aims of (i) characterizing the transcriptome through Illumina RNA-Seq on trees phenotypically divergent for adaptive traits linked to fire adaptation and drought, (ii) performing a functional annotation of the assembled transcriptome, (iii) identifying genes with accelerated evolutionary rates, (iv) studying the expression levels of the annotated genes and (v) developing gene-based markers for population genomic and association genetic studies. The assembled transcriptome consisted of 48,629 contigs and covered about 54.6 Mbp. The comparison of Aleppo pine transcripts to Picea sitchensis protein-coding sequences resulted in the detection of 34,014 SNPs across species, with a Ka /Ks average value of 0.216, suggesting that the majority of the assembled genes are under negative selection. Several genes were differentially expressed across the two pine accessions with contrasted phenotypes, including a glutathione-s-transferase, a cellulose synthase and a cobra-like protein. A large number of new markers (3334 amplifiable SSRs and 28,236 SNPs) have been identified which should facilitate future population genomics and association genetics in this species. A 384-SNP Oligo Pool Assay for genotyping with the Illumina VeraCode technology has been designed which showed an high overall SNP conversion rate (76.6%). Our results showed that Illumina next-generation sequencing is a valuable technology to obtain an extensive overview on whole transcriptomes of nonmodel species with large genomes.

  1. Groundnut improvement: use of genetic and genomic tools

    PubMed Central

    Janila, Pasupuleti; Nigam, S. N.; Pandey, Manish K.; Nagesh, P.; Varshney, Rajeev K.

    2013-01-01

    Groundnut (Arachis hypogaea L.), a self-pollinated legume is an important crop cultivated in 24 million ha world over for extraction of edible oil and food uses. The kernels are rich in oil (48–50%) and protein (25–28%), and are source of several vitamins, minerals, antioxidants, biologically active polyphenols, flavonoids, and isoflavones. Improved varieties of groundnut with high yield potential were developed and released for cultivation world over. The improved varieties belong to different maturity durations and possess resistance to diseases, tolerance to drought, enhanced oil content, and improved quality traits for food uses. Conventional breeding procedures along with the tools for phenotyping were largely used in groundnut improvement programs. Mutations were used to induce variability and wide hybridization was attempted to tap variability from wild species. Low genetic variability has been a bottleneck for groundnut improvement. The vast potential of wild species, reservoir of new alleles remains under-utilized. Development of linkage maps of groundnut during the last decade was followed by identification of markers and quantitative trait loci for the target traits. Consequently, the last decade has witnessed the deployment of molecular breeding approaches to complement the ongoing groundnut improvement programs in USA, China, India, and Japan. The other potential advantages of molecular breeding are the feasibility to target multiple traits for improvement and provide tools to tap new alleles from wild species. The first groundnut variety developed through marker-assisted back-crossing is a root-knot nematode-resistant variety, NemaTAM in USA. The uptake of molecular breeding approaches in groundnut improvement programs by NARS partners in India and many African countries is slow or needs to be initiated in part due to inadequate infrastructure, high genotyping costs, and human capacities. Availability of draft genome sequence for diploid (AA and

  2. Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology.

    PubMed

    Karp, Peter D; Latendresse, Mario; Paley, Suzanne M; Krummenacker, Markus; Ong, Quang D; Billington, Richard; Kothari, Anamika; Weaver, Daniel; Lee, Thomas; Subhraveti, Pallavi; Spaulding, Aaron; Fulcher, Carol; Keseler, Ingrid M; Caspi, Ron

    2016-09-01

    Pathway Tools is a bioinformatics software environment with a broad set of capabilities. The software provides genome-informatics tools such as a genome browser, sequence alignments, a genome-variant analyzer and comparative-genomics operations. It offers metabolic-informatics tools, such as metabolic reconstruction, quantitative metabolic modeling, prediction of reaction atom mappings and metabolic route search. Pathway Tools also provides regulatory-informatics tools, such as the ability to represent and visualize a wide range of regulatory interactions. This article outlines the advances in Pathway Tools in the past 5 years. Major additions include components for metabolic modeling, metabolic route search, computation of atom mappings and estimation of compound Gibbs free energies of formation; addition of editors for signaling pathways, for genome sequences and for cellular architecture; storage of gene essentiality data and phenotype data; display of multiple alignments, and of signaling and electron-transport pathways; and development of Python and web-services application programming interfaces. Scientists around the world have created more than 9800 Pathway/Genome Databases by using Pathway Tools, many of which are curated databases for important model organisms. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  3. Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology

    PubMed Central

    Latendresse, Mario; Paley, Suzanne M.; Krummenacker, Markus; Ong, Quang D.; Billington, Richard; Kothari, Anamika; Weaver, Daniel; Lee, Thomas; Subhraveti, Pallavi; Spaulding, Aaron; Fulcher, Carol; Keseler, Ingrid M.; Caspi, Ron

    2016-01-01

    Pathway Tools is a bioinformatics software environment with a broad set of capabilities. The software provides genome-informatics tools such as a genome browser, sequence alignments, a genome-variant analyzer and comparative-genomics operations. It offers metabolic-informatics tools, such as metabolic reconstruction, quantitative metabolic modeling, prediction of reaction atom mappings and metabolic route search. Pathway Tools also provides regulatory-informatics tools, such as the ability to represent and visualize a wide range of regulatory interactions. This article outlines the advances in Pathway Tools in the past 5 years. Major additions include components for metabolic modeling, metabolic route search, computation of atom mappings and estimation of compound Gibbs free energies of formation; addition of editors for signaling pathways, for genome sequences and for cellular architecture; storage of gene essentiality data and phenotype data; display of multiple alignments, and of signaling and electron-transport pathways; and development of Python and web-services application programming interfaces. Scientists around the world have created more than 9800 Pathway/Genome Databases by using Pathway Tools, many of which are curated databases for important model organisms. PMID:26454094

  4. Development of genome viewer (Web Omics Viewer) for managing databases of cucumber genome

    NASA Astrophysics Data System (ADS)

    Wojcieszek, M.; RóŻ, P.; Pawełkowicz, M.; Nowak, R.; Przybecki, Z.

    Cucumber is an important plant in horticulture and science world. Sequencing projects of C. sativus genome enable new methodological aproaches in further investigation of this species. Accessibility is crucial to fully exploit obtained information about detail structure of genes, markers and other characteristic features such contigs, scaffolds and chromosomes. Genome viewer is one of tools providing plain and easy way for presenting genome data for users and for databases administration. Gbrowse - the main viewer has several very useful features but lacks in managing simplicity. Our group developed new genome browser Web Omics Viewer (WOV), keeping functionality but improving utilization and accessibility to cucumber genome data.

  5. Laser microdissection: A powerful tool for genomics at cell level.

    PubMed

    Bevilacqua, Claudia; Ducos, Bertrand

    2017-09-16

    Laser microdissection (LM) has become considerably democratized over the last fifteen years. Instruments have evolved to offer more powerful and efficient lasers as well as new options for sample collection and preparation. Technological evolutions have also focused on the post-microdissection analysis capabilities, opening up investigations in all disciplines of experimental and clinical biology, thanks to the advent of new high-throughput methods of genome analysis, including RNAseq and proteomics, now globally known as microgenomics, i.e. analysis of biomolecules at the cell level. In spite of the advances these rapidly developing methods have allowed, the workflow for sampling and collection by LM remains a critical step in insuring sample integrity in terms of histology (accurate cell identification) and biochemistry (reliable analyzes of biomolecules). In this review, we describe the sample processing as well as the strengths and limiting factors of LM applied to the specific selection of one or more cells of interest from a heterogeneous tissue. We will see how the latest developments in protocols and methods have made LM a powerful and sometimes essential tool for genomic and proteomic analyzes of tiny amounts of biomolecules extracted from few cells isolated from a complex tissue, in their physiological context, thus offering new opportunities for understanding fundamental physiological and/or patho-physiological processes. Copyright © 2017. Published by Elsevier Ltd.

  6. The Plant Ontology: A Tool for Plant Genomics.

    PubMed

    Cooper, Laurel; Jaiswal, Pankaj

    2016-01-01

    The use of controlled, structured vocabularies (ontologies) has become a critical tool for scientists in the post-genomic era of massive datasets. Adoption and integration of common vocabularies and annotation practices enables cross-species comparative analyses and increases data sharing and reusability. The Plant Ontology (PO; http://www.plantontology.org/ ) describes plant anatomy, morphology, and the stages of plant development, and offers a database of plant genomics annotations associated to the PO terms. The scope of the PO has grown from its original design covering only rice, maize, and Arabidopsis, and now includes terms to describe all green plants from angiosperms to green algae.This chapter introduces how the PO and other related ontologies are constructed and organized, including languages and software used for ontology development, and provides an overview of the key features. Detailed instructions illustrate how to search and browse the PO database and access the associated annotation data. Users are encouraged to provide input on the ontology through the online term request form and contribute datasets for integration in the PO database.

  7. GREAT: a web portal for Genome Regulatory Architecture Tools.

    PubMed

    Bouyioukos, Costas; Bucchini, François; Elati, Mohamed; Képès, François

    2016-07-08

    GREAT (Genome REgulatory Architecture Tools) is a novel web portal for tools designed to generate user-friendly and biologically useful analysis of genome architecture and regulation. The online tools of GREAT are freely accessible and compatible with essentially any operating system which runs a modern browser. GREAT is based on the analysis of genome layout -defined as the respective positioning of co-functional genes- and its relation with chromosome architecture and gene expression. GREAT tools allow users to systematically detect regular patterns along co-functional genomic features in an automatic way consisting of three individual steps and respective interactive visualizations. In addition to the complete analysis of regularities, GREAT tools enable the use of periodicity and position information for improving the prediction of transcription factor binding sites using a multi-view machine learning approach. The outcome of this integrative approach features a multivariate analysis of the interplay between the location of a gene and its regulatory sequence. GREAT results are plotted in web interactive graphs and are available for download either as individual plots, self-contained interactive pages or as machine readable tables for downstream analysis. The GREAT portal can be reached at the following URL https://absynth.issb.genopole.fr/GREAT and each individual GREAT tool is available for downloading. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace

    PubMed Central

    Thorvaldsdottir, Helga; Liefeld, Ted; Ocana, Marco; Borges-Rivera, Diego; Pochet, Nathalie; Robinson, James T.; Demchak, Barry; Hull, Tim; Ben-Artzi, Gil; Blankenberg, Daniel; Barber, Galt P.; Lee, Brian T.; Kuhn, Robert M.; Nekrutenko, Anton; Segal, Eran; Ideker, Trey; Reich, Michael; Regev, Aviv; Chang, Howard Y.; Mesirov, Jill P.

    2015-01-01

    Integrative analysis of multiple data types to address complex biomedical questions requires the use of multiple software tools in concert and remains an enormous challenge for most of the biomedical research community. Here we introduce GenomeSpace (http://www.genomespace.org), a cloud-based, cooperative community resource. Seeded as a collaboration of six of the most popular genomics analysis tools, GenomeSpace now supports the streamlined interaction of 20 bioinformatics tools and data resources. To facilitate the ability of non-programming users’ to leverage GenomeSpace in integrative analysis, it offers a growing set of ‘recipes’, short workflows involving a few tools and steps to guide investigators through high utility analysis tasks. PMID:26780094

  9. Improved criteria and comparative genomics tool provide new insights into grass paleogenomics.

    PubMed

    Salse, Jerome; Abrouk, Michael; Murat, Florent; Quraishi, Umar Masood; Feuillet, Catherine

    2009-11-01

    In the past decade, a number of bioinformatics tools have been developed to perform comparative genomics studies in plants and animals. However, most of the publicly available and user friendly tools lack common standards for the identification of robust orthologous relationships between genomes leading non-specialists to often over interpret the results of large scale comparative sequence analyses. Recently, we have established a number of improved parameters and tools to define significant relationships between genomes as a basis to develop paleogenomics studies in grasses. Here, we describe our approaches and propose the development of community-based standards that can be used in comparative genomic studies to (i) identify robust sets of orthologous gene pairs, (ii) derive complete sets of chromosome to chromosome relationships within and between genomes and (iii) model common paleo-ancestor genome structures. The rice and sorghum genome sequences are used to exemplify step-by-step a methodology that should allow users to perform accurate comparative genome analyses in their favourite species. Finally, we describe two applications for accurate gene annotation and synteny-based cloning of agronomically important traits.

  10. Public data and open source tools for multi-assay genomic investigation of disease

    PubMed Central

    Kannan, Lavanya; Ramos, Marcel; Re, Angela; El-Hachem, Nehme; Safikhani, Zhaleh; Gendoo, Deena M.A.; Davis, Sean; Gomez-Cabrero, David; Castelo, Robert; Hansen, Kasper D.; Carey, Vincent J.; Morgan, Martin; Culhane, Aedín C.; Haibe-Kains, Benjamin

    2016-01-01

    Molecular interrogation of a biological sample through DNA sequencing, RNA and microRNA profiling, proteomics and other assays, has the potential to provide a systems level approach to predicting treatment response and disease progression, and to developing precision therapies. Large publicly funded projects have generated extensive and freely available multi-assay data resources; however, bioinformatic and statistical methods for the analysis of such experiments are still nascent. We review multi-assay genomic data resources in the areas of clinical oncology, pharmacogenomics and other perturbation experiments, population genomics and regulatory genomics and other areas, and tools for data acquisition. Finally, we review bioinformatic tools that are explicitly geared toward integrative genomic data visualization and analysis. This review provides starting points for accessing publicly available data and tools to support development of needed integrative methods. PMID:26463000

  11. Analysis tools for the interplay between genome layout and regulation.

    PubMed

    Bouyioukos, Costas; Elati, Mohamed; Képès, François

    2016-06-06

    Genome layout and gene regulation appear to be interdependent. Understanding this interdependence is key to exploring the dynamic nature of chromosome conformation and to engineering functional genomes. Evidence for non-random genome layout, defined as the relative positioning of either co-functional or co-regulated genes, stems from two main approaches. Firstly, the analysis of contiguous genome segments across species, has highlighted the conservation of gene arrangement (synteny) along chromosomal regions. Secondly, the study of long-range interactions along a chromosome has emphasised regularities in the positioning of microbial genes that are co-regulated, co-expressed or evolutionarily correlated. While one-dimensional pattern analysis is a mature field, it is often powerless on biological datasets which tend to be incomplete, and partly incorrect. Moreover, there is a lack of comprehensive, user-friendly tools to systematically analyse, visualise, integrate and exploit regularities along genomes. Here we present the Genome REgulatory and Architecture Tools SCAN (GREAT:SCAN) software for the systematic study of the interplay between genome layout and gene expression regulation. SCAN is a collection of related and interconnected applications currently able to perform systematic analyses of genome regularities as well as to improve transcription factor binding sites (TFBS) and gene regulatory network predictions based on gene positional information. We demonstrate the capabilities of these tools by studying on one hand the regular patterns of genome layout in the major regulons of the bacterium Escherichia coli. On the other hand, we demonstrate the capabilities to improve TFBS prediction in microbes. Finally, we highlight, by visualisation of multivariate techniques, the interplay between position and sequence information for effective transcription regulation.

  12. Next generation tools for genomic data generation, distribution, and visualization.

    PubMed

    Nix, David A; Di Sera, Tonya L; Dalley, Brian K; Milash, Brett A; Cundick, Robert M; Quinn, Kevin S; Courdy, Samir J

    2010-09-09

    With the rapidly falling cost and availability of high throughput sequencing and microarray technologies, the bottleneck for effectively using genomic analysis in the laboratory and clinic is shifting to one of effectively managing, analyzing, and sharing genomic data. Here we present three open-source, platform independent, software tools for generating, analyzing, distributing, and visualizing genomic data. These include a next generation sequencing/microarray LIMS and analysis project center (GNomEx); an application for annotating and programmatically distributing genomic data using the community vetted DAS/2 data exchange protocol (GenoPub); and a standalone Java Swing application (GWrap) that makes cutting edge command line analysis tools available to those who prefer graphical user interfaces. Both GNomEx and GenoPub use the rich client Flex/Flash web browser interface to interact with Java classes and a relational database on a remote server. Both employ a public-private user-group security model enabling controlled distribution of patient and unpublished data alongside public resources. As such, they function as genomic data repositories that can be accessed manually or programmatically through DAS/2-enabled client applications such as the Integrated Genome Browser. These tools have gained wide use in our core facilities, research laboratories and clinics and are freely available for non-profit use. See http://sourceforge.net/projects/gnomex/, http://sourceforge.net/projects/genoviz/, and http://sourceforge.net/projects/useq.

  13. DHPC: a new tool to express genome structural features.

    PubMed

    Deng, Xuegong; Deng, Xuemei; Rayner, Simon; Liu, Xiangdong; Zhang, Qingling; Yang, Yupu; Li, Ning

    2008-05-01

    The DHPC (DNA Hilbert-Peano curve) is a new tool for visualizing large-scale genome sequences by mapping sequences into a two-dimensional square. It utilizes the space-filling function of Hilbert-Peano mapping. By applying a Gauss smoothing technique and a user-defined color function, a large-scale genome sequence can be mapped into a two-dimensional color image. In the calculated DHPCs, many genome characteristics are revealed. In this article we introduce the method and show how DHPCs may be used to identify regions of different base composition. The power of the method is demonstrated by presenting multiple examples such as repeating sequences, degree of base bias, regions of homogeneity and their boundaries, and mark of annotated segments. We also present several genome curves generated by DHPC to demonstrate how DHPC can be used to find previously unidentified sequence features in these genomes.

  14. Hawkeye: an interactive visual analytics tool for genome assemblies

    PubMed Central

    2007-01-01

    Genome sequencing remains an inexact science, and genome sequences can contain significant errors if they are not carefully examined. Hawkeye is our new visual analytics tool for genome assemblies, designed to aid in identifying and correcting assembly errors. Users can analyze all levels of an assembly along with summary statistics and assembly metrics, and are guided by a ranking component towards likely mis-assemblies. Hawkeye is freely available and released as part of the open source AMOS project http://amos.sourceforge.net/hawkeye. PMID:17349036

  15. Genome Calligrapher: A Web Tool for Refactoring Bacterial Genome Sequences for de Novo DNA Synthesis.

    PubMed

    Christen, Matthias; Deutsch, Samuel; Christen, Beat

    2015-08-21

    Recent advances in synthetic biology have resulted in an increasing demand for the de novo synthesis of large-scale DNA constructs. Any process improvement that enables fast and cost-effective streamlining of digitized genetic information into fabricable DNA sequences holds great promise to study, mine, and engineer genomes. Here, we present Genome Calligrapher, a computer-aided design web tool intended for whole genome refactoring of bacterial chromosomes for de novo DNA synthesis. By applying a neutral recoding algorithm, Genome Calligrapher optimizes GC content and removes obstructive DNA features known to interfere with the synthesis of double-stranded DNA and the higher order assembly into large DNA constructs. Subsequent bioinformatics analysis revealed that synthesis constraints are prevalent among bacterial genomes. However, a low level of codon replacement is sufficient for refactoring bacterial genomes into easy-to-synthesize DNA sequences. To test the algorithm, 168 kb of synthetic DNA comprising approximately 20 percent of the synthetic essential genome of the cell-cycle bacterium Caulobacter crescentus was streamlined and then ordered from a commercial supplier of low-cost de novo DNA synthesis. The successful assembly into eight 20 kb segments indicates that Genome Calligrapher algorithm can be efficiently used to refactor difficult-to-synthesize DNA. Genome Calligrapher is broadly applicable to recode biosynthetic pathways, DNA sequences, and whole bacterial genomes, thus offering new opportunities to use synthetic biology tools to explore the functionality of microbial diversity. The Genome Calligrapher web tool can be accessed at https://christenlab.ethz.ch/GenomeCalligrapher  .

  16. PSAT: A web tool to compare genomic neighborhoods of multiple prokaryotic genomes

    PubMed Central

    Fong, Christine; Rohmer, Laurence; Radey, Matthew; Wasnick, Michael; Brittnacher, Mitchell J

    2008-01-01

    Background The conservation of gene order among prokaryotic genomes can provide valuable insight into gene function, protein interactions, or events by which genomes have evolved. Although some tools are available for visualizing and comparing the order of genes between genomes of study, few support an efficient and organized analysis between large numbers of genomes. The Prokaryotic Sequence homology Analysis Tool (PSAT) is a web tool for comparing gene neighborhoods among multiple prokaryotic genomes. Results PSAT utilizes a database that is preloaded with gene annotation, BLAST hit results, and gene-clustering scores designed to help identify regions of conserved gene order. Researchers use the PSAT web interface to find a gene of interest in a reference genome and efficiently retrieve the sequence homologs found in other bacterial genomes. The tool generates a graphic of the genomic neighborhood surrounding the selected gene and the corresponding regions for its homologs in each comparison genome. Homologs in each region are color coded to assist users with analyzing gene order among various genomes. In contrast to common comparative analysis methods that filter sequence homolog data based on alignment score cutoffs, PSAT leverages gene context information for homologs, including those with weak alignment scores, enabling a more sensitive analysis. Features for constraining or ordering results are designed to help researchers browse results from large numbers of comparison genomes in an organized manner. PSAT has been demonstrated to be useful for helping to identify gene orthologs and potential functional gene clusters, and detecting genome modifications that may result in loss of function. Conclusion PSAT allows researchers to investigate the order of genes within local genomic neighborhoods of multiple genomes. A PSAT web server for public use is available for performing analyses on a growing set of reference genomes through any web browser with no client

  17. A Simulation Tool for the Study of Symmetric Inversions in Bacterial Genomes

    NASA Astrophysics Data System (ADS)

    Dias, Ulisses; Dias, Zanoni; Setubal, João C.

    We present the tool SIB that simulates genomic inversions in bacterial chromosomes. The tool simulates symmetric inversions but allows the appearance of nonsymmetric inversions by simulating small syntenic blocks frequently observed on bacterial genome comparisons. We evaluate SIB by comparing its results to real genome alignments. We develop measures that allow quantitative comparisons between real pairwise alignments (in terms of dotplots) and simulated ones. These measures allow an evaluation of SIB in terms of dendrograms. We evaluate SIB by comparing its results to whole chromosome alignments and maximum likelihood trees for three bacterial groups (the Pseudomonadaceae family and the Xanthomonas and Shewanella genera). We demonstrate an application of SIB by using it to evaluate the ancestral genome reconstruction tool MGR.

  18. Emerging Genomic Tools for Legume Breeding: Current Status and Future Prospects.

    PubMed

    Pandey, Manish K; Roorkiwal, Manish; Singh, Vikas K; Ramalingam, Abirami; Kudapa, Himabindu; Thudi, Mahendar; Chitikineni, Anu; Rathore, Abhishek; Varshney, Rajeev K

    2016-01-01

    Legumes play a vital role in ensuring global nutritional food security and improving soil quality through nitrogen fixation. Accelerated higher genetic gains is required to meet the demand of ever increasing global population. In recent years, speedy developments have been witnessed in legume genomics due to advancements in next-generation sequencing (NGS) and high-throughput genotyping technologies. Reference genome sequences for many legume crops have been reported in the last 5 years. The availability of the draft genome sequences and re-sequencing of elite genotypes for several important legume crops have made it possible to identify structural variations at large scale. Availability of large-scale genomic resources and low-cost and high-throughput genotyping technologies are enhancing the efficiency and resolution of genetic mapping and marker-trait association studies. Most importantly, deployment of molecular breeding approaches has resulted in development of improved lines in some legume crops such as chickpea and groundnut. In order to support genomics-driven crop improvement at a fast pace, the deployment of breeder-friendly genomics and decision support tools seems appear to be critical in breeding programs in developing countries. This review provides an overview of emerging genomics and informatics tools/approaches that will be the key driving force for accelerating genomics-assisted breeding and ultimately ensuring nutritional and food security in developing countries.

  19. Emerging Genomic Tools for Legume Breeding: Current Status and Future Prospects

    PubMed Central

    Pandey, Manish K.; Roorkiwal, Manish; Singh, Vikas K.; Ramalingam, Abirami; Kudapa, Himabindu; Thudi, Mahendar; Chitikineni, Anu; Rathore, Abhishek; Varshney, Rajeev K.

    2016-01-01

    Legumes play a vital role in ensuring global nutritional food security and improving soil quality through nitrogen fixation. Accelerated higher genetic gains is required to meet the demand of ever increasing global population. In recent years, speedy developments have been witnessed in legume genomics due to advancements in next-generation sequencing (NGS) and high-throughput genotyping technologies. Reference genome sequences for many legume crops have been reported in the last 5 years. The availability of the draft genome sequences and re-sequencing of elite genotypes for several important legume crops have made it possible to identify structural variations at large scale. Availability of large-scale genomic resources and low-cost and high-throughput genotyping technologies are enhancing the efficiency and resolution of genetic mapping and marker-trait association studies. Most importantly, deployment of molecular breeding approaches has resulted in development of improved lines in some legume crops such as chickpea and groundnut. In order to support genomics-driven crop improvement at a fast pace, the deployment of breeder-friendly genomics and decision support tools seems appear to be critical in breeding programs in developing countries. This review provides an overview of emerging genomics and informatics tools/approaches that will be the key driving force for accelerating genomics-assisted breeding and ultimately ensuring nutritional and food security in developing countries. PMID:27199998

  20. Comparing thousands of circular genomes using the CGView Comparison Tool

    PubMed Central

    2012-01-01

    Background Continued sequencing efforts coupled with advances in sequencing technology will lead to the completion of a vast number of small genomes. Whole-genome comparisons represent an important part of the analysis of any new genome sequence, as they can provide a better understanding of the biology and evolution of the source organism. Visualization of the results is important, as it allows information from a variety of sources to be integrated and interpreted. However, existing graphical comparison tools lack features needed for efficiently comparing a new genome to hundreds or thousands of existing sequences. Moreover, existing tools are limited in terms of the types of comparisons that can be performed, the extent to which the output can be customized, and the ease with which the entire process can be automated. Results The CGView Comparison Tool (CCT) is a package for visually comparing bacterial, plasmid, chloroplast, or mitochondrial sequences of interest to existing genomes or sequence collections. The comparisons are conducted using BLAST, and the BLAST results are presented in the form of graphical maps that can also show sequence features, gene and protein names, COG (Clusters of Orthologous Groups of proteins) category assignments, and sequence composition characteristics. CCT can generate maps in a variety of sizes, including 400 Megapixel maps suitable for posters. Comparisons can be conducted within a particular species or genus, or all available genomes can be used. The entire map creation process, from downloading sequences to redrawing zoomed maps, can be completed easily using scripts included with the CCT. User-defined features or analysis results can be included on maps, and maps can be extensively customized. To simplify program setup, a CCT virtual machine that includes all dependencies preinstalled is available. Detailed tutorials illustrating the use of CCT are included with the CCT documentation. Conclusion CCT can be used to visually

  1. Frageria vesca, a useful tool for Rosaceae Genomics

    USDA-ARS?s Scientific Manuscript database

    The diploid woodland strawberry, Fragaria vesca subsp. vesca forma semperflorens, has many characteristics that make it an ideal plant for strawberry genomics research. A brief review of some of the tools available for using this plant as a reference for studying gene function in strawberry and oth...

  2. MaGnET: Malaria Genome Exploration Tool

    PubMed Central

    Sharman, Joanna L.; Gerloff, Dietlind L.

    2013-01-01

    Summary: The Malaria Genome Exploration Tool (MaGnET) is a software tool enabling intuitive ‘exploration-style’ visualization of functional genomics data relating to the malaria parasite, Plasmodium falciparum. MaGnET provides innovative integrated graphic displays for different datasets, including genomic location of genes, mRNA expression data, protein–protein interactions and more. Any selection of genes to explore made by the user is easily carried over between the different viewers for different datasets, and can be changed interactively at any point (without returning to a search). Availability and Implementation: Free online use (Java Web Start) or download (Java application archive and MySQL database; requires local MySQL installation) at http://malariagenomeexplorer.org Contact: joanna.sharman@ed.ac.uk or dgerloff@ffame.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23894142

  3. MaGnET: Malaria Genome Exploration Tool.

    PubMed

    Sharman, Joanna L; Gerloff, Dietlind L

    2013-09-15

    The Malaria Genome Exploration Tool (MaGnET) is a software tool enabling intuitive 'exploration-style' visualization of functional genomics data relating to the malaria parasite, Plasmodium falciparum. MaGnET provides innovative integrated graphic displays for different datasets, including genomic location of genes, mRNA expression data, protein-protein interactions and more. Any selection of genes to explore made by the user is easily carried over between the different viewers for different datasets, and can be changed interactively at any point (without returning to a search). Free online use (Java Web Start) or download (Java application archive and MySQL database; requires local MySQL installation) at http://malariagenomeexplorer.org joanna.sharman@ed.ac.uk or dgerloff@ffame.org Supplementary data are available at Bioinformatics online.

  4. Developing ICALL Tools Using GATE

    ERIC Educational Resources Information Center

    Wood, Peter

    2008-01-01

    This article discusses the use of the General Architecture for Text Engineering (GATE) as a tool for the development of ICALL and NLP applications. It outlines a paradigm shift in software development, which is mainly influenced by projects such as the Free Software Foundation. It looks at standards that have been proposed to facilitate the…

  5. Developing the Advising "Tool Kit."

    ERIC Educational Resources Information Center

    Kelly, James J.

    1988-01-01

    Developing an inventory consisting of each and every item that contributes to a total advising program is suggested. The point to be made by developing a campus "Advising Tool Kit" such as Penn State's is that advising and advisors are becoming increasingly integral to the entire educational mission of colleges and universities. (MLW)

  6. Developing ICALL Tools Using GATE

    ERIC Educational Resources Information Center

    Wood, Peter

    2008-01-01

    This article discusses the use of the General Architecture for Text Engineering (GATE) as a tool for the development of ICALL and NLP applications. It outlines a paradigm shift in software development, which is mainly influenced by projects such as the Free Software Foundation. It looks at standards that have been proposed to facilitate the…

  7. Applications of genomic tools to colorectal cancer therapeutics.

    PubMed

    Auman, J Todd; McLeod, Howard L

    2008-12-01

    Clinically and histopathologically similar colorectal cancers exhibit considerable variability in their responses to chemotherapeutics. The advent of genomic technologies has enabled the unbiased determination of changes in DNA and RNA, alterations that may be responsible for, or predictive of the variability in response to chemotherapy. This review highlights several advances made in applying genomic tools toward colorectal cancer therapeutics. Progress has been made using gene expression profiling to identify which colorectal cancer patients would benefit most from adjuvant chemotherapy. In addition, advances have been made in colorectal cancer pharmacogenomics by identifying gene expression patterns associated with sensitivity to specific chemotherapeutic agents. Lastly, the use of genome-wide mutation screening of individual tumor samples to identify the profiles of mutated genes is explored. Future research toward integrating genomic information with clinical and histopathological data is expected to lead to improved therapeutic management of colorectal cancer.

  8. GAPIT: genome association and prediction integrated tool

    USDA-ARS?s Scientific Manuscript database

    Advances in high throughput sequencing have improved the detection of genes underlying important traits as well as the prediction accuracy of disease risk and breeding value of crop or livestock. Software programs developed to perform statistical genetic analysis that support these activities should...

  9. Genomic Tools in Cowpea Breeding Programs: Status and Perspectives.

    PubMed

    Boukar, Ousmane; Fatokun, Christian A; Huynh, Bao-Lam; Roberts, Philip A; Close, Timothy J

    2016-01-01

    Cowpea is one of the most important grain legumes in sub-Saharan Africa (SSA). It provides strong support to the livelihood of small-scale farmers through its contributions to their nutritional security, income generation and soil fertility enhancement. Worldwide about 6.5 million metric tons of cowpea are produced annually on about 14.5 million hectares. The low productivity of cowpea is attributable to numerous abiotic and biotic constraints. The abiotic stress factors comprise drought, low soil fertility, and heat while biotic constraints include insects, diseases, parasitic weeds, and nematodes. Cowpea farmers also have limited access to quality seeds of improved varieties for planting. Some progress has been made through conventional breeding at international and national research institutions in the last three decades. Cowpea improvement could also benefit from modern breeding methods based on molecular genetic tools. A number of advances in cowpea genetic linkage maps, and quantitative trait loci associated with some desirable traits such as resistance to Striga, Macrophomina, Fusarium wilt, bacterial blight, root-knot nematodes, aphids, and foliar thrips have been reported. An improved consensus genetic linkage map has been developed and used to identify QTLs of additional traits. In order to take advantage of these developments single nucleotide polymorphism (SNP) genotyping is being streamlined to establish an efficient workflow supported by genotyping support service (GSS)-client interactions. About 1100 SNPs mapped on the cowpea genome were converted by LGC Genomics to KASP assays. Several cowpea breeding programs have been exploiting these resources to implement molecular breeding, especially for MARS and MABC, to accelerate cowpea variety improvement. The combination of conventional breeding and molecular breeding strategies, with workflow managed through the CGIAR breeding management system (BMS), promises an increase in the number of improved

  10. Genomic Tools in Cowpea Breeding Programs: Status and Perspectives

    PubMed Central

    Boukar, Ousmane; Fatokun, Christian A.; Huynh, Bao-Lam; Roberts, Philip A.; Close, Timothy J.

    2016-01-01

    Cowpea is one of the most important grain legumes in sub-Saharan Africa (SSA). It provides strong support to the livelihood of small-scale farmers through its contributions to their nutritional security, income generation and soil fertility enhancement. Worldwide about 6.5 million metric tons of cowpea are produced annually on about 14.5 million hectares. The low productivity of cowpea is attributable to numerous abiotic and biotic constraints. The abiotic stress factors comprise drought, low soil fertility, and heat while biotic constraints include insects, diseases, parasitic weeds, and nematodes. Cowpea farmers also have limited access to quality seeds of improved varieties for planting. Some progress has been made through conventional breeding at international and national research institutions in the last three decades. Cowpea improvement could also benefit from modern breeding methods based on molecular genetic tools. A number of advances in cowpea genetic linkage maps, and quantitative trait loci associated with some desirable traits such as resistance to Striga, Macrophomina, Fusarium wilt, bacterial blight, root-knot nematodes, aphids, and foliar thrips have been reported. An improved consensus genetic linkage map has been developed and used to identify QTLs of additional traits. In order to take advantage of these developments single nucleotide polymorphism (SNP) genotyping is being streamlined to establish an efficient workflow supported by genotyping support service (GSS)-client interactions. About 1100 SNPs mapped on the cowpea genome were converted by LGC Genomics to KASP assays. Several cowpea breeding programs have been exploiting these resources to implement molecular breeding, especially for MARS and MABC, to accelerate cowpea variety improvement. The combination of conventional breeding and molecular breeding strategies, with workflow managed through the CGIAR breeding management system (BMS), promises an increase in the number of improved

  11. Development of Bioinformatics Infrastructure for Genomics Research.

    PubMed

    Mulder, Nicola J; Adebiyi, Ezekiel; Adebiyi, Marion; Adeyemi, Seun; Ahmed, Azza; Ahmed, Rehab; Akanle, Bola; Alibi, Mohamed; Armstrong, Don L; Aron, Shaun; Ashano, Efejiro; Baichoo, Shakuntala; Benkahla, Alia; Brown, David K; Chimusa, Emile R; Fadlelmola, Faisal M; Falola, Dare; Fatumo, Segun; Ghedira, Kais; Ghouila, Amel; Hazelhurst, Scott; Isewon, Itunuoluwa; Jung, Segun; Kassim, Samar Kamal; Kayondo, Jonathan K; Mbiyavanga, Mamana; Meintjes, Ayton; Mohammed, Somia; Mosaku, Abayomi; Moussa, Ahmed; Muhammd, Mustafa; Mungloo-Dilmohamud, Zahra; Nashiru, Oyekanmi; Odia, Trust; Okafor, Adaobi; Oladipo, Olaleye; Osamor, Victor; Oyelade, Jellili; Sadki, Khalid; Salifu, Samson Pandam; Soyemi, Jumoke; Panji, Sumir; Radouani, Fouzia; Souiai, Oussama; Tastan Bishop, Özlem

    2017-06-01

    Although pockets of bioinformatics excellence have developed in Africa, generally, large-scale genomic data analysis has been limited by the availability of expertise and infrastructure. H3ABioNet, a pan-African bioinformatics network, was established to build capacity specifically to enable H3Africa (Human Heredity and Health in Africa) researchers to analyze their data in Africa. Since the inception of the H3Africa initiative, H3ABioNet's role has evolved in response to changing needs from the consortium and the African bioinformatics community. H3ABioNet set out to develop core bioinformatics infrastructure and capacity for genomics research in various aspects of data collection, transfer, storage, and analysis. Various resources have been developed to address genomic data management and analysis needs of H3Africa researchers and other scientific communities on the continent. NetMap was developed and used to build an accurate picture of network performance within Africa and between Africa and the rest of the world, and Globus Online has been rolled out to facilitate data transfer. A participant recruitment database was developed to monitor participant enrollment, and data is being harmonized through the use of ontologies and controlled vocabularies. The standardized metadata will be integrated to provide a search facility for H3Africa data and biospecimens. Because H3Africa projects are generating large-scale genomic data, facilities for analysis and interpretation are critical. H3ABioNet is implementing several data analysis platforms that provide a large range of bioinformatics tools or workflows, such as Galaxy, the Job Management System, and eBiokits. A set of reproducible, portable, and cloud-scalable pipelines to support the multiple H3Africa data types are also being developed and dockerized to enable execution on multiple computing infrastructures. In addition, new tools have been developed for analysis of the uniquely divergent African data and for

  12. Emerging Tools for Synthetic Genome Design

    PubMed Central

    Lee, Bo-Rahm; Cho, Suhyung; Song, Yoseb; Kim, Sun Chang; Cho, Byung-Kwan

    2013-01-01

    Synthetic biology is an emerging discipline for designing and synthesizing predictable, measurable, controllable, and transformable biological systems. These newly designed biological systems have great potential for the development of cheaper drugs, green fuels, biodegradable plastics, and targeted cancer therapies over the coming years. Fortunately, our ability to quickly and accurately engineer biological systems that behave predictably has been dramatically expanded by significant advances in DNA-sequencing, DNA-synthesis, and DNA-editing technologies. Here, we review emerging technologies and methodologies in the field of building designed biological systems, and we discuss their future perspectives. PMID:23708771

  13. Employability Skills Assessment Tool Development

    ERIC Educational Resources Information Center

    Rasul, Mohamad Sattar; Rauf, Rose Amnah Abd; Mansor, Azlin Norhaini; Puvanasvaran, A. P.

    2012-01-01

    Research nationally and internationally found that technical graduates are lacking in employability skills. As employability skills are crucial in outcome-based education, the main goal of this research is to develop an Employability Skill Assessment Tool to help students and lecturers produce competent graduates in employability skills needed by…

  14. BEDTools: the Swiss-army tool for genome feature analysis

    PubMed Central

    Quinlan, Aaron R.

    2014-01-01

    Technological advances have enabled the use of DNA sequencing as a flexible tool to characterize genetic variation and to measure the activity of diverse cellular phenomena such as gene isoform expression and transcription factor binding. Extracting biological insight from the experiments enabled by these advances demands the analysis of large, multi-dimensional datasets. This unit describes the use of the BEDTools toolkit for the exploration of high-throughput genomics datasets. I present several protocols for common genomic analyses and demonstrate how simple BEDTools operations may be combined to create bespoke pipelines addressing complex questions. PMID:25199790

  15. Virome genomics: a tool for defining the human virome.

    PubMed

    Wylie, Kristine M; Weinstock, George M; Storch, Gregory A

    2013-08-01

    High throughput, deep sequencing assays are powerful tools for gaining insights into virus-host interactions. Sequencing assays can discover novel viruses and describe the genomes of novel and known viruses. Genomic information can predict viral proteins that can be characterized, describe important genes in the host that control infections, and evaluate gene expression of viruses and hosts during infection. Sequencing can also describe variation and evolution of viruses during replication and transmission. This review recounts some of the major advances in the studies of virus-host interactions from the last two years, and discusses the uses of sequencing technologies relating to these studies.

  16. Databases and Web Tools for Cancer Genomics Study

    PubMed Central

    Yang, Yadong; Dong, Xunong; Xie, Bingbing; Ding, Nan; Chen, Juan; Li, Yongjun; Zhang, Qian; Qu, Hongzhu; Fang, Xiangdong

    2015-01-01

    Publicly-accessible resources have promoted the advance of scientific discovery. The era of genomics and big data has brought the need for collaboration and data sharing in order to make effective use of this new knowledge. Here, we describe the web resources for cancer genomics research and rate them on the basis of the diversity of cancer types, sample size, omics data comprehensiveness, and user experience. The resources reviewed include data repository and analysis tools; and we hope such introduction will promote the awareness and facilitate the usage of these resources in the cancer research community. PMID:25707591

  17. Databases and web tools for cancer genomics study.

    PubMed

    Yang, Yadong; Dong, Xunong; Xie, Bingbing; Ding, Nan; Chen, Juan; Li, Yongjun; Zhang, Qian; Qu, Hongzhu; Fang, Xiangdong

    2015-02-01

    Publicly-accessible resources have promoted the advance of scientific discovery. The era of genomics and big data has brought the need for collaboration and data sharing in order to make effective use of this new knowledge. Here, we describe the web resources for cancer genomics research and rate them on the basis of the diversity of cancer types, sample size, omics data comprehensiveness, and user experience. The resources reviewed include data repository and analysis tools; and we hope such introduction will promote the awareness and facilitate the usage of these resources in the cancer research community. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.

  18. Genomic Tools in Pearl Millet Breeding for Drought Tolerance: Status and Prospects.

    PubMed

    Serba, Desalegn D; Yadav, Rattan S

    2016-01-01

    Pearl millet [Penisetum glaucum (L) R. Br.] is a hardy cereal crop grown in the arid and semiarid tropics where other cereals are likely to fail to produce economic yields due to drought and heat stresses. Adaptive evolution, a form of natural selection shaped the crop to grow and yield satisfactorily with limited moisture supply or under periodic water deficits in the soil. Drought tolerance is a complex polygenic trait that various morphological and physiological responses are controlled by 100s of genes and significantly influenced by the environment. The development of genomic tools will have enormous potential to improve the efficiency and precision of conventional breeding. The apparent independent domestication events, highly outcrossing nature and traditional cultivation in stressful environments maintained tremendous amount of polymorphism in pearl millet. This high polymorphism of the crop has been revealed by genome mapping that in turn stimulated the mapping and tagging of genomic regions controlling important traits such as drought tolerance. Mapping of a major QTL for terminal drought tolerance in independent populations envisaged the prospect for the development of molecular breeding in pearl millet. To accelerate genetic gains for drought tolerance targeted novel approaches such as establishment of marker-trait associations, genomic selection tools, genome sequence and genotyping-by-sequencing are still limited. Development and application of high throughput genomic tools need to be intensified to improve the breeding efficiency of pearl millet to minimize the impact of climate change on its production.

  19. Genomic Tools in Pearl Millet Breeding for Drought Tolerance: Status and Prospects

    PubMed Central

    Serba, Desalegn D.; Yadav, Rattan S.

    2016-01-01

    Pearl millet [Penisetum glaucum (L) R. Br.] is a hardy cereal crop grown in the arid and semiarid tropics where other cereals are likely to fail to produce economic yields due to drought and heat stresses. Adaptive evolution, a form of natural selection shaped the crop to grow and yield satisfactorily with limited moisture supply or under periodic water deficits in the soil. Drought tolerance is a complex polygenic trait that various morphological and physiological responses are controlled by 100s of genes and significantly influenced by the environment. The development of genomic tools will have enormous potential to improve the efficiency and precision of conventional breeding. The apparent independent domestication events, highly outcrossing nature and traditional cultivation in stressful environments maintained tremendous amount of polymorphism in pearl millet. This high polymorphism of the crop has been revealed by genome mapping that in turn stimulated the mapping and tagging of genomic regions controlling important traits such as drought tolerance. Mapping of a major QTL for terminal drought tolerance in independent populations envisaged the prospect for the development of molecular breeding in pearl millet. To accelerate genetic gains for drought tolerance targeted novel approaches such as establishment of marker-trait associations, genomic selection tools, genome sequence and genotyping-by-sequencing are still limited. Development and application of high throughput genomic tools need to be intensified to improve the breeding efficiency of pearl millet to minimize the impact of climate change on its production. PMID:27920783

  20. Biological Database of Images and Genomes: tools for community annotations linking image and genomic information

    PubMed Central

    Oberlin, Andrew T; Jurkovic, Dominika A; Balish, Mitchell F; Friedberg, Iddo

    2013-01-01

    Genomic data and biomedical imaging data are undergoing exponential growth. However, our understanding of the phenotype–genotype connection linking the two types of data is lagging behind. While there are many types of software that enable the manipulation and analysis of image data and genomic data as separate entities, there is no framework established for linking the two. We present a generic set of software tools, BioDIG, that allows linking of image data to genomic data. BioDIG tools can be applied to a wide range of research problems that require linking images to genomes. BioDIG features the following: rapid construction of web-based workbenches, community-based annotation, user management and web services. By using BioDIG to create websites, researchers and curators can rapidly annotate a large number of images with genomic information. Here we present the BioDIG software tools that include an image module, a genome module and a user management module. We also introduce a BioDIG-based website, MyDIG, which is being used to annotate images of mycoplasmas. Database URL: BioDIG website: http://biodig.org BioDIG source code repository: http://github.com/FriedbergLab/BioDIG The MyDIG database: http://mydig.biodig.org/ PMID:23550062

  1. DivStat: A User-Friendly Tool for Single Nucleotide Polymorphism Analysis of Genomic Diversity

    PubMed Central

    Soares, Inês; Moleirinho, Ana; Oliveira, Gonçalo N. P.; Amorim, António

    2015-01-01

    Recent developments have led to an enormous increase of publicly available large genomic data, including complete genomes. The 1000 Genomes Project was a major contributor, releasing the results of sequencing a large number of individual genomes, and allowing for a myriad of large scale studies on human genetic variation. However, the tools currently available are insufficient when the goal concerns some analyses of data sets encompassing more than hundreds of base pairs and when considering haplotype sequences of single nucleotide polymorphisms (SNPs). Here, we present a new and potent tool to deal with large data sets allowing the computation of a variety of summary statistics of population genetic data, increasing the speed of data analysis. PMID:25756185

  2. Bioinformatics tools and databases for whole genome sequence analysis of Mycobacterium tuberculosis.

    PubMed

    Faksri, Kiatichai; Tan, Jun Hao; Chaiprasert, Angkana; Teo, Yik-Ying; Ong, Rick Twee-Hee

    2016-11-01

    Tuberculosis (TB) is an infectious disease of global public health importance caused by Mycobacterium tuberculosis complex (MTC) in which M. tuberculosis (Mtb) is the major causative agent. Recent advancements in genomic technologies such as next generation sequencing have enabled high throughput cost-effective generation of whole genome sequence information from Mtb clinical isolates, providing new insights into the evolution, genomic diversity and transmission of the Mtb bacteria, including molecular mechanisms of antibiotic resistance. The large volume of sequencing data generated however necessitated effective and efficient management, storage, analysis and visualization of the data and results through development of novel and customized bioinformatics software tools and databases. In this review, we aim to provide a comprehensive survey of the current freely available bioinformatics software tools and publicly accessible databases for genomic analysis of Mtb for identifying disease transmission in molecular epidemiology and in rapid determination of the antibiotic profiles of clinical isolates for prompt and optimal patient treatment.

  3. Use of whole-genus genome sequence data to develop a multilocus sequence typing tool that accurately identifies Yersinia isolates to the species and subspecies levels.

    PubMed

    Hall, Miquette; Chattaway, Marie A; Reuter, Sandra; Savin, Cyril; Strauch, Eckhard; Carniel, Elisabeth; Connor, Thomas; Van Damme, Inge; Rajakaruna, Lakshani; Rajendram, Dunstan; Jenkins, Claire; Thomson, Nicholas R; McNally, Alan

    2015-01-01

    The genus Yersinia is a large and diverse bacterial genus consisting of human-pathogenic species, a fish-pathogenic species, and a large number of environmental species. Recently, the phylogenetic and population structure of the entire genus was elucidated through the genome sequence data of 241 strains encompassing every known species in the genus. Here we report the mining of this enormous data set to create a multilocus sequence typing-based scheme that can identify Yersinia strains to the species level to a level of resolution equal to that for whole-genome sequencing. Our assay is designed to be able to accurately subtype the important human-pathogenic species Yersinia enterocolitica to whole-genome resolution levels. We also report the validation of the scheme on 386 strains from reference laboratory collections across Europe. We propose that the scheme is an important molecular typing system to allow accurate and reproducible identification of Yersinia isolates to the species level, a process often inconsistent in nonspecialist laboratories. Additionally, our assay is the most phylogenetically informative typing scheme available for Y. enterocolitica. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  4. Use of Whole-Genus Genome Sequence Data To Develop a Multilocus Sequence Typing Tool That Accurately Identifies Yersinia Isolates to the Species and Subspecies Levels

    PubMed Central

    Hall, Miquette; Chattaway, Marie A.; Reuter, Sandra; Savin, Cyril; Strauch, Eckhard; Carniel, Elisabeth; Connor, Thomas; Van Damme, Inge; Rajakaruna, Lakshani; Rajendram, Dunstan; Jenkins, Claire; Thomson, Nicholas R.

    2014-01-01

    The genus Yersinia is a large and diverse bacterial genus consisting of human-pathogenic species, a fish-pathogenic species, and a large number of environmental species. Recently, the phylogenetic and population structure of the entire genus was elucidated through the genome sequence data of 241 strains encompassing every known species in the genus. Here we report the mining of this enormous data set to create a multilocus sequence typing-based scheme that can identify Yersinia strains to the species level to a level of resolution equal to that for whole-genome sequencing. Our assay is designed to be able to accurately subtype the important human-pathogenic species Yersinia enterocolitica to whole-genome resolution levels. We also report the validation of the scheme on 386 strains from reference laboratory collections across Europe. We propose that the scheme is an important molecular typing system to allow accurate and reproducible identification of Yersinia isolates to the species level, a process often inconsistent in nonspecialist laboratories. Additionally, our assay is the most phylogenetically informative typing scheme available for Y. enterocolitica. PMID:25339391

  5. Tetrahymena as a Unicellular Model Eukaryote: Genetic and Genomic Tools.

    PubMed

    Ruehle, Marisa D; Orias, Eduardo; Pearson, Chad G

    2016-06-01

    Tetrahymena thermophila is a ciliate model organism whose study has led to important discoveries and insights into both conserved and divergent biological processes. In this review, we describe the tools for the use of Tetrahymena as a model eukaryote, including an overview of its life cycle, orientation to its evolutionary roots, and methodological approaches to forward and reverse genetics. Recent genomic tools have expanded Tetrahymena's utility as a genetic model system. With the unique advantages that Tetrahymena provide, we argue that it will continue to be a model organism of choice. Copyright © 2016 by the Genetics Society of America.

  6. Tetrahymena as a Unicellular Model Eukaryote: Genetic and Genomic Tools

    PubMed Central

    Ruehle, Marisa D.; Orias, Eduardo; Pearson, Chad G.

    2016-01-01

    Tetrahymena thermophila is a ciliate model organism whose study has led to important discoveries and insights into both conserved and divergent biological processes. In this review, we describe the tools for the use of Tetrahymena as a model eukaryote, including an overview of its life cycle, orientation to its evolutionary roots, and methodological approaches to forward and reverse genetics. Recent genomic tools have expanded Tetrahymena’s utility as a genetic model system. With the unique advantages that Tetrahymena provide, we argue that it will continue to be a model organism of choice. PMID:27270699

  7. Cascade: an RNA-seq visualization tool for cancer genomics.

    PubMed

    Shifman, Aaron R; Johnson, Radia M; Wilhelm, Brian T

    2016-01-25

    Cancer genomics projects are producing ever-increasing amounts of rich and diverse data from patient samples. The ability to easily visualize this data in an integrated an intuitive way is currently limited by the current software available. As a result, users typically must use several different tools to view the different data types for their cohort, making it difficult to have a simple unified view of their data. Here we present Cascade, a novel web based tool for the intuitive 3D visualization of RNA-seq data from cancer genomics experiments. The Cascade viewer allows multiple data types (e.g. mutation, gene expression, alternative splicing frequency) to be simultaneously displayed, allowing a simplified view of the data in a way that is tuneable based on user specified parameters. The main webpage of Cascade provides a primary view of user data which is overlaid onto known biological pathways that are either predefined or added by users. A space-saving menu for data selection and parameter adjustment allows users to access an underlying MySQL database and customize the features presented in the main view. There is currently a pressing need for new software tools to allow researchers to easily explore large cancer genomics datasets and generate hypotheses. Cascade represents a simple yet intuitive interface for data visualization that is both scalable and customizable.

  8. Correcting Inconsistencies and Errors in Bacterial Genome Metadata Using an Automated Curation Tool in Excel (AutoCurE).

    PubMed

    Schmedes, Sarah E; King, Jonathan L; Budowle, Bruce

    2015-01-01

    Whole-genome data are invaluable for large-scale comparative genomic studies. Current sequencing technologies have made it feasible to sequence entire bacterial genomes with relative ease and time with a substantially reduced cost per nucleotide, hence cost per genome. More than 3,000 bacterial genomes have been sequenced and are available at the finished status. Publically available genomes can be readily downloaded; however, there are challenges to verify the specific supporting data contained within the download and to identify errors and inconsistencies that may be present within the organizational data content and metadata. AutoCurE, an automated tool for bacterial genome database curation in Excel, was developed to facilitate local database curation of supporting data that accompany downloaded genomes from the National Center for Biotechnology Information. AutoCurE provides an automated approach to curate local genomic databases by flagging inconsistencies or errors by comparing the downloaded supporting data to the genome reports to verify genome name, RefSeq accession numbers, the presence of archaea, BioProject/UIDs, and sequence file descriptions. Flags are generated for nine metadata fields if there are inconsistencies between the downloaded genomes and genomes reports and if erroneous or missing data are evident. AutoCurE is an easy-to-use tool for local database curation for large-scale genome data prior to downstream analyses.

  9. A two-plasmid inducible CRISPR/Cas9 genome editing tool for Clostridium acetobutylicum.

    PubMed

    Wasels, François; Jean-Marie, Jennifer; Collas, Florent; López-Contreras, Ana M; Lopes Ferreira, Nicolas

    2017-09-01

    CRISPR/Cas-based genetic engineering has revolutionised molecular biology in both eukaryotes and prokaryotes. Several tools dedicated to the genomic transformation of the Clostridium genus of Gram-positive bacteria have been described in the literature; however, the integration of large DNA fragments still remains relatively limited. In this study, a CRISPR/Cas9 genome editing tool using a two-plasmid strategy was developed for the solventogenic strain Clostridium acetobutylicum ATCC 824. Codon-optimised cas9 from Streptococcus pyogenes was placed under the control of an anhydrotetracycline-inducible promoter on one plasmid, while the gRNA expression cassettes and editing templates were located on a second plasmid. Through the sequential introduction of these vectors into the cell, we achieved highly accurate genome modifications, including nucleotide substitution, gene deletion and cassette insertion up to 3.6kb. To demonstrate its potential, this genome editing tool was used to generate a marker-free mutant of ATCC 824 that produced an isopropanol-butanol-ethanol mixture. Whole-genome sequencing confirmed that no off-target modifications were present in the mutants. Such a tool is a prerequisite for efficient metabolic engineering in this solventogenic strain and provides an alternative editing strategy that might be applicable to other Clostridium strains. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  10. The CRISPR/Cas Genome-Editing Tool: Application in Improvement of Crops

    PubMed Central

    Khatodia, Surender; Bhatotia, Kirti; Passricha, Nishat; Khurana, S. M. P.; Tuteja, Narendra

    2016-01-01

    The Clustered Regularly Interspaced Short Palindromic Repeats associated Cas9/sgRNA system is a novel targeted genome-editing technique derived from bacterial immune system. It is an inexpensive, easy, most user friendly and rapidly adopted genome editing tool transforming to revolutionary paradigm. This technique enables precise genomic modifications in many different organisms and tissues. Cas9 protein is an RNA guided endonuclease utilized for creating targeted double-stranded breaks with only a short RNA sequence to confer recognition of the target in animals and plants. Development of genetically edited (GE) crops similar to those developed by conventional or mutation breeding using this potential technique makes it a promising and extremely versatile tool for providing sustainable productive agriculture for better feeding of rapidly growing population in a changing climate. The emerging areas of research for the genome editing in plants include interrogating gene function, rewiring the regulatory signaling networks and sgRNA library for high-throughput loss-of-function screening. In this review, we have described the broad applicability of the Cas9 nuclease mediated targeted plant genome editing for development of designer crops. The regulatory uncertainty and social acceptance of plant breeding by Cas9 genome editing have also been described. With this powerful and innovative technique the designer GE non-GM plants could further advance climate resilient and sustainable agriculture in the future and maximizing yield by combating abiotic and biotic stresses. PMID:27148329

  11. BioViews: Java-Based Tools for Genomic Data Visualization

    PubMed Central

    Helt, Gregg A.; Lewis, Suzanna; Loraine, Ann E.; Rubin, Gerald M.

    1998-01-01

    Visualization tools for bioinformatics ideally should provide universal access to the most current data in an interactive and intuitive graphical user interface. Since the introduction of Java, a language designed for distributed programming over the Web, the technology now exists to build a genomic data visualization tool that meets these requirements. Using Java we have developed a prototype genome browser applet (BioViews) that incorporates a three-level graphical view of genomic data: a physical map, an annotated sequence map, and a DNA sequence display. Annotated biological features are displayed on the physical and sequence-based maps, and the different views are interconnected. The applet is linked to several databases and can retrieve features and display hyperlinked textual data on selected features. In addition to browsing genomic data, different types of analyses can be performed interactively and the results of these analyses visualized alongside prior annotations. Our genome browser is built on top of extensible, reusable graphic components specifically designed for bioinformatics. Other groups can (and do) reuse this work in various ways. Genome centers can reuse large parts of the genome browser with minor modifications, bioinformatics groups working on sequence analysis can reuse components to build front ends for analysis programs, and biology laboratories can reuse components to publish results as dynamic Web documents. PMID:9521932

  12. FEATnotator: A tool for integrated annotation of sequence features and variation, facilitating interpretation in genomics experiments.

    PubMed

    Podicheti, Ram; Mockaitis, Keithanne

    2015-06-01

    As approaches are sought for more efficient and democratized uses of non-model and expanded model genomics references, ease of integration of genomic feature datasets is especially desirable in multidisciplinary research communities. Valuable conclusions are often missed or slowed when researchers refer experimental results to a single reference sequence that lacks integrated pan-genomic and multi-experiment data in accessible formats. Association of genomic positional information, such as results from an expansive variety of next-generation sequencing experiments, with annotated reference features such as genes or predicted protein binding sites, provides the context essential for conclusions and ongoing research. When the experimental system includes polymorphic genomic inputs, rapid calculation of gene structural and protein translational effects of sequence variation from the reference can be invaluable. Here we present FEATnotator, a lightweight, fast and easy to use open source software program that integrates and reports overlap and proximity in genomic information from any user-defined datasets including those from next generation sequencing applications. We illustrate use of the tool by summarizing whole genome sequence variation of a widely used natural isolate of Arabidopsis thaliana in the context of gene models of the reference accession. Previous discovery of a protein coding deletion influencing root development is replicated rapidly. Appropriate even in investigations of a single gene or genic regions such as QTL, comprehensive reports provided by FEATnotator better prepare researchers for interpretation of their experimental results. The tool is available for download at http://featnotator.sourceforge.net.

  13. DFAST and DAGA: web-based integrated genome annotation tools and resources

    PubMed Central

    TANIZAWA, Yasuhiro; FUJISAWA, Takatomo; KAMINUMA, Eli; NAKAMURA, Yasukazu; ARITA, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus, obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii, whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp. PMID:27867804

  14. DFAST and DAGA: web-based integrated genome annotation tools and resources.

    PubMed

    Tanizawa, Yasuhiro; Fujisawa, Takatomo; Kaminuma, Eli; Nakamura, Yasukazu; Arita, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus, obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii, whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp.

  15. Accessing the SEED genome databases via Web services API: tools for programmers.

    PubMed

    Disz, Terry; Akhter, Sajia; Cuevas, Daniel; Olson, Robert; Overbeek, Ross; Vonstein, Veronika; Stevens, Rick; Edwards, Robert A

    2010-06-14

    The SEED integrates many publicly available genome sequences into a single resource. The database contains accurate and up-to-date annotations based on the subsystems concept that leverages clustering between genomes and other clues to accurately and efficiently annotate microbial genomes. The backend is used as the foundation for many genome annotation tools, such as the Rapid Annotation using Subsystems Technology (RAST) server for whole genome annotation, the metagenomics RAST server for random community genome annotations, and the annotation clearinghouse for exchanging annotations from different resources. In addition to a web user interface, the SEED also provides Web services based API for programmatic access to the data in the SEED, allowing the development of third-party tools and mash-ups. The currently exposed Web services encompass over forty different methods for accessing data related to microbial genome annotations. The Web services provide comprehensive access to the database back end, allowing any programmer access to the most consistent and accurate genome annotations available. The Web services are deployed using a platform independent service-oriented approach that allows the user to choose the most suitable programming platform for their application. Example code demonstrate that Web services can be used to access the SEED using common bioinformatics programming languages such as Perl, Python, and Java. We present a novel approach to access the SEED database. Using Web services, a robust API for access to genomics data is provided, without requiring large volume downloads all at once. The API ensures timely access to the most current datasets available, including the new genomes as soon as they come online.

  16. Accessing the SEED Genome Databases via Web Services API: Tools for Programmers

    PubMed Central

    2010-01-01

    Background The SEED integrates many publicly available genome sequences into a single resource. The database contains accurate and up-to-date annotations based on the subsystems concept that leverages clustering between genomes and other clues to accurately and efficiently annotate microbial genomes. The backend is used as the foundation for many genome annotation tools, such as the Rapid Annotation using Subsystems Technology (RAST) server for whole genome annotation, the metagenomics RAST server for random community genome annotations, and the annotation clearinghouse for exchanging annotations from different resources. In addition to a web user interface, the SEED also provides Web services based API for programmatic access to the data in the SEED, allowing the development of third-party tools and mash-ups. Results The currently exposed Web services encompass over forty different methods for accessing data related to microbial genome annotations. The Web services provide comprehensive access to the database back end, allowing any programmer access to the most consistent and accurate genome annotations available. The Web services are deployed using a platform independent service-oriented approach that allows the user to choose the most suitable programming platform for their application. Example code demonstrate that Web services can be used to access the SEED using common bioinformatics programming languages such as Perl, Python, and Java. Conclusions We present a novel approach to access the SEED database. Using Web services, a robust API for access to genomics data is provided, without requiring large volume downloads all at once. The API ensures timely access to the most current datasets available, including the new genomes as soon as they come online. PMID:20546611

  17. The SickKids Genome Clinic: developing and evaluating a pediatric model for individualized genomic medicine.

    PubMed

    Bowdin, S C; Hayeems, R Z; Monfared, N; Cohn, R D; Meyn, M S

    2016-01-01

    Our increasing knowledge of how genomic variants affect human health and the falling costs of whole-genome sequencing are driving the development of individualized genomic medicine. This new clinical paradigm uses knowledge of an individual's genomic variants to anticipate, diagnose and manage disease. While individualized genetic medicine offers the promise of transformative change in health care, it forces us to reconsider existing ethical, scientific and clinical paradigms. The potential benefits of pre-symptomatic identification of at-risk individuals, improved diagnostics, individualized therapy, accurate prognosis and avoidance of adverse drug reactions coexist with the potential risks of uninterpretable results, psychological harm, outmoded counseling models and increased health care costs. Here we review the challenges, opportunities and limits of integrating genomic analysis into pediatric clinical practice and describe a model for implementing individualized genomic medicine. Our multidisciplinary team of bioinformaticians, health economists, health services and policy researchers, ethicists, geneticists, genetic counselors and clinicians has designed a 'Genome Clinic' research project that addresses multiple challenges in pediatric genomic medicine--ranging from development of bioinformatics tools for the clinical assessment of genomic variants and the discovery of disease genes to health policy inquiries, assessment of clinical care models, patient preference and the ethics of consent. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  18. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets.

    PubMed

    Khan, Aziz; Mathelier, Anthony

    2017-05-31

    A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. To address this gap, we have developed the Intervene tool, which provides an easy and automated interface for the effective intersection and visualization of genomic region or list sets, thus facilitating their analysis and interpretation. Intervene contains three modules: venn to generate Venn diagrams of up to six sets, upset to generate UpSet plots of multiple sets, and pairwise to compute and visualize intersections of multiple sets as clustered heat maps. Intervene, and its interactive web ShinyApp companion, generate publication-quality figures for the interpretation of genomic region and list sets. Intervene and its web application companion provide an easy command line and an interactive web interface to compute intersections of multiple genomic and list sets. They have the capacity to plot intersections using easy-to-interpret visual approaches. Intervene is developed and designed to meet the needs of both computer scientists and biologists. The source code is freely available at https://bitbucket.org/CBGR/intervene , with the web application available at https://asntech.shinyapps.io/intervene .

  19. Molecular Tools for Exploring Polyploid Genomes in Plants

    PubMed Central

    Aversano, Riccardo; Ercolano, Maria Raffaella; Caruso, Immacolata; Fasano, Carlo; Rosellini, Daniele; Carputo, Domenico

    2012-01-01

    Polyploidy is a very common phenomenon in the plant kingdom, where even diploid species are often described as paleopolyploids. The polyploid condition may bring about several advantages compared to the diploid state. Polyploids often show phenotypes that are not present in their diploid progenitors or exceed the range of the contributing species. Some of these traits may play a role in heterosis or could favor adaptation to new ecological niches. Advances in genomics and sequencing technology may create unprecedented opportunities for discovering and monitoring the molecular effects of polyploidization. Through this review, we provide an overview of technologies and strategies that may allow an in-depth analysis of polyploid genomes. After introducing some basic aspects on the origin and genetics of polyploids, we highlight the main tools available for genome and gene expression analysis and summarize major findings. In the last part of this review, the implications of next generation sequencing are briefly discussed. The accumulation of knowledge on polyploid formation, maintenance, and divergence at whole-genome and subgenome levels will not only help plant biologists to understand how plants have evolved and diversified, but also assist plant breeders in designing new strategies for crop improvement. PMID:22949863

  20. [Genome Editing Tools and their Application in Experimental Ophthalmology].

    PubMed

    Yanik, M; Wende, W; Stieger, K

    2017-01-23

    New genome editing tools in molecular biology are revolutionising precise genome surgery and have greatly influenced experimental ophthalmology too. Aside from the commonly used nuclease-based platforms, such as the zinc-finger nucleases (ZFN) and transcription activator-like effector nucleases (TALEN), CRISPR/Cas systems, clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) genes, perform very efficiently in site-specific DNA cleavage within living cells. DNA double strand breaks (DSB) are repaired through two different conserved repair pathways: NHEJ (non-homologous end joining) and HDR (homology directed repair). By using the correct DNA templates, these repair pathways can be used to knock out defective genes or to repair mutations. Genome editing technology lays the ground for new strategies in basic science, biotechnology, and biomedical science, as well as clinical studies with genome editing. Therapeutic gene editing strategies are now concentrating on diseases in the retina, due to the comparatively easy accessibility of the eye and with local application in vivo.

  1. Revolutions in Neuroscience: Tool Development

    PubMed Central

    Bickle, John

    2016-01-01

    Thomas Kuhn’s famous model of the components and dynamics of scientific revolutions is still dominant to this day across science, philosophy, and history. The guiding philosophical theme of this article is that, concerning actual revolutions in neuroscience over the past 60 years, Kuhn’s account is wrong. There have been revolutions, and new ones are brewing, but they do not turn on competing paradigms, anomalies, or the like. Instead, they turn exclusively on the development of new experimental tools. I adopt a metascientific approach and examine in detail the development of two recent neuroscience revolutions: the impact of engineered genetically mutated mammals in the search for causal mechanisms of “higher” cognitive functions; and the more recent impact of optogenetics and designer receptors exclusively activated by designer drugs (DREADDs). The two key metascientific concepts, I derive from these case studies are a revolutionary new tool’s motivating problem, and its initial and second-phase hook experiments. These concepts hardly exhaust a detailed metascience of tool development experiments in neuroscience, but they get that project off to a useful start and distinguish the subsequent account of neuroscience revolutions clearly from Kuhn’s famous model. I close with a brief remark about the general importance of molecular biology for a current philosophical understanding of science, as comparable to the place physics occupied when Kuhn formulated his famous theory of scientific revolutions. PMID:27013992

  2. GAPIT Version 2: An Enhanced Integrated Tool for Genomic Association and Prediction.

    PubMed

    Tang, You; Liu, Xiaolei; Wang, Jiabo; Li, Meng; Wang, Qishan; Tian, Feng; Su, Zhongbin; Pan, Yuchun; Liu, Di; Lipka, Alexander E; Buckler, Edward S; Zhang, Zhiwu

    2016-07-01

    Most human diseases and agriculturally important traits are complex. Dissecting their genetic architecture requires continued development of innovative and powerful statistical methods. Corresponding advances in computing tools are critical to efficiently use these statistical innovations and to enhance and accelerate biomedical and agricultural research and applications. The genome association and prediction integrated tool (GAPIT) was first released in 2012 and became widely used for genome-wide association studies (GWAS) and genomic prediction. The GAPIT implemented computationally efficient statistical methods, including the compressed mixed linear model (CMLM) and genomic prediction by using genomic best linear unbiased prediction (gBLUP). New state-of-the-art statistical methods have now been implemented in a new, enhanced version of GAPIT. These methods include factored spectrally transformed linear mixed models (FaST-LMM), enriched CMLM (ECMLM), FaST-LMM-Select, and settlement of mixed linear models under progressively exclusive relationship (SUPER). The genomic prediction methods implemented in this new release of the GAPIT include gBLUP based on CMLM, ECMLM, and SUPER. Additionally, the GAPIT was updated to improve its existing output display features and to add new data display and evaluation functions, including new graphing options and capabilities, phenotype simulation, power analysis, and cross-validation. These enhancements make the GAPIT a valuable resource for determining appropriate experimental designs and performing GWAS and genomic prediction. The enhanced R-based GAPIT software package uses state-of-the-art methods to conduct GWAS and genomic prediction. The GAPIT also provides new functions for developing experimental designs and creating publication-ready tabular summaries and graphs to improve the efficiency and application of genomic research.

  3. CRISPR-Cas9: Tool for Qualitative and Quantitative Plant Genome Editing

    PubMed Central

    Noman, Ali; Aqeel, Muhammad; He, Shuilin

    2016-01-01

    Recent developments in genome editing techniques have aroused substantial excitement among agricultural scientists. These techniques offer new opportunities for developing improved plant lines with addition of important traits or removal of undesirable traits. Increased adoption of genome editing has been geared by swiftly developing Clustered regularly interspaced short palindromic repeats (CRISPR). This is appearing as driving force for innovative utilization in diverse branches of plant biology. CRISPR-Cas9 mediated genome editing is being used for rapid, easy and efficient alteration of genes among diverse plant species. With approximate completion of conceptual work about CRISPR-Cas9, plant scientists are applying this genome editing tool for crop attributes enhancement. The capability of this system for performing targeted and efficient modifications in genome sequence as well as gene expression will certainly spur novel developments not only in model plants but in crop and ornamental plants as well. Additionally, due to non-involvement of foreign DNA, this technique may help alleviating regulatory issues associated with genetically modified plants. We expect that prevailing challenges in plant science like genomic region manipulation, crop specific vectors etc. will be addressed along with sustained growth of this genome editing tool. In this review, recent progress of CRISPR-Cas9 technology in plants has been summarized and discussed. We reviewed significance of CRISPR-Cas9 for specific and non-traditional aspects of plant life. It also covers strengths of this technique in comparison with other genome editing techniques, e.g., Zinc finger nucleases, Transcription activator-like effector nucleases and potential challenges in coming decades have been described. PMID:27917188

  4. CRISPR-Cas9: Tool for Qualitative and Quantitative Plant Genome Editing.

    PubMed

    Noman, Ali; Aqeel, Muhammad; He, Shuilin

    2016-01-01

    Recent developments in genome editing techniques have aroused substantial excitement among agricultural scientists. These techniques offer new opportunities for developing improved plant lines with addition of important traits or removal of undesirable traits. Increased adoption of genome editing has been geared by swiftly developing Clustered regularly interspaced short palindromic repeats (CRISPR). This is appearing as driving force for innovative utilization in diverse branches of plant biology. CRISPR-Cas9 mediated genome editing is being used for rapid, easy and efficient alteration of genes among diverse plant species. With approximate completion of conceptual work about CRISPR-Cas9, plant scientists are applying this genome editing tool for crop attributes enhancement. The capability of this system for performing targeted and efficient modifications in genome sequence as well as gene expression will certainly spur novel developments not only in model plants but in crop and ornamental plants as well. Additionally, due to non-involvement of foreign DNA, this technique may help alleviating regulatory issues associated with genetically modified plants. We expect that prevailing challenges in plant science like genomic region manipulation, crop specific vectors etc. will be addressed along with sustained growth of this genome editing tool. In this review, recent progress of CRISPR-Cas9 technology in plants has been summarized and discussed. We reviewed significance of CRISPR-Cas9 for specific and non-traditional aspects of plant life. It also covers strengths of this technique in comparison with other genome editing techniques, e.g., Zinc finger nucleases, Transcription activator-like effector nucleases and potential challenges in coming decades have been described.

  5. The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows

    PubMed Central

    O'Connor, Brian D.; Yuen, Denis; Chung, Vincent; Duncan, Andrew G.; Liu, Xiang Kun; Patricia, Janice; Paten, Benedict; Stein, Lincoln; Ferretti, Vincent

    2017-01-01

    As genomic datasets continue to grow, the feasibility of downloading data to a local organization and running analysis on a traditional compute environment is becoming increasingly problematic. Current large-scale projects, such as the ICGC PanCancer Analysis of Whole Genomes (PCAWG), the Data Platform for the U.S. Precision Medicine Initiative, and the NIH Big Data to Knowledge Center for Translational Genomics, are using cloud-based infrastructure to both host and perform analysis across large data sets. In PCAWG, over 5,800 whole human genomes were aligned and variant called across 14 cloud and HPC environments; the processed data was then made available on the cloud for further analysis and sharing. If run locally, an operation at this scale would have monopolized a typical academic data centre for many months, and would have presented major challenges for data storage and distribution. However, this scale is increasingly typical for genomics projects and necessitates a rethink of how analytical tools are packaged and moved to the data. For PCAWG, we embraced the use of highly portable Docker images for encapsulating and sharing complex alignment and variant calling workflows across highly variable environments. While successful, this endeavor revealed a limitation in Docker containers, namely the lack of a standardized way to describe and execute the tools encapsulated inside the container. As a result, we created the Dockstore ( https://dockstore.org), a project that brings together Docker images with standardized, machine-readable ways of describing and running the tools contained within. This service greatly improves the sharing and reuse of genomics tools and promotes interoperability with similar projects through emerging web service standards developed by the Global Alliance for Genomics and Health (GA4GH). PMID:28344774

  6. The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows.

    PubMed

    O'Connor, Brian D; Yuen, Denis; Chung, Vincent; Duncan, Andrew G; Liu, Xiang Kun; Patricia, Janice; Paten, Benedict; Stein, Lincoln; Ferretti, Vincent

    2017-01-01

    As genomic datasets continue to grow, the feasibility of downloading data to a local organization and running analysis on a traditional compute environment is becoming increasingly problematic. Current large-scale projects, such as the ICGC PanCancer Analysis of Whole Genomes (PCAWG), the Data Platform for the U.S. Precision Medicine Initiative, and the NIH Big Data to Knowledge Center for Translational Genomics, are using cloud-based infrastructure to both host and perform analysis across large data sets. In PCAWG, over 5,800 whole human genomes were aligned and variant called across 14 cloud and HPC environments; the processed data was then made available on the cloud for further analysis and sharing. If run locally, an operation at this scale would have monopolized a typical academic data centre for many months, and would have presented major challenges for data storage and distribution. However, this scale is increasingly typical for genomics projects and necessitates a rethink of how analytical tools are packaged and moved to the data. For PCAWG, we embraced the use of highly portable Docker images for encapsulating and sharing complex alignment and variant calling workflows across highly variable environments. While successful, this endeavor revealed a limitation in Docker containers, namely the lack of a standardized way to describe and execute the tools encapsulated inside the container. As a result, we created the Dockstore ( https://dockstore.org), a project that brings together Docker images with standardized, machine-readable ways of describing and running the tools contained within. This service greatly improves the sharing and reuse of genomics tools and promotes interoperability with similar projects through emerging web service standards developed by the Global Alliance for Genomics and Health (GA4GH).

  7. Transcription activator-like effector nucleases (TALENs): a highly efficient and versatile tool for genome editing.

    PubMed

    Sun, Ning; Zhao, Huimin

    2013-07-01

    Transcription activator-like effector (TALE) nucleases (TALENs) have recently emerged as a revolutionary genome editing tool in many different organisms and cell types. The site-specific chromosomal double-strand breaks introduced by TALENs significantly increase the efficiency of genomic modification. The modular nature of the TALE central repeat domains enables researchers to tailor DNA recognition specificity with ease and target essentially any desired DNA sequence. Here, we comprehensively review the development of TALEN technology in terms of scaffold optimization, DNA recognition, and repeat array assembly. In addition, we provide some perspectives on the future development of this technology.

  8. Single-cell analysis tools for drug discovery and development.

    PubMed

    Heath, James R; Ribas, Antoni; Mischel, Paul S

    2016-03-01

    The genetic, functional or compositional heterogeneity of healthy and diseased tissues presents major challenges in drug discovery and development. Such heterogeneity hinders the design of accurate disease models and can confound the interpretation of biomarker levels and of patient responses to specific therapies. The complex nature of virtually all tissues has motivated the development of tools for single-cell genomic, transcriptomic and multiplex proteomic analyses. Here, we review these tools and assess their advantages and limitations. Emerging applications of single cell analysis tools in drug discovery and development, particularly in the field of oncology, are discussed.

  9. A comparison of tools for the simulation of genomic next-generation sequencing data.

    PubMed

    Escalona, Merly; Rocha, Sara; Posada, David

    2016-08-01

    Computer simulation of genomic data has become increasingly popular for assessing and validating biological models or for gaining an understanding of specific data sets. Several computational tools for the simulation of next-generation sequencing (NGS) data have been developed in recent years, which could be used to compare existing and new NGS analytical pipelines. Here we review 23 of these tools, highlighting their distinct functionality, requirements and potential applications. We also provide a decision tree for the informed selection of an appropriate NGS simulation tool for the specific question at hand.

  10. A comparison of tools for the simulation of genomic next-generation sequencing data

    PubMed Central

    Escalona, Merly; Rocha, Sara; Posada, David

    2017-01-01

    Computer simulation of genomic data has become increasingly popular for assessing and validating biological models or to gain understanding about specific datasets. Multiple computational tools for the simulation of next-generation sequencing (NGS) data have been developed in recent years, which could be used to compare existing and new NGS analytical pipelines. Here we review 23 of these tools, highlighting their distinct functionality, requirements and potential applications. We also provide a decision tree for the informed selection of an appropriate NGS simulation tool for the specific question at hand. PMID:27320129

  11. Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus Saccharum

    PubMed Central

    Grativol, Clícia; Regulski, Michael; Bertalan, Marcelo; McCombie, W. Richard; da Silva, Felipe Rodrigues; Neto, Adhemar Zerlotini; Vicentini, Renato; Farinelli, Laurent; Hemerly, Adriana Silva; Martienssen, Robert A.; Ferreira, Paulo Cavalcanti Gomes

    2015-01-01

    SUMMARY Many economically important crops have large and complex genomes, which hampers sequencing of their genome by standard methods such as WGS. Large tracts of methylated repeats occur at plant genomes interspersed by hypomethylated gene-rich regions. Gene enrichment strategies based on methylation profile offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration (MF) with McrBC digestion to enrich for euchromatic regions of sugarcane genome. To verify the efficiency of MF and the assembly quality of sequences submitted to gene-enrichment strategy, we have compared assemblies using MF and unfiltered (UF) libraries. The MF allowed the achievement of a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5 times more scaffolds and 1.7 times more assembled Mb compared to unfiltered scaffolds. The coverage of sorghum CDS by MF scaffolds was at least 36% higher than by UF scaffolds. Using MF technology, we increased by 134X the coverage of genic regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds covering all genes at sugarcane BACs, 97.2% of sugarcane ESTs, 92.7% of sugarcane RNA-seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds encoding enzymes of the sucrose/starch pathway discovered 291 SNPs in the wild sugarcane species, S. spontaneum and S. officinarum. A large number of microRNA genes were also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus Saccharum and improvement of sugarcane as a biofuel crop. PMID:24773339

  12. Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus Saccharum.

    PubMed

    Grativol, Clícia; Regulski, Michael; Bertalan, Marcelo; McCombie, W Richard; da Silva, Felipe Rodrigues; Zerlotini Neto, Adhemar; Vicentini, Renato; Farinelli, Laurent; Hemerly, Adriana Silva; Martienssen, Robert A; Ferreira, Paulo Cavalcanti Gomes

    2014-07-01

    Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun (WGS). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene-rich regions. Gene-enrichment strategies based on methylation profiles offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration with McrBC endonuclease digestion to enrich for euchromatic regions in the sugarcane genome. To verify the efficiency of methylation filtration and the assembly quality of sequences submitted to gene-enrichment strategy, we have compared assemblies using methyl-filtered (MF) and unfiltered (UF) libraries. The use of methy filtration allowed a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5× more scaffolds and 1.7× more assembled Mb in length compared with unfiltered dataset. The coverage of sorghum coding sequences (CDS) by MF scaffolds was at least 36% higher than by the use of UF scaffolds. Using MF technology, we increased by 134× the coverage of gene regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds that covered all genes of the sugarcane bacterial artificial chromosomes (BACs), 97.2% of sugarcane expressed sequence tags (ESTs), 92.7% of sugarcane RNA-seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds from encoded enzymes of the sucrose/starch pathway discovered 291 single-nucleotide polymorphisms (SNPs) in the wild sugarcane species, S. spontaneum and S. officinarum. A large number of microRNA genes was also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus Saccharum and for improvement of sugarcane as a biofuel crop. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  13. Primer: genomic and proteomic tools for the molecular dissection of disease.

    PubMed

    Walker, Erin J; Siminovitch, Katherine A

    2007-10-01

    Completion of the Human Genome Project has been rapidly followed by the emergence of high-throughput technologies that combine automation, miniaturization, and many other strategies and tools to enable systematic surveys of genome composition and gene expression. Of particular relevance to the prevention and management of disease are technologies such as high-throughput DNA genotyping, microarray-based gene-expression profiling, and mass spectrometry-facilitated protein profiling--platforms that collectively support the comprehensive analysis of DNA sequence variants across the genome and the global gene and protein expression changes that distinguish health from disease. Now used extensively in all facets of biomedical investigation, genomic and proteomic tools are already beginning to pinpoint molecular variants that influence risk and outcome in common diseases, and to thereby inform and direct development of novel molecular biomarkers and drug targets. As evidenced by recent advances in DNA sequencing methods, continued improvements in the scope, power, and cost efficiency of genomic and proteomic technologies should ensure their capacity to provide the scale and depth of knowledge required for translating genome sequence information into major medical impact.

  14. ThermoAlign: a genome-aware primer design tool for tiled amplicon resequencing

    PubMed Central

    Francis, Felix; Dumas, Michael D.; Wisser, Randall J.

    2017-01-01

    Isolating and sequencing specific regions in a genome is a cornerstone of molecular biology. This has been facilitated by computationally encoding the thermodynamics of DNA hybridization for automated design of hybridization and priming oligonucleotides. However, the repetitive composition of genomes challenges the identification of target-specific oligonucleotides, which limits genetics and genomics research on many species. Here, a tool called ThermoAlign was developed that ensures the design of target-specific primer pairs for DNA amplification. This is achieved by evaluating the thermodynamics of hybridization for full-length oligonucleotide-template alignments — thermoalignments — across the genome to identify primers predicted to bind specifically to the target site. For amplification-based resequencing of regions that cannot be amplified by a single primer pair, a directed graph analysis method is used to identify minimum amplicon tiling paths. Laboratory validation by standard and long-range polymerase chain reaction and amplicon resequencing with maize, one of the most repetitive genomes sequenced to date (≈85% repeat content), demonstrated the specificity-by-design functionality of ThermoAlign. ThermoAlign is released under an open source license and bundled in a dependency-free container for wide distribution. It is anticipated that this tool will facilitate multiple applications in genetics and genomics and be useful in the workflow of high-throughput targeted resequencing studies. PMID:28300202

  15. Genome-editing tools for stem cell biology

    PubMed Central

    Vasileva, E A; Shuvalov, O U; Garabadgiu, A V; Melino, G; Barlev, N A

    2015-01-01

    Human pluripotent stem cells provide a versatile platform for regenerative studies, drug testing and disease modeling. That the expression of only four transcription factors, Oct4, Klf4, Sox2 and c-Myc (OKSM), is sufficient for generation of induced pluripotent stem cells (iPSCs) from differentiated somatic cells has revolutionized the field and also highlighted the importance of OKSM as targets for genome editing. A number of novel genome-editing systems have been developed recently. In this review, we focus on successful applications of several such systems for generation of iPSCs. In particular, we discuss genome-editing systems based on zinc-finger fusion proteins (ZFs), transcription activator-like effectors (TALEs) and an RNA-guided DNA-specific nuclease, Cas9, derived from the bacterial defense system against viruses that utilizes clustered regularly interspaced short palindromic repeats (CRISPR). PMID:26203860

  16. Genome-editing tools for stem cell biology.

    PubMed

    Vasileva, E A; Shuvalov, O U; Garabadgiu, A V; Melino, G; Barlev, N A

    2015-07-23

    Human pluripotent stem cells provide a versatile platform for regenerative studies, drug testing and disease modeling. That the expression of only four transcription factors, Oct4, Klf4, Sox2 and c-Myc (OKSM), is sufficient for generation of induced pluripotent stem cells (iPSCs) from differentiated somatic cells has revolutionized the field and also highlighted the importance of OKSM as targets for genome editing. A number of novel genome-editing systems have been developed recently. In this review, we focus on successful applications of several such systems for generation of iPSCs. In particular, we discuss genome-editing systems based on zinc-finger fusion proteins (ZFs), transcription activator-like effectors (TALEs) and an RNA-guided DNA-specific nuclease, Cas9, derived from the bacterial defense system against viruses that utilizes clustered regularly interspaced short palindromic repeats (CRISPR).

  17. A survey of tools for variant analysis of next-generation genome sequencing data

    PubMed Central

    Pabinger, Stephan; Dander, Andreas; Fischer, Maria; Snajder, Rene; Sperk, Michael; Efremova, Mirjana; Krabichler, Birgit; Speicher, Michael R.; Zschocke, Johannes

    2014-01-01

    Recent advances in genome sequencing technologies provide unprecedented opportunities to characterize individual genomic landscapes and identify mutations relevant for diagnosis and therapy. Specifically, whole-exome sequencing using next-generation sequencing (NGS) technologies is gaining popularity in the human genetics community due to the moderate costs, manageable data amounts and straightforward interpretation of analysis results. While whole-exome and, in the near future, whole-genome sequencing are becoming commodities, data analysis still poses significant challenges and led to the development of a plethora of tools supporting specific parts of the analysis workflow or providing a complete solution. Here, we surveyed 205 tools for whole-genome/whole-exome sequencing data analysis supporting five distinct analytical steps: quality assessment, alignment, variant identification, variant annotation and visualization. We report an overview of the functionality, features and specific requirements of the individual tools. We then selected 32 programs for variant identification, variant annotation and visualization, which were subjected to hands-on evaluation using four data sets: one set of exome data from two patients with a rare disease for testing identification of germline mutations, two cancer data sets for testing variant callers for somatic mutations, copy number variations and structural variations, and one semi-synthetic data set for testing identification of copy number variations. Our comprehensive survey and evaluation of NGS tools provides a valuable guideline for human geneticists working on Mendelian disorders, complex diseases and cancers. PMID:23341494

  18. GEnomes Management Application (GEM.app): a new software tool for large-scale collaborative genome analysis.

    PubMed

    Gonzalez, Michael A; Lebrigio, Rafael F Acosta; Van Booven, Derek; Ulloa, Rick H; Powell, Eric; Speziani, Fiorella; Tekin, Mustafa; Schüle, Rebecca; Züchner, Stephan

    2013-06-01

    Novel genes are now identified at a rapid pace for many Mendelian disorders, and increasingly, for genetically complex phenotypes. However, new challenges have also become evident: (1) effectively managing larger exome and/or genome datasets, especially for smaller labs; (2) direct hands-on analysis and contextual interpretation of variant data in large genomic datasets; and (3) many small and medium-sized clinical and research-based investigative teams around the world are generating data that, if combined and shared, will significantly increase the opportunities for the entire community to identify new genes. To address these challenges, we have developed GEnomes Management Application (GEM.app), a software tool to annotate, manage, visualize, and analyze large genomic datasets (https://genomics.med.miami.edu/). GEM.app currently contains ∼1,600 whole exomes from 50 different phenotypes studied by 40 principal investigators from 15 different countries. The focus of GEM.app is on user-friendly analysis for nonbioinformaticians to make next-generation sequencing data directly accessible. Yet, GEM.app provides powerful and flexible filter options, including single family filtering, across family/phenotype queries, nested filtering, and evaluation of segregation in families. In addition, the system is fast, obtaining results within 4 sec across ∼1,200 exomes. We believe that this system will further enhance identification of genetic causes of human disease. © 2013 Wiley Periodicals, Inc.

  19. GEnomes Management Application (GEM.app): A new software tool for large-scale collaborative genome analysis

    PubMed Central

    Gonzalez, Michael A.; Acosta Lebrigio, Rafael F.; Van Booven, Derek; Ulloa, Rick H.; Powell, Eric; Speziani, Fiorella; Tekin, Mustafa; Schule, Rebecca; Zuchner, Stephan

    2015-01-01

    Novel genes are now identified at a rapid pace for many Mendelian disorders, and increasingly, for genetically complex phenotypes. However, new challenges have also become evident: (1) effectively managing larger exome and/or genome datasets, especially for smaller labs; (2) direct hands-on analysis and contextual interpretation of variant data in large genomic datasets; and (3) many small and medium-sized clinical and research-based investigative teams around the world are generating data that, if combined and shared, will significantly increase the opportunities for the entire community to identify new genes. To address these challenges we have developed GEnomes Management Application (GEM.app), a software tool to annotate, manage, visualize, and analyze large genomic datasets (https://genomics.med.miami.edu/). GEM.app currently contains ~1,600 whole exomes from 50 different phenotypes studied by 40 principal investigators from 15 different countries. The focus of GEM.app is on user-friendly analysis for non-bioinformaticians to make NGS data directly accessible. Yet, GEM.app provides powerful and flexible filter options, including single family filtering, across family/phenotype queries, nested filtering, and evaluation of segregation in families. In addition, the system is fast, obtaining results within 4 seconds across ~1,200 exomes. We believe that this system will further enhance identification of genetic causes of human disease. PMID:23463597

  20. A Tool for Multiple Targeted Genome Deletions that Is Precise, Scar-Free, and Suitable for Automation.

    PubMed

    Aubrey, Wayne; Riley, Michael C; Young, Michael; King, Ross D; Oliver, Stephen G; Clare, Amanda

    2015-01-01

    Many advances in synthetic biology require the removal of a large number of genomic elements from a genome. Most existing deletion methods leave behind markers, and as there are a limited number of markers, such methods can only be applied a fixed number of times. Deletion methods that recycle markers generally are either imprecise (remove untargeted sequences), or leave scar sequences which can cause genome instability and rearrangements. No existing marker recycling method is automation-friendly. We have developed a novel openly available deletion tool that consists of: 1) a method for deleting genomic elements that can be repeatedly used without limit, is precise, scar-free, and suitable for automation; and 2) software to design the method's primers. Our tool is sequence agnostic and could be used to delete large numbers of coding sequences, promoter regions, transcription factor binding sites, terminators, etc in a single genome. We have validated our tool on the deletion of non-essential open reading frames (ORFs) from S. cerevisiae. The tool is applicable to arbitrary genomes, and we provide primer sequences for the deletion of: 90% of the ORFs from the S. cerevisiae genome, 88% of the ORFs from S. pombe genome, and 85% of the ORFs from the L. lactis genome.

  1. A Tool for Multiple Targeted Genome Deletions that Is Precise, Scar-Free, and Suitable for Automation

    PubMed Central

    Aubrey, Wayne; Riley, Michael C.; Young, Michael; King, Ross D.; Oliver, Stephen G.; Clare, Amanda

    2015-01-01

    Many advances in synthetic biology require the removal of a large number of genomic elements from a genome. Most existing deletion methods leave behind markers, and as there are a limited number of markers, such methods can only be applied a fixed number of times. Deletion methods that recycle markers generally are either imprecise (remove untargeted sequences), or leave scar sequences which can cause genome instability and rearrangements. No existing marker recycling method is automation-friendly. We have developed a novel openly available deletion tool that consists of: 1) a method for deleting genomic elements that can be repeatedly used without limit, is precise, scar-free, and suitable for automation; and 2) software to design the method’s primers. Our tool is sequence agnostic and could be used to delete large numbers of coding sequences, promoter regions, transcription factor binding sites, terminators, etc in a single genome. We have validated our tool on the deletion of non-essential open reading frames (ORFs) from S. cerevisiae. The tool is applicable to arbitrary genomes, and we provide primer sequences for the deletion of: 90% of the ORFs from the S. cerevisiae genome, 88% of the ORFs from S. pombe genome, and 85% of the ORFs from the L. lactis genome. PMID:26630677

  2. Genome engineering and gene expression control for bacterial strain development.

    PubMed

    Song, Chan Woo; Lee, Joungmin; Lee, Sang Yup

    2015-01-01

    In recent years, a number of techniques and tools have been developed for genome engineering and gene expression control to achieve desired phenotypes of various bacteria. Here we review and discuss the recent advances in bacterial genome manipulation and gene expression control techniques, and their actual uses with accompanying examples. Genome engineering has been commonly performed based on homologous recombination. During such genome manipulation, the counterselection systems employing SacB or nucleases have mainly been used for the efficient selection of desired engineered strains. The recombineering technology enables simple and more rapid manipulation of the bacterial genome. The group II intron-mediated genome engineering technology is another option for some bacteria that are difficult to be engineered by homologous recombination. Due to the increasing demands on high-throughput screening of bacterial strains having the desired phenotypes, several multiplex genome engineering techniques have recently been developed and validated in some bacteria. Another approach to achieve desired bacterial phenotypes is the repression of target gene expression without the modification of genome sequences. This can be performed by expressing antisense RNA, small regulatory RNA, or CRISPR RNA to repress target gene expression at the transcriptional or translational level. All of these techniques allow efficient and rapid development and screening of bacterial strains having desired phenotypes, and more advanced techniques are expected to be seen. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  3. Genome-Based Genetic Tool Development for Bacillus methanolicus: Theta- and Rolling Circle-Replicating Plasmids for Inducible Gene Expression and Application to Methanol-Based Cadaverine Production.

    PubMed

    Irla, Marta; Heggeset, Tonje M B; Nærdal, Ingemar; Paul, Lidia; Haugen, Tone; Le, Simone B; Brautaset, Trygve; Wendisch, Volker F

    2016-01-01

    Bacillus methanolicus is a thermophilic methylotroph able to overproduce amino acids from methanol, a substrate not used for human or animal nutrition. Based on our previous RNA-seq analysis a mannitol inducible promoter and a putative mannitol activator gene mtlR were identified. The mannitol inducible promoter was applied for controlled gene expression using fluorescent reporter proteins and a flow cytometry analysis, and improved by changing the -35 promoter region and by co-expression of the mtlR regulator gene. For independent complementary gene expression control, the heterologous xylose-inducible system from B. megaterium was employed and a two-plasmid gene expression system was developed. Four different replicons for expression vectors were compared with respect to their copy number and stability. As an application example, methanol-based production of cadaverine was shown to be improved from 11.3 to 17.5 g/L when a heterologous lysine decarboxylase gene cadA was expressed from a theta-replicating rather than a rolling-circle replicating vector. The current work on inducible promoter systems and compatible theta- or rolling circle-replicating vectors is an important extension of the poorly developed B. methanolicus genetic toolbox, valuable for genetic engineering and further exploration of this bacterium.

  4. Genome-Based Genetic Tool Development for Bacillus methanolicus: Theta- and Rolling Circle-Replicating Plasmids for Inducible Gene Expression and Application to Methanol-Based Cadaverine Production

    PubMed Central

    Irla, Marta; Heggeset, Tonje M. B.; Nærdal, Ingemar; Paul, Lidia; Haugen, Tone; Le, Simone B.; Brautaset, Trygve; Wendisch, Volker F.

    2016-01-01

    Bacillus methanolicus is a thermophilic methylotroph able to overproduce amino acids from methanol, a substrate not used for human or animal nutrition. Based on our previous RNA-seq analysis a mannitol inducible promoter and a putative mannitol activator gene mtlR were identified. The mannitol inducible promoter was applied for controlled gene expression using fluorescent reporter proteins and a flow cytometry analysis, and improved by changing the -35 promoter region and by co-expression of the mtlR regulator gene. For independent complementary gene expression control, the heterologous xylose-inducible system from B. megaterium was employed and a two-plasmid gene expression system was developed. Four different replicons for expression vectors were compared with respect to their copy number and stability. As an application example, methanol-based production of cadaverine was shown to be improved from 11.3 to 17.5 g/L when a heterologous lysine decarboxylase gene cadA was expressed from a theta-replicating rather than a rolling-circle replicating vector. The current work on inducible promoter systems and compatible theta- or rolling circle-replicating vectors is an important extension of the poorly developed B. methanolicus genetic toolbox, valuable for genetic engineering and further exploration of this bacterium. PMID:27713731

  5. OmniMapFree: A unified tool to visualise and explore sequenced genomes

    PubMed Central

    2011-01-01

    • Background Acquiring and exploring whole genome sequence information for a species under investigation is now a routine experimental approach. On most genome browsers, typically, only the DNA sequence, EST support, motif search results, and GO annotations are displayed. However, for many species, a growing volume of additional experimental information is available but this is rarely searchable within the landscape of the entire genome. • Results We have developed a generic software which permits users to view a single genome in entirety either within its chromosome or supercontig context within a single window. This software permits the genome to be displayed at any scales and with any features. Different data types and data sets are displayed onto the genome, which have been acquired from other types of studies including classical genetics, forward and reverse genetics, transcriptomics, proteomics and improved annotation from alternative sources. In each display, different types of information can be overlapped, then retrieved in the desired combinations and scales and used in follow up analyses. The displays generated are of publication quality. • Conclusions OmniMapFree provides a unified, versatile and easy-to-use software tool for studying a single genome in association with all the other datasets and data types available for the organism. PMID:22085540

  6. Microbial ecology in the age of genomics and metagenomics: concepts, tools, and recent advances.

    PubMed

    Xu, Jianping

    2006-06-01

    Microbial ecology examines the diversity and activity of micro-organisms in Earth's biosphere. In the last 20 years, the application of genomics tools have revolutionized microbial ecological studies and drastically expanded our view on the previously underappreciated microbial world. This review first introduces the basic concepts in microbial ecology and the main genomics methods that have been used to examine natural microbial populations and communities. In the ensuing three specific sections, the applications of the genomics in microbial ecological research are highlighted. The first describes the widespread application of multilocus sequence typing and representational difference analysis in studying genetic variation within microbial species. Such investigations have identified that migration, horizontal gene transfer and recombination are common in natural microbial populations and that microbial strains can be highly variable in genome size and gene content. The second section highlights and summarizes the use of four specific genomics methods (phylogenetic analysis of ribosomal RNA, DNA-DNA re-association kinetics, metagenomics, and micro-arrays) in analysing the diversity and potential activity of microbial populations and communities from a variety of terrestrial and aquatic environments. Such analyses have identified many unexpected phylogenetic lineages in viruses, bacteria, archaea, and microbial eukaryotes. Functional analyses of environmental DNA also revealed highly prevalent, but previously unknown, metabolic processes in natural microbial communities. In the third section, the ecological implications of sequenced microbial genomes are briefly discussed. Comparative analyses of prokaryotic genomic sequences suggest the importance of ecology in determining microbial genome size and gene content. The significant variability in genome size and gene content among strains and species of prokaryotes indicate the highly fluid nature of prokaryotic

  7. Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing

    PubMed Central

    2011-01-01

    Background Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives

  8. Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing.

    PubMed

    Straub, Shannon C K; Fishbein, Mark; Livshultz, Tatyana; Foster, Zachary; Parks, Matthew; Weitemier, Kevin; Cronn, Richard C; Liston, Aaron

    2011-05-04

    Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first

  9. Learning-Oriented Instructional Development Tools.

    ERIC Educational Resources Information Center

    Merrill, M. David

    1997-01-01

    Discusses design requirements, and advantages and disadvantages of the following learner-centered instructional development tools: information containers; authoring systems; templates, models, or widgets; learning-oriented instructional development tools; and adaptive learning-oriented systems. (AEF)

  10. Family history in public health practice: a genomic tool for disease prevention and health promotion.

    PubMed

    Valdez, Rodolfo; Yoon, Paula W; Qureshi, Nadeem; Green, Ridgely Fisk; Khoury, Muin J

    2010-01-01

    Family history is a risk factor for many chronic diseases, including cancer, cardiovascular disease, and diabetes. Professional guidelines usually include family history to assess health risk, initiate interventions, and motivate behavioral changes. The advantages of family history over other genomic tools include a lower cost, greater acceptability, and a reflection of shared genetic and environmental factors. However, the utility of family history in public health has been poorly explored. To establish family history as a public health tool, it needs to be evaluated within the ACCE framework (analytical validity; clinical validity; clinical utility; and ethical, legal, and social issues). Currently, private and public organizations are developing tools to collect standardized family histories of many diseases. Their goal is to create family history tools that have decision support capabilities and are compatible with electronic health records. These advances will help realize the potential of family history as a public health tool.

  11. Emerging Imaging and Genomic Tools for Developmental Systems Biology.

    PubMed

    Liu, Zhe; Keller, Philipp J

    2016-03-21

    Animal development is a complex and dynamic process orchestrated by exquisitely timed cell lineage commitment, divisions, migration, and morphological changes at the single-cell level. In the past decade, extensive genetic, stem cell, and genomic studies provided crucial insights into molecular underpinnings and the functional importance of genetic pathways governing various cellular differentiation processes. However, it is still largely unknown how the precise coordination of these pathways is achieved at the whole-organism level and how the highly regulated spatiotemporal choreography of development is established in turn. Here, we discuss the latest technological advances in imaging and single-cell genomics that hold great promise for advancing our understanding of this intricate process. We propose an integrated approach that combines such methods to quantitatively decipher in vivo cellular dynamic behaviors and their underlying molecular mechanisms at the systems level with single-cell, single-molecule resolution.

  12. ECR Browser: A Tool For Visualizing And Accessing Data From Comparisons Of Multiple Vertebrate Genomes

    SciTech Connect

    Loots, G G; Ovcharenko, I; Stubbs, L; Nobrega, M A

    2004-01-06

    The increasing number of vertebrate genomes being sequenced in draft or finished form provide a unique opportunity to study and decode the language of DNA sequence through comparative genome alignments. However, novel tools and strategies are required to accommodate this increasing volume of genomic information and to facilitate experimental annotation of genome function. Here we present the ECR Browser, a tool that provides an easy and dynamic access to whole genome alignments of human, mouse, rat and fish sequences. This web-based tool (http://ecrbrowser.dcode.org) provides the starting point for discovery of novel genes, identification of distant gene regulatory elements and prediction of transcription factor binding sites. The genome alignment portal of the ECR Browser also permits fast and automated alignment of any user-submitted sequence to the genome of choice. The interconnection of the ECR browser with other DNA sequence analysis tools creates a unique portal for studying and exploring vertebrate genomes.

  13. Single cell analytic tools for drug discovery and development

    PubMed Central

    Heath, James R.; Ribas, Antoni; Mischel, Paul S.

    2016-01-01

    The genetic, functional, or compositional heterogeneity of healthy and diseased tissues presents major challenges in drug discovery and development.1-3 In cancers, heterogeneity may be essential for tumor stability,4 but its precise role in tumor biology is poorly resolved. This challenges the design of accurate disease models for use in drug development, and can confound the interpretation of biomarker levels, and of patient responses to specific therapies. The complex nature of heterogeneous tissues has motivated the development of tools for single cell genomic, transcriptomic, and multiplex proteomic analysis. We review these tools, assess their advantages and limitations, and explore their potential applications in drug discovery and development. PMID:26669673

  14. Bioinformatics tools for genome mining of polyketide and non-ribosomal peptides.

    PubMed

    Boddy, Christopher N

    2014-02-01

    Microbial natural products have played a key role in the development of clinical agents in nearly all therapeutic areas. Recent advances in genome sequencing have revealed that there is an incredible wealth of new polyketide and non-ribosomal peptide natural product diversity to be mined from genetic data. The diversity and complexity of polyketide and non-ribosomal peptide biosynthesis has required the development of unique bioinformatics tools to identify, annotate, and predict the structures of these natural products from their biosynthetic gene clusters. This review highlights and evaluates web-based bioinformatics tools currently available to the natural product community for genome mining to discover new polyketides and non-ribosomal peptides.

  15. System analysis: Developing tools for the future

    SciTech Connect

    De Jong, K.; clever, J.; Draper, J.V.; Davies, B.; Lonks, A.

    1996-02-01

    This report introduces and evaluates system analysis tools that were developed, or are under development, for the Robotics Technology Development Program (RTDP). Additionally, it discusses system analysis work completed using these tools aimed at completing a system analysis of the retrieval of waste from underground storage tanks on the Hanford Reservation near Richland, Washington. The tools developed and evaluated include a mixture of commercially available tools adapted to RTDP requirements, and some tools developed in house. The tools that are included in this report include: a Process Diagramming Tool, a Cost Modeling Tool, an Amortization Modeling Tool, a graphical simulation linked to the Cost Modeling Tool, a decision assistance tool, and a system thinking tool. Additionally, the importance of performance testing to the RTDP and the results of such testing executed is discussed. Further, the results of the Tank Waste Retrieval (TWR) System Diagram, the TWR Operations Cost Model, and the TWR Amortization Model are presented, and the implication of the results are discussed. Finally, the RTDP system analysis tools are assessed and some recommendations are made regarding continuing development of the tools and process.

  16. CrossMap: a versatile tool for coordinate conversion between genome assemblies

    PubMed Central

    Zhao, Hao; Sun, Zhifu; Wang, Jing; Huang, Haojie; Kocher, Jean-Pierre; Wang, Liguo

    2014-01-01

    Motivation: Reference genome assemblies are subject to change and refinement from time to time. Generally, researchers need to convert the results that have been analyzed according to old assemblies to newer versions, or vice versa, to facilitate meta-analysis, direct comparison, data integration and visualization. Several useful conversion tools can convert genome interval files in browser extensible data or general feature format, but none have the functionality to convert files in sequence alignment map or BigWig format. This is a significant gap in computational genomics tools, as these formats are the ones most widely used for representing high-throughput sequencing data, such as RNA-seq, chromatin immunoprecipitation sequencing, DNA-seq, etc. Results: Here we developed CrossMap, a versatile and efficient tool for converting genome coordinates between assemblies. CrossMap supports most of the commonly used file formats, including BAM, sequence alignment map, Wiggle, BigWig, browser extensible data, general feature format, gene transfer format and variant call format. Availability and implementation: CrossMap is written in Python and C. Source code and a comprehensive user’s manual are freely available at: http://crossmap.sourceforge.net/. Contact: Kocher.JeanPierre@mayo.edu or wang.liguo@mayo.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24351709

  17. Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence-Function Space and Genome Context to Discover Novel Functions.

    PubMed

    Gerlt, John A

    2017-08-22

    The exponentially increasing number of protein and nucleic acid sequences provides opportunities to discover novel enzymes, metabolic pathways, and metabolites/natural products, thereby adding to our knowledge of biochemistry and biology. The challenge has evolved from generating sequence information to mining the databases to integrating and leveraging the available information, i.e., the availability of "genomic enzymology" web tools. Web tools that allow identification of biosynthetic gene clusters are widely used by the natural products/synthetic biology community, thereby facilitating the discovery of novel natural products and the enzymes responsible for their biosynthesis. However, many novel enzymes with interesting mechanisms participate in uncharacterized small-molecule metabolic pathways; their discovery and functional characterization also can be accomplished by leveraging information in protein and nucleic acid databases. This Perspective focuses on two genomic enzymology web tools that assist the discovery novel metabolic pathways: (1) Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST) for generating sequence similarity networks to visualize and analyze sequence-function space in protein families and (2) Enzyme Function Initiative-Genome Neighborhood Tool (EFI-GNT) for generating genome neighborhood networks to visualize and analyze the genome context in microbial and fungal genomes. Both tools have been adapted to other applications to facilitate target selection for enzyme discovery and functional characterization. As the natural products community has demonstrated, the enzymology community needs to embrace the essential role of web tools that allow the protein and genome sequence databases to be leveraged for novel insights into enzymological problems.

  18. The capsicum transcriptome DB: a "hot" tool for genomic research.

    PubMed

    Góngora-Castillo, Elsa; Fajardo-Jaime, Rubén; Fernández-Cortes, Araceli; Jofre-Garfias, Alba E; Lozoya-Gloria, Edmundo; Martínez, Octavio; Ochoa-Alejo, Neftalí; Rivera-Bustamante, Rafael

    2012-01-01

    Chili pepper (Capsicum annuum) is an economically important crop with no available public genome sequence. We describe a genomic resource to facilitate Capsicum annuum research. A collection of Expressed Sequence Tags (ESTs) derived from five C. annuum organs (root, stem, leaf, flower and fruit) were sequenced using the Sanger method and multiple leaf transcriptomes were deeply sampled using with GS-pyrosequencing. A hybrid assembly of 1,324,516 raw reads yielded 32,314 high quality contigs as validated by coverage and identity analysis with existing pepper sequences. Overall, 75.5% of the contigs had significant sequence similarity to entries in nucleic acid and protein databases; 23% of the sequences have not been previously reported for C. annuum and expand sequence resources for this species. A MySQL database and a user-friendly Web interface were constructed with search-tools that permit queries of the ESTs including sequence, functional annotation, Gene Ontology classification, metabolic pathways, and assembly information. The Capsicum Transcriptome DB is free available from http://www.bioingenios.ira.cinvestav.mx:81/Joomla/

  19. Modeling Tools and Ontology Development

    NASA Astrophysics Data System (ADS)

    Gaševic, Dragan; Djuric, Dragan; Devedžic, Vladan

    In this chapter, we give a few tutorials on how to use some of the currently available UML tools to create ontologies using the Ontology UML Profile. We also provide a brief overview of the Atlas Transformation Language (ATL) and its tooling support. Here we will show how ATL can be used for transforming ODM, and thus continue our discussion about transformations provided in Chap. 10.

  20. Flight Awareness Collaboration Tool Development

    NASA Technical Reports Server (NTRS)

    Mogford, Richard

    2016-01-01

    This is a PowerPoint presentation covering airline operations center (AOC) research. It reviews a dispatcher decision support tool called the Flight Awareness Collaboration Tool (FACT). FACT gathers information about winter weather onto one screen and includes predictive abilities. FACT should prove to be useful for airline dispatchers and airport personnel when they manage winter storms and their effect on air traffic. This material is very similar to other previously approved presentations.

  1. Human genome project: New tools for tomorrow's health research

    SciTech Connect

    Not Available

    1990-01-01

    The Human Genome Project is discussed within the context of the benefits that can be derived from human biological and biomedical research in the 21st century. Explanations are given about what chromosome mapping is and the different kinds that exist. Next, model organisms that were developed to better understand and interpret genetic information are discussed, as well as the database that the project will develop and the training opportunities that such a database will afford to emerging technologies. Finally, a review is given of the personal and social implications stemming from greater genetic knowledge.

  2. Mos1 transposition as a tool to engineer the Caenorhabditis elegans genome by homologous recombination.

    PubMed

    Robert, Valérie J P; Katic, Iskra; Bessereau, Jean-Louis

    2009-11-01

    Gene knockouts and knock-ins have emerged as powerful tools to study gene function in model organisms. The construction of such engineered alleles requires that homologous recombination between a transgenic fragment carrying the modifications desired in the genome and the locus to engineer occurs at high frequencies. Homologous recombination frequency is significantly increased in the vicinity of a DNA double-strand break. Based on this observation, a new generation of transgene-instructed genome engineering protocols was developed. Here, we present MosTIC (for "Mos1 excision-induced transgene-instructed gene conversion"), a new technique that provides a means to engineer the Caenorhabditis elegans genome. MosTIC is initiated by the mobilization of Mos1, a Drosophila transposon experimentally introduced in C. elegans. During MosTIC, a Mos1 insertion localized in the genomic region to engineer is mobilized after germline expression of the Mos transposase. Mos1 excision generates a DNA double-strand break, which is repaired by homologous recombination using a transgenic repair template. This results in the transfer of information from the transgene into the genome. Depending on the method used to trigger Mos1 excision, two alternative MosTIC protocols are available, which are presented here in detail. This technique can be used for a wide range of applications, such as structure-function analysis, protein localization and purification, genetic screens or generation of single copy transgenes at a defined locus in the genome.

  3. Transposon tools: worldwide landscape of intellectual property and technological developments.

    PubMed

    Palazzoli, Fabien; Testu, François-Xavier; Merly, Franck; Bigot, Yves

    2010-03-01

    DNA transposons are considered to be good candidates for developing tools for genome engineering, insertional mutagenesis and gene delivery for therapeutic purposes, as illustrated by the recent first clinical trial of a transposon. In this article we set out to highlight the interest of patent information, and to develop a strategy for the technological development of transposon tools, similar to what has been done in many other fields. We propose a patent landscape for transposon tools, including the changes in international patent applications, and review the leading inventors and applicants. We also provide an overview of the potential patent portfolio for the prokaryotic and eukaryotic transposons that are exploited by spin-off companies. Finally, we discuss the difficulties involved in tracing relevant state-of-the-art of articles and patent documents, based on the example of one of the most promising transposon systems, including all the impacts on the technological development of transposon tools.

  4. Tools for Accurate and Efficient Analysis of Complex Evolutionary Mechanisms in Microbial Genomes. Final Report

    SciTech Connect

    Nakhleh, Luay

    2014-03-12

    I proposed to develop computationally efficient tools for accurate detection and reconstruction of microbes' complex evolutionary mechanisms, thus enabling rapid and accurate annotation, analysis and understanding of their genomes. To achieve this goal, I proposed to address three aspects. (1) Mathematical modeling. A major challenge facing the accurate detection of HGT is that of distinguishing between these two events on the one hand and other events that have similar "effects." I proposed to develop a novel mathematical approach for distinguishing among these events. Further, I proposed to develop a set of novel optimization criteria for the evolutionary analysis of microbial genomes in the presence of these complex evolutionary events. (2) Algorithm design. In this aspect of the project, I proposed to develop an array of e cient and accurate algorithms for analyzing microbial genomes based on the formulated optimization criteria. Further, I proposed to test the viability of the criteria and the accuracy of the algorithms in an experimental setting using both synthetic as well as biological data. (3) Software development. I proposed the nal outcome to be a suite of software tools which implements the mathematical models as well as the algorithms developed.

  5. geneSurv: An interactive web-based tool for survival analysis in genomics research.

    PubMed

    Korkmaz, Selcuk; Goksuluk, Dincer; Zararsiz, Gokmen; Karahan, Sevilay

    2017-09-05

    Survival analysis methods are often used in cancer studies. It has been shown that the combination of clinical data with genomics increases the predictive performance of survival analysis methods. But, this leads to a high-dimensional data problem. Fortunately, new methods have been developed in the last decade to overcome this problem. However, there is a strong need for easily accessible, user-friendly and interactive tool to perform survival analysis in the presence of genomics data. We developed an open-source and freely available web-based tool for survival analysis methods that can deal with high-dimensional data. This tool includes classical methods, such as Kaplan-Meier, Cox proportional hazards regression, and advanced methods, such as penalized Cox regression and Random Survival Forests. It also offers an optimal cutoff determination method based on maximizing several test statistics. The tool has a simple and interactive interface, and it can handle high dimensional data through feature selection and ensemble methods. To dichotomize gene expressions, geneSurv can identify optimal cutoff points. Users can upload their microarray, RNA-Seq, chip-Seq, proteomics, metabolomics or clinical data as a nxp dimensional data matrix, where n refers to samples and p refers to genes. This tool is available free at www.biosoft.hacettepe.edu.tr/geneSurv. All source code is available at https://github.com/selcukorkmaz/geneSurv under the GPL-3 license. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. The hidden perils of read mapping as a quality assessment tool in genome sequencing

    PubMed Central

    Lehri, B.; Seddon, A. M.; Karlyshev, A. V.

    2017-01-01

    This article provides a comparative analysis of the various methods of genome sequencing focusing on verification of the assembly quality. The results of a comparative assessment of various de novo assembly tools, as well as sequencing technologies, are presented using a recently completed sequence of the genome of Lactobacillus fermentum 3872. In particular, quality of assemblies is assessed by using CLC Genomics Workbench read mapping and Optical mapping developed by OpGen. Over-extension of contigs without prior knowledge of contig location can lead to misassembled contigs, even when commonly used quality indicators such as read mapping suggest that a contig is well assembled. Precautions must also be undertaken when using long read sequencing technology, which may also lead to misassembled contigs. PMID:28225089

  7. FilooT: a visualization tool for exploring genomic data

    NASA Astrophysics Data System (ADS)

    Zeinaly, Mahshid; Soltangheis, Mina; Shaw, Chris D.

    2013-12-01

    In order to enhance analysis of synthetic health data of the IEEE VAST Challenge 2010, we introduce an interactive Visual Analytics tool called FilooT designed as a part of the Interactive Multi-genomic Analysis System (IMAS) project. In this paper, we describe different interactive views of FilooT: the Tabular View for exploring and comparing genetic sequences, the Matrix View for sorting sequences according to the values of different characteristics, the P-value View for finding the most important mutations across a family of sequences, the Graph View for finding related sequences and the Group View to group them for further investigation. We followed the Nested Process Model framework throughout the design process and the evaluation. To understand the tool's design capabilities for target domain analysts, we conducted a User Experience scenario-based study followed by an informal interview. The findings indicated how analysts employ each of the visualization and interaction designs in their Bioinformatics task-analysis process. The critical analysis of the results inspired design informing suggestions.

  8. Cpf1 Is A Versatile Tool for CRISPR Genome Editing Across Diverse Species of Cyanobacteria

    PubMed Central

    Ungerer, Justin; Pakrasi, Himadri B.

    2016-01-01

    Cyanobacteria are the ideal organisms for the production of a wide range of bioproducts as they can convert CO2 directly into the desired end product using solar energy. Unfortunately, the engineering of cyanobacteria to create efficient cell factories has been impaired by the cumbersome genetic tools that are currently available for these organisms; especially when trying to accumulate multiple modifications. We sought to construct an efficient and precise tool for generating numerous markerless modifications in cyanobacteria using CRISPR technology and the alternative nuclease, Cpf1. In this study we demonstrate rapid engineering of markerless knock-ins, knock-outs and point mutations in each of three model cyanobacteria; Synechococcus, Synechocystis and Anabaena. The markerless nature of cpf1 genome editing will allow for complex genome modification that was not possible with previously existing technology while facilitating the development of cyanobacteria as highly modified biofactories. PMID:28000776

  9. Cpf1 Is A Versatile Tool for CRISPR Genome Editing Across Diverse Species of Cyanobacteria.

    PubMed

    Ungerer, Justin; Pakrasi, Himadri B

    2016-12-21

    Cyanobacteria are the ideal organisms for the production of a wide range of bioproducts as they can convert CO2 directly into the desired end product using solar energy. Unfortunately, the engineering of cyanobacteria to create efficient cell factories has been impaired by the cumbersome genetic tools that are currently available for these organisms; especially when trying to accumulate multiple modifications. We sought to construct an efficient and precise tool for generating numerous markerless modifications in cyanobacteria using CRISPR technology and the alternative nuclease, Cpf1. In this study we demonstrate rapid engineering of markerless knock-ins, knock-outs and point mutations in each of three model cyanobacteria; Synechococcus, Synechocystis and Anabaena. The markerless nature of cpf1 genome editing will allow for complex genome modification that was not possible with previously existing technology while facilitating the development of cyanobacteria as highly modified biofactories.

  10. Proteogenomic Tools and Approaches to Explore Protein Coding Landscapes of Eukaryotic Genomes.

    PubMed

    Kumar, Dhirendra; Dash, Debasis

    2016-01-01

    Proteogenomic strategies aim to refine genome-wide annotations of protein coding features by using actual protein level observations. Most of the currently applied proteogenomic approaches include integrative analysis of multiple types of high-throughput omics data, e.g., genomics, transcriptomics, proteomics, etc. Recent efforts towards creating a human proteome map were primarily targeted to experimentally detect at least one protein product for each gene in the genome and extensively utilized proteogenomic approaches. The 14 year long wait to get a draft human proteome map, after completion of similar efforts to sequence the genome, explains the huge complexity and technical hurdles of such efforts. Further, the integrative analysis of large-scale multi-omics datasets inherent to these studies becomes a major bottleneck to their success. However, recent developments of various analysis tools and pipelines dedicated to proteogenomics reduce both the time and complexity of such analysis. Here, we summarize notable approaches, studies, software developments and their potential applications towards eukaryotic genome annotation and clinical proteogenomics.

  11. Sequencing Single Cell Microbial Genomes with Microfluidic Amplifications Tools (MICW - Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    ScienceCinema

    Quake, Steve [University of Stanford

    2016-07-12

    Stanford University's Steve Quake on "Sequencing Single Cell Microbial Genomes with Microfluidic Amplification Tools" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  12. Sequencing Single Cell Microbial Genomes with Microfluidic Amplifications Tools (MICW - Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    SciTech Connect

    Quake, Steve

    2011-10-12

    Stanford University's Steve Quake on "Sequencing Single Cell Microbial Genomes with Microfluidic Amplification Tools" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  13. Program Development Tools and Infrastructures

    SciTech Connect

    Schulz, M

    2012-03-12

    Exascale class machines will exhibit a new level of complexity: they will feature an unprecedented number of cores and threads, will most likely be heterogeneous and deeply hierarchical, and offer a range of new hardware techniques (such as speculative threading, transactional memory, programmable prefetching, and programmable accelerators), which all have to be utilized for an application to realize the full potential of the machine. Additionally, users will be faced with less memory per core, fixed total power budgets, and sharply reduced MTBFs. At the same time, it is expected that the complexity of applications will rise sharply for exascale systems, both to implement new science possible at exascale and to exploit the new hardware features necessary to achieve exascale performance. This is particularly true for many of the NNSA codes, which are large and often highly complex integrated simulation codes that push the limits of everything in the system including language features. To overcome these limitations and to enable users to reach exascale performance, users will expect a new generation of tools that address the bottlenecks of exascale machines, that work seamlessly with the (set of) programming models on the target machines, that scale with the machine, that provide automatic analysis capabilities, and that are flexible and modular enough to overcome the complexities and changing demands of the exascale architectures. Further, any tool must be robust enough to handle the complexity of large integrated codes while keeping the user's learning curve low. With the ASC program, in particular the CSSE (Computational Systems and Software Engineering) and CCE (Common Compute Environment) projects, we are working towards a new generation of tools that fulfill these requirements and that provide our users as well as the larger HPC community with the necessary tools, techniques, and methodologies required to make exascale performance a reality.

  14. Capitalizing on App Development Tools and Technologies

    ERIC Educational Resources Information Center

    Luterbach, Kenneth J.; Hubbell, Kenneth R.

    2015-01-01

    Instructional developers and others creating apps must choose from a wide variety of app development tools and technologies. Some app development tools have incorporated visual programming features, which enable some drag and drop coding and contextual programming. While those features help novices begin programming with greater ease, questions…

  15. Capitalizing on App Development Tools and Technologies

    ERIC Educational Resources Information Center

    Luterbach, Kenneth J.; Hubbell, Kenneth R.

    2015-01-01

    Instructional developers and others creating apps must choose from a wide variety of app development tools and technologies. Some app development tools have incorporated visual programming features, which enable some drag and drop coding and contextual programming. While those features help novices begin programming with greater ease, questions…

  16. Optical Whole-Genome Restriction Mapping as a Tool for Rapidly Distinguishing and Identifying Bacterial Contaminants in Clinical Samples

    PubMed Central

    Baldwin, James C.

    2015-01-01

    Introduction Optical restriction genome mapping is a technology in which a genome is linearized on a surface and digested with specific restriction enzymes, giving an arrangement of the genome with gaps whose order and size are unique for a given organism. Current applications of this technology include assisting with the correct scaffolding and ordering of genomes in conjunction with whole-genome sequencing, observation of genetic drift and evolution using comparative genomics and epidemiological monitoring of the spread of infections. Here, we investigated the suitability of genome mapping for use in clinical labs as a potential diagnostic tool. Materials and Methods Using whole genome mapping, we investigated the basic performance of the technology for identifying two bacteria of interest for food-safety (Lactobacilli spp. and Enterohemorrhagic Escherichia coli). We further evaluated the performance for identifying multiple organisms from both simple and complex mixtures. Results We were able to successfully generate optical restriction maps of four Lactobacillus species as well as a strain of Enterohemorrhagic Escherichia coli from within a mixed solution, each distinguished using a common compatible restriction enzyme. Finally, we demonstrated that optical restriction maps were successfully obtained and the correct organism identified within a clinical matrix. Conclusion With additional development, whole genome mapping may be a useful clinical tool for rapid invitro diagnostics. PMID:26435946

  17. Optical Whole-Genome Restriction Mapping as a Tool for Rapidly Distinguishing and Identifying Bacterial Contaminants in Clinical Samples.

    PubMed

    Chapleau, Richard R; Baldwin, James C

    2015-08-01

    Optical restriction genome mapping is a technology in which a genome is linearized on a surface and digested with specific restriction enzymes, giving an arrangement of the genome with gaps whose order and size are unique for a given organism. Current applications of this technology include assisting with the correct scaffolding and ordering of genomes in conjunction with whole-genome sequencing, observation of genetic drift and evolution using comparative genomics and epidemiological monitoring of the spread of infections. Here, we investigated the suitability of genome mapping for use in clinical labs as a potential diagnostic tool. Using whole genome mapping, we investigated the basic performance of the technology for identifying two bacteria of interest for food-safety (Lactobacilli spp. and Enterohemorrhagic Escherichia coli). We further evaluated the performance for identifying multiple organisms from both simple and complex mixtures. We were able to successfully generate optical restriction maps of four Lactobacillus species as well as a strain of Enterohemorrhagic Escherichia coli from within a mixed solution, each distinguished using a common compatible restriction enzyme. Finally, we demonstrated that optical restriction maps were successfully obtained and the correct organism identified within a clinical matrix. With additional development, whole genome mapping may be a useful clinical tool for rapid invitro diagnostics.

  18. Functional Genomics Tools for Haemonchus contortus and Lessons From Other Helminths.

    PubMed

    Britton, C; Roberts, B; Marks, N D

    2016-01-01

    The availability of genome and transcriptome data for parasitic nematodes, including Haemonchus contortus, has highlighted the need to develop functional genomics tools. Comparative genomic analysis, particularly using data from the free-living nematode Caenorhabditis elegans, can help predict gene function. Reliable approaches to study function directly in parasitic nematodes are currently lacking. However, gene knockdown by RNA interference (RNAi) is being successfully used in schistosome and planarian species to define gene functions. Lessons from these systems may be applied to improve RNAi in H. contortus. Previous studies in H. contortus and related nematodes demonstrated reliable RNAi-mediated silencing of some genes, but not others. Current data suggest that susceptibility to RNAi in these nematodes is limited to genes expressed in sites accessible to the environment, such as the gut, amphids and excretory cell. Therefore, RNAi is functional in H. contortus, but improvements are needed to develop this system as a functional genomics platform. Here, we summarize RNAi studies on H. contortus and discuss the optimization of RNA delivery and improvements to culture methods to enhance larval development, protein turnover and the induction of phenotypic effects in vitro. The transgenic delivery of RNA or dominant-negative gene constructs and the recently developed CRISPR/Cas genome-editing technique are considered as potential alternative approaches for gene knockout. This is a key time to devote greater effort in progressing from genome to function, to improve our understanding of the biology of Haemonchus and identify novel targets for parasite control. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Molecular tools for functional genomics in filamentous fungi: recent advances and new strategies.

    PubMed

    Jiang, Dewei; Zhu, Wei; Wang, Yunchuan; Sun, Chang; Zhang, Ke-Qin; Yang, Jinkui

    2013-12-01

    Advances in genetic transformation techniques have made important contributions to molecular genetics. Various molecular tools and strategies have been developed for functional genomic analysis of filamentous fungi since the first DNA transformation was successfully achieved in Neurospora crassa in 1973. Increasing amounts of genomic data regarding filamentous fungi are continuously reported and large-scale functional studies have become common in a wide range of fungal species. In this review, various molecular tools used in filamentous fungi are compared and discussed, including methods for genetic transformation (e.g., protoplast transformation, electroporation, and microinjection), the construction of random mutant libraries (e.g., restriction enzyme mediated integration, transposon arrayed gene knockout, and Agrobacterium tumefaciens mediated transformation), and the analysis of gene function (e.g., RNA interference and transcription activator-like effector nucleases). We also focused on practical strategies that could enhance the efficiency of genetic manipulation in filamentous fungi, such as choosing a proper screening system and marker genes, assembling target-cassettes or vectors effectively, and transforming into strains that are deficient in the nonhomologous end joining pathway. In summary, we present an up-to-date review on the different molecular tools and latest strategies that have been successfully used in functional genomics in filamentous fungi.

  20. Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomic Data.

    PubMed

    Bolser, Dan M; Staines, Daniel M; Perry, Emily; Kersey, Paul J

    2017-01-01

    Ensembl Plants ( http://plants.ensembl.org ) is an integrative resource presenting genome-scale information for 39 sequenced plant species. Available data includes genome sequence, gene models, functional annotation, and polymorphic loci; for the latter, additional information including population structure, individual genotypes, linkage, and phenotype data is available for some species. Comparative data is also available, including genomic alignments and "gene trees," which show the inferred evolutionary history of each gene family represented in the resource. Access to the data is provided through a genome browser, which incorporates many specialist interfaces for different data types, through a variety of programmatic interfaces, and via a specialist data mining tool supporting rapid filtering and retrieval of bulk data. Genomic data from many non-plant species, including those of plant pathogens, pests, and pollinators, is also available via the same interfaces through other divisions of Ensembl.Ensembl Plants is updated 4-6 times a year and is developed in collaboration with our international partners in the Gramene ( http://www.gramene.org ) and transPLANT projects ( http://www.transplantdb.eu ).

  1. SyntenyTracker: a tool for defining homologous synteny blocks using radiation hybrid maps and whole-genome sequence.

    PubMed

    Donthu, Ravikiran; Lewin, Harris A; Larkin, Denis M

    2009-07-23

    The recent availability of genomic sequences and BAC libraries for a large number of mammals provides an excellent opportunity for identifying comparatively-anchored markers that are useful for creating high-resolution radiation-hybrid (RH) and BAC-based comparative maps. To use these maps for multispecies genome comparison and evolutionary inference, robust bioinformatic tools are required for the identification of chromosomal regions shared between genomes and to localize the positions of evolutionary breakpoints that are the signatures of chromosomal rearrangements. Here we report an automated tool for the identification of homologous synteny blocks (HSBs) between genomes that tolerates errors common in RH comparative maps and can be used for automated whole-genome analysis of chromosome rearrangements that occur during evolution. We developed an algorithm and software tool (SyntenyTracker) that can be used for automated definition of HSBs using pair-wise RH or gene-based comparative maps as input. To verify correct implementation of the underlying algorithm, SyntenyTracker was used to identify HSBs in the cattle and human genomes. Results demonstrated 96% agreement with HSBs defined manually using the same set of rules. A comparison of SyntenyTracker with the AutoGRAPH synteny tool was performed using identical datasets containing 14,380 genes with 1:1 orthology in human and mouse. Discrepancies between the results using the two tools and advantages of SyntenyTracker are reported. SyntenyTracker was shown to be an efficient and accurate automated tool for defining HSBs using datasets that may contain minor errors resulting from limitations in map construction methodologies. The utility of SyntenyTracker will become more important for comparative genomics as the number of mapped and sequenced genomes increases.

  2. Genetic simulation tools for post-genome wide association studies of complex diseases.

    PubMed

    Chen, Huann-Sheng; Hutter, Carolyn M; Mechanic, Leah E; Amos, Christopher I; Bafna, Vineet; Hauser, Elizabeth R; Hernandez, Ryan D; Li, Chun; Liberles, David A; McAllister, Kimberly; Moore, Jason H; Paltoo, Dina N; Papanicolaou, George J; Peng, Bo; Ritchie, Marylyn D; Rosenfeld, Gabriel; Witte, John S; Gillanders, Elizabeth M; Feuer, Eric J

    2015-01-01

    Genetic simulation programs are used to model data under specified assumptions to facilitate the understanding and study of complex genetic systems. Standardized data sets generated using genetic simulation are essential for the development and application of novel analytical tools in genetic epidemiology studies. With continuing advances in high-throughput genomic technologies and generation and analysis of larger, more complex data sets, there is a need for updating current approaches in genetic simulation modeling. To provide a forum to address current and emerging challenges in this area, the National Cancer Institute (NCI) sponsored a workshop, entitled "Genetic Simulation Tools for Post-Genome Wide Association Studies of Complex Diseases" at the National Institutes of Health (NIH) in Bethesda, Maryland on March 11-12, 2014. The goals of the workshop were to (1) identify opportunities, challenges, and resource needs for the development and application of genetic simulation models; (2) improve the integration of tools for modeling and analysis of simulated data; and (3) foster collaborations to facilitate development and applications of genetic simulation. During the course of the meeting, the group identified challenges and opportunities for the science of simulation, software and methods development, and collaboration. This paper summarizes key discussions at the meeting, and highlights important challenges and opportunities to advance the field of genetic simulation. © 2014 WILEY PERIODICALS, INC.

  3. Genetic Simulation Tools for Post-Genome Wide Association Studies of Complex Diseases

    PubMed Central

    Amos, Christopher I.; Bafna, Vineet; Hauser, Elizabeth R.; Hernandez, Ryan D.; Li, Chun; Liberles, David A.; McAllister, Kimberly; Moore, Jason H.; Paltoo, Dina N.; Papanicolaou, George J.; Peng, Bo; Ritchie, Marylyn D.; Rosenfeld, Gabriel; Witte, John S.

    2014-01-01

    Genetic simulation programs are used to model data under specified assumptions to facilitate the understanding and study of complex genetic systems. Standardized data sets generated using genetic simulation are essential for the development and application of novel analytical tools in genetic epidemiology studies. With continuing advances in high-throughput genomic technologies and generation and analysis of larger, more complex data sets, there is a need for updating current approaches in genetic simulation modeling. To provide a forum to address current and emerging challenges in this area, the National Cancer Institute (NCI) sponsored a workshop, entitled “Genetic Simulation Tools for Post-Genome Wide Association Studies of Complex Diseases” at the National Institutes of Health (NIH) in Bethesda, Maryland on March 11-12, 2014. The goals of the workshop were to: (i) identify opportunities, challenges and resource needs for the development and application of genetic simulation models; (ii) improve the integration of tools for modeling and analysis of simulated data; and (iii) foster collaborations to facilitate development and applications of genetic simulation. During the course of the meeting the group identified challenges and opportunities for the science of simulation, software and methods development, and collaboration. This paper summarizes key discussions at the meeting, and highlights important challenges and opportunities to advance the field of genetic simulation. PMID:25371374

  4. Developing molecular tools for Chlamydomonas reinhardtii

    NASA Astrophysics Data System (ADS)

    Noor-Mohammadi, Samaneh

    Microalgae have garnered increasing interest over the years for their ability to produce compounds ranging from biofuels to neutraceuticals. A main focus of researchers has been to use microalgae as a natural bioreactor for the production of valuable and complex compounds. Recombinant protein expression in the chloroplasts of green algae has recently become more routine; however, the heterologous expression of multiple proteins or complete biosynthetic pathways remains a significant challenge. To take full advantage of these organisms' natural abilities, sophisticated molecular tools are needed to be able to introduce and functionally express multiple gene biosynthetic pathways in its genome. To achieve the above objective, we have sought to establish a method to construct, integrate and express multigene operons in the chloroplast and nuclear genome of the model microalgae Chlamydomonas reinhardtii. Here we show that a modified DNA Assembler approach can be used to rapidly assemble multiple-gene biosynthetic pathways in yeast and then integrate these assembled pathways at a site-specific location in the chloroplast, or by random integration in the nuclear genome of C. reinhardtii. As a proof of concept, this method was used to successfully integrate and functionally express up to three reporter proteins (AphA6, AadA, and GFP) in the chloroplast of C. reinhardtii and up to three reporter proteins (Ble, AphVIII, and GFP) in its nuclear genome. An analysis of the relative gene expression of the engineered strains showed significant differences in the mRNA expression levels of the reporter genes and thus highlights the importance of proper promoter/untranslated-region selection when constructing a target pathway. In addition, this work focuses on expressing the cofactor regeneration enzyme phosphite dehydrogenase (PTDH) in the chloroplast and nuclear genomes of C. reinhardtii. The PTDH enzyme converts phosphite into phosphate and NAD(P)+ into NAD(P)H. The reduced

  5. Probing the impact of chromatin conformation on genome editing tools

    PubMed Central

    Chen, Xiaoyu; Rinsma, Marrit; Janssen, Josephine M.; Liu, Jin; Maggio, Ignazio; Gonçalves, Manuel A.F.V.

    2016-01-01

    Transcription activator-like effector nucleases (TALENs) and RNA-guided nucleases derived from clustered, regularly interspaced, short palindromic repeats (CRISPR)-Cas9 systems have become ubiquitous genome editing tools. Despite this, the impact that distinct high-order chromatin conformations have on these sequence-specific designer nucleases is, presently, ill-defined. The same applies to the relative performance of TALENs and CRISPR/Cas9 nucleases at isogenic target sequences subjected to different epigenetic modifications. Here, to address these gaps in our knowledge, we have implemented quantitative cellular systems based on genetic reporters in which the euchromatic and heterochromatic statuses of designer nuclease target sites are stringently controlled by small-molecule drug availability. By using these systems, we demonstrate that TALENs and CRISPR/Cas9 nucleases are both significantly affected by the high-order epigenetic context of their target sequences. In addition, this outcome could also be ascertained for S. pyogenes CRISPR/Cas9 complexes harbouring Cas9 variants whose DNA cleaving specificities are superior to that of the wild-type Cas9 protein. Thus, the herein investigated cellular models will serve as valuable functional readouts for screening and assessing the role of chromatin on designer nucleases based on different platforms or with different architectures or compositions. PMID:27280977

  6. Observation Tools for Professional Development

    ERIC Educational Resources Information Center

    Malu, Kathleen F.

    2015-01-01

    Professional development of teachers, including English language teachers, empowers them to change in ways that improve teaching and learning (Gall and Acheson 2011; Murray 2010). In their seminal research on staff development--professional development in today's terms--Joyce and Showers (2002) identify key factors that promote teacher change.…

  7. The integrated microbial genomes (IMG) system in 2007: datacontent and analysis tool extensions

    SciTech Connect

    Markowitz, Victor M.; Szeto, Ernest; Palaniappan, Krishna; Grechkin, Yuri; Chu, Ken; Chen, I-Min A.; Dubchak, Inna; Anderson, Iain; Lykidis, Athanasios; Mavromatis, Konstantinos; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2007-08-01

    The Integrated Microbial Genomes (IMG) system is a data management, analysis and annotation platform for all publicly available genomes. IMG contains both draft and complete JGI microbial genomes integrated with all other publicly available genomes from all three domains of life, together with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and annotating genomes, genes and functions, individually or in a comparative context. Since its first release in 2005, IMG's data content and analytical capabilities have been constantly expanded through quarterly releases. IMG is provided by the DOE-Joint Genome Institute (JGI) and is available from http://img.jgi.doe.gov.

  8. Technicolour transgenics: imaging tools for functional genomics in the mouse.

    PubMed

    Hadjantonakis, Anna-Katerina; Dickinson, Mary E; Fraser, Scott E; Papaioannou, Virginia E

    2003-08-01

    Over the past decade, a battery of powerful tools that encompass forward and reverse genetic approaches have been developed to dissect the molecular and cellular processes that regulate development and disease. The advent of genetically-encoded fluorescent proteins that are expressed in wild type and mutant mice, together with advances in imaging technology, make it possible to study these biological processes in many dimensions. Importantly, these technologies allow direct visual access to complex events as they happen in their native environment, which provides greater insights into mammalian biology than ever before.

  9. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.

    PubMed

    Holt, Carson; Yandell, Mark

    2011-12-22

    Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes are being sequenced and how they are annotated. While the first generation of genome projects focused on well-studied model organisms, many of today's projects involve exotic organisms whose genomes are largely terra incognita. This complicates their annotation, because unlike first-generation projects, there are no pre-existing 'gold-standard' gene-models with which to train gene-finders. Improvements in genome assembly and the wide availability of mRNA-seq data are also creating opportunities to update and re-annotate previously published genome annotations. Today's genome projects are thus in need of new genome annotation tools that can meet the challenges and opportunities presented by second-generation sequencing technologies. We present MAKER2, a genome annotation and data management tool designed for second-generation genome projects. MAKER2 is a multi-threaded, parallelized application that can process second-generation datasets of virtually any size. We show that MAKER2 can produce accurate annotations for novel genomes where training-data are limited, of low quality or even non-existent. MAKER2 also provides an easy means to use mRNA-seq data to improve annotation quality; and it can use these data to update legacy annotations, significantly improving their quality. We also show that MAKER2 can evaluate the quality of genome annotations, and identify and prioritize problematic annotations for manual review. MAKER2 is the first annotation engine specifically designed for second-generation genome projects. MAKER2 scales to datasets of any size, requires little in the way of training data, and can use mRNA-seq data to improve annotation quality. It can also update and manage legacy genome annotation datasets.

  10. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects

    PubMed Central

    2011-01-01

    Background Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes are being sequenced and how they are annotated. While the first generation of genome projects focused on well-studied model organisms, many of today's projects involve exotic organisms whose genomes are largely terra incognita. This complicates their annotation, because unlike first-generation projects, there are no pre-existing 'gold-standard' gene-models with which to train gene-finders. Improvements in genome assembly and the wide availability of mRNA-seq data are also creating opportunities to update and re-annotate previously published genome annotations. Today's genome projects are thus in need of new genome annotation tools that can meet the challenges and opportunities presented by second-generation sequencing technologies. Results We present MAKER2, a genome annotation and data management tool designed for second-generation genome projects. MAKER2 is a multi-threaded, parallelized application that can process second-generation datasets of virtually any size. We show that MAKER2 can produce accurate annotations for novel genomes where training-data are limited, of low quality or even non-existent. MAKER2 also provides an easy means to use mRNA-seq data to improve annotation quality; and it can use these data to update legacy annotations, significantly improving their quality. We also show that MAKER2 can evaluate the quality of genome annotations, and identify and prioritize problematic annotations for manual review. Conclusions MAKER2 is the first annotation engine specifically designed for second-generation genome projects. MAKER2 scales to datasets of any size, requires little in the way of training data, and can use mRNA-seq data to improve annotation quality. It can also update and manage legacy genome annotation datasets. PMID:22192575

  11. Use of Genomic Tools to Improve Cattle Health in the Context of Infectious Diseases

    PubMed Central

    Raszek, Mikolaj M.; Guan, Le L.; Plastow, Graham S.

    2016-01-01

    Although infectious diseases impose a heavy economic burden on the cattle industry, the etiology of many disorders that affect livestock is not fully elucidated, and effective countermeasures are often lacking. The main tools available until now have been vaccines, antibiotics and antiparasitic drugs. Although these have been very successful in some cases, the appearance of parasite and microbial resistance to these treatments is a cause of concern. Next-generation sequencing provides important opportunities to tackle problems associated with pathogenic illnesses. This review describes the rapid gains achieved to track disease progression, identify the pathogens involved, and map pathogen interactions with the host. Use of novel genomic tools subsequently aids in treatment development, as well as successful creation of breeding programs aimed toward less susceptible livestock. These may be important tools for mitigating the long term effects of combating infection and helping reduce the reliance on antibiotic treatment. PMID:27014337

  12. Development and characterization of genomic and expressed SSRs in citrus by genome-wide analysis.

    PubMed

    Liu, Sheng-Rui; Li, Wen-Yang; Long, Dang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2013-01-01

    Microsatellites or simple sequence repeats (SSRs) are one of the most popular sources of genetic markers and play a significant role in plant genetics and breeding. In this study, we identified citrus SSRs in the genome of Clementine mandarin and analyzed their frequency and distribution in different genomic regions. A total of 80,708 SSRs were detected in the genome with an overall density of 268 SSRs/Mb. While di-nucleotide repeats were the most frequent microsatellites in genomic DNA sequence, tetra-nucleotides, which had more repeat units than any other SSR types, had the highest cumulative sequence length. We identified 6,834 transcripts as containing 8,989 SSRs in 33,929 Clementine mandarin transcripts, among which, tri-nucleotide motifs (36.0%) were the most common, followed by di-nucleotide (26.9%) and hexa-nucleotide motifs (15.1%). The motif AG (16.7%) was most abundant among these SSRs, while motifs AAG (6.6%), AAT (5.0%), and TAG (2.2%) were most common among tri-nucleotides. Functional categorization of transcripts containing SSRs revealed that 5,879 (86.0%) of such transcripts had homology with known proteins, GO and KEGG annotation revealed that transcripts containing SSRs were those implicated in diverse biological processes in plants, including binding, development, transcription, and protein degradation. When 27 genomic and 78 randomly selected SSRs were tested on Clementine mandarin, 95 SSRs revealed polymorphism. These 95 SSRs were further deployed on 18 genotypes of the three generas of Rutaceae for the genetic diversity assessment, genomic SSRs generally show low transferability in comparison to SSRs developed from expressed sequences. These transcript-markers identified in our study may provide a valuable genetic and genomic tool for further genetic research and varietal development in citrus, such as diversity study, QTL mapping, molecular breeding, comparative mapping and other genetic analyses.

  13. The Princeton Protein Orthology Database (P-POD): A Comparative Genomics Analysis Tool for Biologists

    PubMed Central

    Kang, Fan; Angiuoli, Samuel V.; White, Owen; Botstein, David; Dolinski, Kara

    2007-01-01

    Many biological databases that provide comparative genomics information and tools are now available on the internet. While certainly quite useful, to our knowledge none of the existing databases combine results from multiple comparative genomics methods with manually curated information from the literature. Here we describe the Princeton Protein Orthology Database (P-POD, http://ortholog.princeton.edu), a user-friendly database system that allows users to find and visualize the phylogenetic relationships among predicted orthologs (based on the OrthoMCL method) to a query gene from any of eight eukaryotic organisms, and to see the orthologs in a wider evolutionary context (based on the Jaccard clustering method). In addition to the phylogenetic information, the database contains experimental results manually collected from the literature that can be compared to the computational analyses, as well as links to relevant human disease and gene information via the OMIM, model organism, and sequence databases. Our aim is for the P-POD resource to be extremely useful to typical experimental biologists wanting to learn more about the evolutionary context of their favorite genes. P-POD is based on the commonly used Generic Model Organism Database (GMOD) schema and can be downloaded in its entirety for installation on one's own system. Thus, bioinformaticians and software developers may also find P-POD useful because they can use the P-POD database infrastructure when developing their own comparative genomics resources and database tools. PMID:17712414

  14. Recent Advances in Algal Genetic Tool Development

    SciTech Connect

    R. Dahlin, Lukas; T. Guarnieri, Michael

    2016-06-24

    The goal of achieving cost-effective biofuels and bioproducts derived from algal biomass will require improvements along the entire value chain, including identification of robust, high-productivity strains and development of advanced genetic tools. Though there have been modest advances in development of genetic systems for the model alga Chlamydomonas reinhardtii, progress in development of algal genetic tools, especially as applied to non-model algae, has generally lagged behind that of more commonly utilized laboratory and industrial microbes. This is in part due to the complex organellar structure of algae, including robust cell walls and intricate compartmentalization of target loci, as well as prevalent gene silencing mechanisms, which hinder facile utilization of conventional genetic engineering tools and methodologies. However, recent progress in global tool development has opened the door for implementation of strain-engineering strategies in industrially-relevant algal strains. Here, we review recent advances in algal genetic tool development and applications in eukaryotic microalgae.

  15. Recent Advances in Algal Genetic Tool Development

    SciTech Connect

    R. Dahlin, Lukas; T. Guarnieri, Michael

    2016-06-24

    The goal of achieving cost-effective biofuels and bioproducts derived from algal biomass will require improvements along the entire value chain, including identification of robust, high-productivity strains and development of advanced genetic tools. Though there have been modest advances in development of genetic systems for the model alga Chlamydomonas reinhardtii, progress in development of algal genetic tools, especially as applied to non-model algae, has generally lagged behind that of more commonly utilized laboratory and industrial microbes. This is in part due to the complex organellar structure of algae, including robust cell walls and intricate compartmentalization of target loci, as well as prevalent gene silencing mechanisms, which hinder facile utilization of conventional genetic engineering tools and methodologies. However, recent progress in global tool development has opened the door for implementation of strain-engineering strategies in industrially-relevant algal strains. Here, we review recent advances in algal genetic tool development and applications in eukaryotic microalgae.

  16. Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence–Function Space and Genome Context to Discover Novel Functions

    PubMed Central

    2017-01-01

    The exponentially increasing number of protein and nucleic acid sequences provides opportunities to discover novel enzymes, metabolic pathways, and metabolites/natural products, thereby adding to our knowledge of biochemistry and biology. The challenge has evolved from generating sequence information to mining the databases to integrating and leveraging the available information, i.e., the availability of “genomic enzymology” web tools. Web tools that allow identification of biosynthetic gene clusters are widely used by the natural products/synthetic biology community, thereby facilitating the discovery of novel natural products and the enzymes responsible for their biosynthesis. However, many novel enzymes with interesting mechanisms participate in uncharacterized small-molecule metabolic pathways; their discovery and functional characterization also can be accomplished by leveraging information in protein and nucleic acid databases. This Perspective focuses on two genomic enzymology web tools that assist the discovery novel metabolic pathways: (1) Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST) for generating sequence similarity networks to visualize and analyze sequence–function space in protein families and (2) Enzyme Function Initiative-Genome Neighborhood Tool (EFI-GNT) for generating genome neighborhood networks to visualize and analyze the genome context in microbial and fungal genomes. Both tools have been adapted to other applications to facilitate target selection for enzyme discovery and functional characterization. As the natural products community has demonstrated, the enzymology community needs to embrace the essential role of web tools that allow the protein and genome sequence databases to be leveraged for novel insights into enzymological problems. PMID:28826221

  17. Genome ARTIST: a robust, high-accuracy aligner tool for mapping transposon insertions and self-insertions.

    PubMed

    Ecovoiu, Alexandru Al; Ghionoiu, Iulian Constantin; Ciuca, Andrei Mihai; Ratiu, Attila Cristian

    2016-01-01

    A critical topic of insertional mutagenesis experiments performed on model organisms is mapping the hits of artificial transposons (ATs) at nucleotide level accuracy. Mapping errors may occur when sequencing artifacts or mutations as single nucleotide polymorphisms (SNPs) and small indels are present very close to the junction between a genomic sequence and a transposon inverted repeat (TIR). Another particular item of insertional mutagenesis is mapping of the transposon self-insertions and, to our best knowledge, there is no publicly available mapping tool designed to analyze such molecular events. We developed Genome ARTIST, a pairwise gapped aligner tool which works out both issues by means of an original, robust mapping strategy. Genome ARTIST is not designed to use next-generation sequencing (NGS) data but to analyze ATs insertions obtained in small to medium-scale mutagenesis experiments. Genome ARTIST employs a heuristic approach to find DNA sequence similarities and harnesses a multi-step implementation of a Smith-Waterman adapted algorithm to compute the mapping alignments. The experience is enhanced by easily customizable parameters and a user-friendly interface that describes the genomic landscape surrounding the insertion. Genome ARTIST is functional with many genomes of bacteria and eukaryotes available in Ensembl and GenBank repositories. Our tool specifically harnesses the sequence annotation data provided by FlyBase for Drosophila melanogaster (the fruit fly), which enables mapping of insertions relative to various genomic features such as natural transposons. Genome ARTIST was tested against other alignment tools using relevant query sequences derived from the D. melanogaster and Mus musculus (mouse) genomes. Real and simulated query sequences were also comparatively inquired, revealing that Genome ARTIST is a very robust solution for mapping transposon insertions. Genome ARTIST is a stand-alone user-friendly application, designed for high

  18. Genome-Wide Approaches to Drosophila Heart Development

    PubMed Central

    Frasch, Manfred

    2016-01-01

    The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi) reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level. PMID:27294102

  19. Genome Duplication: The Heartbeat of Developing Organisms

    PubMed Central

    DePamphilis, Melvin L.

    2016-01-01

    The mechanism that duplicates the nuclear genome during the trillions of cell divisions required to develop from zygote to adult is the same throughout the eukarya, but the mechanisms that determine where, when and how much nuclear genome duplication occur regulate development and differ among the eukarya. They allow organisms to change the rate of cell proliferation during development, to activate zygotic gene expression independently of DNA replication, and to restrict nuclear DNA replication to once per cell division. They allow specialized cells to exit their mitotic cell cycle and differentiate into polyploid cells, and in some cases, to amplify the number of copies of specific genes. It is genome duplication that drives evolution, by virtue of the errors that inevitably occur when the same process is repeated trillions of times. It is, unfortunately, the same errors that produce age-related genetic disorders such as cancer. PMID:26970621

  20. Genetic resources offer efficient tools for rice functional genomics research.

    PubMed

    Lo, Shuen-Fang; Fan, Ming-Jen; Hsing, Yue-Ie; Chen, Liang-Jwu; Chen, Shu; Wen, Ien-Chie; Liu, Yi-Lun; Chen, Ku-Ting; Jiang, Mirng-Jier; Lin, Ming-Kuang; Rao, Meng-Yen; Yu, Lin-Chih; Ho, Tuan-Hua David; Yu, Su-May

    2016-05-01

    Rice is an important crop and major model plant for monocot functional genomics studies. With the establishment of various genetic resources for rice genomics, the next challenge is to systematically assign functions to predicted genes in the rice genome. Compared with the robustness of genome sequencing and bioinformatics techniques, progress in understanding the function of rice genes has lagged, hampering the utilization of rice genes for cereal crop improvement. The use of transfer DNA (T-DNA) insertional mutagenesis offers the advantage of uniform distribution throughout the rice genome, but preferentially in gene-rich regions, resulting in direct gene knockout or activation of genes within 20-30 kb up- and downstream of the T-DNA insertion site and high gene tagging efficiency. Here, we summarize the recent progress in functional genomics using the T-DNA-tagged rice mutant population. We also discuss important features of T-DNA activation- and knockout-tagging and promoter-trapping of the rice genome in relation to mutant and candidate gene characterizations and how to more efficiently utilize rice mutant populations and datasets for high-throughput functional genomics and phenomics studies by forward and reverse genetics approaches. These studies may facilitate the translation of rice functional genomics research to improvements of rice and other cereal crops. © 2015 John Wiley & Sons Ltd.

  1. E-TALEN: a web tool to design TALENs for genome engineering.

    PubMed

    Heigwer, Florian; Kerr, Grainne; Walther, Nike; Glaeser, Kathrin; Pelz, Oliver; Breinig, Marco; Boutros, Michael

    2013-11-01

    Use of transcription activator-like effector nucleases (TALENs) is a promising new technique in the field of targeted genome engineering, editing and reverse genetics. Its applications span from introducing knockout mutations to endogenous tagging of proteins and targeted excision repair. Owing to this wide range of possible applications, there is a need for fast and user-friendly TALEN design tools. We developed E-TALEN (http://www.e-talen.org), a web-based tool to design TALENs for experiments of varying scale. E-TALEN enables the design of TALENs against a single target or a large number of target genes. We significantly extended previously published design concepts to consider genomic context and different applications. E-TALEN guides the user through an end-to-end design process of de novo TALEN pairs, which are specific to a certain sequence or genomic locus. Furthermore, E-TALEN offers a functionality to predict targeting and specificity for existing TALENs. Owing to the computational complexity of many of the steps in the design of TALENs, particular emphasis has been put on the implementation of fast yet accurate algorithms. We implemented a user-friendly interface, from the input parameters to the presentation of results. An additional feature of E-TALEN is the in-built sequence and annotation database available for many organisms, including human, mouse, zebrafish, Drosophila and Arabidopsis, which can be extended in the future.

  2. PAPA: a flexible tool for identifying pleiotropic pathways using genome-wide association study summaries.

    PubMed

    Wen, Yan; Wang, Wenyu; Guo, Xiong; Zhang, Feng

    2016-03-15

    : Pleiotropy is common in the genetic architectures of complex diseases. To the best of our knowledge, no analysis tool has been developed for identifying pleiotropic pathways using multiple genome-wide association study (GWAS) summaries by now. Here, we present PAPA, a flexible tool for pleiotropic pathway analysis utilizing GWAS summary results. The performance of PAPA was validated using publicly available GWAS summaries of body mass index and waist-hip ratio of the GIANT datasets. PAPA identified a set of pleiotropic pathways, which have been demonstrated to be involved in the development of obesity. PAPA program, document and illustrative example are available at http://sourceforge.net/projects/papav1/files/ : fzhxjtu@mail.xjtu.edu.cn Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  3. Epigenetic Mechanisms Underlie Genome Development

    ERIC Educational Resources Information Center

    Lamm, Ehud

    2013-01-01

    Technological and methodological advances, in particular next-generation sequencing and chromatin profiling, has led to a deluge of data on epigenetic mechanisms and processes. Epigenetic regulation in the brain is no exception. In this commentary, Ehud Lamm writes that extending existing frameworks for thinking about psychological development to…

  4. Development and Application of Camelid Molecular Cytogenetic Tools

    PubMed Central

    Avila, Felipe; Das, Pranab J.; Kutzler, Michelle; Owens, Elaine; Perelman, Polina; Rubes, Jiri; Hornak, Miroslav; Johnson, Warren E.

    2014-01-01

    Cytogenetic chromosome maps offer molecular tools for genome analysis and clinical cytogenetics and are of particular importance for species with difficult karyotypes, such as camelids (2n = 74). Building on the available human–camel zoo-fluorescence in situ hybridization (FISH) data, we developed the first cytogenetic map for the alpaca (Lama pacos, LPA) genome by isolating and identifying 151 alpaca bacterial artificial chromosome (BAC) clones corresponding to 44 specific genes. The genes were mapped by FISH to 31 alpaca autosomes and the sex chromosomes; 11 chromosomes had 2 markers, which were ordered by dual-color FISH. The STS gene mapped to Xpter/Ypter, demarcating the pseudoautosomal region, whereas no markers were assigned to chromosomes 14, 21, 22, 28, and 36. The chromosome-specific markers were applied in clinical cytogenetics to identify LPA20, the major histocompatibility complex (MHC)-carrying chromosome, as a part of an autosomal translocation in a sterile male llama (Lama glama, LGL; 2n = 73,XY). FISH with LPAX BACs and LPA36 paints, as well as comparative genomic hybridization, were also used to investigate the origin of the minute chromosome, an abnormally small LPA36 in infertile female alpacas. This collection of cytogenetically mapped markers represents a new tool for camelid clinical cytogenetics and has applications for the improvement of the alpaca genome map and sequence assembly. PMID:23109720

  5. MP3: a software tool for the prediction of pathogenic proteins in genomic and metagenomic data.

    PubMed

    Gupta, Ankit; Kapil, Rohan; Dhakan, Darshan B; Sharma, Vineet K

    2014-01-01

    The identification of virulent proteins in any de-novo sequenced genome is useful in estimating its pathogenic ability and understanding the mechanism of pathogenesis. Similarly, the identification of such proteins could be valuable in comparing the metagenome of healthy and diseased individuals and estimating the proportion of pathogenic species. However, the common challenge in both the above tasks is the identification of virulent proteins since a significant proportion of genomic and metagenomic proteins are novel and yet unannotated. The currently available tools which carry out the identification of virulent proteins provide limited accuracy and cannot be used on large datasets. Therefore, we have developed an MP3 standalone tool and web server for the prediction of pathogenic proteins in both genomic and metagenomic datasets. MP3 is developed using an integrated Support Vector Machine (SVM) and Hidden Markov Model (HMM) approach to carry out highly fast, sensitive and accurate prediction of pathogenic proteins. It displayed Sensitivity, Specificity, MCC and accuracy values of 92%, 100%, 0.92 and 96%, respectively, on blind dataset constructed using complete proteins. On the two metagenomic blind datasets (Blind A: 51-100 amino acids and Blind B: 30-50 amino acids), it displayed Sensitivity, Specificity, MCC and accuracy values of 82.39%, 97.86%, 0.80 and 89.32% for Blind A and 71.60%, 94.48%, 0.67 and 81.86% for Blind B, respectively. In addition, the performance of MP3 was validated on selected bacterial genomic and real metagenomic datasets. To our knowledge, MP3 is the only program that specializes in fast and accurate identification of partial pathogenic proteins predicted from short (100-150 bp) metagenomic reads and also performs exceptionally well on complete protein sequences. MP3 is publicly available at http://metagenomics.iiserb.ac.in/mp3/index.php.

  6. MP3: A Software Tool for the Prediction of Pathogenic Proteins in Genomic and Metagenomic Data

    PubMed Central

    Gupta, Ankit; Kapil, Rohan; Dhakan, Darshan B.; Sharma, Vineet K.

    2014-01-01

    The identification of virulent proteins in any de-novo sequenced genome is useful in estimating its pathogenic ability and understanding the mechanism of pathogenesis. Similarly, the identification of such proteins could be valuable in comparing the metagenome of healthy and diseased individuals and estimating the proportion of pathogenic species. However, the common challenge in both the above tasks is the identification of virulent proteins since a significant proportion of genomic and metagenomic proteins are novel and yet unannotated. The currently available tools which carry out the identification of virulent proteins provide limited accuracy and cannot be used on large datasets. Therefore, we have developed an MP3 standalone tool and web server for the prediction of pathogenic proteins in both genomic and metagenomic datasets. MP3 is developed using an integrated Support Vector Machine (SVM) and Hidden Markov Model (HMM) approach to carry out highly fast, sensitive and accurate prediction of pathogenic proteins. It displayed Sensitivity, Specificity, MCC and accuracy values of 92%, 100%, 0.92 and 96%, respectively, on blind dataset constructed using complete proteins. On the two metagenomic blind datasets (Blind A: 51–100 amino acids and Blind B: 30–50 amino acids), it displayed Sensitivity, Specificity, MCC and accuracy values of 82.39%, 97.86%, 0.80 and 89.32% for Blind A and 71.60%, 94.48%, 0.67 and 81.86% for Blind B, respectively. In addition, the performance of MP3 was validated on selected bacterial genomic and real metagenomic datasets. To our knowledge, MP3 is the only program that specializes in fast and accurate identification of partial pathogenic proteins predicted from short (100–150 bp) metagenomic reads and also performs exceptionally well on complete protein sequences. MP3 is publicly available at http://metagenomics.iiserb.ac.in/mp3/index.php. PMID:24736651

  7. CoryneBase: Corynebacterium Genomic Resources and Analysis Tools at Your Fingertips

    PubMed Central

    Tan, Mui Fern; Jakubovics, Nick S.; Wee, Wei Yee; Mutha, Naresh V. R.; Wong, Guat Jah; Ang, Mia Yang; Yazdi, Amir Hessam; Choo, Siew Woh

    2014-01-01

    Corynebacteria are used for a wide variety of industrial purposes but some species are associated with human diseases. With increasing number of corynebacterial genomes having been sequenced, comparative analysis of these strains may provide better understanding of their biology, phylogeny, virulence and taxonomy that may lead to the discoveries of beneficial industrial strains or contribute to better management of diseases. To facilitate the ongoing research of corynebacteria, a specialized central repository and analysis platform for the corynebacterial research community is needed to host the fast-growing amount of genomic data and facilitate the analysis of these data. Here we present CoryneBase, a genomic database for Corynebacterium with diverse functionality for the analysis of genomes aimed to provide: (1) annotated genome sequences of Corynebacterium where 165,918 coding sequences and 4,180 RNAs can be found in 27 species; (2) access to comprehensive Corynebacterium data through the use of advanced web technologies for interactive web interfaces; and (3) advanced bioinformatic analysis tools consisting of standard BLAST for homology search, VFDB BLAST for sequence homology search against the Virulence Factor Database (VFDB), Pairwise Genome Comparison (PGC) tool for comparative genomic analysis, and a newly designed Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomic analysis. CoryneBase offers the access of a range of Corynebacterium genomic resources as well as analysis tools for comparative genomics and pathogenomics. It is publicly available at http://corynebacterium.um.edu.my/. PMID:24466021

  8. GACT: a Genome build and Allele definition Conversion Tool for SNP imputation and meta-analysis in genetic association studies.

    PubMed

    Sulovari, Arvis; Li, Dawei

    2014-07-19

    Genome-wide association studies (GWAS) have successfully identified genes associated with complex human diseases. Although much of the heritability remains unexplained, combining single nucleotide polymorphism (SNP) genotypes from multiple studies for meta-analysis will increase the statistical power to identify new disease-associated variants. Meta-analysis requires same allele definition (nomenclature) and genome build among individual studies. Similarly, imputation, commonly-used prior to meta-analysis, requires the same consistency. However, the genotypes from various GWAS are generated using different genotyping platforms, arrays or SNP-calling approaches, resulting in use of different genome builds and allele definitions. Incorrect assumptions of identical allele definition among combined GWAS lead to a large portion of discarded genotypes or incorrect association findings. There is no published tool that predicts and converts among all major allele definitions. In this study, we have developed a tool, GACT, which stands for Genome build and Allele definition Conversion Tool, that predicts and inter-converts between any of the common SNP allele definitions and between the major genome builds. In addition, we assessed several factors that may affect imputation quality, and our results indicated that inclusion of singletons in the reference had detrimental effects while ambiguous SNPs had no measurable effect. Unexpectedly, exclusion of genotypes with missing rate > 0.001 (40% of study SNPs) showed no significant decrease of imputation quality (even significantly higher when compared to the imputation with singletons in the reference), especially for rare SNPs. GACT is a new, powerful, and user-friendly tool with both command-line and interactive online versions that can accurately predict, and convert between any of the common allele definitions and between genome builds for genome-wide meta-analysis and imputation of genotypes from SNP-arrays or deep

  9. GACT: a Genome build and Allele definition Conversion Tool for SNP imputation and meta-analysis in genetic association studies

    PubMed Central

    2014-01-01

    Background Genome-wide association studies (GWAS) have successfully identified genes associated with complex human diseases. Although much of the heritability remains unexplained, combining single nucleotide polymorphism (SNP) genotypes from multiple studies for meta-analysis will increase the statistical power to identify new disease-associated variants. Meta-analysis requires same allele definition (nomenclature) and genome build among individual studies. Similarly, imputation, commonly-used prior to meta-analysis, requires the same consistency. However, the genotypes from various GWAS are generated using different genotyping platforms, arrays or SNP-calling approaches, resulting in use of different genome builds and allele definitions. Incorrect assumptions of identical allele definition among combined GWAS lead to a large portion of discarded genotypes or incorrect association findings. There is no published tool that predicts and converts among all major allele definitions. Results In this study, we have developed a tool, GACT, which stands for Genome build and Allele definition Conversion Tool, that predicts and inter-converts between any of the common SNP allele definitions and between the major genome builds. In addition, we assessed several factors that may affect imputation quality, and our results indicated that inclusion of singletons in the reference had detrimental effects while ambiguous SNPs had no measurable effect. Unexpectedly, exclusion of genotypes with missing rate > 0.001 (40% of study SNPs) showed no significant decrease of imputation quality (even significantly higher when compared to the imputation with singletons in the reference), especially for rare SNPs. Conclusion GACT is a new, powerful, and user-friendly tool with both command-line and interactive online versions that can accurately predict, and convert between any of the common allele definitions and between genome builds for genome-wide meta-analysis and imputation of

  10. Quality Assurance Project Plan Development Tool

    EPA Pesticide Factsheets

    This tool contains information designed to assist in developing a Quality Assurance (QA) Project Plan that meets EPA requirements for projects that involve surface or groundwater monitoring and/or the collection and analysis of water samples.

  11. SHARAD Radargram Analysis Tool Development in JMARS

    NASA Astrophysics Data System (ADS)

    Adler, J. B.; Anwar, S.; Dickenshied, S.; Carter, S.

    2016-09-01

    New tools are being developed in JMARS, a free GIS software, for SHARAD radargram viewing and analysis. These capabilities are useful for the polar science community, and for constraining the viability of ice resource deposits for human exploration.

  12. Genomic Landscape of Developing Male Germ Cells

    PubMed Central

    Lee, Tin-Lap; Pang, Alan Lap-Yin; Rennert, Owen M.; Chan, Wai-Yee

    2010-01-01

    Spermatogenesis is a highly orchestrated developmental process by which spermatogonia develop into mature spermatozoa. This process involves many testis- or male germ cell-specific gene products whose expressions are strictly regulated. In the past decade the advent of high-throughput gene expression analytical techniques has made functional genomic studies of this process, particularly in model animals such as mice and rats, feasible and practical. These studies have just begun to reveal the complexity of the genomic landscape of the developing male germ cells. Over 50% of the mouse and rat genome are expressed during testicular development. Among transcripts present in germ cells, 40% – 60% are uncharacterized. A number of genes, and consequently their associated biological pathways, are differentially expressed at different stages of spermatogenesis. Developing male germ cells present a rich repertoire of genetic processes. Tissue-specific as well as spermatogenesis stage-specific alternative splicing of genes exemplifies the complexity of genome expression. In addition to this layer of control, discoveries of abundant presence of antisense transcripts, expressed psuedogenes, non-coding RNAs (ncRNA) including long ncRNAs, microRNAs (miRNAs) and Piwi-interacting RNAs (piRNAs), and retrogenes all point to the presence of multiple layers of expression and functional regulation in male germ cells. It is anticipated that application of systems biology approaches will further our understanding of the regulatory mechanism of spermatogenesis.† PMID:19306351

  13. Mi-DISCOVERER: A bioinformatics tool for the detection of mi-RNA in human genome

    PubMed Central

    Arshad, Saadia; Mumtaz, Asia; Ahmad, Freed; Liaquat, Sadia; Nadeem, Shahid; Mehboob, Shahid; Afzal, Muhammad

    2010-01-01

    MicroRNAs (miRNAs) are 22 nucleotides non-coding RNAs that play pivotal regulatory roles in diverse organisms including the humans and are difficult to be identified due to lack of either sequence features or robust algorithms to efficiently identify. Therefore, we made a tool that is Mi-Discoverer for the detection of miRNAs in human genome. The tools used for the development of software are Microsoft Office Access 2003, the JDK version 1.6.0, BioJava version 1.0, and the NetBeans IDE version 6.0. All already made miRNAs softwares were web based; so the advantage of our project was to make a desktop facility to the user for sequence alignment search with already identified miRNAs of human genome present in the database. The user can also insert and update the newly discovered human miRNA in the database. Mi-Discoverer, a bioinformatics tool successfully identifies human miRNAs based on multiple sequence alignment searches. It's a non redundant database containing a large collection of publicly available human miRNAs. PMID:21364831

  14. Data Mining and Optimization Tools for Developing Engine Parameters Tools

    NASA Technical Reports Server (NTRS)

    Dhawan, Atam P.

    1998-01-01

    This project was awarded for understanding the problem and developing a plan for Data Mining tools for use in designing and implementing an Engine Condition Monitoring System. Tricia Erhardt and I studied the problem domain for developing an Engine Condition Monitoring system using the sparse and non-standardized datasets to be available through a consortium at NASA Lewis Research Center. We visited NASA three times to discuss additional issues related to dataset which was not made available to us. We discussed and developed a general framework of data mining and optimization tools to extract useful information from sparse and non-standard datasets. These discussions lead to the training of Tricia Erhardt to develop Genetic Algorithm based search programs which were written in C++ and used to demonstrate the capability of GA algorithm in searching an optimal solution in noisy, datasets. From the study and discussion with NASA LeRC personnel, we then prepared a proposal, which is being submitted to NASA for future work for the development of data mining algorithms for engine conditional monitoring. The proposed set of algorithm uses wavelet processing for creating multi-resolution pyramid of tile data for GA based multi-resolution optimal search.

  15. Data Mining and Optimization Tools for Developing Engine Parameters Tools

    NASA Technical Reports Server (NTRS)

    Dhawan, Atam P.

    1998-01-01

    This project was awarded for understanding the problem and developing a plan for Data Mining tools for use in designing and implementing an Engine Condition Monitoring System. Tricia Erhardt and I studied the problem domain for developing an Engine Condition Monitoring system using the sparse and non-standardized datasets to be available through a consortium at NASA Lewis Research Center. We visited NASA three times to discuss additional issues related to dataset which was not made available to us. We discussed and developed a general framework of data mining and optimization tools to extract useful information from sparse and non-standard datasets. These discussions lead to the training of Tricia Erhardt to develop Genetic Algorithm based search programs which were written in C++ and used to demonstrate the capability of GA algorithm in searching an optimal solution in noisy, datasets. From the study and discussion with NASA LeRC personnel, we then prepared a proposal, which is being submitted to NASA for future work for the development of data mining algorithms for engine conditional monitoring. The proposed set of algorithm uses wavelet processing for creating multi-resolution pyramid of tile data for GA based multi-resolution optimal search.

  16. Software development tools: A bibliography, appendix C.

    NASA Technical Reports Server (NTRS)

    Riddle, W. E.

    1980-01-01

    A bibliography containing approximately 200 citations on tools which help software developers perform some development task (such as text manipulation, testing, etc.), and which would not necessarily be found as part of a computing facility is given. The bibliography comes from a relatively random sampling of the literature and is not complete. But it is indicative of the nature and range of tools currently being prepared or currently available.

  17. Phytozome: a Tool for Green Plant Comparative Genomics

    DOE Data Explorer

    Phytozome is a joint project of the Department of Energy's Joint Genome Institute and the Center for Integrative Genomics to facilitate comparative genomic studies amongst green plants. Clusters of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These clusters allow easy access to clade specific orthology/paralogy relationships as well as clade specific genes and gene expansions. As of release v4.0, Phytozome provides access to nine sequenced and annotated green plant genomes, eight of which have been clustered into gene families at six evolutionarily significant nodes. Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are hyper-linked and searchable. [Copied from the Overview at http://www.phytozome.net/Phytozome_info.php

  18. The GLOBE 3D Genome Platform - towards a novel system-biological paper tool to integrate the huge complexity of genome organization and function.

    PubMed

    Knoch, Tobias A; Lesnussa, Michael; Kepper, Nick; Eussen, Hubert B; Grosveld, Frank G

    2009-01-01

    Genomes are tremendous co-evolutionary holistic systems for molecular storage, processing and fabrication of information. Their system-biological complexity remains, however, still largely mysterious, despite immense sequencing achievements and huge advances in the understanding of the general sequential, three-dimensional and regulatory organization. Here, we present the GLOBE 3D Genome Platform a completely novel grid based virtual "paper" tool and in fact the first system-biological genome browser integrating the holistic complexity of genomes in a single easy comprehensible platform: Based on a detailed study of biophysical and IT requirements, every architectural level from sequence to morphology of one or several genomes can be approached in a real and in a symbolic representation simultaneously and navigated by continuous scale-free zooming within a unique three-dimensional OpenGL and grid driven environment. In principle an unlimited number of multi-dimensional data sets can be visualized, customized in terms of arrangement, shape, colour, and texture etc. as well as accessed and annotated individually or in groups using internal or external data bases/facilities. Any information can be searched and correlated by importing or calculating simple relations in real-time using grid resources. A general correlation and application platform for more complex correlative analysis and a front-end for system-biological simulations both using again the huge capabilities of grid infrastructures is currently under development. Hence, the GLOBE 3D Genome Platform is an example of a grid based approach towards a virtual desktop for genomic work combining the three fundamental distributed resources: i) visual data representation, ii) data access and management, and iii) data analysis and creation. Thus, the GLOBE 3D Genome Platform is the novel system-biology oriented information system urgently needed to access, present, annotate, and to simulate the holistic genome

  19. G-IMEx: A comprehensive software tool for detection of microsatellites from genome sequences

    PubMed Central

    Mudunuri, Suresh B; Kumar, Pankaj; Rao, Allam Appa; Pallamsetty, S; Nagarajaram, H A

    2010-01-01

    Microsatellites are ubiquitous short tandem repeats found in all known genomes and are known to play a very important role in various studies and fields including DNA fingerprinting, paternity studies, evolutionary studies, virulence and adaptation of certain bacteria and viruses etc. Due to the sequencing of several genomes and the availability of enormous amounts of sequence data during the past few years, computational studies of microsatellites are of interest for many researchers. In this context, we developed a software tool called Imperfect Microsatellite Extractor (IMEx), to extract perfect, imperfect and compound microsatellites from genome sequences along with their complete statistics. Recently we developed a user-friendly graphical-interface using JAVA for IMEx to be used as a stand-alone software named G-IMEx. G-IMEx takes a nucleotide sequence as an input and the results are produced in both html and text formats. The Linux version of G-IMEx can be downloaded for free from http://www.cdfd.org.in/imex PMID:21364802

  20. Current genome editing tools in gene therapy: new approaches to treat cancer.

    PubMed

    Shuvalov, Oleg; Petukhov, Alexey; Daks, Alexandra; Fedorova, Olga; Ermakov, Alexander; Melino, Gerry; Barlev, Nickolai A

    2015-01-01

    Gene therapy suggests a promising approach to treat genetic diseases by applying genes as pharmaceuticals. Cancer is a complex disease, which strongly depends on a particular genetic make-up and hence can be treated with gene therapy. From about 2,000 clinical trials carried out so far, more than 60% were cancer targeted. Development of precise and effective gene therapy approaches is intimately connected with achievements in the molecular biology techniques. The field of gene therapy was recently revolutionized by the introduction of "programmable" nucleases, including ZFNs, TALENs, and CRISPR, which target specific genomic loci with high efficacy and precision. Furthermore, when combined with DNA transposons for the delivery purposes into cells, these programmable nucleases represent a promising alternative to the conventional viral-mediated gene delivery. In addition to "programmable" nucleases, a new class of TALE- and CRISPR-based "artificial transcription effectors" has been developed to mediate precise regulation of specific genes. In sum, these new molecular tools may be used in a wide plethora of gene therapy strategies. This review highlights the current status of novel genome editing tools and discusses their suitability and perspectives in respect to cancer gene therapy studies.

  1. Rainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing.

    PubMed

    Zhao, Shanrong; Prenger, Kurt; Smith, Lance; Messina, Thomas; Fan, Hongtao; Jaeger, Edward; Stephens, Susan

    2013-06-27

    Technical improvements have decreased sequencing costs and, as a result, the size and number of genomic datasets have increased rapidly. Because of the lower cost, large amounts of sequence data are now being produced by small to midsize research groups. Crossbow is a software tool that can detect single nucleotide polymorphisms (SNPs) in whole-genome sequencing (WGS) data from a single subject; however, Crossbow has a number of limitations when applied to multiple subjects from large-scale WGS projects. The data storage and CPU resources that are required for large-scale whole genome sequencing data analyses are too large for many core facilities and individual laboratories to provide. To help meet these challenges, we have developed Rainbow, a cloud-based software package that can assist in the automation of large-scale WGS data analyses. Here, we evaluated the performance of Rainbow by analyzing 44 different whole-genome-sequenced subjects. Rainbow has the capacity to process genomic data from more than 500 subjects in two weeks using cloud computing provided by the Amazon Web Service. The time includes the import and export of the data using Amazon Import/Export service. The average cost of processing a single sample in the cloud was less than 120 US dollars. Compared with Crossbow, the main improvements incorporated into Rainbow include the ability: (1) to handle BAM as well as FASTQ input files; (2) to split large sequence files for better load balance downstream; (3) to log the running metrics in data processing and monitoring multiple Amazon Elastic Compute Cloud (EC2) instances; and (4) to merge SOAPsnp outputs for multiple individuals into a single file to facilitate downstream genome-wide association studies. Rainbow is a scalable, cost-effective, and open-source tool for large-scale WGS data analysis. For human WGS data sequenced by either the Illumina HiSeq 2000 or HiSeq 2500 platforms, Rainbow can be used straight out of the box. Rainbow is available

  2. Rainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing

    PubMed Central

    2013-01-01

    Background Technical improvements have decreased sequencing costs and, as a result, the size and number of genomic datasets have increased rapidly. Because of the lower cost, large amounts of sequence data are now being produced by small to midsize research groups. Crossbow is a software tool that can detect single nucleotide polymorphisms (SNPs) in whole-genome sequencing (WGS) data from a single subject; however, Crossbow has a number of limitations when applied to multiple subjects from large-scale WGS projects. The data storage and CPU resources that are required for large-scale whole genome sequencing data analyses are too large for many core facilities and individual laboratories to provide. To help meet these challenges, we have developed Rainbow, a cloud-based software package that can assist in the automation of large-scale WGS data analyses. Results Here, we evaluated the performance of Rainbow by analyzing 44 different whole-genome-sequenced subjects. Rainbow has the capacity to process genomic data from more than 500 subjects in two weeks using cloud computing provided by the Amazon Web Service. The time includes the import and export of the data using Amazon Import/Export service. The average cost of processing a single sample in the cloud was less than 120 US dollars. Compared with Crossbow, the main improvements incorporated into Rainbow include the ability: (1) to handle BAM as well as FASTQ input files; (2) to split large sequence files for better load balance downstream; (3) to log the running metrics in data processing and monitoring multiple Amazon Elastic Compute Cloud (EC2) instances; and (4) to merge SOAPsnp outputs for multiple individuals into a single file to facilitate downstream genome-wide association studies. Conclusions Rainbow is a scalable, cost-effective, and open-source tool for large-scale WGS data analysis. For human WGS data sequenced by either the Illumina HiSeq 2000 or HiSeq 2500 platforms, Rainbow can be used straight out of

  3. Computational Tools to Accelerate Commercial Development

    SciTech Connect

    Miller, David C

    2013-01-01

    The goals of the work reported are: to develop new computational tools and models to enable industry to more rapidly develop and deploy new advanced energy technologies; to demonstrate the capabilities of the CCSI Toolset on non-proprietary case studies; and to deploy the CCSI Toolset to industry. Challenges of simulating carbon capture (and other) processes include: dealing with multiple scales (particle, device, and whole process scales); integration across scales; verification, validation, and uncertainty; and decision support. The tools cover: risk analysis and decision making; validated, high-fidelity CFD; high-resolution filtered sub-models; process design and optimization tools; advanced process control and dynamics; process models; basic data sub-models; and cross-cutting integration tools.

  4. Data Mining and Optimization Tools for Developing Engine Parameters Tools

    NASA Technical Reports Server (NTRS)

    Dhawan, Atam P.

    1998-01-01

    This project was awarded for understanding the problem and developing a plan for Data Mining tools for use in designing and implementing an Engine Condition Monitoring System. From the total budget of $5,000, Tricia and I studied the problem domain for developing ail Engine Condition Monitoring system using the sparse and non-standardized datasets to be available through a consortium at NASA Lewis Research Center. We visited NASA three times to discuss additional issues related to dataset which was not made available to us. We discussed and developed a general framework of data mining and optimization tools to extract useful information from sparse and non-standard datasets. These discussions lead to the training of Tricia Erhardt to develop Genetic Algorithm based search programs which were written in C++ and used to demonstrate the capability of GA algorithm in searching an optimal solution in noisy datasets. From the study and discussion with NASA LERC personnel, we then prepared a proposal, which is being submitted to NASA for future work for the development of data mining algorithms for engine conditional monitoring. The proposed set of algorithm uses wavelet processing for creating multi-resolution pyramid of the data for GA based multi-resolution optimal search. Wavelet processing is proposed to create a coarse resolution representation of data providing two advantages in GA based search: 1. We will have less data to begin with to make search sub-spaces. 2. It will have robustness against the noise because at every level of wavelet based decomposition, we will be decomposing the signal into low pass and high pass filters.

  5. Development of a transportation planning tool

    SciTech Connect

    Funkhouser, B.R.; Moyer, J.W.; Ballweg, E.L.

    1994-07-01

    This paper describes the application of simulation modeling and logistics techniques to the development of a planning tool for the Department of Energy (DOE). The focus of the Transportation Planning Model (TPM) tool is to aid DOE and Sandia analysts in the planning of future fleet sizes, driver and support personnel sizes, base site locations, and resource balancing among the base sites. The design approach is to develop a rapid modeling environment which will allow analysts to easily set up a shipment scenario and perform multiple ``what if`` evaluations. The TPM is being developed on personal computers using commercial off-the shelf (COTS) software tools under the WINDOWS{reg_sign} operating environment. Prototype development of the TPM has been completed.

  6. BACs as tools for the study of genomic imprinting.

    PubMed

    Tunster, S J; Van De Pette, M; John, R M

    2011-01-01

    Genomic imprinting in mammals results in the expression of genes from only one parental allele. Imprinting occurs as a consequence of epigenetic marks set down either in the father's or the mother's germ line and affects a very specific category of mammalian gene. A greater understanding of this distinctive phenomenon can be gained from studies using large genomic clones, called bacterial artificial chromosomes (BACs). Here, we review the important applications of BACs to imprinting research, covering physical mapping studies and the use of BACs as transgenes in mice to study gene expression patterns, to identify imprinting centres, and to isolate the consequences of altered gene dosage. We also highlight the significant and unique advantages that rapid BAC engineering brings to genomic imprinting research.

  7. Recent Advances in Algal Genetic Tool Development

    DOE PAGES

    R. Dahlin, Lukas; T. Guarnieri, Michael

    2016-06-24

    The goal of achieving cost-effective biofuels and bioproducts derived from algal biomass will require improvements along the entire value chain, including identification of robust, high-productivity strains and development of advanced genetic tools. Though there have been modest advances in development of genetic systems for the model alga Chlamydomonas reinhardtii, progress in development of algal genetic tools, especially as applied to non-model algae, has generally lagged behind that of more commonly utilized laboratory and industrial microbes. This is in part due to the complex organellar structure of algae, including robust cell walls and intricate compartmentalization of target loci, as well asmore » prevalent gene silencing mechanisms, which hinder facile utilization of conventional genetic engineering tools and methodologies. However, recent progress in global tool development has opened the door for implementation of strain-engineering strategies in industrially-relevant algal strains. Here, we review recent advances in algal genetic tool development and applications in eukaryotic microalgae.« less

  8. Development of genome- and transcriptome-derived microsatellites in related species of snapping shrimps with highly duplicated genomes.

    PubMed

    Gaynor, Kaitlyn M; Solomon, Joseph W; Siller, Stefanie; Jessell, Linnet; Duffy, J Emmett; Rubenstein, Dustin R

    2017-08-04

    Molecular markers are powerful tools for studying patterns of relatedness and parentage within populations and for making inferences about social evolution. However, the development of molecular markers for simultaneous study of multiple species presents challenges, particularly when species exhibit genome duplication or polyploidy. We developed microsatellite markers for Synalpheus shrimp, a genus in which species exhibit not only great variation in social organization, but also interspecific variation in genome size and partial genome duplication. From the four primary clades within Synalpheus, we identified microsatellites in the genomes of four species and in the consensus transcriptome of two species. Ultimately, we designed and tested primers for 143 microsatellite markers across 25 species. Although the majority of markers were disomic, many markers were polysomic for certain species. Surprisingly, we found no relationship between genome size and the number of polysomic markers. As expected, markers developed for a given species amplified better for closely related species than for more distant relatives. Finally, the markers developed from the transcriptome were more likely to work successfully and to be disomic than those developed from the genome, suggesting that consensus transcriptomes are likely to be conserved across species. Our findings suggest that the transcriptome, particularly consensus sequences from multiple species, can be a valuable source of molecular markers for taxa with complex, duplicated genomes. © 2017 John Wiley & Sons Ltd.

  9. The Developing Child: Tools for Monitoring.

    ERIC Educational Resources Information Center

    D'Agostino, Micheline

    1991-01-01

    This document examines the use of measurement tools to evaluate children's psychological development. The first part of the document discusses the identification and quantification of developmental landmarks and the reasons for evaluation. The second part reviews several evaluation instruments. The Ruth Griffiths mental development scales are used…

  10. The Developing Child: Tools for Monitoring.

    ERIC Educational Resources Information Center

    D'Agostino, Micheline

    1991-01-01

    This document examines the use of measurement tools to evaluate children's psychological development. The first part of the document discusses the identification and quantification of developmental landmarks and the reasons for evaluation. The second part reviews several evaluation instruments. The Ruth Griffiths mental development scales are used…

  11. Tools to Develop or Convert MOVES Inputs

    EPA Pesticide Factsheets

    The following tools are designed to help users develop inputs to MOVES and post-process the output. With the release of MOVES2014, EPA strongly encourages state and local agencies to develop local inputs based on MOVES fleet and activity categories.

  12. An FDA bioinformatics tool for microbial genomics research on molecular characterization of bacterial foodborne pathogens using microarrays.

    PubMed

    Fang, Hong; Xu, Joshua; Ding, Don; Jackson, Scott A; Patel, Isha R; Frye, Jonathan G; Zou, Wen; Nayak, Rajesh; Foley, Steven; Chen, James; Su, Zhenqiang; Ye, Yanbin; Turner, Steve; Harris, Steve; Zhou, Guangxu; Cerniglia, Carl; Tong, Weida

    2010-10-07

    Advances in microbial genomics and bioinformatics are offering greater insights into the emergence and spread of foodborne pathogens in outbreak scenarios. The Food and Drug Administration (FDA) has developed a genomics tool, ArrayTrack™, which provides extensive functionalities to manage, analyze, and interpret genomic data for mammalian species. ArrayTrack™ has been widely adopted by the research community and used for pharmacogenomics data review in the FDA's Voluntary Genomics Data Submission program. ArrayTrack™ has been extended to manage and analyze genomics data from bacterial pathogens of human, animal, and food origin. It was populated with bioinformatics data from public databases such as NCBI, Swiss-Prot, KEGG Pathway, and Gene Ontology to facilitate pathogen detection and characterization. ArrayTrack™'s data processing and visualization tools were enhanced with analysis capabilities designed specifically for microbial genomics including flag-based hierarchical clustering analysis (HCA), flag concordance heat maps, and mixed scatter plots. These specific functionalities were evaluated on data generated from a custom Affymetrix array (FDA-ECSG) previously developed within the FDA. The FDA-ECSG array represents 32 complete genomes of Escherichia coli and Shigella. The new functions were also used to analyze microarray data focusing on antimicrobial resistance genes from Salmonella isolates in a poultry production environment using a universal antimicrobial resistance microarray developed by the United States Department of Agriculture (USDA). The application of ArrayTrack™ to different microarray platforms demonstrates its utility in microbial genomics research, and thus will improve the capabilities of the FDA to rapidly identify foodborne bacteria and their genetic traits (e.g., antimicrobial resistance, virulence, etc.) during outbreak investigations. ArrayTrack™ is free to use and available to public, private, and academic researchers at

  13. MADAP, a flexible clustering tool for the interpretation of one-dimensional genome annotation data.

    PubMed

    Schmid, Christoph D; Sengstag, Thierry; Bucher, Philipp; Delorenzi, Mauro

    2007-07-01

    A recurring task in the analysis of mass genome annotation data from high-throughput technologies is the identification of peaks or clusters in a noisy signal profile. Examples of such applications are the definition of promoters on the basis of transcription start site profiles, the mapping of transcription factor binding sites based on ChIP-chip data and the identification of quantitative trait loci (QTL) from whole genome SNP profiles. Input to such an analysis is a set of genome coordinates associated with counts or intensities. The output consists of a discrete number of peaks with respective volumes, extensions and center positions. We have developed for this purpose a flexible one-dimensional clustering tool, called MADAP, which we make available as a web server and as standalone program. A set of parameters enables the user to customize the procedure to a specific problem. The web server, which returns results in textual and graphical form, is useful for small to medium-scale applications, as well as for evaluation and parameter tuning in view of large-scale applications, requiring a local installation. The program written in C++ can be freely downloaded from ftp://ftp.epd.unil.ch/pub/software/unix/madap. The MADAP web server can be accessed at http://www.isrec.isb-sib.ch/madap/.

  14. UniPrimer: A Web-Based Primer Design Tool for Comparative Analyses of Primate Genomes.

    PubMed

    Batnyam, Nomin; Lee, Jimin; Lee, Jungnam; Hong, Seung Bok; Oh, Sejong; Han, Kyudong

    2012-01-01

    Whole genome sequences of various primates have been released due to advanced DNA-sequencing technology. A combination of computational data mining and the polymerase chain reaction (PCR) assay to validate the data is an excellent method for conducting comparative genomics. Thus, designing primers for PCR is an essential procedure for a comparative analysis of primate genomes. Here, we developed and introduced UniPrimer for use in those studies. UniPrimer is a web-based tool that designs PCR- and DNA-sequencing primers. It compares the sequences from six different primates (human, chimpanzee, gorilla, orangutan, gibbon, and rhesus macaque) and designs primers on the conserved region across species. UniPrimer is linked to RepeatMasker, Primer3Plus, and OligoCalc softwares to produce primers with high accuracy and UCSC In-Silico PCR to confirm whether the designed primers work. To test the performance of UniPrimer, we designed primers on sample sequences using UniPrimer and manually designed primers for the same sequences. The comparison of the two processes showed that UniPrimer was more effective than manual work in terms of saving time and reducing errors.

  15. Use of genome-editing tools to treat sickle cell disease.

    PubMed

    Tasan, Ipek; Jain, Surbhi; Zhao, Huimin

    2016-09-01

    Recent advances in genome-editing techniques have made it possible to modify any desired DNA sequence by employing programmable nucleases. These next-generation genome-modifying tools are the ideal candidates for therapeutic applications, especially for the treatment of genetic disorders like sickle cell disease (SCD). SCD is an inheritable monogenic disorder which is caused by a point mutation in the β-globin gene. Substantial success has been achieved in the development of supportive therapeutic strategies for SCD, but unfortunately there is still a lack of long-term universal cure. The only existing curative treatment is based on allogeneic stem cell transplantation from healthy donors; however, this treatment is applicable to a limited number of patients only. Hence, a universally applicable therapy is highly desirable. In this review, we will discuss the three programmable nucleases that are commonly used for genome-editing purposes: zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9). We will continue by exemplifying uses of these methods to correct the sickle cell mutation. Additionally, we will present induction of fetal globin expression as an alternative approach to cure sickle cell disease. We will conclude by comparing the three methods and explaining the concerns about their use in therapy.

  16. HGT-Finder: A New Tool for Horizontal Gene Transfer Finding and Application to Aspergillus genomes.

    PubMed

    Nguyen, Marcus; Ekstrom, Alex; Li, Xueqiong; Yin, Yanbin

    2015-10-09

    Horizontal gene transfer (HGT) is a fast-track mechanism that allows genetically unrelated organisms to exchange genes for rapid environmental adaptation. We developed a new phyletic distribution-based software, HGT-Finder, which implements a novel bioinformatics algorithm to calculate a horizontal transfer index and a probability value for each query gene. Applying this new tool to the Aspergillus fumigatus, Aspergillus flavus, and Aspergillus nidulans genomes, we found 273, 542, and 715 transferred genes (HTGs), respectively. HTGs have shorter length, higher guanine-cytosine (GC) content, and relaxed selection pressure. Metabolic process and secondary metabolism functions are significantly enriched in HTGs. Gene clustering analysis showed that 61%, 41% and 74% of HTGs in the three genomes form physically linked gene clusters (HTGCs). Overlapping manually curated, secondary metabolite gene clusters (SMGCs) with HTGCs found that 9 of the 33 A. fumigatus SMGCs and 31 of the 65 A. nidulans SMGCs share genes with HTGCs, and that HTGs are significantly enriched in SMGCs. Our genome-wide analysis thus presented very strong evidence to support the hypothesis that HGT has played a very critical role in the evolution of SMGCs. The program is freely available at http://cys.bios.niu.edu/HGTFinder/ HGTFinder.tar.gz.

  17. Whole genome mapping as a fast-track tool to assess genomic stability of sequenced Staphylococcus aureus strains.

    PubMed

    Sabirova, Julia S; Xavier, Basil Britto; Ieven, Margareta; Goossens, Herman; Malhotra-Kumar, Surbhi

    2014-10-08

    Whole genome (optical) mapping (WGM), a state-of-the-art mapping technology based on the generation of high resolution restriction maps, has so far been used for typing clinical outbreak strains and for mapping de novo sequence contigs in genome sequencing projects. We employed WGM to assess the genomic stability of previously sequenced Staphylococcus aureus strains that are commonly used in laboratories as reference standards. S. aureus strains (n = 12) were mapped on the Argus™ Optical Mapping System (Opgen Inc, Gaithersburg, USA). Assembly of NcoI-restricted DNA molecules, visualization, and editing of whole genome maps was performed employing MapManager and MapSolver softwares (Opgen Inc). In silico whole genome NcoI-restricted maps were also generated from available sequence data, and compared to the laboratory-generated maps. Strains showing differences between the two maps were resequenced using Nextera XT DNA Sample Preparation Kit and Miseq Reagent Kit V2 (MiSeq, Illumina) and de novo assembled into sequence contigs using the Velvet assembly tool. Sequence data were correlated with corresponding whole genome maps to perform contig mapping and genome assembly using MapSolver. Of the twelve strains tested, one (USA300_FPR3757) showed a 19-kbp deletion on WGM compared to its in silico generated map and reference sequence data. Resequencing of the USA300_FPR3757 identified the deleted fragment to be a 13 kbp-long integrative conjugative element ICE6013. Frequent subculturing and inter-laboratory transfers can induce genomic and therefore, phenotypic changes that could compromise the utility of standard reference strains. WGM can thus be used as a rapid genome screening method to identify genomic rearrangements whose size and type can be confirmed by sequencing.

  18. GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing.

    PubMed

    Wang, Xuewen; Wang, Le

    2016-01-01

    Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar.

  19. CSTminer: a web tool for the identification of coding and noncoding conserved sequence tags through cross-species genome comparison.

    PubMed

    Castrignanò, Tiziana; Canali, Alessandro; Grillo, Giorgio; Liuni, Sabino; Mignone, Flavio; Pesole, Graziano

    2004-07-01

    The identification and characterization of genome tracts that are highly conserved across species during evolution may contribute significantly to the functional annotation of whole-genome sequences. Indeed, such sequences are likely to correspond to known or unknown coding exons or regulatory motifs. Here, we present a web server implementing a previously developed algorithm that, by comparing user-submitted genome sequences, is able to identify statistically significant conserved blocks and assess their coding or noncoding nature through the measure of a coding potential score. The web tool, available at http://www.caspur.it/CSTminer/, is dynamically interconnected with the Ensembl genome resources and produces a graphical output showing a map of detected conserved sequences and annotated gene features.

  20. Population genomics as a new tool for wildlife management.

    PubMed

    Gompert, Zachariah

    2012-04-01

    Admixture and introgression have varied effects on population viability and fitness. Admixture might be an important source of new alleles, particularly for small, geographically isolated populations. However, admixture might also cause outbreeding depression if populations are adapted to different ecological or climatic conditions. Because of the emerging use of translocation and admixture as a conservation and wildlife management strategy to reduce genetic load (termed genetic rescue), the possible effects of admixture have practical consequences (Bouzat et al. 2009; Hedrick & Fredrickson 2010). Importantly, genetic load and local adaptation are properties of individual loci and epistatic interactions among loci rather than properties of genomes. Likewise, the outcome and consequences of genetic rescue depend on the fitness effects of individual introduced alleles. In this issue of Molecular Ecology, Miller et al. (2012) use model-based, population genomic analyses to document locus-specific effects of a recent genetic rescue in the bighorn sheep population within the National Bison Range wildlife refuge (NBR; Montana, USA). They find a subset of introduced alleles associated with increased fitness in NBR bighorn sheep, some of which experienced accelerated introgression following their introduction. These loci mark regions of the genome that could constitute the genetic basis of the successful NBR bighorn sheep genetic rescue. Although population genomic analyses are frequently used to study local adaptation and selection (e.g. Hohenlohe et al. 2010; Lawniczak et al. 2010), this study constitutes a novel application of this analytical framework for wildlife management. Moreover, the detailed demographic data available for the NBR bighorn sheep population provide a rare and powerful source of information and allow more robust population genomic inference than is often possible.

  1. DECIDE: a Decision Support Tool to Facilitate Parents' Choices Regarding Genome-Wide Sequencing.

    PubMed

    Birch, Patricia; Adam, S; Bansback, N; Coe, R R; Hicklin, J; Lehman, A; Li, K C; Friedman, J M

    2016-12-01

    We describe the rationale, development, and usability testing for an integrated e-learning tool and decision aid for parents facing decisions about genome-wide sequencing (GWS) for their children with a suspected genetic condition. The online tool, DECIDE, is designed to provide decision-support and to promote high quality decisions about undergoing GWS with or without return of optional incidental finding results. DECIDE works by integrating educational material with decision aids. Users may tailor their learning by controlling both the amount of information and its format - text and diagrams and/or short videos. The decision aid guides users to weigh the importance of various relevant factors in their own lives and circumstances. After considering the pros and cons of GWS and return of incidental findings, DECIDE summarizes the user's responses and apparent preferred choices. In a usability study of 16 parents who had already chosen GWS after conventional genetic counselling, all participants found DECIDE to be helpful. Many would have been satisfied to use it alone to guide their GWS decisions, but most would prefer to have the option of consulting a health care professional as well to aid their decision. Further testing is necessary to establish the effectiveness of using DECIDE as an adjunct to or instead of conventional pre-test genetic counselling for clinical genome-wide sequencing.

  2. GReEn: a tool for efficient compression of genome resequencing data.

    PubMed

    Pinho, Armando J; Pratas, Diogo; Garcia, Sara P

    2012-02-01

    Research in the genomic sciences is confronted with the volume of sequencing and resequencing data increasing at a higher pace than that of data storage and communication resources, shifting a significant part of research budgets from the sequencing component of a project to the computational one. Hence, being able to efficiently store sequencing and resequencing data is a problem of paramount importance. In this article, we describe GReEn (Genome Resequencing Encoding), a tool for compressing genome resequencing data using a reference genome sequence. It overcomes some drawbacks of the recently proposed tool GRS, namely, the possibility of compressing sequences that cannot be handled by GRS, faster running times and compression gains of over 100-fold for some sequences. This tool is freely available for non-commercial use at ftp://ftp.ieeta.pt/~ap/codecs/GReEn1.tar.gz.

  3. TALEN and CRISPR/Cas Genome Editing Systems: Tools of Discovery

    PubMed Central

    Nemudryi, A. A.; Valetdinova, K. R.; Medvedev, S. P.; Zakian, S. M.

    2014-01-01

    Precise studies of plant, animal and human genomes enable remarkable opportunities of obtained data application in biotechnology and medicine. However, knowing nucleotide sequences isn’t enough for understanding of particular genomic elements functional relationship and their role in phenotype formation and disease pathogenesis. In post-genomic era methods allowing genomic DNA sequences manipulation, visualization and regulation of gene expression are rapidly evolving. Though, there are few methods, that meet high standards of efficiency, safety and accessibility for a wide range of researchers. In 2011 and 2013 novel methods of genome editing appeared – this are TALEN (Transcription Activator-Like Effector Nucleases) and CRISPR (Clustered Regulatory Interspaced Short Palindromic Repeats)/Cas9 systems. Although TALEN and CRISPR/Cas9 appeared recently, these systems have proved to be effective and reliable tools for genome engineering. Here we generally review application of these systems for genome editing in conventional model objects of current biology, functional genome screening, cell-based human hereditary disease modeling, epigenome studies and visualization of cellular processes. Additionally, we review general strategies for designing TALEN and CRISPR/Cas9 and analyzing their activity. We also discuss some obstacles researcher can face using these genome editing tools. PMID:25349712

  4. The Human Ageing Genomic Resources: online databases and tools for biogerontologists

    PubMed Central

    de Magalhães, João Pedro; Budovsky, Arie; Lehmann, Gilad; Costa, Joana; Li, Yang; Fraifeld, Vadim; Church, George M.

    2009-01-01

    Summary Ageing is a complex, challenging phenomenon that will require multiple, interdisciplinary approaches to unravel its puzzles. To assist basic research on ageing, we developed the Human Ageing Genomic Resources (HAGR). This work provides an overview of the databases and tools in HAGR and describes how the gerontology research community can employ them. Several recent changes and improvements to HAGR are also presented. The two centrepieces in HAGR are GenAge and AnAge. GenAge is a gene database featuring genes associated with ageing and longevity in model organisms, a curated database of genes potentially associated with human ageing, and a list of genes tested for their association with human longevity. A myriad of biological data and information is included for hundreds of genes, making GenAge a reference for research that reflects our current understanding of the genetic basis of ageing. GenAge can also serve as a platform for the systems biology of ageing, and tools for the visualization of protein-protein interactions are also included. AnAge is a database of ageing in animals, featuring over 4,000 species, primarily assembled as a resource for comparative and evolutionary studies of ageing. Longevity records, developmental and reproductive traits, taxonomic information, basic metabolic characteristics, and key observations related to ageing are included in AnAge. Software is also available to aid researchers in the form of Perl modules to automate numerous tasks and as an SPSS script to analyse demographic mortality data. The Human Ageing Genomic Resources are available online at http://genomics.senescence.info. PMID:18986374

  5. Novel tools for an old lineage: Population genomics for cycads.

    PubMed

    Cibrián-Jaramillo, Angelica; Marler, Thomas E

    2011-07-01

    With a ca. 300 million year-old evolutionary history, cycads are often perceived as "living fossils," relicts of their previously widespread dominance. Patterns of genetic variation for a member of the most basal cycad genus, Cycas micronesica, support the notion that cycads are a dynamic group with ongoing diversification. Herein we hypothesize that cycad's hefty genomes enable rapid adaptive change and facilitate specific beneficial interactions with varying assemblages of symbionts. Characterizing population-level genomic patterns of cycads and their symbionts, pollinators in particular, will enlighten our understanding of these mechanisms and of adaptive variation that underlies cycad evolution. In light of rapid climate and landscape change, cycads are a beacon for understanding the ecological processes that ultimately enable species long-term survival.

  6. Genomics tools for the unraveling of chromosome architecture

    PubMed Central

    van Steensel, Bas; Dekker, Job

    2010-01-01

    The spatial organization of chromosomes inside the cell nucleus is still poorly understood. This organization is guided by intra- and interchromosomal contacts and by interactions of specific chromosomal loci with relatively fixed nuclear “landmarks” such as the nuclear envelope and the nucleolus. New molecular genome-wide mapping techniques have begun to uncover both types of molecular interactions, providing insights into the fundamental principles of interphase chromosome folding. PMID:20944601

  7. Mutagenesis as a Tool in Plant Genetics, Functional Genomics, and Breeding

    PubMed Central

    Sikora, Per; Chawade, Aakash; Larsson, Mikael; Olsson, Johanna; Olsson, Olof

    2011-01-01

    Plant mutagenesis is rapidly coming of age in the aftermath of recent developments in high-resolution molecular and biochemical techniques. By combining the high variation of mutagenised populations with novel screening methods, traits that are almost impossible to identify by conventional breeding are now being developed and characterised at the molecular level. This paper provides a comprehensive overview of the various techniques and workflows available to researchers today in the field of molecular breeding, and how these tools complement the ones already used in traditional breeding. Both genetic (Targeting Induced Local Lesions in Genomes; TILLING) and phenotypic screens are evaluated. Finally, different ways of bridging the gap between genotype and phenotype are discussed. PMID:22315587

  8. Inducible CRISPR genome-editing tool: classifications and future trends.

    PubMed

    Dai, Xiaofeng; Chen, Xiao; Fang, Qiuwu; Li, Jia; Bai, Zhonghu

    2017-09-22

    The discovery of CRISPR-Cas9/dCas9 system has reinforced our ability and revolutionized our history in genome engineering. While Cas9 and dCas9 are programed to modulate gene expression by introducing DNA breaks, blocking transcription factor recruitment or dragging functional groups towards the targeted sites, sgRNAs determine the genomic loci where the modulation occurs. The off-target problem, due to limited sgRNA specificity and genome complexity of many species, has posed concerns for the wide application of this revolutionary technique. To solve this problem and, more importantly, gain power over gene functionality and cell fate control, inducible strategies have been continuously evolved to offer tailored solutions to address specific biological questions. By reviewing recent advances in inducible CRISPR system design and critical elements potentially adding values to such systems, we classify current approaches in this domain into four mechanically distinct categories, namely, "split system", "allosteric system", "combinatorial system", and "transient delivery system", discuss the pros and cons of each system, and point out the under-explored areas and future directions, with the aim of enriching our toolbox of delicate life engineering.

  9. Whole genome scanning as a cytogenetic tool in hematologic malignancies

    PubMed Central

    Mufti, Ghulam J.

    2008-01-01

    Over the years, methods of cytogenetic analysis evolved and became part of routine laboratory testing, providing valuable diagnostic and prognostic information in hematologic disorders. Karyotypic aberrations contribute to the understanding of the molecular pathogenesis of disease and thereby to rational application of therapeutic modalities. Most of the progress in this field stems from the application of metaphase cytogenetics (MC), but recently, novel molecular technologies have been introduced that complement MC and overcome many of the limitations of traditional cytogenetics, including a need for cell culture. Whole genome scanning using comparative genomic hybridization and single nucleotide polymorphism arrays (CGH-A; SNP-A) can be used for analysis of somatic or clonal unbalanced chromosomal defects. In SNP-A, the combination of copy number detection and genotyping enables diagnosis of copy-neutral loss of heterozygosity, a lesion that cannot be detected using MC but may have important pathogenetic implications. Overall, whole genome scanning arrays, despite the drawback of an inability to detect balanced translocations, allow for discovery of chromosomal defects in a higher proportion of patients with hematologic malignancies. Newly detected chromosomal aberrations, including somatic uniparental disomy, may lead to more precise prognostic schemes in many diseases. PMID:18505780

  10. Electric fish genomics: Progress, prospects, and new tools for neuroethology.

    PubMed

    Pitchers, William R; Constantinou, Savvas J; Losilla, Mauricio; Gallant, Jason R

    2016-10-01

    Electric fish have served as a model system in biology since the 18th century, providing deep insight into the nature of bioelectrogenesis, the molecular structure of the synapse, and brain circuitry underlying complex behavior. Neuroethologists have collected extensive phenotypic data that span biological levels of analysis from molecules to ecosystems. This phenotypic data, together with genomic resources obtained over the past decades, have motivated new and exciting hypotheses that position the weakly electric fish model to address fundamental 21(st) century biological questions. This review article considers the molecular data collected for weakly electric fish over the past three decades, and the insights that data of this nature has motivated. For readers relatively new to molecular genetics techniques, we also provide a table of terminology aimed at clarifying the numerous acronyms and techniques that accompany this field. Next, we pose a research agenda for expanding genomic resources for electric fish research over the next 10years. We conclude by considering some of the exciting research prospects for neuroethology that electric fish genomics may offer over the coming decades, if the electric fish community is successful in these endeavors. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. A genomic regulatory network for development

    NASA Technical Reports Server (NTRS)

    Davidson, Eric H.; Rast, Jonathan P.; Oliveri, Paola; Ransick, Andrew; Calestani, Cristina; Yuh, Chiou-Hwa; Minokawa, Takuya; Amore, Gabriele; Hinman, Veronica; Arenas-Mena, Cesar; Otim, Ochan; Brown, C. Titus; Livi, Carolina B.; Lee, Pei Yun; Revilla, Roger; Rust, Alistair G.; Pan, Zheng jun; Schilstra, Maria J.; Clarke, Peter J C.; Arnone, Maria I.; Rowen, Lee; Cameron, R. Andrew; McClay, David R.; Hood, Leroy; Bolouri, Hamid

    2002-01-01

    Development of the body plan is controlled by large networks of regulatory genes. A gene regulatory network that controls the specification of endoderm and mesoderm in the sea urchin embryo is summarized here. The network was derived from large-scale perturbation analyses, in combination with computational methodologies, genomic data, cis-regulatory analysis, and molecular embryology. The network contains over 40 genes at present, and each node can be directly verified at the DNA sequence level by cis-regulatory analysis. Its architecture reveals specific and general aspects of development, such as how given cells generate their ordained fates in the embryo and why the process moves inexorably forward in developmental time.

  12. A genomic regulatory network for development

    NASA Technical Reports Server (NTRS)

    Davidson, Eric H.; Rast, Jonathan P.; Oliveri, Paola; Ransick, Andrew; Calestani, Cristina; Yuh, Chiou-Hwa; Minokawa, Takuya; Amore, Gabriele; Hinman, Veronica; Arenas-Mena, Cesar; hide

    2002-01-01

    Development of the body plan is controlled by large networks of regulatory genes. A gene regulatory network that controls the specification of endoderm and mesoderm in the sea urchin embryo is summarized here. The network was derived from large-scale perturbation analyses, in combination with computational methodologies, genomic data, cis-regulatory analysis, and molecular embryology. The network contains over 40 genes at present, and each node can be directly verified at the DNA sequence level by cis-regulatory analysis. Its architecture reveals specific and general aspects of development, such as how given cells generate their ordained fates in the embryo and why the process moves inexorably forward in developmental time.

  13. Development of tools for addressing device interoperability

    NASA Astrophysics Data System (ADS)

    Kalogeras, A. P.; Papadopoulos, George D.; Koubias, Stavros

    1997-01-01

    The great diversity of the devices that need to be interconnected in the industrial environment creates the need for interoperability. Communication interoperability is dealt with by using the Manufacturing Message Specification ISO 9506 standard in the application layer of industrial networks, which provides an abstract representation of the external characteristics of real manufacturing devices. In addition to the communication interoperability, the need for device interoperability has led to the definition of the user layer: an eighth layer in the OSI-RM for fieldbuses. The ISA/IEC SP50 emerging standard on fieldbuses defines the characteristics of the user layer functionality and the required objects. This paper presents an integrated solution for problems associated with industrial communications and control being developed at the University of Patras, Greece. Specific object oriented tools are being developed for the user layer. The function block builder tool allows the definition and creation of function block objects according to the emerging fieldbus standard. The graphical editing tool provides an integrated environment for the creation of logical node objects by means of interconnecting function block objects. The above tools are part of an integrated environment consisting of the application layer and the network management software. Furthermore, the graphical editing tool provides the capability of run-time monitoring as well as of dynamic reconfiguration of the industrial application. FInally, the complete systems has been installed in industrial test-bed applications.

  14. Update on SLD Engineering Tools Development

    NASA Technical Reports Server (NTRS)

    Miller, Dean R.; Potapczuk, Mark G.; Bond, Thomas H.

    2004-01-01

    The airworthiness authorities (FAA, JAA, Transport Canada) will be releasing a draft rule in the 2006 timeframe concerning the operation of aircraft in a Supercooled Large Droplet (SLD) environment aloft. The draft rule will require aircraft manufacturers to demonstrate that their aircraft can operate safely in an SLD environment for a period of time to facilitate a safe exit from the condition. It is anticipated that aircraft manufacturers will require a capability to demonstrate compliance with this rule via experimental means (icing tunnels or tankers) and by analytical means (ice prediction codes). Since existing icing research facilities and analytical codes were not developed to account for SLD conditions, current engineering tools are not adequate to support compliance activities in SLD conditions. Therefore, existing capabilities need to be augmented to include SLD conditions. In response to this need, NASA and its partners conceived a strategy or Roadmap for developing experimental and analytical SLD simulation tools. Following review and refinement by the airworthiness authorities and other international research partners, this technical strategy has been crystallized into a project plan to guide the SLD Engineering Tool Development effort. This paper will provide a brief overview of the latest version of the project plan and technical rationale, and provide a status of selected SLD Engineering Tool Development research tasks which are currently underway.

  15. Distributed Curriculum Development Environments: Techniques and Tools.

    ERIC Educational Resources Information Center

    Patrick, Eric

    1996-01-01

    Provides a background of the technology underlying the World Wide Web, a description of possible uses of the Web for training software, and a proposed method of curriculum development using existing tools based on experiences with training for ROTC (Reserve Officers Training Corps) and Maritime Academies using HyperText Markup Language (HTML).…

  16. Cost-effective cloud computing: a case study using the comparative genomics tool, roundup.

    PubMed

    Kudtarkar, Parul; Deluca, Todd F; Fusaro, Vincent A; Tonellato, Peter J; Wall, Dennis P

    2010-12-22

    Comparative genomics resources, such as ortholog detection tools and repositories are rapidly increasing in scale and complexity. Cloud computing is an emerging technological paradigm that enables researchers to dynamically build a dedicated virtual cluster and may represent a valuable alternative for large computational tools in bioinformatics. In the present manuscript, we optimize the computation of a large-scale comparative genomics resource-Roundup-using cloud computing, describe the proper operating principles required to achieve computational efficiency on the cloud, and detail important procedures for improving cost-effectiveness to ensure maximal computation at minimal costs. Utilizing the comparative genomics tool, Roundup, as a case study, we computed orthologs among 902 fully sequenced genomes on Amazon's Elastic Compute Cloud. For managing the ortholog processes, we designed a strategy to deploy the web service, Elastic MapReduce, and maximize the use of the cloud while simultaneously minimizing costs. Specifically, we created a model to estimate cloud runtime based on the size and complexity of the genomes being compared that determines in advance the optimal order of the jobs to be submitted. We computed orthologous relationships for 245,323 genome-to-genome comparisons on Amazon's computing cloud, a computation that required just over 200 hours and cost $8,000 USD, at least 40% less than expected under a strategy in which genome comparisons were submitted to the cloud randomly with respect to runtime. Our cost savings projections were based on a model that not only demonstrates the optimal strategy for deploying RSD to the cloud, but also finds the optimal cluster size to minimize waste and maximize usage. Our cost-reduction model is readily adaptable for other comparative genomics tools and potentially of significant benefit to labs seeking to take advantage of the cloud as an alternative to local computing infrastructure.

  17. Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup

    PubMed Central

    Kudtarkar, Parul; DeLuca, Todd F.; Fusaro, Vincent A.; Tonellato, Peter J.; Wall, Dennis P.

    2010-01-01

    Background Comparative genomics resources, such as ortholog detection tools and repositories are rapidly increasing in scale and complexity. Cloud computing is an emerging technological paradigm that enables researchers to dynamically build a dedicated virtual cluster and may represent a valuable alternative for large computational tools in bioinformatics. In the present manuscript, we optimize the computation of a large-scale comparative genomics resource—Roundup—using cloud computing, describe the proper operating principles required to achieve computational efficiency on the cloud, and detail important procedures for improving cost-effectiveness to ensure maximal computation at minimal costs. Methods Utilizing the comparative genomics tool, Roundup, as a case study, we computed orthologs among 902 fully sequenced genomes on Amazon’s Elastic Compute Cloud. For managing the ortholog processes, we designed a strategy to deploy the web service, Elastic MapReduce, and maximize the use of the cloud while simultaneously minimizing costs. Specifically, we created a model to estimate cloud runtime based on the size and complexity of the genomes being compared that determines in advance the optimal order of the jobs to be submitted. Results We computed orthologous relationships for 245,323 genome-to-genome comparisons on Amazon’s computing cloud, a computation that required just over 200 hours and cost $8,000 USD, at least 40% less than expected under a strategy in which genome comparisons were submitted to the cloud randomly with respect to runtime. Our cost savings projections were based on a model that not only demonstrates the optimal strategy for deploying RSD to the cloud, but also finds the optimal cluster size to minimize waste and maximize usage. Our cost-reduction model is readily adaptable for other comparative genomics tools and potentially of significant benefit to labs seeking to take advantage of the cloud as an alternative to local computing

  18. CAGO: a software tool for dynamic visual comparison and correlation measurement of genome organization.

    PubMed

    Chang, Yi-Feng; Chang, Chuan-Hsiung

    2011-01-01

    CAGO (Comparative Analysis of Genome Organization) is developed to address two critical shortcomings of conventional genome atlas plotters: lack of dynamic exploratory functions and absence of signal analysis for genomic properties. With dynamic exploratory functions, users can directly manipulate chromosome tracks of a genome atlas and intuitively identify distinct genomic signals by visual comparison. Signal analysis of genomic properties can further detect inconspicuous patterns from noisy genomic properties and calculate correlations between genomic properties across various genomes. To implement dynamic exploratory functions, CAGO presents each genome atlas in Scalable Vector Graphics (SVG) format and allows users to interact with it using a SVG viewer through JavaScript. Signal analysis functions are implemented using R statistical software and a discrete wavelet transformation package waveslim. CAGO is not only a plotter for generating complex genome atlases, but also a platform for exploring genome atlases with dynamic exploratory functions for visual comparison and with signal analysis for comparing genomic properties across multiple organisms. The web-based application of CAGO, its source code, user guides, video demos, and live examples are publicly available and can be accessed at http://cbs.ym.edu.tw/cago.

  19. OntoMate: a text-mining tool aiding curation at the Rat Genome Database

    PubMed Central

    Liu, Weisong; Laulederkind, Stanley J. F.; Hayman, G. Thomas; Wang, Shur-Jen; Nigam, Rajni; Smith, Jennifer R.; De Pons, Jeff; Dwinell, Melinda R.; Shimoyama, Mary

    2015-01-01

    The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Quantitative Trait Locus (QTL) and strain information. The rapidly growing volume of biomedical literature and the active research in the biological natural language processing (bioNLP) community have given RGD the impetus to adopt text-mining tools to improve curation efficiency. Recently, RGD has initiated a project to use OntoMate, an ontology-driven, concept-based literature search engine developed at RGD, as a replacement for the PubMed (http://www.ncbi.nlm.nih.gov/pubmed) search engine in the gene curation workflow. OntoMate tags abstracts with gene names, gene mutations, organism name and most of the 16 ontologies/vocabularies used at RGD. All terms/ entities tagged to an abstract are listed with the abstract in the search results. All listed terms are linked both to data entry boxes and a term browser in the curation tool. OntoMate also provides user-activated filters for species, date and other parameters relevant to the literature search. Using the system for literature search and import has streamlined the process compared to using PubMed. The system was built with a scalable and open architecture, including features specifically designed to accelerate the RGD gene curation process. With the use of bioNLP tools, RGD has added more automation to its curation workflow. Database URL: http://rgd.mcw.edu PMID:25619558

  20. OntoMate: a text-mining tool aiding curation at the Rat Genome Database.

    PubMed

    Liu, Weisong; Laulederkind, Stanley J F; Hayman, G Thomas; Wang, Shur-Jen; Nigam, Rajni; Smith, Jennifer R; De Pons, Jeff; Dwinell, Melinda R; Shimoyama, Mary

    2015-01-01

    The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Quantitative Trait Locus (QTL) and strain information. The rapidly growing volume of biomedical literature and the active research in the biological natural language processing (bioNLP) community have given RGD the impetus to adopt text-mining tools to improve curation efficiency. Recently, RGD has initiated a project to use OntoMate, an ontology-driven, concept-based literature search engine developed at RGD, as a replacement for the PubMed (http://www.ncbi.nlm.nih.gov/pubmed) search engine in the gene curation workflow. OntoMate tags abstracts with gene names, gene mutations, organism name and most of the 16 ontologies/vocabularies used at RGD. All terms/ entities tagged to an abstract are listed with the abstract in the search results. All listed terms are linked both to data entry boxes and a term browser in the curation tool. OntoMate also provides user-activated filters for species, date and other parameters relevant to the literature search. Using the system for literature search and import has streamlined the process compared to using PubMed. The system was built with a scalable and open architecture, including features specifically designed to accelerate the RGD gene curation process. With the use of bioNLP tools, RGD has added more automation to its curation workflow. Database URL: http://rgd.mcw.edu. © The Author(s) 2015. Published by Oxford University Press.

  1. New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome.

    PubMed

    Garazha, Andrew; Ivanova, Alena; Suntsova, Maria; Malakhova, Galina; Roumiantsev, Sergey; Zhavoronkov, Alex; Buzdin, Anton

    2015-01-01

    Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of functional genomic elements. Here, we performed the genome-wide identification of human ERVs/LRs containing TFBS according to the ENCODE project. We created the first interactive ERV/LRs database that groups the individual inserts according to their familial nomenclature, number of mapped TFBS and divergence from their consensus sequence. Information on any particular element can be easily extracted by the user. We also created a genome browser tool, which enables quick mapping of any ERV/LR insert according to genomic coordinates, known human genes and TFBS. These tools can be used to easily explore functionally relevant individual ERV/LRs, and for studying their impact on the regulation of human genes. Overall, we identified ∼110,000 ERV/LR genomic elements having TFBS. We propose a hypothesis of "domestication" of ERV/LR TFBS by the genome milieu including subsequent stages of initial epigenetic repression, partial functional release, and further mutation-driven reshaping of TFBS in tight coevolution with the enclosing genomic loci.

  2. Kapteyn Package: Tools for developing astronomical applications

    NASA Astrophysics Data System (ADS)

    Terlouw, J. P.; Vogelaar, M. G. R.

    2016-11-01

    The Kapteyn Package provides tools for the development of astronomical applications with Python. It handles spatial and spectral coordinates, WCS projections and transformations between different sky systems; spectral translations (e.g., between frequencies and velocities) and mixed coordinates are also supported. Kapteyn offers versatile tools for writing small and dedicated applications for the inspection of FITS headers, the extraction and display of (FITS) data, interactive inspection of this data (color editing) and for the creation of plots with world coordinate information. It includes utilities for use with matplotlib such as obtaining coordinate information from plots, interactively modifiable colormaps and timer events (module mplutil); tools for parsing and interpreting coordinate information entered by the user (module positions); a function to search for gaussian components in a profile (module profiles); and a class for non-linear least squares fitting (module kmpfit).

  3. H1640 caster tool development report

    SciTech Connect

    Brown, L.A.

    1997-12-01

    This report describes the development and certification of the H1640 caster tool. This tool is used to rotate swivel caster wheels 90 degrees on bomb hand trucks or shipping containers. The B83 is a heavy bomb system and weighs close to 5,600 pounds for a two-high stack configuration. High castering moments (handle length times the force exerted on handle) are required to caster a wheel for a two-high stack of B83s. The H1640 is available to the DoD (Air Force) through the Special Equipment List (SEL) for the B83 as a replacement for the H631 and H1216 caster tools.

  4. Tool for rapid annotation of microbial SNPs (TRAMS): a simple program for rapid annotation of genomic variation in prokaryotes.

    PubMed

    Reumerman, Richard A; Tucker, Nicholas P; Herron, Paul R; Hoskisson, Paul A; Sangal, Vartul

    2013-09-01

    Next generation sequencing (NGS) has been widely used to study genomic variation in a variety of prokaryotes. Single nucleotide polymorphisms (SNPs) resulting from genomic comparisons need to be annotated for their functional impact on the coding sequences. We have developed a program, TRAMS, for functional annotation of genomic SNPs which is available to download as a single file executable for WINDOWS users with limited computational experience and as a Python script for Mac OS and Linux users. TRAMS needs a tab delimited text file containing SNP locations, reference nucleotide and SNPs in variant strains along with a reference genome sequence in GenBank or EMBL format. SNPs are annotated as synonymous, nonsynonymous or nonsense. Nonsynonymous SNPs in start and stop codons are separated as non-start and non-stop SNPs, respectively. SNPs in multiple overlapping features are annotated separately for each feature and multiple nucleotide polymorphisms within a codon are combined before annotation. We have also developed a workflow for Galaxy, a highly used tool for analysing NGS data, to map short reads to a reference genome and extract and annotate the SNPs. TRAMS is a simple program for rapid and accurate annotation of SNPs that will be very useful for microbiologists in analysing genomic diversity in microbial populations.

  5. Genomics: Tool to predict and prevent male infertility.

    PubMed

    Halder, Ashutosh; Kumar, Prashant; Jain, Manish; Kalsi, Amanpreet Kaur

    2017-06-01

    A large number of human diseases arise as a result of genetic abnormalities. With the advent of improved molecular biology techniques, the genetic etiology of male infertility is increasing. The common genetic factors responsible for male infertility are chromosomal abnormalities, Yq microdeletion and cystic fibrosis. These are responsible for approximately 30 percent cases of male infertility. About 40 percent cases of male infertility are categorized as idiopathic. These cases may be associated with genetic and genomic abnormalities. During last few years more and more genes are implicated in male infertility leading to decline in prevalence of idiopathic etiology. In this review we will summarize up to date published works on genetic etiologies of male infertility including our own works. We also briefly describe reproductive technologies used to overcome male infertility, dangers of transmitting genetic disorders to offspring and ways to prevent transmission of genetic disorders during assisted reproduction. At the end we will provide our points on how genomic information can be utilized for prediction and prevention of male infertility in coming years.

  6. Iterative ACORN as a high throughput tool in structural genomics.

    PubMed

    Selvanayagam, S; Velmurugan, D; Yamane, T

    2006-08-01

    High throughput macromolecular structure determination is very essential in structural genomics as the available number of sequence information far exceeds the number of available 3D structures. ACORN, a freely available resource in the CCP4 suite of programs is a comprehensive and efficient program for phasing in the determination of protein structures, when atomic resolution data are available. ACORN with the automatic model-building program ARP/wARP and refinement program REFMAC is a suitable combination for the high throughput structural genomics. ACORN can also be run with secondary structural elements like helices and sheets as inputs with high resolution data. In situations, where ACORN phasing is not sufficient for building the protein model, the fragments (incomplete model/dummy atoms) can again be used as a starting input. Iterative ACORN is proved to work efficiently in the subsequent model building stages in congerin (PDB-ID: lis3) and catalase (PDB-ID: 1gwe) for which models are available.

  7. Ferret: a user-friendly Java tool to extract data from the 1000 Genomes Project.

    PubMed

    Limou, Sophie; Taverner, Andrew M; Winkler, Cheryl A

    2016-07-15

    The 1000 Genomes (1KG) Project provides a near-comprehensive resource on human genetic variation in worldwide reference populations. 1KG variants can be accessed through a browser and through the raw and annotated data that are regularly released on an ftp server. We developed Ferret, a user-friendly Java tool, to easily extract genetic variation information from these large and complex data files. From a locus, gene(s) or SNP(s) of interest, Ferret retrieves genotype data for 1KG SNPs and indels, and computes allelic frequencies for 1KG populations and optionally, for the Exome Sequencing Project populations. By converting the 1KG data into files that can be imported into popular pre-existing tools (e.g. PLINK and HaploView), Ferret offers a straightforward way, even for non-bioinformatics specialists, to manipulate, explore and merge 1KG data with the user's dataset, as well as visualize linkage disequilibrium pattern, infer haplotypes and design tagSNPs. Ferret tool and source code are publicly available at http://limousophie35.github.io/Ferret/ ferret@nih.gov Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.

  8. Kvik: three-tier data exploration tools for flexible analysis of genomic data in epidemiological studies

    PubMed Central

    Fjukstad, Bjørn; Standahl Olsen, Karina; Jareid, Mie; Lund, Eiliv; Bongo, Lars Ailo

    2015-01-01

    Kvik is an open-source framework that we developed for explorative analysis of functional genomics data from large epidemiological studies. Creating such studies requires a significant amount of time and resources. It is therefore usual to reuse the data from one study for several research projects. Often each project requires implementing new analysis code, integration with specific knowledge bases, and specific visualizations. Although existing data exploration tools are available for single study data exploration, no tool provides all the required functionality for multistudy data exploration. We have therefore used the Kvik framework to develop Kvik Pathways, an application for exploring gene expression data in the context of biological pathways. We have used Kvik Pathways to explore data from both a cross-sectional study design and a case-control study within the Norwegian Women and Cancer (NOWAC) cohort. Kvik Pathways follows the three-tier architecture in web applications using a powerful back-end for statistical analyses and retrieval of metadata.In this note, we describe how we used the Kvik framework to develop the Kvik Pathways application. Kvik Pathways was used by our team of epidemiologists toexplore gene expression data from healthy women with high and low plasma ratios of essential fatty acids. PMID:26425340

  9. Kvik: three-tier data exploration tools for flexible analysis of genomic data in epidemiological studies.

    PubMed

    Fjukstad, Bjørn; Standahl Olsen, Karina; Jareid, Mie; Lund, Eiliv; Bongo, Lars Ailo

    2015-01-01

    Kvik is an open-source framework that we developed for explorative analysis of functional genomics data from large epidemiological studies. Creating such studies requires a significant amount of time and resources. It is therefore usual to reuse the data from one study for several research projects. Often each project requires implementing new analysis code, integration with specific knowledge bases, and specific visualizations. Although existing data exploration tools are available for single study data exploration, no tool provides all the required functionality for multistudy data exploration. We have therefore used the Kvik framework to develop Kvik Pathways, an application for exploring gene expression data in the context of biological pathways. We have used Kvik Pathways to explore data from both a cross-sectional study design and a case-control study within the Norwegian Women and Cancer (NOWAC) cohort. Kvik Pathways follows the three-tier architecture in web applications using a powerful back-end for statistical analyses and retrieval of metadata.In this note, we describe how we used the Kvik framework to develop the Kvik Pathways application. Kvik Pathways was used by our team of epidemiologists toexplore gene expression data from healthy women with high and low plasma ratios of essential fatty acids.

  10. CodaChrome: a tool for the visualization of proteome conservation across all fully sequenced bacterial genomes

    PubMed Central

    2014-01-01

    Background The relationships between bacterial genomes are complicated by rampant horizontal gene transfer, varied selection pressures, acquisition of new genes, loss of genes, and divergence of genes, even in closely related lineages. As more and more bacterial genomes are sequenced, organizing and interpreting the incredible amount of relational information that connects them becomes increasingly difficult. Results We have developed CodaChrome (http://www.sourceforge.com/p/codachrome), a one-versus-all proteome comparison tool that allows the user to visually investigate the relationship between a bacterial proteome of interest and the proteomes encoded by every other bacterial genome recorded in GenBank in a massive interactive heat map. This tool has allowed us to rapidly identify the most highly conserved proteins encoded in the bacterial pan-genome, fast-clock genes useful for subtyping of bacterial species, the evolutionary history of an indel in the Sphingobium lineage, and an example of horizontal gene transfer from a member of the genus Enterococcus to a recent ancestor of Helicobacter pylori. Conclusion CodaChrome is a user-friendly and powerful tool for simultaneously visualizing relationships between thousands of proteomes. PMID:24460813

  11. webMGR: an online tool for the multiple genome rearrangement problem.

    PubMed

    Lin, Chi Ho; Zhao, Hao; Lowcay, Sean Harry; Shahab, Atif; Bourque, Guillaume

    2010-02-01

    The algorithm MGR enables the reconstruction of rearrangement phylogenies based on gene or synteny block order in multiple genomes. Although MGR has been successfully applied to study the evolution of different sets of species, its utilization has been hampered by the prohibitive running time for some applications. In the current work, we have designed new heuristics that significantly speed up the tool without compromising its accuracy. Moreover, we have developed a web server (webMGR) that includes elaborate web output to facilitate navigation through the results. webMGR can be accessed via http://www.gis.a-star.edu.sg/~bourque. The source code of the improved standalone version of MGR is also freely available from the web site. Supplementary data are available at Bioinformatics online.

  12. Gnome View: A tool for visual representation of human genome data

    SciTech Connect

    Pelkey, J.E.; Thomas, G.S.; Thurman, D.A.; Lortz, V.B.; Douthart, R.J.

    1993-02-01

    GnomeView is a tool for exploring data generated by the Human Gemone Project. GnomeView provides both graphical and textural styles of data presentation: employs an intuitive window-based graphical query interface: and integrates its underlying genome databases in such a way that the user can navigate smoothly across databases and between different levels of data. This paper describes GnomeView and discusses how it addresses various genome informatics issues.

  13. Gnome View: A tool for visual representation of human genome data

    SciTech Connect

    Pelkey, J.E.; Thomas, G.S.; Thurman, D.A.; Lortz, V.B.; Douthart, R.J.

    1993-02-01

    GnomeView is a tool for exploring data generated by the Human Gemone Project. GnomeView provides both graphical and textural styles of data presentation: employs an intuitive window-based graphical query interface: and integrates its underlying genome databases in such a way that the user can navigate smoothly across databases and between different levels of data. This paper describes GnomeView and discusses how it addresses various genome informatics issues.

  14. CGUG: in silico proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to ca. 1.9 Mb

    PubMed Central

    Mahadevan, Padmanabhan; King, John F; Seto, Donald

    2009-01-01

    Background Viruses and small-genome bacteria (~2 megabases and smaller) comprise a considerable population in the biosphere and are of interest to many researchers. These genomes are now sequenced at an unprecedented rate and require complementary computational tools to analyze. "CoreGenesUniqueGenes" (CGUG) is an in silico genome data mining tool that determines a "core" set of genes from two to five organisms with genomes in this size range. Core and unique genes may reflect similar niches and needs, and may be used in classifying organisms. Findings CGUG is available at as a web-based on-the-fly tool that performs iterative BLASTP analyses using a reference genome and up to four query genomes to provide a table of genes common to these genomes. The result is an in silico display of genomes and their proteomes, allowing for further analysis. CGUG can be used for "genome annotation by homology", as demonstrated with Chlamydophila and Francisella genomes. Conclusion CGUG is used to reanalyze the ICTV-based classifications of bacteriophages, to reconfirm long-standing relationships and to explore new classifications. These genomes have been problematic in the past, due largely to horizontal gene transfers. CGUG is validated as a tool for reannotating small genome bacteria using more up-to-date annotations by similarity or homology. These serve as an entry point for wet-bench experiments to confirm the functions of these "hypothetical" and "unknown" proteins. PMID:19706165

  15. Development of a HT seismic downhole tool.

    SciTech Connect

    Maldonado, Frank P.; Greving, Jeffrey J.; Henfling, Joseph Anthony; Chavira, David J.; Uhl, James Eugene; Polsky, Yarom

    2009-06-01

    Enhanced Geothermal Systems (EGS) require the stimulation of the drilled well, likely through hydraulic fracturing. Whether fracturing of the rock occurs by shear destabilization of natural fractures or by extensional failure of weaker zones, control of the fracture process will be required to create the flow paths necessary for effective heat mining. As such, microseismic monitoring provides one method for real-time mapping of the fractures created during the hydraulic fracturing process. This monitoring is necessary to help assess stimulation effectiveness and provide the information necessary to properly create the reservoir. In addition, reservoir monitoring of the microseismic activity can provide information on reservoir performance and evolution over time. To our knowledge, no seismic tool exists that will operate above 125 C for the long monitoring durations that may be necessary. Replacing failed tools is costly and introduces potential errors such as depth variance, etc. Sandia has designed a high temperature seismic tool for long-term deployment in geothermal applications. It is capable of detecting microseismic events and operating continuously at temperatures up to 240 C. This project includes the design and fabrication of two High Temperature (HT) seismic tools that will have the capability to operate in both temporary and long-term monitoring modes. To ensure the developed tool meets industry requirements for high sampling rates (>2ksps) and high resolution (24-bit Analog-to-Digital Converter) two electronic designs will be implemented. One electronic design will utilize newly developed 200 C electronic components. The other design will use qualified Silicon-on-Insulator (SOI) devices and will have a continuous operating temperature of 240 C.

  16. Development of wavelet analysis tools for turbulence

    NASA Technical Reports Server (NTRS)

    Bertelrud, A.; Erlebacher, G.; Dussouillez, PH.; Liandrat, M. P.; Liandrat, J.; Bailly, F. Moret; Tchamitchian, PH.

    1992-01-01

    Presented here is the general framework and the initial results of a joint effort to derive novel research tools and easy to use software to analyze and model turbulence and transition. Given here is a brief review of the issues, a summary of some basic properties of wavelets, and preliminary results. Technical aspects of the implementation, the physical conclusions reached at this time, and current developments are discussed.

  17. Human Ageing Genomic Resources: Integrated databases and tools for the biology and genetics of ageing

    PubMed Central

    Tacutu, Robi; Craig, Thomas; Budovsky, Arie; Wuttke, Daniel; Lehmann, Gilad; Taranukha, Dmitri; Costa, Joana; Fraifeld, Vadim E.; de Magalhães, João Pedro

    2013-01-01

    The Human Ageing Genomic Resources (HAGR, http://genomics.senescence.info) is a freely available online collection of research databases and tools for the biology and genetics of ageing. HAGR features now several databases with high-quality manually curated data: (i) GenAge, a database of genes associated with ageing in humans and model organisms; (ii) AnAge, an extensive collection of longevity records and complementary traits for >4000 vertebrate species; and (iii) GenDR, a newly incorporated database, containing both gene mutations that interfere with dietary restriction-mediated lifespan extension and consistent gene expression changes induced by dietary restriction. Since its creation about 10 years ago, major efforts have been undertaken to maintain the quality of data in HAGR, while further continuing to develop, improve and extend it. This article briefly describes the content of HAGR and details the major updates since its previous publications, in terms of both structure and content. The completely redesigned interface, more intuitive and more integrative of HAGR resources, is also presented. Altogether, we hope that through its improvements, the current version of HAGR will continue to provide users with the most comprehensive and accessible resources available today in the field of biogerontology. PMID:23193293

  18. Human Ageing Genomic Resources: integrated databases and tools for the biology and genetics of ageing.

    PubMed

    Tacutu, Robi; Craig, Thomas; Budovsky, Arie; Wuttke, Daniel; Lehmann, Gilad; Taranukha, Dmitri; Costa, Joana; Fraifeld, Vadim E; de Magalhães, João Pedro

    2013-01-01

    The Human Ageing Genomic Resources (HAGR, http://genomics.senescence.info) is a freely available online collection of research databases and tools for the biology and genetics of ageing. HAGR features now several databases with high-quality manually curated data: (i) GenAge, a database of genes associated with ageing in humans and model organisms; (ii) AnAge, an extensive collection of longevity records and complementary traits for >4000 vertebrate species; and (iii) GenDR, a newly incorporated database, containing both gene mutations that interfere with dietary restriction-mediated lifespan extension and consistent gene expression changes induced by dietary restriction. Since its creation about 10 years ago, major efforts have been undertaken to maintain the quality of data in HAGR, while further continuing to develop, improve and extend it. This article briefly describes the content of HAGR and details the major updates since its previous publications, in terms of both structure and content. The completely redesigned interface, more intuitive and more integrative of HAGR resources, is also presented. Altogether, we hope that through its improvements, the current version of HAGR will continue to provide users with the most comprehensive and accessible resources available today in the field of biogerontology.

  19. Computational and molecular tools for scalable rAAV-mediated genome editing

    PubMed Central

    Stoimenov, Ivaylo; Ali, Muhammad Akhtar; Pandzic, Tatjana; Sjöblom, Tobias

    2015-01-01

    The rapid discovery of potential driver mutations through large-scale mutational analyses of human cancers generates a need to characterize their cellular phenotypes. Among the techniques for genome editing, recombinant adeno-associated virus (rAAV)-mediated gene targeting is suited for knock-in of single nucleotide substitutions and to a lesser degree for gene knock-outs. However, the generation of gene targeting constructs and the targeting process is time-consuming and labor-intense. To facilitate rAAV-mediated gene targeting, we developed the first software and complementary automation-friendly vector tools to generate optimized targeting constructs for editing human protein encoding genes. By computational approaches, rAAV constructs for editing ∼71% of bases in protein-coding exons were designed. Similarly, ∼81% of genes were predicted to be targetable by rAAV-mediated knock-out. A Gateway-based cloning system for facile generation of rAAV constructs suitable for robotic automation was developed and used in successful generation of targeting constructs. Together, these tools enable automated rAAV targeting construct design, generation as well as enrichment and expansion of targeted cells with desired integrations. PMID:25488813

  20. Insights into neural crest development and evolution from genomic analysis

    PubMed Central

    Simões-Costa, Marcos; Bronner, Marianne E.

    2013-01-01

    The neural crest is an excellent model system for the study of cell type diversification during embryonic development due to its multipotency, motility, and ability to form a broad array of derivatives ranging from neurons and glia, to cartilage, bone, and melanocytes. As a uniquely vertebrate cell population, it also offers important clues regarding vertebrate origins. In the past 30 yr, introduction of recombinant DNA technology has facilitated the dissection of the genetic program controlling neural crest development and has provided important insights into gene regulatory mechanisms underlying cell migration and differentiation. More recently, new genomic approaches have provided a platform and tools that are changing the depth and breadth of our understanding of neural crest development at a “systems” level. Such advances provide an insightful view of the regulatory landscape of neural crest cells and offer a new perspective on developmental as well as stem cell and cancer biology. PMID:23817048

  1. Insights into neural crest development and evolution from genomic analysis.

    PubMed

    Simões-Costa, Marcos; Bronner, Marianne E

    2013-07-01

    The neural crest is an excellent model system for the study of cell type diversification during embryonic development due to its multipotency, motility, and ability to form a broad array of derivatives ranging from neurons and glia, to cartilage, bone, and melanocytes. As a uniquely vertebrate cell population, it also offers important clues regarding vertebrate origins. In the past 30 yr, introduction of recombinant DNA technology has facilitated the dissection of the genetic program controlling neural crest development and has provided important insights into gene regulatory mechanisms underlying cell migration and differentiation. More recently, new genomic approaches have provided a platform and tools that are changing the depth and breadth of our understanding of neural crest development at a "systems" level. Such advances provide an insightful view of the regulatory landscape of neural crest cells and offer a new perspective on developmental as well as stem cell and cancer biology.

  2. Applying Genomic and Bioinformatic Resources to Human Adenovirus Genomes for Use in Vaccine Development and for Applications in Vector Development for Gene Delivery

    PubMed Central

    Seto, Jason; Walsh, Michael P.; Mahadevan, Padmanabhan; Zhang, Qiwei; Seto, Donald

    2010-01-01

    Technological advances and increasingly cost-effect methodologies in DNA sequencing and computational analysis are providing genome and proteome data for human adenovirus research. Applying these tools, data and derived knowledge to the development of vaccines against these pathogens will provide effective prophylactics. The same data and approaches can be applied to vector development for gene delivery in gene therapy and vaccine delivery protocols. Examination of several field strain genomes and their analyses provide examples of data that are available using these approaches. An example of the development of HAdV-B3 both as a vaccine and also as a vector is presented. PMID:21994597

  3. GORpipe: a query tool for working with sequence data based on a Genomic Ordered Relational (GOR) architecture.

    PubMed

    Guðbjartsson, Hákon; Georgsson, Guðmundur Fr; Guðjónsson, Sigurjón A; Valdimarsson, Ragnar Þór; Sigurðsson, Jóhann H; Stefánsson, Sigmar K; Másson, Gísli; Magnússon, Gísli; Pálmason, Vilmundur; Stefánsson, Kári

    2016-10-15

    Our aim was to create a general-purpose relational data format and analysis tools to provide an efficient and coherent framework for working with large volumes of DNA sequence data. For this purpose we developed the GORpipe software system. It is based on a genomic ordered architecture and uses a declarative query language that combines features from SQL and shell pipe syntax in a novel manner. The system can for instance be used to annotate sequence variants, find genomic spatial overlap between various types of genomic features, filter and aggregate them in various ways. The GORpipe software is freely available for non-commercial academic usage and can be downloaded from www.nextcode.com/gorpipe CONTACT: hakon@wuxinextcode.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  4. GORpipe: a query tool for working with sequence data based on a Genomic Ordered Relational (GOR) architecture

    PubMed Central

    Guðbjartsson, Hákon; Georgsson, Guðmundur Fr.; Guðjónsson, Sigurjón A.; Valdimarsson, Ragnar þór; Sigurðsson, Jóhann H.; Stefánsson, Sigmar K.; Másson, Gísli; Magnússon, Gísli; Pálmason, Vilmundur; Stefánsson, Kári

    2016-01-01

    Motivation: Our aim was to create a general-purpose relational data format and analysis tools to provide an efficient and coherent framework for working with large volumes of DNA sequence data. Results: For this purpose we developed the GORpipe software system. It is based on a genomic ordered architecture and uses a declarative query language that combines features from SQL and shell pipe syntax in a novel manner. The system can for instance be used to annotate sequence variants, find genomic spatial overlap between various types of genomic features, filter and aggregate them in various ways. Availability and Implementation: The GORpipe software is freely available for non-commercial academic usage and can be downloaded from www.nextcode.com/gorpipe. Contact: hakon@wuxinextcode.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27339714

  5. Fernald Silo Remote Retrieval Tool Development

    SciTech Connect

    Varma, V.K.

    2004-05-18

    A long-reach tool was developed to remove discrete objects from the silos at the Fernald Environmental Management Project in Ohio. If they are not removed, these objects can potentially cause problems during the retrieval and transfer of waste from the silos. Most of the objects are on top of the Bentogrout cap inside the silos at or near the primary opening into the tank and will therefore require only vertical lifting. The objects are located about 20 ft from the top of the silo. Although most of the objects can be retrieved from 20 ft, the long-reach tool was designed to for a reach up to 40 ft in case objects roll towards the walls of the tank or need to be removed during heel retrieval operations. This report provides a detailed description of the tool that was developed, tested, and demonstrated at the Tanks Technology Cold Test Facility at Oak Ridge National Laboratory. Scaffolding was erected over two experimental cells to simulate the 40-ft maximum working depth anticipated in the silos at Fernald. Plastic bottles and plastic sheeting simulated the debris that could be encountered during waste retrieval operations.

  6. SE Requirements Development Tool User Guide

    SciTech Connect

    Benson, Faith Ann

    2016-05-13

    The LANL Systems Engineering Requirements Development Tool (SERDT) is a data collection tool created in InfoPath for use with the Los Alamos National Laboratory’s (LANL) SharePoint sites. Projects can fail if a clear definition of the final product requirements is not performed. For projects to be successful requirements must be defined early in the project and those requirements must be tracked during execution of the project to ensure the goals of the project are met. Therefore, the focus of this tool is requirements definition. The content of this form is based on International Council on Systems Engineering (INCOSE) and Department of Defense (DoD) process standards and allows for single or collaborative input. The “Scoping” section is where project information is entered by the project team prior to requirements development, and includes definitions and examples to assist the user in completing the forms. The data entered will be used to define the requirements and once the form is filled out, a “Requirements List” is automatically generated and a Word document is created and saved to a SharePoint document library. SharePoint also includes the ability to download the requirements data defined in the InfoPath from into an Excel spreadsheet. This User Guide will assist you in navigating through the data entry process.

  7. Genome-wide copy number analysis using copy number inferring tool (CNIT) and DNA pooling.

    PubMed

    Lin, Chien-hsing; Huang, Mei-chu; Li, Ling-hui; Wu, Jer-yuarn; Chen, Yuan-tsong; Fann, Cathy S J

    2008-08-01

    Copy number variation (CNV) has become an important genomic structure element in the human population, and some CNVs are related to specific traits and diseases. Moreover, analysis of human genomes has been potentiated by the use of high-resolution microarrays that assess single nucleotide polymorphisms (SNPs). Although many programs have been designed to analyze data from Affymetrix SNP microarrays, they all have high false-positive rates (FPRs) in copy number (CN) analyses. Copy number analysis tool (CNAT) 4.0 is a recently developed program that offers improved CN estimation, but small amplifications and deletions are lost when using the smoothing procedure. Here, we propose a copy number inferring tool (CNIT) algorithm for the 100K SNP microarray to investigate CNVs at 29.6-kb resolution. CNIT estimated SNP allelic and total CN with reliable P values based on intensity data. In addition, the hidden Markov model (HMM) method was applied to predict regions having altered CN by considering contiguous SNPs. Based on a CN analysis of 23 unrelated Taiwanese and 30 HapMap Centre d'Etude du Polymorphisme Humain (CEPH) trios, CNIT showed higher accuracy and power than other programs. The FPRs and false-negative rates (FNRs) of CNIT were 0.1% and 0.16%, respectively. CNIT also showed better sensitivity for detecting small amplifications and deletions. Furthermore, DNA pooling of 10 and 30 normal unrelated individuals were applied to the 100K SNP microarray, respectively, and 12 common CN-variable regions were identified, suggesting that DNA pooling can be applied to discover common CNVs.

  8. The Human Ageing Genomic Resources: online databases and tools for biogerontologists.

    PubMed

    de Magalhães, João Pedro; Budovsky, Arie; Lehmann, Gilad; Costa, Joana; Li, Yang; Fraifeld, Vadim; Church, George M

    2009-02-01

    Aging is a complex, challenging phenomenon that requires multiple, interdisciplinary approaches to unravel its puzzles. To assist basic research on aging, we developed the Human Ageing Genomic Resources (HAGR). This work provides an overview of the databases and tools in HAGR and describes how the gerontology research community can employ them. Several recent changes and improvements to HAGR are also presented. The two centrepieces in HAGR are GenAge and AnAge. GenAge is a gene database featuring genes associated with aging and longevity in model organisms, a curated database of genes potentially associated with human aging, and a list of genes tested for their association with human longevity. A myriad of biological data and information is included for hundreds of genes, making GenAge a reference for research that reflects our current understanding of the genetic basis of aging. GenAge can also serve as a platform for the systems biology of aging, and tools for the visualization of protein-protein interactions are also included. AnAge is a database of aging in animals, featuring over 4000 species, primarily assembled as a resource for comparative and evolutionary studies of aging. Longevity records, developmental and reproductive traits, taxonomic information, basic metabolic characteristics, and key observations related to aging are included in AnAge. Software is also available to aid researchers in the form of Perl modules to automate numerous tasks and as an SPSS script to analyse demographic mortality data. The HAGR are available online at http://genomics.senescence.info.

  9. Mapping by sequencing the Pneumocystis genome using the ordering DNA sequences V3 tool.

    PubMed Central

    Xu, Zheng; Lance, Britton; Vargas, Claudia; Arpinar, Budak; Bhandarkar, Suchendra; Kraemer, Eileen; Kochut, Krys J; Miller, John A; Wagner, Jeff R; Weise, Michael J; Wunderlich, John K; Stringer, James; Smulian, George; Cushion, Melanie T; Arnold, Jonathan

    2003-01-01

    A bioinformatics tool called ODS3 has been created for mapping by sequencing. The tool allows the creation of integrated genomic maps from genetic, physical mapping, and sequencing data and permits an integrated genome map to be stored, retrieved, viewed, and queried in a stand-alone capacity, in a client/server relationship with the Fungal Genome Database (FGDB), and as a web-browsing tool for the FGDB. In that ODS3 is programmed in Java, the tool promotes platform independence and supports export of integrated genome-mapping data in the extensible markup language (XML) for data interchange with other genome information systems. The tool ODS3 is used to create an initial integrated genome map of the AIDS-related fungal pathogen, Pneumocystis carinii. Contig dynamics would indicate that this physical map is approximately 50% complete with approximately 200 contigs. A total of 10 putative multigene families were found. Two of these putative families were previously characterized in P. carinii, namely the major surface glycoproteins (MSGs) and HSP70 proteins; three of these putative families (not previously characterized in P. carinii) were found to be similar to families encoding the HSP60 in Schizosaccharomyces pombe, the heat-shock psi protein in S. pombe, and the RNA synthetase family (i.e., MES1) in Saccharomyces cerevisiae. Physical mapping data are consistent with the 16S, 5.8S, and 26S rDNA genes being single copy in P. carinii. No other fungus outside this genus is known to have the rDNA genes in single copy. PMID:12702676

  10. Developing Decontamination Tools and Approaches to ...

    EPA Pesticide Factsheets

    Developing Decontamination Tools and Approaches to Address Indoor Pesticide Contamination from Improper Bed Bug Treatments The National Exposure Research Laboratory (NERL) Human Exposure and Atmospheric Sciences Division (HEASD) conducts research in support of EPA mission to protect human health and the environment. HEASD research program supports Goal 1 (Clean Air) and Goal 4 (Healthy People) of EPA strategic plan. More specifically, our division conducts research to characterize the movement of pollutants from the source to contact with humans. Our multidisciplinary research program produces Methods, Measurements, and Models to identify relationships between and characterize processes that link source emissions, environmental concentrations, human exposures, and target-tissue dose. The impact of these tools is improved regulatory programs and policies for EPA.

  11. Process-Based Quality (PBQ) Tools Development

    SciTech Connect

    Cummins, J.L.

    2001-12-03

    The objective of this effort is to benchmark the development of process-based quality tools for application in CAD (computer-aided design) model-based applications. The processes of interest are design, manufacturing, and quality process applications. A study was commissioned addressing the impact, current technologies, and known problem areas in application of 3D MCAD (3-dimensional mechanical computer-aided design) models and model integrity on downstream manufacturing and quality processes. The downstream manufacturing and product quality processes are profoundly influenced and dependent on model quality and modeling process integrity. The goal is to illustrate and expedite the modeling and downstream model-based technologies for available or conceptual methods and tools to achieve maximum economic advantage and advance process-based quality concepts.

  12. ParameciumDB in 2011: new tools and new data for functional and comparative genomics of the model ciliate Paramecium tetraurelia.

    PubMed

    Arnaiz, Olivier; Sperling, Linda

    2011-01-01

    ParameciumDB is a community model organism database built with the GMOD toolkit to integrate the genome and biology of the ciliate Paramecium tetraurelia. Over the last four years, post-genomic data from proteome and transcriptome studies has been incorporated along with predicted orthologs in 33 species, annotations from the community and publications from the scientific literature. Available tools include BioMart for complex queries, GBrowse2 for genome browsing, the Apollo genome editor for expert curation of gene models, a Blast server, a motif finder, and a wiki for protocols, nomenclature guidelines and other documentation. In-house tools have been developed for ontology browsing and evaluation of off-target RNAi matches. Now ready for next-generation deep sequencing data and the genomes of other Paramecium species, this open-access resource is available at http://paramecium.cgm.cnrs-gif.fr.

  13. Spatial Genome Organization and Its Emerging Role as a Potential Diagnosis Tool

    PubMed Central

    Meaburn, Karen J.

    2016-01-01

    In eukaryotic cells the genome is highly spatially organized. Functional relevance of higher order genome organization is implied by the fact that specific genes, and even whole chromosomes, alter spatial position in concert with functional changes within the nucleus, for example with modifications to chromatin or transcription. The exact molecular pathways that regulate spatial genome organization and the full implication to the cell of such an organization remain to be determined. However, there is a growing realization that the spatial organization of the genome can be used as a marker of disease. While global genome organization patterns remain largely conserved in disease, some genes and chromosomes occupy distinct nuclear positions in diseased cells compared to their normal counterparts, with the patterns of reorganization differing between diseases. Importantly, mapping the spatial positioning patterns of specific genomic loci can distinguish cancerous tissue from benign with high accuracy. Genome positioning is an attractive novel biomarker since additional quantitative biomarkers are urgently required in many cancer types. Current diagnostic techniques are often subjective and generally lack the ability to identify aggressive cancer from indolent, which can lead to over- or under-treatment of patients. Proof-of-principle for the use of genome positioning as a diagnostic tool has been provided based on small scale retrospective studies. Future large-scale studies are required to assess the feasibility of bringing spatial genome organization-based diagnostics to the clinical setting and to determine if the positioning patterns of specific loci can be useful biomarkers for cancer prognosis. Since spatial reorganization of the genome has been identified in multiple human diseases, it is likely that spatial genome positioning patterns as a diagnostic biomarker may be applied to many diseases. PMID:27507988

  14. Genetic and genomic tools for the marine annelid Platynereis dumerilii.

    PubMed

    Zantke, Juliane; Bannister, Stephanie; Rajan, Vinoth Babu Veedin; Raible, Florian; Tessmar-Raible, Kristin

    2014-05-01

    The bristle worm Platynereis dumerilii displays many interesting biological characteristics. These include its reproductive timing, which is synchronized to the moon phase, its regenerative capacity that is hormonally controlled, and a slow rate of evolution, which permits analyses of ancestral genes and cell types. As a marine annelid, Platynereis is also representative of the marine ecosystem, as well as one of the three large animal subphyla, the Lophotrochozoa. Here, we provide an overview of the molecular resources, functional techniques, and behavioral assays that have recently been established for the bristle worm. This combination of tools now places Platynereis in an excellent position to advance research at the frontiers of neurobiology, chronobiology, evo-devo, and marine biology.

  15. Genetic and Genomic Tools for the Marine Annelid Platynereis dumerilii

    PubMed Central

    Zantke, Juliane; Bannister, Stephanie; Rajan, Vinoth Babu Veedin; Raible, Florian; Tessmar-Raible, Kristin

    2014-01-01

    The bristle worm Platynereis dumerilii displays many interesting biological characteristics. These include its reproductive timing, which is synchronized to the moon phase, its regenerative capacity that is hormonally controlled, and a slow rate of evolution, which permits analyses of ancestral genes and cell types. As a marine annelid, Platynereis is also representative of the marine ecosystem, as well as one of the three large animal subphyla, the Lophotrochozoa. Here, we provide an overview of the molecular resources, functional techniques, and behavioral assays that have recently been established for the bristle worm. This combination of tools now places Platynereis in an excellent position to advance research at the frontiers of neurobiology, chronobiology, evo-devo, and marine biology. PMID:24807110

  16. Excisable cassettes: new tools for functional analysis of Streptomyces genomes.

    PubMed

    Raynal, Alain; Karray, Fatma; Tuphile, Karine; Darbon-Rongère, Emmanuelle; Pernodet, Jean-Luc

    2006-07-01

    The functional analysis of microbial genomes often requires gene inactivation. We constructed a set of cassettes consisting of single antibiotic resistance genes flanked by the attL and attR sites resulting from site-specific integration of the Streptomyces pSAM2 element. These cassettes can easily be used to inactivate genes by in-frame deletion in Streptomyces by a three-step strategy. In the first step, in Escherichia coli, the cassette is inserted into a cloned copy of the gene to be inactivated. In the second step, the gene is replaced by homologous recombination in Streptomyces, allowing substitution of the wild-type target gene with its inactivated counterpart. In the third step, the cassette can be removed by expression of the pSAM2 genes xis and int. The resulting strains are marker-free and contain an "attB-like" sequence of 33, 34, or 35 bp with no stop codon if the cassette is correctly chosen. Thus, a gene can be disrupted by creating an in-frame deletion, avoiding polar effects if downstream genes are cotranscribed with the target gene. A set of cassettes was constructed to contain a hygromycin or gentamicin resistance gene flanked by the attL and attR sites. The initial constructions carrying convenient cloning sites allow the insertion of any other marker gene. We tested insertion and excision by inserting a cassette into orf3, the third gene of an operon involved in spiramycin biosynthesis. We verified that the cassette exerted a polar effect on the transcription of downstream genes but that, after excision, complementation with orf3 alone restored spiramycin production.

  17. Excisable Cassettes: New Tools for Functional Analysis of Streptomyces Genomes

    PubMed Central

    Raynal, Alain; Karray, Fatma; Tuphile, Karine; Darbon-Rongère, Emmanuelle; Pernodet, Jean-Luc

    2006-01-01

    The functional analysis of microbial genomes often requires gene inactivation. We constructed a set of cassettes consisting of single antibiotic resistance genes flanked by the attL and attR sites resulting from site-specific integration of the Streptomyces pSAM2 element. These cassettes can easily be used to inactivate genes by in-frame deletion in Streptomyces by a three-step strategy. In the first step, in Escherichia coli, the cassette is inserted into a cloned copy of the gene to be inactivated. In the second step, the gene is replaced by homologous recombination in Streptomyces, allowing substitution of the wild-type target gene with its inactivated counterpart. In the third step, the cassette can be removed by expression of the pSAM2 genes xis and int. The resulting strains are marker-free and contain an “attB-like” sequence of 33, 34, or 35 bp with no stop codon if the cassette is correctly chosen. Thus, a gene can be disrupted by creating an in-frame deletion, avoiding polar effects if downstream genes are cotranscribed with the target gene. A set of cassettes was constructed to contain a hygromycin or gentamicin resistance gene flanked by the attL and attR sites. The initial constructions carrying convenient cloning sites allow the insertion of any other marker gene. We tested insertion and excision by inserting a cassette into orf3, the third gene of an operon involved in spiramycin biosynthesis. We verified that the cassette exerted a polar effect on the transcription of downstream genes but that, after excision, complementation with orf3 alone restored spiramycin production. PMID:16820478

  18. gSearch: a fast and flexible general search tool for whole-genome sequencing.

    PubMed

    Song, Taemin; Hwang, Kyu-Baek; Hsing, Michael; Lee, Kyungjoon; Bohn, Justin; Kong, Sek Won

    2012-08-15

    Various processes such as annotation and filtering of variants or comparison of variants in different genomes are required in whole-genome or exome analysis pipelines. However, processing different databases and searching among millions of genomic loci is not trivial. gSearch compares sequence variants in the Genome Variation Format (GVF) or Variant Call Format (VCF) with a pre-compiled annotation or with variants in other genomes. Its search algorithms are subsequently optimized and implemented in a multi-threaded manner. The proposed method is not a stand-alone annotation tool with its own reference databases. Rather, it is a search utility that readily accepts public or user-prepared reference files in various formats including GVF, Generic Feature Format version 3 (GFF3), Gene Transfer Format (GTF), VCF and Browser Extensible Data (BED) format. Compared to existing tools such as ANNOVAR, gSearch runs more than 10 times faster. For example, it is capable of annotating 52.8 million variants with allele frequencies in 6 min. gSearch is available at http://ml.ssu.ac.kr/gSearch. It can be used as an independent search tool or can easily be integrated to existing pipelines through various programming environments such as Perl, Ruby and Python.

  19. Seed development and genomic imprinting in plants.

    PubMed

    Köhler, Claudia; Grossniklaus, Ueli

    2005-01-01

    Genomic imprinting refers to an epigenetic phenomenon where the activity of an allele depends on its parental origin. Imprinting at individual genes has only been described in mammals and seed plants. We will discuss the role imprinted genes play in seed development and compare the situation in plants with that in mammals. Interestingly, many imprinted genes appear to control cell proliferation and growth in both groups of organisms although imprinting in plants may also be involved in the cellular differentiation of the two pairs of gametes involved in double fertilization. DNA methylation plays some role in the control of parent-of-origin-specific expression in both mammals and plants. Thus, although imprinting evolved independently in mammals and plants, there are striking similarities at the phenotypic and possibly also mechanistic level.

  20. Genomic tools to improve progress and preserve variation for future generations

    USDA-ARS?s Scientific Manuscript database

    Use of genomic tools has greatly decreased generation intervals and increased genetic progress in dairy cattle, but faster selection cycles can also increase rates of inbreeding per unit of time. Average pedigree inbreeding of Holstein cows increased from 4.6% in 2000 to 5.6% in 2009 to 6.6% in 201...

  1. Applying Genomic and Genetic Tools to Understand and Mitigate Damage from Exposure to Toxins

    DTIC Science & Technology

    2011-10-01

    Understand and Mitigate Damage from Exposure to Toxins PRINCIPAL INVESTIGATOR: Richard Myers, Ph.D...2010 – 22 September 2011 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Applying Genomic and Genetic Tools to Understand and Mitigate Damage from...dose-response to PB in larval and adult zebrafish at the biochemical and transcriptomic levels, validated a methods of treatment for inducing stress

  2. Developing a mapping tool for tablets

    NASA Astrophysics Data System (ADS)

    Vaughan, Alan; Collins, Nathan; Krus, Mike

    2014-05-01

    Digital field mapping offers significant benefits when compared with traditional paper mapping techniques in that it provides closer integration with downstream geological modelling and analysis. It also provides the mapper with the ability to rapidly integrate new data with existing databases without the potential degradation caused by repeated manual transcription of numeric, graphical and meta-data. In order to achieve these benefits, a number of PC-based digital mapping tools are available which have been developed for specific communities, eg the BGS•SIGMA project, Midland Valley's FieldMove®, and a range of solutions based on ArcGIS® software, which can be combined with either traditional or digital orientation and data collection tools. However, with the now widespread availability of inexpensive tablets and smart phones, a user led demand for a fully integrated tablet mapping tool has arisen. This poster describes the development of a tablet-based mapping environment specifically designed for geologists. The challenge was to deliver a system that would feel sufficiently close to the flexibility of paper-based geological mapping while being implemented on a consumer communication and entertainment device. The first release of a tablet-based geological mapping system from this project is illustrated and will be shown as implemented on an iPad during the poster session. Midland Valley is pioneering tablet-based mapping and, along with its industrial and academic partners, will be using the application in field based projects throughout this year and will be integrating feedback in further developments of this technology.

  3. EPICS simulation tools for control system development

    SciTech Connect

    Wright, R.M.; Kerstiens, D.M.; Vaughn, G.D.; Weiss, R.E.

    1994-09-01

    When developing control system software there are many times when the ability to simulate the response of the instrumentation can be very useful. Examples are: (i) when the operator interface is being designed and the users want an idea of what the finished system might took like; (ii) when the interface hardware is not yet available; (iii) when the reaction of the control system to an error condition must be tested, but the actual occurrence of such an error would cause undesirable side effects; (iv) when operators are being trained to use the system; (v) when an improvement or bug fix needs to be tested, but the running system cannot be shut down for long. The Experimental Physics and Industrial Control System (EPICS) provides tools for building simple simulations and interfacing to more complex simulations of accelerator hardware. At the lowest level an individual data channel can be switched to take its input from either a simulated data location or from the actual hardware. At a slightly higher level, sequences can be run on the real-time interface processor so that output to the hardware is intercepted and an appropriate substitute value is provided for the corresponding read-back records. At a still higher level a program can use the Channel Access software bus facility of EPICS to control some global aspect of an accelerator or can interface to an external accelerator simulation instead of the actual accelerator. The goal of testing control system software using simulated hardware is to minimize the changes required in shifting between the simulated system and the real system. The degree of success of the EPICS tools in meeting the minimum change goal will be addressed with suggestions for improvements. The implementation of simulated responses using EPICS tools will be discussed and examples of experience using the EPICS tools to create and interface to simulations will be given.

  4. Development of a lightning activity nowcasting tool

    NASA Astrophysics Data System (ADS)

    Karagiannidis, Athanassios; Lagouvardos, Kostas; Kotroni, Vassiliki

    2015-04-01

    Electrical phenomena inside thunderstorm clouds are a significant threat to numerous activities. Summertime convective activity is usually associated to local thermal instability, which is hard to predict using numerical weather prediction models. Despite their relatively small areal extend, these thunderstorms can be violent, resulting to infrastructure damage and loss of life. In the frame of TALOS project the National Observatory of Athens has developed a lightning activity nowcasting tool. This tool uses as sole inputs: (i) real time infrared Meteosat Second Generation (MSG) imagery and (ii) real time flashes provided by the VLF lightning detection system ZEUS, which is operated by the National Observatory of Athens. The MSG SEVIRI 10.8 and 6.2μm channels data are utilized to produce 3 Interest Fields (IFs). These fields are the TB10.8 brightness temperature (indicative of the cloud top glaciation), the TB6.2-TB10.8 difference (indicative of the cloud depth) and the TB10.8 15 minute trend, which will be referenced as "TB10.8trend" (indicative of the cloud growth rate). The latter is defined as the difference between two successive 15 minutes images of the TB10.8. When a predefined threshold value is surpassed, the delimited area is considered to be favorable for lightning activity. A statistical procedure is employed to identify the optimum threshold values for the three IFs, based on the performance of each one. The assessment of their efficiency showed that these three IFs can be used independently as predictors of lightning activity. However, in an effort to improve the tool's efficiency a combined estimation is performed. When all three IFs agree that lightning activity is expected over an area, then a Warning Level 3 (WL3) is issued. When 2 or 1 IFs indicate upcoming activity then a WL2 or WL1 is issued. The assessment of the efficiency of the combined IF tool showed that the combined estimation is more skillful than the individual IFs estimations. In a

  5. Development of a language identification tool.

    PubMed

    Davies, Charlotte; Milligan, Frank

    Service improvement is an important aspect of healthcare practice. Practitioners need to identify improvements in processes to free up time and resources for patient care. The obligation to do this falls to all staff, from students to chief executives. The project described in this article was led by a nursing student at the University of Bedfordshire, Luton, as part of the final-year assessment requirement of the pre-registration nursing degree programme. The project involved the development of a language identification tool to address communication barriers in the emergency department.

  6. CRISPR-Cas: From the Bacterial Adaptive Immune System to a Versatile Tool for Genome Engineering.

    PubMed

    Kirchner, Marion; Schneider, Sabine

    2015-11-09

    The field of biology has been revolutionized by the recent advancement of an adaptive bacterial immune system as a universal genome engineering tool. Bacteria and archaea use repetitive genomic elements termed clustered regularly interspaced short palindromic repeats (CRISPR) in combination with an RNA-guided nuclease (CRISPR-associated nuclease: Cas) to target and destroy invading DNA. By choosing the appropriate sequence of the guide RNA, this two-component system can be used to efficiently modify, target, and edit genomic loci of interest in plants, insects, fungi, mammalian cells, and whole organisms. This has opened up new frontiers in genome engineering, including the potential to treat or cure human genetic disorders. Now the potential risks as well as the ethical, social, and legal implications of this powerful new technique move into the limelight.

  7. Accelerating Genome Editing in CHO Cells Using CRISPR Cas9 and CRISPy, a Web-Based Target Finding Tool

    PubMed Central

    Ronda, Carlotta; Pedersen, Lasse Ebdrup; Hansen, Henning Gram; Kallehauge, Thomas Beuchert; Betenbaugh, Michael J; Nielsen, Alex Toftgaard; Kildegaard, Helene Faustrup

    2014-01-01

    Chinese hamster ovary (CHO) cells are widely used in the biopharmaceutical industry as a host for the production of complex pharmaceutical proteins. Thus genome engineering of CHO cells for improved product quality and yield is of great interest. Here, we demonstrate for the first time the efficacy of the CRISPR Cas9 technology in CHO cells by generating site-specific gene disruptions in COSMC and FUT8, both of which encode proteins involved in glycosylation. The tested single guide RNAs (sgRNAs) created an indel frequency up to 47.3% in COSMC, while an indel frequency up to 99.7% in FUT8 was achieved by applying lectin selection. All eight sgRNAs examined in this study resulted in relatively high indel frequencies, demonstrating that the Cas9 system is a robust and efficient genome-editing methodology in CHO cells. Deep sequencing revealed that 85% of the indels created by Cas9 resulted in frameshift mutations at the target sites, with a strong preference for single base indels. Finally, we have developed a user-friendly bioinformatics tool, named “CRISPy” for rapid identification of sgRNA target sequences in the CHO-K1 genome. The CRISPy tool identified 1,970,449 CRISPR targets divided into 27,553 genes and lists the number of off-target sites in the genome. In conclusion, the proven functionality of Cas9 to edit CHO genomes combined with our CRISPy database have the potential to accelerate genome editing and synthetic biology efforts in CHO cells. Biotechnol. Bioeng. 2014; 111: 1604–1616. © 2014 The Authors. Biotechnology and Bioengineering Published by Wiley Periodicals, Inc. PMID:24827782

  8. Accelerating genome editing in CHO cells using CRISPR Cas9 and CRISPy, a web-based target finding tool.

    PubMed

    Ronda, Carlotta; Pedersen, Lasse Ebdrup; Hansen, Henning Gram; Kallehauge, Thomas Beuchert; Betenbaugh, Michael J; Nielsen, Alex Toftgaard; Kildegaard, Helene Faustrup

    2014-08-01

    Chinese hamster ovary (CHO) cells are widely used in the biopharmaceutical industry as a host for the production of complex pharmaceutical proteins. Thus genome engineering of CHO cells for improved product quality and yield is of great interest. Here, we demonstrate for the first time the efficacy of the CRISPR Cas9 technology in CHO cells by generating site-specific gene disruptions in COSMC and FUT8, both of which encode proteins involved in glycosylation. The tested single guide RNAs (sgRNAs) created an indel frequency up to 47.3% in COSMC, while an indel frequency up to 99.7% in FUT8 was achieved by applying lectin selection. All eight sgRNAs examined in this study resulted in relatively high indel frequencies, demonstrating that the Cas9 system is a robust and efficient genome-editing methodology in CHO cells. Deep sequencing revealed that 85% of the indels created by Cas9 resulted in frameshift mutations at the target sites, with a strong preference for single base indels. Finally, we have developed a user-friendly bioinformatics tool, named "CRISPy" for rapid identification of sgRNA target sequences in the CHO-K1 genome. The CRISPy tool identified 1,970,449 CRISPR targets divided into 27,553 genes and lists the number of off-target sites in the genome. In conclusion, the proven functionality of Cas9 to edit CHO genomes combined with our CRISPy database have the potential to accelerate genome editing and synthetic biology efforts in CHO cells.

  9. Decoding the genome with an integrative analysis tool: combinatorial CRM Decoder.

    PubMed

    Kang, Keunsoo; Kim, Joomyeong; Chung, Jae Hoon; Lee, Daeyoup

    2011-09-01

    The identification of genome-wide cis-regulatory modules (CRMs) and characterization of their associated epigenetic features are fundamental steps toward the understanding of gene regulatory networks. Although integrative analysis of available genome-wide information can provide new biological insights, the lack of novel methodologies has become a major bottleneck. Here, we present a comprehensive analysis tool called combinatorial CRM decoder (CCD), which utilizes the publicly available information to identify and characterize genome-wide CRMs in a species of interest. CCD first defines a set of the epigenetic features which is significantly associated with a set of known CRMs as a code called 'trace code', and subsequently uses the trace code to pinpoint putative CRMs throughout the genome. Using 61 genome-wide data sets obtained from 17 independent mouse studies, CCD successfully catalogued ∼12 600 CRMs (five distinct classes) including polycomb repressive complex 2 target sites as well as imprinting control regions. Interestingly, we discovered that ∼4% of the identified CRMs belong to at least two different classes named 'multi-functional CRM', suggesting their functional importance for regulating spatiotemporal gene expression. From these examples, we show that CCD can be applied to any potential genome-wide datasets and therefore will shed light on unveiling genome-wide CRMs in various species.

  10. Cryogenic Propellant Feed System Analytical Tool Development

    NASA Technical Reports Server (NTRS)

    Lusby, Brian S.; Miranda, Bruno M.; Collins, Jacob A.

    2011-01-01

    The Propulsion Systems Branch at NASA s Lyndon B. Johnson Space Center (JSC) has developed a parametric analytical tool to address the need to rapidly predict heat leak into propellant distribution lines based on insulation type, installation technique, line supports, penetrations, and instrumentation. The Propellant Feed System Analytical Tool (PFSAT) will also determine the optimum orifice diameter for an optional thermodynamic vent system (TVS) to counteract heat leak into the feed line and ensure temperature constraints at the end of the feed line are met. PFSAT was developed primarily using Fortran 90 code because of its number crunching power and the capability to directly access real fluid property subroutines in the Reference Fluid Thermodynamic and Transport Properties (REFPROP) Database developed by NIST. A Microsoft Excel front end user interface was implemented to provide convenient portability of PFSAT among a wide variety of potential users and its ability to utilize a user-friendly graphical user interface (GUI) developed in Visual Basic for Applications (VBA). The focus of PFSAT is on-orbit reaction control systems and orbital maneuvering systems, but it may be used to predict heat leak into ground-based transfer lines as well. PFSAT is expected to be used for rapid initial design of cryogenic propellant distribution lines and thermodynamic vent systems. Once validated, PFSAT will support concept trades for a variety of cryogenic fluid transfer systems on spacecraft, including planetary landers, transfer vehicles, and propellant depots, as well as surface-based transfer systems. The details of the development of PFSAT, its user interface, and the program structure will be presented.

  11. Recent updates and developments to plant genome size databases.

    PubMed

    Garcia, Sònia; Leitch, Ilia J; Anadon-Rosell, Alba; Canela, Miguel Á; Gálvez, Francisco; Garnatje, Teresa; Gras, Airy; Hidalgo, Oriane; Johnston, Emmeline; Mas de Xaxars, Gemma; Pellicer, Jaume; Siljak-Yakovlev, Sonja; Vallès, Joan; Vitales, Daniel; Bennett, Michael D

    2014-01-01

    Two plant genome size databases have been recently updated and/or extended: the Plant DNA C-values database (http://data.kew.org/cvalues), and GSAD, the Genome Size in Asteraceae database (http://www.asteraceaegenomesize.com). While the first provides information on nuclear DNA contents across land plants and some algal groups, the second is focused on one of the largest and most economically important angiosperm families, Asteraceae. Genome size data have numerous applications: they can be used in comparative studies on genome evolution, or as a tool to appraise the cost of whole-genome sequencing programs. The growing interest in genome size and increasing rate of data accumulation has necessitated the continued update of these databases. Currently, the Plant DNA C-values database (Release 6.0, Dec. 2012) contains data for 8510 species, while GSAD has 1219 species (Release 2.0, June 2013), representing increases of 17 and 51%, respectively, in the number of species with genome size data, compared with previous releases. Here we provide overviews of the most recent releases of each database, and outline new features of GSAD. The latter include (i) a tool to visually compare genome size data between species, (ii) the option to export data and (iii) a webpage containing information about flow cytometry protocols.

  12. Recent updates and developments to plant genome size databases

    PubMed Central

    Garcia, Sònia; Leitch, Ilia J.; Anadon-Rosell, Alba; Canela, Miguel Á.; Gálvez, Francisco; Garnatje, Teresa; Gras, Airy; Hidalgo, Oriane; Johnston, Emmeline; Mas de Xaxars, Gemma; Pellicer, Jaume; Siljak-Yakovlev, Sonja; Vallès, Joan; Vitales, Daniel; Bennett, Michael D.

    2014-01-01

    Two plant genome size databases have been recently updated and/or extended: the Plant DNA C-values database (http://data.kew.org/cvalues), and GSAD, the Genome Size in Asteraceae database (http://www.asteraceaegenomesize.com). While the first provides information on nuclear DNA contents across land plants and some algal groups, the second is focused on one of the largest and most economically important angiosperm families, Asteraceae. Genome size data have numerous applications: they can be used in comparative studies on genome evolution, or as a tool to appraise the cost of whole-genome sequencing programs. The growing interest in genome size and increasing rate of data accumulation has necessitated the continued update of these databases. Currently, the Plant DNA C-values database (Release 6.0, Dec. 2012) contains data for 8510 species, while GSAD has 1219 species (Release 2.0, June 2013), representing increases of 17 and 51%, respectively, in the number of species with genome size data, compared with previous releases. Here we provide overviews of the most recent releases of each database, and outline new features of GSAD. The latter include (i) a tool to visually compare genome size data between species, (ii) the option to export data and (iii) a webpage containing information about flow cytometry protocols. PMID:24288377

  13. Bootstrap, Bayesian probability and maximum likelihood mapping: exploring new tools for comparative genome analyses

    PubMed Central

    Zhaxybayeva, Olga; Gogarten, J Peter

    2002-01-01

    Background Horizontal gene transfer (HGT) played an important role in shaping microbial genomes. In addition to genes under sporadic selection, HGT also affects housekeeping genes and those involved in information processing, even ribosomal RNA encoding genes. Here we describe tools that provide an assessment and graphic illustration of the mosaic nature of microbial genomes. Results We adapted the Maximum Likelihood (ML) mapping to the analyses of all detected quartets of orthologous genes found in four genomes. We have automated the assembly and analyses of these quartets of orthologs given the selection of four genomes. We compared the ML-mapping approach to more rigorous Bayesian probability and Bootstrap mapping techniques. The latter two approaches appear to be more conservative than the ML-mapping approach, but qualitatively all three approaches give equivalent results. All three tools were tested on mitochondrial genomes, which presumably were inherited as a single linkage group. Conclusions In some instances of interphylum relationships we find nearly equal numbers of quartets strongly supporting the three possible topologies. In contrast, our analyses of genome quartets containing the cyanobacterium Synechocystis sp. indicate that a large part of the cyanobacterial genome is related to that of low GC Gram positives. Other groups that had been suggested as sister groups to the cyanobacteria contain many fewer genes that group with the Synechocystis orthologs. Interdomain comparisons of genome quartets containing the archaeon Halobacterium sp. revealed that Halobacterium sp. shares more genes with Bacteria that live in the same environment than with Bacteria that are more closely related based on rRNA phylogeny . Many of these genes encode proteins involved in substrate transport and metabolism and in information storage and processing. The performed analyses demonstrate that relationships among prokaryotes cannot be accurately depicted by or inferred from

  14. From the genome to the phenome: tools to understand the basic biology of Plasmodium falciparum.

    PubMed

    Webster, Wesley A J; McFadden, Geoffrey I

    2014-01-01

    Malaria plagues one out of every 30 humans and contributes to almost a million deaths, and the problem could worsen. Our current therapeutic options are compromised by emerging resistance by the parasite to our front line drugs. It is thus imperative to better understand the basic biology of the parasite and develop novel drugs to stem this disease. The most facile approach to analyse a gene's function is to remove it from the genome or inhibit its activity. Although genetic manipulation of the human malaria parasite Plasmodium falciparum is a relatively standard procedure, there is no optimal method to perturb genes essential to the intraerythrocytic development cycle--the part of the life cycle that produces the clinical manifestation of malaria. This is a severe impediment to progress because the phenotype we wish to study is exactly the one that is so elusive. In the absence of any utilitarian way to conditionally delete essential genes, we are prevented from investigating the parasite's most vulnerable points. This review aims to focus on the development of tools identifying essential genes of P. falciparum and our ability to elicit phenotypic mutation. © 2014 The Author(s) Journal of Eukaryotic Microbiology © 2014 International Society of Protistologists.

  15. Genetics and genomics of flower initiation and development in roses

    PubMed Central

    Bendahmane, Mohammed

    2013-01-01

    Roses hold high symbolic value and great cultural importance in different societies throughout human history. They are widely used as garden ornamental plants, as cut flowers, and for the production of essential oils for the perfume and cosmetic industries. Domestication of roses has a long and complex history, and the rose species have been hybridized across vast geographic areas such as Europe, Asia, and the Middle East. The domestication processes selected several flower characters affecting floral quality, such as recurrent flowering, double flowers, petal colours, and fragrance. The molecular and genetic events that determine some of these flower characters cannot be studied using model species such as Arabidopsis thaliana, or at least only in a limited manner. In this review, we comment on the recent development of genetic, genomic, and transcriptomic tools for roses, and then focus on recent advances that have helped unravel the molecular mechanisms underlying several rose floral traits. PMID:23364936

  16. Genetics and genomics of flower initiation and development in roses.

    PubMed

    Bendahmane, Mohammed; Dubois, Annick; Raymond, Olivier; Bris, Manuel Le

    2013-02-01

    Roses hold high symbolic value and great cultural importance in different societies throughout human history. They are widely used as garden ornamental plants, as cut flowers, and for the production of essential oils for the perfume and cosmetic industries. Domestication of roses has a long and complex history, and the rose species have been hybridized across vast geographic areas such as Europe, Asia, and the Middle East. The domestication processes selected several flower characters affecting floral quality, such as recurrent flowering, double flowers, petal colours, and fragrance. The molecular and genetic events that determine some of these flower characters cannot be studied using model species such as Arabidopsis thaliana, or at least only in a limited manner. In this review, we comment on the recent development of genetic, genomic, and transcriptomic tools for roses, and then focus on recent advances that have helped unravel the molecular mechanisms underlying several rose floral traits.

  17. Suite of tools for statistical N-gram language modeling for pattern mining in whole genome sequences.

    PubMed

    Ganapathiraju, Madhavi K; Mitchell, Asia D; Thahir, Mohamed; Motwani, Kamiya; Ananthasubramanian, Seshan

    2012-12-01

    Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.

  18. Developing a biomonitoring tool for fine sediment

    NASA Astrophysics Data System (ADS)

    Turley, Matt; Bilotta, Gary; Brazier, Richard; Extence, Chris

    2014-05-01

    Sediment is an essential component of freshwater ecosystems; however anthropogenic activities can lead to elevated sediment delivery which can impact on the physical, chemical and biological characteristics of these ecosystems. Ultimately, this can result in a loss of ecosystem services worth more than 1.7 trillion per annum. As such it is important that sediment, which is one of the most commonly attributed causes of water quality impairment globally, is managed in order to minimise these impacts. The current EU environmental quality standard for sediment (monitored in the form of suspended solids) is 25 mg L-1 for all environments. It is widely recognised that this standard is unsuitable and not ecologically relevant. Furthermore, it requires a substantial resource investment to monitor sediment in this form as part of national and international water resource legislation. In recognition of this the development of sediment-specific biomonitoring tools is receiving increasing attention. The Proportion of Sediment-Sensitive Invertebrates (PSI) index is one such tool that is designed to indicate levels of fine sediment (

  19. GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing

    PubMed Central

    Wang, Xuewen; Wang, Le

    2016-01-01

    Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar. PMID:27679641

  20. Getting trichy: tools and approaches to interrogating Trichomonas vaginalis in a post-genome world

    PubMed Central

    Conrad, Melissa D.; Bradic, Martina; Warring, Sally D.; Gorman, Andrew W.; Carlton, Jane M.

    2012-01-01

    Trichomonas vaginalis is a parasite of the urogenital tract in men and women, with a worldwide presence and significant implications for global public health. T. vaginalis research entered the age of genomics with the publication of the first genome sequence in 2007, yet subsequent utilization of other ‘omics’ technologies and methods has been slow. Here, we review some of the tools and approaches available to interrogate T. vaginalis biology, with an emphasis on recent advances and current limitations, and draw attention to areas where further efforts are needed to effectively examine the complex and intriguing biology of the parasite. PMID:23219217

  1. Cas-analyzer: an online tool for assessing genome editing results using NGS data.

    PubMed

    Park, Jeongbin; Lim, Kayeong; Kim, Jin-Soo; Bae, Sangsu

    2017-01-15

    Genome editing with programmable nucleases has been widely adopted in research and medicine. Next generation sequencing (NGS) platforms are now widely used for measuring the frequencies of mutations induced by CRISPR-Cas9 and other programmable nucleases. Here, we present an online tool, Cas-Analyzer, a JavaScript-based implementation for NGS data analysis. Because Cas-Analyzer is completely used at a client-side web browser on-the-fly, there is no need to upload very large NGS datasets to a server, a time-consuming step in genome editing analysis. Currently, Cas-Analyzer supports various programmable nucleases, including single nucleases and paired nucleases.

  2. Developing academic-practice partnerships to enhance the integration of genomics into public health.

    PubMed

    Raup, Sarah F; Oehlke, Kristin Peterson; Edwards, Karen L

    2005-04-01

    The sequencing of the human genome has provided tools to gain a better understanding of the role of genes and their interaction with environmental factors in the development of disease. However, much work remains in translating discoveries into new opportunities for disease prevention and health promotion. Both public health academia and practice have important roles to play in bridging the gap between the growth in knowledge stemming from the Human Genome Project and its application in public health. Recognizing this, the Centers for Disease Control and Prevention, through the Association of Schools of Public Health, established Centers for Genomics and Public Health at three schools of public health in 2001: the University of Michigan, the University of North Carolina, and the University of Washington. This paper describes the experience of the University of Washington Center for Genomics and Public Health in forging partnerships with public health practitioners to translate genomic advances into public health practice.

  3. Genome editing: intellectual property and product development in plant biotechnology.

    PubMed

    Schinkel, Helga; Schillberg, Stefan

    2016-07-01

    Genome editing is a revolutionary technology in molecular biology. While scientists are fascinated with the unlimited possibilities provided by directed and controlled changes in DNA in eukaryotes and have eagerly adopted such tools for their own experiments, an understanding of the intellectual property (IP) implications involved in bringing genome editing-derived products to market is often lacking. Due to the ingenuity of genome editing, the time between new product conception and its actual existence can be relatively short; therefore knowledge about IP of the various genome editing methods is relevant. This point must be regarded in a national framework as patents are instituted nationally. Therefore, when designing scientific work that could lead to a product, it is worthwhile to consider the different methods used for genome editing not only for their scientific merits but also for their compatibility with a speedy and reliable launch into the desired market.

  4. Development of the SOFIA Image Processing Tool

    NASA Technical Reports Server (NTRS)

    Adams, Alexander N.

    2011-01-01

    The Stratospheric Observatory for Infrared Astronomy (SOFIA) is a Boeing 747SP carrying a 2.5 meter infrared telescope capable of operating between at altitudes of between twelve and fourteen kilometers, which is above more than 99 percent of the water vapor in the atmosphere. The ability to make observations above most water vapor coupled with the ability to make observations from anywhere, anytime, make SOFIA one of the world s premiere infrared observatories. SOFIA uses three visible light CCD imagers to assist in pointing the telescope. The data from these imagers is stored in archive files as is housekeeping data, which contains information such as boresight and area of interest locations. A tool that could both extract and process data from the archive files was developed.

  5. Solar Array Verification Analysis Tool (SAVANT) Developed

    NASA Technical Reports Server (NTRS)

    Bailey, Sheila G.; Long, KIenwyn J.; Curtis, Henry B.; Gardner, Barbara; Davis, Victoria; Messenger, Scott; Walters, Robert

    1999-01-01

    Modeling solar cell performance for a specific radiation environment to obtain the end-of-life photovoltaic array performance has become both increasingly important and, with the rapid advent of new types of cell technology, more difficult. For large constellations of satellites, a few percent difference in the lifetime prediction can have an enormous economic impact. The tool described here automates the assessment of solar array on-orbit end-of-life performance and assists in the development and design of ground test protocols for different solar cell designs. Once established, these protocols can be used to calculate on-orbit end-of-life performance from ground test results. The Solar Array Verification Analysis Tool (SAVANT) utilizes the radiation environment from the Environment Work Bench (EWB) model developed by the NASA Lewis Research Center s Photovoltaic and Space Environmental Effects Branch in conjunction with Maxwell Technologies. It then modifies and combines this information with the displacement damage model proposed by Summers et al. (ref. 1) of the Naval Research Laboratory to determine solar cell performance during the course of a given mission. The resulting predictions can then be compared with flight data. The Environment WorkBench (ref. 2) uses the NASA AE8 (electron) and AP8 (proton) models of the radiation belts to calculate the trapped radiation flux. These fluxes are integrated over the defined spacecraft orbit for the duration of the mission to obtain the total omnidirectional fluence spectra. Components such as the solar cell coverglass, adhesive, and antireflective coatings can slow and attenuate the particle fluence reaching the solar cell. In SAVANT, a continuous slowing down approximation is used to model this effect.

  6. Whole Genome Sequence Analysis Using JSpecies Tool Establishes Clonal Relationships between Listeria monocytogenes Strains from Epidemiologically Unrelated Listeriosis Outbreaks

    DOE PAGES

    Burall, Laurel S.; Grim, Christopher J.; Mammel, Mark K.; ...

    2016-03-07

    In an effort to build a comprehensive genomic approach to food safety challenges, the FDA has implemented a whole genome sequencing effort, GenomeTrakr, which involves the sequencing and analysis of genomes of foodborne pathogens. As a part of this effort, we routinely sequence whole genomes of Listeria monocytogenes (Lm) isolates associated with human listeriosis outbreaks, as well as those isolated through other sources. To rapidly establish genetic relatedness of these genomes, we evaluated tetranucleotide frequency analysis via the JSpecies program to provide a cursory analysis of strain relatedness. The JSpecies tetranucleotide (tetra) analysis plots standardized (z-score) tetramer word frequencies ofmore » two strains against each other and uses linear regression analysis to determine similarity (r2). This tool was able to validate the close relationships between outbreak related strains from four different outbreaks. Included in this study was the analysis of Lm strains isolated during the recent caramel apple outbreak and stone fruit incident in 2014. We identified that many of the isolates from these two outbreaks shared a common 4b variant (4bV) serotype, also designated as IVb-v1, using a qPCR protocol developed in our laboratory. The 4bV serotype is characterized by the presence of a 6.3 Kb DNA segment normally found in serotype 1/2a, 3a, 1/2c and 3c strains but not in serotype 4b or 1/2b strains. We decided to compare these strains at a genomic level using the JSpecies Tetra tool. Specifically, we compared several 4bV and 4b isolates and identified a high level of similarity between the stone fruit and apple 4bV strains, but not the 4b strains co-identified in the caramel apple outbreak or other 4b or 4bV strains in our collection. This finding was further substantiated by a SNP-based analysis. Additionally, we were able to identify close relatedness between isolates from clinical cases from 1993–1994 and a single case from 2011 as well as links

  7. Whole Genome Sequence Analysis Using JSpecies Tool Establishes Clonal Relationships between Listeria monocytogenes Strains from Epidemiologically Unrelated Listeriosis Outbreaks

    PubMed Central

    Burall, Laurel S.; Grim, Christopher J.; Mammel, Mark K.; Datta, Atin R.

    2016-01-01

    In an effort to build a comprehensive genomic approach to food safety challenges, the FDA has implemented a whole genome sequencing effort, GenomeTrakr, which involves the sequencing and analysis of genomes of foodborne pathogens. As a part of this effort, we routinely sequence whole genomes of Listeria monocytogenes (Lm) isolates associated with human listeriosis outbreaks, as well as those isolated through other sources. To rapidly establish genetic relatedness of these genomes, we evaluated tetranucleotide frequency analysis via the JSpecies program to provide a cursory analysis of strain relatedness. The JSpecies tetranucleotide (tetra) analysis plots standardized (z-score) tetramer word frequencies of two strains against each other and uses linear regression analysis to determine similarity (r2). This tool was able to validate the close relationships between outbreak related strains from four different outbreaks. Included in this study was the analysis of Lm strains isolated during the recent caramel apple outbreak and stone fruit incident in 2014. We identified that many of the isolates from these two outbreaks shared a common 4b variant (4bV) serotype, also designated as IVb-v1, using a qPCR protocol developed in our laboratory. The 4bV serotype is characterized by the presence of a 6.3 Kb DNA segment normally found in serotype 1/2a, 3a, 1/2c and 3c strains but not in serotype 4b or 1/2b strains. We decided to compare these strains at a genomic level using the JSpecies Tetra tool. Specifically, we compared several 4bV and 4b isolates and identified a high level of similarity between the stone fruit and apple 4bV strains, but not the 4b strains co-identified in the caramel apple outbreak or other 4b or 4bV strains in our collection. This finding was further substantiated by a SNP-based analysis. Additionally, we were able to identify close relatedness between isolates from clinical cases from 1993–1994 and a single case from 2011 as well as links between

  8. Whole Genome Sequence Analysis Using JSpecies Tool Establishes Clonal Relationships between Listeria monocytogenes Strains from Epidemiologically Unrelated Listeriosis Outbreaks.

    PubMed

    Burall, Laurel S; Grim, Christopher J; Mammel, Mark K; Datta, Atin R

    2016-01-01

    In an effort to build a comprehensive genomic approach to food safety challenges, the FDA has implemented a whole genome sequencing effort, GenomeTrakr, which involves the sequencing and analysis of genomes of foodborne pathogens. As a part of this effort, we routinely sequence whole genomes of Listeria monocytogenes (Lm) isolates associated with human listeriosis outbreaks, as well as those isolated through other sources. To rapidly establish genetic relatedness of these genomes, we evaluated tetranucleotide frequency analysis via the JSpecies program to provide a cursory analysis of strain relatedness. The JSpecies tetranucleotide (tetra) analysis plots standardized (z-score) tetramer word frequencies of two strains against each other and uses linear regression analysis to determine similarity (r2). This tool was able to validate the close relationships between outbreak related strains from four different outbreaks. Included in this study was the analysis of Lm strains isolated during the recent caramel apple outbreak and stone fruit incident in 2014. We identified that many of the isolates from these two outbreaks shared a common 4b variant (4bV) serotype, also designated as IVb-v1, using a qPCR protocol developed in our laboratory. The 4bV serotype is characterized by the presence of a 6.3 Kb DNA segment normally found in serotype 1/2a, 3a, 1/2c and 3c strains but not in serotype 4b or 1/2b strains. We decided to compare these strains at a genomic level using the JSpecies Tetra tool. Specifically, we compared several 4bV and 4b isolates and identified a high level of similarity between the stone fruit and apple 4bV strains, but not the 4b strains co-identified in the caramel apple outbreak or other 4b or 4bV strains in our collection. This finding was further substantiated by a SNP-based analysis. Additionally, we were able to identify close relatedness between isolates from clinical cases from 1993-1994 and a single case from 2011 as well as links between two

  9. SearchSmallRNA: a graphical interface tool for the assemblage of viral genomes using small RNA libraries data.

    PubMed

    de Andrade, Roberto R S; Vaslin, Maite F S

    2014-03-07

    Next-generation parallel sequencing (NGS) allows the identification of viral pathogens by sequencing the small RNAs of infected hosts. Thus, viral genomes may be assembled from host immune response products without prior virus enrichment, amplification or purification. However, mapping of the vast information obtained presents a bioinformatics challenge. In order to by pass the need of line command and basic bioinformatics knowledge, we develop a mapping software with a graphical interface to the assemblage of viral genomes from small RNA dataset obtained by NGS. SearchSmallRNA was developed in JAVA language version 7 using NetBeans IDE 7.1 software. The program also allows the analysis of the viral small interfering RNAs (vsRNAs) profile; providing an overview of the size distribution and other features of the vsRNAs produced in infected cells. The program performs comparisons between each read sequenced present in a library and a chosen reference genome. Reads showing Hamming distances smaller or equal to an allowed mismatched will be selected as positives and used to the assemblage of a long nucleotide genome sequence. In order to validate the software, distinct analysis using NGS dataset obtained from HIV and two plant viruses were used to reconstruct viral whole genomes. SearchSmallRNA program was able to reconstructed viral genomes using NGS of small RNA dataset with high degree of reliability so it will be a valuable tool for viruses sequencing and discovery. It is accessible and free to all research communities and has the advantage to have an easy-to-use graphical interface. SearchSmallRNA was written in Java and is freely available at http://www.microbiologia.ufrj.br/ssrna/.

  10. SearchSmallRNA: a graphical interface tool for the assemblage of viral genomes using small RNA libraries data

    PubMed Central

    2014-01-01

    Background Next-generation parallel sequencing (NGS) allows the identification of viral pathogens by sequencing the small RNAs of infected hosts. Thus, viral genomes may be assembled from host immune response products without prior virus enrichment, amplification or purification. However, mapping of the vast information obtained presents a bioinformatics challenge. Methods In order to by pass the need of line command and basic bioinformatics knowledge, we develop a mapping software with a graphical interface to the assemblage of viral genomes from small RNA dataset obtained by NGS. SearchSmallRNA was developed in JAVA language version 7 using NetBeans IDE 7.1 software. The program also allows the analysis of the viral small interfering RNAs (vsRNAs) profile; providing an overview of the size distribution and other features of the vsRNAs produced in infected cells. Results The program performs comparisons between each read sequenced present in a library and a chosen reference genome. Reads showing Hamming distances smaller or equal to an allowed mismatched will be selected as positives and used to the assemblage of a long nucleotide genome sequence. In order to validate the software, distinct analysis using NGS dataset obtained from HIV and two plant viruses were used to reconstruct viral whole genomes. Conclusions SearchSmallRNA program was able to reconstructed viral genomes using NGS of small RNA dataset with high degree of reliability so it will be a valuable tool for viruses sequencing and discovery. It is accessible and free to all research communities and has the advantage to have an easy-to-use graphical interface. Availability and implementation SearchSmallRNA was written in Java and is freely available at http://www.microbiologia.ufrj.br/ssrna/. PMID:24607237

  11. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

    USDA-ARS?s Scientific Manuscript database

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...

  12. GCORE-sib: An efficient gene-gene interaction tool for genome-wide association studies based on discordant sib pairs.

    PubMed

    Sung, Pei-Yuan; Wang, Yi-Ting; Hsiung, Chao A; Chung, Ren-Hua

    2016-07-08

    A computationally efficient tool is required for a genome-wide gene-gene interaction analysis that tests an extremely large number of single-nucleotide polymorphism (SNP) interaction pairs in genome-wide association studies (GWAS). Current tools for GWAS interaction analysis are mainly developed for unrelated case-control samples. Relatively fewer tools for interaction analysis are available for complex disease studies with family-based design, and these tools tend to be computationally expensive. We developed a fast gene-gene interaction test, GCORE-sib, for discordant sib pairs and implemented the test into an efficient tool. We used simulations to demonstrate that the GCORE-sib has correct type I error rates and has comparable power to that of the regression-based interaction test. We also showed that the GCORE-sib can run more than 10 times faster than the regression-based test. Finally, the GCORE-sib was applied to a GWAS dataset with approximately 2,000 discordant sib pairs, and the GCORE-sib finished testing 19,368,078,382 pairs of SNPs within 6 days. An efficient gene-gene interaction tool for discordant sib pairs was developed. It will be very useful for genome-wide gene-gene interaction analysis in GWAS using discordant sib pairs. The tool can be downloaded for free at http://gcore-sib.sourceforge.net .

  13. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities

    DOE PAGES

    Kang, Dongwan D.; Froula, Jeff; Egan, Rob; ...

    2015-01-01

    Grouping large genomic fragments assembled from shotgun metagenomic sequences to deconvolute complex microbial communities, or metagenome binning, enables the study of individual organisms and their interactions. Because of the complex nature of these communities, existing metagenome binning methods often miss a large number of microbial species. In addition, most of the tools are not scalable to large datasets. Here we introduce automated software called MetaBAT that integrates empirical probabilistic distances of genome abundance and tetranucleotide frequency for accurate metagenome binning. MetaBAT outperforms alternative methods in accuracy and computational efficiency on both synthetic and real metagenome datasets. Lastly, it automatically formsmore » hundreds of high quality genome bins on a very large assembly consisting millions of contigs in a matter of hours on a single node. MetaBAT is open source software and available at https://bitbucket.org/berkeleylab/metabat.« less

  14. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities

    SciTech Connect

    Kang, Dongwan D.; Froula, Jeff; Egan, Rob; Wang, Zhong

    2015-01-01

    Grouping large genomic fragments assembled from shotgun metagenomic sequences to deconvolute complex microbial communities, or metagenome binning, enables the study of individual organisms and their interactions. Because of the complex nature of these communities, existing metagenome binning methods often miss a large number of microbial species. In addition, most of the tools are not scalable to large datasets. Here we introduce automated software called MetaBAT that integrates empirical probabilistic distances of genome abundance and tetranucleotide frequency for accurate metagenome binning. MetaBAT outperforms alternative methods in accuracy and computational efficiency on both synthetic and real metagenome datasets. Lastly, it automatically forms hundreds of high quality genome bins on a very large assembly consisting millions of contigs in a matter of hours on a single node. MetaBAT is open source software and available at https://bitbucket.org/berkeleylab/metabat.

  15. Genome-based approaches to develop vaccines against bacterial pathogens.

    PubMed

    Serruto, Davide; Serino, Laura; Masignani, Vega; Pizza, Mariagrazia

    2009-05-26

    Bacterial infectious diseases remain the single most important threat to health worldwide. Although conventional vaccinology approaches were successful in conferring protection against several diseases, they failed to provide efficacious solutions against many others. The advent of whole-genome sequencing changed the way to think about vaccine development, enabling the targeting of possible vaccine candidates starting from the genomic information of a single bacterial isolate, with a process named reverse vaccinology. As the genomic era progressed, reverse vaccinology has evolved with a pan-genome approach and multi-strain genome analysis became fundamental for the design of universal vaccines. This review describes the applications of genome-based approaches in the development of new vaccines against bacterial pathogens.

  16. Immersive virtual environment technology: a promising tool for future social and behavioral genomics research and practice.

    PubMed

    Persky, Susan; McBride, Colleen M

    2009-12-01

    Social and behavioral research needs to get started now if scientists are to direct genomic discoveries to address pressing public health problems. Advancing social and behavioral science will require innovative and rigorous communication methodologies that move researchers beyond reliance on traditional tools and their inherent limitations. One such emerging research tool is immersive virtual environment technology (virtual reality), a methodology that gives researchers the ability to maintain high experimental control and mundane realism of scenarios; portray and manipulate complex, abstract objects and concepts; and implement innovative implicit behavioral measurement. This report suggests the role that immersive virtual environment technology can play in furthering future research in genomics-related education, decision making, test intentions, behavior change, and health-care provider behaviors. Practical implementation and challenges are also discussed.

  17. Immersive Virtual Environment Technology: A Promising Tool for Future Social and Behavioral Genomics Research and Practice

    PubMed Central

    Persky, Susan; McBride, Colleen M.

    2009-01-01

    Social and behavioral research needs to get started now if we are to direct genomic discoveries to address pressing public health problems. Advancing social and behavioral science will require innovative and rigorous communication methodologies that move us beyond reliance on traditional tools and their inherent limitations. One such emerging research tool is immersive virtual environment technology (aka: virtual reality), a methodology that gives researchers the ability to maintain high experimental control and mundane realism of scenarios, portray and manipulate complex, abstract objects and concepts, and implement innovative implicit behavioral measurement. This report suggests the role that immersive virtual environment technology can play in furthering future research in genomics-related: education, decision-making, test intentions, behavior change, and healthcare provider behaviors. Practical implementation and challenges are also discussed. PMID:20183376

  18. AnnTools: a comprehensive and versatile annotation toolkit for genomic variants

    PubMed Central

    Makarov, Vladimir; O'Grady, Tina; Cai, Guiqing; Lihm, Jayon; Buxbaum, Joseph D.; Yoon, Seungtai

    2012-01-01

    Summary: AnnTools is a versatile bioinformatics application designed for comprehensive annotation of a full spectrum of human genome variation: novel and known single-nucleotide substitutions (SNP/SNV), short insertions/deletions (INDEL) and structural variants/copy number variation (SV/CNV). The variants are interpreted by interrogating data compiled from 15 constantly updated sources. In addition to detailed functional characterization of the coding variants, AnnTools searches for overlaps with regulatory elements, disease/trait associated loci, known segmental duplications and artifact prone regions, thereby offering an integrated and comprehensive analysis of genomic data. The tool conveniently accepts user-provided tracks for custom annotation and offers flexibility in input data formats. The output is generated in the universal Variant Call Format. High annotation speed makes AnnTools suitable for high-throughput sequencing facilities, while a low-memory footprint and modest CPU requirements allow it to operate on a personal computer. The application is freely available for public use; the package includes installation scripts and a set of helper tools. Availability: http://anntools.sourceforge.net/ Contact: vladimir.makarov@mssm.edu; chris.yoon@mssm.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22257670

  19. genipe: an automated genome-wide imputation pipeline with automatic reporting and statistical tools.

    PubMed

    Lemieux Perreault, Louis-Philippe; Legault, Marc-André; Asselin, Géraldine; Dubé, Marie-Pierre

    2016-12-01

    Genotype imputation is now commonly performed following genome-wide genotyping experiments. Imputation increases the density of analyzed genotypes in the dataset, enabling fine-mapping across the genome. However, the process of imputation using the most recent publicly available reference datasets can require considerable computation power and the management of hundreds of large intermediate files. We have developed genipe, a complete genome-wide imputation pipeline which includes automatic reporting, imputed data indexing and management, and a suite of statistical tests for imputed data commonly used in genetic epidemiology (Sequence Kernel Association Test, Cox proportional hazards for survival analysis, and linear mixed models for repeated measurements in longitudinal studies).

  20. EggLib: processing, analysis and simulation tools for population genetics and genomics

    PubMed Central

    2012-01-01

    Background With the considerable growth of available nucleotide sequence data over the last decade, integrated and flexible analytical tools have become a necessity. In particular, in the field of population genetics, there is a strong need for automated and reliable procedures to conduct repeatable and rapid polymorphism analyses, coalescent simulations, data manipulation and estimation of demographic parameters under a variety of scenarios. Results In this context, we present EggLib (Evolutionary Genetics and Genomics Library), a flexible and powerful C++/Python software package providing efficient and easy to use computational tools for sequence data management and extensive population genetic analyses on nucleotide sequence data. EggLib is a multifaceted project involving several integrated modules: an underlying computationally efficient C++ library (which can be used independently in pure C++ applications); two C++ programs; a Python package providing, among other features, a high level Python interface to the C++ library; and the egglib script which provides direct access to pre-programmed Python applications. Conclusions EggLib has been designed aiming to be both efficient and easy to use. A wide array of methods are implemented, including file format conversion, sequence alignment edition, coalescent simulations, neutrality tests and estimation of demographic parameters by Approximate Bayesian Computation (ABC). Classes implementing different demographic scenarios for ABC analyses can easily be developed by the user and included to the package. EggLib source code is distributed freely under the GNU General Public License (GPL) from its website http://egglib.sourceforge.net/ where a full documentation and a manual can also be found and downloaded. PMID:22494792

  1. EggLib: processing, analysis and simulation tools for population genetics and genomics.

    PubMed

    De Mita, Stéphane; Siol, Mathieu

    2012-04-11

    With the considerable growth of available nucleotide sequence data over the last decade, integrated and flexible analytical tools have become a necessity. In particular, in the field of population genetics, there is a strong need for automated and reliable procedures to conduct repeatable and rapid polymorphism analyses, coalescent simulations, data manipulation and estimation of demographic parameters under a variety of scenarios. In this context, we present EggLib (Evolutionary Genetics and Genomics Library), a flexible and powerful C++/Python software package providing efficient and easy to use computational tools for sequence data management and extensive population genetic analyses on nucleotide sequence data. EggLib is a multifaceted project involving several integrated modules: an underlying computationally efficient C++ library (which can be used independently in pure C++ applications); two C++ programs; a Python package providing, among other features, a high level Python interface to the C++ library; and the egglib script which provides direct access to pre-programmed Python applications. EggLib has been designed aiming to be both efficient and easy to use. A wide array of methods are implemented, including file format conversion, sequence alignment edition, coalescent simulations, neutrality tests and estimation of demographic parameters by Approximate Bayesian Computation (ABC). Classes implementing different demographic scenarios for ABC analyses can easily be developed by the user and included to the package. EggLib source code is distributed freely under the GNU General Public License (GPL) from its website http://egglib.sourceforge.net/ where a full documentation and a manual can also be found and downloaded.

  2. OCPAT: an online codon-preserved alignment tool for evolutionary genomic analysis of protein coding sequences

    PubMed Central

    Liu, Guozhen; Uddin, Monica; Islam, Munirul; Goodman, Morris; Grossman, Lawrence I; Romero, Roberto; Wildman, Derek E

    2007-01-01

    Background Rapidly accumulating genome sequence data from multiple species offer powerful opportunities for the detection of DNA sequence evolution. Phylogenetic tree construction and codon-based tests for natural selection are the prevailing tools used to detect functionally important evolutionary change in protein coding sequences. These analyses often require multiple DNA sequence alignments that maintain the correct reading frame for each collection of putative orthologous sequences. Since this feature is not available in most alignment tools, codon reading frames often must be checked manually before evolutionary analyses can commence. Results Here we report an online codon-preserved alignment tool (OCPAT) that generates multiple sequence alignments automatically from the coding sequences of any list of human gene IDs and their putative orthologs from genomes of other vertebrate tetrapods. OCPAT is programmed to extract putative orthologous genes from genomes and to align the orthologs with the reading frame maintained in all species. OCPAT also optimizes the alignment by trimming the most variable alignment regions at the 5' and 3' ends of each gene. The resulting output of alignments is returned in several formats, which facilitates further molecular evolutionary analyses by appropriate available software. Alignments are generally robust and reliable, retaining the correct reading frame. The tool can serve as the first step for comparative genomic analyses of protein-coding gene sequences including phylogenetic tree reconstruction and detection of natural selection. We aligned 20,658 human RefSeq mRNAs using OCPAT. Most alignments are missing sequence(s) from at least one species; however, functional annotation clustering of the ~1700 transcripts that were alignable to all species shows that genes involved in multi-subunit protein complexes are highly conserved. Conclusion The OCPAT program facilitates large-scale evolutionary and phylogenetic analyses of

  3. Complete genome sequence of cyanobacterium Fischerella sp. NIES-3754, providing thermoresistant optogenetic tools.

    PubMed

    Hirose, Yuu; Fujisawa, Takatomo; Ohtsubo, Yoshiyuki; Katayama, Mitsunori; Misawa, Naomi; Wakazuki, Sachiko; Shimura, Yohei; Nakamura, Yasukazu; Kawachi, Masanobu; Yoshikawa, Hirofumi; Eki, Toshihiko; Kanesaki, Yu

    2016-02-20

    Cyanobacterial phytochrome-class photosensors are recently emerging optogenetic tools. We isolated Fischerella sp. strain NIES-3754 from hotspring at Suwa-shrine, Suwa, Nagano, Japan. We determined complete genome sequence of the NIES-3754 strain, which is composed of one chromosome and two putative replicons (total 5,826,863bp containing no gaps). We identified photosensor genes of 5 phytochromes and 9 cyanobacteriochromes, which will facilitate optogenetics of thermophile.

  4. RareVariantVis: new tool for visualization of causative variants in rare monogenic disorders using whole genome sequencing data.

    PubMed

    Stokowy, Tomasz; Garbulowski, Mateusz; Fiskerstrand, Torunn; Holdhus, Rita; Labun, Kornel; Sztromwasser, Pawel; Gilissen, Christian; Hoischen, Alexander; Houge, Gunnar; Petersen, Kjell; Jonassen, Inge; Steen, Vidar M

    2016-10-01

    The search for causative genetic variants in rare diseases of presumed monogenic inheritance has been boosted by the implementation of whole exome (WES) and whole genome (WGS) sequencing. In many cases, WGS seems to be superior to WES, but the analysis and visualization of the vast amounts of data is demanding. To aid this challenge, we have developed a new tool-RareVariantVis-for analysis of genome sequence data (including non-coding regions) for both germ line and somatic variants. It visualizes variants along their respective chromosomes, providing information about exact chromosomal position, zygosity and frequency, with point-and-click information regarding dbSNP IDs, gene association and variant inheritance. Rare variants as well as de novo variants can be flagged in different colors. We show the performance of the RareVariantVis tool in the Genome in a Bottle WGS data set. https://www.bioconductor.org/packages/3.3/bioc/html/RareVariantVis.html tomasz.stokowy@k2.uib.no Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. Novel Genome-Editing Tools to Model and Correct Primary Immunodeficiencies

    PubMed Central

    Ott de Bruin, Lisa M.; Volpi, Stefano; Musunuru, Kiran

    2015-01-01

    Severe combined immunodeficiency (SCID) and other severe non-SCID primary immunodeficiencies (non-SCID PID) can be treated by allogeneic hematopoietic stem cell (HSC) transplantation, but when histocompatibility leukocyte antigen-matched donors are lacking, this can be a high-risk procedure. Correcting the patient’s own HSCs with gene therapy offers an attractive alternative. Gene therapies currently being used in clinical settings insert a functional copy of the entire gene by means of a viral vector. With this treatment, severe complications may result due to integration within oncogenes. A promising alternative is the use of endonucleases such as ZFNs, TALENs, and CRISPR/Cas9 to introduce a double-stranded break in the DNA and thus induce homology-directed repair. With these genome-editing tools a correct copy can be inserted in a precisely targeted “safe harbor.” They can also be used to correct pathogenic mutations in situ and to develop cellular or animal models needed to study the pathogenic effects of specific genetic defects found in immunodeficient patients. This review discusses the advantages and disadvantages of these endonucleases in gene correction and modeling with an emphasis on CRISPR/Cas9, which offers the most promise due to its efficacy and versatility. PMID:26052330

  6. After genomics, what proteomics tools could help us understand the antimicrobial resistance of Escherichia coli?

    PubMed

    Radhouani, Hajer; Pinto, Luís; Poeta, Patrícia; Igrejas, Gilberto

    2012-06-06

    Proteomic approaches have been considerably improved during the past decade and have been used to investigate the differences in protein expression profiles of cells grown under a broad spectrum of growth conditions and with different stress factors including antibiotics. In Europe, the most significant disease threat remains the presence of microorganisms that have become resistant to antimicrobials and so it is important that different scientific tools are combined to achieve the largest amount of knowledge in this area of expertise. The emergence and spread of the antibiotic-resistant Gram-negative pathogens, such as Escherichia coli, can lead to serious problem public health in humans. E. coli, a very well described prokaryote, has served as a model organism for several biological and biotechnological studies increasingly so since the completion of the E. coli genome-sequencing project. The purpose of this review is to present an overview of the different proteomic approaches to antimicrobial-resistant E. coli that will be helpful to obtain a better knowledge of the antibiotic-resistant mechanism(s). This can also aid to understand the molecular determinants involved with pathogenesis, which is essential for the development of effective strategies to combat infection and to reveal new therapeutic targets. This article is part of a Special Issue entitled: Proteomics: The clinical link. Copyright © 2011 Elsevier B.V. All rights reserved.

  7. Dissecting diabetes/metabolic disease mechanisms using pluripotent stem cells and genome editing tools

    PubMed Central

    Teo, Adrian Kee Keong; Gupta, Manoj K.; Doria, Alessandro; Kulkarni, Rohit N.

    2015-01-01

    Background Diabetes and metabolic syndromes are chronic, devastating diseases with increasing prevalence. Human pluripotent stem cells are gaining popularity in their usage for human in vitro disease modeling. With recent rapid advances in genome editing tools, these cells can now be genetically manipulated with relative ease to study how genes and gene variants contribute to diabetes and metabolic syndromes. Scope of review We highlight the diabetes and metabolic genes and gene variants, which could potentially be studied, using two powerful technologies – human pluripotent stem cells (hPSCs) and genome editing tools – to aid the elucidation of yet elusive mechanisms underlying these complex diseases. Major conclusions hPSCs and the advancing genome editing tools appear to be a timely and potent combination for probing molecular mechanism(s) underlying diseases such as diabetes and metabolic syndromes. The knowledge gained from these hiPSC-based disease modeling studies can potentially be translated into the clinics by guiding clinicians on the appropriate type of medication to use for each condition based on the mechanism of action of the disease. PMID:26413465

  8. LEGER: knowledge database and visualization tool for comparative genomics of pathogenic and non-pathogenic Listeria species

    PubMed Central

    Dieterich, Guido; Kärst, Uwe; Fischer, Elmar; Wehland, Jürgen; Jänsch, Lothar

    2006-01-01

    Listeria species are ubiquitous in the environment and often contaminate foods because they grow under conditions used for food preservation. Listeria monocytogenes, the human and animal pathogen, causes Listeriosis, an infection with a high mortality rate in risk groups such as immune-compromised individuals. Furthermore, L.monocytogenes is a model organism for the study of intracellular bacterial pathogens. The publication of its genome sequence and that of the non-pathogenic species Listeria innocua initiated numerous comparative studies and efforts to sequence all species comprising the genus. The Proteome database LEGER () was developed to support functional genome analyses by combining information obtained by applying bioinformatics methods and from public databases to improve the original annotations. LEGER offers three unique key features: (i) it is the first comprehensive information system focusing on the functional assignment of genes and proteins; (ii) integrated visualization tools, KEGG pathway and Genome Viewer, alleviate the functional exploration of complex data; and (iii) LEGER presents results of systematic post-genome studies, thus facilitating analyses combining computational and experimental results. Moreover, LEGER provides an unpublished membrane proteome analysis of L.innocua and in total visualizes experimentally validated information about the subcellular localizations of 789 different listerial proteins. PMID:16381897

  9. DNA-binding protein prediction using plant specific support vector machines: validation and application of a new genome annotation tool.

    PubMed

    Motion, Graham B; Howden, Andrew J M; Huitema, Edgar; Jones, Susan

    2015-12-15

    There are currently 151 plants with draft genomes available but levels of functional annotation for putative protein products are low. Therefore, accurate computational predictions are essential to annotate genomes in the first instance, and to provide focus for the more costly and time consuming functional assays that follow. DNA-binding proteins are an important class of proteins that require annotation, but current computational methods are not applicable for genome wide predictions in plant species. Here, we explore the use of species and lineage specific models for the prediction of DNA-binding proteins in plants. We show that a species specific support vector machine model based on Arabidopsis sequence data is more accurate (accuracy 81%) than a generic model (74%), and based on this we develop a plant specific model for predicting DNA-binding proteins. We apply this model to the tomato proteome and demonstrate its ability to perform accurate high-throughput prediction of DNA-binding proteins. In doing so, we have annotated 36 currently uncharacterised proteins by assigning a putative DNA-binding function. Our model is publically available and we propose it be used in combination with existing tools to help increase annotation levels of DNA-binding proteins encoded in plant genomes.

  10. The use of retroviruses as pharmaceutical tools for target discovery and validation in the field of functional genomics.

    PubMed

    Lorens, J B; Sousa, C; Bennett, M K; Molineaux, S M; Payan, D G

    2001-12-01

    Retrovirally mediated functional genomics enables identification of physiologically relevant cellular therapeutic targets. Unique properties of retroviruses make them ideal tools for the introduction of large and diverse libraries of potential genetic effectors to a variety of cell types. The identification and recovery of intracellular library elements responsible for altered disease responses establishes a direct basis for pharmaceutical development. Recent innovations in retroviral infection efficiency and expression control have broadened application of the methodology to include libraries of mutagenized cDNAs, peptides and ribozyme genetic effectors.

  11. Stage Separation CFD Tool Development and Evaluation

    NASA Technical Reports Server (NTRS)

    Droege, Alan; Gomez, Reynaldo; Wang, Ten-See

    2002-01-01

    This viewgraph presentation evaluates CFD (Computational Fluid Dynamics) tools for solving stage separation problems. The demonstration and validation of the tools is for a second generation RLV (Reusable Launch Vehicle) stage separation. The flow solvers are: Cart3D; Overflow/Overflow-D; Unic.

  12. Developments in applications of PCD tooling

    NASA Astrophysics Data System (ADS)

    Heath, Peter J.

    1991-06-01

    The performance of tooling based on polycrystalline diamond (PCD) has been quantified in the machining of two types of aluminum-based metal matrix composites (MMCs), where the reinforcing phase was either silicon carbide particles or alumina fibers. The machinability of these MMCs was compared with that of a hypereutectic aluminum 20 percent silicon alloy when using both cemented-carbide and PCD cutting edges. It is found that PCD is a most suitable tool material for the machining of aluminum-silicon carbide MMCs, and that cutting speeds should be in the 50-150 m/min range. Feed rates and depths of cut should be adjusted to suit the component being machined and the surface finish required. At low cutting speeds, cemented carbide cutting tools can be used on aluminum-silicon carbide MMCs, but the tool life is very low and the amount of subsurface damage produced is greater than that produced by PCD tools.

  13. Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond

    PubMed Central

    Mascher, Martin; Richmond, Todd A; Gerhardt, Daniel J; Himmelbach, Axel; Clissold, Leah; Sampath, Dharanya; Ayling, Sarah; Steuernagel, Burkhard; Pfeifer, Matthias; D'Ascenzo, Mark; Akhunov, Eduard D; Hedley, Pete E; Gonzales, Ana M; Morrell, Peter L; Kilian, Benjamin; Blattner, Frank R; Scholz, Uwe; Mayer, Klaus FX; Flavell, Andrew J; Muehlbauer, Gary J; Waugh, Robbie; Jeddeloh, Jeffrey A; Stein, Nils

    2013-01-01

    Advanced resources for genome-assisted research in barley (Hordeum vulgare) including a whole-genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole-genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA-coding exome reduces barley genomic complexity more than 50-fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in-solution hybridization-based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full-length cDNAs and de novo assembled RNA-Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA-coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping-by-sequencing and genetic diversity analyzes. PMID:23889683

  14. Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond.

    PubMed

    Mascher, Martin; Richmond, Todd A; Gerhardt, Daniel J; Himmelbach, Axel; Clissold, Leah; Sampath, Dharanya; Ayling, Sarah; Steuernagel, Burkhard; Pfeifer, Matthias; D'Ascenzo, Mark; Akhunov, Eduard D; Hedley, Pete E; Gonzales, Ana M; Morrell, Peter L; Kilian, Benjamin; Blattner, Frank R; Scholz, Uwe; Mayer, Klaus F X; Flavell, Andrew J; Muehlbauer, Gary J; Waugh, Robbie; Jeddeloh, Jeffrey A; Stein, Nils

    2013-11-01

    Advanced resources for genome-assisted research in barley (Hordeum vulgare) including a whole-genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole-genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA-coding exome reduces barley genomic complexity more than 50-fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in-solution hybridization-based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full-length cDNAs and de novo assembled RNA-Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA-coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping-by-sequencing and genetic diversity analyzes.

  15. Development of a Raman lidar simulation tool

    NASA Technical Reports Server (NTRS)

    Grasso, R. J.; Hummel, J. R.

    1992-01-01

    Raman Lidar is a useful and powerful tool for remote probing of the atmosphere. With Raman Lidars, one can accurately determine the identity and concentration of a particular molecular specie present in the atmosphere. We present the results from a program to develop a simulation capability of Raman Lidar systems for the remote detection of atmospheric gases and/or air polluting hydrocarbons. Our model, which integrates remote Raman spectroscopy with SPARTA's BACKSCAT atmospheric lidar simulation package, permits accurate determination of the performance of a Raman Lidar system. The accuracy with which our model operates is due to the accurate calculation, at any given excitation wavelength, of the differential scattering cross section for the molecular specie under investigation. We show excellent correlation of our calculated cross section data with experimental data from the published literature. In addition, the use of our BACKSCAT package, which provides a user friendly environment to define the operating conditions, provides an accurate calculation of the atmospheric extinction at both the excitation and Raman shifted wavelengths. Our code can be used to accurately predict the performance of a Raman Lidar system, the concentration and identification of a specie in the atmosphere, or the feasibility of making Raman measurements.

  16. The integrated development of tools and environments for electronic design

    SciTech Connect

    Wagner, F.R.

    1996-12-31

    This paper discusses the relationships between the development of design tools and design environments for electronic circuits and systems. It is shown that the development of both the tools and the environment must be performed on an integrated way and depend on a design process to be supported. A methodology for the development of integrated and open design environments is shown. It considers different tool integration approaches and the selection of either environment-independent or dependent design tools.

  17. Development of a brachytherapy audit checklist tool.

    PubMed

    Prisciandaro, Joann; Hadley, Scott; Jolly, Shruti; Lee, Choonik; Roberson, Peter; Roberts, Donald; Ritter, Timothy

    2015-01-01

    To develop a brachytherapy audit checklist that could be used to prepare for Nuclear Regulatory Commission or agreement state inspections, to aid in readiness for a practice accreditation visit, or to be used as an annual internal audit tool. Six board-certified medical physicists and one radiation oncologist conducted a thorough review of brachytherapy-related literature and practice guidelines published by professional organizations and federal regulations. The team members worked at two facilities that are part of a large, academic health care center. Checklist items were given a score based on their judged importance. Four clinical sites performed an audit of their program using the checklist. The sites were asked to score each item based on a defined severity scale for their noncompliance, and final audit scores were tallied by summing the products of importance score and severity score for each item. The final audit checklist, which is available online, contains 83 items. The audit scores from the beta sites ranged from 17 to 71 (out of 690) and identified a total of 7-16 noncompliance items. The total time to conduct the audit ranged from 1.5 to 5 hours. A comprehensive audit checklist was developed which can be implemented by any facility that wishes to perform a program audit in support of their own brachytherapy program. The checklist is designed to allow users to identify areas of noncompliance and to prioritize how these items are addressed to minimize deviations from nationally-recognized standards. Copyright © 2015 American Brachytherapy Society. All rights reserved.

  18. Community annotation and bioinformatics workforce development in concert--Little Skate Genome Annotation Workshops and Jamborees.

    PubMed

    Wang, Qinghua; Arighi, Cecilia N; King, Benjamin L; Polson, Shawn W; Vincent, James; Chen, Chuming; Huang, Hongzhan; Kingham, Brewster F; Page, Shallee T; Rendino, Marc Farnum; Thomas, William Kelley; Udwary, Daniel W; Wu, Cathy H

    2012-01-01

    Recent advances in high-throughput DNA sequencing technologies have equipped biologists with a powerful new set of tools for advancing research goals. The resulting flood of sequence data has made it critically important to train the next generation of scientists to handle the inherent bioinformatic challenges. The North East Bioinformatics Collaborative (NEBC) is undertaking the genome sequencing and annotation of the little skate (Leucoraja erinacea) to promote advancement of bioinformatics infrastructure in our region, with an emphasis on practical education to create a critical mass of informatically savvy life scientists. In support of the Little Skate Genome Project, the NEBC members have developed several annotation workshops and jamborees to provide training in genome sequencing, annotation and analysis. Acting as a nexus for both curation activities and dissemination of project data, a project web portal, SkateBase (http://skatebase.org) has been developed. As a case study to illustrate effective coupling of community annotation with workforce development, we report the results of the Mitochondrial Genome Annotation Jamborees organized to annotate the first completely assembled element of the Little Skate Genome Project, as a culminating experience for participants from our three prior annotation workshops. We are applying the physical/virtual infrastructure and lessons learned from these activities to enhance and streamline the genome annotation workflow, as we look toward our continuing efforts for larger-scale functional and structural community annotation of the L. erinacea genome.

  19. Meta-tools for software development and knowledge acquisition

    NASA Technical Reports Server (NTRS)

    Eriksson, Henrik; Musen, Mark A.

    1992-01-01

    The effectiveness of tools that provide support for software development is highly dependent on the match between the tools and their task. Knowledge-acquisition (KA) tools constitute a class of development tools targeted at knowledge-based systems. Generally, KA tools that are custom-tailored for particular application domains are more effective than are general KA tools that cover a large class of domains. The high cost of custom-tailoring KA tools manually has encouraged researchers to develop meta-tools for KA tools. Current research issues in meta-tools for knowledge acquisition are the specification styles, or meta-views, for target KA tools used, and the relationships between the specification entered in the meta-tool and other specifications for the target program under development. We examine different types of meta-views and meta-tools. Our current project is to provide meta-tools that produce KA tools from multiple specification sources--for instance, from a task analysis of the target application.

  20. Animal QTLdb: an improved database tool for livestock animal QTL/association data dissemination in the post-genome era.

    PubMed

    Hu, Zhi-Liang; Park, Carissa A; Wu, Xiao-Lin; Reecy, James M

    2013-01-01

    The Animal QTL database (QTLdb; http://www.animalgenome.org/QTLdb) is designed to house all publicly available QTL and single-nucleotide polymorphism/gene association data on livestock animal species. An earlier version was published in the Nucleic Acids Research Database issue in 2007. Since then, we have continued our efforts to develop new and improved database tools to allow more data types, parameters and functions. Our efforts have transformed the Animal QTLdb into a tool that actively serves the research community as a quality data repository and more importantly, a provider of easily accessible tools and functions to disseminate QTL and gene association information. The QTLdb has been heavily used by the livestock genomics community since its first public release in 2004. To date, there are 5920 cattle, 3442 chicken, 7451 pigs, 753 sheep and 88 rainbow trout data points in the database, and at least 290 publications that cite use of the database. The rapid advancement in genomic studies of cattle, chicken, pigs, sheep and other livestock animals has presented us with challenges, as well as opportunities for the QTLdb to meet the evolving needs of the research community. Here, we report our progress over the recent years and highlight new functions and services available to the general public.

  1. RotaC: a web-based tool for the complete genome classification of group A rotaviruses.

    PubMed

    Maes, Piet; Matthijnssens, Jelle; Rahman, Mustafizur; Van Ranst, Marc

    2009-11-23

    Group A rotaviruses are the most common cause of severe diarrhea in infants and children worldwide and continue to have a major global impact on childhood morbidity and mortality. In recent years, considerable research efforts have been devoted to the development of two new live, orally administered vaccines. Although both vaccines have proven to confer a good protection against severe rotavirus gastroenteritis, these vaccines will have to be screened and may have to be updated regularly to reflect temporal and spatial genotype fluctuations. In this matter, the genetic characterization of circulating and new emerging rotavirus strains will need to be compulsory and accurate. An extended classification system for rotaviruses in which all the 11 genomic RNA segments are used, has been proposed recently. The use of this classification system will help to elucidate the role of gene reassortments in the generation of genetic diversity, host range restriction, co-segregation of certain gene segments, and in adaptation to a new host species. Here we present a web-based tool that can be used for fast rotavirus genotype differentiation of all 11 group A rotavirus gene segments according to the new guidelines proposed by the Rotavirus Classification Working Group (RCWG). With the increasing sequencing efforts that are being conducted around the world to unravel complete rotavirus genomes of human and animal origin, this tool will be of great help to analyze and correctly classify the large amount of new data. The web-based tool is freely available at http://rotac.regatools.be.

  2. Developing genomic resources for the apiaceae

    USDA-ARS?s Scientific Manuscript database

    The Apiaceae family includes carrot, celery, cilantro, dill, fennel and numerous other spice and medicinal crops. Carrot is the most economically important member of the Apiaceae with an annual value of $600 M in the United States alone. There are few genomic resources for carrot or other Apiaceae, ...

  3. Novel genomic tools for specific and real-time detection of biothreat and frequently encountered foodborne pathogens.

    PubMed

    Woubit, Abdela; Yehualaeshet, Teshome; Habtemariam, Tsegaye; Samuel, Temesgen

    2012-04-01

    The bacterial genera Escherichia, Salmonella, Shigella, Vibrio, Yersinia, and Francisella include important food safety and biothreat agents. By extensive mining of the whole genome and protein databases of diverse, closely and distantly related bacterial species and strains, we have identified novel genome regions, which we utilized to develop a rapid detection platform for these pathogens. The specific genomic targets we have identified to design the primers in Francisella tularensis subsp. tularensis, F. tularensis subsp. novicida, Shigella dysenteriae, Salmonella enterica serovar Typhimurium, Vibrio cholerae, Yersinia pestis, and Yersinia pseudotuberculosis contained either known genes or putative proteins. Primer sets were designed from the target regions for use in real-time PCR assays to detect specific biothreat pathogens at species or strain levels. The primer sets were first tested by in silico PCR against whole-genome sequences of different species, subspecies, or strains and then by in vitro PCR against genomic DNA preparations from 23 strains representing six biothreat agents (Escherichia coli O157:H7 strain EDL 933, Shigella dysenteriae, S. enterica serovar Typhi, F. tularensis subsp. tularensis, V. cholerae, and Y. pestis) and six foodborne pathogens (Salmonella Typhimurium, Salmonella Saintpaul, Shigella sonnei, F. tularensis subsp. novicida, Vibrio parahaemolyticus, and Y. pseudotuberculosis). Each pathogen was specifically identifiable at the genus and species levels. Sensitivity assays performed with purified DNA showed the lowest detection limit of 128 fg of DNA/μl for F. tularensis subsp. tularensis. A preliminary test to detect Shigella organisms in a milk matrix also enabled the detection of 6 to 60 CFU/ml. These new tools could ultimately be used to develop platforms to simultaneously detect these pathogens.

  4. The Global Invertebrate Genomics Alliance (GIGA): Developing Community Resources to Study Diverse Invertebrate Genomes

    PubMed Central

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the “invertebrates,” but very few genomes from these organisms have been sequenced. We have, therefore, formed a “Global Invertebrate Genomics Alliance” (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major challenges (e.g., species selection, sample collection and storage, sequence assembly, annotation, analytical tools) associated with genome/transcriptome sequencing across a large taxonomic spectrum. We aim to promote standards that will facilitate comparative approaches to invertebrate genomics and collaborations across the international scientific community. Candidate study taxa include species from Porifera, Ctenophora, Cnidaria, Placozoa, Mollusca, Arthropoda, Echinodermata, Annelida, Bryozoa, and Platyhelminthes, among others. GIGA will target 7000 noninsect/nonnematode species, with an emphasis on marine taxa because of the unrivaled phyletic diversity in the oceans. Priorities for selecting invertebrates for sequencing will include, but are not restricted to, their phylogenetic placement; relevance to organismal, ecological, and conservation research; and their importance to fisheries and human health. We highlight benefits of sequencing both whole genomes (DNA) and transcriptomes and also suggest policies for genomic-level data access and sharing based on transparency and inclusiveness. The GIGA Web site (http://giga.nova.edu) has been launched to facilitate this collaborative venture. PMID:24336862

  5. The Global Invertebrate Genomics Alliance (GIGA): developing community resources to study diverse invertebrate genomes.

    PubMed

    Bracken-Grissom, Heather; Collins, Allen G; Collins, Timothy; Crandall, Keith; Distel, Daniel; Dunn, Casey; Giribet, Gonzalo; Haddock, Steven; Knowlton, Nancy; Martindale, Mark; Medina, Mónica; Messing, Charles; O'Brien, Stephen J; Paulay, Gustav; Putnam, Nicolas; Ravasi, Timothy; Rouse, Greg W; Ryan, Joseph F; Schulze, Anja; Wörheide, Gert; Adamska, Maja; Bailly, Xavier; Breinholt, Jesse; Browne, William E; Diaz, M Christina; Evans, Nathaniel; Flot, Jean-François; Fogarty, Nicole; Johnston, Matthew; Kamel, Bishoy; Kawahara, Akito Y; Laberge, Tammy; Lavrov, Dennis; Michonneau, François; Moroz, Leonid L; Oakley, Todd; Osborne, Karen; Pomponi, Shirley A; Rhodes, Adelaide; Santos, Scott R; Satoh, Nori; Thacker, Robert W; Van de Peer, Yves; Voolstra, Christian R; Welch, David Mark; Winston, Judith; Zhou, Xin

    2014-01-01

    Over 95% of all metazoan (animal) species comprise the "invertebrates," but very few genomes from these organisms have been sequenced. We have, therefore, formed a "Global Invertebrate Genomics Alliance" (GIGA). Our intent is to build a collaborative network of diverse scientists to tackle major challenges (e.g., species selection, sample collection and storage, sequence assembly, annotation, analytical tools) associated with genome/transcriptome sequencing across a large taxonomic spectrum. We aim to promote standards that will facilitate comparative approaches to invertebrate genomics and collaborations across the international scientific community. Candidate study taxa include species from Porifera, Ctenophora, Cnidaria, Placozoa, Mollusca, Arthropoda, Echinodermata, Annelida, Bryozoa, and Platyhelminthes, among others. GIGA will target 7000 noninsect/nonnematode species, with an emphasis on marine taxa because of the unrivaled phyletic diversity in the oceans. Priorities for selecting invertebrates for sequencing will include, but are not restricted to, their phylogenetic placement; relevance to organismal, ecological, and conservation research; and their importance to fisheries and human health. We highlight benefits of sequencing both whole genomes (DNA) and transcriptomes and also suggest policies for genomic-level data access and sharing based on transparency and inclusiveness. The GIGA Web site (http://giga.nova.edu) has been launched to facilitate this collaborative venture.

  6. EVA: Exome Variation Analyzer, an efficient and versatile tool for filtering strategies in medical genomics

    PubMed Central

    2012-01-01

    Background Whole exome sequencing (WES) has become the strategy of choice to identify a coding allelic variant for a rare human monogenic disorder. This approach is a revolution in medical genetics history, impacting both fundamental research, and diagnostic methods leading to personalized medicine. A plethora of efficient algorithms has been developed to ensure the variant discovery. They generally lead to ~20,000 variations that have to be narrow down to find the potential pathogenic allelic variant(s) and the affected gene(s). For this purpose, commonly adopted procedures which implicate various filtering strategies have emerged: exclusion of common variations, type of the allelics variants, pathogenicity effect prediction, modes of inheritance and multiple individuals for exome comparison. To deal with the expansion of WES in medical genomics individual laboratories, new convivial and versatile software tools have to implement these filtering steps. Non-programmer biologists have to be autonomous combining themselves different filtering criteria and conduct a personal strategy depending on their assumptions and study design. Results We describe EVA (Exome Variation Analyzer), a user-friendly web-interfaced software dedicated to the filtering strategies for medical WES. Thanks to different modules, EVA (i) integrates and stores annotated exome variation data as strictly confidential to the project owner, (ii) allows to combine the main filters dealing with common variations, molecular types, inheritance mode and multiple samples, (iii) offers the browsing of annotated data and filtered results in various interactive tables, graphical visualizations and statistical charts, (iv) and finally offers export files and cross-links to external useful databases and softwares for further prioritization of the small subset of sorted candidate variations and genes. We report a demonstrative case study that allowed to identify a new candidate gene related to a rare form of

  7. Hidden Tools for Life Planning and Development.

    ERIC Educational Resources Information Center

    Shepard, John W.; And Others, Eds.

    1991-01-01

    Includes "Leisureman and the Future of Life Planning" (Edwards); "Leisure: A Hidden Counseling Tool" (Zorn, Kimeldorf); "Breakthrough in Productivity" (Faulkner); "Volunteerism" (Schindler-Rainman); "Building Bridges" (Larkin, Burdick); "Religion" (Whitten); "Knowledge of…

  8. Hidden Tools for Life Planning and Development.

    ERIC Educational Resources Information Center

    Shepard, John W.; And Others, Eds.

    1991-01-01

    Includes "Leisureman and the Future of Life Planning" (Edwards); "Leisure: A Hidden Counseling Tool" (Zorn, Kimeldorf); "Breakthrough in Productivity" (Faulkner); "Volunteerism" (Schindler-Rainman); "Building Bridges" (Larkin, Burdick); "Religion" (Whitten); "Knowledge of…

  9. Genome-wide microsatellite characterization and marker development in the sequenced Brassica crop species.

    PubMed

    Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2014-02-01

    Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species.

  10. Genome-Wide Microsatellite Characterization and Marker Development in the Sequenced Brassica Crop Species

    PubMed Central

    Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2014-01-01

    Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species. PMID:24130371

  11. Development and psychometric evaluation of the genomic nursing concept inventory.

    PubMed

    Ward, Linda D; Haberman, Mel; Barbosa-Leiker, Celestina

    2014-09-01

    Translation of genome science to improve health outcomes requires nurses to develop genomic competency and literacy, and a robust measure of genomic literacy is needed to advance evidence-based nursing education. This study aimed to develop and evaluate the Genomic Nursing Concept Inventory (GNCI), which is a scale to measure understanding of the genetic/genomic concepts most critical to nursing practice. Applying a multistep process, key concepts were drawn from essential nursing genetic/genomic competencies and validated by expert opinion. Surveys and cognitive interviews of baccalaureate nursing (BSN) students informed item development. A 52-item draft inventory was administered to 238 BSN students. Item analysis informed inventory reduction, and the resulting 31-item inventory was tested with 705 BSN students. Scale difficulty was 47%, item difficulty 13% to 84%, and Cronbach’s alpha 0.77. As scale refinement proceeds, the GNCI provides a useful measure of genomic literacy to inform curriculum design and evaluate outcomes in genomic nursing education.

  12. Genomic Tools for Customized Recovery and Detection of Foodborne Shiga Toxigenic Escherichia coli.

    PubMed

    Knowles, Michael; Stinson, Sara; Lambert, Dominic; Carrillo, Catherine; Koziol, Adam; Gauthier, Martine; Blais, Burton

    2016-12-01

    Genomic antimicrobial resistance (AMR) prediction tools have the potential to support foodborne illness outbreak investigations through their application in the analysis of bacterial genomes from causative strains. The AMR marker profile of a strain of interest, initially identified in outbreak-associated clinical samples, may serve as the basis for customization of selective enrichment media, facilitating its recovery from samples in a food safety investigation. Different possibilities for AMR analyses include the use of comprehensive AMR gene databases such as the Comprehensive Antibiotic Resistance Database, which can be mined with in-house bioinformatics alignment tools (e.g., Antimicrobial Resistance Marker Identifier), or publicly available tools based on clinically relevant acquired AMR gene databases (e.g., ResFinder). In combination with a previously reported pipeline (SigSeekr) designed to identify specific DNA sequences associated with a particular strain for its rapid identification by PCR, it should be possible to deploy custom recovery and identification tools for the efficient detection of priority pathogens such as Shiga toxigenic Escherichia coli (STEC) outbreak strains within the time frame of an active investigation. Using a laboratory STEC strain as a model, trimethoprim resistance identified by both Antimicrobial Resistance Marker Identifier and ResFinder was used as the basis for its selective recovery against a background of commensal E. coli bacteria in ground beef samples. Enrichment in modified tryptic soy broth containing trimethoprim greatly enhanced the recovery of low numbers of model strain cells inoculated in ground beef samples, as verified by the enumeration of colonies on plating media using a strain-specific PCR method to determine the recovery efficiency for the target strain. We discuss the relative merits of different AMR marker prediction tools for this purpose and describe how such tools can be utilized to good effect in a

  13. Plant Aquaporins: Genome-Wide Identification, Transcriptomics, Proteomics, and Advanced Analytical Tools

    PubMed Central

    Deshmukh, Rupesh K.; Sonah, Humira; Bélanger, Richard R.

    2016-01-01

    Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is

  14. Overview of the Development of Personalized Genomic Medicine and Surgery

    PubMed Central

    Gibbs, Richard A.; Wheeler, David A.; Nemunaitis, John; Fisher, William; Goss, John; Chen, Changyi

    2012-01-01

    Personalized genomic medicine and surgery (PGMS) represents a new approach to health care that customizes patients’ medical treatment according to their own genetic information. This new approach is the result of increased knowledge of the human genome and ways this information can be applied by physicians in the medical and surgical management of their patients. A patient’s genotype can yield important information concerning disease susceptibility and the effectiveness of medications, therefore guiding specific, targeted imaging and treatment therapies. This review summarizes major achievements of human genomic studies and applications of genomics in health care. Five years ago we developed a model for the development of PGMS in which genomic profile guides choice of therapy. In this article we discussed our progress, including an updating of the model, and a future vision of PGMS. PMID:21424870

  15. Patient-derived cell models as preclinical tools for genome-directed targeted therapy

    PubMed Central

    Jang, Jiryeon; Park, Kyunghee; Yi, Jun Ho; Hong, Mineui; Ahn, Taejin; Rath, Oliver; Schueler, Julia; Kim, Seung Tae; Do, In-Gu; Lee, Sujin; Park, Se Hoon; Ji, Yong Ick; Kim, Dukwhan; Park, Joon Oh; Park, Young Suk; Kang, Won Ki; Kim, Kyoung-Mee; Park, Woong-Yang; Lim, Ho Yeong; Lee, Jeeyun

    2015-01-01

    Background In this study, we established patient-derived tumor cell (PDC) models using tissues collected from patients with metastatic cancer and assessed whether these models could be used as a tool for genome-based cancer treatment. Methods PDCs were isolated and cultured from malignant effusions including ascites and pleural fluid. Pathological examination, immunohistochemical analysis, and genomic profiling were performed to compare the histological and genomic features of primary tumors, PDCs. An exploratory gene expression profiling assay was performed to further characterize PDCs. Results From January 2012 to May 2013, 176 samples from patients with metastatic cancer were collected. PDC models were successfully established in 130 (73.6%) samples. The median time from specimen collection to passage 1 (P1) was 3 weeks (range, 0.5–4 weeks), while that from P1 to P2 was 2.5 weeks (range, 0.5–5 weeks). Sixteen paired samples of genomic alterations were highly concordant between each primary tumor and progeny PDCs, with an average variant allele frequency (VAF) correlation of 0.878. We compared genomic profiles of the primary tumor (P0), P1 cells, P2 cells, and patient-derived xenografts (PDXs) derived from P2 cells and found that three samples (P0, P1, and P2 cells) were highly correlated (0.99–1.00). Moreover, PDXs showed more than 100 variants, with correlations of only 0.6–0.8 for the other samples. Drug responses of PDCs were reflective of the clinical response to targeted agents in selected patient PDC lines. Conclusion(s) Our results provided evidence that our PDC model was a promising model for preclinical experiments and closely resembled the patient tumor genome and clinical response. PMID:26296973

  16. Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomics Data.

    PubMed

    Bolser, Dan; Staines, Daniel M; Pritchard, Emily; Kersey, Paul

    2016-01-01

    Ensembl Plants ( http://plants.ensembl.org ) is an integrative resource presenting genome-scale information for a growing number of sequenced plant species (currently 33). Data provided includes genome sequence, gene models, functional annotation, and polymorphic loci. Various additional information are provided for variation data, including population structure, individual genotypes, linkage, and phenotype data. In each release, comparative analyses are performed on whole genome and protein sequences, and genome alignments and gene trees are made available that show the implied evolutionary history of each gene family. Access to the data is provided through a genome browser incorporating many specialist interfaces for different data types, and through a variety of additional methods for programmatic access and data mining. These access routes are consistent with those offered through the Ensembl interface for the genomes of non-plant species, including those of plant pathogens, pests, and pollinators.Ensembl Plants is updated 4-5 times a year and is developed in collaboration with our international partners in the Gramene ( http://www.gramene.org ) and transPLANT projects ( http://www.transplantdb.org ).

  17. Engineering plastid genomes: methods, tools, and applications in basic research and biotechnology.

    PubMed

    Bock, Ralph

    2015-01-01

    The small bacterial-type genome of the plastid (chloroplast) can be engineered by genetic transformation, generating cells and plants with transgenic plastid genomes, also referred to as transplastomic plants. The transformation process relies on homologous recombination, thereby facilitating the site-specific alteration of endogenous plastid genes as well as the precisely targeted insertion of foreign genes into the plastid DNA. The technology has been used extensively to analyze chloroplast gene functions and study plastid gene expression at all levels in vivo. Over the years, a large toolbox has been assembled that is now nearly comparable to the techniques available for plant nuclear transformation and that has enabled new applications of transplastomic technology in basic and applied research. This review describes the state of the art in engineering the plastid genomes of algae and land plants (Embryophyta). It provides an overview of the existing tools for plastid genome engineering, discusses current technological limitations, and highlights selected applications that demonstrate the immense potential of chloroplast transformation in several key areas of plant biotechnology.

  18. Workshop on Software Development Tools for Petascale Computing

    SciTech Connect

    Vetter, Jeffrey

    2007-08-01

    Petascale computing systems will soon be available to the DOE science community. Recent studies in the productivity of HPC platforms point to better software environments as a key enabler to science on these systems. To prepare for the deployment and productive use of these petascale platforms, the DOE science and general HPC community must have the software development tools, such as performance analyzers and debuggers that meet application requirements for scalability, functionality, reliability, and ease of use. In this report, we identify and prioritize the research opportunities in the area of software development tools for high performance computing. To facilitate this effort, DOE hosted a group of 55 leading international experts in this area at the Software Development Tools for PetaScale Computing (SDTPC) Workshop, which was held in Washington, D.C. on August 1 and 2, 2007. Software development tools serve as an important interface between the application teams and the target HPC architectures. Broadly speaking, these roles can be decomposed into three categories: performance tools, correctness tools, and development environments. Accordingly, this SDTPC report has four technical thrusts: performance tools, correctness tools, development environment infrastructures, and scalable tool infrastructures. The last thrust primarily targets tool developers per se, rather than end users. Finally, this report identifies non-technical strategic challenges that impact most tool development. The organizing committee emphasizes that many critical areas are outside the scope of this charter; these important areas include system software, compilers, and I/O.

  19. Stakeholder engagement in policy development: challenges and opportunities for human genomics.

    PubMed

    Lemke, Amy A; Harris-Wai, Julie N

    2015-12-01

    Along with rapid advances in human genomics, policies governing genomic data and clinical technologies have proliferated. Stakeholder engagement is widely lauded as an important methodology for improving clinical, scientific, and public health policy decision making. The purpose of this paper is to examine how stakeholder engagement is used to develop policies in genomics research and public health areas, as well as to identify future priorities for conducting evidence-based stakeholder engagements. We focus on exemplars in biobanking and newborn screening to illustrate a variety of current stakeholder engagement in policy-making efforts. Each setting provides an important context for examining the methods of obtaining and integrating informed stakeholder voices into the policy-making process. While many organizations have an interest in engaging stakeholders with regard to genomic policy issues, there is broad divergence with respect to the stakeholders involved, the purpose of engagements, when stakeholders are engaged during policy development, methods of engagement, and the outcomes reported. Stakeholder engagement in genomics policy development is still at a nascent stage. Several challenges of using stakeholder engagement as a tool for genomics policy development remain, and little evidence regarding how to best incorporate stakeholder feedback into policy-making processes is currently available.

  20. Stakeholder engagement in policy development: challenges and opportunities for human genomics

    PubMed Central

    Lemke, Amy A.; Harris-Wai, Julie N.

    2015-01-01

    Along with rapid advances in human genomics, policies governing genomic data and clinical technologies have proliferated. Stakeholder engagement is widely lauded as an important methodology for improving clinical, scientific, and public health policy decision making. The purpose of this paper is to examine how stakeholder engagement is used to develop policies in genomics research and public health areas, as well as to identify future priorities for conducting evidence-based stakeholder engagements. We focus on exemplars in biobanking and newborn screening to illustrate a variety of current stakeholder engagement in policy-making efforts. Each setting provides an important context for examining the methods of obtaining and integrating informed stakeholder voices into the policy-making process. While many organizations have an interest in engaging stakeholders with regard to genomic policy issues, there is broad divergence with respect to the stakeholders involved, the purpose of engagements, when stakeholders are engaged during policy development, methods of engagement, and the outcomes reported. Stakeholder engagement in genomics policy development is still at a nascent stage. Several challenges of using stakeholder engagement as a tool for genomics policy development remain, and little evidence regarding how to best incorporate stakeholder feedback into policy-making processes is currently available. PMID:25764215

  1. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality

    PubMed Central

    Kamvar, Zhian N.; Brooks, Jonah C.; Grünwald, Niklaus J.

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  2. Predictive computation of genomic logic processing functions in embryonic development

    PubMed Central

    Peter, Isabelle S.; Faure, Emmanuel; Davidson, Eric H.

    2012-01-01

    Gene regulatory networks (GRNs) control the dynamic spatial patterns of regulatory gene expression in development. Thus, in principle, GRN models may provide system-level, causal explanations of developmental process. To test this assertion, we have transformed a relatively well-established GRN model into a predictive, dynamic Boolean computational model. This Boolean model computes spatial and temporal gene expression according to the regulatory logic and gene interactions specified in a GRN model for embryonic development in the sea urchin. Additional information input into the model included the progressive embryonic geometry and gene expression kinetics. The resulting model predicted gene expression patterns for a large number of individual regulatory genes each hour up to gastrulation (30 h) in four different spatial domains of the embryo. Direct comparison with experimental observations showed that the model predictively computed these patterns with remarkable spatial and temporal accuracy. In addition, we used this model to carry out in silico perturbations of regulatory functions and of embryonic spatial organization. The model computationally reproduced the altered developmental functions observed experimentally. Two major conclusions are that the starting GRN model contains sufficiently complete regulatory information to permit explanation of a complex developmental process of gene expression solely in terms of genomic regulatory code, and that the Boolean model provides a tool with which to test in silico regulatory circuitry and developmental perturbations. PMID:22927416

  3. ARG-ANNOT, a New Bioinformatic Tool To Discover Antibiotic Resistance Genes in Bacterial Genomes

    PubMed Central

    Gupta, Sushim Kumar; Padmanabhan, Babu Roshan; Diene, Seydina M.; Lopez-Rojas, Rafael; Kempf, Marie; Landraud, Luce

    2014-01-01

    ARG-ANNOT (Antibiotic Resistance Gene-ANNOTation) is a new bioinformatic tool that was created to detect existing and putative new antibiotic resistance (AR) genes in bacterial genomes. ARG-ANNOT uses a local BLAST program in Bio-Edit software that allows the user to analyze sequences without a Web interface. All AR genetic determinants were collected from published works and online resources; nucleotide and protein sequences were retrieved from the NCBI GenBank database. After building a database that includes 1,689 antibiotic resistance genes, the software was tested in a blind manner using 100 random sequences selected from the database to verify that the sensitivity and specificity were at 100% even when partial sequences were queried. Notably, BLAST analysis results obtained using the rmtF gene sequence (a new aminoglycoside-modifying enzyme gene sequence that is not included in the database) as a query revealed that the tool was able to link this sequence to short sequences (17 to 40 bp) found in other genes of the rmt family with significant E values. Finally, the analysis of 178 Acinetobacter baumannii and 20 Staphylococcus aureus genomes allowed the detection of a significantly higher number of AR genes than the Resfinder gene analyzer and 11 point mutations in target genes known to be associated with AR. The average time for the analysis of a genome was 3.35 ± 0.13 min. We have created a concise database for BLAST using a Bio-Edit interface that can detect AR genetic determinants in bacterial genomes and can rapidly and easily discover putative new AR genetic determinants. PMID:24145532

  4. Optimized CRISPR/Cas tools for efficient germline and somatic genome engineering in Drosophila.

    PubMed

    Port, Fillip; Chen, Hui-Min; Lee, Tzumin; Bullock, Simon L

    2014-07-22

    The type II clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) system has emerged recently as a powerful method to manipulate the genomes of various organisms. Here, we report a toolbox for high-efficiency genome engineering of Drosophila melanogaster consisting of transgenic Cas9 lines and versatile guide RNA (gRNA) expression plasmids. Systematic evaluation reveals Cas9 lines with ubiquitous or germ-line-restricted patterns of activity. We also demonstrate differential activity of the same gRNA expressed from different U6 snRNA promoters, with the previously untested U6:3 promoter giving the most potent effect. An appropriate combination of Cas9 and gRNA allows targeting of essential and nonessential genes with transmission rates ranging from 25-100%. We also demonstrate that our optimized CRISPR/Cas tools can be used for offset nicking-based mutagenesis. Furthermore, in combination with oligonucleotide or long double-stranded donor templates, our reagents allow precise genome editing by homology-directed repair with rates that make selection markers unnecessary. Last, we demonstrate a novel application of CRISPR/Cas-mediated technology in revealing loss-of-function phenotypes in somatic cells following efficient biallelic targeting by Cas9 expressed in a ubiquitous or tissue-restricted manner. Our CRISPR/Cas tools will facilitate the rapid evaluation of mutant phenotypes of specific genes and the precise modification of the genome with single-nucleotide precision. Our results also pave the way for high-throughput genetic screening with CRISPR/Cas.

  5. Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classification.

    PubMed

    Bao, Yiming; Chetvernin, Vyacheslav; Tatusova, Tatiana

    2014-12-01

    The number of viral genome sequences in the public databases is increasing dramatically, and these sequences are playing an important role in virus classification. Pairwise sequence comparison is a sequence-based virus classification method. A program using this method calculates the pairwise identities of virus sequences within a virus family and displays their distribution, and visual analysis helps to determine demarcations at different taxonomic levels such as strain, species, genus and subfamily. Subsequent comparison of new sequences against existing ones allows viruses from which the new sequences were derived to be classified. Although this method cannot be used as the only criterion for virus classification in some cases, it is a quantitative method and has many advantages over conventional virus classification methods. It has been applied to several virus families, and there is an increasing interest in using this method for other virus families/groups. The Pairwise Sequence Comparison (PASC) classification tool was created at the National Center for Biotechnology Information. The tool's database stores pairwise identities for complete genomes/segments of 56 virus families/groups. Data in the system are updated every day to reflect changes in virus taxonomy and additions of new virus sequences to the public database. The web interface of the tool ( http://www.ncbi.nlm.nih.gov/sutils/pasc/ ) makes it easy to navigate and perform analyses. Multiple new viral genome sequences can be tested simultaneously with this system to suggest the taxonomic position of virus isolates in a specific family. PASC eliminates potential discrepancies in the results caused by different algorithms and/or different data used by researchers.

  6. Reverse Vaccinology: Developing Vaccines in the Era of Genomics

    PubMed Central

    Sette, Alessandro; Rappuoli, Rino

    2012-01-01

    The sequence of microbial genomes made all potential antigens of each pathogen available for vaccine development. This increased by orders of magnitude potential vaccine targets in bacteria, parasites, and large viruses and revealed virtually all their CD4+ and CD8+ T cell epitopes. The genomic information was first used for the development of a vaccine against serogroup B meningococcus, and it is now being used for several other bacterial vaccines. In this review, we will first summarize the impact that genome sequencing has had on vaccine development, and then we will analyze how the genomic information can help further our understanding of immunity to infection or vaccination and lead to the design of better vaccines by diving into the world of T cell immunity. PMID:21029963

  7. Swine models, genomic tools and services to enhance our understanding of human health and diseases.

    PubMed

    Walters, Eric M; Wells, Kevin D; Bryda, Elizabeth C; Schommer, Susan; Prather, Randall S

    2017-03-22

    The pig is becoming increasingly important as a biomedical model. Given the similarities between pigs and humans, a greater understanding of the underlying biology of human health and diseases may come from the pig rather than from classical rodent models. With an increasing need for swine models, it is essential that the genomic tools, models and services be readily available to the scientific community. Many of these are available through the National Swine Resource and Research Center (NSRRC), a facility funded by the US National Institutes of Health at the University of Missouri. The goal of the NSRRC is to provide high-quality biomedical swine models to the scientific community.

  8. Tool Forces Developed During Friction Stir Welding

    NASA Technical Reports Server (NTRS)

    Melendez, M.; Tang, W.; Schmidt, C.; McClure, J. C.; Nunes, A. C.; Murr, L. E.

    2003-01-01

    This paper will describe a technique for measuring the various forces and the torque that exist on the Friction Stir Welding pin tool. Results for various plunge depths, weld speeds, rotational speed, and tool configurations will be presented. Welds made on 6061 aluminum with typical welding conditions require a downward force of 2800 lbs. (12.5 kN) a longitudinal force in the direction of motion of 300 lbs (1.33 kN), a transverse force in the omega x v direction of 30 lbs (135 N). Aluminum 2195 under typical weld conditions requires a downward force of 3100 lbs. (1.38 kN), a longitudinal force of 920 lbs. (4.1 kN), and a transverse force of 45 lbs. (200 N) in the omega x v direction.

  9. Hull Form Design and Optimization Tool Development

    DTIC Science & Technology

    2012-07-01

    Combatants • Unconventional Surface Combatants • Twin - Screw , Open-Stern, UNREP and Auxiliary • Single- Screw , Closed-Stern, UNREP and Auxiliary • Single...Waterplane Area Twin Hull TriSWACH – Trimaran Small Waterplane Area Center Hull UI – User Interface UNREP – Underway Replenishment Naval Surface...resistance prediction tool for SWATH (Small Waterplane Area Twin Hull) ships. An example of a SWATH ship is shown in Figure 11. The input files

  10. Computational Tools for Accelerating Carbon Capture Process Development

    SciTech Connect

    Miller, David; Sahinidis, N V; Cozad, A; Lee, A; Kim, H; Morinelly, J; Eslick, J; Yuan, Z

    2013-06-04

    This presentation reports development of advanced computational tools to accelerate next generation technology development. These tools are to develop an optimized process using rigorous models. They include: Process Models; Simulation-Based Optimization; Optimized Process; Uncertainty Quantification; Algebraic Surrogate Models; and Superstructure Optimization (Determine Configuration).

  11. Spatial organization of genome architecture in neuronal development and disease.

    PubMed

    Fujita, Yuki; Yamashita, Toshihide

    2017-07-28

    Although mammalian genomes encode genetic information in their linear sequences, their fundamental function with regard to gene expression depends on the higher-order structure of chromosomes. Current techniques for the evaluation of chromosomal structure have revealed that genomes are arranged at several hierarchical levels in three-dimensional space. The spatial organization of genomes involves the formation of chromatin loops that bypass a wide range of genomic distances, providing a connection between enhancers and chromosomal domains. Furthermore, they form chromatin domains that are arranged into chromosome territories in the three-dimensional space of the cell nucleus. Recent studies have shown that the spatial organization of the genome is essential for normal brain development and function. Activity-dependent alterations in the spatial organization of the genome can regulate transcriptional activity related to neuronal plasticity. Disruptions in the higher-order chromatin architecture have been implicated in neuropsychiatric disorders, such as cognitive dysfunction and anxiety. Here, we discuss the growing interest in the role of genome organization in brain development and neurological disorders. Copyright © 2017. Published by Elsevier Ltd.

  12. Algal Functional Annotation Tool from the DOE-UCLA Institute for Genomics and Proteomics

    DOE Data Explorer

    Lopez, David

    The Algal Functional Annotation Tool is a bioinformatics resource to visualize pathway maps, identify enriched biological terms, or convert gene identifiers to elucidate biological function in silico. These types of analysis have been catered to support lists of gene identifiers, such as those coming from transcriptome gene expression analysis. By analyzing the functional annotation of an interesting set of genes, common biological motifs may be elucidated and a first-pass analysis can point further research in the right direction. Currently, the following databases have been parsed, processed, and added to the tool: 1( Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathways Database, 2) MetaCyc Encyclopedia of Metabolic Pathways, 3) Panther Pathways Database, 4) Reactome Pathways Database, 5) Gene Ontology, 6) MapMan Ontology, 7) KOG (Eukaryotic Clusters of Orthologous Groups), 5)Pfam, 6) InterPro.

  13. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research.

    PubMed

    Conesa, Ana; Götz, Stefan; García-Gómez, Juan Miguel; Terol, Javier; Talón, Manuel; Robles, Montserrat

    2005-09-15

    We present here Blast2GO (B2G), a research tool designed with the main purpose of enabling Gene Ontology (GO) based data mining on sequence data for which no GO annotation is yet available. B2G joints in one application GO annotation based on similarity searches with statistical analysis and highlighted visualization on directed acyclic graphs. This tool offers a suitable platform for functional genomics research in non-model species. B2G is an intuitive and interactive desktop application that allows monitoring and comprehension of the whole annotation and analysis process. Blast2GO is freely available via Java Web Start at http://www.blast2go.de. http://www.blast2go.de -> Evaluation.

  14. Developing a platform for genomic medicine in Mexico.

    PubMed

    Jimenez-Sanchez, Gerardo

    2003-04-11

    Mexico is preparing to develop a genomic medicine program focused on national health problems. Modern Mexicans result from an admixture of more than 65 native Indian groups with Spaniards, leading to a unique genetic makeup and a characteristic set of disease susceptibilities. Since 1999, more than 100 experts from different fields have joined efforts with government, academia, and industry to identify priorities and goals for genomic medicine in Mexico. The plan includes establishment of an Institute of Genomic Medicine with strong intramural and extramural programs. This project is expected to ease the social and financial burden of health problems in Mexico.

  15. GAPIT version 2: an enhanced integrated tool for genomic association and prediction

    USDA-ARS?s Scientific Manuscript database

    Most human diseases and agriculturally important traits are complex. Dissecting their genetic architecture requires continued development of innovative and powerful statistical methods. Corresponding advances in computing tools are critical to efficiently use these statistical innovations and to enh...

  16. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications

    PubMed Central

    Del Medico, Luca; Christen, Heinz; Christen, Beat

    2017-01-01

    Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner. PMID:28531174

  17. Developing A SPOT CRM Debriefing Tool

    NASA Technical Reports Server (NTRS)

    Martin, Lynne; Villeda, Eric; Orasanu, Judith; Connors, Mary M. (Technical Monitor)

    1998-01-01

    In a study of CRM LOFT briefings published in 1997, Dismukes, McDonnell & Jobe reported that briefings were not being utilized as fully as they could be and that crews may not be getting the full benefit from LOFT that is possible. On the basis of their findings, they suggested a set of general guidelines for briefings for the industry. Our work builds on this study to try to provide a specific debriefing tool which provides a focus for the strategies that Dismukes et al suggest.

  18. Different whole-genome amplification methods as a preamplification tool in Y-chromosome Loci analysis.

    PubMed

    Maciejewska, Agnieszka; Jakubowska, Joanna; Pawłowski, Ryszard

    2014-06-01

    Degraded and low template DNA is often analyzed in forensic genetics laboratories. Reliable analysis of degraded and low template DNA is of great importance, because its results impact the quality and reliability of expert testimonies. Recently, a number of whole-genome amplification (WGA) methods have been proposed as preamplification tools improving quantity and quality of DNA. We chose, investigated, and compared 7 WGA methods to evaluate their ability to "recover" degraded and nondegraded DNA. These methods include degenerate oligonucleotide primed polymerase chain reaction, primer extension preamplification (PEP) polymerase chain reaction, GenomePlex WGA (Sigma), multiple displacement amplification, GenomiPhi Amplification Kit (Amersham Biosciences), restriction and circularization aided rolling circle amplification, and blunt-end ligation-mediated WGA. Recently, we have described the comparison of these methods' efficiency and reliability using SGMPlus kit. However, Y-chromosome profiling is also often used in analysis of both nondegraded and degraded DNA. This includes criminal cases and investigation of kinship in male linage. Here we demonstrate the impact of WGA methods on Y-chromosome loci (Yfiler) reactivation.The best results for nondegraded DNA were obtained with GenomiPhi kit and PEP method. In the case of degraded DNA (200 base pairs), the most complete profiles were obtained with GenomePlex kit and PEP method. None of the analyzed methods allowed full reactivation of degraded (200 base pairs) DNA in terms of Y-chromosome loci profiling.

  19. Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology.

    PubMed

    Karp, Peter D; Paley, Suzanne M; Krummenacker, Markus; Latendresse, Mario; Dale, Joseph M; Lee, Thomas J; Kaipa, Pallavi; Gilham, Fred; Spaulding, Aaron; Popescu, Liviu; Altman, Tomer; Paulsen, Ian; Keseler, Ingrid M; Caspi, Ron

    2010-01-01

    Pathway Tools is a production-quality software environment for creating a type of model-organism database called a Pathway/Genome Database (PGDB). A PGDB such as EcoCyc integrates the evolving understanding of the genes, proteins, metabolic network and regulatory network of an organism. This article provides an overview of Pathway Tools capabilities. The software performs multiple computational inferences including prediction of metabolic pathways, prediction of metabolic pathway hole fillers and prediction of operons. It enables interactive editing of PGDBs by DB curators. It supports web publishing of PGDBs, and provides a large number of query and visualization tools. The software also supports comparative analyses of PGDBs, and provides several systems biology analyses of PGDBs including reachability analysis of metabolic networks, and interactive tracing of metabolites through a metabolic network. More than 800 PGDBs have been created using Pathway Tools by scientists around the world, many of which are curated DBs for important model organisms. Those PGDBs can be exchanged using a peer-to-peer DB sharing system called the PGDB Registry.

  20. Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology

    PubMed Central

    Paley, Suzanne M.; Krummenacker, Markus; Latendresse, Mario; Dale, Joseph M.; Lee, Thomas J.; Kaipa, Pallavi; Gilham, Fred; Spaulding, Aaron; Popescu, Liviu; Altman, Tomer; Paulsen, Ian; Keseler, Ingrid M.; Caspi, Ron

    2010-01-01

    Pathway Tools is a production-quality software environment for creating a type of model-organism database called a Pathway/Genome Database (PGDB). A PGDB such as EcoCyc integrates the evolving understanding of the genes, proteins, metabolic network and regulatory network of an organism. This article provides an overview of Pathway Tools capabilities. The software performs multiple computational inferences including prediction of metabolic pathways, prediction of metabolic pathway hole fillers and prediction of operons. It enables interactive editing of PGDBs by DB curators. It supports web publishing of PGDBs, and provides a large number of query and visualization tools. The software also supports comparative analyses of PGDBs, and provides several systems biology analyses of PGDBs including reachability analysis of metabolic networks, and interactive tracing of metabolites through a metabolic network. More than 800 PGDBs have been created using Pathway Tools by scientists around the world, many of which are curated DBs for important model organisms. Those PGDBs can be exchanged using a peer-to-peer DB sharing system called the PGDB Registry. PMID:19955237

  1. PentaPlot: A software tool for the illustration of genome mosaicism

    PubMed Central

    Hamel, Lutz; Zhaxybayeva, Olga; Gogarten, J Peter

    2005-01-01

    Background Dekapentagonal maps depict the phylogenetic relationships of five genomes in a visually appealing diagram and can be viewed as an alternative to a single evolutionary consensus tree. In particular, the generated maps focus attention on those gene families that significantly deviate from the consensus or plurality phylogeny. PentaPlot is a software tool that computes such dekapentagonal maps given an appropriate probability support matrix. Results The visualization with dekapentagonal maps critically depends on the optimal layout of unrooted tree topologies representing different evolutionary relationships among five organisms along the vertices of the dekapentagon. This is a difficult optimization problem given the large number of possible layouts. At its core our tool utilizes a genetic algorithm with demes and a local search strategy to search for the optimal layout. The hybrid genetic algorithm performs satisfactorily even in those cases where the chosen genomes are so divergent that little phylogenetic information has survived in the individual gene families. Conclusion PentaPlot is being made publicly available as an open source project at . PMID:15938752

  2. Hordeum chilense genome, a useful tool to investigate the endosperm yellow pigment content in the Triticeae

    PubMed Central

    2012-01-01

    Background The wild barley Hordeum chilense fulfills some requirements for being a useful tool to investigate the endosperm yellow pigment content (YPC) in the Triticeae including its diploid constitution, the availability of genetic resources (addition and deletion stocks and a high density genetic map) and, especially, its high seed YPC not silenced in tritordeums (amphiploids derived from H. chilense and wheat). Thus, the aim of this work was to test the utility of the H. chilense genome for investigating the YPC in the Triticeae. Results Twelve genes related to endosperm carotenoid content and/or YPC in grasses (Dxr, Hdr [synonym ispH], Ggpps1, Psy2, Psy3, Pds, Zds, e-Lcy, b-Lcy, Hyd3, Ccd1 and Ppo1) were identified, and mapped in H. chilense using rice genes to identify orthologs from barley, wheat, sorghum and maize. Macrocolinearity studies revealed that gene positions were in agreement in H. vulgare and H. chilense. Additionally, three main regions associated with YPC were identified in chromosomes 2Hch, 3Hch and 7Hch in H. chilense, the former being the most significant one. Conclusions The results obtained are consistent with previous findings in wheat and suggest that Ggpps1, Zds and Hyd3 on chromosome 2Hch may be considered candidate genes in wheat for further studies in YPC improvement. Considering the syntenic location of carotenoid genes in H. chilense, we have concluded that the Hch genome may constitute a valuable tool for YPC studies in the Triticeae. PMID:23122232

  3. Hordeum chilense genome, a useful tool to investigate the endosperm yellow pigment content in the Triticeae.

    PubMed

    Rodríguez-Suárez, Cristina; Atienza, Sergio G

    2012-11-02

    The wild barley Hordeum chilense fulfills some requirements for being a useful tool to investigate the endosperm yellow pigment content (YPC) in the Triticeae including its diploid constitution, the availability of genetic resources (addition and deletion stocks and a high density genetic map) and, especially, its high seed YPC not silenced in tritordeums (amphiploids derived from H. chilense and wheat). Thus, the aim of this work was to test the utility of the H. chilense genome for investigating the YPC in the Triticeae. Twelve genes related to endosperm carotenoid content and/or YPC in grasses (Dxr, Hdr [synonym ispH], Ggpps1, Psy2, Psy3, Pds, Zds, e-Lcy, b-Lcy, Hyd3, Ccd1 and Ppo1) were identified, and mapped in H. chilense using rice genes to identify orthologs from barley, wheat, sorghum and maize. Macrocolinearity studies revealed that gene positions were in agreement in H. vulgare and H. chilense. Additionally, three main regions associated with YPC were identified in chromosomes 2Hch, 3Hch and 7Hch in H. chilense, the former being the most significant one. The results obtained are consistent with previous findings in wheat and suggest that Ggpps1, Zds and Hyd3 on chromosome 2Hch may be considered candidate genes in wheat for further studies in YPC improvement. Considering the syntenic location of carotenoid genes in H. chilense, we have concluded that the Hch genome may constitute a valuable tool for YPC studies in the Triticeae.

  4. Machine tool evaluation and machining operation development

    SciTech Connect

    Morris, T.O.; Kegg, R.

    1997-03-15

    The purpose of this CRADA was to support Cincinnati Milacron`s needs in fabricating precision components, from difficult to machine materials, while maintaining and enhancing the precision manufacturing skills of the Oak Ridge Complex. Oak Ridge and Cincinnati Milacron personnel worked in a team relationship wherein each contributed equally to the success of the program. Process characterization, control technologies, machine tool capabilities, and environmental issues were the primary focus areas. In general, Oak Ridge contributed a wider range of expertise in machine tool testing and monitoring, and environmental testing on machining fluids to the defined tasks while Cincinnati Milacron personnel provided equipment, operations-specific knowledge and shop-floor services to each task. Cincinnati Milacron was very pleased with the results of all of the CRADA tasks. However, some of the environmental tasks were not carried through to a desired completion due to an expanding realization of need as the work progressed. This expansion of the desired goals then exceeded the time length of the CRADA. Discussions are underway on continuing these tasks under either a Work for Others agreement or some alternate funding.

  5. Applying CASE Tools for On-Board Software Development

    NASA Astrophysics Data System (ADS)

    Brammer, U.; Hönle, A.

    For many space projects the software development is facing great pressure with respect to quality, costs and schedule. One way to cope with these challenges is the application of CASE tools for automatic generation of code and documentation. This paper describes two CASE tools: Rhapsody (I-Logix) featuring UML and ISG (BSSE) that provides modeling of finite state machines. Both tools have been used at Kayser-Threde in different space projects for the development of on-board software. The tools are discussed with regard to the full software development cycle.

  6. A flexible multi-species genome-wide 60K SNP chip developed from pooled resequencing of 240 Eucalyptus tree genomes across 12 species.

    PubMed

    Silva-Junior, Orzenil B; Faria, Danielle A; Grattapaglia, Dario

    2015-06-01

    We used whole genome resequencing of pooled individuals to develop a high-density single-nucleotide polymorphism (SNP) chip for Eucalyptus. Genomes of 240 trees of 12 species were sequenced at 3.5× each, and 46 997 586 raw SNP variants were subject to multivariable filtering metrics toward a multispecies, genome-wide distributed chip content. Of the 60 904 SNPs on the chip, 59 222 were genotyped and 51 204 were polymorphic across 14 Eucalyptus species, providing a 96% genome-wide coverage with 1 SNP/12-20 kb, and 47 069 SNPs at ≤ 10 kb from 30 444 of the 33 917 genes in the Eucalyptus genome. Given the EUChip60K multi-species genotyping flexibility, we show that both the sample size and taxonomic composition of cluster files impact heterozygous call specificity and sensitivity by benchmarking against 'gold standard' genotypes derived from deeply sequenced individual tree genomes. Thousands of SNPs were shared across species, likely representing ancient variants arisen before the split of these taxa, hinting to a recent eucalypt radiation. We show that the variable SNP filtering constraints allowed coverage of the entire site frequency spectrum, mitigating SNP ascertainment bias. The EUChip60K represents an outstanding tool with which to address population genomics questions in Eucalyptus and to empower genomic selection, GWAS and the broader study of complex trait variation in eucalypts. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  7. Using Genomics to Dissect Seed Development (JGI Seventh Annual User Meeting 2012: Genomics of Energy and Environment Meeting)

    ScienceCinema

    Goldberg, Robert [UCLA

    2016-07-12

    Robert Goldberg of UCLA presents "Using Genomics to Dissect Seed Development" at the JGI 7th Annual Users Meeting: Genomics of Energy & Environment Meeting on March 22, 2012 in Walnut Creek, California.

  8. Using Genomics to Dissect Seed Development (JGI Seventh Annual User Meeting 2012: Genomics of Energy and Environment Meeting)

    SciTech Connect

    Goldberg, Robert

    2012-03-21

    Robert Goldberg of UCLA presents "Using Genomics to Dissect Seed Development" at the JGI 7th Annual Users Meeting: Genomics of Energy & Environment Meeting on March 22, 2012 in Walnut Creek, California.

  9. Empowered genome community: leveraging a bioinformatics platform as a citizen-scientist collaboration tool.

    PubMed

    Wendelsdorf, Katherine; Shah, Sohela

    2015-09-01

    There is on-going effort in the biomedical research community to leverage Next Generation Sequencing (NGS) technology to identify genetic variants that affect our health. The main challenge facing researchers is getting enough samples from individuals either sick or healthy - to be able to reliably identify the few variants that are causal for a phenotype among all other variants typically seen among individuals. At the same time, more and more individuals are having their genome sequenced either out of curiosity or to identify the cause of an illness. These individuals may benefit from of a way to view and understand their data. QIAGEN's Ingenuity Variant Analysis is an online application that allows users with and without extensive bioinformatics training to incorporate information from published experiments, genetic databases, and a variety of statistical models to identify variants, from a long list of candidates, that are most likely causal for a phenotype as well as annotate variants with what is already known about them in the literature and databases. Ingenuity Variant Analysis is also an information sharing platform where users may exchange samples and analyses. The Empowered Genome Community (EGC) is a new program in which QIAGEN is making this on-line tool freely available to any individual who wishes to analyze their own genetic sequence. EGC members are then able to make their data available to other Ingenuity Variant Analysis users to be used in research. Here we present and describe the Empowered Genome Community in detail. We also present a preliminary, proof-of-concept study that utilizes the 200 genomes currently available through the EGC. The goal of this program is to allow individuals to access and understand their own data as well as facilitate citizen-scientist collaborations that can drive research forward and spur quality scientific dialogue in the general public.

  10. Development of CAM System for Multi-Tasking Machine Tools

    NASA Astrophysics Data System (ADS)

    Kubota, Koji; Kotani, Takashi; Nakamoto, Keiichi; Ishida, Tohru; Takeuchi, Yoshimi

    Multi-tasking machine tools are very useful to manufacture complicated workpiece efficiently. However, it is very difficult to generate NC data when an operator uses multi-tasking machine tools. Currently, there are some CAM systems for multi-tasking machine tools. However, there are such problems as the manual allocation of parts to generate tool paths, the difficulty of self-interference recognition without a simulator, and so on. Therefore, this study deals with the development of CAM system which can recognize part configurations, calculate tool paths automatically for turning and milling operations, and sort them in machining order, based on 3-D CAD data of workpieces.

  11. Development of Asset Management Decision Support Tools for Power Equipment

    NASA Astrophysics Data System (ADS)

    Okamoto, Tatsuki; Takahashi, Tsuguhiro

    Development of asset management decision support tools become very intensive in order to reduce maintenance cost of power equipment due to the liberalization of power business. This article reviews some aspects of present status of asset management decision support tools development for power equipment based on the papers published in international conferences, domestic conventions, and several journals.

  12. Design and development of progressive tool for manufacturing washer

    NASA Astrophysics Data System (ADS)

    Annigeri, Ulhas K.; Raghavendra Ravi Kiran, K.; Deepthi, Y. P.

    2017-07-01

    In a progressive tool the raw material is worked at different station to finally fabricate the component. A progressive tool is a lucrative tool for mass production of components. A lot of automobile and other transport industries develop progressive tool for the production of components. The design of tool involves lot of planning and the same amount of skill of process planning is required in the fabrication of the tool. The design also involves use of thumb rules and standard elements as per experience gained in practice. Manufacturing the press tool is a laborious task as special jigs and fixtures have to be designed for the purpose. Assembly of all the press tool elements is another task where use of accurate measuring instruments for alignment of various tool elements is important. In the present study, design and fabrication of progressive press tool for production of washer has been developed and the press tool has been tried out on a mechanical type of press. The components produced are to dimensions.

  13. Development of a Geometric Spatial Visualization Tool

    ERIC Educational Resources Information Center

    Ganesh, Bibi; Wilhelm, Jennifer; Sherrod, Sonya

    2009-01-01

    This paper documents the development of the Geometric Spatial Assessment. We detail the development of this instrument which was designed to identify middle school students' strategies and advancement in understanding of four geometric concept domains (geometric spatial visualization, spatial projection, cardinal directions, and periodic patterns)…

  14. Developing a Statewide Student Tracking Tool

    ERIC Educational Resources Information Center

    Pai, Wendell; Moschos, Marina; Detlev, Angela; Robinson, Ophelia; Lanneau, Sumi

    2008-01-01

    This chapter describes the development of a state-level student tracking system that enables the state and institutions to follow various cohort types of students across institutional boundaries. The system was developed by the Policy Research and Data Warehousing section of the State Council of Higher Education for Virginia (SCHEV). SCHEV is the…

  15. Development of a Geometric Spatial Visualization Tool

    ERIC Educational Resources Information Center

    Ganesh, Bibi; Wilhelm, Jennifer; Sherrod, Sonya

    2009-01-01

    This paper documents the development of the Geometric Spatial Assessment. We detail the development of this instrument which was designed to identify middle school students' strategies and advancement in understanding of four geometric concept domains (geometric spatial visualization, spatial projection, cardinal directions, and periodic patterns)…

  16. Development of a tool for evaluating multimedia for surgical education.

    PubMed

    Coughlan, Jane; Morar, Sonali S

    2008-09-01

    Educational multimedia has been designed to provide surgical trainees with expert operative information outside of the operating theater. The effectiveness of multimedia (e.g., CD-ROMs) for learning has been a common research topic since the 1990s. To date, however, little discussion has taken place on the mechanisms to evaluate the quality of multimedia-driven teaching. This may be because of a lack of research into the development of appropriate tools for evaluating multimedia, especially for surgical education. This paper reports on a small-scale pilot and exploratory study (n = 12) that developed a tool for surgical multimedia evaluation. The validity of the developed tool was established through adaptation of an existing tool, which was reviewed using experts in surgery, usability, and education. The reliability of the developed tool was tested with surgical trainees who used it to assess a multimedia CD-ROM created for teaching basic surgical skills. The findings contribute to an understanding of surgical trainees' experience of using educational multimedia, in terms of characteristics of the learning material for interface design and content and the process of developing evaluation tools, in terms of inclusion of appropriate assessment criteria. The increasing use of multimedia in medical education necessitates the development of standardized tools for determining the quality of teaching and learning. Little research exists into the development of such tools and so the present work stimulates discussion on how to evaluate surgical training.

  17. Development of genomic SSR markers for fingerprinting lettuce (Lactuca sativa L.) cultivars and mapping genes

    PubMed Central

    2013-01-01

    Background Lettuce (Lactuca sativa L.) is the major crop from the group of leafy vegetables. Several types of molecular markers were developed that are effectively used in lettuce breeding and genetic studies. However only a very limited number of microsattelite-based markers are publicly available. We have employed the method of enriched microsatellite libraries to develop 97 genomic SSR markers. Results Testing of newly developed markers on a set of 36 Lactuca accession (33 L. sativa, and one of each L. serriola L., L. saligna L., and L. virosa L.) revealed that both the genetic heterozygosity (UHe = 0.56) and the number of loci per SSR (Na = 5.50) are significantly higher for genomic SSR markers than for previously developed EST-based SSR markers (UHe = 0.32, Na = 3.56). Fifty-four genomic SSR markers were placed on the molecular linkage map of lettuce. Distribution of markers in the genome appeared to be random, with the exception of possible cluster on linkage group 6. Any combination of 32 genomic SSRs was able to distinguish genotypes of all 36 accessions. Fourteen of newly developed SSR markers originate from fragments with high sequence similarity to resistance gene candidates (RGCs) and RGC pseudogenes. Analysis of molecular variance (AMOVA) of L. sativa accessions showed that approximately 3% of genetic diversity was within accessions, 79% among accessions, and 18% among horticultural types. Conclusions The newly developed genomic SSR markers were added to the pool of previously developed EST-SSRs markers. These two types of SSR-based markers provide useful tools for lettuce cultivar fingerprinting, development of integrated molecular linkage maps, and mapping of genes. PMID:23339733

  18. Development of genomic SSR markers for fingerprinting lettuce (Lactuca sativa L.) cultivars and mapping genes.

    PubMed

    Rauscher, Gilda; Simko, Ivan

    2013-01-22

    Lettuce (Lactuca sativa L.) is the major crop from the group of leafy vegetables. Several types of molecular markers were developed that are effectively used in lettuce breeding and genetic studies. However only a very limited number of microsattelite-based markers are publicly available. We have employed the method of enriched microsatellite libraries to develop 97 genomic SSR markers. Testing of newly developed markers on a set of 36 Lactuca accession (33 L. sativa, and one of each L. serriola L., L. saligna L., and L. virosa L.) revealed that both the genetic heterozygosity (UHe = 0.56) and the number of loci per SSR (Na = 5.50) are significantly higher for genomic SSR markers than for previously developed EST-based SSR markers (UHe = 0.32, Na = 3.56). Fifty-four genomic SSR markers were placed on the molecular linkage map of lettuce. Distribution of markers in the genome appeared to be random, with the exception of possible cluster on linkage group 6. Any combination of 32 genomic SSRs was able to distinguish genotypes of all 36 accessions. Fourteen of newly developed SSR markers originate from fragments with high sequence similarity to resistance gene candidates (RGCs) and RGC pseudogenes. Analysis of molecular variance (AMOVA) of L. sativa accessions showed that approximately 3% of genetic diversity was within accessions, 79% among accessions, and 18% among horticultural types. The newly developed genomic SSR markers were added to the pool of previously developed EST-SSRs markers. These two types of SSR-based markers provide useful tools for lettuce cultivar fingerprinting, development of integrated molecular linkage maps, and mapping of genes.

  19. Recent advances in functional perturbation and genome editing techniques in studying sea urchin development.

    PubMed

    Cui, Miao; Lin, Che-Yi; Su, Yi-Hsien

    2017-06-12

    Studies on the gene regulatory networks (GRNs) of sea urchin embryos have provided a basic understanding of the molecular mechanisms controlling animal development. The causal links in GRNs have been verified experimentally through perturbation of gene functions. Microinjection of antisense morpholino oligonucleotides (MOs) into the egg is the most widely used approach for gene knockdown in sea urchin embryos. The modification of MOs into a membrane-permeable form (vivo-MOs) has allowed gene knockdown at later developmental stages. Recent advances in genome editing tools, such as zinc-finger nucleases, transcription activator-like effector-based nucleases and the clustered regularly interspaced short palindromic repeat/clustered regularly interspaced short palindromic repeat-associated protein 9 (CRISPR/Cas9) system, have provided methods for gene knockout in sea urchins. Here, we review the use of vivo-MOs and genome editing tools in sea urchin studies since the publication of its genome in 2006. Various applications of the CRISPR/Cas9 system and their potential in studying sea urchin development are also discussed. These new tools will provide more sophisticated experimental methods for studying sea urchin development. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  20. CLIPS interface development tools and their application

    NASA Technical Reports Server (NTRS)

    Engel, Bernard A.; Rewerts, Chris C.; Srinivasan, Raghavan; Rogers, Joseph B.; Jones, Don D.

    1990-01-01

    A package of C-based PC user interface development functions has been developed and integrated into CLIPS. The primary function is ASK which provides a means to ask the user questions via multiple choice menus or the keyboard and then returns the user response to CLIPS. A parameter-like structure supplies information for the interface. Another function, SHOW, provides a means to paginate and display text. A third function, TITLE, formats and displays title screens. A similar set of C-based functions that are more general and thus will run on UNIX and machines have also been developed. Seven expert system applications were transformed from commercial development environments into CLIPS and utilize ASK, SHOW, and TITLE. Development of numerous new expert system applications using CLIPS and these interface functions has started. These functions greatly reduce the time required to build interfaces for CLIPS applications.

  1. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    PubMed Central

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  2. Developing a 300C Analog Tool for EGS

    SciTech Connect

    Normann, Randy

    2015-03-23

    This paper covers the development of a 300°C geothermal well monitoring tool for supporting future EGS (enhanced geothermal systems) power production. This is the first of 3 tools planed. This is an analog tool designed for monitoring well pressure and temperature. There is discussion on 3 different circuit topologies and the development of the supporting surface electronics and software. There is information on testing electronic circuits and component. One of the major components is the cable used to connect the analog tool to the surface.

  3. DEVELOPMENT OF SOLUBILITY PRODUCT VISUALIZATION TOOLS

    SciTech Connect

    T.F. Turner; A.T. Pauli; J.F. Schabron

    2004-05-01

    Western Research Institute (WRI) has developed software for the visualization of data acquired from solubility tests. The work was performed in conjunction with AB Nynas Petroleum, Nynashamn, Sweden who participated as the corporate cosponsor for this Jointly Sponsored Research (JSR) task. Efforts in this project were split between software development and solubility test development. The Microsoft Windows-compatible software developed inputs up to three solubility data sets, calculates the parameters for six solid body types to fit the data, and interactively displays the results in three dimensions. Several infrared spectroscopy techniques have been examined for potential use in determining bitumen solubility in various solvents. Reflectance, time-averaged absorbance, and transmittance techniques were applied to bitumen samples in single and binary solvent systems. None of the techniques were found to have wide applicability.

  4. DEVELOPING A TOOL FOR ENVIRONMENTALLY PREFERABLE PURCHASING

    EPA Science Inventory

    LCA-based guidance was developed by EPA under the Framework for Responsible Environmental Decision Making (FRED) effort to demonstrate how to conduct a relative comparison between product types to determine environmental preferability. It identifies data collection needs and iss...

  5. DEVELOPING A TOOL FOR ENVIRONMENTALLY PREFERABLE PURCHASING

    EPA Science Inventory

    LCA-based guidance was developed by EPA under the Framework for Responsible Environmental Decision Making (FRED) effort to demonstrate how to conduct a relative comparison between product types to determine environmental preferability. It identifies data collection needs and iss...

  6. Testing Automation Tools for Secure Software Development

    DTIC Science & Technology

    2007-06-01

    Paperwork Reduction Project (0704-0188) Washington DC 20503. 1. AGENCY USE ONLY (Leave blank) 2. REPORT DATE June 2007 3. REPORT TYPE AND DATES...operation of the system. This technique is not particularly effective. Thus, recent research has focused on developing new , more effective software...operation of the system. This technique is not particularly effective. Thus, recent research has focused on developing new , more effective software

  7. Computational tools and resources for prediction and analysis of gene regulatory regions in the chick genome

    PubMed Central

    Khan, Mohsin A. F.; Soto-Jimenez, Luz Mayela; Howe, Timothy; Streit, Andrea; Sosinsky, Alona; Stern, Claudio D.

    2013-01-01

    The discovery of cis-regulatory elements is a challenging problem in bioinformatics, owing to distal locations and context-specific roles of these elements in controlling gene regulation. Here we review the current bioinformatics methodologies and resources available for systematic discovery of cis-acting regulatory elements and conserved transcription factor binding sites in the chick genome. In addition, we propose and make available, a novel workflow using computational tools that integrate CTCF analysis to predict putative insulator elements, enhancer prediction and TFBS analysis. To demonstrate the usefulness of this computational workflow, we then use it to analyze the locus of the gene Sox2 whose developmental expression is known to be controlled by a complex array of cis-acting regulatory elements. The workflow accurately predicts most of the experimentally verified elements along with some that have not yet been discovered. A web version of the CTCF tool, together with instructions for using the workflow can be accessed from http://toolshed.g2.bx.psu.edu/view/mkhan1980/ctcf_analysis. For local installation of the tool, relevant Perl scripts and instructions are provided in the directory named “code” in the supplementary materials. PMID:23355428

  8. Computational tools and resources for prediction and analysis of gene regulatory regions in the chick genome.

    PubMed

    Khan, Mohsin A F; Soto-Jimenez, Luz Mayela; Howe, Timothy; Streit, Andrea; Sosinsky, Alona; Stern, Claudio D

    2013-05-01

    The discovery of cis-regulatory elements is a challenging problem in bioinformatics, owing to distal locations and context-specific roles of these elements in controlling gene regulation. Here we review the current bioinformatics methodologies and resources available for systematic discovery of cis-acting regulatory elements and conserved transcription factor binding sites in the chick genome. In addition, we propose and make available, a novel workflow using computational tools that integrate CTCF analysis to predict putative insulator elements, enhancer prediction, and TFBS analysis. To demonstrate the usefulness of this computational workflow, we then use it to analyze the locus of the gene Sox2 whose developmental expression is known to be controlled by a complex array of cis-acting regulatory elements. The workflow accurately predicts most of the experimentally verified elements along with some that have not yet been discovered. A web version of the CTCF tool, together with instructions for using the workflow can be accessed from http://toolshed.g2.bx.psu.edu/view/mkhan1980/ctcf_analysis. For local installation of the tool, relevant Perl scripts and instructions are provided in the directory named "code" in the supplementary materials.

  9. 78 FR 68459 - Medical Device Development Tools; Draft Guidance for Industry, Tool Developers, and Food and Drug...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-11-14

    ... HUMAN SERVICES Food and Drug Administration Medical Device Development Tools; Draft Guidance for Industry, Tool Developers, and Food and Drug Administration Staff; Availability AGENCY: Food and Drug Administration, HHS. ACTION: Notice. SUMMARY: The Food and Drug Administration (FDA) is announcing the...

  10. Entering the Public Health Genomics Era: Why Must Health Educators Develop Genomic Competencies?

    ERIC Educational Resources Information Center

    Chen, Lei-Shih; Goodson, Patricia

    2007-01-01

    Although the completion of the Human Genome Project will offer new insight into diseases and help develop efficient, personalized treatment or prevention programs, it will also raise new and non-trivial public health issues. Many of these issues fall under the professional purview of public health workers. As members of the public health…

  11. Microcomputers: Tools for Developing Technological Literacy.

    ERIC Educational Resources Information Center

    Liao, Thomas T.

    1983-01-01

    Describes a course in which undergraduate students learn to program microcomputers while learning about its applications and ramifications. Descriptions of software developed for the course are also provided. These include yellow light (traffic flow), domestic electrical energy use/cost, water pollution, and supermarket automation. (CN)

  12. Appreciative Inquiry as an Organizational Development Tool.

    ERIC Educational Resources Information Center

    Martinetz, Charles F.

    2002-01-01

    Defines appreciative inquiry as a change model that uses traditional organizational development processes (team building, strategic planning, business process redesign, management audits) in a new way, both as a philosophy and as a process. Emphasizes collaboration, participation of all voices, and changing the organization rather than the people.…

  13. Tools for Nanotechnology Education Development Program

    SciTech Connect

    Dorothy Moore

    2010-09-27

    The overall focus of this project was the development of reusable, cost-effective educational modules for use with the table top scanning electron microscope (TTSEM). The goal of this project's outreach component was to increase students' exposure to the science and technology of nanoscience.

  14. Microcomputers: Tools for Developing Technological Literacy.

    ERIC Educational Resources Information Center

    Liao, Thomas T.

    1983-01-01

    Describes a course in which undergraduate students learn to program microcomputers while learning about its applications and ramifications. Descriptions of software developed for the course are also provided. These include yellow light (traffic flow), domestic electrical energy use/cost, water pollution, and supermarket automation. (CN)

  15. Developing tools to sustain biological diversity.

    Treesearch

    Randy. Molina

    2004-01-01

    The Biodiversity Initiative strives to provide innovative solutions to the complex problem of managing forests for biodiversity. Although this initiative is in its beginning stages, an initial scoping meeting has already taken place and planning for the next steps is underway. The initiative is developing plans to conduct extensive scoping efforts in the management,...

  16. Appreciative Inquiry as an Organizational Development Tool.

    ERIC Educational Resources Information Center

    Martinetz, Charles F.

    2002-01-01

    Defines appreciative inquiry as a change model that uses traditional organizational development processes (team building, strategic planning, business process redesign, management audits) in a new way, both as a philosophy and as a process. Emphasizes collaboration, participation of all voices, and changing the organization rather than the people.…

  17. The development of tool manufacture in humans: what helps young children make innovative tools?

    PubMed

    Chappell, Jackie; Cutting, Nicola; Apperly, Ian A; Beck, Sarah R

    2013-11-19

    We know that even young children are proficient tool users, but until recently, little was known about how they make tools. Here, we will explore the concepts underlying tool making, and the kinds of information and putative cognitive abilities required for children to manufacture novel tools. We will review the evidence for novel tool manufacture from the comparative literature and present a growing body of data from children suggesting that innovation of the solution to a problem by making a tool is a much more challenging task than previously thought. Children's difficulty with these kinds of tasks does not seem to be explained by perseveration with unmodified tools, difficulty with switching to alternative strategies, task pragmatics or issues with permission. Rather, making novel tools (without having seen an example of the required tool within the context of the task) appears to be hard, because it is an example of an 'ill-structured problem'. In this type of ill-structured problem, the starting conditions and end goal are known, but the transformations and/or actions required to get from one to the other are not specified. We will discuss the implications of these findings for understanding the development of problem-solving in humans and other animals.

  18. The development of tool manufacture in humans: what helps young children make innovative tools?

    PubMed Central

    Chappell, Jackie; Cutting, Nicola; Apperly, Ian A.; Beck, Sarah R.

    2013-01-01

    We know that even young children are proficient tool users, but until recently, little was known about how they make tools. Here, we will explore the concepts underlying tool making, and the kinds of information and putative cognitive abilities required for children to manufacture novel tools. We will review the evidence for novel tool manufacture from the comparative literature and present a growing body of data from children suggesting that innovation of the solution to a problem by making a tool is a much more challenging task than previously thought. Children's difficulty with these kinds of tasks does not seem to be explained by perseveration with unmodified tools, difficulty with switching to alternative strategies, task pragmatics or issues with permission. Rather, making novel tools (without having seen an example of the required tool within the context of the task) appears to be hard, because it is an example of an ‘ill-structured problem’. In this type of ill-structured problem, the starting conditions and end goal are known, but the transformations and/or actions required to get from one to the other are not specified. We will discuss the implications of these findings for understanding the development of problem-solving in humans and other animals. PMID:24101620

  19. Development of Machine Learning Tools in ROOT

    NASA Astrophysics Data System (ADS)

    Gleyzer, S. V.; Moneta, L.; Zapata, Omar A.

    2016-10-01

    ROOT is a framework for large-scale data analysis that provides basic and advanced statistical methods used by the LHC experiments. These include machine learning algorithms from the ROOT-integrated Toolkit for Multivariate Analysis (TMVA). We present several recent developments in TMVA, including a new modular design, new algorithms for variable importance and cross-validation, interfaces to other machine-learning software packages and integration of TMVA with Jupyter, making it accessible with a browser.

  20. Surrogate Analysis and Index Developer (SAID) tool

    USGS Publications Warehouse

    Domanski, Marian M.; Straub, Timothy D.; Landers, Mark N.

    2015-10-01

    The regression models created in SAID can be used in utilities that have been developed to work with the USGS National Water Information System (NWIS) and for the USGS National Real-Time Water Quality (NRTWQ) Web site. The real-time dissemination of predicted SSC and prediction intervals for each time step has substantial potential to improve understanding of sediment-related water quality and associated engineering and ecological management decisions.

  1. Organoid development in cancer genome discovery.

    PubMed

    Gao, Dong; Chen, Yu

    2015-02-01

    The tumor response to most therapeutic agents in cancer is highly unpredictable. Cancer models which can adequately represent tumor heterogeneity and predict in vivo drug sensitivity are intense areas of investigation. Cancer cell lines and patient-derived xenograft models are the most frequently used models in cancer research and anticancer drug screening. Recently, cancer 'organoid' culture conditions have been developed to establish in vitro growth of patient-derived samples at higher efficiency and they are very promising for large scale drug screening and fundamental cancer biology research. Here, we leverage our experience in prostate cancer to discuss the advantages and limitations of these cancer models and summarize the development of cancer organoid culture--a development which may provide a new path towards personalized medicine in the future.

  2. Genome elimination: translating basic research into a future tool for plant breeding.

    PubMed

    Comai, Luca

    2014-06-01

    During the course of our history, humankind has been through different periods of agricultural improvement aimed at enhancing our food supply and the performance of food crops. In recent years, it has become apparent that future crop improvement efforts will require new approaches to address the local challenges of farmers while empowering discovery across industry and academia. New plant breeding approaches are needed to meet this challenge to help feed a growing world population. Here I discuss how a basic research discovery is being translated into a potential future tool for plant breeding, and share the story of researcher Simon Chan, who recognized the potential application of this new approach--genome elimination--for the breeding of staple food crops in Africa and South America.

  3. Tools to kill: Genome of one of the most destructive plant pathogenic fungi Macrophomina phaseolina

    PubMed Central

    2012-01-01

    Background Macrophomina phaseolina is one of the most destructive necrotrophic fungal pathogens that infect more than 500 plant species throughout the world. It can grow rapidly in infected plants and subsequently produces a large amount of sclerotia that plugs the vessels, resulting in wilting of the plant. Results We sequenced and assembled ~49 Mb into 15 super-scaffolds covering 92.83% of the M. phaseolina genome. We predict 14,249 open reading frames (ORFs) of which 9,934 are validated by the transcriptome. This phytopathogen has an abundance of secreted oxidases, peroxidases, and hydrolytic enzymes for degrading cell wall polysaccharides and lignocelluloses to penetrate into the host tissue. To overcome the host plant defense response, M. phaseolina encodes a significant number of P450s, MFS type membrane transporters, glycosidases, transposases, and secondary metabolites in comparison to all sequenced ascomycete species. A strikingly distinct set of carbohydrate esterases (CE) are present in M. phaseolina, with the CE9 and CE10 families remarkably higher than any other fungi. The phenotypic microarray data indicates that M. phaseolina can adapt to a wide range of osmotic and pH environments. As a broad host range pathogen, M. phaseolina possesses a large number of pathogen-host interaction genes including those for adhesion, signal transduction, cell wall breakdown, purine biosynthesis, and potent mycotoxin patulin. Conclusions The M. phaseolina genome provides a framework of the infection process at the cytological and molecular level which uses a diverse arsenal of enzymatic and toxin tools to destroy the host plants. Further understanding of the M. phaseolina genome-based plant-pathogen interactions will be instrumental in designing rational strategies for disease control, essential to ensuring global agricultural crop production and security. PMID:22992219

  4. Tools to kill: genome of one of the most destructive plant pathogenic fungi Macrophomina phaseolina.

    PubMed

    Islam, Md Shahidul; Haque, Md Samiul; Islam, Mohammad Moinul; Emdad, Emdadul Mannan; Halim, Abdul; Hossen, Quazi Md Mosaddeque; Hossain, Md Zakir; Ahmed, Borhan; Rahim, Sifatur; Rahman, Md Sharifur; Alam, Md Monjurul; Hou, Shaobin; Wan, Xuehua; Saito, Jennifer A; Alam, Maqsudul

    2012-09-19

    Macrophomina phaseolina is one of the most destructive necrotrophic fungal pathogens that infect more than 500 plant species throughout the world. It can grow rapidly in infected plants and subsequently produces a large amount of sclerotia that plugs the vessels, resulting in wilting of the plant. We sequenced and assembled ~49 Mb into 15 super-scaffolds covering 92.83% of the M. phaseolina genome. We predict 14,249 open reading frames (ORFs) of which 9,934 are validated by the transcriptome. This phytopathogen has an abundance of secreted oxidases, peroxidases, and hydrolytic enzymes for degrading cell wall polysaccharides and lignocelluloses to penetrate into the host tissue. To overcome the host plant defense response, M. phaseolina encodes a significant number of P450s, MFS type membrane transporters, glycosidases, transposases, and secondary metabolites in comparison to all sequenced ascomycete species. A strikingly distinct set of carbohydrate esterases (CE) are present in M. phaseolina, with the CE9 and CE10 families remarkably higher than any other fungi. The phenotypic microarray data indicates that M. phaseolina can adapt to a wide range of osmotic and pH environments. As a broad host range pathogen, M. phaseolina possesses a large number of pathogen-host interaction genes including those for adhesion, signal transduction, cell wall breakdown, purine biosynthesis, and potent mycotoxin patulin. The M. phaseolina genome provides a framework of the infection process at the cytological and molecular level which uses a diverse arsenal of enzymatic and toxin tools to destroy the host plants. Further understanding of the M. phaseolina genome-based plant-pathogen interactions will be instrumental in designing rational strategies for disease control, essential to ensuring global agricultural crop production and security.

  5. Enhancers as information integration hubs in development: lessons from genomics

    PubMed Central

    Buecker, Christa; Wysocka, Joanna

    2016-01-01

    Transcriptional enhancers are the primary determinants of tissue-specific gene expression. Although the majority of our current knowledge of enhancer elements comes from detailed analyses of individual loci, recent progress in epigenomics has led to the development of methods for comprehensive and conservation-independent annotation of cell type-specific enhancers. Here, we discuss the advantages and limitations of different genomic approaches to enhancer mapping and summarize observations that have been afforded by the genome-wide views of enhancer landscapes, with a focus on development. We propose that enhancers serve as information integration hubs, at which instructions encoded by the genome are read in the context of a specific cellular state, signaling milieu and chromatin environment, allowing for exquisitely precise spatiotemporal control of gene expression during embryogenesis. PMID:22487374

  6. The capsicum transcriptome DB: a “hot” tool for genomic research

    PubMed Central

    Góngora-Castillo, Elsa; Fajardo-Jaime, Rubén; Fernández-Cortes, Araceli; Jofre-Garfias, Alba E; Lozoya-Gloria, Edmundo; Martínez, Octavio; Ochoa-Alejo, Neftalí; Rivera-Bustamante, Rafael

    2012-01-01

    Chili pepper (Capsicum annuum) is an economically important crop with no available public genome sequence. We describe a genomic resource to facilitate Capsicum annuum research. A collection of Expressed Sequence Tags (ESTs) derived from five C. annuum organs (root, stem, leaf, flower and fruit) were sequenced using the Sanger method and multiple leaf transcriptomes were deeply sampled using with GS-pyrosequencing. A hybrid assembly of 1,324,516 raw reads yielded 32,314 high quality contigs as validated by coverage and identity analysis with existing pepper sequences. Overall, 75.5% of the contigs had significant sequence similarity to entries in nucleic acid and protein databases; 23% of the sequences have not been previously reported for C. annuum and expand sequence resources for this species. A MySQL database and a user-friendly Web interface were constructed with search-tools that permit queries of the ESTs including sequence, functional annotation, Gene Ontology classification, metabolic pathways, and assembly information. The Capsicum Transcriptome DB is free available from http://www.bioingenios.ira.cinvestav.mx:81/Joomla/ PMID:22359434

  7. Evaluating IMU communication skills training programme: assessment tool development.

    PubMed

    Yeap, R; Beevi, Z; Lukman, H

    2008-08-01

    This article describes the development of four assessment tools designed to evaluate the communication skills training (CST) programme at the International Medical University (IMU). The tools measure pre-clinical students' 1) perceived competency in basic interpersonal skills, 2) attitude towards patient-centred communication, 3) conceptual knowledge on doctor-patient communication, and 4) acceptance of the CST programme.

  8. The Development and Demonstration of The Metric Assessment Tool

    DTIC Science & Technology

    1993-09-01

    APIT/GCM/LAS/93S-3 A04 Ai 7.5 4 . " . THE DEVELOPMENT AND DEMONSTRATION OF THE METRIC ASSESSMENT TOOL THESIS Cynthia A. Campbell, GS-13, USAF and...Av•JI:a)iity Ccdes Avaii "•tc, or Dist 6pi-claI * iiV AFIT/GCM/LAS/93S-3 THE DEVELOPMENT AND DEMONSTRATION OF THE METRIC ASSESSMENT TOOL THESIS...thesis was to develop and demonstrate a Metric Assessment Tool for the purpose of evaluating and improving process measurements. We narrowed our study

  9. Black raspberry genetic and genomic resources development

    USDA-ARS?s Scientific Manuscript database

    This study incorporates field and laboratory components to advance and streamline identification of a variety of traits of economic interest and to develop molecular markers for marker assisted breeding of black raspberry (Rubus occidentalis). A lack of adapted, disease resistant cultivars has led t...

  10. Developing black raspberry genetic and genomic resources

    USDA-ARS?s Scientific Manuscript database

    This study incorporates field and laboratory components to advance and streamline identification of a variety of traits of economic interest and to develop molecular markers for marker assisted breeding of black raspberry (Rubus occidentalis). A lack of adapted, disease resistant cultivars has led t...

  11. Plasmid vectors and molecular building blocks for the development of genetic manipulation tools for Trypanosoma cruzi.

    PubMed

    Bouvier, León A; Cámara, María de los Milagros; Canepa, Gaspar E; Miranda, Mariana R; Pereira, Claudio A

    2013-01-01

    The post genomic era revealed the need for developing better performing, easier to use and more sophisticated genetic manipulation tools for the study of Trypanosoma cruzi, the etiological agent of Chagas disease. In this work a series of plasmids that allow genetic manipulation of this protozoan parasite were developed. First of all we focused on useful tools to establish selection strategies for different strains and which can be employed as expression vectors. On the other hand molecular building blocks in the form of diverse selectable markers, modifiable fluorescent protein and epitope-tag coding sequences were produced. Both types of modules were harboured in backbone molecules conceived to offer multiple construction and sub-cloning strategies. These can be used to confer new properties to already available genetic manipulation tools or as starting points for whole novel designs. The performance of each plasmid and building block was determined independently. For illustration purposes, some simple direct practical applications were conducted.

  12. Genomics, public health and developing countries: the case of the Mexican National Institute of Genomic Medicine (INMEGEN).

    PubMed

    Séguin, Béatrice; Hardy, Billie-Jo; Singer, Peter A; Daar, Abdallah S

    2008-10-01

    In 2004, the government of Mexico established the National Institute of Genomic Medicine (INMEGEN), to carry out disease-related genomic studies that will address national health problems and stimulate scientific and technological development by generating new commercial products and services in genomic medicine. Towards this end, INMEGEN is carrying out a large-scale genotyping project to map genomic variation within its own population. The initiative is expected to generate a key resource for local researchers to understand disease susceptibility and variation in drug responses, which will contribute to Mexico's goal of developing public health genomics - a field in which Mexico is proving to be a leader amongst emerging economies.

  13. Computational development of the nanoporous materials genome

    NASA Astrophysics Data System (ADS)

    Boyd, Peter G.; Lee, Yongjin; Smit, Berend

    2017-08-01

    There is currently a push towards big data and data mining in materials research to accelerate discovery. Zeolites, metal-organic frameworks and other related crystalline porous materials are not immune to this phenomenon, as evidenced by the proliferation of porous structure databases and computational gas-adsorption screening studies over the past decade. The endeavour to identify the best materials for various gas separation and storage applications has led not only to thousands of synthesized structures, but also to the development of algorithms for building hypothetical materials. The materials databases assembled with these algorithms contain a much wider range of complex pore structures than have been synthesized, with the reasoning being that we have discovered only a small fraction of realizable structures and expanding upon these will accelerate rational design. In this Review, we highlight the methods developed to build these databases, and some of the important outcomes from large-scale computational screening studies.

  14. PrimerSNP: a web tool for whole-genome selection of allele-specific and common primers of phylogenetically-related bacterial genomic sequences

    PubMed Central

    Yao, Jiqiang; Lin, Hong; Van Deynze, Allen; Doddapaneni, Harshavardhan; Francis, Martha; Lemos, Eliana Gertrudes Macedo; Civerolo, Edwin L

    2008-01-01

    Background The increasing number of genomic sequences of bacteria makes it possible to select unique SNPs of a particular strain/species at the whole genome level and thus design specific primers based on the SNPs. The high similarity of genomic sequences among phylogenetically-related bacteria requires the identification of the few loci in the genome that can serve as unique markers for strain differentiation. PrimerSNP attempts to identify reliable strain-specific markers, on which specific primers are designed for pathogen detection purpose. Results PrimerSNP is an online tool to design primers based on strain specific SNPs for multiple strains/species of microorganisms at the whole genome level. The allele-specific primers could distinguish query sequences of one strain from other homologous sequences by standard PCR reaction. Additionally, PrimerSNP provides a feature for designing common primers that can amplify all the homologous sequences of multiple strains/species of microorganisms. PrimerSNP is freely available at . Conclusion PrimerSNP is a high-throughput specific primer generation tool for the differentiation of phylogenetically-related strains/species. Experimental validation showed that this software had a successful prediction rate of 80.4 – 100% for strain specific primer design. PMID:18937861

  15. Drosophila genomes and the development of affordable molecular markers for species genotyping.

    PubMed

    Minuk, Leigh; Civetta, Alberto

    2011-04-01

    The recent completion of genome sequencing of 12 species of Drosophila has provided a powerful resource for hypothesis testing, as well as the development of technical tools. Here we take advantage of genome sequence data from two closely related species of Drosophila, Drosophila simulans and Drosophila sechellia, to quickly identify candidate molecular markers for genotyping based on expected insertion or deletion (indel) differences between species. Out of 64 candidate molecular markers selected along the second and third chromosome of Drosophila, 51 molecular markers were validated using PCR and gel electrophoresis. We found that the 20% error rate was due to sequencing errors in the genome data, although we cannot rule out possible indel polymorphisms. The approach has the advantage of being affordable and quick, as it only requires the use of bioinformatics tools for predictions and a PCR and agarose gel based assay for validation. Moreover, the approach could be easily extended to a wide variety of taxa with the only limitation being the availability of complete or partial genome sequence data.

  16. Developing an undue influence screening tool for Adult Protective Services.

    PubMed

    Quinn, Mary Joy; Nerenberg, Lisa; Navarro, Adria E; Wilber, Kathleen H

    2017-03-01

    The study purpose was to develop and pilot an undue influence screening tool for California's Adult Protective Services (APS) personnel based on the definition of undue influence enacted into California law January 1, 2014. Methods included four focus groups with APS providers (n = 33), piloting the preliminary tool by APS personnel (n = 15), and interviews with four elder abuse experts and two APS administrators. Social service literature-including existing undue influence models-was reviewed, as were existing screening and assessment tools. Using the information from these various sources, the California Undue Influence Screening Tool (CUIST) was developed. It can be applied to APS cases and potentially adapted for use by other professionals and for use in other states. Implementation of the tool into APS practice, policy, procedures, and training of personnel will depend on the initiative of APS management. Future work will need to address the reliability and validity of CUIST.

  17. Thyroid insufficiency in developing rat brain: A genomic analysis.

    EPA Science Inventory

    Thyroid Insufficiency in the Developing Rat Brain: A Genomic Analysis. JE Royland and ME Gilbert, Neurotox. Div., U.S. EPA, RTP, NC, USA. Endocrine disruption (ED) is an area of major concern in environmental neurotoxicity. Severe deficits in thyroid hormone (TH) levels have bee...

  18. Thyroid insufficiency in developing rat brain: A genomic analysis.

    EPA Science Inventory

    Thyroid Insufficiency in the Developing Rat Brain: A Genomic Analysis. JE Royland and ME Gilbert, Neurotox. Div., U.S. EPA, RTP, NC, USA. Endocrine disruption (ED) is an area of major concern in environmental neurotoxicity. Severe deficits in thyroid hormone (TH) levels have bee...

  19. Pan-genome sequence analysis using Panseq: an online tool for the rapid analysis of core and accessory genomic regions

    PubMed Central

    2010-01-01

    Background The pan-genome of a bacterial species consists of a core and an accessory gene pool. The accessory genome is thought to be an important source of genetic variability in bacterial populations and is gained through lateral gene transfer, allowing subpopulations of bacteria to better adapt to specific niches. Low-cost and high-throughput sequencing platforms have created an exponential increase in genome sequence data and an opportunity to study the pan-genomes of many bacterial species. In this study, we describe a new online pan-genome sequence analysis program, Panseq. Results Panseq was used to identify Escherichia coli O157:H7 and E. coli K-12 genomic islands. Within a population of 60 E. coli O157:H7 strains, the existence of 65 accessory genomic regions identified by Panseq analysis was confirmed by PCR. The accessory genome and binary presence/absence data, and core genome and single nucleotide polymorphisms (SNPs) of six L. monocytogenes strains were extracted with Panseq and hierarchically clustered and visualized. The nucleotide core and binary accessory data were also used to construct maximum parsimony (MP) trees, which were compared to the MP tree generated by multi-locus sequence typing (MLST). The topology of the accessory and core trees was identical but differed from the tree produced using seven MLST loci. The Loci Selector module found the most variable and discriminatory combinations of four loci within a 100 loci set among 10 strains in 1 s, compared to the 449 s required to exhaustively search for all possible combinations; it also found the most discriminatory 20 loci from a 96 loci E. coli O157:H7 SNP dataset. Conclusion Panseq determines the core and accessory regions among a collection of genomic sequences based on user-defined parameters. It readily extracts regions unique to a genome or group of genomes, identifies SNPs within shared core genomic regions, constructs files for use in phylogeny programs based on both the presence

  20. Pan-genome sequence analysis using Panseq: an online tool for the rapid analysis of core and accessory genomic regions.

    PubMed

    Laing, Chad; Buchanan, Cody; Taboada, Eduardo N; Zhang, Yongxiang; Kropinski, Andrew; Villegas, Andre; Thomas, James E; Gannon, Victor P J

    2010-09-15

    The pan-genome of a bacterial species consists of a core and an accessory gene pool. The accessory genome is thought to be an important source of genetic variability in bacterial populations and is gained through lateral gene transfer, allowing subpopulations of bacteria to better adapt to specific niches. Low-cost and high-throughput sequencing platforms have created an exponential increase in genome sequence data and an opportunity to study the pan-genomes of many bacterial species. In this study, we describe a new online pan-genome sequence analysis program, Panseq. Panseq was used to identify Escherichia coli O157:H7 and E. coli K-12 genomic islands. Within a population of 60 E. coli O157:H7 strains, the existence of 65 accessory genomic regions identified by Panseq analysis was confirmed by PCR. The accessory genome and binary presence/absence data, and core genome and single nucleotide polymorphisms (SNPs) of six L. monocytogenes strains were extracted with Panseq and hierarchically clustered and visualized. The nucleotide core and binary accessory data were also used to construct maximum parsimony (MP) trees, which were compared to the MP tree generated by multi-locus sequence typing (MLST). The topology of the accessory and core trees was identical but differed from the tree produced using seven MLST loci. The Loci Selector module found the most variable and discriminatory combinations of four loci within a 100 loci set among 10 strains in 1 s, compared to the 449 s required to exhaustively search for all possible combinations; it also found the most discriminatory 20 loci from a 96 loci E. coli O157:H7 SNP dataset. Panseq determines the core and accessory regions among a collection of genomic sequences based on user-defined parameters. It readily extracts regions unique to a genome or group of genomes, identifies SNPs within shared core genomic regions, constructs files for use in phylogeny programs based on both the presence/absence of accessory regions

  1. Genomic Perspectives of Transcriptional Regulation in Forebrain Development

    PubMed Central

    Nord, Alex S.; Pattabiraman, Kartik; Visel, Axel; Rubenstein, John L. R.

    2015-01-01

    The forebrain is the seat of higher order brain functions, and many human neuropsychiatric disorders are due to genetic defects affecting forebrain development, making it imperative to understand the underlying genetic circuitry. Recent progress now makes it possible to begin fully elucidating the genomic regulatory mechanisms that control forebrain gene expression. Herein, we discuss the current knowledge of how transcription factors drive gene expression programs through their interactions with cis-acting genomic elements, such as enhancers; how analyses of chromatin and DNA modifications provide insights into gene expression states; and how these approaches yield insights into the evolution of the human brain. PMID:25569346

  2. Genomic perspectives of transcriptional regulation in forebrain development.

    PubMed

    Nord, Alex S; Pattabiraman, Kartik; Visel, Axel; Rubenstein, John L R

    2015-01-07

    The forebrain is the seat of higher-order brain functions, and many human neuropsychiatric disorders are due to genetic defects affecting forebrain development, making it imperative to understand the underlying genetic circuitry. Recent progress now makes it possible to begin fully elucidating the genomic regulatory mechanisms that control forebrain gene expression. Herein, we discuss the current knowledge of how transcription factors drive gene expression programs through their interactions with cis-acting genomic elements, such as enhancers; how analyses of chromatin and DNA modifications provide insights into gene expression states; and how these approaches yield insights into the evolution of the human brain. Copyright © 2015 Elsevier Inc. All rights reserved.

  3. Development of the major trauma case review tool.

    PubMed

    Curtis, Kate; Mitchell, Rebecca; McCarthy, Amy; Wilson, Kellie; Van, Connie; Kennedy, Belinda; Tall, Gary; Holland, Andrew; Foster, Kim; Dickinson, Stuart; Stelfox, Henry T

    2017-02-28

    As many as half of all patients with major traumatic injuries do not receive the recommended care, with variance in preventable mortality reported across the globe. This variance highlights the need for a comprehensive process for monitoring and reviewing patient care, central to which is a consistent peer-review process that includes trauma system safety and human factors. There is no published, evidence-informed standardised tool that considers these factors for use in adult or paediatric trauma case peer-review. The aim of this research was to develop and validate a trauma case review tool to facilitate clinical review of paediatric trauma patient care in extracting information to facilitate monitoring, inform change and enable loop closure. Development of the trauma case review tool was multi-faceted, beginning with a review of the trauma audit tool literature. Data were extracted from the literature to inform iterative tool development using a consensus approach. Inter-rater agreement was assessed for both the pilot and finalised versions of the tool. The final trauma case review tool contained ten sections, including patient factors (such as pre-existing conditions), presenting problem, a timeline of events, factors contributing to the care delivery problem (including equipment, work environment, staff action, organizational factors), positive aspects of care and the outcome of panel discussion. After refinement, the inter-rater reliability of the human factors and outcome components of the tool improved with an average 86% agreement between raters. This research developed an evidence-informed tool for use in paediatric trauma case review that considers both system safety and human factors to facilitate clinical review of trauma patient care. This tool can be used to identify opportunities for improvement in trauma care and guide quality assurance activities. Validation is required in the adult population.

  4. netview p: a network visualization tool to unravel complex population structure using genome-wide SNPs.

    PubMed

    Steinig, Eike J; Neuditschko, Markus; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R

    2016-01-01

    Network-based approaches are emerging as valuable tools for the analysis of complex genetic structure in wild and captive populations. netview p combines data quality control with the construction of population networks through mutual k-nearest neighbours thresholds applied to genome-wide SNPs. The program is cross-platform compatible, open-source and efficiently operates on data ranging from hundreds to hundreds of thousands of SNPs. The pipeline was used for the analysis of pedigree data from simulated (n = 750, SNPs = 1279) and captive silver-lipped pearl oysters (n = 415, SNPs = 1107), wild populations of the European hake from the Atlantic and Mediterranean (n = 834, SNPs = 380) and grey wolves from North America (n = 239, SNPs = 78 255). The population networks effectively visualize large- and fine-scale genetic structure within and between populations, including family-level structure and relationships. netview p comprises a network-based addition to other population analysis tools and provides user-friendly access to a complex network analysis pipeline through implementation in python.

  5. Artificial zinc finger DNA binding domains: versatile tools for genome engineering and modulation of gene expression.

    PubMed

    Hossain, Mir A; Barrow, Joeva J; Shen, Yong; Haq, Md Imdadul; Bungert, Jörg

    2015-11-01

    Genome editing and alteration of gene expression by synthetic DNA binding activities gained a lot of momentum over the last decade. This is due to the development of new DNA binding molecules with enhanced binding specificity. The most commonly used DNA binding modules are zinc fingers (ZFs), TALE-domains, and the RNA component of the CRISPR/Cas9 system. These binding modules are fused or linked to either nucleases that cut the DNA and induce DNA repair processes, or to protein domains that activate or repress transcription of genes close to the targeted site in the genome. This review focuses on the structure, design, and applications of ZF DNA binding domains (ZFDBDs). ZFDBDs are relatively small and have been shown to penetrate the cell membrane without additional tags suggesting that they could be delivered to cells without a DNA or RNA intermediate. Advanced algorithms that are based on extensive knowledge of the mode of ZF/DNA interactions are used to design the amino acid composition of ZFDBDs so that they bind to unique sites in the genome. Off-target binding has been a concern for all synthetic DNA binding molecules. Thus, increasing the specificity and affinity of ZFDBDs will have a significant impact on their use in analytical or therapeutic settings.

  6. An integrated environment for developing object-oriented CAE tools

    SciTech Connect

    Hofmann, P.; Ryba, M.; Baitinger, U.G.

    1996-12-31

    This paper presents how object oriented techniques can applied to improve the development of CAE tools. For the design of modular and reusable software systems we use predefined and well tested building blocks. These building blocks are reusable software components based on object-oriented technology which allows the assembling of software systems. Today CAE tools are typically very complex and computation extensive. Therefore we need a concept, that join the advantages of the object-oriented paradigm with the advantages of parallel and distributed programming. So we present a design environment for the development of concurrent-object oriented CAE tools called CoDO.

  7. Limited resources of genome sequencing in developing countries: Challenges and solutions.

    PubMed

    Helmy, Mohamed; Awad, Mohamed; Mosa, Kareem A

    2016-06-01

    The differences between countries in national income, growth, human development and many other factors are used to classify countries into developed and developing countries. There are several classification systems that use different sets of measures and criteria. The most common classifications are the United Nations (UN) and the World Bank (WB) systems. The UN classification system uses the UN Human Development Index (HDI), an indicator that uses statistic of life expectancy, education, and income per capita for countries' classification. While the WB system uses gross national income (GNI) per capita that is calculated using the World Bank Atlas method. According to the UN and WB classification systems, there are 151 and 134 developing countries, respectively, with 89% overlap between the two systems. Developing countries have limited human development, and limited expenditure in education and research, among several other limitations. The biggest challenge facing genomic researchers and clinicians is limited resources. As a result, genomic tools, specifically genome sequencing technologies, which are rapidly becoming indispensable, are not widely available. In this report, we explore the current status of sequencing technologies in developing countries, describe the associated challenges and emphasize potential solutions.

  8. Surveying the maize community for their diversity and pedigree visualization needs to prioritize tool development and curation

    USDA-ARS?s Scientific Manuscript database

    The Maize Genetics and Genomics Database (MaizeGDB) team prepared a survey to identify breeders’ needs for visualizing pedigrees, diversity data, and haplotypes in order to prioritize tool development and curation efforts at MaizeGDB. The survey was distributed to the maize research community on beh...

  9. Consumer acceptance of food crops developed by genome editing.

    PubMed

    Ishii, Tetsuya; Araki, Motoko

    2016-07-01

    One of the major problems regarding consumer acceptance of genetically modified organisms (GMOs) is the possibility that their transgenes could have adverse effects on the environment and/or human health. Genome editing, represented by the CRISPR/Cas9 system, can efficiently achieve transgene-free gene modifications and is anticipated to generate a wide spectrum of plants. However, the public attitude against GMOs suggests that people will initially be unlikely to accept these plants. We herein explored the bottlenecks of consumer acceptance of transgene-free food crops developed by genome editing and made some recommendations. People should not pursue a zero-risk bias regarding such crops. Developers are encouraged to produce cultivars with a trait that would satisfy consumer needs. Moreover, they should carefully investigate off-target mutations in resultant plants and initially refrain from agricultural use of multiplex genome editing for better risk-benefit communication. The government must consider their regulatory status and establish appropriate regulations if necessary. The government also should foster communication between the public and developers. If people are informed of the benefits of genome editing-mediated plant breeding and trust in the relevant regulations, and if careful risk-benefit communication and sincere considerations for the right to know approach are guaranteed, then such transgene-free crops could gradually be integrated into society.

  10. LEMONS – A Tool for the Identification of Splice Junctions in Transcriptomes of Organisms Lacking Reference Genomes

    PubMed Central

    Bouskila, Amos; Chorev, Michal; Carmel, Liran; Mishmar, Dan

    2015-01-01

    RNA-seq is becoming a preferred tool for genomics studies of model and non-model organisms. However, DNA-based analysis of organisms lacking sequenced genomes cannot rely on RNA-seq data alone to isolate most genes of interest, as DNA codes both exons and introns. With this in mind, we designed a novel tool, LEMONS, that exploits the evolutionary conservation of both exon/intron boundary positions and splice junction recognition signals to produce high throughput splice-junction predictions in the absence of a reference genome. When tested on multiple annotated vertebrate mRNA data, LEMONS accurately identified 87% (average) of the splice-junctions. LEMONS was then applied to our updated Mediterranean chameleon transcriptome, which lacks a reference genome, and predicted a total of 90,820 exon-exon junctions. We experimentally verified these splice-junction predictions by amplifying and sequencing twenty randomly selected genes from chameleon DNA templates. Exons and introns were detected in 19 of 20 of the positions predicted by LEMONS. To the best of our knowledge, LEMONS is currently the only experimentally verified tool that can accurately predict splice-junctions in organisms that lack a reference genome. PMID:26606265

  11. Mireval: a web tool for simple microRNA prediction in genome sequences.

    PubMed

    Ritchie, William; Théodule, François-Xavier; Gautheret, Daniel

    2008-06-01

    We have developed an online tool called mirEval which can search sequences of up to 10 000 nt for novel microRNAs in multiple organisms. It is a comprehensive tool, easy to use and very informative. It will allow users with no prior knowledge of in-silico detection of microRNAs to take advantage of the most successful approaches to investigate sequences of interest. The mirEval web server is available at http://tagc.univ-mrs.fr/mireval

  12. Development and evaluation of a genome-wide 6K SNP array for diploid sweet cherry and tetraploid sour cherry

    USDA-ARS?s Scientific Manuscript database

    High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a commun...

  13. DEVELOPMENT OF REMOTE HANFORD CONNECTOR GASKET REPLACEMENT TOOLING FOR DWPF

    SciTech Connect

    Krementz, D.; Coughlin, Jeffrey

    2009-05-05

    The Defense Waste Processing Facility (DWPF) requested the Savannah River National Laboratory (SRNL) to develop tooling and equipment to remotely replace gaskets in mechanical Hanford connectors to reduce personnel radiation exposure as compared to the current hands-on method. It is also expected that radiation levels will continually increase with future waste streams. The equipment is operated in the Remote Equipment Decontamination Cell (REDC), which is equipped with compressed air, two master-slave manipulators (MSM's) and an electro-mechanical manipulator (EMM) arm for operation of the remote tools. The REDC does not provide access to electrical power, so the equipment must be manually or pneumatically operated. The MSM's have a load limit at full extension of ten pounds, which limited the weight of the installation tool. In order to remotely replace Hanford connector gaskets several operations must be performed remotely, these include: removal of the spent gasket and retaining ring (retaining ring is also called snap ring), loading the new snap ring and gasket into the installation tool and installation of the new gasket into the Hanford connector. SRNL developed and tested tools that successfully perform all of the necessary tasks. Removal of snap rings from horizontal and vertical connectors is performed by separate air actuated retaining ring removal tools and is manipulated in the cell by the MSM. In order install a new gasket, the snap ring loader is used to load a new snap ring into a groove in the gasket installation tool. A new gasket is placed on the installation tool and retained by custom springs. An MSM lifts the installation tool and presses the mounted gasket against the connector block. Once the installation tool is in position, the gasket and snap ring are installed onto the connector by pneumatic actuation. All of the tools are located on a custom work table with a pneumatic valve station that directs compressed air to the desired tool and

  14. The development and demonstration of the metric assessment tool

    NASA Astrophysics Data System (ADS)

    Campbell, Cynthia A.; Gutterman, Gregory M.

    1993-09-01

    This study reflects the development and demonstration of the metric assessment tool. The purpose of the tool was to provide individuals the means to assess metrics and make improvements to the process measurement. The tool was developed using two critical attributes: customer satisfaction and process improvement. Once the tool was developed, a metric assessment process was designed to demonstrate the tool. Two metrics were selected from the Aeronautical Systems Center (ASC) and individuals with a working knowledge of the metric and process were selected for the demonstrations. Using a group support system at Armstrong Laboratory, Wright Patterson AFB, the group was asked to identify behaviors which might be motivated from the metric. Once the behaviors were identified, the group evaluated the behaviors against the critical attributes. From this assessment, behaviors were placed on the metric assessment tool. This tool clearly identified deficient behaviors and how they might distort the process measurement. From this information, the group was asked to generate improvement actions which would serve to eliminate or control deficient behaviors. With the elimination or control of deficient behaviors, the process measurement is improved and the organizational objective is better served.

  15. FOCAL: an experimental design tool for systematizing metabolic discoveries and model development

    PubMed Central

    2012-01-01

    Current computational tools can generate and improve genome-scale models based on existing data; however, for many organisms, the data needed to test and refine such models are not available. To facilitate model development, we created the forced coupling algorithm, FOCAL, to identify genetic and environmental conditions such that a reaction becomes essential for an experimentally measurable phenotype. This reaction's conditional essentiality can then be tested experimentally to evaluate whether network connections occur or to create strains with desirable phenotypes. FOCAL allows network connections to be queried, which improves our understanding of metabolism and accuracy of developed models. PMID:23236964

  16. FOCAL: an experimental design tool for systematizing metabolic discoveries and model development.

    PubMed

    Tervo, Christopher J; Reed, Jennifer L

    2012-12-13

    Current computational tools can generate and improve genome-scale models based on existing data; however, for many organisms, the data needed to test and refine such models are not available. To facilitate model development, we created the forced coupling algorithm, FOCAL, to identify genetic and environmental conditions such that a reaction becomes essential for an experimentally measurable phenotype. This reaction's conditional essentiality can then be tested experimentally to evaluate whether network connections occur or to create strains with desirable phenotypes. FOCAL allows network connections to be queried, which improves our understanding of metabolism and accuracy of developed models.

  17. Host-Brucella interactions and the Brucella genome as tools for subunit antigen discovery and immunization against brucellosis.

    PubMed

    Gomez, Gabriel; Adams, Leslie G; Rice-Ficht, Allison; Ficht, Thomas A

    2013-01-01

    Vaccination is the most important approach to counteract infectious diseases. Thus, the development of new and improved vaccines for existing, emerging, and re-emerging diseases is an area of great interest to the scientific community and general public. Traditional approaches to subunit antigen discovery and vaccine development lack consideration for the critical aspects of public safety and activation of relevant protective host immunity. The availability of genomic sequences for pathogenic Brucella spp. and their hosts have led to development of systems-wide analytical tools that have provided a better understanding of host and pathogen physiology while also beginning to unravel the intricacies at the host-pathogen interface. Advances in pathogen biology, host immunology, and host-agent interactions have the potential to serve as a platform for the design and implementation of better-targeted antigen discovery approaches. With emphasis on Brucella spp., we probe the biological aspects of host and pathogen that merit consideration in the targeted design of subunit antigen discovery and vaccine development.

  18. Host-Brucella interactions and the Brucella genome as tools for subunit antigen discovery and immunization against brucellosis

    PubMed Central

    Gomez, Gabriel; Adams, Leslie G.; Rice-Ficht, Allison; Ficht, Thomas A.

    2013-01-01

    Vaccination is the most important approach to counteract infectious diseases. Thus, the development of new and improved vaccines for existing, emerging, and re-emerging diseases is an area of great interest to the scientific community and general public. Traditional approaches to subunit antigen discovery and vaccine development lack consideration for the critical aspects of public safety and activation of relevant protective host immunity. The availability of genomic sequences for pathogenic Brucella spp. and their hosts have led to development of systems-wide analytical tools that have provided a better understanding of host and pathogen physiology while also beginning to unravel the intricacies at the host-pathogen interface. Advances in pathogen biology, host immunology, and host-agent interactions have the potential to serve as a platform for the design and implementation of better-targeted antigen discovery approaches. With emphasis on Brucella spp., we probe the biological aspects of host and pathogen that merit consideration in the targeted design of subunit antigen discovery and vaccine development. PMID:23720712

  19. EXTATIC: ASML's α-tool development for EUVL

    NASA Astrophysics Data System (ADS)

    Meiling, Hans; Benschop, Jos P.; Hartman, Robert A.; Kuerz, Peter; Hoghoj, Peter; Geyl, Roland; Harned, Noreen

    2002-07-01

    Within the recently initiated EXTATIC project a complete full-field lithography exposure tool for he 50-nm technology node is being developed. The goal is to demonstrate the feasibility of extreme UV lithography (EUVL) for 50-nm imaging and to reduce technological risks in the development of EUVL production tools. We describe the EUV MEDEA+) framework in which EXTATIC is executed, and give an update on the status of the (alpha) -tool development. A brief summary of our in-house source-collector module development is given, as well as the general vacuum architecture of the (alpha) -tool is discussed. We discuss defect-free reticle handling, and investigated the uses of V-grooved brackets glued to the side of the reticle to reduce particle generation during takeovers. These takeovers do not only occur in the exposure tool, but also in multilayer deposition equipment, e-beam pattern writers, inspection tools, etc., where similar requirements on particle contamination are present. Finally, we present an update of mirror fabrication technology and show improved mirror figuring and finishing results.

  20. Development of highly efficient, low-cost lignocellulolytic enzyme systems in the post-genomic era.

    PubMed

    Liu, Guodong; Qin, Yuqi; Li, Zhonghai; Qu, Yinbo

    2013-11-01

    The current high cost of lignocellulolytic enzymes is a major bottleneck in the economic bioconversion of lignocellulosic biomass to fuels and chemicals. Fungal lignocellulolytic enzyme systems are secreted at high levels, making them the most promising starting points for further development of highly efficient lignocellulolytic enzyme systems. In this paper, recent advances in improvement of fungal lignocellulolytic enzyme systems are reviewed, with an emphasis on the achievements made using genomic approaches. A general strategy for lignocellulolytic enzyme system development is proposed, including the improvement of the hydrolysis efficiencies and productivities of current enzyme systems. The applications of genomic, transcriptomic and proteomic analysis methods in examining the composition of native enzyme systems, discovery of novel enzymes and synergistic proteins from natural sources, and understanding of regulatory mechanisms for lignocellulolytic enzyme biosynthesis are summarized. By combining systems biology and synthetic biology tools, engineered fungal strains are expected to produce high levels of optimized lignocellulolytic enzyme systems.

  1. Development of a Safety Management Web Tool for Horse Stables

    PubMed Central

    Leppälä, Jarkko; Kolstrup, Christina Lunner; Pinzke, Stefan; Rautiainen, Risto; Saastamoinen, Markku; Särkijärvi, Susanna

    2015-01-01

    Simple Summary A new web tool for equine activities, InnoHorse, was developed to support horse stable managers in business, safety, pasture and manure management. The aim of the safety section of the web tool was to raise awareness of safety issues in daily horse stable activities. This section contains a safety checklist, stable safety map and good practices to support human health and horse welfare and to prevent injuries in horse-related activities. Reviews of the literature and statistics, empirical horse stable case studies, expert panel workshops and stakeholder interviews were utilized in designing the web tool. Abstract Managing a horse stable involves risks, which can have serious consequences for the stable, employees, clients, visitors and horses. Existing industrial or farm production risk management tools are not directly applicable to horse stables and they need to be adapted for use by managers of different types of stables. As a part of the InnoEquine project, an innovative web tool, InnoHorse, was developed to support horse stable managers in business, safety, pasture and manure management. A literature review, empirical horse stable case studies, expert panel workshops and stakeholder interviews were carried out to support the design. The InnoHorse web tool includes a safety section containing a horse stable safety map, stable safety checklists, and examples of good practices in stable safety, horse handling and rescue planning. This new horse stable safety management tool can also help in organizing work processes in horse stables in general. PMID:26569319

  2. Computational Tools for Accelerating Carbon Capture Process Development

    SciTech Connect

    Miller, David

    2013-01-01

    The goals of the work reported are: to develop new computational tools and models to enable industry to more rapidly develop and deploy new advanced energy technologies; to demonstrate the capabilities of the CCSI Toolset on non-proprietary case studies; and to deploy the CCSI Toolset to industry. Challenges of simulating carbon capture (and other) processes include: dealing with multiple scales (particle, device, and whole process scales); integration across scales; verification, validation, and uncertainty; and decision support. The tools cover: risk analysis and decision making; validated, high-fidelity CFD; high-resolution filtered sub-models; process design and optimization tools; advanced process control and dynamics; process models; basic data sub-models; and cross-cutting integration