Science.gov

Sample records for functional genomics tools

  1. Genetic resources offer efficient tools for rice functional genomics research.

    PubMed

    Lo, Shuen-Fang; Fan, Ming-Jen; Hsing, Yue-Ie; Chen, Liang-Jwu; Chen, Shu; Wen, Ien-Chie; Liu, Yi-Lun; Chen, Ku-Ting; Jiang, Mirng-Jier; Lin, Ming-Kuang; Rao, Meng-Yen; Yu, Lin-Chih; Ho, Tuan-Hua David; Yu, Su-May

    2016-05-01

    Rice is an important crop and major model plant for monocot functional genomics studies. With the establishment of various genetic resources for rice genomics, the next challenge is to systematically assign functions to predicted genes in the rice genome. Compared with the robustness of genome sequencing and bioinformatics techniques, progress in understanding the function of rice genes has lagged, hampering the utilization of rice genes for cereal crop improvement. The use of transfer DNA (T-DNA) insertional mutagenesis offers the advantage of uniform distribution throughout the rice genome, but preferentially in gene-rich regions, resulting in direct gene knockout or activation of genes within 20-30 kb up- and downstream of the T-DNA insertion site and high gene tagging efficiency. Here, we summarize the recent progress in functional genomics using the T-DNA-tagged rice mutant population. We also discuss important features of T-DNA activation- and knockout-tagging and promoter-trapping of the rice genome in relation to mutant and candidate gene characterizations and how to more efficiently utilize rice mutant populations and datasets for high-throughput functional genomics and phenomics studies by forward and reverse genetics approaches. These studies may facilitate the translation of rice functional genomics research to improvements of rice and other cereal crops.

  2. Bovine Genome Database: new tools for gleaning function from the Bos taurus genome.

    PubMed

    Elsik, Christine G; Unni, Deepak R; Diesh, Colin M; Tayal, Aditi; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-04

    We report an update of the Bovine Genome Database (BGD) (http://BovineGenome.org). The goal of BGD is to support bovine genomics research by providing genome annotation and data mining tools. We have developed new genome and annotation browsers using JBrowse and WebApollo for two Bos taurus genome assemblies, the reference genome assembly (UMD3.1.1) and the alternate genome assembly (Btau_4.6.1). Annotation tools have been customized to highlight priority genes for annotation, and to aid annotators in selecting gene evidence tracks from 91 tissue specific RNAseq datasets. We have also developed BovineMine, based on the InterMine data warehousing system, to integrate the bovine genome, annotation, QTL, SNP and expression data with external sources of orthology, gene ontology, gene interaction and pathway information. BovineMine provides powerful query building tools, as well as customized query templates, and allows users to analyze and download genome-wide datasets. With BovineMine, bovine researchers can use orthology to leverage the curated gene pathways of model organisms, such as human, mouse and rat. BovineMine will be especially useful for gene ontology and pathway analyses in conjunction with GWAS and QTL studies.

  3. Excisable cassettes: new tools for functional analysis of Streptomyces genomes.

    PubMed

    Raynal, Alain; Karray, Fatma; Tuphile, Karine; Darbon-Rongère, Emmanuelle; Pernodet, Jean-Luc

    2006-07-01

    The functional analysis of microbial genomes often requires gene inactivation. We constructed a set of cassettes consisting of single antibiotic resistance genes flanked by the attL and attR sites resulting from site-specific integration of the Streptomyces pSAM2 element. These cassettes can easily be used to inactivate genes by in-frame deletion in Streptomyces by a three-step strategy. In the first step, in Escherichia coli, the cassette is inserted into a cloned copy of the gene to be inactivated. In the second step, the gene is replaced by homologous recombination in Streptomyces, allowing substitution of the wild-type target gene with its inactivated counterpart. In the third step, the cassette can be removed by expression of the pSAM2 genes xis and int. The resulting strains are marker-free and contain an "attB-like" sequence of 33, 34, or 35 bp with no stop codon if the cassette is correctly chosen. Thus, a gene can be disrupted by creating an in-frame deletion, avoiding polar effects if downstream genes are cotranscribed with the target gene. A set of cassettes was constructed to contain a hygromycin or gentamicin resistance gene flanked by the attL and attR sites. The initial constructions carrying convenient cloning sites allow the insertion of any other marker gene. We tested insertion and excision by inserting a cassette into orf3, the third gene of an operon involved in spiramycin biosynthesis. We verified that the cassette exerted a polar effect on the transcription of downstream genes but that, after excision, complementation with orf3 alone restored spiramycin production.

  4. Excisable Cassettes: New Tools for Functional Analysis of Streptomyces Genomes

    PubMed Central

    Raynal, Alain; Karray, Fatma; Tuphile, Karine; Darbon-Rongère, Emmanuelle; Pernodet, Jean-Luc

    2006-01-01

    The functional analysis of microbial genomes often requires gene inactivation. We constructed a set of cassettes consisting of single antibiotic resistance genes flanked by the attL and attR sites resulting from site-specific integration of the Streptomyces pSAM2 element. These cassettes can easily be used to inactivate genes by in-frame deletion in Streptomyces by a three-step strategy. In the first step, in Escherichia coli, the cassette is inserted into a cloned copy of the gene to be inactivated. In the second step, the gene is replaced by homologous recombination in Streptomyces, allowing substitution of the wild-type target gene with its inactivated counterpart. In the third step, the cassette can be removed by expression of the pSAM2 genes xis and int. The resulting strains are marker-free and contain an “attB-like” sequence of 33, 34, or 35 bp with no stop codon if the cassette is correctly chosen. Thus, a gene can be disrupted by creating an in-frame deletion, avoiding polar effects if downstream genes are cotranscribed with the target gene. A set of cassettes was constructed to contain a hygromycin or gentamicin resistance gene flanked by the attL and attR sites. The initial constructions carrying convenient cloning sites allow the insertion of any other marker gene. We tested insertion and excision by inserting a cassette into orf3, the third gene of an operon involved in spiramycin biosynthesis. We verified that the cassette exerted a polar effect on the transcription of downstream genes but that, after excision, complementation with orf3 alone restored spiramycin production. PMID:16820478

  5. New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome.

    PubMed

    Garazha, Andrew; Ivanova, Alena; Suntsova, Maria; Malakhova, Galina; Roumiantsev, Sergey; Zhavoronkov, Alex; Buzdin, Anton

    2015-01-01

    Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of functional genomic elements. Here, we performed the genome-wide identification of human ERVs/LRs containing TFBS according to the ENCODE project. We created the first interactive ERV/LRs database that groups the individual inserts according to their familial nomenclature, number of mapped TFBS and divergence from their consensus sequence. Information on any particular element can be easily extracted by the user. We also created a genome browser tool, which enables quick mapping of any ERV/LR insert according to genomic coordinates, known human genes and TFBS. These tools can be used to easily explore functionally relevant individual ERV/LRs, and for studying their impact on the regulation of human genes. Overall, we identified ∼110,000 ERV/LR genomic elements having TFBS. We propose a hypothesis of "domestication" of ERV/LR TFBS by the genome milieu including subsequent stages of initial epigenetic repression, partial functional release, and further mutation-driven reshaping of TFBS in tight coevolution with the enclosing genomic loci.

  6. Molecular tools for functional genomics in filamentous fungi: recent advances and new strategies.

    PubMed

    Jiang, Dewei; Zhu, Wei; Wang, Yunchuan; Sun, Chang; Zhang, Ke-Qin; Yang, Jinkui

    2013-12-01

    Advances in genetic transformation techniques have made important contributions to molecular genetics. Various molecular tools and strategies have been developed for functional genomic analysis of filamentous fungi since the first DNA transformation was successfully achieved in Neurospora crassa in 1973. Increasing amounts of genomic data regarding filamentous fungi are continuously reported and large-scale functional studies have become common in a wide range of fungal species. In this review, various molecular tools used in filamentous fungi are compared and discussed, including methods for genetic transformation (e.g., protoplast transformation, electroporation, and microinjection), the construction of random mutant libraries (e.g., restriction enzyme mediated integration, transposon arrayed gene knockout, and Agrobacterium tumefaciens mediated transformation), and the analysis of gene function (e.g., RNA interference and transcription activator-like effector nucleases). We also focused on practical strategies that could enhance the efficiency of genetic manipulation in filamentous fungi, such as choosing a proper screening system and marker genes, assembling target-cassettes or vectors effectively, and transforming into strains that are deficient in the nonhomologous end joining pathway. In summary, we present an up-to-date review on the different molecular tools and latest strategies that have been successfully used in functional genomics in filamentous fungi.

  7. Algal Functional Annotation Tool from the DOE-UCLA Institute for Genomics and Proteomics

    DOE Data Explorer

    Lopez, David

    The Algal Functional Annotation Tool is a bioinformatics resource to visualize pathway maps, identify enriched biological terms, or convert gene identifiers to elucidate biological function in silico. These types of analysis have been catered to support lists of gene identifiers, such as those coming from transcriptome gene expression analysis. By analyzing the functional annotation of an interesting set of genes, common biological motifs may be elucidated and a first-pass analysis can point further research in the right direction. Currently, the following databases have been parsed, processed, and added to the tool: 1( Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathways Database, 2) MetaCyc Encyclopedia of Metabolic Pathways, 3) Panther Pathways Database, 4) Reactome Pathways Database, 5) Gene Ontology, 6) MapMan Ontology, 7) KOG (Eukaryotic Clusters of Orthologous Groups), 5)Pfam, 6) InterPro.

  8. MAVEN: a tool for visualization and functional analysis of genome-wide association results

    PubMed Central

    Narayanan, Kanchana; Li, Jing

    2010-01-01

    Summary: We describe the features and implementation of a web application tool named MAVEN—for Management, Analysis, Visualization and rEsults shariNg of genome-wide association data using cutting edge technologies. Main capabilities include user data uploading and management, queries using a variety of criteria, visualization of results, interactive selections and seamless integration of users' data with databases at the National Center for Biotechnology Information (NCBI) for functional annotations of single nucleotide polymorphisms (SNPs) and genes. Availability: http://cbc.case.edu/maven Contact: jingli@case.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19933166

  9. Tnt1 Retrotransposon Mutagenesis: A Tool for Soybean Functional Genomics1[W][OA

    PubMed Central

    Cui, Yaya; Barampuram, Shyam; Stacey, Minviluz G.; Hancock, C. Nathan; Findley, Seth; Mathieu, Melanie; Zhang, Zhanyuan; Parrott, Wayne A.; Stacey, Gary

    2013-01-01

    Insertional mutagenesis is a powerful tool for determining gene function in both model and crop plant species. Tnt1, the transposable element of tobacco (Nicotiana tabacum) cell type 1, is a retrotransposon that replicates via an RNA copy that is reverse transcribed and integrated elsewhere in the plant genome. Based on studies in a variety of plants, Tnt1 appears to be inactive in normal plant tissue but can be reactivated by tissue culture. Our goal was to evaluate the utility of the Tnt1 retrotransposon as a mutagenesis strategy in soybean (Glycine max). Experiments showed that the Tnt1 element was stably transformed into soybean plants by Agrobacterium tumefaciens-mediated transformation. Twenty-seven independent transgenic lines carrying Tnt1 insertions were generated. Southern-blot analysis revealed that the copy number of transposed Tnt1 elements ranged from four to 19 insertions, with an average of approximately eight copies per line. These insertions showed Mendelian segregation and did not transpose under normal growth conditions. Analysis of 99 Tnt1 flanking sequences revealed insertions into 62 (62%) annotated genes, indicating that the element preferentially inserts into protein-coding regions. Tnt1 insertions were found in all 20 soybean chromosomes, indicating that Tnt1 transposed throughout the soybean genome. Furthermore, fluorescence in situ hybridization experiments validated that Tnt1 inserted into multiple chromosomes. Passage of transgenic lines through two different tissue culture treatments resulted in Tnt1 transposition, significantly increasing the number of insertions per line. Thus, our data demonstrate the Tnt1 retrotransposon to be a powerful system that can be used for effective large-scale insertional mutagenesis in soybean. PMID:23124322

  10. Agrobacterium rhizogenes-induced cotton hairy root culture as an alternative tool for cotton functional genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Although well-accepted as the ultimate method for cotton functional genomics, Agrobacterium tumefaciens-mediated cotton transformation is not widely used for functional analyses of cotton genes and their promoters since regeneration of cotton in tissue culture is lengthy and labor intensive. In cer...

  11. An Analysis of Adenovirus Genomes Using Whole Genome Software Tools

    PubMed Central

    Mahadevan, Padmanabhan

    2016-01-01

    The evolution of sequencing technology has lead to an enormous increase in the number of genomes that have been sequenced. This is especially true in the field of virus genomics. In order to extract meaningful biological information from these genomes, whole genome data mining software tools must be utilized. Hundreds of tools have been developed to analyze biological sequence data. However, only some of these tools are user-friendly to biologists. Several of these tools that have been successfully used to analyze adenovirus genomes are described here. These include Artemis, EMBOSS, pDRAW, zPicture, CoreGenes, GeneOrder, and PipMaker. These tools provide functionalities such as visualization, restriction enzyme analysis, alignment, and proteome comparisons that are extremely useful in the bioinformatics analysis of adenovirus genomes. PMID:28293072

  12. Enabling functional genomics with genome engineering.

    PubMed

    Hilton, Isaac B; Gersbach, Charles A

    2015-10-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances.

  13. Enabling functional genomics with genome engineering

    PubMed Central

    Hilton, Isaac B.; Gersbach, Charles A.

    2015-01-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. PMID:26430154

  14. Algal functional annotation tool

    SciTech Connect

    2012-07-12

    Abstract BACKGROUND: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. DESCRIPTION: The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on KEGG

  15. Geochip: A high throughput genomic tool for linking community structure to functions

    SciTech Connect

    Van Nostrand, Joy D.; Liang, Yuting; He, Zhili; Li, Guanghe; Zhou, Jizhong

    2009-01-30

    GeoChip is a comprehensive functional gene array that targets key functional genes involved in the geochemical cycling of N, C, and P, sulfate reduction, metal resistance and reduction, and contaminant degradation. Studies have shown the GeoChip to be a sensitive, specific, and high-throughput tool for microbial community analysis that has the power to link geochemical processes with microbial community structure. However, several challenges remain regarding the development and applications of microarrays for microbial community analysis.

  16. Mutagenesis as a Tool in Plant Genetics, Functional Genomics, and Breeding

    PubMed Central

    Sikora, Per; Chawade, Aakash; Larsson, Mikael; Olsson, Johanna; Olsson, Olof

    2011-01-01

    Plant mutagenesis is rapidly coming of age in the aftermath of recent developments in high-resolution molecular and biochemical techniques. By combining the high variation of mutagenised populations with novel screening methods, traits that are almost impossible to identify by conventional breeding are now being developed and characterised at the molecular level. This paper provides a comprehensive overview of the various techniques and workflows available to researchers today in the field of molecular breeding, and how these tools complement the ones already used in traditional breeding. Both genetic (Targeting Induced Local Lesions in Genomes; TILLING) and phenotypic screens are evaluated. Finally, different ways of bridging the gap between genotype and phenotype are discussed. PMID:22315587

  17. The SOFG Anatomy Entry List (SAEL): An Annotation Tool for Functional Genomics Data

    PubMed Central

    Parkinson, Helen; Aitken, Stuart; Baldock, Richard A.; Bard, Jonathan B. L.; Burger, Albert; Hayamizu, Terry F.; Rector, Alan; Ringwald, Martin; Rogers, Jeremy; Rosse, Cornelius; Stoeckert, Christian J.

    2004-01-01

    A great deal of data in functional genomics studies needs to be annotated with low-resolution anatomical terms. For example, gene expression assays based on manually dissected samples (microarray, SAGE, etc.) need high-level anatomical terms to describe sample origin. First-pass annotation in high-throughput assays (e.g. large-scale in situ gene expression screens or phenotype screens) and bibliographic applications, such as selection of keywords, would also benefit from a minimum set of standard anatomical terms. Although only simple terms are required, the researcher faces serious practical problems of inconsistency and confusion, given the different aims and the range of complexity of existing anatomy ontologies. A Standards and Ontologies for Functional Genomics (SOFG) group therefore initiated discussions between several of the major anatomical ontologies for higher vertebrates. As we report here, one result of these discussions is a simple, accessible, controlled vocabulary of gross anatomical terms, the SOFG Anatomy Entry List (SAEL). The SAEL is available from http://www.sofg.org and is intended as a resource for biologists, curators, bioinformaticians and developers of software supporting functional genomics. It can be used directly for annotation in the contexts described above. Importantly, each term is linked to the corresponding term in each of the major anatomy ontologies. Where the simple list does not provide enough detail or sophistication, therefore, the researcher can use the SAEL to choose the appropriate ontology and move directly to the relevant term as an entry point. The SAEL links will also be used to support computational access to the respective ontologies. PMID:18629134

  18. Bacterial Artificial Chromosomes: A Functional Genomics Tool for the Study of Positive-strand RNA Viruses

    PubMed Central

    Yun, Sang-Im; Song, Byung-Hak; Kim, Jin-Kyoung; Lee, Young-Min

    2015-01-01

    Reverse genetics, an approach to rescue infectious virus entirely from a cloned cDNA, has revolutionized the field of positive-strand RNA viruses, whose genomes have the same polarity as cellular mRNA. The cDNA-based reverse genetics system is a seminal method that enables direct manipulation of the viral genomic RNA, thereby generating recombinant viruses for molecular and genetic studies of both viral RNA elements and gene products in viral replication and pathogenesis. It also provides a valuable platform that allows the development of genetically defined vaccines and viral vectors for the delivery of foreign genes. For many positive-strand RNA viruses such as Japanese encephalitis virus (JEV), however, the cloned cDNAs are unstable, posing a major obstacle to the construction and propagation of the functional cDNA. Here, the present report describes the strategic considerations in creating and amplifying a genetically stable full-length infectious JEV cDNA as a bacterial artificial chromosome (BAC) using the following general experimental procedures: viral RNA isolation, cDNA synthesis, cDNA subcloning and modification, assembly of a full-length cDNA, cDNA linearization, in vitro RNA synthesis, and virus recovery. This protocol provides a general methodology applicable to cloning full-length cDNA for a range of positive-strand RNA viruses, particularly those with a genome of >10 kb in length, into a BAC vector, from which infectious RNAs can be transcribed in vitro with a bacteriophage RNA polymerase. PMID:26780115

  19. Bacterial Artificial Chromosomes: A Functional Genomics Tool for the Study of Positive-strand RNA Viruses.

    PubMed

    Yun, Sang-Im; Song, Byung-Hak; Kim, Jin-Kyoung; Lee, Young-Min

    2015-12-29

    Reverse genetics, an approach to rescue infectious virus entirely from a cloned cDNA, has revolutionized the field of positive-strand RNA viruses, whose genomes have the same polarity as cellular mRNA. The cDNA-based reverse genetics system is a seminal method that enables direct manipulation of the viral genomic RNA, thereby generating recombinant viruses for molecular and genetic studies of both viral RNA elements and gene products in viral replication and pathogenesis. It also provides a valuable platform that allows the development of genetically defined vaccines and viral vectors for the delivery of foreign genes. For many positive-strand RNA viruses such as Japanese encephalitis virus (JEV), however, the cloned cDNAs are unstable, posing a major obstacle to the construction and propagation of the functional cDNA. Here, the present report describes the strategic considerations in creating and amplifying a genetically stable full-length infectious JEV cDNA as a bacterial artificial chromosome (BAC) using the following general experimental procedures: viral RNA isolation, cDNA synthesis, cDNA subcloning and modification, assembly of a full-length cDNA, cDNA linearization, in vitro RNA synthesis, and virus recovery. This protocol provides a general methodology applicable to cloning full-length cDNA for a range of positive-strand RNA viruses, particularly those with a genome of >10 kb in length, into a BAC vector, from which infectious RNAs can be transcribed in vitro with a bacteriophage RNA polymerase.

  20. The use of retroviruses as pharmaceutical tools for target discovery and validation in the field of functional genomics.

    PubMed

    Lorens, J B; Sousa, C; Bennett, M K; Molineaux, S M; Payan, D G

    2001-12-01

    Retrovirally mediated functional genomics enables identification of physiologically relevant cellular therapeutic targets. Unique properties of retroviruses make them ideal tools for the introduction of large and diverse libraries of potential genetic effectors to a variety of cell types. The identification and recovery of intracellular library elements responsible for altered disease responses establishes a direct basis for pharmaceutical development. Recent innovations in retroviral infection efficiency and expression control have broadened application of the methodology to include libraries of mutagenized cDNAs, peptides and ribozyme genetic effectors.

  1. Genome Exploitation and Bioinformatics Tools

    NASA Astrophysics Data System (ADS)

    de Jong, Anne; van Heel, Auke J.; Kuipers, Oscar P.

    Bioinformatic tools can greatly improve the efficiency of bacteriocin screening efforts by limiting the amount of strains. Different classes of bacteriocins can be detected in genomes by looking at different features. Finding small bacteriocins can be especially challenging due to low homology and because small open reading frames (ORFs) are often omitted from annotations. In this chapter, several bioinformatic tools/strategies to identify bacteriocins in genomes are discussed.

  2. Plant functional genomics

    NASA Astrophysics Data System (ADS)

    Holtorf, Hauke; Guitton, Marie-Christine; Reski, Ralf

    2002-04-01

    Functional genome analysis of plants has entered the high-throughput stage. The complete genome information from key species such as Arabidopsis thaliana and rice is now available and will further boost the application of a range of new technologies to functional plant gene analysis. To broadly assign functions to unknown genes, different fast and multiparallel approaches are currently used and developed. These new technologies are based on known methods but are adapted and improved to accommodate for comprehensive, large-scale gene analysis, i.e. such techniques are novel in the sense that their design allows researchers to analyse many genes at the same time and at an unprecedented pace. Such methods allow analysis of the different constituents of the cell that help to deduce gene function, namely the transcripts, proteins and metabolites. Similarly the phenotypic variations of entire mutant collections can now be analysed in a much faster and more efficient way than before. The different methodologies have developed to form their own fields within the functional genomics technological platform and are termed transcriptomics, proteomics, metabolomics and phenomics. Gene function, however, cannot solely be inferred by using only one such approach. Rather, it is only by bringing together all the information collected by different functional genomic tools that one will be able to unequivocally assign functions to unknown plant genes. This review focuses on current technical developments and their impact on the field of plant functional genomics. The lower plant Physcomitrella is introduced as a new model system for gene function analysis, owing to its high rate of homologous recombination.

  3. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer.

  4. Validation of Synthetic CRISPR Reagents as a Tool for Arrayed Functional Genomic Screening

    PubMed Central

    Tan, Jenille; Martin, Scott E.

    2016-01-01

    To date, lentiviral-based CRISPR-Cas9 screens have largely been conducted in pooled format. However, numerous assays are not amenable to pooled approaches, and lentiviral screening in arrayed format presents many challenges. We sought to examine synthetic CRISPR reagents in the context of arrayed screening. Experiments were performed using aberrant DNA replication as an assay. Using synthetic CRISPR RNAs targeting the known control gene GMNN in HCT-116 cells stably expressing Cas9, we observed statistically significant phenotype among the majority of transfected cells within 72 hours. Additional studies revealed near complete loss of GMNN protein and editing of GMNN DNA. We next conducted a screen of synthetic CRISPR RNAs directed against 640 ubiquitin-related genes. Screening identified known and novel DNA replication regulators that were also supported by siRNA gene knockdown. Notably, CRISPR screening identified more statistically significant hits than corresponding siRNA screens run in parallel. These results highlight the possibility of using synthetic CRISPR reagents as an arrayed screening tool. PMID:28030641

  5. Multilevel functional genomics data integration as a tool for understanding physiology: a network biology perspective.

    PubMed

    Davidsen, Peter K; Turan, Nil; Egginton, Stuart; Falciani, Francesco

    2016-02-01

    The overall aim of physiological research is to understand how living systems function in an integrative manner. Consequently, the discipline of physiology has since its infancy attempted to link multiple levels of biological organization. Increasingly this has involved mathematical and computational approaches, typically to model a small number of components spanning several levels of biological organization. With the advent of "omics" technologies, which can characterize the molecular state of a cell or tissue (intended as the level of expression and/or activity of its molecular components), the number of molecular components we can quantify has increased exponentially. Paradoxically, the unprecedented amount of experimental data has made it more difficult to derive conceptual models underlying essential mechanisms regulating mammalian physiology. We present an overview of state-of-the-art methods currently used to identifying biological networks underlying genomewide responses. These are based on a data-driven approach that relies on advanced computational methods designed to "learn" biology from observational data. In this review, we illustrate an application of these computational methodologies using a case study integrating an in vivo model representing the transcriptional state of hypoxic skeletal muscle with a clinical study representing muscle wasting in chronic obstructive pulmonary disease patients. The broader application of these approaches to modeling multiple levels of biological data in the context of modern physiology is discussed.

  6. The GLOBE 3D Genome Platform - towards a novel system-biological paper tool to integrate the huge complexity of genome organization and function.

    PubMed

    Knoch, Tobias A; Lesnussa, Michael; Kepper, Nick; Eussen, Hubert B; Grosveld, Frank G

    2009-01-01

    Genomes are tremendous co-evolutionary holistic systems for molecular storage, processing and fabrication of information. Their system-biological complexity remains, however, still largely mysterious, despite immense sequencing achievements and huge advances in the understanding of the general sequential, three-dimensional and regulatory organization. Here, we present the GLOBE 3D Genome Platform a completely novel grid based virtual "paper" tool and in fact the first system-biological genome browser integrating the holistic complexity of genomes in a single easy comprehensible platform: Based on a detailed study of biophysical and IT requirements, every architectural level from sequence to morphology of one or several genomes can be approached in a real and in a symbolic representation simultaneously and navigated by continuous scale-free zooming within a unique three-dimensional OpenGL and grid driven environment. In principle an unlimited number of multi-dimensional data sets can be visualized, customized in terms of arrangement, shape, colour, and texture etc. as well as accessed and annotated individually or in groups using internal or external data bases/facilities. Any information can be searched and correlated by importing or calculating simple relations in real-time using grid resources. A general correlation and application platform for more complex correlative analysis and a front-end for system-biological simulations both using again the huge capabilities of grid infrastructures is currently under development. Hence, the GLOBE 3D Genome Platform is an example of a grid based approach towards a virtual desktop for genomic work combining the three fundamental distributed resources: i) visual data representation, ii) data access and management, and iii) data analysis and creation. Thus, the GLOBE 3D Genome Platform is the novel system-biology oriented information system urgently needed to access, present, annotate, and to simulate the holistic genome

  7. VISTA - computational tools for comparative genomics

    SciTech Connect

    Frazer, Kelly A.; Pachter, Lior; Poliakov, Alexander; Rubin,Edward M.; Dubchak, Inna

    2004-01-01

    Comparison of DNA sequences from different species is a fundamental method for identifying functional elements in genomes. Here we describe the VISTA family of tools created to assist biologists in carrying out this task. Our first VISTA server at http://www-gsd.lbl.gov/VISTA/ was launched in the summer of 2000 and was designed to align long genomic sequences and visualize these alignments with associated functional annotations. Currently the VISTA site includes multiple comparative genomics tools and provides users with rich capabilities to browse pre-computed whole-genome alignments of large vertebrate genomes and other groups of organisms with VISTA Browser, submit their own sequences of interest to several VISTA servers for various types of comparative analysis, and obtain detailed comparative analysis results for a set of cardiovascular genes. We illustrate capabilities of the VISTA site by the analysis of a 180 kilobase (kb) interval on human chromosome 5 that encodes for the kinesin family member3A (KIF3A) protein.

  8. Strategies and tools for whole genome alignments

    SciTech Connect

    Couronne, Olivier; Poliakov, Alexander; Bray, Nicolas; Ishkhanov,Tigran; Ryaboy, Dmitriy; Rubin, Edward; Pachter, Lior; Dubchak, Inna

    2002-11-25

    The availability of the assembled mouse genome makespossible, for the first time, an alignment and comparison of two largevertebrate genomes. We have investigated different strategies ofalignment for the subsequent analysis of conservation of genomes that areeffective for different quality assemblies. These strategies were appliedto the comparison of the working draft of the human genome with the MouseGenome Sequencing Consortium assembly, as well as other intermediatemouse assemblies. Our methods are fast and the resulting alignmentsexhibit a high degree of sensitivity, covering more than 90 percent ofknown coding exons in the human genome. We have obtained such coveragewhile preserving specificity. With a view towards the end user, we havedeveloped a suite of tools and websites for automatically aligning, andsubsequently browsing and working with whole genome comparisons. Wedescribe the use of these tools to identify conserved non-coding regionsbetween the human and mouse genomes, some of which have not beenidentified by other methods.

  9. Comparative genomics tools applied to bioterrorism defence.

    PubMed

    Slezak, Tom; Kuczmarski, Tom; Ott, Linda; Torres, Clinton; Medeiros, Dan; Smith, Jason; Truitt, Brian; Mulakken, Nisha; Lam, Marisa; Vitalis, Elizabeth; Zemla, Adam; Zhou, Carol Ecale; Gardner, Shea

    2003-06-01

    comparative genomics algorithms may help spur algorithm developers to tackle some of the many remaining problems that need to be addressed. Solutions to these problems will advance a wide range of biological disciplines, only one of which is pathogen detection. For example, exploration in evolution and phylogenetics, annotating gene coding regions, predicting and understanding gene function and regulation, and untangling gene networks all rely on tools for aligning multiple sequences, detecting gene rearrangements and duplications, and visualising genomic data. Two key problems currently needing improved solutions are: (1) aligning incomplete, fragmentary sequence (eg draft genome contigs or arbitrary genome regions) with both complete genomes and other fragmentary sequences; and (2) ordering, aligning and visualising non-colinear gene rearrangements and inversions in addition to the colinear alignments handled by current tools.

  10. Translational and functional oncogenomics. From cancer-oriented genomic screenings to new diagnostic tools and improved cancer treatment.

    PubMed

    Medico, Enzo

    2008-01-01

    We present here an experimental pipeline for the systematic identification and functional characterization of genes with high potential diagnostic and therapeutic value in human cancer. Complementary competences and resources have been brought together in the TRANSFOG Consortium to reach the following integrated research objectives: 1) execution of cancer-oriented genomic screenings on tumor tissues and experimental models and merging of the results to generate a prioritized panel of candidate genes involved in cancer progression and metastasis; 2) setup of systems for high-throughput delivery of full-length cDNAs, for gain-of-function analysis of the prioritized candidate genes; 3) collection of vectors and oligonucleotides for systematic, RNA interference-mediated down-regulation of the candidate genes; 4) adaptation of existing cell-based and model organism assays to a systematic analysis of gain and loss of function of the candidate genes, for identification and preliminary validation of novel potential therapeutic targets; 5) proteomic analysis of signal transduction and protein-protein interaction for better dissection of aberrant cancer signaling pathways; 6) validation of the diagnostic potential of the identified cancer genes towards the clinical use of diagnostic molecular signatures; 7) generation of a shared informatics platform for data handling and gene functional annotation. The results of the first three years of activity of the TRANSFOG Consortium are also briefly presented and discussed.

  11. Cytoscape: the network visualization tool for GenomeSpace workflows

    PubMed Central

    Demchak, Barry; Hull, Tim; Reich, Michael; Liefeld, Ted; Smoot, Michael; Ideker, Trey; Mesirov, Jill P.

    2014-01-01

    Modern genomic analysis often requires workflows incorporating multiple best-of-breed tools. GenomeSpace is a web-based visual workbench that combines a selection of these tools with mechanisms that create data flows between them. One such tool is Cytoscape 3, a popular application that enables analysis and visualization of graph-oriented genomic networks. As Cytoscape runs on the desktop, and not in a web browser, integrating it into GenomeSpace required special care in creating a seamless user experience and enabling appropriate data flows. In this paper, we present the design and operation of the Cytoscape GenomeSpace app, which accomplishes this integration, thereby providing critical analysis and visualization functionality for GenomeSpace users. It has been downloaded over 850 times since the release of its first version in September, 2013. PMID:25165537

  12. DCODE.ORG Anthology of Comparative Genomic Tools

    SciTech Connect

    Loots, G G; Ovcharenko, I

    2005-01-11

    Comparative genomics provides the means to demarcate functional regions in anonymous DNA sequences. The successful application of this method to identifying novel genes is currently shifting to deciphering the noncoding encryption of gene regulation across genomes. To facilitate the use of comparative genomics to practical applications in genetics and genomics we have developed several analytical and visualization tools for the analysis of arbitrary sequences and whole genomes. These tools include two alignment tools: zPicture and Mulan; a phylogenetic shadowing tool: eShadow for identifying lineage- and species-specific functional elements; two evolutionary conserved transcription factor analysis tools: rVista and multiTF; a tool for extracting cis-regulatory modules governing the expression of co-regulated genes, CREME; and a dynamic portal to multiple vertebrate and invertebrate genome alignments, the ECR Browser. Here we briefly describe each one of these tools and provide specific examples on their practical applications. All the tools are publicly available at the http://www.dcode.org/ web site.

  13. TheViral MetaGenome Annotation Pipeline(VMGAP):an automated tool for the functional annotation of viral Metagenomic shotgun sequencing data

    PubMed Central

    Lorenzi, Hernan A.; Hoover, Jeff; Inman, Jason; Safford, Todd; Murphy, Sean; Kagan, Leonid; Williamson, Shannon J.

    2011-01-01

    In the past few years, the field of metagenomics has been growing at an accelerated pace, particularly in response to advancements in new sequencing technologies. The large volume of sequence data from novel organisms generated by metagenomic projects has triggered the development of specialized databases and tools focused on particular groups of organisms or data types. Here we describe a pipeline for the functional annotation of viral metagenomic sequence data. The Viral MetaGenome Annotation Pipeline (VMGAP) pipeline takes advantage of a number of specialized databases, such as collections of mobile genetic elements and environmental metagenomes to improve the classification and functional prediction of viral gene products. The pipeline assigns a functional term to each predicted protein sequence following a suite of comprehensive analyses whose results are ranked according to a priority rules hierarchy. Additional annotation is provided in the form of enzyme commission (EC) numbers, GO/MeGO terms and Hidden Markov Models together with supporting evidence. PMID:21886867

  14. Computational Tools for Genomic Studies in Plants.

    PubMed

    Martinez, Manuel

    2016-12-01

    In recent years, the genomic sequence of numerous plant species including the main crop species has been determined. Computational tools have been developed to deal with the issue of which plant has been sequenced and where is the sequence hosted. In this mini-review, the databases for genome projects, the databases created to host species/clade projects and the databases developed to perform plant comparative genomics are revised. Because of their importance in modern research, an in-depth analysis of the plant comparative genomics databases has been performed. This comparative analysis is focused in the common and specific computational tools developed to achieve the particular objectives of each database. Besides, emerging high-performance bioinformatics tools specific for plant research are commented. What kind of computational approaches should be implemented in next years to efficiently analyze plant genomes is discussed.

  15. Metabolomics as a Hypothesis-Generating Functional Genomics Tool for the Annotation of Arabidopsis thaliana Genes of “Unknown Function”

    PubMed Central

    Quanbeck, Stephanie M.; Brachova, Libuse; Campbell, Alexis A.; Guan, Xin; Perera, Ann; He, Kun; Rhee, Seung Y.; Bais, Preeti; Dickerson, Julie A.; Dixon, Philip; Wohlgemuth, Gert; Fiehn, Oliver; Barkan, Lenore; Lange, Iris; Lange, B. Markus; Lee, Insuk; Cortes, Diego; Salazar, Carolina; Shuman, Joel; Shulaev, Vladimir; Huhman, David V.; Sumner, Lloyd W.; Roth, Mary R.; Welti, Ruth; Ilarslan, Hilal; Wurtele, Eve S.; Nikolau, Basil J.

    2012-01-01

    Metabolomics is the methodology that identifies and measures global pools of small molecules (of less than about 1,000 Da) of a biological sample, which are collectively called the metabolome. Metabolomics can therefore reveal the metabolic outcome of a genetic or environmental perturbation of a metabolic regulatory network, and thus provide insights into the structure and regulation of that network. Because of the chemical complexity of the metabolome and limitations associated with individual analytical platforms for determining the metabolome, it is currently difficult to capture the complete metabolome of an organism or tissue, which is in contrast to genomics and transcriptomics. This paper describes the analysis of Arabidopsis metabolomics data sets acquired by a consortium that includes five analytical laboratories, bioinformaticists, and biostatisticians, which aims to develop and validate metabolomics as a hypothesis-generating functional genomics tool. The consortium is determining the metabolomes of Arabidopsis T-DNA mutant stocks, grown in standardized controlled environment optimized to minimize environmental impacts on the metabolomes. Metabolomics data were generated with seven analytical platforms, and the combined data is being provided to the research community to formulate initial hypotheses about genes of unknown function (GUFs). A public database (www.PlantMetabolomics.org) has been developed to provide the scientific community with access to the data along with tools to allow for its interactive analysis. Exemplary datasets are discussed to validate the approach, which illustrate how initial hypotheses can be generated from the consortium-produced metabolomics data, integrated with prior knowledge to provide a testable hypothesis concerning the functionality of GUFs. PMID:22645570

  16. ParameciumDB in 2011: new tools and new data for functional and comparative genomics of the model ciliate Paramecium tetraurelia.

    PubMed

    Arnaiz, Olivier; Sperling, Linda

    2011-01-01

    ParameciumDB is a community model organism database built with the GMOD toolkit to integrate the genome and biology of the ciliate Paramecium tetraurelia. Over the last four years, post-genomic data from proteome and transcriptome studies has been incorporated along with predicted orthologs in 33 species, annotations from the community and publications from the scientific literature. Available tools include BioMart for complex queries, GBrowse2 for genome browsing, the Apollo genome editor for expert curation of gene models, a Blast server, a motif finder, and a wiki for protocols, nomenclature guidelines and other documentation. In-house tools have been developed for ontology browsing and evaluation of off-target RNAi matches. Now ready for next-generation deep sequencing data and the genomes of other Paramecium species, this open-access resource is available at http://paramecium.cgm.cnrs-gif.fr.

  17. Nongenetic functions of the genome.

    PubMed

    Bustin, Michael; Misteli, Tom

    2016-05-06

    The primary function of the genome is to store, propagate, and express the genetic information that gives rise to a cell's architectural and functional machinery. However, the genome is also a major structural component of the cell. Besides its genetic roles, the genome affects cellular functions by nongenetic means through its physical and structural properties, particularly by exerting mechanical forces and by serving as a scaffold for binding of cellular components. Major cellular processes affected by nongenetic functions of the genome include establishment of nuclear structure, signal transduction, mechanoresponses, cell migration, and vision in nocturnal animals. We discuss the concept, mechanisms, and implications of nongenetic functions of the genome.

  18. Application of Genomic Tools in Plant Breeding

    PubMed Central

    Pérez-de-Castro, A.M.; Vilanova, S.; Cañizares, J.; Pascual, L.; Blanca, J.M.; Díez, M.J.; Prohens, J.; Picó, B.

    2012-01-01

    Plant breeding has been very successful in developing improved varieties using conventional tools and methodologies. Nowadays, the availability of genomic tools and resources is leading to a new revolution of plant breeding, as they facilitate the study of the genotype and its relationship with the phenotype, in particular for complex traits. Next Generation Sequencing (NGS) technologies are allowing the mass sequencing of genomes and transcriptomes, which is producing a vast array of genomic information. The analysis of NGS data by means of bioinformatics developments allows discovering new genes and regulatory sequences and their positions, and makes available large collections of molecular markers. Genome-wide expression studies provide breeders with an understanding of the molecular basis of complex traits. Genomic approaches include TILLING and EcoTILLING, which make possible to screen mutant and germplasm collections for allelic variants in target genes. Re-sequencing of genomes is very useful for the genome-wide discovery of markers amenable for high-throughput genotyping platforms, like SSRs and SNPs, or the construction of high density genetic maps. All these tools and resources facilitate studying the genetic diversity, which is important for germplasm management, enhancement and use. Also, they allow the identification of markers linked to genes and QTLs, using a diversity of techniques like bulked segregant analysis (BSA), fine genetic mapping, or association mapping. These new markers are used for marker assisted selection, including marker assisted backcross selection, ‘breeding by design’, or new strategies, like genomic selection. In conclusion, advances in genomics are providing breeders with new tools and methodologies that allow a great leap forward in plant breeding, including the ‘superdomestication’ of crops and the genetic dissection and breeding for complex traits. PMID:23115520

  19. Galaxy tools to study genome diversity

    PubMed Central

    2013-01-01

    Background Intra-species genetic variation can be used to investigate population structure, selection, and gene flow in non-model vertebrates; and due to the plummeting costs for genome sequencing, it is now possible for small labs to obtain full-genome variation data from their species of interest. However, those labs may not have easy access to, and familiarity with, computational tools to analyze those data. Results We have created a suite of tools for the Galaxy web server aimed at handling nucleotide and amino-acid polymorphisms discovered by full-genome sequencing of several individuals of the same species, or using a SNP genotyping microarray. In addition to providing user-friendly tools, a main goal is to make published analyses reproducible. While most of the examples discussed in this paper deal with nuclear-genome diversity in non-human vertebrates, we also illustrate the application of the tools to fungal genomes, human biomedical data, and mitochondrial sequences. Conclusions This project illustrates that a small group can design, implement, test, document, and distribute a Galaxy tool collection to meet the needs of a particular community of biologists. PMID:24377391

  20. PanGeT: Pan-genomics tool.

    PubMed

    Yuvaraj, Iyyappan; Sridhar, Jayavel; Michael, Daliah; Sekar, Kanagaraj

    2017-02-05

    A decade after the concept of Pan-genome was first introduced; research in this field has spread its tentacles to areas such as pathogenesis of diseases, bacterial evolutionary studies and drug resistance. Gene content-based differentiation of virulent and a virulent strains of bacteria and identification of pathogen specific genes is imperative to understand their physiology and gain insights into the mechanism of genome evolution. Subsequently, this will aid in identifying diagnostic targets and in developing and selecting vaccines. The root of pan-genomic studies, however, is to identify the core genes, dispensable genes and strain specific genes across the genomes belonging to a clade. To this end, we have developed a tool, "PanGeT - Pan-genomics Tool" to compute the 'pan-genome' based on comparisons at the genome as well as the proteome levels. This automated tool is implemented using LaTeX libraries for effective visualization of overall pan-genome through graphical plots. Links to retrieve sequence information and functional annotations have also been provided. PanGeT can be downloaded from http://pranag.physics.iisc.ernet.in/PanGeT/ or https://github.com/PanGeTv1/PanGeT.

  1. Gene Chips and Functional Genomics

    NASA Astrophysics Data System (ADS)

    Hamadeh, Hisham; Afshari, Cynthia

    2000-11-01

    These past few years of scientific discovery will undoubtedly be remembered as the "genomics era," the period in which biologists succeeded in enumerating the sequence of nucleotides making up all, or at least most, of human DNA. And while this achievement has been heralded as a technological feat equal to the moon landing, it is only the first of many advances in DNA technology. Scientists are now faced with the task of understanding the meaning of the DNA sequence. Specifically, they want to learn how the DNA code relates to protein function. An important tool in the study of "functional genomics," is the cDNA microarray—also known as the gene chip. Inspired by computer microchips, gene chips allow scientists to monitor the expression of hundreds, even thousands, of genes in a fraction of the time it used to take to monitor the expression of a single one. By altering the conditions under which a particular tissue expresses genes—say, by exposing it to toxins or growth factors—scientists can determine the suite of genes expressed in different situations and hence start to get a handle on the function of these genes. The authors discuss this important new technology and some of its practical applications.

  2. Functional genomics tools applied to plant metabolism: a survey on plant respiration, its connections and the annotation of complex gene functions

    PubMed Central

    Araújo, Wagner L.; Nunes-Nesi, Adriano; Williams, Thomas C. R.

    2012-01-01

    The application of post-genomic techniques in plant respiration studies has greatly improved our ability to assign functions to gene products. In addition it has also revealed previously unappreciated interactions between distal elements of metabolism. Such results have reinforced the need to consider plant respiratory metabolism as part of a complex network and making sense of such interactions will ultimately require the construction of predictive and mechanistic models. Transcriptomics, proteomics, metabolomics, and the quantification of metabolic flux will be of great value in creating such models both by facilitating the annotation of complex gene function, determining their structure and by furnishing the quantitative data required to test them. In this review, we highlight how these experimental approaches have contributed to our current understanding of plant respiratory metabolism and its interplay with associated process (e.g., photosynthesis, photorespiration, and nitrogen metabolism). We also discuss how data from these techniques may be integrated, with the ultimate aim of identifying mechanisms that control and regulate plant respiration and discovering novel gene functions with potential biotechnological implications. PMID:22973288

  3. GREAT: a web portal for Genome Regulatory Architecture Tools.

    PubMed

    Bouyioukos, Costas; Bucchini, François; Elati, Mohamed; Képès, François

    2016-07-08

    GREAT (Genome REgulatory Architecture Tools) is a novel web portal for tools designed to generate user-friendly and biologically useful analysis of genome architecture and regulation. The online tools of GREAT are freely accessible and compatible with essentially any operating system which runs a modern browser. GREAT is based on the analysis of genome layout -defined as the respective positioning of co-functional genes- and its relation with chromosome architecture and gene expression. GREAT tools allow users to systematically detect regular patterns along co-functional genomic features in an automatic way consisting of three individual steps and respective interactive visualizations. In addition to the complete analysis of regularities, GREAT tools enable the use of periodicity and position information for improving the prediction of transcription factor binding sites using a multi-view machine learning approach. The outcome of this integrative approach features a multivariate analysis of the interplay between the location of a gene and its regulatory sequence. GREAT results are plotted in web interactive graphs and are available for download either as individual plots, self-contained interactive pages or as machine readable tables for downstream analysis. The GREAT portal can be reached at the following URL https://absynth.issb.genopole.fr/GREAT and each individual GREAT tool is available for downloading.

  4. Integrating sequence, evolution and functional genomics in regulatory genomics

    PubMed Central

    Vingron, Martin; Brazma, Alvis; Coulson, Richard; van Helden, Jacques; Manke, Thomas; Palin, Kimmo; Sand, Olivier; Ukkonen, Esko

    2009-01-01

    With genome analysis expanding from the study of genes to the study of gene regulation, 'regulatory genomics' utilizes sequence information, evolution and functional genomics measurements to unravel how regulatory information is encoded in the genome. PMID:19226437

  5. Frageria vesca, a useful tool for Rosaceae Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The diploid woodland strawberry, Fragaria vesca subsp. vesca forma semperflorens, has many characteristics that make it an ideal plant for strawberry genomics research. A brief review of some of the tools available for using this plant as a reference for studying gene function in strawberry and oth...

  6. The UCSC genome browser and associated tools.

    PubMed

    Kuhn, Robert M; Haussler, David; Kent, W James

    2013-03-01

    The UCSC Genome Browser (http://genome.ucsc.edu) is a graphical viewer for genomic data now in its 13th year. Since the early days of the Human Genome Project, it has presented an integrated view of genomic data of many kinds. Now home to assemblies for 58 organisms, the Browser presents visualization of annotations mapped to genomic coordinates. The ability to juxtapose annotations of many types facilitates inquiry-driven data mining. Gene predictions, mRNA alignments, epigenomic data from the ENCODE project, conservation scores from vertebrate whole-genome alignments and variation data may be viewed at any scale from a single base to an entire chromosome. The Browser also includes many other widely used tools, including BLAT, which is useful for alignments from high-throughput sequencing experiments. Private data uploaded as Custom Tracks and Data Hubs in many formats may be displayed alongside the rich compendium of precomputed data in the UCSC database. The Table Browser is a full-featured graphical interface, which allows querying, filtering and intersection of data tables. The Saved Session feature allows users to store and share customized views, enhancing the utility of the system for organizing multiple trains of thought. Binary Alignment/Map (BAM), Variant Call Format and the Personal Genome Single Nucleotide Polymorphisms (SNPs) data formats are useful for visualizing a large sequencing experiment (whole-genome or whole-exome), where the differences between the data set and the reference assembly may be displayed graphically. Support for high-throughput sequencing extends to compact, indexed data formats, such as BAM, bigBed and bigWig, allowing rapid visualization of large datasets from RNA-seq and ChIP-seq experiments via local hosting.

  7. Bioinformatics tools for analysing viral genomic data.

    PubMed

    Orton, R J; Gu, Q; Hughes, J; Maabar, M; Modha, S; Vattipally, S B; Wilkie, G S; Davison, A J

    2016-04-01

    The field of viral genomics and bioinformatics is experiencing a strong resurgence due to high-throughput sequencing (HTS) technology, which enables the rapid and cost-effective sequencing and subsequent assembly of large numbers of viral genomes. In addition, the unprecedented power of HTS technologies has enabled the analysis of intra-host viral diversity and quasispecies dynamics in relation to important biological questions on viral transmission, vaccine resistance and host jumping. HTS also enables the rapid identification of both known and potentially new viruses from field and clinical samples, thus adding new tools to the fields of viral discovery and metagenomics. Bioinformatics has been central to the rise of HTS applications because new algorithms and software tools are continually needed to process and analyse the large, complex datasets generated in this rapidly evolving area. In this paper, the authors give a brief overview of the main bioinformatics tools available for viral genomic research, with a particular emphasis on HTS technologies and their main applications. They summarise the major steps in various HTS analyses, starting with quality control of raw reads and encompassing activities ranging from consensus and de novo genome assembly to variant calling and metagenomics, as well as RNA sequencing.

  8. Bovine Genome Database: integrated tools for genome annotation and discovery.

    PubMed

    Childers, Christopher P; Reese, Justin T; Sundaram, Jaideep P; Vile, Donald C; Dickens, C Michael; Childs, Kevin L; Salih, Hanni; Bennett, Anna K; Hagen, Darren E; Adelson, David L; Elsik, Christine G

    2011-01-01

    The Bovine Genome Database (BGD; http://BovineGenome.org) strives to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data. BGD includes GBrowse genome browsers, the Apollo Annotation Editor, a quantitative trait loci (QTL) viewer, BLAST databases and gene pages. Genome browsers, available for both scaffold and chromosome coordinate systems, display the bovine Official Gene Set (OGS), RefSeq and Ensembl gene models, non-coding RNA, repeats, pseudogenes, single-nucleotide polymorphism, markers, QTL and alignments to complementary DNAs, ESTs and protein homologs. The Bovine QTL viewer is connected to the BGD Chromosome GBrowse, allowing for the identification of candidate genes underlying QTL. The Apollo Annotation Editor connects directly to the BGD Chado database to provide researchers with remote access to gene evidence in a graphical interface that allows editing and creating new gene models. Researchers may upload their annotations to the BGD server for review and integration into the subsequent release of the OGS. Gene pages display information for individual OGS gene models, including gene structure, transcript variants, functional descriptions, gene symbols, Gene Ontology terms, annotator comments and links to National Center for Biotechnology Information and Ensembl. Each gene page is linked to a wiki page to allow input from the research community.

  9. Genome Calligrapher: A Web Tool for Refactoring Bacterial Genome Sequences for de Novo DNA Synthesis.

    PubMed

    Christen, Matthias; Deutsch, Samuel; Christen, Beat

    2015-08-21

    Recent advances in synthetic biology have resulted in an increasing demand for the de novo synthesis of large-scale DNA constructs. Any process improvement that enables fast and cost-effective streamlining of digitized genetic information into fabricable DNA sequences holds great promise to study, mine, and engineer genomes. Here, we present Genome Calligrapher, a computer-aided design web tool intended for whole genome refactoring of bacterial chromosomes for de novo DNA synthesis. By applying a neutral recoding algorithm, Genome Calligrapher optimizes GC content and removes obstructive DNA features known to interfere with the synthesis of double-stranded DNA and the higher order assembly into large DNA constructs. Subsequent bioinformatics analysis revealed that synthesis constraints are prevalent among bacterial genomes. However, a low level of codon replacement is sufficient for refactoring bacterial genomes into easy-to-synthesize DNA sequences. To test the algorithm, 168 kb of synthetic DNA comprising approximately 20 percent of the synthetic essential genome of the cell-cycle bacterium Caulobacter crescentus was streamlined and then ordered from a commercial supplier of low-cost de novo DNA synthesis. The successful assembly into eight 20 kb segments indicates that Genome Calligrapher algorithm can be efficiently used to refactor difficult-to-synthesize DNA. Genome Calligrapher is broadly applicable to recode biosynthetic pathways, DNA sequences, and whole bacterial genomes, thus offering new opportunities to use synthetic biology tools to explore the functionality of microbial diversity. The Genome Calligrapher web tool can be accessed at https://christenlab.ethz.ch/GenomeCalligrapher  .

  10. Functional genomics of intracellular bacteria.

    PubMed

    de Barsy, Marie; Greub, Gilbert

    2013-07-01

    During the genomic era, a large amount of whole-genome sequences accumulated, which identified many hypothetical proteins of unknown function. Rapidly, functional genomics, which is the research domain that assign a function to a given gene product, has thus been developed. Functional genomics of intracellular pathogenic bacteria exhibit specific peculiarities due to the fastidious growth of most of these intracellular micro-organisms, due to the close interaction with the host cell, due to the risk of contamination of experiments with host cell proteins and, for some strict intracellular bacteria such as Chlamydia, due to the absence of simple genetic system to manipulate the bacterial genome. To identify virulence factors of intracellular pathogenic bacteria, functional genomics often rely on bioinformatic analyses compared with model organisms such as Escherichia coli and Bacillus subtilis. The use of heterologous expression is another common approach. Given the intracellular lifestyle and the many effectors that are used by the intracellular bacteria to corrupt host cell functions, functional genomics is also often targeting the identification of new effectors such as those of the T4SS of Brucella and Legionella.

  11. From data to function: functional modeling of poultry genomics data.

    PubMed

    McCarthy, F M; Lyons, E

    2013-09-01

    One of the challenges of functional genomics is to create a better understanding of the biological system being studied so that the data produced are leveraged to provide gains for agriculture, human health, and the environment. Functional modeling enables researchers to make sense of these data as it reframes a long list of genes or gene products (mRNA, ncRNA, and proteins) by grouping based upon function, be it individual molecular functions or interactions between these molecules or broader biological processes, including metabolic and signaling pathways. However, poultry researchers have been hampered by a lack of functional annotation data, tools, and training to use these data and tools. Moreover, this lack is becoming more critical as new sequencing technologies enable us to generate data not only for an increasingly diverse range of species but also individual genomes and populations of individuals. We discuss the impact of these new sequencing technologies on poultry research, with a specific focus on what functional modeling resources are available for poultry researchers. We also describe key strategies for researchers who wish to functionally model their own data, providing background information about functional modeling approaches, the data and tools to support these approaches, and the strengths and limitations of each. Specifically, we describe methods for functional analysis using Gene Ontology (GO) functional summaries, functional enrichment analysis, and pathways and network modeling. As annotation efforts begin to provide the fundamental data that underpin poultry functional modeling (such as improved gene identification, standardized gene nomenclature, temporal and spatial expression data and gene product function), tool developers are incorporating these data into new and existing tools that are used for functional modeling, and cyberinfrastructure is being developed to provide the necessary extendibility and scalability for storing and

  12. ECR Browser: A Tool For Visualizing And Accessing Data From Comparisons Of Multiple Vertebrate Genomes

    SciTech Connect

    Loots, G G; Ovcharenko, I; Stubbs, L; Nobrega, M A

    2004-01-06

    The increasing number of vertebrate genomes being sequenced in draft or finished form provide a unique opportunity to study and decode the language of DNA sequence through comparative genome alignments. However, novel tools and strategies are required to accommodate this increasing volume of genomic information and to facilitate experimental annotation of genome function. Here we present the ECR Browser, a tool that provides an easy and dynamic access to whole genome alignments of human, mouse, rat and fish sequences. This web-based tool (http://ecrbrowser.dcode.org) provides the starting point for discovery of novel genes, identification of distant gene regulatory elements and prediction of transcription factor binding sites. The genome alignment portal of the ECR Browser also permits fast and automated alignment of any user-submitted sequence to the genome of choice. The interconnection of the ECR browser with other DNA sequence analysis tools creates a unique portal for studying and exploring vertebrate genomes.

  13. From plant genomes to protein families: computational tools

    PubMed Central

    Martinez, Manuel

    2013-01-01

    The development of new high-throughput sequencing technologies has increased dramatically the number of successful genomic projects. Thus, draft genomic sequences of more than 60 plant species are currently available. Suitable bioinformatics tools are being developed to assemble, annotate and analyze the enormous number of sequences produced. In this context, specific plant comparative genomic databases are become powerful tools for gene family annotation in plant clades. In this mini-review, the current state-of-art of genomic projects is glossed. Besides, the computational tools developed to compare genomic data are compiled. PMID:24688740

  14. [Tale nucleases--new tool for genome editing].

    PubMed

    Glazkova, D V; Shipulin, G A

    2014-01-01

    The ability to introduce targeted changes in the genome of living cells or entire organisms enables researchers to meet the challenges of basic life sciences, biotechnology and medicine. Knockdown of target genes in the zygotes gives the opportunity to investigate the functions of these genes in different organisms. Replacement of single nucleotide in the DNA sequence allows to correct mutations in genes and thus to cure hereditary diseases. Adding transgene to specific genomic.loci can be used in biotechnology for generation of organisms with certain properties or cell lines for biopharmaceutical production. Such manipulations of gene sequences in their natural chromosomal context became possible after the emergence of the technology called "genome editing". This technology is based on the induction of a double-strand break in a specific genomic target DNA using endonucleases that recognize the unique sequences in the genome and on subsequent recovery of DNA integrity through the use of cellular repair mechanisms. A necessary tool for the genome editing is a custom-designed endonuclease which is able to recognize selected sequences. The emergence of a new type of programmable endonucleases, which were constructed on the basis of bacterial proteins--TAL-effectors (Transcription activators like effector), has become an important stage in the development of technology and promoted wide spread of the genome editing. This article reviews the history of the discovery of TAL effectors and creation of TALE nucleases, and describes their advantages over zinc finger endonucleases that appeared earlier. A large section is devoted to description of genetic modifications that can be performed using the genome editing.

  15. Endonucleases: new tools to edit the mouse genome.

    PubMed

    Wijshake, Tobias; Baker, Darren J; van de Sluis, Bart

    2014-10-01

    Mouse transgenesis has been instrumental in determining the function of genes in the pathophysiology of human diseases and modification of genes by homologous recombination in mouse embryonic stem cells remains a widely used technology. However, this approach harbors a number of disadvantages, as it is time-consuming and quite laborious. Over the last decade a number of new genome editing technologies have been developed, including zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and clustered regularly interspaced short palindromic repeats/CRISPR-associated (CRISPR/Cas). These systems are characterized by a designed DNA binding protein or RNA sequence fused or co-expressed with a non-specific endonuclease, respectively. The engineered DNA binding protein or RNA sequence guides the nuclease to a specific target sequence in the genome to induce a double strand break. The subsequent activation of the DNA repair machinery then enables the introduction of gene modifications at the target site, such as gene disruption, correction or insertion. Nuclease-mediated genome editing has numerous advantages over conventional gene targeting, including increased efficiency in gene editing, reduced generation time of mutant mice, and the ability to mutagenize multiple genes simultaneously. Although nuclease-driven modifications in the genome are a powerful tool to generate mutant mice, there are concerns about off-target cleavage, especially when using the CRISPR/Cas system. Here, we describe the basic principles of these new strategies in mouse genome manipulation, their inherent advantages, and their potential disadvantages compared to current technologies used to study gene function in mouse models. This article is part of a Special Issue entitled: From Genome to Function.

  16. Functional genomics of pathogenic bacteria.

    PubMed Central

    Moxon, E R; Hood, D W; Saunders, N J; Schweda, E K H; Richards, J C

    2002-01-01

    Microbial diseases remain the commonest cause of global mortality and morbidity. Automated-DNA sequencing has revolutionized the investigation of pathogenic microbes by making the immense fund of information contained in their genomes available at reasonable cost. The challenge is how this information can be used to increase current understanding of the biology of commensal and virulence behaviour of pathogens with particular emphasis on in vivo function and novel approaches to prevention. One example of the application of whole-genome-sequence information is afforded by investigations of the pathogenic role of Haemophilus influenzae lipopolysaccharide and its candidacy as a vaccine. PMID:11839188

  17. The integrated microbial genomes (IMG) system in 2007: datacontent and analysis tool extensions

    SciTech Connect

    Markowitz, Victor M.; Szeto, Ernest; Palaniappan, Krishna; Grechkin, Yuri; Chu, Ken; Chen, I-Min A.; Dubchak, Inna; Anderson, Iain; Lykidis, Athanasios; Mavromatis, Konstantinos; Ivanova, Natalia N.; Kyrpides, Nikos C.

    2007-08-01

    The Integrated Microbial Genomes (IMG) system is a data management, analysis and annotation platform for all publicly available genomes. IMG contains both draft and complete JGI microbial genomes integrated with all other publicly available genomes from all three domains of life, together with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and annotating genomes, genes and functions, individually or in a comparative context. Since its first release in 2005, IMG's data content and analytical capabilities have been constantly expanded through quarterly releases. IMG is provided by the DOE-Joint Genome Institute (JGI) and is available from http://img.jgi.doe.gov.

  18. 2004 Structural, Function and Evolutionary Genomics

    SciTech Connect

    Douglas L. Brutlag Nancy Ryan Gray

    2005-03-23

    This Gordon conference will cover the areas of structural, functional and evolutionary genomics. It will take a systematic approach to genomics, examining the evolution of proteins, protein functional sites, protein-protein interactions, regulatory networks, and metabolic networks. Emphasis will be placed on what we can learn from comparative genomics and entire genomes and proteomes.

  19. REVIEW: Zebrafish: A Renewed Model System For Functional Genomics

    NASA Astrophysics Data System (ADS)

    Wen, Xiao-Yan

    2008-01-01

    In the post genome era, a major goal in molecular biology is to determine the function of the many thousands of genes present in the vertebrate genome. The zebrafish (Danio rerio) provides an almost ideal genetic model to identify the biological roles of these novel genes, in part because their embryos are transparent and develop rapidly. The zebrafish has many advantages over mouse for genome-wide mutagenesis studies, allowing for easier, cheaper and faster functional characterization of novel genes in the vertebrate genome. Many molecular research tools such as chemical mutagenesis, transgenesis, gene trapping, gene knockdown, TILLING, gene targeting, RNAi and chemical genetic screen are now available in zebrafish. Combining all the forward, reverse, and chemical genetic tools, it is expected that zebrafish will make invaluable contribution to vertebrate functional genomics in functional annotation of the genes, modeling human diseases and drug discoveries.

  20. [The place of functional genomics in oncological research].

    PubMed

    Bálint, Bálint L; Nagy, László

    2013-03-01

    The 1000 genomes project changed the way how we see the human genome. The rapid development of the deep sequencing technologies is raising several practical questions, and the way how we answer these questions will affect deeply the future of the oncological reseach in Hungary. In our manuscript we give a short overview of the results of the 1000 genomes project and we present the place of the functional genomic investigations between other genomic tools. Based on the recent development in the field we summarize the challenges that have to be addressed in the next couple of years.

  1. Functional genomic screening to enhance oncolytic virotherapy.

    PubMed

    Mahoney, D J; Stojdl, D F

    2013-02-05

    Functional genomic screening has emerged as a powerful approach for understanding complex biological phenomena. Of the available tools, genome-wide RNA interference (RNAi) technology is unquestionably the most incisive, as it directly probes gene function. Recent applications of RNAi screening have been impressive. Notable amongst these are its use in elucidated mechanism(s) for signal transduction, various aspects of cell biology, tumourigenesis and metastasis, resistance to cancer therapeutics, and the host's response to a pathogen. Herein we discuss how recent RNAi screening efforts have helped turn our attention to the targetability of non-oncogene support pathways for cancer treatment, with a particular focus on a recent study that identified a non-oncogene addiction to the ER stress response as a synergist target for oncolytic virus therapy (OVT). Moreover, we give our thoughts on the future of RNAi screening as a tool to enhance OVT and describe recent technical improvements that are poised to make genome-scale RNAi experiments more sensitive, less noisy, more applicable in vivo, and more easily validated in clinically relevant animal models.

  2. Integrative data-mining tools to link gene and function.

    PubMed

    El Yacoubi, Basma; de Crécy-Lagard, Valérie

    2014-01-01

    Information derived from genomic and post-genomic data can be efficiently used to link gene and function. Several web-based platforms have been developed to mine these types of data by integrating different tools. This method paper is designed to allow the user to navigate these platforms in order to make functional predictions. The main focus is on phylogenetic distribution and physical clustering tools, but other tools such as pathway reconstruction, gene fusions, and analysis of high-throughput experimental data are also surveyed.

  3. Data management tools for genomic applications: A progress report

    SciTech Connect

    Markowitz, V.M.; Chen, I-Min A.

    1993-09-01

    We report in this paper on the development of data management tools that allow scientist to construct and manipulate genomic data bases in terms of application-specific objects and protocols. We are developing tools for specifying genomic database structures, as well as for entering, changing, maintaining, browsing and querying data in genomic data bases. These tools are based on the Object-protocol Model (OPM) developed by us and target commercial relational database management systems which are widely used in molecular biology laboratories. OPM allows scientists to interact with genomic databases in terms of their own frame or reference, namely genomic objects and protocols. Databases developed using the data management tools are easier to use, manage, and adapt.

  4. FDA Bioinformatics Tool for Microbial Genomics Research on Molecular Characterization of Bacterial Foodborne Pathogens Using Microarrays

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Advances in microbial genomics and bioinformatics are offering greater insights into the emergence and spread of foodborne pathogens in outbreak scenarios. The Food and Drug Administration (FDA) has developed the genomics tool ArrayTrackTM, which provides extensive functionalities to man...

  5. Neuroscience in the era of functional genomics and systems biology.

    PubMed

    Geschwind, Daniel H; Konopka, Genevieve

    2009-10-15

    Advances in genetics and genomics have fuelled a revolution in discovery-based, or hypothesis-generating, research that provides a powerful complement to the more directly hypothesis-driven molecular, cellular and systems neuroscience. Genetic and functional genomic studies have already yielded important insights into neuronal diversity and function, as well as disease. One of the most exciting and challenging frontiers in neuroscience involves harnessing the power of large-scale genetic, genomic and phenotypic data sets, and the development of tools for data integration and mining. Methods for network analysis and systems biology offer the promise of integrating these multiple levels of data, connecting molecular pathways to nervous system function.

  6. Technology developments in biological tools for targeted genome surgery.

    PubMed

    Teimourian, Shahram; Abdollahzadeh, Rasoul

    2015-01-01

    Different biological tools for targeted genome engineering have recently appeared and these include tools like meganucleases, zinc-finger nucleases and newer technologies including TALENs and CRISPR/Cas systems. transcription activator-like effector nucleases (TALENs) have greatly improved genome editing efficiency by making site-specific DNA double-strand breaks. Several studies have shown the prominence of TALENs in comparison to the meganucleases and zinc-finger nucleases. The most important feature of TALENs that makes them suitable tools for targeted genome editing is the modularity of central repeat domains, meaning that they can be designed to recognize any desirable DNA sequence. In this review, we present a comprehensive and concise description of TALENs technology developments for targeted genome surgery with to the point description and comparison of other tools.

  7. Evolution, language and analogy in functional genomics

    NASA Technical Reports Server (NTRS)

    Benner, S. A.; Gaucher, E. A.

    2001-01-01

    Almost a century ago, Wittgenstein pointed out that theory in science is intricately connected to language. This connection is not a frequent topic in the genomics literature. But a case can be made that functional genomics is today hindered by the paradoxes that Wittgenstein identified. If this is true, until these paradoxes are recognized and addressed, functional genomics will continue to be limited in its ability to extrapolate information from genomic sequences.

  8. Evolution, language and analogy in functional genomics.

    PubMed

    Benner, S A; Gaucher, E A

    2001-07-01

    Almost a century ago, Wittgenstein pointed out that theory in science is intricately connected to language. This connection is not a frequent topic in the genomics literature. But a case can be made that functional genomics is today hindered by the paradoxes that Wittgenstein identified. If this is true, until these paradoxes are recognized and addressed, functional genomics will continue to be limited in its ability to extrapolate information from genomic sequences.

  9. Genomics and public health: development of Web-based training tools for increasing genomic awareness.

    PubMed

    Bodzin, Jennifer; Kardia, Sharon L R; Goldenberg, Aaron; Raup, Sarah F; Bach, Janice V; Citrin, Toby

    2005-04-01

    In 2001, the Centers for Disease Control and Prevention funded three Centers for Genomics and Public Health to develop training tools for increasing genomic awareness. Over the past three years, the centers, working together with the Centers for Disease Control and Prevention's Office of Genomics and Disease Prevention, have developed tools to increase awareness of the impact genomics will have on public health practice, to provide a foundation for understanding basic genomic advances, and to translate the relevance of that information to public health practitioners' own work. These training tools serve to communicate genomic advances and their potential for integration into public heath practice. This paper highlights two of these training tools: 1) Genomics for Public Health Practitioners: The Practical Application of Genomics in Public Health Practice, a Web-based introduction to genomics, and 2) Six Weeks to Genomic Awareness, an in-depth training module on public health genomics. This paper focuses on the processes and collaborative efforts by which these live presentations were developed and delivered as Web-based training sessions.

  10. CoryneBase: Corynebacterium Genomic Resources and Analysis Tools at Your Fingertips

    PubMed Central

    Tan, Mui Fern; Jakubovics, Nick S.; Wee, Wei Yee; Mutha, Naresh V. R.; Wong, Guat Jah; Ang, Mia Yang; Yazdi, Amir Hessam; Choo, Siew Woh

    2014-01-01

    Corynebacteria are used for a wide variety of industrial purposes but some species are associated with human diseases. With increasing number of corynebacterial genomes having been sequenced, comparative analysis of these strains may provide better understanding of their biology, phylogeny, virulence and taxonomy that may lead to the discoveries of beneficial industrial strains or contribute to better management of diseases. To facilitate the ongoing research of corynebacteria, a specialized central repository and analysis platform for the corynebacterial research community is needed to host the fast-growing amount of genomic data and facilitate the analysis of these data. Here we present CoryneBase, a genomic database for Corynebacterium with diverse functionality for the analysis of genomes aimed to provide: (1) annotated genome sequences of Corynebacterium where 165,918 coding sequences and 4,180 RNAs can be found in 27 species; (2) access to comprehensive Corynebacterium data through the use of advanced web technologies for interactive web interfaces; and (3) advanced bioinformatic analysis tools consisting of standard BLAST for homology search, VFDB BLAST for sequence homology search against the Virulence Factor Database (VFDB), Pairwise Genome Comparison (PGC) tool for comparative genomic analysis, and a newly designed Pathogenomics Profiling Tool (PathoProT) for comparative pathogenomic analysis. CoryneBase offers the access of a range of Corynebacterium genomic resources as well as analysis tools for comparative genomics and pathogenomics. It is publicly available at http://corynebacterium.um.edu.my/. PMID:24466021

  11. Serine recombinases as tools for genome engineering.

    PubMed

    Brown, William R A; Lee, Nicholas C O; Xu, Zhengyao; Smith, Margaret C M

    2011-04-01

    The serine recombinases differ mechanistically from the tyrosine recombinases and include proteins such as ϕC31 integrase which, unlike Cre and Flp, promote unidirectional reactions. The serine recombinase family is large and includes many other proteins besides ϕC31 integrase with the potential to be widely used in genome engineering. Here we review the details of the mechanism of the reactions promoted by the serine recombinases and discuss how these not only limit the utility of this class of recombinase but also creates opportunities for the engineering of new enzymes. We discuss the unanswered questions posed by genome engineering experiments in a variety of systems in which the serine recombinases have been used and finally describe more recently discovered serine recombinases that have the potential to be used in genome engineering.

  12. Functional genomics down under: RNAi screening in the Victorian Centre for Functional Genomics.

    PubMed

    Thomas, Daniel W; Gould, Cathryn M; Handoko, Yanny; Simpson, Kaylene J

    2014-05-01

    The Victorian Centre for Functional Genomics (VCFG) is an RNAi screening facility housed at the Peter MacCallum Cancer Centre in Melbourne, Australia. The Peter Mac is Australia's largest dedicated Cancer Research Institute, home to a team of over 520 scientists that focus on understanding the genetic risk of cancer, the molecular events regulating cancer growth and dissemination and improving detection through new diagnostic tools (www.petermac.org). Peter Mac is a well recognised technology leader and established the VCFG with a view to enabling researchers Australia and New Zealand-wide access to cutting edge functional genomics technology, infrastructure and expertise. This review documents the technology platforms operated within the VCFG and provides insight into the workflows and analysis pipelines currently in operation.

  13. Can orangutans (Pongo abelii) infer tool functionality?

    PubMed

    Mulcahy, Nicholas J; Schubiger, Michèle N

    2014-05-01

    It is debatable whether apes can reason about the unobservable properties of tools. We tested orangutans for this ability with a range of tool tasks that they could solve by using observational cues to infer tool functionality. In experiment 1, subjects successfully chose an unbroken tool over a broken one when each tool's middle section was hidden. This prevented seeing which tool was functional but it could be inferred by noting the tools' visible ends that were either disjointed (broken tool) or aligned (unbroken tool). We investigated whether success in experiment 1 was best explained by inferential reasoning or by having a preference per se for a hidden tool with an aligned configuration. We conducted a similar task to experiment 1 and included a functional bent tool that could be arranged to have the same disjointed configuration as the broken tool. The results suggested that subjects had a preference per se for the aligned tool by choosing it regardless of whether it was paired with the broken tool or the functional bent tool. However, further experiments with the bent tool task suggested this preference was a result of additional demands of having to attend to and remember the properties of the tools from the beginning of the task. In our last experiment, we removed these task demands and found evidence that subjects could infer the functionality of a broken tool and an unbroken tool that both looked identical at the time of choice.

  14. Patenting genome research tools and the law.

    PubMed

    Eisenberg, Rebecca

    2003-01-01

    Patenting genes encoding therapeutic proteins was relatively uncontroversial in the early days of biotechnology. Controversy arose in the era of high-throughput DNA sequencing, when gene patents started to look less like patents on drugs and more like patents on scientific information. Evolving scientific and business strategies for exploiting genomic information raised concerns that patents might slow subsequent research. The trend towards stricter enforcement of the utility and disclosure requirements by the patent offices should help clarify the current confusion.

  15. From Loci to Biology: Functional Genomics of Genome-Wide Association for Coronary Disease

    PubMed Central

    Nurnberg, Sylvia T; Zhang, Hanrui; Hand, Nicholas J; Bauer, Robert C; Saleheen, Danish; Reilly, Muredach P; Rader, Daniel J

    2016-01-01

    Genome-wide association studies (GWAS) have provided a rich collection of ~58 CAD loci that suggest the existence of previously unsuspected new biology relevant to atherosclerosis. However, these studies only identify genomic loci associated with CAD and many questions remain even after a genomic locus is definitively implicated, including the nature of the causal variant(s) and the causal gene(s), as well as the directionality of effect. There are a number of tools that can be employed for investigation of the functional genomics of these loci, and progress has been made on a limited number of novel CAD loci. New biology regarding atherosclerosis and CAD will be learned through the functional genomics of these loci and the hope is that at least some of these new pathways relevant to CAD pathogenesis will yield new therapeutic targets for the prevention and treatment of CAD. PMID:26892960

  16. Genomic Tools in Groundnut Breeding Program: Status and Perspectives

    PubMed Central

    Janila, P.; Variath, Murali T.; Pandey, Manish K.; Desmae, Haile; Motagi, Babu N.; Okori, Patrick; Manohar, Surendra S.; Rathnakumar, A. L.; Radhakrishnan, T.; Liao, Boshou; Varshney, Rajeev K.

    2016-01-01

    Groundnut, a nutrient-rich food legume, is cultivated world over. It is valued for its good quality cooking oil, energy and protein rich food, and nutrient-rich fodder. Globally, groundnut improvement programs have developed varieties to meet the preferences of farmers, traders, processors, and consumers. Enhanced yield, tolerance to biotic and abiotic stresses and quality parameters have been the target traits. Spurt in genetic information of groundnut was facilitated by development of molecular markers, genetic, and physical maps, generation of expressed sequence tags (EST), discovery of genes, and identification of quantitative trait loci (QTL) for some important biotic and abiotic stresses and quality traits. The first groundnut variety developed using marker assisted breeding (MAB) was registered in 2003. Since then, USA, China, Japan, and India have begun to use genomic tools in routine groundnut improvement programs. Introgression lines that combine foliar fungal disease resistance and early maturity were developed using MAB. Establishment of marker-trait associations (MTA) paved way to integrate genomic tools in groundnut breeding for accelerated genetic gain. Genomic Selection (GS) tools are employed to improve drought tolerance and pod yield, governed by several minor effect QTLs. Draft genome sequence and low cost genotyping tools such as genotyping by sequencing (GBS) are expected to accelerate use of genomic tools to enhance genetic gains for target traits in groundnut. PMID:27014312

  17. Genomic Tools in Groundnut Breeding Program: Status and Perspectives.

    PubMed

    Janila, P; Variath, Murali T; Pandey, Manish K; Desmae, Haile; Motagi, Babu N; Okori, Patrick; Manohar, Surendra S; Rathnakumar, A L; Radhakrishnan, T; Liao, Boshou; Varshney, Rajeev K

    2016-01-01

    Groundnut, a nutrient-rich food legume, is cultivated world over. It is valued for its good quality cooking oil, energy and protein rich food, and nutrient-rich fodder. Globally, groundnut improvement programs have developed varieties to meet the preferences of farmers, traders, processors, and consumers. Enhanced yield, tolerance to biotic and abiotic stresses and quality parameters have been the target traits. Spurt in genetic information of groundnut was facilitated by development of molecular markers, genetic, and physical maps, generation of expressed sequence tags (EST), discovery of genes, and identification of quantitative trait loci (QTL) for some important biotic and abiotic stresses and quality traits. The first groundnut variety developed using marker assisted breeding (MAB) was registered in 2003. Since then, USA, China, Japan, and India have begun to use genomic tools in routine groundnut improvement programs. Introgression lines that combine foliar fungal disease resistance and early maturity were developed using MAB. Establishment of marker-trait associations (MTA) paved way to integrate genomic tools in groundnut breeding for accelerated genetic gain. Genomic Selection (GS) tools are employed to improve drought tolerance and pod yield, governed by several minor effect QTLs. Draft genome sequence and low cost genotyping tools such as genotyping by sequencing (GBS) are expected to accelerate use of genomic tools to enhance genetic gains for target traits in groundnut.

  18. TALEN and CRISPR/Cas Genome Editing Systems: Tools of Discovery

    PubMed Central

    Nemudryi, A. A.; Valetdinova, K. R.; Medvedev, S. P.; Zakian, S. M.

    2014-01-01

    Precise studies of plant, animal and human genomes enable remarkable opportunities of obtained data application in biotechnology and medicine. However, knowing nucleotide sequences isn’t enough for understanding of particular genomic elements functional relationship and their role in phenotype formation and disease pathogenesis. In post-genomic era methods allowing genomic DNA sequences manipulation, visualization and regulation of gene expression are rapidly evolving. Though, there are few methods, that meet high standards of efficiency, safety and accessibility for a wide range of researchers. In 2011 and 2013 novel methods of genome editing appeared – this are TALEN (Transcription Activator-Like Effector Nucleases) and CRISPR (Clustered Regulatory Interspaced Short Palindromic Repeats)/Cas9 systems. Although TALEN and CRISPR/Cas9 appeared recently, these systems have proved to be effective and reliable tools for genome engineering. Here we generally review application of these systems for genome editing in conventional model objects of current biology, functional genome screening, cell-based human hereditary disease modeling, epigenome studies and visualization of cellular processes. Additionally, we review general strategies for designing TALEN and CRISPR/Cas9 and analyzing their activity. We also discuss some obstacles researcher can face using these genome editing tools. PMID:25349712

  19. The capsicum transcriptome DB: a "hot" tool for genomic research.

    PubMed

    Góngora-Castillo, Elsa; Fajardo-Jaime, Rubén; Fernández-Cortes, Araceli; Jofre-Garfias, Alba E; Lozoya-Gloria, Edmundo; Martínez, Octavio; Ochoa-Alejo, Neftalí; Rivera-Bustamante, Rafael

    2012-01-01

    Chili pepper (Capsicum annuum) is an economically important crop with no available public genome sequence. We describe a genomic resource to facilitate Capsicum annuum research. A collection of Expressed Sequence Tags (ESTs) derived from five C. annuum organs (root, stem, leaf, flower and fruit) were sequenced using the Sanger method and multiple leaf transcriptomes were deeply sampled using with GS-pyrosequencing. A hybrid assembly of 1,324,516 raw reads yielded 32,314 high quality contigs as validated by coverage and identity analysis with existing pepper sequences. Overall, 75.5% of the contigs had significant sequence similarity to entries in nucleic acid and protein databases; 23% of the sequences have not been previously reported for C. annuum and expand sequence resources for this species. A MySQL database and a user-friendly Web interface were constructed with search-tools that permit queries of the ESTs including sequence, functional annotation, Gene Ontology classification, metabolic pathways, and assembly information. The Capsicum Transcriptome DB is free available from http://www.bioingenios.ira.cinvestav.mx:81/Joomla/

  20. Integrative genomic analysis by interoperation of bioinformatics tools in GenomeSpace

    PubMed Central

    Thorvaldsdottir, Helga; Liefeld, Ted; Ocana, Marco; Borges-Rivera, Diego; Pochet, Nathalie; Robinson, James T.; Demchak, Barry; Hull, Tim; Ben-Artzi, Gil; Blankenberg, Daniel; Barber, Galt P.; Lee, Brian T.; Kuhn, Robert M.; Nekrutenko, Anton; Segal, Eran; Ideker, Trey; Reich, Michael; Regev, Aviv; Chang, Howard Y.; Mesirov, Jill P.

    2015-01-01

    Integrative analysis of multiple data types to address complex biomedical questions requires the use of multiple software tools in concert and remains an enormous challenge for most of the biomedical research community. Here we introduce GenomeSpace (http://www.genomespace.org), a cloud-based, cooperative community resource. Seeded as a collaboration of six of the most popular genomics analysis tools, GenomeSpace now supports the streamlined interaction of 20 bioinformatics tools and data resources. To facilitate the ability of non-programming users’ to leverage GenomeSpace in integrative analysis, it offers a growing set of ‘recipes’, short workflows involving a few tools and steps to guide investigators through high utility analysis tasks. PMID:26780094

  1. CGUG: in silico proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to ca. 1.9 Mb

    PubMed Central

    Mahadevan, Padmanabhan; King, John F; Seto, Donald

    2009-01-01

    Background Viruses and small-genome bacteria (~2 megabases and smaller) comprise a considerable population in the biosphere and are of interest to many researchers. These genomes are now sequenced at an unprecedented rate and require complementary computational tools to analyze. "CoreGenesUniqueGenes" (CGUG) is an in silico genome data mining tool that determines a "core" set of genes from two to five organisms with genomes in this size range. Core and unique genes may reflect similar niches and needs, and may be used in classifying organisms. Findings CGUG is available at as a web-based on-the-fly tool that performs iterative BLASTP analyses using a reference genome and up to four query genomes to provide a table of genes common to these genomes. The result is an in silico display of genomes and their proteomes, allowing for further analysis. CGUG can be used for "genome annotation by homology", as demonstrated with Chlamydophila and Francisella genomes. Conclusion CGUG is used to reanalyze the ICTV-based classifications of bacteriophages, to reconfirm long-standing relationships and to explore new classifications. These genomes have been problematic in the past, due largely to horizontal gene transfers. CGUG is validated as a tool for reannotating small genome bacteria using more up-to-date annotations by similarity or homology. These serve as an entry point for wet-bench experiments to confirm the functions of these "hypothetical" and "unknown" proteins. PMID:19706165

  2. Open chromatin reveals the functional maize genome

    PubMed Central

    Rodgers-Melnick, Eli; Vera, Daniel L.; Bass, Hank W.

    2016-01-01

    Cellular processes mediated through nuclear DNA must contend with chromatin. Chromatin structural assays can efficiently integrate information across diverse regulatory elements, revealing the functional noncoding genome. In this study, we use a differential nuclease sensitivity assay based on micrococcal nuclease (MNase) digestion to discover open chromatin regions in the maize genome. We find that maize MNase-hypersensitive (MNase HS) regions localize around active genes and within recombination hotspots, focusing biased gene conversion at their flanks. Although MNase HS regions map to less than 1% of the genome, they consistently explain a remarkably large amount (∼40%) of heritable phenotypic variance in diverse complex traits. MNase HS regions are therefore on par with coding sequences as annotations that demarcate the functional parts of the maize genome. These results imply that less than 3% of the maize genome (coding and MNase HS regions) may give rise to the overwhelming majority of phenotypic variation, greatly narrowing the scope of the functional genome. PMID:27185945

  3. The Human Functional Genomics Project: Understanding Generation of Diversity.

    PubMed

    Pappalardo, Jenna L; Hafler, David A

    2016-11-03

    Generation of biologic diversity is a cornerstone of immunity, yet the tools to investigate the causal influence of genetic and environmental factors have been greatly limited. Studies from the Human Functional Genomics Project, presented in Cell and other Cell Press journals, integrate environmental and genetic factors with the direction and magnitude of immune responses to decipher inflammatory disease pathogenesis.

  4. GenomePeek—an online tool for prokaryotic genome and metagenome analysis

    SciTech Connect

    McNair, Katelyn; Edwards, Robert A.

    2015-06-16

    As increases in prokaryotic sequencing take place, a method to quickly and accurately analyze this data is needed. Previous tools are mainly designed for metagenomic analysis and have limitations; such as long runtimes and significant false positive error rates. The online tool GenomePeek (edwards.sdsu.edu/GenomePeek) was developed to analyze both single genome and metagenome sequencing files, quickly and with low error rates. GenomePeek uses a sequence assembly approach where reads to a set of conserved genes are extracted, assembled and then aligned against the highly specific reference database. GenomePeek was found to be faster than traditional approaches while still keeping error rates low, as well as offering unique data visualization options.

  5. GenomePeek—an online tool for prokaryotic genome and metagenome analysis

    DOE PAGES

    McNair, Katelyn; Edwards, Robert A.

    2015-06-16

    As increases in prokaryotic sequencing take place, a method to quickly and accurately analyze this data is needed. Previous tools are mainly designed for metagenomic analysis and have limitations; such as long runtimes and significant false positive error rates. The online tool GenomePeek (edwards.sdsu.edu/GenomePeek) was developed to analyze both single genome and metagenome sequencing files, quickly and with low error rates. GenomePeek uses a sequence assembly approach where reads to a set of conserved genes are extracted, assembled and then aligned against the highly specific reference database. GenomePeek was found to be faster than traditional approaches while still keeping errormore » rates low, as well as offering unique data visualization options.« less

  6. The Functional Genomics Initiative at Oak Ridge National Laboratory

    SciTech Connect

    Johnson, Dabney; Justice, Monica; Beattle, Ken; Buchanan, Michelle; Ramsey, Michael; Ramsey, Rose; Paulus, Michael; Ericson, Nance; Allison, David; Kress, Reid; Mural, Richard; Uberbacher, Ed; Mann, Reinhold

    1997-12-31

    The Functional Genomics Initiative at the Oak Ridge National Laboratory integrates outstanding capabilities in mouse genetics, bioinformatics, and instrumentation. The 50 year investment by the DOE in mouse genetics/mutagenesis has created a one-of-a-kind resource for generating mutations and understanding their biological consequences. It is generally accepted that, through the mouse as a surrogate for human biology, we will come to understand the function of human genes. In addition to this world class program in mammalian genetics, ORNL has also been a world leader in developing bioinformatics tools for the analysis, management and visualization of genomic data. Combining this expertise with new instrumentation technologies will provide a unique capability to understand the consequences of mutations in the mouse at both the organism and molecular levels. The goal of the Functional Genomics Initiative is to develop the technology and methodology necessary to understand gene function on a genomic scale and apply these technologies to megabase regions of the human genome. The effort is scoped so as to create an effective and powerful resource for functional genomics. ORNL is partnering with the Joint Genome Institute and other large scale sequencing centers to sequence several multimegabase regions of both human and mouse genomic DNA, to identify all the genes in these regions, and to conduct fundamental surveys to examine gene function at the molecular and organism level. The Initiative is designed to be a pilot for larger scale deployment in the post-genome era. Technologies will be applied to the examination of gene expression and regulation, metabolism, gene networks, physiology and development.

  7. Complete Genome Sequence of Flavobacteriumpsychrophilum Strain OSU THCO2-90, Used for Functional Genetic Analysis.

    PubMed

    Rochat, Tatiana; Barbier, Paul; Nicolas, Pierre; Loux, Valentin; Pérez-Pascual, David; Guijarro, José A; Bernardet, Jean-François; Duchaud, Eric

    2017-02-23

    We report here the complete annotated genome sequence of Flavobacterium psychrophilum OSU THCO2-90, isolated from Coho salmon (Oncorhynchus kisutch) in Oregon. The genome consists of a circular chromosome with 2,343 predicted open reading frames. This strain has proved to be a valuable tool for functional genomics.

  8. Databases and Web Tools for Cancer Genomics Study

    PubMed Central

    Yang, Yadong; Dong, Xunong; Xie, Bingbing; Ding, Nan; Chen, Juan; Li, Yongjun; Zhang, Qian; Qu, Hongzhu; Fang, Xiangdong

    2015-01-01

    Publicly-accessible resources have promoted the advance of scientific discovery. The era of genomics and big data has brought the need for collaboration and data sharing in order to make effective use of this new knowledge. Here, we describe the web resources for cancer genomics research and rate them on the basis of the diversity of cancer types, sample size, omics data comprehensiveness, and user experience. The resources reviewed include data repository and analysis tools; and we hope such introduction will promote the awareness and facilitate the usage of these resources in the cancer research community. PMID:25707591

  9. BEDTools: the Swiss-army tool for genome feature analysis

    PubMed Central

    Quinlan, Aaron R.

    2014-01-01

    Technological advances have enabled the use of DNA sequencing as a flexible tool to characterize genetic variation and to measure the activity of diverse cellular phenomena such as gene isoform expression and transcription factor binding. Extracting biological insight from the experiments enabled by these advances demands the analysis of large, multi-dimensional datasets. This unit describes the use of the BEDTools toolkit for the exploration of high-throughput genomics datasets. I present several protocols for common genomic analyses and demonstrate how simple BEDTools operations may be combined to create bespoke pipelines addressing complex questions. PMID:25199790

  10. Databases and web tools for cancer genomics study.

    PubMed

    Yang, Yadong; Dong, Xunong; Xie, Bingbing; Ding, Nan; Chen, Juan; Li, Yongjun; Zhang, Qian; Qu, Hongzhu; Fang, Xiangdong

    2015-02-01

    Publicly-accessible resources have promoted the advance of scientific discovery. The era of genomics and big data has brought the need for collaboration and data sharing in order to make effective use of this new knowledge. Here, we describe the web resources for cancer genomics research and rate them on the basis of the diversity of cancer types, sample size, omics data comprehensiveness, and user experience. The resources reviewed include data repository and analysis tools; and we hope such introduction will promote the awareness and facilitate the usage of these resources in the cancer research community.

  11. Virome genomics: a tool for defining the human virome.

    PubMed

    Wylie, Kristine M; Weinstock, George M; Storch, Gregory A

    2013-08-01

    High throughput, deep sequencing assays are powerful tools for gaining insights into virus-host interactions. Sequencing assays can discover novel viruses and describe the genomes of novel and known viruses. Genomic information can predict viral proteins that can be characterized, describe important genes in the host that control infections, and evaluate gene expression of viruses and hosts during infection. Sequencing can also describe variation and evolution of viruses during replication and transmission. This review recounts some of the major advances in the studies of virus-host interactions from the last two years, and discusses the uses of sequencing technologies relating to these studies.

  12. Probing the impact of chromatin conformation on genome editing tools

    PubMed Central

    Chen, Xiaoyu; Rinsma, Marrit; Janssen, Josephine M.; Liu, Jin; Maggio, Ignazio; Gonçalves, Manuel A.F.V.

    2016-01-01

    Transcription activator-like effector nucleases (TALENs) and RNA-guided nucleases derived from clustered, regularly interspaced, short palindromic repeats (CRISPR)-Cas9 systems have become ubiquitous genome editing tools. Despite this, the impact that distinct high-order chromatin conformations have on these sequence-specific designer nucleases is, presently, ill-defined. The same applies to the relative performance of TALENs and CRISPR/Cas9 nucleases at isogenic target sequences subjected to different epigenetic modifications. Here, to address these gaps in our knowledge, we have implemented quantitative cellular systems based on genetic reporters in which the euchromatic and heterochromatic statuses of designer nuclease target sites are stringently controlled by small-molecule drug availability. By using these systems, we demonstrate that TALENs and CRISPR/Cas9 nucleases are both significantly affected by the high-order epigenetic context of their target sequences. In addition, this outcome could also be ascertained for S. pyogenes CRISPR/Cas9 complexes harbouring Cas9 variants whose DNA cleaving specificities are superior to that of the wild-type Cas9 protein. Thus, the herein investigated cellular models will serve as valuable functional readouts for screening and assessing the role of chromatin on designer nucleases based on different platforms or with different architectures or compositions. PMID:27280977

  13. Genome editing strategies: potential tools for eradicating HIV-1/AIDS

    PubMed Central

    Khalili, Kamel; Gordon, Jennifer; Cosentino, Laura; Hu, Wenhui

    2015-01-01

    Current therapy for controlling HIV-1 infection and preventing AIDS progression has profoundly decreased viral replication in cells susceptible to HIV-1 infection, but it does not eliminate the low level of viral replication in latently infected cells which contain integrated copies of HIV-1 proviral DNA. There is an urgent need for the development of HIV-1 genome eradication strategies that will lead to a permanent or “sterile” cure of HIV-1/AIDS. In the past few years, novel nuclease-initiated genome editing tools have been developing rapidly, including ZFNs, TALENs, and the CRISPR/Cas9 system. These surgical knives, which can excise any genome, provide a great opportunity to eradicate the HIV-1 genome by targeting highly conserved regions of the HIV-1 long terminal repeats or essential viral genes. Given the time consuming and costly engineering of target-specific ZFNs and TALENs, the RNA-guided endonuclease Cas9 technology has emerged as a simpler and more versatile technology to allow permanent removal of integrated HIV-1 proviral DNA in eukaryotic cells, and hopefully animal models or human patients. The major unmet challenges of this approach at present include inefficient nuclease gene delivery, potential off-target cleavage, and cell-specific genome targeting. Nanoparticle or lentivirus-mediated delivery of next generation Cas9 technologies including nickase or RNA-guided FokI nuclease (RFN) will further improve the potential for genome editing to become a promising approach for curing HIV-1/AIDS. PMID:25716921

  14. A factor analysis model for functional genomics

    PubMed Central

    Kustra, Rafal; Shioda, Romy; Zhu, Mu

    2006-01-01

    Background Expression array data are used to predict biological functions of uncharacterized genes by comparing their expression profiles to those of characterized genes. While biologically plausible, this is both statistically and computationally challenging. Typical approaches are computationally expensive and ignore correlations among expression profiles and functional categories. Results We propose a factor analysis model (FAM) for functional genomics and give a two-step algorithm, using genome-wide expression data for yeast and a subset of Gene-Ontology Biological Process functional annotations. We show that the predictive performance of our method is comparable to the current best approach while our total computation time was faster by a factor of 4000. We discuss the unique challenges in performance evaluation of algorithms used for genome-wide functions genomics. Finally, we discuss extensions to our method that can incorporate the inherent correlation structure of the functional categories to further improve predictive performance. Conclusion Our factor analysis model is a computationally efficient technique for functional genomics and provides a clear and unified statistical framework with potential for incorporating important gene ontology information to improve predictions. PMID:16630343

  15. Tetrahymena as a Unicellular Model Eukaryote: Genetic and Genomic Tools.

    PubMed

    Ruehle, Marisa D; Orias, Eduardo; Pearson, Chad G

    2016-06-01

    Tetrahymena thermophila is a ciliate model organism whose study has led to important discoveries and insights into both conserved and divergent biological processes. In this review, we describe the tools for the use of Tetrahymena as a model eukaryote, including an overview of its life cycle, orientation to its evolutionary roots, and methodological approaches to forward and reverse genetics. Recent genomic tools have expanded Tetrahymena's utility as a genetic model system. With the unique advantages that Tetrahymena provide, we argue that it will continue to be a model organism of choice.

  16. Recent advances in developing molecular tools for targeted genome engineering of mammalian cells.

    PubMed

    Lim, Kwang-il

    2015-01-01

    Various biological molecules naturally existing in diversified species including fungi, bacteria, and bacteriophage have functionalities for DNA binding and processing. The biological molecules have been recently actively engineered for use in customized genome editing of mammalian cells as the molecule-encoding DNA sequence information and the underlying mechanisms how the molecules work are unveiled. Excitingly, multiple novel methods based on the newly constructed artificial molecular tools have enabled modifications of specific endogenous genetic elements in the genome context at efficiencies that are much higher than that of the conventional homologous recombination based methods. This minireview introduces the most recently spotlighted molecular genome engineering tools with their key features and ongoing modifications for better performance. Such ongoing efforts have mainly focused on the removal of the inherent DNA sequence recognition rigidity from the original molecular platforms, the addition of newly tailored targeting functions into the engineered molecules, and the enhancement of their targeting specificity. Effective targeted genome engineering of mammalian cells will enable not only sophisticated genetic studies in the context of the genome, but also widely-applicable universal therapeutics based on the pinpointing and correction of the disease-causing genetic elements within the genome in the near future.

  17. The CRISPR/Cas Genome-Editing Tool: Application in Improvement of Crops.

    PubMed

    Khatodia, Surender; Bhatotia, Kirti; Passricha, Nishat; Khurana, S M P; Tuteja, Narendra

    2016-01-01

    The Clustered Regularly Interspaced Short Palindromic Repeats associated Cas9/sgRNA system is a novel targeted genome-editing technique derived from bacterial immune system. It is an inexpensive, easy, most user friendly and rapidly adopted genome editing tool transforming to revolutionary paradigm. This technique enables precise genomic modifications in many different organisms and tissues. Cas9 protein is an RNA guided endonuclease utilized for creating targeted double-stranded breaks with only a short RNA sequence to confer recognition of the target in animals and plants. Development of genetically edited (GE) crops similar to those developed by conventional or mutation breeding using this potential technique makes it a promising and extremely versatile tool for providing sustainable productive agriculture for better feeding of rapidly growing population in a changing climate. The emerging areas of research for the genome editing in plants include interrogating gene function, rewiring the regulatory signaling networks and sgRNA library for high-throughput loss-of-function screening. In this review, we have described the broad applicability of the Cas9 nuclease mediated targeted plant genome editing for development of designer crops. The regulatory uncertainty and social acceptance of plant breeding by Cas9 genome editing have also been described. With this powerful and innovative technique the designer GE non-GM plants could further advance climate resilient and sustainable agriculture in the future and maximizing yield by combating abiotic and biotic stresses.

  18. The CRISPR/Cas Genome-Editing Tool: Application in Improvement of Crops

    PubMed Central

    Khatodia, Surender; Bhatotia, Kirti; Passricha, Nishat; Khurana, S. M. P.; Tuteja, Narendra

    2016-01-01

    The Clustered Regularly Interspaced Short Palindromic Repeats associated Cas9/sgRNA system is a novel targeted genome-editing technique derived from bacterial immune system. It is an inexpensive, easy, most user friendly and rapidly adopted genome editing tool transforming to revolutionary paradigm. This technique enables precise genomic modifications in many different organisms and tissues. Cas9 protein is an RNA guided endonuclease utilized for creating targeted double-stranded breaks with only a short RNA sequence to confer recognition of the target in animals and plants. Development of genetically edited (GE) crops similar to those developed by conventional or mutation breeding using this potential technique makes it a promising and extremely versatile tool for providing sustainable productive agriculture for better feeding of rapidly growing population in a changing climate. The emerging areas of research for the genome editing in plants include interrogating gene function, rewiring the regulatory signaling networks and sgRNA library for high-throughput loss-of-function screening. In this review, we have described the broad applicability of the Cas9 nuclease mediated targeted plant genome editing for development of designer crops. The regulatory uncertainty and social acceptance of plant breeding by Cas9 genome editing have also been described. With this powerful and innovative technique the designer GE non-GM plants could further advance climate resilient and sustainable agriculture in the future and maximizing yield by combating abiotic and biotic stresses. PMID:27148329

  19. Spatial Genome Organization and Its Emerging Role as a Potential Diagnosis Tool

    PubMed Central

    Meaburn, Karen J.

    2016-01-01

    In eukaryotic cells the genome is highly spatially organized. Functional relevance of higher order genome organization is implied by the fact that specific genes, and even whole chromosomes, alter spatial position in concert with functional changes within the nucleus, for example with modifications to chromatin or transcription. The exact molecular pathways that regulate spatial genome organization and the full implication to the cell of such an organization remain to be determined. However, there is a growing realization that the spatial organization of the genome can be used as a marker of disease. While global genome organization patterns remain largely conserved in disease, some genes and chromosomes occupy distinct nuclear positions in diseased cells compared to their normal counterparts, with the patterns of reorganization differing between diseases. Importantly, mapping the spatial positioning patterns of specific genomic loci can distinguish cancerous tissue from benign with high accuracy. Genome positioning is an attractive novel biomarker since additional quantitative biomarkers are urgently required in many cancer types. Current diagnostic techniques are often subjective and generally lack the ability to identify aggressive cancer from indolent, which can lead to over- or under-treatment of patients. Proof-of-principle for the use of genome positioning as a diagnostic tool has been provided based on small scale retrospective studies. Future large-scale studies are required to assess the feasibility of bringing spatial genome organization-based diagnostics to the clinical setting and to determine if the positioning patterns of specific loci can be useful biomarkers for cancer prognosis. Since spatial reorganization of the genome has been identified in multiple human diseases, it is likely that spatial genome positioning patterns as a diagnostic biomarker may be applied to many diseases. PMID:27507988

  20. Development of peanut expessed sequence tag-based genomic resources and tools

    Technology Transfer Automated Retrieval System (TEKTRAN)

    U.S. Peanut Genome Initiative (PGI) has widely recognized the need for peanut genome tools and resources development for mitigating peanut allergens and food safety. Genomics such as Expressed Sequence Tag (EST), microarray technologies, and whole genome sequencing provides robotic tools for profili...

  1. GAPIT Version 2: An Enhanced Integrated Tool for Genomic Association and Prediction.

    PubMed

    Tang, You; Liu, Xiaolei; Wang, Jiabo; Li, Meng; Wang, Qishan; Tian, Feng; Su, Zhongbin; Pan, Yuchun; Liu, Di; Lipka, Alexander E; Buckler, Edward S; Zhang, Zhiwu

    2016-07-01

    Most human diseases and agriculturally important traits are complex. Dissecting their genetic architecture requires continued development of innovative and powerful statistical methods. Corresponding advances in computing tools are critical to efficiently use these statistical innovations and to enhance and accelerate biomedical and agricultural research and applications. The genome association and prediction integrated tool (GAPIT) was first released in 2012 and became widely used for genome-wide association studies (GWAS) and genomic prediction. The GAPIT implemented computationally efficient statistical methods, including the compressed mixed linear model (CMLM) and genomic prediction by using genomic best linear unbiased prediction (gBLUP). New state-of-the-art statistical methods have now been implemented in a new, enhanced version of GAPIT. These methods include factored spectrally transformed linear mixed models (FaST-LMM), enriched CMLM (ECMLM), FaST-LMM-Select, and settlement of mixed linear models under progressively exclusive relationship (SUPER). The genomic prediction methods implemented in this new release of the GAPIT include gBLUP based on CMLM, ECMLM, and SUPER. Additionally, the GAPIT was updated to improve its existing output display features and to add new data display and evaluation functions, including new graphing options and capabilities, phenotype simulation, power analysis, and cross-validation. These enhancements make the GAPIT a valuable resource for determining appropriate experimental designs and performing GWAS and genomic prediction. The enhanced R-based GAPIT software package uses state-of-the-art methods to conduct GWAS and genomic prediction. The GAPIT also provides new functions for developing experimental designs and creating publication-ready tabular summaries and graphs to improve the efficiency and application of genomic research.

  2. Genetic and genomic approaches to understanding macrophage identity and function.

    PubMed

    Glass, Christopher K

    2015-04-01

    A major goal of our laboratory is to understand the molecular mechanisms that underlie the development and functions of diverse macrophage phenotypes in health and disease. Recent studies using genetic and genomic approaches suggest a relatively simple model of collaborative and hierarchical interactions between lineage-determining and signal-dependent transcription factors that enable selection and activation of transcriptional enhancers that specify macrophage identity and function. In addition, we have found that it is possible to use natural genetic variation as a powerful tool for advancing our understanding of how the macrophage deciphers the information encoded by the genome to attain specific phenotypes in a context-dependent manner. Here, I will describe our recent efforts to extend genetic and genomic approaches to investigate the roles of distinct tissue environments in determining the phenotypes of different resident populations of macrophages.

  3. Unraveling the 3D genome: genomics tools for multi-scale exploration

    PubMed Central

    Risca, Viviana I.; Greenleaf, William J.

    2015-01-01

    A decade of rapid method development has begun to yield exciting insights into the three-dimensional architecture of the metazoan genome and the roles it may play in regulating transcription. We review here core methods and new tools in the modern genomicist’s toolbox at three length scales, ranging from single base pair to megabase scale chromosomal domains, and discuss the emerging picture of the 3D genome that these tools have revealed. Blind spots remain, especially at intermediate length scales spanning a few nucleosomes, but thanks in part to new technologies that permit targeted alteration of chromatin states and time-resolved studies, the next decade holds great promise for hypothesis-driven research into the mechanisms that drive genome architecture and transcriptional regulation. PMID:25887733

  4. A comparison of tools for the simulation of genomic next-generation sequencing data

    PubMed Central

    Escalona, Merly; Rocha, Sara; Posada, David

    2017-01-01

    Computer simulation of genomic data has become increasingly popular for assessing and validating biological models or to gain understanding about specific datasets. Multiple computational tools for the simulation of next-generation sequencing (NGS) data have been developed in recent years, which could be used to compare existing and new NGS analytical pipelines. Here we review 23 of these tools, highlighting their distinct functionality, requirements and potential applications. We also provide a decision tree for the informed selection of an appropriate NGS simulation tool for the specific question at hand. PMID:27320129

  5. A comparison of tools for the simulation of genomic next-generation sequencing data.

    PubMed

    Escalona, Merly; Rocha, Sara; Posada, David

    2016-08-01

    Computer simulation of genomic data has become increasingly popular for assessing and validating biological models or for gaining an understanding of specific data sets. Several computational tools for the simulation of next-generation sequencing (NGS) data have been developed in recent years, which could be used to compare existing and new NGS analytical pipelines. Here we review 23 of these tools, highlighting their distinct functionality, requirements and potential applications. We also provide a decision tree for the informed selection of an appropriate NGS simulation tool for the specific question at hand.

  6. Tomato functional genomics database (TFGD): a comprehensive collection and analysis package for tomato functional genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Tomato Functional Genomics Database (TFGD; http://ted.bti.cornell.edu) provides a comprehensive systems biology resource to store, mine, analyze, visualize and integrate large-scale tomato functional genomics datasets. The database is expanded from the previously described Tomato Expression Database...

  7. Decoding the genome with an integrative analysis tool: combinatorial CRM Decoder.

    PubMed

    Kang, Keunsoo; Kim, Joomyeong; Chung, Jae Hoon; Lee, Daeyoup

    2011-09-01

    The identification of genome-wide cis-regulatory modules (CRMs) and characterization of their associated epigenetic features are fundamental steps toward the understanding of gene regulatory networks. Although integrative analysis of available genome-wide information can provide new biological insights, the lack of novel methodologies has become a major bottleneck. Here, we present a comprehensive analysis tool called combinatorial CRM decoder (CCD), which utilizes the publicly available information to identify and characterize genome-wide CRMs in a species of interest. CCD first defines a set of the epigenetic features which is significantly associated with a set of known CRMs as a code called 'trace code', and subsequently uses the trace code to pinpoint putative CRMs throughout the genome. Using 61 genome-wide data sets obtained from 17 independent mouse studies, CCD successfully catalogued ∼12 600 CRMs (five distinct classes) including polycomb repressive complex 2 target sites as well as imprinting control regions. Interestingly, we discovered that ∼4% of the identified CRMs belong to at least two different classes named 'multi-functional CRM', suggesting their functional importance for regulating spatiotemporal gene expression. From these examples, we show that CCD can be applied to any potential genome-wide datasets and therefore will shed light on unveiling genome-wide CRMs in various species.

  8. Genome editing strategies: potential tools for eradicating HIV-1/AIDS.

    PubMed

    Khalili, Kamel; Kaminski, Rafal; Gordon, Jennifer; Cosentino, Laura; Hu, Wenhui

    2015-06-01

    Current therapy for controlling human immunodeficiency virus (HIV-1) infection and preventing acquired immunodeficiency syndrome (AIDS) progression has profoundly decreased viral replication in cells susceptible to HIV-1 infection, but it does not eliminate the low level of viral replication in latently infected cells, which contain integrated copies of HIV-1 proviral DNA. There is an urgent need for the development of HIV-1 genome eradication strategies that will lead to a permanent or "sterile" cure of HIV-1/AIDS. In the past few years, novel nuclease-initiated genome editing tools have been developing rapidly, including zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and the CRISPR/Cas9 system. These surgical knives, which can excise any genome, provide a great opportunity to eradicate the HIV-1 genome by targeting highly conserved regions of the HIV-1 long terminal repeats or essential viral genes. Given the time consuming and costly engineering of target-specific ZFNs and TALENs, the RNA-guided endonuclease Cas9 technology has emerged as a simpler and more versatile technology to allow permanent removal of integrated HIV-1 proviral DNA in eukaryotic cells, and hopefully animal models or human patients. The major unmet challenges of this approach at present include inefficient nuclease gene delivery, potential off-target cleavage, and cell-specific genome targeting. Nanoparticle or lentivirus-mediated delivery of next generation Cas9 technologies including nickase or RNA-guided FokI nuclease (RFN) will further improve the potential for genome editing to become a promising approach for curing HIV-1/AIDS.

  9. Functional Profiling of Human Fungal Pathogen Genomes

    PubMed Central

    Goranov, Alexi I.; Madhani, Hiten D.

    2015-01-01

    Fungal infections are challenging to diagnose and often difficult to treat, with only a handful of drug classes existing. Understanding the molecular mechanisms by which pathogenic fungi cause human disease is imperative. Here, we discuss how the development and use of genome-scale genetic resources, such as whole-genome knockout collections, can address this unmet need. Using work in Saccharomcyes cerevisiae as a guide, studies of Cryptococcus neoformans and Candida albicans have shown how the challenges of large-scale gene deletion can be overcome, and how such collections can be effectively used to obtain insights into mechanisms of pathogenesis. We conclude that, with concerted efforts, full genome-wide functional analysis of human fungal pathogen genomes is within reach. PMID:25377143

  10. ThermoAlign: a genome-aware primer design tool for tiled amplicon resequencing

    PubMed Central

    Francis, Felix; Dumas, Michael D.; Wisser, Randall J.

    2017-01-01

    Isolating and sequencing specific regions in a genome is a cornerstone of molecular biology. This has been facilitated by computationally encoding the thermodynamics of DNA hybridization for automated design of hybridization and priming oligonucleotides. However, the repetitive composition of genomes challenges the identification of target-specific oligonucleotides, which limits genetics and genomics research on many species. Here, a tool called ThermoAlign was developed that ensures the design of target-specific primer pairs for DNA amplification. This is achieved by evaluating the thermodynamics of hybridization for full-length oligonucleotide-template alignments — thermoalignments — across the genome to identify primers predicted to bind specifically to the target site. For amplification-based resequencing of regions that cannot be amplified by a single primer pair, a directed graph analysis method is used to identify minimum amplicon tiling paths. Laboratory validation by standard and long-range polymerase chain reaction and amplicon resequencing with maize, one of the most repetitive genomes sequenced to date (≈85% repeat content), demonstrated the specificity-by-design functionality of ThermoAlign. ThermoAlign is released under an open source license and bundled in a dependency-free container for wide distribution. It is anticipated that this tool will facilitate multiple applications in genetics and genomics and be useful in the workflow of high-throughput targeted resequencing studies. PMID:28300202

  11. [Genome Editing Tools and their Application in Experimental Ophthalmology].

    PubMed

    Yanik, M; Wende, W; Stieger, K

    2017-01-23

    New genome editing tools in molecular biology are revolutionising precise genome surgery and have greatly influenced experimental ophthalmology too. Aside from the commonly used nuclease-based platforms, such as the zinc-finger nucleases (ZFN) and transcription activator-like effector nucleases (TALEN), CRISPR/Cas systems, clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) genes, perform very efficiently in site-specific DNA cleavage within living cells. DNA double strand breaks (DSB) are repaired through two different conserved repair pathways: NHEJ (non-homologous end joining) and HDR (homology directed repair). By using the correct DNA templates, these repair pathways can be used to knock out defective genes or to repair mutations. Genome editing technology lays the ground for new strategies in basic science, biotechnology, and biomedical science, as well as clinical studies with genome editing. Therapeutic gene editing strategies are now concentrating on diseases in the retina, due to the comparatively easy accessibility of the eye and with local application in vivo.

  12. Molecular Tools for Exploring Polyploid Genomes in Plants

    PubMed Central

    Aversano, Riccardo; Ercolano, Maria Raffaella; Caruso, Immacolata; Fasano, Carlo; Rosellini, Daniele; Carputo, Domenico

    2012-01-01

    Polyploidy is a very common phenomenon in the plant kingdom, where even diploid species are often described as paleopolyploids. The polyploid condition may bring about several advantages compared to the diploid state. Polyploids often show phenotypes that are not present in their diploid progenitors or exceed the range of the contributing species. Some of these traits may play a role in heterosis or could favor adaptation to new ecological niches. Advances in genomics and sequencing technology may create unprecedented opportunities for discovering and monitoring the molecular effects of polyploidization. Through this review, we provide an overview of technologies and strategies that may allow an in-depth analysis of polyploid genomes. After introducing some basic aspects on the origin and genetics of polyploids, we highlight the main tools available for genome and gene expression analysis and summarize major findings. In the last part of this review, the implications of next generation sequencing are briefly discussed. The accumulation of knowledge on polyploid formation, maintenance, and divergence at whole-genome and subgenome levels will not only help plant biologists to understand how plants have evolved and diversified, but also assist plant breeders in designing new strategies for crop improvement. PMID:22949863

  13. Genomic Tools in Pea Breeding Programs: Status and Perspectives

    PubMed Central

    Tayeh, Nadim; Aubert, Grégoire; Pilet-Nayel, Marie-Laure; Lejeune-Hénaut, Isabelle; Warkentin, Thomas D.; Burstin, Judith

    2015-01-01

    Pea (Pisum sativum L.) is an annual cool-season legume and one of the oldest domesticated crops. Dry pea seeds contain 22–25% protein, complex starch and fiber constituents, and a rich array of vitamins, minerals, and phytochemicals which make them a valuable source for human consumption and livestock feed. Dry pea ranks third to common bean and chickpea as the most widely grown pulse in the world with more than 11 million tons produced in 2013. Pea breeding has achieved great success since the time of Mendel's experiments in the mid-1800s. However, several traits still require significant improvement for better yield stability in a larger growing area. Key breeding objectives in pea include improving biotic and abiotic stress resistance and enhancing yield components and seed quality. Taking advantage of the diversity present in the pea genepool, many mapping populations have been constructed in the last decades and efforts have been deployed to identify loci involved in the control of target traits and further introgress them into elite breeding materials. Pea now benefits from next-generation sequencing and high-throughput genotyping technologies that are paving the way for genome-wide association studies and genomic selection approaches. This review covers the significant development and deployment of genomic tools for pea breeding in recent years. Future prospects are discussed especially in light of current progress toward deciphering the pea genome. PMID:26640470

  14. An evolutionary classification of genomic function.

    PubMed

    Graur, Dan; Zheng, Yichen; Azevedo, Ricardo B R

    2015-01-28

    The pronouncements of the ENCODE Project Consortium regarding "junk DNA" exposed the need for an evolutionary classification of genomic elements according to their selected-effect function. In the classification scheme presented here, we divide the genome into "functional DNA," that is, DNA sequences that have a selected-effect function, and "rubbish DNA," that is, sequences that do not. Functional DNA is further subdivided into "literal DNA" and "indifferent DNA." In literal DNA, the order of nucleotides is under selection; in indifferent DNA, only the presence or absence of the sequence is under selection. Rubbish DNA is further subdivided into "junk DNA" and "garbage DNA." Junk DNA neither contributes to nor detracts from the fitness of the organism and, hence, evolves under selective neutrality. Garbage DNA, on the other hand, decreases the fitness of its carriers. Garbage DNA exists in the genome only because natural selection is neither omnipotent nor instantaneous. Each of these four functional categories can be 1) transcribed and translated, 2) transcribed but not translated, or 3) not transcribed. The affiliation of a DNA segment to a particular functional category may change during evolution: Functional DNA may become junk DNA, junk DNA may become garbage DNA, rubbish DNA may become functional DNA, and so on; however, determining the functionality or nonfunctionality of a genomic sequence must be based on its present status rather than on its potential to change (or not to change) in the future. Changes in functional affiliation are divided into pseudogenes, Lazarus DNA, zombie DNA, and Jekyll-to-Hyde DNA.

  15. An Evolutionary Classification of Genomic Function

    PubMed Central

    Graur, Dan; Zheng, Yichen; Azevedo, Ricardo B.R.

    2015-01-01

    The pronouncements of the ENCODE Project Consortium regarding “junk DNA” exposed the need for an evolutionary classification of genomic elements according to their selected-effect function. In the classification scheme presented here, we divide the genome into “functional DNA,” that is, DNA sequences that have a selected-effect function, and “rubbish DNA,” that is, sequences that do not. Functional DNA is further subdivided into “literal DNA” and “indifferent DNA.” In literal DNA, the order of nucleotides is under selection; in indifferent DNA, only the presence or absence of the sequence is under selection. Rubbish DNA is further subdivided into “junk DNA” and “garbage DNA.” Junk DNA neither contributes to nor detracts from the fitness of the organism and, hence, evolves under selective neutrality. Garbage DNA, on the other hand, decreases the fitness of its carriers. Garbage DNA exists in the genome only because natural selection is neither omnipotent nor instantaneous. Each of these four functional categories can be 1) transcribed and translated, 2) transcribed but not translated, or 3) not transcribed. The affiliation of a DNA segment to a particular functional category may change during evolution: Functional DNA may become junk DNA, junk DNA may become garbage DNA, rubbish DNA may become functional DNA, and so on; however, determining the functionality or nonfunctionality of a genomic sequence must be based on its present status rather than on its potential to change (or not to change) in the future. Changes in functional affiliation are divided into pseudogenes, Lazarus DNA, zombie DNA, and Jekyll-to-Hyde DNA. PMID:25635041

  16. Open chromatin reveals the functional maize genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Every cellular process mediated through nuclear DNA must contend with chromatin. As results from ENCODE show, open chromatin assays can efficiently integrate across diverse regulatory elements, revealing functional non-coding genome. In this study, we use a MNase hypersensitivity assay to discover o...

  17. Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus Saccharum.

    PubMed

    Grativol, Clícia; Regulski, Michael; Bertalan, Marcelo; McCombie, W Richard; da Silva, Felipe Rodrigues; Zerlotini Neto, Adhemar; Vicentini, Renato; Farinelli, Laurent; Hemerly, Adriana Silva; Martienssen, Robert A; Ferreira, Paulo Cavalcanti Gomes

    2014-07-01

    Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun (WGS). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene-rich regions. Gene-enrichment strategies based on methylation profiles offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration with McrBC endonuclease digestion to enrich for euchromatic regions in the sugarcane genome. To verify the efficiency of methylation filtration and the assembly quality of sequences submitted to gene-enrichment strategy, we have compared assemblies using methyl-filtered (MF) and unfiltered (UF) libraries. The use of methy filtration allowed a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5× more scaffolds and 1.7× more assembled Mb in length compared with unfiltered dataset. The coverage of sorghum coding sequences (CDS) by MF scaffolds was at least 36% higher than by the use of UF scaffolds. Using MF technology, we increased by 134× the coverage of gene regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds that covered all genes of the sugarcane bacterial artificial chromosomes (BACs), 97.2% of sugarcane expressed sequence tags (ESTs), 92.7% of sugarcane RNA-seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds from encoded enzymes of the sucrose/starch pathway discovered 291 single-nucleotide polymorphisms (SNPs) in the wild sugarcane species, S. spontaneum and S. officinarum. A large number of microRNA genes was also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus Saccharum and for improvement of sugarcane as a biofuel crop.

  18. Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus Saccharum

    PubMed Central

    Grativol, Clícia; Regulski, Michael; Bertalan, Marcelo; McCombie, W. Richard; da Silva, Felipe Rodrigues; Neto, Adhemar Zerlotini; Vicentini, Renato; Farinelli, Laurent; Hemerly, Adriana Silva; Martienssen, Robert A.; Ferreira, Paulo Cavalcanti Gomes

    2015-01-01

    SUMMARY Many economically important crops have large and complex genomes, which hampers sequencing of their genome by standard methods such as WGS. Large tracts of methylated repeats occur at plant genomes interspersed by hypomethylated gene-rich regions. Gene enrichment strategies based on methylation profile offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration (MF) with McrBC digestion to enrich for euchromatic regions of sugarcane genome. To verify the efficiency of MF and the assembly quality of sequences submitted to gene-enrichment strategy, we have compared assemblies using MF and unfiltered (UF) libraries. The MF allowed the achievement of a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5 times more scaffolds and 1.7 times more assembled Mb compared to unfiltered scaffolds. The coverage of sorghum CDS by MF scaffolds was at least 36% higher than by UF scaffolds. Using MF technology, we increased by 134X the coverage of genic regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds covering all genes at sugarcane BACs, 97.2% of sugarcane ESTs, 92.7% of sugarcane RNA-seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds encoding enzymes of the sucrose/starch pathway discovered 291 SNPs in the wild sugarcane species, S. spontaneum and S. officinarum. A large number of microRNA genes were also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus Saccharum and improvement of sugarcane as a biofuel crop. PMID:24773339

  19. Adapting CRISPR/Cas9 for functional genomics screens.

    PubMed

    Malina, Abba; Katigbak, Alexandra; Cencic, Regina; Maïga, Rayelle Itoua; Robert, Francis; Miura, Hisashi; Pelletier, Jerry

    2014-01-01

    The use of CRISPR/Cas9 (clustered regularly interspaced short palindromic repeats/CRISPR-associated protein) for targeted genome editing has been widely adopted and is considered a "game changing" technology. The ease and rapidity by which this approach can be used to modify endogenous loci in a wide spectrum of cell types and organisms makes it a powerful tool for customizable genetic modifications as well as for large-scale functional genomics. The development of retrovirus-based expression platforms to simultaneously deliver the Cas9 nuclease and single guide (sg) RNAs provides unique opportunities by which to ensure stable and reproducible expression of the editing tools and a broad cell targeting spectrum, while remaining compatible with in vivo genetic screens. Here, we describe methods and highlight considerations for designing and generating sgRNA libraries in all-in-one retroviral vectors for such applications.

  20. Resources for Functional Genomics Studies in Drosophila melanogaster

    PubMed Central

    Mohr, Stephanie E.; Hu, Yanhui; Kim, Kevin; Housden, Benjamin E.; Perrimon, Norbert

    2014-01-01

    Drosophila melanogaster has become a system of choice for functional genomic studies. Many resources, including online databases and software tools, are now available to support design or identification of relevant fly stocks and reagents or analysis and mining of existing functional genomic, transcriptomic, proteomic, etc. datasets. These include large community collections of fly stocks and plasmid clones, “meta” information sites like FlyBase and FlyMine, and an increasing number of more specialized reagents, databases, and online tools. Here, we introduce key resources useful to plan large-scale functional genomics studies in Drosophila and to analyze, integrate, and mine the results of those studies in ways that facilitate identification of highest-confidence results and generation of new hypotheses. We also discuss ways in which existing resources can be used and might be improved and suggest a few areas of future development that would further support large- and small-scale studies in Drosophila and facilitate use of Drosophila information by the research community more generally. PMID:24653003

  1. CrossMap: a versatile tool for coordinate conversion between genome assemblies

    PubMed Central

    Zhao, Hao; Sun, Zhifu; Wang, Jing; Huang, Haojie; Kocher, Jean-Pierre; Wang, Liguo

    2014-01-01

    Motivation: Reference genome assemblies are subject to change and refinement from time to time. Generally, researchers need to convert the results that have been analyzed according to old assemblies to newer versions, or vice versa, to facilitate meta-analysis, direct comparison, data integration and visualization. Several useful conversion tools can convert genome interval files in browser extensible data or general feature format, but none have the functionality to convert files in sequence alignment map or BigWig format. This is a significant gap in computational genomics tools, as these formats are the ones most widely used for representing high-throughput sequencing data, such as RNA-seq, chromatin immunoprecipitation sequencing, DNA-seq, etc. Results: Here we developed CrossMap, a versatile and efficient tool for converting genome coordinates between assemblies. CrossMap supports most of the commonly used file formats, including BAM, sequence alignment map, Wiggle, BigWig, browser extensible data, general feature format, gene transfer format and variant call format. Availability and implementation: CrossMap is written in Python and C. Source code and a comprehensive user’s manual are freely available at: http://crossmap.sourceforge.net/. Contact: Kocher.JeanPierre@mayo.edu or wang.liguo@mayo.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24351709

  2. Structure-based inference of molecular functions of proteins of unknown function from Berkeley Structural Genomics Center

    SciTech Connect

    Kim, Sung-Hou; Shin, Dong Hae; Hou, Jingtong; Chandonia, John-Marc; Das, Debanu; Choi, In-Geol; Kim, Rosalind; Kim, Sung-Hou

    2007-09-02

    Advances in sequence genomics have resulted in an accumulation of a huge number of protein sequences derived from genome sequences. However, the functions of a large portion of them cannot be inferred based on the current methods of sequence homology detection to proteins of known functions. Three-dimensional structure can have an important impact in providing inference of molecular function (physical and chemical function) of a protein of unknown function. Structural genomics centers worldwide have been determining many 3-D structures of the proteins of unknown functions, and possible molecular functions of them have been inferred based on their structures. Combined with bioinformatics and enzymatic assay tools, the successful acceleration of the process of protein structure determination through high throughput pipelines enables the rapid functional annotation of a large fraction of hypothetical proteins. We present a brief summary of the process we used at the Berkeley Structural Genomics Center to infer molecular functions of proteins of unknown function.

  3. Structure-based inference of molecular functions of proteins of unknown function from Berkeley Structural Genomics Center.

    PubMed

    Shin, Dong Hae; Hou, Jingtong; Chandonia, John-Marc; Das, Debanu; Choi, In-Geol; Kim, Rosalind; Kim, Sung-Hou

    2007-09-01

    Advances in sequence genomics have resulted in an accumulation of a huge number of protein sequences derived from genome sequences. However, the functions of a large portion of them cannot be inferred based on the current methods of sequence homology detection to proteins of known functions. Three-dimensional structure can have an important impact in providing inference of molecular function (physical and chemical function) of a protein of unknown function. Structural genomics centers worldwide have been determining many 3-D structures of the proteins of unknown functions, and possible molecular functions of them have been inferred based on their structures. Combined with bioinformatics and enzymatic assay tools, the successful acceleration of the process of protein structure determination through high throughput pipelines enables the rapid functional annotation of a large fraction of hypothetical proteins. We present a brief summary of the process we used at the Berkeley Structural Genomics Center to infer molecular functions of proteins of unknown function.

  4. Coral Reef Genomics: Developing tools for functional genomics ofcoral symbiosis

    SciTech Connect

    Schwarz, Jodi; Brokstein, Peter; Manohar, Chitra; Coffroth, MaryAlice; Szmant, Alina; Medina, Monica

    2005-03-01

    Symbioses between cnidarians and dinoflagellates in the genus Symbiodinium are widespread in the marine environment. The importance of this symbiosis to reef-building corals and reef nutrient and carbon cycles is well documented, but little is known about the mechanisms by which the partners establish and regulate the symbiosis. Because the dinoflagellate symbionts live inside the cells of their host coral, the interactions between the partners occur on cellular and molecular levels, as each partner alters the expression of genes and proteins to facilitate the partnership. These interactions can examined using high-throughput techniques that allow thousands of genes to be examined simultaneously. We are developing the groundwork so that we can use DNA microarray profiling to identify genes involved in the Montastraea faveolata and Acropora palmata symbioses. Here we report results from the initial steps in this microarray initiative, that is, the construction of cDNA libraries from 4 of 16 target stages, sequencing of 3450 cDNA clones to generate Expressed Sequenced Tags (ESTs), and annotation of the ESTs to identify candidate genes to include in the microarrays. An understanding of how the coral-dinoflagellate symbiosis is regulated will have implications for atmospheric and ocean sciences, conservation biology, the study and diagnosis of coral bleaching and disease, and comparative studies of animal-protest interactions.

  5. Emerging Imaging and Genomic Tools for Developmental Systems Biology.

    PubMed

    Liu, Zhe; Keller, Philipp J

    2016-03-21

    Animal development is a complex and dynamic process orchestrated by exquisitely timed cell lineage commitment, divisions, migration, and morphological changes at the single-cell level. In the past decade, extensive genetic, stem cell, and genomic studies provided crucial insights into molecular underpinnings and the functional importance of genetic pathways governing various cellular differentiation processes. However, it is still largely unknown how the precise coordination of these pathways is achieved at the whole-organism level and how the highly regulated spatiotemporal choreography of development is established in turn. Here, we discuss the latest technological advances in imaging and single-cell genomics that hold great promise for advancing our understanding of this intricate process. We propose an integrated approach that combines such methods to quantitatively decipher in vivo cellular dynamic behaviors and their underlying molecular mechanisms at the systems level with single-cell, single-molecule resolution.

  6. Groundnut improvement: use of genetic and genomic tools

    PubMed Central

    Janila, Pasupuleti; Nigam, S. N.; Pandey, Manish K.; Nagesh, P.; Varshney, Rajeev K.

    2013-01-01

    Groundnut (Arachis hypogaea L.), a self-pollinated legume is an important crop cultivated in 24 million ha world over for extraction of edible oil and food uses. The kernels are rich in oil (48–50%) and protein (25–28%), and are source of several vitamins, minerals, antioxidants, biologically active polyphenols, flavonoids, and isoflavones. Improved varieties of groundnut with high yield potential were developed and released for cultivation world over. The improved varieties belong to different maturity durations and possess resistance to diseases, tolerance to drought, enhanced oil content, and improved quality traits for food uses. Conventional breeding procedures along with the tools for phenotyping were largely used in groundnut improvement programs. Mutations were used to induce variability and wide hybridization was attempted to tap variability from wild species. Low genetic variability has been a bottleneck for groundnut improvement. The vast potential of wild species, reservoir of new alleles remains under-utilized. Development of linkage maps of groundnut during the last decade was followed by identification of markers and quantitative trait loci for the target traits. Consequently, the last decade has witnessed the deployment of molecular breeding approaches to complement the ongoing groundnut improvement programs in USA, China, India, and Japan. The other potential advantages of molecular breeding are the feasibility to target multiple traits for improvement and provide tools to tap new alleles from wild species. The first groundnut variety developed through marker-assisted back-crossing is a root-knot nematode-resistant variety, NemaTAM in USA. The uptake of molecular breeding approaches in groundnut improvement programs by NARS partners in India and many African countries is slow or needs to be initiated in part due to inadequate infrastructure, high genotyping costs, and human capacities. Availability of draft genome sequence for diploid (AA and

  7. Functional genomics approaches in parasitic helminths.

    PubMed

    Hagen, J; Lee, E F; Fairlie, W D; Kalinna, B H

    2012-01-01

    As research on parasitic helminths is moving into the post-genomic era, an enormous effort is directed towards deciphering gene function and to achieve gene annotation. The sequences that are available in public databases undoubtedly hold information that can be utilized for new interventions and control but the exploitation of these resources has until recently remained difficult. Only now, with the emergence of methods to genetically manipulate and transform parasitic worms will it be possible to gain a comprehensive understanding of the molecular mechanisms involved in nutrition, metabolism, developmental switches/maturation and interaction with the host immune system. This review focuses on functional genomics approaches in parasitic helminths that are currently used, to highlight potential applications of these technologies in the areas of cell biology, systems biology and immunobiology of parasitic helminths.

  8. AutoFACT: An Automatic Functional Annotation and Classification Tool

    PubMed Central

    Koski, Liisa B; Gray, Michael W; Lang, B Franz; Burger, Gertraud

    2005-01-01

    Background Assignment of function to new molecular sequence data is an essential step in genomics projects. The usual process involves similarity searches of a given sequence against one or more databases, an arduous process for large datasets. Results We present AutoFACT, a fully automated and customizable annotation tool that assigns biologically informative functions to a sequence. Key features of this tool are that it (1) analyzes nucleotide and protein sequence data; (2) determines the most informative functional description by combining multiple BLAST reports from several user-selected databases; (3) assigns putative metabolic pathways, functional classes, enzyme classes, GeneOntology terms and locus names; and (4) generates output in HTML, text and GFF formats for the user's convenience. We have compared AutoFACT to four well-established annotation pipelines. The error rate of functional annotation is estimated to be only between 1–2%. Comparison of AutoFACT to the traditional top-BLAST-hit annotation method shows that our procedure increases the number of functionally informative annotations by approximately 50%. Conclusion AutoFACT will serve as a useful annotation tool for smaller sequencing groups lacking dedicated bioinformatics staff. It is implemented in PERL and runs on LINUX/UNIX platforms. AutoFACT is available at . PMID:15960857

  9. A New System for Comparative Functional Genomics of Saccharomyces Yeasts

    PubMed Central

    Caudy, Amy A.; Guan, Yuanfang; Jia, Yue; Hansen, Christina; DeSevo, Chris; Hayes, Alicia P.; Agee, Joy; Alvarez-Dominguez, Juan R.; Arellano, Hugo; Barrett, Daniel; Bauerle, Cynthia; Bisaria, Namita; Bradley, Patrick H.; Breunig, J. Scott; Bush, Erin; Cappel, David; Capra, Emily; Chen, Walter; Clore, John; Combs, Peter A.; Doucette, Christopher; Demuren, Olukunle; Fellowes, Peter; Freeman, Sam; Frenkel, Evgeni; Gadala-Maria, Daniel; Gawande, Richa; Glass, David; Grossberg, Samuel; Gupta, Anita; Hammonds-Odie, Latanya; Hoisos, Aaron; Hsi, Jenny; Hsu, Yu-Han Huang; Inukai, Sachi; Karczewski, Konrad J.; Ke, Xiaobo; Kojima, Mina; Leachman, Samuel; Lieber, Danny; Liebowitz, Anna; Liu, Julia; Liu, Yufei; Martin, Trevor; Mena, Jose; Mendoza, Rosa; Myhrvold, Cameron; Millian, Christian; Pfau, Sarah; Raj, Sandeep; Rich, Matt; Rokicki, Joe; Rounds, William; Salazar, Michael; Salesi, Matthew; Sharma, Rajani; Silverman, Sanford; Singer, Cara; Sinha, Sandhya; Staller, Max; Stern, Philip; Tang, Hanlin; Weeks, Sharon; Weidmann, Maxwell; Wolf, Ashley; Young, Carmen; Yuan, Jie; Crutchfield, Christopher; McClean, Megan; Murphy, Coleen T.; Llinás, Manuel; Botstein, David; Troyanskaya, Olga G.; Dunham, Maitreya J.

    2013-01-01

    Whole-genome sequencing, particularly in fungi, has progressed at a tremendous rate. More difficult, however, is experimental testing of the inferences about gene function that can be drawn from comparative sequence analysis alone. We present a genome-wide functional characterization of a sequenced but experimentally understudied budding yeast, Saccharomyces bayanus var. uvarum (henceforth referred to as S. bayanus), allowing us to map changes over the 20 million years that separate this organism from S. cerevisiae. We first created a suite of genetic tools to facilitate work in S. bayanus. Next, we measured the gene-expression response of S. bayanus to a diverse set of perturbations optimized using a computational approach to cover a diverse array of functionally relevant biological responses. The resulting data set reveals that gene-expression patterns are largely conserved, but significant changes may exist in regulatory networks such as carbohydrate utilization and meiosis. In addition to regulatory changes, our approach identified gene functions that have diverged. The functions of genes in core pathways are highly conserved, but we observed many changes in which genes are involved in osmotic stress, peroxisome biogenesis, and autophagy. A surprising number of genes specific to S. bayanus respond to oxidative stress, suggesting the organism may have evolved under different selection pressures than S. cerevisiae. This work expands the scope of genome-scale evolutionary studies from sequence-based analysis to rapid experimental characterization and could be adopted for functional mapping in any lineage of interest. Furthermore, our detailed characterization of S. bayanus provides a valuable resource for comparative functional genomics studies in yeast. PMID:23852385

  10. A new system for comparative functional genomics of Saccharomyces yeasts.

    PubMed

    Caudy, Amy A; Guan, Yuanfang; Jia, Yue; Hansen, Christina; DeSevo, Chris; Hayes, Alicia P; Agee, Joy; Alvarez-Dominguez, Juan R; Arellano, Hugo; Barrett, Daniel; Bauerle, Cynthia; Bisaria, Namita; Bradley, Patrick H; Breunig, J Scott; Bush, Erin; Cappel, David; Capra, Emily; Chen, Walter; Clore, John; Combs, Peter A; Doucette, Christopher; Demuren, Olukunle; Fellowes, Peter; Freeman, Sam; Frenkel, Evgeni; Gadala-Maria, Daniel; Gawande, Richa; Glass, David; Grossberg, Samuel; Gupta, Anita; Hammonds-Odie, Latanya; Hoisos, Aaron; Hsi, Jenny; Hsu, Yu-Han Huang; Inukai, Sachi; Karczewski, Konrad J; Ke, Xiaobo; Kojima, Mina; Leachman, Samuel; Lieber, Danny; Liebowitz, Anna; Liu, Julia; Liu, Yufei; Martin, Trevor; Mena, Jose; Mendoza, Rosa; Myhrvold, Cameron; Millian, Christian; Pfau, Sarah; Raj, Sandeep; Rich, Matt; Rokicki, Joe; Rounds, William; Salazar, Michael; Salesi, Matthew; Sharma, Rajani; Silverman, Sanford; Singer, Cara; Sinha, Sandhya; Staller, Max; Stern, Philip; Tang, Hanlin; Weeks, Sharon; Weidmann, Maxwell; Wolf, Ashley; Young, Carmen; Yuan, Jie; Crutchfield, Christopher; McClean, Megan; Murphy, Coleen T; Llinás, Manuel; Botstein, David; Troyanskaya, Olga G; Dunham, Maitreya J

    2013-09-01

    Whole-genome sequencing, particularly in fungi, has progressed at a tremendous rate. More difficult, however, is experimental testing of the inferences about gene function that can be drawn from comparative sequence analysis alone. We present a genome-wide functional characterization of a sequenced but experimentally understudied budding yeast, Saccharomyces bayanus var. uvarum (henceforth referred to as S. bayanus), allowing us to map changes over the 20 million years that separate this organism from S. cerevisiae. We first created a suite of genetic tools to facilitate work in S. bayanus. Next, we measured the gene-expression response of S. bayanus to a diverse set of perturbations optimized using a computational approach to cover a diverse array of functionally relevant biological responses. The resulting data set reveals that gene-expression patterns are largely conserved, but significant changes may exist in regulatory networks such as carbohydrate utilization and meiosis. In addition to regulatory changes, our approach identified gene functions that have diverged. The functions of genes in core pathways are highly conserved, but we observed many changes in which genes are involved in osmotic stress, peroxisome biogenesis, and autophagy. A surprising number of genes specific to S. bayanus respond to oxidative stress, suggesting the organism may have evolved under different selection pressures than S. cerevisiae. This work expands the scope of genome-scale evolutionary studies from sequence-based analysis to rapid experimental characterization and could be adopted for functional mapping in any lineage of interest. Furthermore, our detailed characterization of S. bayanus provides a valuable resource for comparative functional genomics studies in yeast.

  11. Chemogenetic tools to interrogate brain functions.

    PubMed

    Sternson, Scott M; Roth, Bryan L

    2014-01-01

    Elucidating the roles of neuronal cell types for physiology and behavior is essential for understanding brain functions. Perturbation of neuron electrical activity can be used to probe the causal relationship between neuronal cell types and behavior. New genetically encoded neuron perturbation tools have been developed for remotely controlling neuron function using small molecules that activate engineered receptors that can be targeted to cell types using genetic methods. Here we describe recent progress for approaches using genetically engineered receptors that selectively interact with small molecules. Called "chemogenetics," receptors with diverse cellular functions have been developed that facilitate the selective pharmacological control over a diverse range of cell-signaling processes, including electrical activity, for molecularly defined cell types. These tools have revealed remarkably specific behavioral physiological influences for molecularly defined cell types that are often intermingled with populations having different or even opposite functions.

  12. ScreenBEAM: a novel meta-analysis algorithm for functional genomics screens via Bayesian hierarchical modeling | Office of Cancer Genomics

    Cancer.gov

    Functional genomics (FG) screens, using RNAi or CRISPR technology, have become a standard tool for systematic, genome-wide loss-of-function studies for therapeutic target discovery. As in many large-scale assays, however, off-target effects, variable reagents' potency and experimental noise must be accounted for appropriately control for false positives.

  13. Applying functional genomics research to the study of pig reproduction.

    PubMed

    Pomp, D; Caetano, A R; Bertani, G R; Gladney, C D; Johnson, R K

    2001-01-01

    Functional genomics is an experimental approach that incorporates genome-wide or system-wide experimentation, expanding the scope of biological investigation from studying single genes to studying potentially all genes at once in a systematic manner. This technology is highly appealing because of its high throughput and relatively low cost. Furthermore, analysis of gene expression using microarrays is likely to be more biologically relevant than the conventional paradigm of reductionism, because it has the potential to uncover new biological connections between genes and biochemical pathways. However, functional genomics is still in its infancy, especially with regard to the study of pig reproduction. Currently, efforts are centred on developing the necessary resources to enable high throughput evaluation and comparison of gene expression. However, it is clear that in the near future functional genomics will be applied on a large scale to study the biology and physiology of reproduction in pigs, and to understand better the complex nature of genetic control over polygenic characteristics, such as ovulation rate and litter size. We can look forward to generating a significant amount of new data on differences in gene expression between genotypes, treatments, or at various temporal and spatial coordinates within a variety of reproductively relevant systems. Along with this capability will be the challenge of collating, analysing and interpreting datasets that are orders of magnitude more extensive and complex than those currently used. Furthermore, integration of functional genomics with traditional genetic approaches and with detailed analysis of the proteome and relevant whole animal phenotypes will be required to make full use of this powerful new experimental paradigm as a beneficial research tool.

  14. Functional cell-based uHTS in chemical genomic drug discovery.

    PubMed

    Croston, Glenn E

    2002-03-01

    The availability of genomic information significantly increases the number of potential targets available for drug discovery, although the function of many targets and their relationship to disease is unknown. In a chemical genomic research approach, ultra-high throughput screening (uHTS) of genomic targets takes place early in the drug discovery process, before target validation. Target-selective modulators then provide drug leads and pharmacological research tools to validate target function. Effective implementation of a chemical genomic strategy requires assays that can perform uHTS for large numbers of genomic targets. Cell-based functional assays are capable of the uHTS throughput required for chemical genomic research, and their functional nature provides distinct advantages over ligand-binding assays in the identification of target-selective modulators.

  15. The Plant Ontology: A Tool for Plant Genomics.

    PubMed

    Cooper, Laurel; Jaiswal, Pankaj

    2016-01-01

    The use of controlled, structured vocabularies (ontologies) has become a critical tool for scientists in the post-genomic era of massive datasets. Adoption and integration of common vocabularies and annotation practices enables cross-species comparative analyses and increases data sharing and reusability. The Plant Ontology (PO; http://www.plantontology.org/ ) describes plant anatomy, morphology, and the stages of plant development, and offers a database of plant genomics annotations associated to the PO terms. The scope of the PO has grown from its original design covering only rice, maize, and Arabidopsis, and now includes terms to describe all green plants from angiosperms to green algae.This chapter introduces how the PO and other related ontologies are constructed and organized, including languages and software used for ontology development, and provides an overview of the key features. Detailed instructions illustrate how to search and browse the PO database and access the associated annotation data. Users are encouraged to provide input on the ontology through the online term request form and contribute datasets for integration in the PO database.

  16. Orchidstra: an integrated orchid functional genomics database.

    PubMed

    Su, Chun-lin; Chao, Ya-Ting; Yen, Shao-Hua; Chen, Chun-Yi; Chen, Wan-Chieh; Chang, Yao-Chien Alex; Shih, Ming-Che

    2013-02-01

    A specialized orchid database, named Orchidstra (URL: http://orchidstra.abrc.sinica.edu.tw), has been constructed to collect, annotate and share genomic information for orchid functional genomics studies. The Orchidaceae is a large family of Angiosperms that exhibits extraordinary biodiversity in terms of both the number of species and their distribution worldwide. Orchids exhibit many unique biological features; however, investigation of these traits is currently constrained due to the limited availability of genomic information. Transcriptome information for five orchid species and one commercial hybrid has been included in the Orchidstra database. Altogether, these comprise >380,000 non-redundant orchid transcript sequences, of which >110,000 are protein-coding genes. Sequences from the transcriptome shotgun assembly (TSA) were obtained either from output reads from next-generation sequencing technologies assembled into contigs, or from conventional cDNA library approaches. An annotation pipeline using Gene Ontology, KEGG and Pfam was built to assign gene descriptions and functional annotation to protein-coding genes. Deep sequencing of small RNA was also performed for Phalaenopsis aphrodite to search for microRNAs (miRNAs), extending the information archived for this species to miRNA annotation, precursors and putative target genes. The P. aphrodite transcriptome information was further used to design probes for an oligonucleotide microarray, and expression profiling analysis was carried out. The intensities of hybridized probes derived from microarray assays of various tissues were incorporated into the database as part of the functional evidence. In the future, the content of the Orchidstra database will be expanded with transcriptome data and genomic information from more orchid species.

  17. Functional genomics of Lactobacillus casei establishment in the gut

    PubMed Central

    Licandro-Seraut, Hélène; Scornec, Hélène; Pédron, Thierry; Cavin, Jean-François; Sansonetti, Philippe J.

    2014-01-01

    Although the composition of the gut microbiota and its symbiotic contribution to key host physiological functions are well established, little is known as yet about the bacterial factors that account for this symbiosis. We selected Lactobacillus casei as a model microorganism to proceed to genomewide identification of the functions required for a symbiont to establish colonization in the gut. As a result of our recent development of a transposon-mutagenesis tool that overcomes the barrier that had prevented L. casei random mutagenesis, we developed a signature-tagged mutagenesis approach combining whole-genome reverse genetics using a set of tagged transposons and in vivo screening using the rabbit ligated ileal loop model. After sequencing transposon insertion sites in 9,250 random mutants, we assembled a library of 1,110 independent mutants, all disrupted in a different gene, that provides a representative view of the L. casei genome. By determining the relative quantity of each of the 1,110 mutants before and after the in vivo challenge, we identified a core of 47 L. casei genes necessary for its establishment in the gut. They are involved in housekeeping functions, metabolism (sugar, amino acids), cell wall biogenesis, and adaptation to environment. Hence we provide what is, to our knowledge, the first global functional genomics analysis of L. casei symbiosis. PMID:25024222

  18. FilooT: a visualization tool for exploring genomic data

    NASA Astrophysics Data System (ADS)

    Zeinaly, Mahshid; Soltangheis, Mina; Shaw, Chris D.

    2013-12-01

    In order to enhance analysis of synthetic health data of the IEEE VAST Challenge 2010, we introduce an interactive Visual Analytics tool called FilooT designed as a part of the Interactive Multi-genomic Analysis System (IMAS) project. In this paper, we describe different interactive views of FilooT: the Tabular View for exploring and comparing genetic sequences, the Matrix View for sorting sequences according to the values of different characteristics, the P-value View for finding the most important mutations across a family of sequences, the Graph View for finding related sequences and the Group View to group them for further investigation. We followed the Nested Process Model framework throughout the design process and the evaluation. To understand the tool's design capabilities for target domain analysts, we conducted a User Experience scenario-based study followed by an informal interview. The findings indicated how analysts employ each of the visualization and interaction designs in their Bioinformatics task-analysis process. The critical analysis of the results inspired design informing suggestions.

  19. Complete Genome Sequence of Flavobacterium psychrophilum Strain OSU THCO2-90, Used for Functional Genetic Analysis

    PubMed Central

    Rochat, Tatiana; Barbier, Paul; Nicolas, Pierre; Loux, Valentin; Pérez-Pascual, David; Guijarro, José A.; Bernardet, Jean-François

    2017-01-01

    ABSTRACT We report here the complete annotated genome sequence of Flavobacterium psychrophilum OSU THCO2-90, isolated from Coho salmon (Oncorhynchus kisutch) in Oregon. The genome consists of a circular chromosome with 2,343 predicted open reading frames. This strain has proved to be a valuable tool for functional genomics. PMID:28232446

  20. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality

    PubMed Central

    Kamvar, Zhian N.; Brooks, Jonah C.; Grünwald, Niklaus J.

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  1. RNA interference for functional genomics and improvement of cotton (Gossypium species)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    RNA interference (RNAi), is a powerful new technology in the discovery of genetic sequence functions, and has become a valuable tool for functional genomics of cotton (Gossypium ssp.). The rapid adoption of RNAi has replaced previous antisense technology. RNAi has aided in the discovery of function ...

  2. Ascribing Functions to Genes: Journey Towards Genetic Improvement of Rice Via Functional Genomics

    PubMed Central

    Mustafiz, Ananda; Kumari, Sumita; Karan, Ratna

    2016-01-01

    Rice, one of the most important cereal crops for mankind, feeds more than half the world population. Rice has been heralded as a model cereal owing to its small genome size, amenability to easy transformation, high synteny to other cereal crops and availability of complete genome sequence. Moreover, sequence wealth in rice is getting more refined and precise due to resequencing efforts. This humungous resource of sequence data has confronted research fraternity with a herculean challenge as well as an excellent opportunity to functionally validate expressed as well as regulatory portions of the genome. This will not only help us in understanding the genetic basis of plant architecture and physiology but would also steer us towards developing improved cultivars. No single technique can achieve such a mammoth task. Functional genomics through its diverse tools viz. loss and gain of function mutants, multifarious omics strategies like transcriptomics, proteomics, metabolomics and phenomics provide us with the necessary handle. A paradigm shift in technological advances in functional genomics strategies has been instrumental in generating considerable amount of information w.r.t functionality of rice genome. We now have several databases and online resources for functionally validated genes but despite that we are far from reaching the desired milestone of functionally characterizing each and every rice gene. There is an urgent need for a common platform, for information already available in rice, and collaborative efforts between researchers in a concerted manner as well as healthy public-private partnership, for genetic improvement of rice crop better able to handle the pressures of climate change and exponentially increasing population. PMID:27252584

  3. Sequencing Single Cell Microbial Genomes with Microfluidic Amplifications Tools (MICW - Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    ScienceCinema

    Quake, Steve [University of Stanford

    2016-07-12

    Stanford University's Steve Quake on "Sequencing Single Cell Microbial Genomes with Microfluidic Amplification Tools" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  4. A web-based genomic sequence database for the Streptomycetaceae: a tool for systematics and genome mining

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The ARS Microbial Genome Sequence Database (http://199.133.98.43), a web-based database server, was established utilizing the BIGSdb (Bacterial Isolate Genomics Sequence Database) software package, developed at Oxford University, as a tool to manage multi-locus sequence data for the family Streptomy...

  5. Genomic Tools in Cowpea Breeding Programs: Status and Perspectives

    PubMed Central

    Boukar, Ousmane; Fatokun, Christian A.; Huynh, Bao-Lam; Roberts, Philip A.; Close, Timothy J.

    2016-01-01

    Cowpea is one of the most important grain legumes in sub-Saharan Africa (SSA). It provides strong support to the livelihood of small-scale farmers through its contributions to their nutritional security, income generation and soil fertility enhancement. Worldwide about 6.5 million metric tons of cowpea are produced annually on about 14.5 million hectares. The low productivity of cowpea is attributable to numerous abiotic and biotic constraints. The abiotic stress factors comprise drought, low soil fertility, and heat while biotic constraints include insects, diseases, parasitic weeds, and nematodes. Cowpea farmers also have limited access to quality seeds of improved varieties for planting. Some progress has been made through conventional breeding at international and national research institutions in the last three decades. Cowpea improvement could also benefit from modern breeding methods based on molecular genetic tools. A number of advances in cowpea genetic linkage maps, and quantitative trait loci associated with some desirable traits such as resistance to Striga, Macrophomina, Fusarium wilt, bacterial blight, root-knot nematodes, aphids, and foliar thrips have been reported. An improved consensus genetic linkage map has been developed and used to identify QTLs of additional traits. In order to take advantage of these developments single nucleotide polymorphism (SNP) genotyping is being streamlined to establish an efficient workflow supported by genotyping support service (GSS)-client interactions. About 1100 SNPs mapped on the cowpea genome were converted by LGC Genomics to KASP assays. Several cowpea breeding programs have been exploiting these resources to implement molecular breeding, especially for MARS and MABC, to accelerate cowpea variety improvement. The combination of conventional breeding and molecular breeding strategies, with workflow managed through the CGIAR breeding management system (BMS), promises an increase in the number of improved

  6. Genomic Tools in Cowpea Breeding Programs: Status and Perspectives.

    PubMed

    Boukar, Ousmane; Fatokun, Christian A; Huynh, Bao-Lam; Roberts, Philip A; Close, Timothy J

    2016-01-01

    Cowpea is one of the most important grain legumes in sub-Saharan Africa (SSA). It provides strong support to the livelihood of small-scale farmers through its contributions to their nutritional security, income generation and soil fertility enhancement. Worldwide about 6.5 million metric tons of cowpea are produced annually on about 14.5 million hectares. The low productivity of cowpea is attributable to numerous abiotic and biotic constraints. The abiotic stress factors comprise drought, low soil fertility, and heat while biotic constraints include insects, diseases, parasitic weeds, and nematodes. Cowpea farmers also have limited access to quality seeds of improved varieties for planting. Some progress has been made through conventional breeding at international and national research institutions in the last three decades. Cowpea improvement could also benefit from modern breeding methods based on molecular genetic tools. A number of advances in cowpea genetic linkage maps, and quantitative trait loci associated with some desirable traits such as resistance to Striga, Macrophomina, Fusarium wilt, bacterial blight, root-knot nematodes, aphids, and foliar thrips have been reported. An improved consensus genetic linkage map has been developed and used to identify QTLs of additional traits. In order to take advantage of these developments single nucleotide polymorphism (SNP) genotyping is being streamlined to establish an efficient workflow supported by genotyping support service (GSS)-client interactions. About 1100 SNPs mapped on the cowpea genome were converted by LGC Genomics to KASP assays. Several cowpea breeding programs have been exploiting these resources to implement molecular breeding, especially for MARS and MABC, to accelerate cowpea variety improvement. The combination of conventional breeding and molecular breeding strategies, with workflow managed through the CGIAR breeding management system (BMS), promises an increase in the number of improved

  7. GAViT: Genome Assembly Visualization Tool for Short Read Data

    SciTech Connect

    Syed, Aijazuddin; Shapiro, Harris; Tu, Hank; Pangilinan, Jasmyn; Trong, Stephan

    2008-03-14

    It is a challenging job for genome analysts to accurately debug, troubleshoot, and validate genome assembly results. Genome analysts rely on visualization tools to help validate and troubleshoot assembly results, including such problems as mis-assemblies, low-quality regions, and repeats. Short read data adds further complexity and makes it extremely challenging for the visualization tools to scale and to view all needed assembly information. As a result, there is a need for a visualization tool that can scale to display assembly data from the new sequencing technologies. We present Genome Assembly Visualization Tool (GAViT), a highly scalable and interactive assembly visualization tool developed at the DOE Joint Genome Institute (JGI).

  8. From bacterial genome to functionality; case bifidobacteria.

    PubMed

    Ventura, Marco; O'Connell-Motherway, Mary; Leahy, Sinead; Moreno-Munoz, Jose Antonio; Fitzgerald, Gerald F; van Sinderen, Douwe

    2007-11-30

    The availability of complete bacterial genome sequences has significantly furthered our understanding of the genetics, physiology and biochemistry of the microorganisms in question, particularly those that have commercially important applications. Bifidobacteria are among such microorganisms, as they constitute mammalian commensals of biotechnological significance due to their perceived role in maintaining a balanced gastrointestinal (GIT) microflora. Bifidobacteria are therefore frequently used as health-promoting or probiotic components in functional food products. A fundamental understanding of the metabolic activities employed by these commensal bacteria, in particular their capability to utilize a wide range of complex oligosaccharides, can reveal ways to provide in vivo growth advantages relative to other competing gut bacteria or pathogens. Furthermore, an in depth analysis of adaptive responses to nutritional or environmental stresses may provide methodologies to retain viability and improve functionality during commercial preparation, storage and delivery of the probiotic organism.

  9. Selfish drive can trump function when animal mitochondrial genomes compete

    PubMed Central

    Ma, Hansong; O’Farrell, Patrick H.

    2016-01-01

    Mitochondrial genomes compete for transmission from mother to progeny. We explored this competition by introducing a second genome into Drosophila melanogaster to follow transmission. Competitions between closely related genomes favored those functional in electron transport, resulting in a host-beneficial purifying selection1. Contrastingly, matchups between distant genomes often favored those with negligible, negative or lethal consequences, indicating selfish selection. Exhibiting powerful selfish selection, a genome carrying a detrimental mutation displaced a complementing genome leading to population death after several generations. In a different pairing, opposing selfish and purifying selection counterbalanced to give stable transmission of two genomes. Sequencing of recombinant mitochondrial genomes revealed that the non-coding region, containing origins of replication, governs selfish transmission. Uniparental inheritance prevents encounters between distantly related genomes. Nonetheless, within each maternal lineage, constant competition among sibling genomes selects for super-replicators. We suggest that this relentless competition drives positive selection promoting change in the sequences influencing transmission. PMID:27270106

  10. Selfish drive can trump function when animal mitochondrial genomes compete.

    PubMed

    Ma, Hansong; O'Farrell, Patrick H

    2016-07-01

    Mitochondrial genomes compete for transmission from mother to progeny. We explored this competition by introducing a second genome into Drosophila melanogaster to follow transmission. Competitions between closely related genomes favored those functional in electron transport, resulting in a host-beneficial purifying selection. In contrast, matchups between distantly related genomes often favored those with negligible, negative or lethal consequences, indicating selfish selection. Exhibiting powerful selfish selection, a genome carrying a detrimental mutation displaced a complementing genome, leading to population death after several generations. In a different pairing, opposing selfish and purifying selection counterbalanced to give stable transmission of two genomes. Sequencing of recombinant mitochondrial genomes showed that the noncoding region, containing origins of replication, governs selfish transmission. Uniparental inheritance prevents encounters between distantly related genomes. Nonetheless, in each maternal lineage, constant competition among sibling genomes selects for super-replicators. We suggest that this relentless competition drives positive selection, promoting change in the sequences influencing transmission.

  11. proGenomes: a resource for consistent functional and taxonomic annotations of prokaryotic genomes

    PubMed Central

    Mende, Daniel R.; Letunic, Ivica; Huerta-Cepas, Jaime; Li, Simone S.; Forslund, Kristoffer; Sunagawa, Shinichi; Bork, Peer

    2017-01-01

    The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de. PMID:28053165

  12. proGenomes: a resource for consistent functional and taxonomic annotations of prokaryotic genomes.

    PubMed

    Mende, Daniel R; Letunic, Ivica; Huerta-Cepas, Jaime; Li, Simone S; Forslund, Kristoffer; Sunagawa, Shinichi; Bork, Peer

    2017-01-04

    The availability of microbial genomes has opened many new avenues of research within microbiology. This has been driven primarily by comparative genomics approaches, which rely on accurate and consistent characterization of genomic sequences. It is nevertheless difficult to obtain consistent taxonomic and integrated functional annotations for defined prokaryotic clades. Thus, we developed proGenomes, a resource that provides user-friendly access to currently 25 038 high-quality genomes whose sequences and consistent annotations can be retrieved individually or by taxonomic clade. These genomes are assigned to 5306 consistent and accurate taxonomic species clusters based on previously established methodology. proGenomes also contains functional information for almost 80 million protein-coding genes, including a comprehensive set of general annotations and more focused annotations for carbohydrate-active enzymes and antibiotic resistance genes. Additionally, broad habitat information is provided for many genomes. All genomes and associated information can be downloaded by user-selected clade or multiple habitat-specific sets of representative genomes. We expect that the availability of high-quality genomes with comprehensive functional annotations will promote advances in clinical microbial genomics, functional evolution and other subfields of microbiology. proGenomes is available at http://progenomes.embl.de.

  13. E-TALEN: a web tool to design TALENs for genome engineering.

    PubMed

    Heigwer, Florian; Kerr, Grainne; Walther, Nike; Glaeser, Kathrin; Pelz, Oliver; Breinig, Marco; Boutros, Michael

    2013-11-01

    Use of transcription activator-like effector nucleases (TALENs) is a promising new technique in the field of targeted genome engineering, editing and reverse genetics. Its applications span from introducing knockout mutations to endogenous tagging of proteins and targeted excision repair. Owing to this wide range of possible applications, there is a need for fast and user-friendly TALEN design tools. We developed E-TALEN (http://www.e-talen.org), a web-based tool to design TALENs for experiments of varying scale. E-TALEN enables the design of TALENs against a single target or a large number of target genes. We significantly extended previously published design concepts to consider genomic context and different applications. E-TALEN guides the user through an end-to-end design process of de novo TALEN pairs, which are specific to a certain sequence or genomic locus. Furthermore, E-TALEN offers a functionality to predict targeting and specificity for existing TALENs. Owing to the computational complexity of many of the steps in the design of TALENs, particular emphasis has been put on the implementation of fast yet accurate algorithms. We implemented a user-friendly interface, from the input parameters to the presentation of results. An additional feature of E-TALEN is the in-built sequence and annotation database available for many organisms, including human, mouse, zebrafish, Drosophila and Arabidopsis, which can be extended in the future.

  14. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects

    PubMed Central

    2011-01-01

    Background Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes are being sequenced and how they are annotated. While the first generation of genome projects focused on well-studied model organisms, many of today's projects involve exotic organisms whose genomes are largely terra incognita. This complicates their annotation, because unlike first-generation projects, there are no pre-existing 'gold-standard' gene-models with which to train gene-finders. Improvements in genome assembly and the wide availability of mRNA-seq data are also creating opportunities to update and re-annotate previously published genome annotations. Today's genome projects are thus in need of new genome annotation tools that can meet the challenges and opportunities presented by second-generation sequencing technologies. Results We present MAKER2, a genome annotation and data management tool designed for second-generation genome projects. MAKER2 is a multi-threaded, parallelized application that can process second-generation datasets of virtually any size. We show that MAKER2 can produce accurate annotations for novel genomes where training-data are limited, of low quality or even non-existent. MAKER2 also provides an easy means to use mRNA-seq data to improve annotation quality; and it can use these data to update legacy annotations, significantly improving their quality. We also show that MAKER2 can evaluate the quality of genome annotations, and identify and prioritize problematic annotations for manual review. Conclusions MAKER2 is the first annotation engine specifically designed for second-generation genome projects. MAKER2 scales to datasets of any size, requires little in the way of training data, and can use mRNA-seq data to improve annotation quality. It can also update and manage legacy genome annotation datasets. PMID:22192575

  15. Plant Aquaporins: Genome-Wide Identification, Transcriptomics, Proteomics, and Advanced Analytical Tools

    PubMed Central

    Deshmukh, Rupesh K.; Sonah, Humira; Bélanger, Richard R.

    2016-01-01

    Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is

  16. Engineering plastid genomes: methods, tools, and applications in basic research and biotechnology.

    PubMed

    Bock, Ralph

    2015-01-01

    The small bacterial-type genome of the plastid (chloroplast) can be engineered by genetic transformation, generating cells and plants with transgenic plastid genomes, also referred to as transplastomic plants. The transformation process relies on homologous recombination, thereby facilitating the site-specific alteration of endogenous plastid genes as well as the precisely targeted insertion of foreign genes into the plastid DNA. The technology has been used extensively to analyze chloroplast gene functions and study plastid gene expression at all levels in vivo. Over the years, a large toolbox has been assembled that is now nearly comparable to the techniques available for plant nuclear transformation and that has enabled new applications of transplastomic technology in basic and applied research. This review describes the state of the art in engineering the plastid genomes of algae and land plants (Embryophyta). It provides an overview of the existing tools for plastid genome engineering, discusses current technological limitations, and highlights selected applications that demonstrate the immense potential of chloroplast transformation in several key areas of plant biotechnology.

  17. The function of genomes in bioenergetic organelles.

    PubMed Central

    Allen, John F

    2003-01-01

    Mitochondria and chloroplasts are energy-transducing organelles of the cytoplasm of eukaryotic cells. They originated as bacterial symbionts whose host cells acquired respiration from the precursor of the mitochondrion, and oxygenic photosynthesis from the precursor of the chloroplast. The host cells also acquired genetic information from their symbionts, eventually incorporating much of it into their own genomes. Genes of the eukaryotic cell nucleus now encode most mitochondrial and chloroplast proteins. Genes are copied and moved between cellular compartments with relative ease, and there is no obvious obstacle to successful import of any protein precursor from the cytosol. So why are any genes at all retained in cytoplasmic organelles? One proposal is that these small but functional genomes provide a location for genes that is close to, and in the same compartment as, their gene products. This co-location facilitates rapid and direct regulatory coupling. Redox control of synthesis de novo is put forward as the common property of those proteins that must be encoded and synthesized within mitochondria and chloroplasts. This testable hypothesis is termed CORR, for co-location for redox regulation. Principles, predictions and consequences of CORR are examined in the context of competing hypotheses and current evidence. PMID:12594916

  18. Population perspectives on functional genomic variation in yeast.

    PubMed

    Skelly, Daniel A; Magwene, Paul M

    2016-03-01

    Advances in high-throughput sequencing have facilitated large-scale surveys of genomic variation in the budding yeast,Saccharomyces cerevisiae These surveys have revealed extensive sequence variation between yeast strains. However, much less is known about how such variation influences the amount and nature of variation for functional genomic traits within and between yeast lineages. We review population-level studies of functional genomic variation, with a particular focus on how population functional genomic approaches can provide insights into both genome function and the evolutionary process. Although variation in functional genomics phenotypes is pervasive, our understanding of the consequences of this variation, either in physiological or evolutionary terms, is still rudimentary and thus motivates increased attention to appropriate null models. To date, much of the focus of population functional genomic studies has been on gene expression variation, but other functional genomic data types are just as likely to reveal important insights at the population level, suggesting a pressing need for more studies that go beyond transcription. Finally, we discuss how a population functional genomic perspective can be a powerful approach for developing a mechanistic understanding of the processes that link genomic variation to organismal phenotypes through gene networks.

  19. CRISPR/Cas9: an advanced tool for editing plant genomes.

    PubMed

    Samanta, Milan Kumar; Dey, Avishek; Gayen, Srimonta

    2016-10-01

    To meet current challenges in agriculture, genome editing using sequence-specific nucleases (SSNs) is a powerful tool for basic and applied plant biology research. Here, we describe the principle and application of available genome editing tools, including zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and the clustered regularly interspaced short palindromic repeat associated CRISPR/Cas9 system. Among these SSNs, CRISPR/Cas9 is the most recently characterized and rapidly developing genome editing technology, and has been successfully utilized in a wide variety of organisms. This review specifically illustrates the power of CRISPR/Cas9 as a tool for plant genome engineering, and describes the strengths and weaknesses of the CRISPR/Cas9 technology compared to two well-established genome editing tools, ZFNs and TALENs.

  20. Three-Dimensional Genome Organization and Function in Drosophila

    PubMed Central

    Schwartz, Yuri B.; Cavalli, Giacomo

    2017-01-01

    Understanding how the metazoan genome is used during development and cell differentiation is one of the major challenges in the postgenomic era. Early studies in Drosophila suggested that three-dimensional (3D) chromosome organization plays important regulatory roles in this process and recent technological advances started to reveal connections at the molecular level. Here we will consider general features of the architectural organization of the Drosophila genome, providing historical perspective and insights from recent work. We will compare the linear and spatial segmentation of the fly genome and focus on the two key regulators of genome architecture: insulator components and Polycomb group proteins. With its unique set of genetic tools and a compact, well annotated genome, Drosophila is poised to remain a model system of choice for rapid progress in understanding principles of genome organization and to serve as a proving ground for development of 3D genome-engineering techniques. PMID:28049701

  1. CAGO: a software tool for dynamic visual comparison and correlation measurement of genome organization.

    PubMed

    Chang, Yi-Feng; Chang, Chuan-Hsiung

    2011-01-01

    CAGO (Comparative Analysis of Genome Organization) is developed to address two critical shortcomings of conventional genome atlas plotters: lack of dynamic exploratory functions and absence of signal analysis for genomic properties. With dynamic exploratory functions, users can directly manipulate chromosome tracks of a genome atlas and intuitively identify distinct genomic signals by visual comparison. Signal analysis of genomic properties can further detect inconspicuous patterns from noisy genomic properties and calculate correlations between genomic properties across various genomes. To implement dynamic exploratory functions, CAGO presents each genome atlas in Scalable Vector Graphics (SVG) format and allows users to interact with it using a SVG viewer through JavaScript. Signal analysis functions are implemented using R statistical software and a discrete wavelet transformation package waveslim. CAGO is not only a plotter for generating complex genome atlases, but also a platform for exploring genome atlases with dynamic exploratory functions for visual comparison and with signal analysis for comparing genomic properties across multiple organisms. The web-based application of CAGO, its source code, user guides, video demos, and live examples are publicly available and can be accessed at http://cbs.ym.edu.tw/cago.

  2. Genetic and genomic tools for the marine annelid Platynereis dumerilii.

    PubMed

    Zantke, Juliane; Bannister, Stephanie; Rajan, Vinoth Babu Veedin; Raible, Florian; Tessmar-Raible, Kristin

    2014-05-01

    The bristle worm Platynereis dumerilii displays many interesting biological characteristics. These include its reproductive timing, which is synchronized to the moon phase, its regenerative capacity that is hormonally controlled, and a slow rate of evolution, which permits analyses of ancestral genes and cell types. As a marine annelid, Platynereis is also representative of the marine ecosystem, as well as one of the three large animal subphyla, the Lophotrochozoa. Here, we provide an overview of the molecular resources, functional techniques, and behavioral assays that have recently been established for the bristle worm. This combination of tools now places Platynereis in an excellent position to advance research at the frontiers of neurobiology, chronobiology, evo-devo, and marine biology.

  3. Genetic and Genomic Tools for the Marine Annelid Platynereis dumerilii

    PubMed Central

    Zantke, Juliane; Bannister, Stephanie; Rajan, Vinoth Babu Veedin; Raible, Florian; Tessmar-Raible, Kristin

    2014-01-01

    The bristle worm Platynereis dumerilii displays many interesting biological characteristics. These include its reproductive timing, which is synchronized to the moon phase, its regenerative capacity that is hormonally controlled, and a slow rate of evolution, which permits analyses of ancestral genes and cell types. As a marine annelid, Platynereis is also representative of the marine ecosystem, as well as one of the three large animal subphyla, the Lophotrochozoa. Here, we provide an overview of the molecular resources, functional techniques, and behavioral assays that have recently been established for the bristle worm. This combination of tools now places Platynereis in an excellent position to advance research at the frontiers of neurobiology, chronobiology, evo-devo, and marine biology. PMID:24807110

  4. Blast2GO: A Comprehensive Suite for Functional Analysis in Plant Genomics

    PubMed Central

    Conesa, Ana; Götz, Stefan

    2008-01-01

    Functional annotation of novel sequence data is a primary requirement for the utilization of functional genomics approaches in plant research. In this paper, we describe the Blast2GO suite as a comprehensive bioinformatics tool for functional annotation of sequences and data mining on the resulting annotations, primarily based on the gene ontology (GO) vocabulary. Blast2GO optimizes function transfer from homologous sequences through an elaborate algorithm that considers similarity, the extension of the homology, the database of choice, the GO hierarchy, and the quality of the original annotations. The tool includes numerous functions for the visualization, management, and statistical analysis of annotation results, including gene set enrichment analysis. The application supports InterPro, enzyme codes, KEGG pathways, GO direct acyclic graphs (DAGs), and GOSlim. Blast2GO is a suitable tool for plant genomics research because of its versatility, easy installation, and friendly use. PMID:18483572

  5. BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experiments

    PubMed Central

    Al-Shahrour, Fátima; Minguez, Pablo; Tárraga, Joaquín; Montaner, David; Alloza, Eva; Vaquerizas, Juan M.; Conde, Lucía; Blaschke, Christian; Vera, Javier; Dopazo, Joaquín

    2006-01-01

    We present a new version of Babelomics, a complete suite of web tools for functional analysis of genome-scale experiments, with new and improved tools. New functionally relevant terms have been included such as CisRed motifs or bioentities obtained by text-mining procedures. An improved indexing has considerably speeded up several of the modules. An improved version of the FatiScan method for studying the coordinate behaviour of groups of functionally related genes is presented, along with a similar tool, the Gene Set Enrichment Analysis. Babelomics is now more oriented to test systems biology inspired hypotheses. Babelomics can be found at . PMID:16845052

  6. Functional profiling of the Saccharomyces cerevisiae genome.

    PubMed

    Giaever, Guri; Chu, Angela M; Ni, Li; Connelly, Carla; Riles, Linda; Véronneau, Steeve; Dow, Sally; Lucau-Danila, Ankuta; Anderson, Keith; André, Bruno; Arkin, Adam P; Astromoff, Anna; El-Bakkoury, Mohamed; Bangham, Rhonda; Benito, Rocio; Brachat, Sophie; Campanaro, Stefano; Curtiss, Matt; Davis, Karen; Deutschbauer, Adam; Entian, Karl-Dieter; Flaherty, Patrick; Foury, Francoise; Garfinkel, David J; Gerstein, Mark; Gotte, Deanna; Güldener, Ulrich; Hegemann, Johannes H; Hempel, Svenja; Herman, Zelek; Jaramillo, Daniel F; Kelly, Diane E; Kelly, Steven L; Kötter, Peter; LaBonte, Darlene; Lamb, David C; Lan, Ning; Liang, Hong; Liao, Hong; Liu, Lucy; Luo, Chuanyun; Lussier, Marc; Mao, Rong; Menard, Patrice; Ooi, Siew Loon; Revuelta, Jose L; Roberts, Christopher J; Rose, Matthias; Ross-Macdonald, Petra; Scherens, Bart; Schimmack, Greg; Shafer, Brenda; Shoemaker, Daniel D; Sookhai-Mahadeo, Sharon; Storms, Reginald K; Strathern, Jeffrey N; Valle, Giorgio; Voet, Marleen; Volckaert, Guido; Wang, Ching-yun; Ward, Teresa R; Wilhelmy, Julie; Winzeler, Elizabeth A; Yang, Yonghong; Yen, Grace; Youngman, Elaine; Yu, Kexin; Bussey, Howard; Boeke, Jef D; Snyder, Michael; Philippsen, Peter; Davis, Ronald W; Johnston, Mark

    2002-07-25

    Determining the effect of gene deletion is a fundamental approach to understanding gene function. Conventional genetic screens exhibit biases, and genes contributing to a phenotype are often missed. We systematically constructed a nearly complete collection of gene-deletion mutants (96% of annotated open reading frames, or ORFs) of the yeast Saccharomyces cerevisiae. DNA sequences dubbed 'molecular bar codes' uniquely identify each strain, enabling their growth to be analysed in parallel and the fitness contribution of each gene to be quantitatively assessed by hybridization to high-density oligonucleotide arrays. We show that previously known and new genes are necessary for optimal growth under six well-studied conditions: high salt, sorbitol, galactose, pH 8, minimal medium and nystatin treatment. Less than 7% of genes that exhibit a significant increase in messenger RNA expression are also required for optimal growth in four of the tested conditions. Our results validate the yeast gene-deletion collection as a valuable resource for functional genomics.

  7. Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology.

    PubMed

    Karp, Peter D; Latendresse, Mario; Paley, Suzanne M; Krummenacker, Markus; Ong, Quang D; Billington, Richard; Kothari, Anamika; Weaver, Daniel; Lee, Thomas; Subhraveti, Pallavi; Spaulding, Aaron; Fulcher, Carol; Keseler, Ingrid M; Caspi, Ron

    2016-09-01

    Pathway Tools is a bioinformatics software environment with a broad set of capabilities. The software provides genome-informatics tools such as a genome browser, sequence alignments, a genome-variant analyzer and comparative-genomics operations. It offers metabolic-informatics tools, such as metabolic reconstruction, quantitative metabolic modeling, prediction of reaction atom mappings and metabolic route search. Pathway Tools also provides regulatory-informatics tools, such as the ability to represent and visualize a wide range of regulatory interactions. This article outlines the advances in Pathway Tools in the past 5 years. Major additions include components for metabolic modeling, metabolic route search, computation of atom mappings and estimation of compound Gibbs free energies of formation; addition of editors for signaling pathways, for genome sequences and for cellular architecture; storage of gene essentiality data and phenotype data; display of multiple alignments, and of signaling and electron-transport pathways; and development of Python and web-services application programming interfaces. Scientists around the world have created more than 9800 Pathway/Genome Databases by using Pathway Tools, many of which are curated databases for important model organisms.

  8. Phytozome: a Tool for Green Plant Comparative Genomics

    DOE Data Explorer

    Phytozome is a joint project of the Department of Energy's Joint Genome Institute and the Center for Integrative Genomics to facilitate comparative genomic studies amongst green plants. Clusters of orthologous and paralogous genes that represent the modern descendents of ancestral gene sets are constructed at key phylogenetic nodes. These clusters allow easy access to clade specific orthology/paralogy relationships as well as clade specific genes and gene expansions. As of release v4.0, Phytozome provides access to nine sequenced and annotated green plant genomes, eight of which have been clustered into gene families at six evolutionarily significant nodes. Where possible, each gene has been annotated with PFAM, KOG, KEGG, and PANTHER assignments, and publicly available annotations from RefSeq, UniProt, TAIR, JGI are hyper-linked and searchable. [Copied from the Overview at http://www.phytozome.net/Phytozome_info.php

  9. CSTminer: a web tool for the identification of coding and noncoding conserved sequence tags through cross-species genome comparison.

    PubMed

    Castrignanò, Tiziana; Canali, Alessandro; Grillo, Giorgio; Liuni, Sabino; Mignone, Flavio; Pesole, Graziano

    2004-07-01

    The identification and characterization of genome tracts that are highly conserved across species during evolution may contribute significantly to the functional annotation of whole-genome sequences. Indeed, such sequences are likely to correspond to known or unknown coding exons or regulatory motifs. Here, we present a web server implementing a previously developed algorithm that, by comparing user-submitted genome sequences, is able to identify statistically significant conserved blocks and assess their coding or noncoding nature through the measure of a coding potential score. The web tool, available at http://www.caspur.it/CSTminer/, is dynamically interconnected with the Ensembl genome resources and produces a graphical output showing a map of detected conserved sequences and annotated gene features.

  10. A statistical framework to predict functional non-coding regions in the human genome through integrated analysis of annotation data.

    PubMed

    Lu, Qiongshi; Hu, Yiming; Sun, Jiehuan; Cheng, Yuwei; Cheung, Kei-Hoi; Zhao, Hongyu

    2015-05-27

    Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu.

  11. Comparative genomics of pectinacetylesterases: Insight on function and biology

    PubMed Central

    de Souza, Amancio José; Pauly, Markus

    2015-01-01

    Pectin acetylation influences the gelling ability of this important plant polysaccharide for the food industry. Plant apoplastic pectinacetylesterases (PAEs) play a key role in regulating the degree of pectin acetylation and modifying their expression thus represents one way to engineer plant polysaccharides for food applications. Identifying the major active enzymes within the PAE gene family will aid in our understanding of this biological phenomena as well as provide the tools for direct trait manipulation. Using comparative genomics we propose that there is a minimal set of 4 distinct PAEs in plants. Possible functional diversification of the PAE family in the grasses is also explored with the identification of 3 groups of PAE genes specific to grasses. PMID:26237162

  12. NCBI GEO: archive for functional genomics data sets--update.

    PubMed

    Barrett, Tanya; Wilhite, Stephen E; Ledoux, Pierre; Evangelista, Carlos; Kim, Irene F; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Holko, Michelle; Yefanov, Andrey; Lee, Hyeseung; Zhang, Naigong; Robertson, Cynthia L; Serova, Nadezhda; Davis, Sean; Soboleva, Alexandra

    2013-01-01

    The Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) is an international public repository for high-throughput microarray and next-generation sequence functional genomic data sets submitted by the research community. The resource supports archiving of raw data, processed data and metadata which are indexed, cross-linked and searchable. All data are freely available for download in a variety of formats. GEO also provides several web-based tools and strategies to assist users to query, analyse and visualize data. This article reports current status and recent database developments, including the release of GEO2R, an R-based web application that helps users analyse GEO data.

  13. Advances in mRNA Silencing and Transgene Expression: a Gateway to Functional Genomics in Schistosomes

    PubMed Central

    Tchoubrieva, Elissaveta B.; Kalinna, Bernd H.

    2013-01-01

    The completion of the WHO Schistosoma Genome Project in 2008, although not fully annotated, provides a golden opportunity to actively pursue fundamental research on the parasites genome. This analysis will aid identification of targets for drugs, vaccines and markers for diagnostic tools as well as for studying the biological basis of drug resistance, infectivity and pathology. For the validation of drug and vaccine targets, the genomic sequence data is only of use if functional analyses can be conducted (in the parasite itself). Until recently, gene manipulation approaches had not been seriously addressed. This situation is now changing and rapid advances have been made in gene silencing and transgenesis of schistosomes. PMID:21415884

  14. The role of chromosome domains in shaping the functional genome.

    PubMed

    Sexton, Tom; Cavalli, Giacomo

    2015-03-12

    The genome must be highly compacted to fit within eukaryotic nuclei but must be accessible to the transcriptional machinery to allow appropriate expression of genes in different cell types and throughout developmental pathways. A growing body of work has shown that the genome, analogously to proteins, forms an ordered, hierarchical structure that closely correlates and may even be causally linked with regulation of functions such as transcription. This review describes our current understanding of how these functional genomic "secondary and tertiary structures" form a blueprint for global nuclear architecture and the potential they hold for understanding and manipulating genomic regulation.

  15. Comprehensive Resources for Tomato Functional Genomics Based on the Miniature Model Tomato Micro-Tom

    PubMed Central

    Matsukura, C; Aoki, K; Fukuda, N; Mizoguchi, T; Asamizu, E; Saito, T; Shibata, D; Ezura, H

    2008-01-01

    Tomato (Solanum lycopersicum L., Solanaceae) is an excellent model plant for genomic research of solanaceous plants, as well as for studying the development, ripening, and metabolism of fruit. In 2003, the International Solanaceae Project (SOL, www.sgn.cornell.edu ) was initiated by members from more than 30 countries, and the tomato genome-sequencing project is currently underway. Genome sequence of tomato obtained by this project will provide a firm foundation for forthcoming genomic studies such as the comparative analysis of genes conserved among the Solanaceae species and the elucidation of the functions of unknown tomato genes. To exploit the wealth of the genome sequence information, there is an urgent need for novel resources and analytical tools for tomato functional genomics. Here, we present an overview of the development of genetic and genomic resources of tomato in the last decade, with a special focus on the activities of Japan SOL and the National Bio-Resource Project in the development of functional genomic resources of a model cultivar, Micro-Tom. PMID:19506732

  16. Genome-editing tools for stem cell biology

    PubMed Central

    Vasileva, E A; Shuvalov, O U; Garabadgiu, A V; Melino, G; Barlev, N A

    2015-01-01

    Human pluripotent stem cells provide a versatile platform for regenerative studies, drug testing and disease modeling. That the expression of only four transcription factors, Oct4, Klf4, Sox2 and c-Myc (OKSM), is sufficient for generation of induced pluripotent stem cells (iPSCs) from differentiated somatic cells has revolutionized the field and also highlighted the importance of OKSM as targets for genome editing. A number of novel genome-editing systems have been developed recently. In this review, we focus on successful applications of several such systems for generation of iPSCs. In particular, we discuss genome-editing systems based on zinc-finger fusion proteins (ZFs), transcription activator-like effectors (TALEs) and an RNA-guided DNA-specific nuclease, Cas9, derived from the bacterial defense system against viruses that utilizes clustered regularly interspaced short palindromic repeats (CRISPR). PMID:26203860

  17. BACs as tools for the study of genomic imprinting.

    PubMed

    Tunster, S J; Van De Pette, M; John, R M

    2011-01-01

    Genomic imprinting in mammals results in the expression of genes from only one parental allele. Imprinting occurs as a consequence of epigenetic marks set down either in the father's or the mother's germ line and affects a very specific category of mammalian gene. A greater understanding of this distinctive phenomenon can be gained from studies using large genomic clones, called bacterial artificial chromosomes (BACs). Here, we review the important applications of BACs to imprinting research, covering physical mapping studies and the use of BACs as transgenes in mice to study gene expression patterns, to identify imprinting centres, and to isolate the consequences of altered gene dosage. We also highlight the significant and unique advantages that rapid BAC engineering brings to genomic imprinting research.

  18. Development and characterization of rice mutants for functional genomic studies and breeding

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mutagenesis is a powerful tool for creating genetic materials for studying functional genomics, breeding, and understanding the molecular basis of disease resistance. Approximately 100,000 putative mutants of rice (Oryza sativa L.) have been generated with mutagens. Numerous mutant genes involved in...

  19. Using functional genomics to identify molecular markers for fire blight resistance (Erwinia amylovora) in apple (Malus)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Fire blight, caused by Erwinia amylovora (Ea), is a destructive disease of apple (Malus), pear (Pyrus) and some woody ornamentals in the rose family (Rosaceae). The goal of this project is to use a functional genomics approach to develop tools to breed fire blight resistant apples. Six hundred fifty...

  20. Cancer genomics object model: an object model for multiple functional genomics data for cancer research.

    PubMed

    Park, Yu Rang; Lee, Hye Won; Cho, Sung Bum; Kim, Ju Han

    2007-01-01

    The development of functional genomics including transcriptomics, proteomics and metabolomics allow us to monitor a large number of key cellular pathways simultaneously. Several technology-specific data models have been introduced for the representation of functional genomics experimental data, including the MicroArray Gene Expression-Object Model (MAGE-OM), the Proteomics Experiment Data Repository (PEDRo), and the Tissue MicroArray-Object Model (TMA-OM). Despite the increasing number of cancer studies using multiple functional genomics technologies, there is still no integrated data model for multiple functional genomics experimental and clinical data. We propose an object-oriented data model for cancer genomics research, Cancer Genomics Object Model (CaGe-OM). We reference four data models: Functional Genomic-Object Model, MAGE-OM, TMAOM and PEDRo. The clinical and histopathological information models are created by analyzing cancer management workflow and referencing the College of American Pathology Cancer Protocols and National Cancer Institute Common Data Elements. The CaGe-OM provides a comprehensive data model for integrated storage and analysis of clinical and multiple functional genomics data.

  1. NCBI GEO: archive for high-throughput functional genomic data.

    PubMed

    Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Rudnev, Dmitry; Evangelista, Carlos; Kim, Irene F; Soboleva, Alexandra; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Muertter, Rolf N; Edgar, Ron

    2009-01-01

    The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing. The database has a flexible infrastructure that can capture fully annotated raw and processed data, enabling compliance with major community-derived scientific reporting standards such as 'Minimum Information About a Microarray Experiment' (MIAME). In addition to serving as a centralized data storage hub, GEO offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives. This article summarizes the GEO repository structure, content and operating procedures, as well as recently introduced data mining features. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.

  2. SigmoID: a user-friendly tool for improving bacterial genome annotation through analysis of transcription control signals

    PubMed Central

    Damienikan, Aliaksandr U.

    2016-01-01

    The majority of bacterial genome annotations are currently automated and based on a ‘gene by gene’ approach. Regulatory signals and operon structures are rarely taken into account which often results in incomplete and even incorrect gene function assignments. Here we present SigmoID, a cross-platform (OS X, Linux and Windows) open-source application aiming at simplifying the identification of transcription regulatory sites (promoters, transcription factor binding sites and terminators) in bacterial genomes and providing assistance in correcting annotations in accordance with regulatory information. SigmoID combines a user-friendly graphical interface to well known command line tools with a genome browser for visualising regulatory elements in genomic context. Integrated access to online databases with regulatory information (RegPrecise and RegulonDB) and web-based search engines speeds up genome analysis and simplifies correction of genome annotation. We demonstrate some features of SigmoID by constructing a series of regulatory protein binding site profiles for two groups of bacteria: Soft Rot Enterobacteriaceae (Pectobacterium and Dickeya spp.) and Pseudomonas spp. Furthermore, we inferred over 900 transcription factor binding sites and alternative sigma factor promoters in the annotated genome of Pectobacterium atrosepticum. These regulatory signals control putative transcription units covering about 40% of the P. atrosepticum chromosome. Reviewing the annotation in cases where it didn’t fit with regulatory information allowed us to correct product and gene names for over 300 loci. PMID:27257541

  3. SigmoID: a user-friendly tool for improving bacterial genome annotation through analysis of transcription control signals.

    PubMed

    Nikolaichik, Yevgeny; Damienikan, Aliaksandr U

    2016-01-01

    The majority of bacterial genome annotations are currently automated and based on a 'gene by gene' approach. Regulatory signals and operon structures are rarely taken into account which often results in incomplete and even incorrect gene function assignments. Here we present SigmoID, a cross-platform (OS X, Linux and Windows) open-source application aiming at simplifying the identification of transcription regulatory sites (promoters, transcription factor binding sites and terminators) in bacterial genomes and providing assistance in correcting annotations in accordance with regulatory information. SigmoID combines a user-friendly graphical interface to well known command line tools with a genome browser for visualising regulatory elements in genomic context. Integrated access to online databases with regulatory information (RegPrecise and RegulonDB) and web-based search engines speeds up genome analysis and simplifies correction of genome annotation. We demonstrate some features of SigmoID by constructing a series of regulatory protein binding site profiles for two groups of bacteria: Soft Rot Enterobacteriaceae (Pectobacterium and Dickeya spp.) and Pseudomonas spp. Furthermore, we inferred over 900 transcription factor binding sites and alternative sigma factor promoters in the annotated genome of Pectobacterium atrosepticum. These regulatory signals control putative transcription units covering about 40% of the P. atrosepticum chromosome. Reviewing the annotation in cases where it didn't fit with regulatory information allowed us to correct product and gene names for over 300 loci.

  4. Public data and open source tools for multi-assay genomic investigation of disease

    PubMed Central

    Kannan, Lavanya; Ramos, Marcel; Re, Angela; El-Hachem, Nehme; Safikhani, Zhaleh; Gendoo, Deena M.A.; Davis, Sean; Gomez-Cabrero, David; Castelo, Robert; Hansen, Kasper D.; Carey, Vincent J.; Morgan, Martin; Culhane, Aedín C.; Haibe-Kains, Benjamin

    2016-01-01

    Molecular interrogation of a biological sample through DNA sequencing, RNA and microRNA profiling, proteomics and other assays, has the potential to provide a systems level approach to predicting treatment response and disease progression, and to developing precision therapies. Large publicly funded projects have generated extensive and freely available multi-assay data resources; however, bioinformatic and statistical methods for the analysis of such experiments are still nascent. We review multi-assay genomic data resources in the areas of clinical oncology, pharmacogenomics and other perturbation experiments, population genomics and regulatory genomics and other areas, and tools for data acquisition. Finally, we review bioinformatic tools that are explicitly geared toward integrative genomic data visualization and analysis. This review provides starting points for accessing publicly available data and tools to support development of needed integrative methods. PMID:26463000

  5. Population genomics as a new tool for wildlife management.

    PubMed

    Gompert, Zachariah

    2012-04-01

    Admixture and introgression have varied effects on population viability and fitness. Admixture might be an important source of new alleles, particularly for small, geographically isolated populations. However, admixture might also cause outbreeding depression if populations are adapted to different ecological or climatic conditions. Because of the emerging use of translocation and admixture as a conservation and wildlife management strategy to reduce genetic load (termed genetic rescue), the possible effects of admixture have practical consequences (Bouzat et al. 2009; Hedrick & Fredrickson 2010). Importantly, genetic load and local adaptation are properties of individual loci and epistatic interactions among loci rather than properties of genomes. Likewise, the outcome and consequences of genetic rescue depend on the fitness effects of individual introduced alleles. In this issue of Molecular Ecology, Miller et al. (2012) use model-based, population genomic analyses to document locus-specific effects of a recent genetic rescue in the bighorn sheep population within the National Bison Range wildlife refuge (NBR; Montana, USA). They find a subset of introduced alleles associated with increased fitness in NBR bighorn sheep, some of which experienced accelerated introgression following their introduction. These loci mark regions of the genome that could constitute the genetic basis of the successful NBR bighorn sheep genetic rescue. Although population genomic analyses are frequently used to study local adaptation and selection (e.g. Hohenlohe et al. 2010; Lawniczak et al. 2010), this study constitutes a novel application of this analytical framework for wildlife management. Moreover, the detailed demographic data available for the NBR bighorn sheep population provide a rare and powerful source of information and allow more robust population genomic inference than is often possible.

  6. High-resolution interrogation of functional elements in the noncoding genome

    PubMed Central

    Sanjana, Neville E.; Wright, Jason; Zheng, Kaijie; Shalem, Ophir; Fontanillas, Pierre; Joung, Julia; Cheng, Christine; Regev, Aviv; Zhang, Feng

    2016-01-01

    The noncoding genome affects gene regulation and disease, yet we lack tools for rapid identification and manipulation of noncoding elements. We develop a CRISPR screen employing ~18,000 sgRNAs targeting >700 kb surrounding the genes NF1, NF2, and CUL3, which are involved in BRAF inhibitor resistance in melanoma. We find that noncoding locations that modulate drug resistance also harbor predictive hallmarks of noncoding function. With a subset of regions at the CUL3 locus, we demonstrate that engineered mutations alter transcription factor occupancy and long-range and local epigenetic environments, implicating these sites in gene regulation and chemotherapeutic resistance. Though our expansion of the potential of pooled CRISPR screens we provide tools for genomic discovery and for elucidating biologically relevant mechanisms of gene regulation. Pooled CRISPR mutagenesis identifies functional elements in the noncoding genome. PMID:27708104

  7. BambooGDB: a bamboo genome database with functional annotation and an analysis platform

    PubMed Central

    Zhao, Hansheng; Peng, Zhenhua; Fei, Benhua; Li, Lubin; Hu, Tao; Gao, Zhimin; Jiang, Zehui

    2014-01-01

    Bamboo, as one of the most important non-timber forest products and fastest-growing plants in the world, represents the only major lineage of grasses that is native to forests. Recent success on the first high-quality draft genome sequence of moso bamboo (Phyllostachys edulis) provides new insights on bamboo genetics and evolution. To further extend our understanding on bamboo genome and facilitate future studies on the basis of previous achievements, here we have developed BambooGDB, a bamboo genome database with functional annotation and analysis platform. The de novo sequencing data, together with the full-length complementary DNA and RNA-seq data of moso bamboo composed the main contents of this database. Based on these sequence data, a comprehensively functional annotation for bamboo genome was made. Besides, an analytical platform composed of comparative genomic analysis, protein–protein interactions network, pathway analysis and visualization of genomic data was also constructed. As discovery tools to understand and identify biological mechanisms of bamboo, the platform can be used as a systematic framework for helping and designing experiments for further validation. Moreover, diverse and powerful search tools and a convenient browser were incorporated to facilitate the navigation of these data. As far as we know, this is the first genome database for bamboo. Through integrating high-throughput sequencing data, a full functional annotation and several analysis modules, BambooGDB aims to provide worldwide researchers with a central genomic resource and an extensible analysis platform for bamboo genome. BambooGDB is freely available at http://www.bamboogdb.org/. Database URL: http://www.bamboogdb.org PMID:24602877

  8. BambooGDB: a bamboo genome database with functional annotation and an analysis platform.

    PubMed

    Zhao, Hansheng; Peng, Zhenhua; Fei, Benhua; Li, Lubin; Hu, Tao; Gao, Zhimin; Jiang, Zehui

    2014-01-01

    Bamboo, as one of the most important non-timber forest products and fastest-growing plants in the world, represents the only major lineage of grasses that is native to forests. Recent success on the first high-quality draft genome sequence of moso bamboo (Phyllostachys edulis) provides new insights on bamboo genetics and evolution. To further extend our understanding on bamboo genome and facilitate future studies on the basis of previous achievements, here we have developed BambooGDB, a bamboo genome database with functional annotation and analysis platform. The de novo sequencing data, together with the full-length complementary DNA and RNA-seq data of moso bamboo composed the main contents of this database. Based on these sequence data, a comprehensively functional annotation for bamboo genome was made. Besides, an analytical platform composed of comparative genomic analysis, protein-protein interactions network, pathway analysis and visualization of genomic data was also constructed. As discovery tools to understand and identify biological mechanisms of bamboo, the platform can be used as a systematic framework for helping and designing experiments for further validation. Moreover, diverse and powerful search tools and a convenient browser were incorporated to facilitate the navigation of these data. As far as we know, this is the first genome database for bamboo. Through integrating high-throughput sequencing data, a full functional annotation and several analysis modules, BambooGDB aims to provide worldwide researchers with a central genomic resource and an extensible analysis platform for bamboo genome. BambooGDB is freely available at http://www.bamboogdb.org/. Database URL: http://www.bamboogdb.org.

  9. New tools for experimental diabetes research: Cellular reprogramming and genome editing

    PubMed Central

    2016-01-01

    Isolated human islets are a rare and precious material for diabetes research. However, their availability is limited, and it is impossible to obtain them from patients with specific genotypes. Human pluripotent stem cells provide an alternative. Induced pluripotent stem cells can be generated from any individual’s somatic cells and differentiated into pancreatic cells. Currently, this approach is limited by the immaturity of the islet-like cells derived from stem cells. However, this approach can already be used to model developmental defects, and the possibilities for studying insulin secretion are continuously improving. In addition, genome editing using the CRISPR/Cas9 technology provides powerful possibilities to study the impact of specific genotypes. The same technology can also be used for transcriptional regulation in order to improve the functional maturation of stem cell-derived islets. These tools are today becoming available for tomorrow’s translational diabetes research. PMID:27007444

  10. Novel tools for an old lineage: Population genomics for cycads.

    PubMed

    Cibrián-Jaramillo, Angelica; Marler, Thomas E

    2011-07-01

    With a ca. 300 million year-old evolutionary history, cycads are often perceived as "living fossils," relicts of their previously widespread dominance. Patterns of genetic variation for a member of the most basal cycad genus, Cycas micronesica, support the notion that cycads are a dynamic group with ongoing diversification. Herein we hypothesize that cycad's hefty genomes enable rapid adaptive change and facilitate specific beneficial interactions with varying assemblages of symbionts. Characterizing population-level genomic patterns of cycads and their symbionts, pollinators in particular, will enlighten our understanding of these mechanisms and of adaptive variation that underlies cycad evolution. In light of rapid climate and landscape change, cycads are a beacon for understanding the ecological processes that ultimately enable species long-term survival.

  11. Genetic screens and functional genomics using CRISPR/Cas9 technology.

    PubMed

    Hartenian, Ella; Doench, John G

    2015-04-01

    Functional genomics attempts to understand the genome by perturbing the flow of information from DNA to RNA to protein, in order to learn how gene dysfunction leads to disease. CRISPR/Cas9 technology is the newest tool in the geneticist's toolbox, allowing researchers to edit DNA with unprecedented ease, speed and accuracy, and representing a novel means to perform genome-wide genetic screens to discover gene function. In this review, we first summarize the discovery and characterization of CRISPR/Cas9, and then compare it to other genome engineering technologies. We discuss its initial use in screening applications, with a focus on optimizing on-target activity and minimizing off-target effects. Finally, we comment on future challenges and opportunities afforded by this technology.

  12. Genomics tools for the unraveling of chromosome architecture

    PubMed Central

    van Steensel, Bas; Dekker, Job

    2010-01-01

    The spatial organization of chromosomes inside the cell nucleus is still poorly understood. This organization is guided by intra- and interchromosomal contacts and by interactions of specific chromosomal loci with relatively fixed nuclear “landmarks” such as the nuclear envelope and the nucleolus. New molecular genome-wide mapping techniques have begun to uncover both types of molecular interactions, providing insights into the fundamental principles of interphase chromosome folding. PMID:20944601

  13. Partnering for functional genomics research conference: Abstracts of poster presentations

    SciTech Connect

    1998-06-01

    This reports contains abstracts of poster presentations presented at the Functional Genomics Research Conference held April 16--17, 1998 in Oak Ridge, Tennessee. Attention is focused on the following areas: mouse mutagenesis and genomics; phenotype screening; gene expression analysis; DNA analysis technology development; bioinformatics; comparative analyses of mouse, human, and yeast sequences; and pilot projects to evaluate methodologies.

  14. Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup

    PubMed Central

    Kudtarkar, Parul; DeLuca, Todd F.; Fusaro, Vincent A.; Tonellato, Peter J.; Wall, Dennis P.

    2010-01-01

    Background Comparative genomics resources, such as ortholog detection tools and repositories are rapidly increasing in scale and complexity. Cloud computing is an emerging technological paradigm that enables researchers to dynamically build a dedicated virtual cluster and may represent a valuable alternative for large computational tools in bioinformatics. In the present manuscript, we optimize the computation of a large-scale comparative genomics resource—Roundup—using cloud computing, describe the proper operating principles required to achieve computational efficiency on the cloud, and detail important procedures for improving cost-effectiveness to ensure maximal computation at minimal costs. Methods Utilizing the comparative genomics tool, Roundup, as a case study, we computed orthologs among 902 fully sequenced genomes on Amazon’s Elastic Compute Cloud. For managing the ortholog processes, we designed a strategy to deploy the web service, Elastic MapReduce, and maximize the use of the cloud while simultaneously minimizing costs. Specifically, we created a model to estimate cloud runtime based on the size and complexity of the genomes being compared that determines in advance the optimal order of the jobs to be submitted. Results We computed orthologous relationships for 245,323 genome-to-genome comparisons on Amazon’s computing cloud, a computation that required just over 200 hours and cost $8,000 USD, at least 40% less than expected under a strategy in which genome comparisons were submitted to the cloud randomly with respect to runtime. Our cost savings projections were based on a model that not only demonstrates the optimal strategy for deploying RSD to the cloud, but also finds the optimal cluster size to minimize waste and maximize usage. Our cost-reduction model is readily adaptable for other comparative genomics tools and potentially of significant benefit to labs seeking to take advantage of the cloud as an alternative to local computing

  15. Kvik: three-tier data exploration tools for flexible analysis of genomic data in epidemiological studies

    PubMed Central

    Fjukstad, Bjørn; Standahl Olsen, Karina; Jareid, Mie; Lund, Eiliv; Bongo, Lars Ailo

    2015-01-01

    Kvik is an open-source framework that we developed for explorative analysis of functional genomics data from large epidemiological studies. Creating such studies requires a significant amount of time and resources. It is therefore usual to reuse the data from one study for several research projects. Often each project requires implementing new analysis code, integration with specific knowledge bases, and specific visualizations. Although existing data exploration tools are available for single study data exploration, no tool provides all the required functionality for multistudy data exploration. We have therefore used the Kvik framework to develop Kvik Pathways, an application for exploring gene expression data in the context of biological pathways. We have used Kvik Pathways to explore data from both a cross-sectional study design and a case-control study within the Norwegian Women and Cancer (NOWAC) cohort. Kvik Pathways follows the three-tier architecture in web applications using a powerful back-end for statistical analyses and retrieval of metadata.In this note, we describe how we used the Kvik framework to develop the Kvik Pathways application. Kvik Pathways was used by our team of epidemiologists toexplore gene expression data from healthy women with high and low plasma ratios of essential fatty acids. PMID:26425340

  16. Emerging Genomic Tools for Legume Breeding: Current Status and Future Prospects

    PubMed Central

    Pandey, Manish K.; Roorkiwal, Manish; Singh, Vikas K.; Ramalingam, Abirami; Kudapa, Himabindu; Thudi, Mahendar; Chitikineni, Anu; Rathore, Abhishek; Varshney, Rajeev K.

    2016-01-01

    Legumes play a vital role in ensuring global nutritional food security and improving soil quality through nitrogen fixation. Accelerated higher genetic gains is required to meet the demand of ever increasing global population. In recent years, speedy developments have been witnessed in legume genomics due to advancements in next-generation sequencing (NGS) and high-throughput genotyping technologies. Reference genome sequences for many legume crops have been reported in the last 5 years. The availability of the draft genome sequences and re-sequencing of elite genotypes for several important legume crops have made it possible to identify structural variations at large scale. Availability of large-scale genomic resources and low-cost and high-throughput genotyping technologies are enhancing the efficiency and resolution of genetic mapping and marker-trait association studies. Most importantly, deployment of molecular breeding approaches has resulted in development of improved lines in some legume crops such as chickpea and groundnut. In order to support genomics-driven crop improvement at a fast pace, the deployment of breeder-friendly genomics and decision support tools seems appear to be critical in breeding programs in developing countries. This review provides an overview of emerging genomics and informatics tools/approaches that will be the key driving force for accelerating genomics-assisted breeding and ultimately ensuring nutritional and food security in developing countries. PMID:27199998

  17. Emerging Genomic Tools for Legume Breeding: Current Status and Future Prospects.

    PubMed

    Pandey, Manish K; Roorkiwal, Manish; Singh, Vikas K; Ramalingam, Abirami; Kudapa, Himabindu; Thudi, Mahendar; Chitikineni, Anu; Rathore, Abhishek; Varshney, Rajeev K

    2016-01-01

    Legumes play a vital role in ensuring global nutritional food security and improving soil quality through nitrogen fixation. Accelerated higher genetic gains is required to meet the demand of ever increasing global population. In recent years, speedy developments have been witnessed in legume genomics due to advancements in next-generation sequencing (NGS) and high-throughput genotyping technologies. Reference genome sequences for many legume crops have been reported in the last 5 years. The availability of the draft genome sequences and re-sequencing of elite genotypes for several important legume crops have made it possible to identify structural variations at large scale. Availability of large-scale genomic resources and low-cost and high-throughput genotyping technologies are enhancing the efficiency and resolution of genetic mapping and marker-trait association studies. Most importantly, deployment of molecular breeding approaches has resulted in development of improved lines in some legume crops such as chickpea and groundnut. In order to support genomics-driven crop improvement at a fast pace, the deployment of breeder-friendly genomics and decision support tools seems appear to be critical in breeding programs in developing countries. This review provides an overview of emerging genomics and informatics tools/approaches that will be the key driving force for accelerating genomics-assisted breeding and ultimately ensuring nutritional and food security in developing countries.

  18. Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam

    SciTech Connect

    Chai, Juanjuan; Kora, Guruprasad; Ahn, Tae-Hyuk; Hyatt, Doug; Pan, Chongle

    2014-01-01

    To supply some background, phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes. Our results show a total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accurate comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history. In conclusion, our functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree.

  19. Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam

    DOE PAGES

    Chai, Juanjuan; Kora, Guruprasad; Ahn, Tae-Hyuk; ...

    2014-01-01

    To supply some background, phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes. Our results show a total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accuratemore » comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history. In conclusion, our functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree.« less

  20. Decoding the ecological function of accessory genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Shiga toxin-producing Escherichia coli O157:H7 primarily resides in cattle asymptomatically, and can be transmitted to humans through food. A study by Lupolova et al applied a machine-learning approach to complex pan-genome information and predicted that only a small subset of bovine isolates have t...

  1. Chemical genomics for studying parasite gene function and interaction

    PubMed Central

    Li, Jian; Yuan, Jing; Chen, Chin-chien; Inglese, James; Su, Xin-zhuan

    2013-01-01

    With the development of new technologies in genome sequencing, gene expression profiling, genotyping, and high-throughput screening of chemical compound libraries, small molecules are playing increasingly important roles in studying gene expression regulation, gene-gene interaction, and gene function. Here we briefly review and discuss some recent advancements in drug target identification and phenotype characterization using combinations of high-throughput screening of small-molecule libraries and various genome-wide methods such as whole genome sequencing, genome-wide association studies, and genome-wide expressional analysis. These approaches can be used to search for new drugs against parasitic infections, to identify drug targets or drug-resistance genes, and to infer gene function. PMID:24215777

  2. The capsicum transcriptome DB: a “hot” tool for genomic research

    PubMed Central

    Góngora-Castillo, Elsa; Fajardo-Jaime, Rubén; Fernández-Cortes, Araceli; Jofre-Garfias, Alba E; Lozoya-Gloria, Edmundo; Martínez, Octavio; Ochoa-Alejo, Neftalí; Rivera-Bustamante, Rafael

    2012-01-01

    Chili pepper (Capsicum annuum) is an economically important crop with no available public genome sequence. We describe a genomic resource to facilitate Capsicum annuum research. A collection of Expressed Sequence Tags (ESTs) derived from five C. annuum organs (root, stem, leaf, flower and fruit) were sequenced using the Sanger method and multiple leaf transcriptomes were deeply sampled using with GS-pyrosequencing. A hybrid assembly of 1,324,516 raw reads yielded 32,314 high quality contigs as validated by coverage and identity analysis with existing pepper sequences. Overall, 75.5% of the contigs had significant sequence similarity to entries in nucleic acid and protein databases; 23% of the sequences have not been previously reported for C. annuum and expand sequence resources for this species. A MySQL database and a user-friendly Web interface were constructed with search-tools that permit queries of the ESTs including sequence, functional annotation, Gene Ontology classification, metabolic pathways, and assembly information. The Capsicum Transcriptome DB is free available from http://www.bioingenios.ira.cinvestav.mx:81/Joomla/ PMID:22359434

  3. Characterizing genomic alterations in cancer by complementary functional associations | Office of Cancer Genomics

    Cancer.gov

    Systematic efforts to sequence the cancer genome have identified large numbers of mutations and copy number alterations in human cancers. However, elucidating the functional consequences of these variants, and their interactions to drive or maintain oncogenic states, remains a challenge in cancer research. We developed REVEALER, a computational method that identifies combinations of mutually exclusive genomic alterations correlated with functional phenotypes, such as the activation or gene dependency of oncogenic pathways or sensitivity to a drug treatment.

  4. Iterative ACORN as a high throughput tool in structural genomics.

    PubMed

    Selvanayagam, S; Velmurugan, D; Yamane, T

    2006-08-01

    High throughput macromolecular structure determination is very essential in structural genomics as the available number of sequence information far exceeds the number of available 3D structures. ACORN, a freely available resource in the CCP4 suite of programs is a comprehensive and efficient program for phasing in the determination of protein structures, when atomic resolution data are available. ACORN with the automatic model-building program ARP/wARP and refinement program REFMAC is a suitable combination for the high throughput structural genomics. ACORN can also be run with secondary structural elements like helices and sheets as inputs with high resolution data. In situations, where ACORN phasing is not sufficient for building the protein model, the fragments (incomplete model/dummy atoms) can again be used as a starting input. Iterative ACORN is proved to work efficiently in the subsequent model building stages in congerin (PDB-ID: lis3) and catalase (PDB-ID: 1gwe) for which models are available.

  5. DivStat: A User-Friendly Tool for Single Nucleotide Polymorphism Analysis of Genomic Diversity

    PubMed Central

    Soares, Inês; Moleirinho, Ana; Oliveira, Gonçalo N. P.; Amorim, António

    2015-01-01

    Recent developments have led to an enormous increase of publicly available large genomic data, including complete genomes. The 1000 Genomes Project was a major contributor, releasing the results of sequencing a large number of individual genomes, and allowing for a myriad of large scale studies on human genetic variation. However, the tools currently available are insufficient when the goal concerns some analyses of data sets encompassing more than hundreds of base pairs and when considering haplotype sequences of single nucleotide polymorphisms (SNPs). Here, we present a new and potent tool to deal with large data sets allowing the computation of a variety of summary statistics of population genetic data, increasing the speed of data analysis. PMID:25756185

  6. Bioinformatics tools and databases for whole genome sequence analysis of Mycobacterium tuberculosis.

    PubMed

    Faksri, Kiatichai; Tan, Jun Hao; Chaiprasert, Angkana; Teo, Yik-Ying; Ong, Rick Twee-Hee

    2016-11-01

    Tuberculosis (TB) is an infectious disease of global public health importance caused by Mycobacterium tuberculosis complex (MTC) in which M. tuberculosis (Mtb) is the major causative agent. Recent advancements in genomic technologies such as next generation sequencing have enabled high throughput cost-effective generation of whole genome sequence information from Mtb clinical isolates, providing new insights into the evolution, genomic diversity and transmission of the Mtb bacteria, including molecular mechanisms of antibiotic resistance. The large volume of sequencing data generated however necessitated effective and efficient management, storage, analysis and visualization of the data and results through development of novel and customized bioinformatics software tools and databases. In this review, we aim to provide a comprehensive survey of the current freely available bioinformatics software tools and publicly accessible databases for genomic analysis of Mtb for identifying disease transmission in molecular epidemiology and in rapid determination of the antibiotic profiles of clinical isolates for prompt and optimal patient treatment.

  7. Defining functional DNA elements in the human genome.

    PubMed

    Kellis, Manolis; Wold, Barbara; Snyder, Michael P; Bernstein, Bradley E; Kundaje, Anshul; Marinov, Georgi K; Ward, Lucas D; Birney, Ewan; Crawford, Gregory E; Dekker, Job; Dunham, Ian; Elnitski, Laura L; Farnham, Peggy J; Feingold, Elise A; Gerstein, Mark; Giddings, Morgan C; Gilbert, David M; Gingeras, Thomas R; Green, Eric D; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D; Myers, Richard M; Pazin, Michael J; Ren, Bing; Stamatoyannopoulos, John A; Weng, Zhiping; White, Kevin P; Hardison, Ross C

    2014-04-29

    With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease.

  8. Defining functional DNA elements in the human genome

    PubMed Central

    Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

    2014-01-01

    With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594

  9. Cognitive Tools and User-Centered Learning Environments: Rethinking Tools, Functions, and Applications.

    ERIC Educational Resources Information Center

    Iiyoshi, Toru; Hannafin, Michael J.

    This paper introduces and analyzes problems and issues in the design and use of cognitive tools in open, user-centered learning environments. It introduces a classification scheme for tool functions, and showcases several tools in a current educational hypermedia research and development effort. Information-seeking, information-presentation,…

  10. Gnome View: A tool for visual representation of human genome data

    SciTech Connect

    Pelkey, J.E.; Thomas, G.S.; Thurman, D.A.; Lortz, V.B.; Douthart, R.J.

    1993-02-01

    GnomeView is a tool for exploring data generated by the Human Gemone Project. GnomeView provides both graphical and textural styles of data presentation: employs an intuitive window-based graphical query interface: and integrates its underlying genome databases in such a way that the user can navigate smoothly across databases and between different levels of data. This paper describes GnomeView and discusses how it addresses various genome informatics issues.

  11. Conservation and functional element discovery in 20 angiosperm plant genomes.

    PubMed

    Hupalo, Daniel; Kern, Andrew D

    2013-07-01

    Here, we describe the construction of a phylogenetically deep, whole-genome alignment of 20 flowering plants, along with an analysis of plant genome conservation. Each included angiosperm genome was aligned to a reference genome, Arabidopsis thaliana, using the LASTZ/MULTIZ paradigm and tools from the University of California-Santa Cruz Genome Browser source code. In addition to the multiple alignment, we created a local genome browser displaying multiple tracks of newly generated genome annotation, as well as annotation sourced from published data of other research groups. An investigation into A. thaliana gene features present in the aligned A. lyrata genome revealed better conservation of start codons, stop codons, and splice sites within our alignments (51% of features from A. thaliana conserved without interruption in A. lyrata) when compared with previous publicly available plant pairwise alignments (34% of features conserved). The detailed view of conservation across angiosperms revealed not only high coding-sequence conservation but also a large set of previously uncharacterized intergenic conservation. From this, we annotated the collection of conserved features, revealing dozens of putative noncoding RNAs, including some with recorded small RNA expression. Comparing conservation between kingdoms revealed a faster decay of vertebrate genome features when compared with angiosperm genomes. Finally, conserved sequences were searched for folding RNA features, including but not limited to noncoding RNA (ncRNA) genes. Among these, we highlight a double hairpin in the 5'-untranslated region (5'-UTR) of the PRIN2 gene and a putative ncRNA with homology targeting the LAF3 protein.

  12. Genomic Tools in Pearl Millet Breeding for Drought Tolerance: Status and Prospects

    PubMed Central

    Serba, Desalegn D.; Yadav, Rattan S.

    2016-01-01

    Pearl millet [Penisetum glaucum (L) R. Br.] is a hardy cereal crop grown in the arid and semiarid tropics where other cereals are likely to fail to produce economic yields due to drought and heat stresses. Adaptive evolution, a form of natural selection shaped the crop to grow and yield satisfactorily with limited moisture supply or under periodic water deficits in the soil. Drought tolerance is a complex polygenic trait that various morphological and physiological responses are controlled by 100s of genes and significantly influenced by the environment. The development of genomic tools will have enormous potential to improve the efficiency and precision of conventional breeding. The apparent independent domestication events, highly outcrossing nature and traditional cultivation in stressful environments maintained tremendous amount of polymorphism in pearl millet. This high polymorphism of the crop has been revealed by genome mapping that in turn stimulated the mapping and tagging of genomic regions controlling important traits such as drought tolerance. Mapping of a major QTL for terminal drought tolerance in independent populations envisaged the prospect for the development of molecular breeding in pearl millet. To accelerate genetic gains for drought tolerance targeted novel approaches such as establishment of marker-trait associations, genomic selection tools, genome sequence and genotyping-by-sequencing are still limited. Development and application of high throughput genomic tools need to be intensified to improve the breeding efficiency of pearl millet to minimize the impact of climate change on its production. PMID:27920783

  13. Genomic Tools in Pearl Millet Breeding for Drought Tolerance: Status and Prospects.

    PubMed

    Serba, Desalegn D; Yadav, Rattan S

    2016-01-01

    Pearl millet [Penisetum glaucum (L) R. Br.] is a hardy cereal crop grown in the arid and semiarid tropics where other cereals are likely to fail to produce economic yields due to drought and heat stresses. Adaptive evolution, a form of natural selection shaped the crop to grow and yield satisfactorily with limited moisture supply or under periodic water deficits in the soil. Drought tolerance is a complex polygenic trait that various morphological and physiological responses are controlled by 100s of genes and significantly influenced by the environment. The development of genomic tools will have enormous potential to improve the efficiency and precision of conventional breeding. The apparent independent domestication events, highly outcrossing nature and traditional cultivation in stressful environments maintained tremendous amount of polymorphism in pearl millet. This high polymorphism of the crop has been revealed by genome mapping that in turn stimulated the mapping and tagging of genomic regions controlling important traits such as drought tolerance. Mapping of a major QTL for terminal drought tolerance in independent populations envisaged the prospect for the development of molecular breeding in pearl millet. To accelerate genetic gains for drought tolerance targeted novel approaches such as establishment of marker-trait associations, genomic selection tools, genome sequence and genotyping-by-sequencing are still limited. Development and application of high throughput genomic tools need to be intensified to improve the breeding efficiency of pearl millet to minimize the impact of climate change on its production.

  14. Coordinated international action to accelerate genome-to-phenome with FAANG, The Functional Annotation of Animal Genomes project

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We describe the organization of a nascent international effort - the "Functional Annotation of ANimal Genomes" project - whose aim is to produce comprehensive maps of functional elements in the genomes of domesticated animal species....

  15. 3D genome structure modeling by Lorentzian objective function.

    PubMed

    Trieu, Tuan; Cheng, Jianlin

    2016-11-29

    The 3D structure of the genome plays a vital role in biological processes such as gene interaction, gene regulation, DNA replication and genome methylation. Advanced chromosomal conformation capture techniques, such as Hi-C and tethered conformation capture, can generate chromosomal contact data that can be used to computationally reconstruct 3D structures of the genome. We developed a novel restraint-based method that is capable of reconstructing 3D genome structures utilizing both intra-and inter-chromosomal contact data. Our method was robust to noise and performed well in comparison with a panel of existing methods on a controlled simulated data set. On a real Hi-C data set of the human genome, our method produced chromosome and genome structures that are consistent with 3D FISH data and known knowledge about the human chromosome and genome, such as, chromosome territories and the cluster of small chromosomes in the nucleus center with the exception of the chromosome 18. The tool and experimental data are available at https://missouri.box.com/v/LorDG.

  16. Application of Functional Genomics for Bovine Respiratory Disease Diagnostics

    PubMed Central

    Rai, Aswathy N.; Epperson, William B.; Nanduri, Bindu

    2015-01-01

    Bovine respiratory disease (BRD) is the most common economically important disease affecting cattle. For developing accurate diagnostics that can predict disease susceptibility/resistance and stratification, it is necessary to identify the molecular mechanisms that underlie BRD. To study the complex interactions among the bovine host and the multitude of viral and bacterial pathogens, as well as the environmental factors associated with BRD etiology, genome-scale high-throughput functional genomics methods such as microarrays, RNA-seq, and proteomics are helpful. In this review, we summarize the progress made in our understanding of BRD using functional genomics approaches. We also discuss some of the available bioinformatics resources for analyzing high-throughput data, in the context of biological pathways and molecular interactions. Although resources for studying host response to infection are avail-able, the corresponding information is lacking for majority of BRD pathogens, impeding progress in identifying diagnostic signatures for BRD using functional genomics approaches. PMID:26526746

  17. Application of Functional Genomics for Bovine Respiratory Disease Diagnostics.

    PubMed

    Rai, Aswathy N; Epperson, William B; Nanduri, Bindu

    2015-01-01

    Bovine respiratory disease (BRD) is the most common economically important disease affecting cattle. For developing accurate diagnostics that can predict disease susceptibility/resistance and stratification, it is necessary to identify the molecular mechanisms that underlie BRD. To study the complex interactions among the bovine host and the multitude of viral and bacterial pathogens, as well as the environmental factors associated with BRD etiology, genome-scale high-throughput functional genomics methods such as microarrays, RNA-seq, and proteomics are helpful. In this review, we summarize the progress made in our understanding of BRD using functional genomics approaches. We also discuss some of the available bioinformatics resources for analyzing high-throughput data, in the context of biological pathways and molecular interactions. Although resources for studying host response to infection are avail-able, the corresponding information is lacking for majority of BRD pathogens, impeding progress in identifying diagnostic signatures for BRD using functional genomics approaches.

  18. Determining protein function and interaction from genome analysis

    DOEpatents

    Eisenberg, David; Marcotte, Edward M.; Thompson, Michael J.; Pellegrini, Matteo; Yeates, Todd O.

    2004-08-03

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  19. Assigning protein functions by comparative genome analysis protein phylogenetic profiles

    DOEpatents

    Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.

    2003-05-13

    A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.

  20. Functional coverage of the human genome by existing structures, structural genomics targets, and homology models.

    PubMed

    Xie, Lei; Bourne, Philip E

    2005-08-01

    The bias in protein structure and function space resulting from experimental limitations and targeting of particular functional classes of proteins by structural biologists has long been recognized, but never continuously quantified. Using the Enzyme Commission and the Gene Ontology classifications as a reference frame, and integrating structure data from the Protein Data Bank (PDB), target sequences from the structural genomics projects, structure homology derived from the SUPERFAMILY database, and genome annotations from Ensembl and NCBI, we provide a quantified view, both at the domain and whole-protein levels, of the current and projected coverage of protein structure and function space relative to the human genome. Protein structures currently provide at least one domain that covers 37% of the functional classes identified in the genome; whole structure coverage exists for 25% of the genome. If all the structural genomics targets were solved (twice the current number of structures in the PDB), it is estimated that structures of one domain would cover 69% of the functional classes identified and complete structure coverage would be 44%. Homology models from existing experimental structures extend the 37% coverage to 56% of the genome as single domains and 25% to 31% for complete structures. Coverage from homology models is not evenly distributed by protein family, reflecting differing degrees of sequence and structure divergence within families. While these data provide coverage, conversely, they also systematically highlight functional classes of proteins for which structures should be determined. Current key functional families without structure representation are highlighted here; updated information on the "most wanted list" that should be solved is available on a weekly basis from http://function.rcsb.org:8080/pdb/function_distribution/index.html.

  1. Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST): A web tool for generating protein sequence similarity networks

    PubMed Central

    Gerlt, John A.; Bouvier, Jason T.; Davidson, Daniel B.; Imker, Heidi J.; Sadkhin, Boris; Slater, David R.; Whalen, Katie L.

    2015-01-01

    The Enzyme Function Initiative, an NIH/NIGMS-supported Large-Scale Collaborative Project (EFI; U54GM093342; http://enzymefunction.org/), is focused on devising and disseminating bioinformatics and computational tools as well as experimental strategies for the prediction and assignment of functions (in vitro activities and in vivo physiological/metabolic roles) to uncharacterized enzymes discovered in genome projects. Protein sequence similarity networks (SSNs) are visually powerful tools for analyzing sequence relationships in protein families (H.J. Atkinson, J.H. Morris, T.E. Ferrin, and P.C. Babbitt, PLoS One 2009, 4, e4345). However, the members of the biological/biomedical community have not had access to the capability to generate SSNs for their “favorite” protein families. In this article we announce the EFI-EST (Enzyme Function Initiative-Enzyme Similarity Tool) web tool (http://efi.igb.illinois.edu/efi-est/) that is available without cost for the automated generation of SSNs by the community. The tool can create SSNs for the “closest neighbors” of a user-supplied protein sequence from the UniProt database (Option A) or of members of any user-supplied Pfam and/or InterPro family (Option B). We provide an introduction to SSNs, a description of EFI-EST, and a demonstration of the use of EFI-EST to explore sequence-function space in the OMP decarboxylase superfamily (PF00215). This article is designed as a tutorial that will allow members of the community to use the EFI-EST web tool for exploring sequence/function space in protein families. PMID:25900361

  2. BioViews: Java-Based Tools for Genomic Data Visualization

    PubMed Central

    Helt, Gregg A.; Lewis, Suzanna; Loraine, Ann E.; Rubin, Gerald M.

    1998-01-01

    Visualization tools for bioinformatics ideally should provide universal access to the most current data in an interactive and intuitive graphical user interface. Since the introduction of Java, a language designed for distributed programming over the Web, the technology now exists to build a genomic data visualization tool that meets these requirements. Using Java we have developed a prototype genome browser applet (BioViews) that incorporates a three-level graphical view of genomic data: a physical map, an annotated sequence map, and a DNA sequence display. Annotated biological features are displayed on the physical and sequence-based maps, and the different views are interconnected. The applet is linked to several databases and can retrieve features and display hyperlinked textual data on selected features. In addition to browsing genomic data, different types of analyses can be performed interactively and the results of these analyses visualized alongside prior annotations. Our genome browser is built on top of extensible, reusable graphic components specifically designed for bioinformatics. Other groups can (and do) reuse this work in various ways. Genome centers can reuse large parts of the genome browser with minor modifications, bioinformatics groups working on sequence analysis can reuse components to build front ends for analysis programs, and biology laboratories can reuse components to publish results as dynamic Web documents. PMID:9521932

  3. FEATnotator: A tool for integrated annotation of sequence features and variation, facilitating interpretation in genomics experiments.

    PubMed

    Podicheti, Ram; Mockaitis, Keithanne

    2015-06-01

    As approaches are sought for more efficient and democratized uses of non-model and expanded model genomics references, ease of integration of genomic feature datasets is especially desirable in multidisciplinary research communities. Valuable conclusions are often missed or slowed when researchers refer experimental results to a single reference sequence that lacks integrated pan-genomic and multi-experiment data in accessible formats. Association of genomic positional information, such as results from an expansive variety of next-generation sequencing experiments, with annotated reference features such as genes or predicted protein binding sites, provides the context essential for conclusions and ongoing research. When the experimental system includes polymorphic genomic inputs, rapid calculation of gene structural and protein translational effects of sequence variation from the reference can be invaluable. Here we present FEATnotator, a lightweight, fast and easy to use open source software program that integrates and reports overlap and proximity in genomic information from any user-defined datasets including those from next generation sequencing applications. We illustrate use of the tool by summarizing whole genome sequence variation of a widely used natural isolate of Arabidopsis thaliana in the context of gene models of the reference accession. Previous discovery of a protein coding deletion influencing root development is replicated rapidly. Appropriate even in investigations of a single gene or genic regions such as QTL, comprehensive reports provided by FEATnotator better prepare researchers for interpretation of their experimental results. The tool is available for download at http://featnotator.sourceforge.net.

  4. GEnomes Management Application (GEM.app): A new software tool for large-scale collaborative genome analysis

    PubMed Central

    Gonzalez, Michael A.; Acosta Lebrigio, Rafael F.; Van Booven, Derek; Ulloa, Rick H.; Powell, Eric; Speziani, Fiorella; Tekin, Mustafa; Schule, Rebecca; Zuchner, Stephan

    2015-01-01

    Novel genes are now identified at a rapid pace for many Mendelian disorders, and increasingly, for genetically complex phenotypes. However, new challenges have also become evident: (1) effectively managing larger exome and/or genome datasets, especially for smaller labs; (2) direct hands-on analysis and contextual interpretation of variant data in large genomic datasets; and (3) many small and medium-sized clinical and research-based investigative teams around the world are generating data that, if combined and shared, will significantly increase the opportunities for the entire community to identify new genes. To address these challenges we have developed GEnomes Management Application (GEM.app), a software tool to annotate, manage, visualize, and analyze large genomic datasets (https://genomics.med.miami.edu/). GEM.app currently contains ~1,600 whole exomes from 50 different phenotypes studied by 40 principal investigators from 15 different countries. The focus of GEM.app is on user-friendly analysis for non-bioinformaticians to make NGS data directly accessible. Yet, GEM.app provides powerful and flexible filter options, including single family filtering, across family/phenotype queries, nested filtering, and evaluation of segregation in families. In addition, the system is fast, obtaining results within 4 seconds across ~1,200 exomes. We believe that this system will further enhance identification of genetic causes of human disease. PMID:23463597

  5. DNA-binding protein prediction using plant specific support vector machines: validation and application of a new genome annotation tool.

    PubMed

    Motion, Graham B; Howden, Andrew J M; Huitema, Edgar; Jones, Susan

    2015-12-15

    There are currently 151 plants with draft genomes available but levels of functional annotation for putative protein products are low. Therefore, accurate computational predictions are essential to annotate genomes in the first instance, and to provide focus for the more costly and time consuming functional assays that follow. DNA-binding proteins are an important class of proteins that require annotation, but current computational methods are not applicable for genome wide predictions in plant species. Here, we explore the use of species and lineage specific models for the prediction of DNA-binding proteins in plants. We show that a species specific support vector machine model based on Arabidopsis sequence data is more accurate (accuracy 81%) than a generic model (74%), and based on this we develop a plant specific model for predicting DNA-binding proteins. We apply this model to the tomato proteome and demonstrate its ability to perform accurate high-throughput prediction of DNA-binding proteins. In doing so, we have annotated 36 currently uncharacterised proteins by assigning a putative DNA-binding function. Our model is publically available and we propose it be used in combination with existing tools to help increase annotation levels of DNA-binding proteins encoded in plant genomes.

  6. The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows

    PubMed Central

    O'Connor, Brian D.; Yuen, Denis; Chung, Vincent; Duncan, Andrew G.; Liu, Xiang Kun; Patricia, Janice; Paten, Benedict; Stein, Lincoln; Ferretti, Vincent

    2017-01-01

    As genomic datasets continue to grow, the feasibility of downloading data to a local organization and running analysis on a traditional compute environment is becoming increasingly problematic. Current large-scale projects, such as the ICGC PanCancer Analysis of Whole Genomes (PCAWG), the Data Platform for the U.S. Precision Medicine Initiative, and the NIH Big Data to Knowledge Center for Translational Genomics, are using cloud-based infrastructure to both host and perform analysis across large data sets. In PCAWG, over 5,800 whole human genomes were aligned and variant called across 14 cloud and HPC environments; the processed data was then made available on the cloud for further analysis and sharing. If run locally, an operation at this scale would have monopolized a typical academic data centre for many months, and would have presented major challenges for data storage and distribution. However, this scale is increasingly typical for genomics projects and necessitates a rethink of how analytical tools are packaged and moved to the data. For PCAWG, we embraced the use of highly portable Docker images for encapsulating and sharing complex alignment and variant calling workflows across highly variable environments. While successful, this endeavor revealed a limitation in Docker containers, namely the lack of a standardized way to describe and execute the tools encapsulated inside the container. As a result, we created the Dockstore ( https://dockstore.org), a project that brings together Docker images with standardized, machine-readable ways of describing and running the tools contained within. This service greatly improves the sharing and reuse of genomics tools and promotes interoperability with similar projects through emerging web service standards developed by the Global Alliance for Genomics and Health (GA4GH). PMID:28344774

  7. The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows.

    PubMed

    O'Connor, Brian D; Yuen, Denis; Chung, Vincent; Duncan, Andrew G; Liu, Xiang Kun; Patricia, Janice; Paten, Benedict; Stein, Lincoln; Ferretti, Vincent

    2017-01-01

    As genomic datasets continue to grow, the feasibility of downloading data to a local organization and running analysis on a traditional compute environment is becoming increasingly problematic. Current large-scale projects, such as the ICGC PanCancer Analysis of Whole Genomes (PCAWG), the Data Platform for the U.S. Precision Medicine Initiative, and the NIH Big Data to Knowledge Center for Translational Genomics, are using cloud-based infrastructure to both host and perform analysis across large data sets. In PCAWG, over 5,800 whole human genomes were aligned and variant called across 14 cloud and HPC environments; the processed data was then made available on the cloud for further analysis and sharing. If run locally, an operation at this scale would have monopolized a typical academic data centre for many months, and would have presented major challenges for data storage and distribution. However, this scale is increasingly typical for genomics projects and necessitates a rethink of how analytical tools are packaged and moved to the data. For PCAWG, we embraced the use of highly portable Docker images for encapsulating and sharing complex alignment and variant calling workflows across highly variable environments. While successful, this endeavor revealed a limitation in Docker containers, namely the lack of a standardized way to describe and execute the tools encapsulated inside the container. As a result, we created the Dockstore ( https://dockstore.org), a project that brings together Docker images with standardized, machine-readable ways of describing and running the tools contained within. This service greatly improves the sharing and reuse of genomics tools and promotes interoperability with similar projects through emerging web service standards developed by the Global Alliance for Genomics and Health (GA4GH).

  8. Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome

    PubMed Central

    Bergman, Casey M; Pfeiffer, Barret D; Rincón-Limas, Diego E; Hoskins, Roger A; Gnirke, Andreas; Mungall, Chris J; Wang, Adrienne M; Kronmiller, Brent; Pacleb, Joanne; Park, Soo; Stapleton, Mark; Wan, Kenneth; George, Reed A; de Jong, Pieter J; Botas, Juan; Rubin, Gerald M; Celniker, Susan E

    2002-01-01

    Background It is widely accepted that comparative sequence data can aid the functional annotation of genome sequences; however, the most informative species and features of genome evolution for comparison remain to be determined. Results We analyzed conservation in eight genomic regions (apterous, even-skipped, fushi tarazu, twist, and Rhodopsins 1, 2, 3 and 4) from four Drosophila species (D. erecta, D. pseudoobscura, D. willistoni, and D. littoralis) covering more than 500 kb of the D. melanogaster genome. All D. melanogaster genes (and 78-82% of coding exons) identified in divergent species such as D. pseudoobscura show evidence of functional constraint. Addition of a third species can reveal functional constraint in otherwise non-significant pairwise exon comparisons. Microsynteny is largely conserved, with rearrangement breakpoints, novel transposable element insertions, and gene transpositions occurring in similar numbers. Rates of amino-acid substitution are higher in uncharacterized genes relative to genes that have previously been studied. Conserved non-coding sequences (CNCSs) tend to be spatially clustered with conserved spacing between CNCSs, and clusters of CNCSs can be used to predict enhancer sequences. Conclusions Our results provide the basis for choosing species whose genome sequences would be most useful in aiding the functional annotation of coding and cis-regulatory sequences in Drosophila. Furthermore, this work shows how decoding the spatial organization of conserved sequences, such as the clustering of CNCSs, can complement efforts to annotate eukaryotic genomes on the basis of sequence conservation alone. PMID:12537575

  9. In silico Comparison of 19 Porphyromonas gingivalis Strains in Genomics, Phylogenetics, Phylogenomics and Functional Genomics.

    PubMed

    Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

    2017-01-01

    Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica. All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/.

  10. In silico Comparison of 19 Porphyromonas gingivalis Strains in Genomics, Phylogenetics, Phylogenomics and Functional Genomics

    PubMed Central

    Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

    2017-01-01

    Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica. All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/. PMID:28261563

  11. Mapping by sequencing the Pneumocystis genome using the ordering DNA sequences V3 tool.

    PubMed Central

    Xu, Zheng; Lance, Britton; Vargas, Claudia; Arpinar, Budak; Bhandarkar, Suchendra; Kraemer, Eileen; Kochut, Krys J; Miller, John A; Wagner, Jeff R; Weise, Michael J; Wunderlich, John K; Stringer, James; Smulian, George; Cushion, Melanie T; Arnold, Jonathan

    2003-01-01

    A bioinformatics tool called ODS3 has been created for mapping by sequencing. The tool allows the creation of integrated genomic maps from genetic, physical mapping, and sequencing data and permits an integrated genome map to be stored, retrieved, viewed, and queried in a stand-alone capacity, in a client/server relationship with the Fungal Genome Database (FGDB), and as a web-browsing tool for the FGDB. In that ODS3 is programmed in Java, the tool promotes platform independence and supports export of integrated genome-mapping data in the extensible markup language (XML) for data interchange with other genome information systems. The tool ODS3 is used to create an initial integrated genome map of the AIDS-related fungal pathogen, Pneumocystis carinii. Contig dynamics would indicate that this physical map is approximately 50% complete with approximately 200 contigs. A total of 10 putative multigene families were found. Two of these putative families were previously characterized in P. carinii, namely the major surface glycoproteins (MSGs) and HSP70 proteins; three of these putative families (not previously characterized in P. carinii) were found to be similar to families encoding the HSP60 in Schizosaccharomyces pombe, the heat-shock psi protein in S. pombe, and the RNA synthetase family (i.e., MES1) in Saccharomyces cerevisiae. Physical mapping data are consistent with the 16S, 5.8S, and 26S rDNA genes being single copy in P. carinii. No other fungus outside this genus is known to have the rDNA genes in single copy. PMID:12702676

  12. Accelerating Genome Editing in CHO Cells Using CRISPR Cas9 and CRISPy, a Web-Based Target Finding Tool

    PubMed Central

    Ronda, Carlotta; Pedersen, Lasse Ebdrup; Hansen, Henning Gram; Kallehauge, Thomas Beuchert; Betenbaugh, Michael J; Nielsen, Alex Toftgaard; Kildegaard, Helene Faustrup

    2014-01-01

    Chinese hamster ovary (CHO) cells are widely used in the biopharmaceutical industry as a host for the production of complex pharmaceutical proteins. Thus genome engineering of CHO cells for improved product quality and yield is of great interest. Here, we demonstrate for the first time the efficacy of the CRISPR Cas9 technology in CHO cells by generating site-specific gene disruptions in COSMC and FUT8, both of which encode proteins involved in glycosylation. The tested single guide RNAs (sgRNAs) created an indel frequency up to 47.3% in COSMC, while an indel frequency up to 99.7% in FUT8 was achieved by applying lectin selection. All eight sgRNAs examined in this study resulted in relatively high indel frequencies, demonstrating that the Cas9 system is a robust and efficient genome-editing methodology in CHO cells. Deep sequencing revealed that 85% of the indels created by Cas9 resulted in frameshift mutations at the target sites, with a strong preference for single base indels. Finally, we have developed a user-friendly bioinformatics tool, named “CRISPy” for rapid identification of sgRNA target sequences in the CHO-K1 genome. The CRISPy tool identified 1,970,449 CRISPR targets divided into 27,553 genes and lists the number of off-target sites in the genome. In conclusion, the proven functionality of Cas9 to edit CHO genomes combined with our CRISPy database have the potential to accelerate genome editing and synthetic biology efforts in CHO cells. Biotechnol. Bioeng. 2014; 111: 1604–1616. © 2014 The Authors. Biotechnology and Bioengineering Published by Wiley Periodicals, Inc. PMID:24827782

  13. Accelerating genome editing in CHO cells using CRISPR Cas9 and CRISPy, a web-based target finding tool.

    PubMed

    Ronda, Carlotta; Pedersen, Lasse Ebdrup; Hansen, Henning Gram; Kallehauge, Thomas Beuchert; Betenbaugh, Michael J; Nielsen, Alex Toftgaard; Kildegaard, Helene Faustrup

    2014-08-01

    Chinese hamster ovary (CHO) cells are widely used in the biopharmaceutical industry as a host for the production of complex pharmaceutical proteins. Thus genome engineering of CHO cells for improved product quality and yield is of great interest. Here, we demonstrate for the first time the efficacy of the CRISPR Cas9 technology in CHO cells by generating site-specific gene disruptions in COSMC and FUT8, both of which encode proteins involved in glycosylation. The tested single guide RNAs (sgRNAs) created an indel frequency up to 47.3% in COSMC, while an indel frequency up to 99.7% in FUT8 was achieved by applying lectin selection. All eight sgRNAs examined in this study resulted in relatively high indel frequencies, demonstrating that the Cas9 system is a robust and efficient genome-editing methodology in CHO cells. Deep sequencing revealed that 85% of the indels created by Cas9 resulted in frameshift mutations at the target sites, with a strong preference for single base indels. Finally, we have developed a user-friendly bioinformatics tool, named "CRISPy" for rapid identification of sgRNA target sequences in the CHO-K1 genome. The CRISPy tool identified 1,970,449 CRISPR targets divided into 27,553 genes and lists the number of off-target sites in the genome. In conclusion, the proven functionality of Cas9 to edit CHO genomes combined with our CRISPy database have the potential to accelerate genome editing and synthetic biology efforts in CHO cells.

  14. Functional genomics of lactic acid bacteria: from food to health

    PubMed Central

    2014-01-01

    Genome analysis using next generation sequencing technologies has revolutionized the characterization of lactic acid bacteria and complete genomes of all major groups are now available. Comparative genomics has provided new insights into the natural and laboratory evolution of lactic acid bacteria and their environmental interactions. Moreover, functional genomics approaches have been used to understand the response of lactic acid bacteria to their environment. The results have been instrumental in understanding the adaptation of lactic acid bacteria in artisanal and industrial food fermentations as well as their interactions with the human host. Collectively, this has led to a detailed analysis of genes involved in colonization, persistence, interaction and signaling towards to the human host and its health. Finally, massive parallel genome re-sequencing has provided new opportunities in applied genomics, specifically in the characterization of novel non-GMO strains that have potential to be used in the food industry. Here, we provide an overview of the state of the art of these functional genomics approaches and their impact in understanding, applying and designing lactic acid bacteria for food and health. PMID:25186768

  15. Functional genomics of lactic acid bacteria: from food to health.

    PubMed

    Douillard, François P; de Vos, Willem M

    2014-08-29

    Genome analysis using next generation sequencing technologies has revolutionized the characterization of lactic acid bacteria and complete genomes of all major groups are now available. Comparative genomics has provided new insights into the natural and laboratory evolution of lactic acid bacteria and their environmental interactions. Moreover, functional genomics approaches have been used to understand the response of lactic acid bacteria to their environment. The results have been instrumental in understanding the adaptation of lactic acid bacteria in artisanal and industrial food fermentations as well as their interactions with the human host. Collectively, this has led to a detailed analysis of genes involved in colonization, persistence, interaction and signaling towards to the human host and its health. Finally, massive parallel genome re-sequencing has provided new opportunities in applied genomics, specifically in the characterization of novel non-GMO strains that have potential to be used in the food industry. Here, we provide an overview of the state of the art of these functional genomics approaches and their impact in understanding, applying and designing lactic acid bacteria for food and health.

  16. Exploring the post-genomic world: differing explanatory and manipulatory functions of post-genomic sciences.

    PubMed

    Holmes, Christina; Carlson, Siobhan M; McDonald, Fiona; Jones, Mavis; Graham, Janice

    2016-01-02

    Richard Lewontin proposed that the ability of a scientific field to create a narrative for public understanding garners it social relevance. This article applies Lewontin's conceptual framework of the functions of science (manipulatory and explanatory) to compare and explain the current differences in perceived societal relevance of genetics/genomics and proteomics. We provide three examples to illustrate the social relevance and strong cultural narrative of genetics/genomics for which no counterpart exists for proteomics. We argue that the major difference between genetics/genomics and proteomics is that genomics has a strong explanatory function, due to the strong cultural narrative of heredity. Based on qualitative interviews and observations of proteomics conferences, we suggest that the nature of proteins, lack of public understanding, and theoretical complexity exacerbates this difference for proteomics. Lewontin's framework suggests that social scientists may find that omics sciences affect social relations in different ways than past analyses of genetics.

  17. Exploring the post-genomic world: differing explanatory and manipulatory functions of post-genomic sciences

    PubMed Central

    Holmes, Christina; Carlson, Siobhan M.; McDonald, Fiona; Jones, Mavis; Graham, Janice

    2016-01-01

    Richard Lewontin proposed that the ability of a scientific field to create a narrative for public understanding garners it social relevance. This article applies Lewontin's conceptual framework of the functions of science (manipulatory and explanatory) to compare and explain the current differences in perceived societal relevance of genetics/genomics and proteomics. We provide three examples to illustrate the social relevance and strong cultural narrative of genetics/genomics for which no counterpart exists for proteomics. We argue that the major difference between genetics/genomics and proteomics is that genomics has a strong explanatory function, due to the strong cultural narrative of heredity. Based on qualitative interviews and observations of proteomics conferences, we suggest that the nature of proteins, lack of public understanding, and theoretical complexity exacerbates this difference for proteomics. Lewontin's framework suggests that social scientists may find that omics sciences affect social relations in different ways than past analyses of genetics. PMID:27134568

  18. Functional Analysis of the Human Genome:. Study of Genetic Disease

    NASA Astrophysics Data System (ADS)

    Tsui, Lap-Chee

    2003-04-01

    I will divide my remarks into 3 parts. First, I will give a brief summary of the Human Genome Project. Second, I will describe our work on human chromosome 7 to illustrate how we could contribute to the Project and disease research. Third, I would like to bring across the argument that study of genetic disease is an integral component of the Human Genome Project. In particular, I will use cystic fibrosis as an example to elaborate why I consider disease study is a part of functional genomics.

  19. DFAST and DAGA: web-based integrated genome annotation tools and resources

    PubMed Central

    TANIZAWA, Yasuhiro; FUJISAWA, Takatomo; KAMINUMA, Eli; NAKAMURA, Yasukazu; ARITA, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus, obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii, whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp. PMID:27867804

  20. DFAST and DAGA: web-based integrated genome annotation tools and resources.

    PubMed

    Tanizawa, Yasuhiro; Fujisawa, Takatomo; Kaminuma, Eli; Nakamura, Yasukazu; Arita, Masanori

    2016-01-01

    Quality assurance and correct taxonomic affiliation of data submitted to public sequence databases have been an everlasting problem. The DDBJ Fast Annotation and Submission Tool (DFAST) is a newly developed genome annotation pipeline with quality and taxonomy assessment tools. To enable annotation of ready-to-submit quality, we also constructed curated reference protein databases tailored for lactic acid bacteria. DFAST was developed so that all the procedures required for DDBJ submission could be done seamlessly online. The online workspace would be especially useful for users not familiar with bioinformatics skills. In addition, we have developed a genome repository, DFAST Archive of Genome Annotation (DAGA), which currently includes 1,421 genomes covering 179 species and 18 subspecies of two genera, Lactobacillus and Pediococcus, obtained from both DDBJ/ENA/GenBank and Sequence Read Archive (SRA). All the genomes deposited in DAGA were annotated consistently and assessed using DFAST. To assess the taxonomic position based on genomic sequence information, we used the average nucleotide identity (ANI), which showed high discriminative power to determine whether two given genomes belong to the same species. We corrected mislabeled or misidentified genomes in the public database and deposited the curated information in DAGA. The repository will improve the accessibility and reusability of genome resources for lactic acid bacteria. By exploiting the data deposited in DAGA, we found intraspecific subgroups in Lactobacillus gasseri and Lactobacillus jensenii, whose variation between subgroups is larger than the well-accepted ANI threshold of 95% to differentiate species. DFAST and DAGA are freely accessible at https://dfast.nig.ac.jp.

  1. A segmental genomic duplication generates a functional intron

    PubMed Central

    Hellsten, Uffe; Aspden, Julie L.; Rio, Donald C.; Rokhsar, Daniel S.

    2011-01-01

    An intron is an extended genomic feature whose function requires multiple constrained positions—donor and acceptor splice sites, a branch point, a polypyrimidine tract and suitable splicing enhancers—that may be distributed over hundreds or thousands of nucleotides. New introns are therefore unlikely to emerge by incremental accumulation of functional sub-elements. Here we demonstrate that a functional intron can be created de novo in a single step by a segmental genomic duplication. This experiment recapitulates in vivo the birth of an intron that arose in the ancestral jawed vertebrate lineage nearly half-a-billion years ago. PMID:21878908

  2. Primer: genomic and proteomic tools for the molecular dissection of disease.

    PubMed

    Walker, Erin J; Siminovitch, Katherine A

    2007-10-01

    Completion of the Human Genome Project has been rapidly followed by the emergence of high-throughput technologies that combine automation, miniaturization, and many other strategies and tools to enable systematic surveys of genome composition and gene expression. Of particular relevance to the prevention and management of disease are technologies such as high-throughput DNA genotyping, microarray-based gene-expression profiling, and mass spectrometry-facilitated protein profiling--platforms that collectively support the comprehensive analysis of DNA sequence variants across the genome and the global gene and protein expression changes that distinguish health from disease. Now used extensively in all facets of biomedical investigation, genomic and proteomic tools are already beginning to pinpoint molecular variants that influence risk and outcome in common diseases, and to thereby inform and direct development of novel molecular biomarkers and drug targets. As evidenced by recent advances in DNA sequencing methods, continued improvements in the scope, power, and cost efficiency of genomic and proteomic technologies should ensure their capacity to provide the scale and depth of knowledge required for translating genome sequence information into major medical impact.

  3. High-throughput TILLING for functional genomics.

    PubMed

    Till, Bradley J; Colbert, Trenton; Tompa, Rachel; Enns, Linda C; Codomo, Christine A; Johnson, Jessica E; Reynolds, Steven H; Henikoff, Jorja G; Greene, Elizabeth A; Steine, Michael N; Comai, Luca; Henikoff, Steven

    2003-01-01

    Targeting-induced local lesions in genomes (TILLING) is a general strategy for identifying induced point mutations that can be applied to almost any organism. Here, we describe the basic methodology for high-throughput TILLING. Gene segments are amplified using fluorescently tagged primers, and products are denatured and reannealed to form heteroduplexes between the mutated sequence and its wild-type counterpart. These heteroduplexes are substrates for cleavage by the endonuclease CEL I. Following cleavage, products are analyzed on denaturing polyacrylamide gels using the LI-COR DNA analyzer system. High-throughput TILLING has been adopted by the Arabidopsis TILLING Project (ATP) to provide allelic series of point mutations for the general Arabidopsis community.

  4. Deductive genomics: a functional approach to identify innovative drug targets in the post-genome era.

    PubMed

    Stumm, Gabriele; Russ, Andreas; Nehls, Michael

    2002-01-01

    The sequencing of the human genome has generated a drug discovery process that is based on sequence analysis and hypothesis-driven (inductive) prediction of gene function. This approach, which we term inductive genomics, is currently dominating the efforts of the pharmaceutical industry to identify new drug targets. According to recent studies, this sequence-driven discovery process is paradoxically increasing the average cost of drug development, thus falling short of the promise of the Human Genome Project to simplify the creation of much needed novel therapeutics. In the early stages of discovery, the flurry of new gene sequences makes it difficult to pick and prioritize the most promising product candidates for product development, as with existing technologies important decisions have to be based on circumstantial evidence that does not strongly predict therapeutic potential. This is because the physiological function of a potential target cannot be predicted by gene sequence analysis and in vitro technologies alone. In contrast, deductive genomics, or large-scale forward genetics, bridges the gap between sequence and function by providing a function-driven in vivo screen of a highly orthologous mammalian model genome for medically relevant physiological functions and drug targets. This approach allows drug discovery to move beyond the focus on sequence-driven identification of new members of classical drug-able protein families towards the biology-driven identification of innovative targets and biological pathways.

  5. Genomic tools to improve progress and preserve variation for future generations

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Use of genomic tools has greatly decreased generation intervals and increased genetic progress in dairy cattle, but faster selection cycles can also increase rates of inbreeding per unit of time. Average pedigree inbreeding of Holstein cows increased from 4.6% in 2000 to 5.6% in 2009 to 6.6% in 201...

  6. Genomic tools for developing markers for postharvest disease resistance in Rosaceae fruit crops

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A wealth of new plant genomic information and molecular tools have been developed over the past ten years and now the challenge is to learn how to apply this information to address critical production problems, such as disease resistance and abiotic stress tolerance. Malus sieversii, an apple speci...

  7. Global nutrient profiling by Phenotype MicroArrays: a tool complementing genomic and proteomic studies in conidial fungi*

    PubMed Central

    Atanasova, Lea; Druzhinina, Irina S.

    2010-01-01

    Conidial fungi or molds and mildews are widely used in modern biotechnology as producers of antibiotics and other secondary metabolites, industrially important enzymes, chemicals and food. They are also important pathogens of animals including humans and agricultural crops. These various applications and extremely versatile natural phenotypes have led to the constantly growing list of complete genomes which are now available. Functional genomics and proteomics widely exploit the genomic information to study the cell-wide impact of altered genes on the phenotype of an organism and its function. This allows for global analysis of the information flow from DNA to RNA to protein, but it is usually not sufficient for the description of the global phenotype of an organism. More recently, Phenotype MicroArray (PM) technology has been introduced as a tool to characterize the metabolism of a (wild) fungal strain or a mutant. In this article, we review the background of PM applications for fungi and the methodic requirements to obtain reliable results. We also report examples of the versatility of this tool. PMID:20205302

  8. CRISPR-Cas: From the Bacterial Adaptive Immune System to a Versatile Tool for Genome Engineering.

    PubMed

    Kirchner, Marion; Schneider, Sabine

    2015-11-09

    The field of biology has been revolutionized by the recent advancement of an adaptive bacterial immune system as a universal genome engineering tool. Bacteria and archaea use repetitive genomic elements termed clustered regularly interspaced short palindromic repeats (CRISPR) in combination with an RNA-guided nuclease (CRISPR-associated nuclease: Cas) to target and destroy invading DNA. By choosing the appropriate sequence of the guide RNA, this two-component system can be used to efficiently modify, target, and edit genomic loci of interest in plants, insects, fungi, mammalian cells, and whole organisms. This has opened up new frontiers in genome engineering, including the potential to treat or cure human genetic disorders. Now the potential risks as well as the ethical, social, and legal implications of this powerful new technique move into the limelight.

  9. Physical function assessment tools in pediatric rheumatology

    PubMed Central

    Moorthy, Lakshmi Nandini; Peterson, Margaret GE; Harrison, Melanie J; Onel, Karen B; Lehman, Thomas JA

    2008-01-01

    Pediatric rheumatic diseases with predominant musculoskeletal involvement such as juvenile idiopathic arthritis (JIA) and juvenile dermatomyositis(JDM) can cause considerable physical functional impairment and significantly affect the children's quality of life (QOL). Physical function, QOL, health-related QOL (HRQOL) and health status are personal constructs used as outcomes to estimate the impact of these diseases and often used as proxies for each other. The chronic, fluctuating nature of these diseases differs within and between patients, and complicates the measurement of these outcomes. In children, their growing needs and expectations, limited use of age-specific questionnaires, and the use of proxy respondents further influences this evaluation. This article will briefly review the different constructs inclusive of and related to physical function, and the scales used for measuring them. An understanding of these instruments will enable assessment of functional outcome in clinical studies of children with rheumatic diseases, measure the impact of the disease and treatments on their lives, and guide us in formulating appropriate interventions. PMID:18533038

  10. Functional Measurement: An Incredibly Flexible Tool

    ERIC Educational Resources Information Center

    Mullet, Etienne; Morales Martinez, Guadalupe Elizabeth; Makris, Ioannis; Roge, Bernadette; Munoz Sastre, Maria Teresa

    2012-01-01

    Functional Measurement (FM) has been applied to a variety of settings that can be considered as "extreme" settings; that is, settings involving participants with severe cognitive disabilities or involving unusual stimulus material. FM has, as instance, been successfully applied for analyzing (a) numerosity judgments among children as…

  11. Bacterial genome mining of enzymatic tools for alkyne biosynthesis

    PubMed Central

    Zhu, Xuejun; Su, Michael; Manickam, Kadhirvel; Zhang, Wenjun

    2015-01-01

    The alkyne is an important functionality widely used in material science, pharmaceutical science, and chemical biology, but the importance of this functionality is contrasted by the very limited number of enzymes known to be involved in alkyne biosynthesis. We recently reported the first known carrier protein-dependent pathway for terminal alkyne formation, and in silico analysis suggested that this mechanism could be widespread in bacteria. In this paper, we screened additional homologous gene cassettes presumed to be involved in alkyne biosynthesis using both in vitro biochemical study and an E. coli-polyketide synthase (PKS) reporting system for in vivo analysis. We discovered and characterized a new terminal alkyne biosynthetic pathway comprised of TtuA, B, and C from Teredinibacter turnerae T7901. While the acyl-CoA ligase homolog (TtuA) demonstrated promiscuity in the activation and loading of medium-chain fatty acids onto the carrier protein (TtuC), the desaturase homolog (TtuB) showed stringent substrate specificity towards C10 fatty acyl moieties. In addition, TtuB was demonstrated to be a bifunctional desaturase/acetylenase that efficiently catalyzed two sequential O2-dependent dehydrogenation reactions. A novel terminal-alkyne bearing polyketide was further produced upon co-expression of ttuABC and a PKS gene in E. coli. The discovery and characterization of TtuA, B, and C provides us with a new bifunctional desaturase/acetylenase for mechanistic and structural study and expands the scarce enzyme inventory for the biosynthesis of the alkyne functionality, which has important applications in synthetic and chemical biology. PMID:26441143

  12. Conifer genomics and adaptation: at the crossroads of genetic diversity and genome function.

    PubMed

    Prunier, Julien; Verta, Jukka-Pekka; MacKay, John J

    2016-01-01

    Conifers have been understudied at the genomic level despite their worldwide ecological and economic importance but the situation is rapidly changing with the development of next generation sequencing (NGS) technologies. With NGS, genomics research has simultaneously gained in speed, magnitude and scope. In just a few years, genomes of 20-24 gigabases have been sequenced for several conifers, with several others expected in the near future. Biological insights have resulted from recent sequencing initiatives as well as genetic mapping, gene expression profiling and gene discovery research over nearly two decades. We review the knowledge arising from conifer genomics research emphasizing genome evolution and the genomic basis of adaptation, and outline emerging questions and knowledge gaps. We discuss future directions in three areas with potential inputs from NGS technologies: the evolutionary impacts of adaptation in conifers based on the adaptation-by-speciation model; the contributions of genetic variability of gene expression in adaptation; and the development of a broader understanding of genetic diversity and its impacts on genome function. These research directions promise to sustain research aimed at addressing the emerging challenges of adaptation that face conifer trees.

  13. CRISPR-Cas9: Tool for Qualitative and Quantitative Plant Genome Editing

    PubMed Central

    Noman, Ali; Aqeel, Muhammad; He, Shuilin

    2016-01-01

    Recent developments in genome editing techniques have aroused substantial excitement among agricultural scientists. These techniques offer new opportunities for developing improved plant lines with addition of important traits or removal of undesirable traits. Increased adoption of genome editing has been geared by swiftly developing Clustered regularly interspaced short palindromic repeats (CRISPR). This is appearing as driving force for innovative utilization in diverse branches of plant biology. CRISPR-Cas9 mediated genome editing is being used for rapid, easy and efficient alteration of genes among diverse plant species. With approximate completion of conceptual work about CRISPR-Cas9, plant scientists are applying this genome editing tool for crop attributes enhancement. The capability of this system for performing targeted and efficient modifications in genome sequence as well as gene expression will certainly spur novel developments not only in model plants but in crop and ornamental plants as well. Additionally, due to non-involvement of foreign DNA, this technique may help alleviating regulatory issues associated with genetically modified plants. We expect that prevailing challenges in plant science like genomic region manipulation, crop specific vectors etc. will be addressed along with sustained growth of this genome editing tool. In this review, recent progress of CRISPR-Cas9 technology in plants has been summarized and discussed. We reviewed significance of CRISPR-Cas9 for specific and non-traditional aspects of plant life. It also covers strengths of this technique in comparison with other genome editing techniques, e.g., Zinc finger nucleases, Transcription activator-like effector nucleases and potential challenges in coming decades have been described. PMID:27917188

  14. CRISPR-Cas9: Tool for Qualitative and Quantitative Plant Genome Editing.

    PubMed

    Noman, Ali; Aqeel, Muhammad; He, Shuilin

    2016-01-01

    Recent developments in genome editing techniques have aroused substantial excitement among agricultural scientists. These techniques offer new opportunities for developing improved plant lines with addition of important traits or removal of undesirable traits. Increased adoption of genome editing has been geared by swiftly developing Clustered regularly interspaced short palindromic repeats (CRISPR). This is appearing as driving force for innovative utilization in diverse branches of plant biology. CRISPR-Cas9 mediated genome editing is being used for rapid, easy and efficient alteration of genes among diverse plant species. With approximate completion of conceptual work about CRISPR-Cas9, plant scientists are applying this genome editing tool for crop attributes enhancement. The capability of this system for performing targeted and efficient modifications in genome sequence as well as gene expression will certainly spur novel developments not only in model plants but in crop and ornamental plants as well. Additionally, due to non-involvement of foreign DNA, this technique may help alleviating regulatory issues associated with genetically modified plants. We expect that prevailing challenges in plant science like genomic region manipulation, crop specific vectors etc. will be addressed along with sustained growth of this genome editing tool. In this review, recent progress of CRISPR-Cas9 technology in plants has been summarized and discussed. We reviewed significance of CRISPR-Cas9 for specific and non-traditional aspects of plant life. It also covers strengths of this technique in comparison with other genome editing techniques, e.g., Zinc finger nucleases, Transcription activator-like effector nucleases and potential challenges in coming decades have been described.

  15. FuGE: Functional Genomics Experiment Object Model.

    PubMed

    Jones, Andrew R; Pizarro, Angel; Spellman, Paul; Miller, Michael

    2006-01-01

    This is an interim report on the Functional Genomics Experiment (FuGE) Object Model. FuGE is a framework for creating data standards for high-throughput biological experiments, developed by a consortium of researchers from academia and industry. FuGE supports rich annotation of samples, protocols, instruments, and software, as well as providing extension points for technology specific details. It has been adopted by microarray and proteomics standards bodies as a basis for forthcoming standards. It is hoped that standards developers for other omics techniques will join this collaborative effort; widespread adoption will allow uniform annotation of common parts of functional genomics workflows, reduce standard development and learning times through the sharing of consistent practice, and ease the construction of software for accessing and integrating functional genomics data.

  16. Budding off: bringing functional genomics to Candida albicans.

    PubMed

    Anderson, Matthew Z; Bennett, Richard J

    2016-03-01

    Candida species are the most prevalent human fungal pathogens, with Candida albicans being the most clinically relevant species. Candida albicans resides as a commensal of the human gastrointestinal tract but is a frequent cause of opportunistic mucosal and systemic infections. Investigation of C. albicans virulence has traditionally relied on candidate gene approaches, but recent advances in functional genomics have now facilitated global, unbiased studies of gene function. Such studies include comparative genomics (both between and within Candida species), analysis of total RNA expression, and regulation and delineation of protein-DNA interactions. Additionally, large collections of mutant strains have begun to aid systematic screening of clinically relevant phenotypes. Here, we will highlight the development of functional genomics in C. albicans and discuss the use of these approaches to addressing both commensalism and pathogenesis in this species.

  17. TreeQ-VISTA: An Interactive Tree Visualization Tool withFunctional Annotation Query Capabilities

    SciTech Connect

    Gu, Shengyin; Anderson, Iain; Kunin, Victor; Cipriano, Michael; Minovitsky, Simon; Weber, Gunther; Amenta, Nina; Hamann, Bernd; Dubchak,Inna

    2007-05-07

    Summary: We describe a general multiplatform exploratorytool called TreeQ-Vista, designed for presenting functional annotationsin a phylogenetic context. Traits, such as phenotypic and genomicproperties, are interactively queried from a relational database with auser-friendly interface which provides a set of tools for users with orwithout SQL knowledge. The query results are projected onto aphylogenetic tree and can be displayed in multiple color groups. A richset of browsing, grouping and query tools are provided to facilitatetrait exploration, comparison and analysis.Availability: The program,detailed tutorial and examples are available online athttp://genome-test.lbl.gov/vista/TreeQVista.

  18. University of Texas Southwestern Medical Center: Functional Signature Ontology Tool: Triplicate Measurements of Reporter Gene Expression in Response to Individual Genetic and Chemical Perturbations in HCT116 Cells | Office of Cancer Genomics

    Cancer.gov

    The goal of this project is to use an eight-gene expression profile to define functional signatures for small molecules and natural products with heretofore undefined mechanism of action. Two genes in the eight gene set are used as internal controls and do not vary across gene expression array data collected from the public domain. The remaining six genes are found to vary independently across a large collection of publically available gene expression array datasets.  Read the abstract

  19. A close relationship between primary nucleotides sequence structure and the composition of functional genes in the genome of prokaryotes.

    PubMed

    Garcia, Juan A L; Fernández-Guerra, Antoni; Casamayor, Emilio O

    2011-12-01

    Comparative genomics is an essential tool to unravel how genomes change over evolutionary time and to gain clues on the links between functional genomics and evolution. In prokaryotes, the large, good quality, genome sequences available in public databases and the recently developed large-scale computational methods, offer an unprecedent view on the ecology and evolution of microorganisms through comparative genomics. In this work, we examined the links among genome structure (i.e., the sequential distribution of nucleotides itself by detrended fluctuation analysis, DFA) and genomic diversity (i.e., gene functionality by Clusters of Orthologous Genes, COGs) in 828 full sequenced prokaryotic genomes from 548 different bacteria and archaea species. DFA scaling exponent α indicated persistent long-range correlations (fractality) in each genome analyzed. Higher resolution power was found when considering the sequential succession of purine (AG) vs. pyrimidine (CT) bases than either keto (GT) to amino (AC) forms or strongly (GC) vs. weakly (AT) bonded nucleotides. Interestingly, the phyla Aquificae, Fusobacteria, Dictyoglomi, Nitrospirae, and Thermotogae were closer to archaea than to their bacterial counterparts. A strong significant correlation was found between scaling exponent α and COGs distribution, and we consistently observed that the larger α the more heterogeneous was the gene distribution within each functional category, suggesting a close relationship between primary nucleotides sequence structure and functional genes composition.

  20. Bootstrap, Bayesian probability and maximum likelihood mapping: exploring new tools for comparative genome analyses

    PubMed Central

    Zhaxybayeva, Olga; Gogarten, J Peter

    2002-01-01

    Background Horizontal gene transfer (HGT) played an important role in shaping microbial genomes. In addition to genes under sporadic selection, HGT also affects housekeeping genes and those involved in information processing, even ribosomal RNA encoding genes. Here we describe tools that provide an assessment and graphic illustration of the mosaic nature of microbial genomes. Results We adapted the Maximum Likelihood (ML) mapping to the analyses of all detected quartets of orthologous genes found in four genomes. We have automated the assembly and analyses of these quartets of orthologs given the selection of four genomes. We compared the ML-mapping approach to more rigorous Bayesian probability and Bootstrap mapping techniques. The latter two approaches appear to be more conservative than the ML-mapping approach, but qualitatively all three approaches give equivalent results. All three tools were tested on mitochondrial genomes, which presumably were inherited as a single linkage group. Conclusions In some instances of interphylum relationships we find nearly equal numbers of quartets strongly supporting the three possible topologies. In contrast, our analyses of genome quartets containing the cyanobacterium Synechocystis sp. indicate that a large part of the cyanobacterial genome is related to that of low GC Gram positives. Other groups that had been suggested as sister groups to the cyanobacteria contain many fewer genes that group with the Synechocystis orthologs. Interdomain comparisons of genome quartets containing the archaeon Halobacterium sp. revealed that Halobacterium sp. shares more genes with Bacteria that live in the same environment than with Bacteria that are more closely related based on rRNA phylogeny . Many of these genes encode proteins involved in substrate transport and metabolism and in information storage and processing. The performed analyses demonstrate that relationships among prokaryotes cannot be accurately depicted by or inferred from

  1. Structure and Functional Studies on Dengue-2 Virus Genome

    DTIC Science & Technology

    1986-03-01

    AD STRUCTURE AND FUNCTIONAL STUDIES ON DENGUE -2 VIRUS GENOME FINAL Report Lfl C’) Radha Krishnan Padmanabhan, Ph.D. 0) March 1, 1986 Supported by U.S...and Functional Studies on Dengue -2 Virus Genome 12. PERSONAL AUTHOR(S) Radha Krishnan Padmanabhan 13a. TYPE OF REPORT 13b. TIME COVERED 14. DATE OF...3’-end of Dengue RNA in order to facilitate cDNA synthesis by oligo d(T) priming as proposed in the original research project. 2. We also showed that

  2. WWOX: its genomics, partners, and functions.

    PubMed

    Del Mare, Sara; Salah, Zaidoun; Aqeilan, Rami I

    2009-11-01

    The WW domain-containing oxidoreductase (WWOX) spans one of the most active common fragile sites (CFSs) involved in cancer, FRA16D. WWOX encodes a 46-kDa protein that contains two N-terminal WW domains and a central short-chain dehydrogenase/reductase (SDR) domain. Through its WW domain, Wwox interacts with its partners and modulates their functions. Our data indicate that Wwox suppresses the transactivation function of several transcription factors implied in neoplasia by sequestering them in the cytoplasm. Work from our laboratory and other research groups have demonstrated that Wwox participates in a number of cellular processes including growth, differentiation, apoptosis, and tumor suppression. Targeted deletion of the Wwox gene in mice causes increased spontaneous and chemically induced tumor incidence supporting bona fide tumor suppressor function of WWOX. Moreover, generation of the Wwox-deficient mice uncovers, at least in part, some of the physiological in vivo functions of the WWOX gene. This review focuses on recent progress that elucidates Wwox functions in biology and pathology.

  3. Solutions for data integration in functional genomics: a critical assessment and case study.

    PubMed

    Smedley, Damian; Swertz, Morris A; Wolstencroft, Katy; Proctor, Glenn; Zouberakis, Michael; Bard, Jonathan; Hancock, John M; Schofield, Paul

    2008-11-01

    The torrent of data emerging from the application of new technologies to functional genomics and systems biology can no longer be contained within the traditional modes of data sharing and publication with the consequence that data is being deposited in, distributed across and disseminated through an increasing number of databases. The resulting fragmentation poses serious problems for the model organism community which increasingly rely on data mining and computational approaches that require gathering of data from a range of sources. In the light of these problems, the European Commission has funded a coordination action, CASIMIR (coordination and sustainability of international mouse informatics resources), with a remit to assess the technical and social aspects of database interoperability that currently prevent the full realization of the potential of data integration in mouse functional genomics. In this article, we assess the current problems with interoperability, with particular reference to mouse functional genomics, and critically review the technologies that can be deployed to overcome them. We describe a typical use-case where an investigator wishes to gather data on variation, genomic context and metabolic pathway involvement for genes discovered in a genome-wide screen. We go on to develop an automated approach involving an in silico experimental workflow tool, Taverna, using web services, BioMart and MOLGENIS technologies for data retrieval. Finally, we focus on the current impediments to adopting such an approach in a wider context, and strategies to overcome them.

  4. Family history in public health practice: a genomic tool for disease prevention and health promotion.

    PubMed

    Valdez, Rodolfo; Yoon, Paula W; Qureshi, Nadeem; Green, Ridgely Fisk; Khoury, Muin J

    2010-01-01

    Family history is a risk factor for many chronic diseases, including cancer, cardiovascular disease, and diabetes. Professional guidelines usually include family history to assess health risk, initiate interventions, and motivate behavioral changes. The advantages of family history over other genomic tools include a lower cost, greater acceptability, and a reflection of shared genetic and environmental factors. However, the utility of family history in public health has been poorly explored. To establish family history as a public health tool, it needs to be evaluated within the ACCE framework (analytical validity; clinical validity; clinical utility; and ethical, legal, and social issues). Currently, private and public organizations are developing tools to collect standardized family histories of many diseases. Their goal is to create family history tools that have decision support capabilities and are compatible with electronic health records. These advances will help realize the potential of family history as a public health tool.

  5. Integrative genomics to dissect retinoid functions.

    PubMed

    Mendoza-Parra, Marco-Antonio; Gronemeyer, Hinrich

    2014-01-01

    Retinoids and rexinoids, as all other ligands of the nuclear receptor (NR) family, act as ligand-regulated trans-acting transcription factors that bind to cis-acting DNA regulatory elements in the promoter regions of target genes (for reviews see [12, 22, 23, 26, 36]). Ligand binding modulates the communication functions of the receptor with the intracellular environment, which essentially entails receptor-protein and receptor-DNA or receptor-chromatin interactions. In this communication network, the receptor simultaneously serves as both intracellular sensor and regulator of cell/organ functions. Receptors are "intelligent" mediators of the information encoded in the chemical structure of a nuclear receptor ligand, as they interpret this information in the context of cellular identity and cell-physiological status and convert it into a dynamic chain of receptor-protein and receptor-DNA interactions. To process input and output information, they are composed of a modular structure with several domains that have evolved to exert particular molecular recognition functions. As detailed in other chapters in this volume, the main functional domains are the DNA-binding (DBD) and ligand-binding (LBD) [5-7, 38, 56, 71]. The LBD serves as a dual input-output information processor. Inputs, such as ligand binding or receptor phosphorylations, induce allosteric changes in receptor surfaces that serve as docking sites for outputs, such as subunits of transcription and epigenetic machineries or enzyme complexes. The complexity of input and output signals and their interdependencies is far from being understood.

  6. Correcting Inconsistencies and Errors in Bacterial Genome Metadata Using an Automated Curation Tool in Excel (AutoCurE).

    PubMed

    Schmedes, Sarah E; King, Jonathan L; Budowle, Bruce

    2015-01-01

    Whole-genome data are invaluable for large-scale comparative genomic studies. Current sequencing technologies have made it feasible to sequence entire bacterial genomes with relative ease and time with a substantially reduced cost per nucleotide, hence cost per genome. More than 3,000 bacterial genomes have been sequenced and are available at the finished status. Publically available genomes can be readily downloaded; however, there are challenges to verify the specific supporting data contained within the download and to identify errors and inconsistencies that may be present within the organizational data content and metadata. AutoCurE, an automated tool for bacterial genome database curation in Excel, was developed to facilitate local database curation of supporting data that accompany downloaded genomes from the National Center for Biotechnology Information. AutoCurE provides an automated approach to curate local genomic databases by flagging inconsistencies or errors by comparing the downloaded supporting data to the genome reports to verify genome name, RefSeq accession numbers, the presence of archaea, BioProject/UIDs, and sequence file descriptions. Flags are generated for nine metadata fields if there are inconsistencies between the downloaded genomes and genomes reports and if erroneous or missing data are evident. AutoCurE is an easy-to-use tool for local database curation for large-scale genome data prior to downstream analyses.

  7. Suite of tools for statistical N-gram language modeling for pattern mining in whole genome sequences.

    PubMed

    Ganapathiraju, Madhavi K; Mitchell, Asia D; Thahir, Mohamed; Motwani, Kamiya; Ananthasubramanian, Seshan

    2012-12-01

    Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.

  8. Transcription activator-like effector nucleases (TALENs): a highly efficient and versatile tool for genome editing.

    PubMed

    Sun, Ning; Zhao, Huimin

    2013-07-01

    Transcription activator-like effector (TALE) nucleases (TALENs) have recently emerged as a revolutionary genome editing tool in many different organisms and cell types. The site-specific chromosomal double-strand breaks introduced by TALENs significantly increase the efficiency of genomic modification. The modular nature of the TALE central repeat domains enables researchers to tailor DNA recognition specificity with ease and target essentially any desired DNA sequence. Here, we comprehensively review the development of TALEN technology in terms of scaffold optimization, DNA recognition, and repeat array assembly. In addition, we provide some perspectives on the future development of this technology.

  9. Cas-analyzer: an online tool for assessing genome editing results using NGS data.

    PubMed

    Park, Jeongbin; Lim, Kayeong; Kim, Jin-Soo; Bae, Sangsu

    2017-01-15

    Genome editing with programmable nucleases has been widely adopted in research and medicine. Next generation sequencing (NGS) platforms are now widely used for measuring the frequencies of mutations induced by CRISPR-Cas9 and other programmable nucleases. Here, we present an online tool, Cas-Analyzer, a JavaScript-based implementation for NGS data analysis. Because Cas-Analyzer is completely used at a client-side web browser on-the-fly, there is no need to upload very large NGS datasets to a server, a time-consuming step in genome editing analysis. Currently, Cas-Analyzer supports various programmable nucleases, including single nucleases and paired nucleases.

  10. Getting trichy: tools and approaches to interrogating Trichomonas vaginalis in a post-genome world

    PubMed Central

    Conrad, Melissa D.; Bradic, Martina; Warring, Sally D.; Gorman, Andrew W.; Carlton, Jane M.

    2012-01-01

    Trichomonas vaginalis is a parasite of the urogenital tract in men and women, with a worldwide presence and significant implications for global public health. T. vaginalis research entered the age of genomics with the publication of the first genome sequence in 2007, yet subsequent utilization of other ‘omics’ technologies and methods has been slow. Here, we review some of the tools and approaches available to interrogate T. vaginalis biology, with an emphasis on recent advances and current limitations, and draw attention to areas where further efforts are needed to effectively examine the complex and intriguing biology of the parasite. PMID:23219217

  11. Genomic Functionalization: The Next Revolution In Biology

    SciTech Connect

    Anderson, Peter; Schoeniger, Joseph S.; Imbro, Paula M.

    2014-07-01

    We have implemented a ligand-alignment algorithm into our developed computational pipeline for identifying specificity-determining features (SDFs) in protein-ligand complexes. Given a set of protein-ligand complex structures, the algorithm aligns the complexes by ligand rather than by the C -RMSD or standard approach, providing a single reference frame for extracting SDFs. We anticipate that this ligand-alignment capability will be highly useful for protein function prediction. We already have a database containing > 20 K ligand-protein complex crystal structures taken from the Protein Data Bank. By aligning these proteins to single reference frames using ligand alignment, we can submit the complexes to our pipeline for SDF extraction. The SDFs derived from this training procedure can be used as thumbprints that are hallmarks of individual enzyme classes. These SDF thumbprints may then serve as guides to the prediction of function of new unknown proteins.

  12. PerPlot & PerScan: tools for analysis of DNA curvature-related periodicity in genomic nucleotide sequences

    PubMed Central

    2011-01-01

    Background Periodic spacing of short adenine or thymine runs phased with DNA helical period of ~10.5 bp is associated with intrinsic DNA curvature and deformability, which play important roles in DNA-protein interactions and in the organization of chromosomes in both eukaryotes and prokaryotes. Local differences in DNA sequence periodicity have been linked to differences in gene expression in some organisms. Despite the significance of these periodic patterns, there are virtually no publicly accessible tools for their analysis. Results We present novel tools suitable for assessments of DNA curvature-related sequence periodicity in nucleotide sequences at the genome scale. Utility of the present software is demonstrated on a comparison of sequence periodicities in the genomes of Haemophilus influenzae, Methanocaldococcus jannaschii, Saccharomyces cerevisiae, and Arabidopsis thaliana. The software can be accessed through a web interface and the programs are also available for download. Conclusions The present software is suitable for comparing DNA curvature-related sequence periodicity among different genomes as well as for analysis of intrachromosomal heterogeneity of the sequence periodicity. It provides a quick and convenient way to detect anomalous regions of chromosomes that could have unusual structural and functional properties and/or distinct evolutionary history. PMID:22587738

  13. The rules of tool incorporation: Tool morpho-functional & sensori-motor constraints.

    PubMed

    Cardinali, L; Brozzoli, C; Finos, L; Roy, A C; Farnè, A

    2016-04-01

    Previous studies showed that using tools modifies the agent's body and space representation. However, it is still not clear which rules govern those remapping processes. Here, we studied the differential role played by the morpho-functional characteristics of a tool and the sensori-motor constraints that a tool imposes on the hand. To do so, we asked a group of participants to reach and grasp an object using, in different conditions, two different tools: Pliers, to be acted upon by the index and thumb fingertips, and Sticks, taped to the same two digits. The two tools were equivalent in terms of morpho-functional characteristics, providing index finger and thumb with the same amount of elongation. Crucially, however, they imposed different sensori-motor constraints on the acting fingers. We measured and compared the kinematic profile of free-hand movements performed before and after the use of both devices. As predicted on the basis of their equivalent morpho-functional characteristics, both tools induced similar changes in the fingers (but not the arm) kinematics compatible with the hand being represented as bigger. Furthermore, the different sensori-motor constraints imposed by Pliers and Sticks over the hand, induced differential updates of the hand representation. In particular, the Sticks selectively affected the kinematics of the two fingers they were taped on, whereas Pliers had a more global effect, affecting the kinematics of hand movements not performed during the use of the tool. These results suggest that tool-use induces a rapid update of the hand representation in the brain, not only on the basis of the morpho-functional characteristics of the tool, but also depending on the specific sensori-motor constraints imposed by the tool.

  14. Mutant power: using mutant allele collections for yeast functional genomics.

    PubMed

    Norman, Kaitlyn L; Kumar, Anuj

    2016-03-01

    The budding yeast has long served as a model eukaryote for the functional genomic analysis of highly conserved signaling pathways, cellular processes and mechanisms underlying human disease. The collection of reagents available for genomics in yeast is extensive, encompassing a growing diversity of mutant collections beyond gene deletion sets in the standard wild-type S288C genetic background. We review here three main types of mutant allele collections: transposon mutagen collections, essential gene collections and overexpression libraries. Each collection provides unique and identifiable alleles that can be utilized in genome-wide, high-throughput studies. These genomic reagents are particularly informative in identifying synthetic phenotypes and functions associated with essential genes, including those modeled most effectively in complex genetic backgrounds. Several examples of genomic studies in filamentous/pseudohyphal backgrounds are provided here to illustrate this point. Additionally, the limitations of each approach are examined. Collectively, these mutant allele collections in Saccharomyces cerevisiae and the related pathogenic yeast Candida albicans promise insights toward an advanced understanding of eukaryotic molecular and cellular biology.

  15. Advancing our understanding of functional genome organisation through studies in the fission yeast.

    PubMed

    Olsson, Ida; Bjerling, Pernilla

    2011-02-01

    Significant progress has been made in understanding the functional organisation of the cell nucleus. Still many questions remain to be answered about the relationship between the spatial organisation of the nucleus and the regulation of the genome function. There are many conflicting data in the field making it very difficult to merge published results on mammalian cells into one model on subnuclear chromatin organisation. The fission yeast, Schizosaccharomyces pombe, over the last decades has emerged as a valuable model organism in understanding basic biological mechanisms, especially the cell cycle and chromosome biology. In this review we describe and compare the nuclear organisation in mammalian and fission yeast cells. We believe that fission yeast is a good tool to resolve at least some of the contradictions and unanswered questions concerning functional nuclear architecture, since S. pombe has chromosomes structurally similar to that of human. S. pombe also has the advantage over higher eukaryotes in that the genome can easily be manipulated via homologous recombination making it possible to integrate the tools needed for visualisation of chromosomes using live-cell microscopy. Classical genetic experiments can be used to elucidate what factors are involved in a certain mechanism. The knowledge we have gained during the last few years indicates similarities between the genome organisation in fission yeast and mammalian cells. We therefore propose the use of fission yeast for further advancement of our understanding of functional nuclear organisation.

  16. Resurrection of DNA Function In Vivo from an Extinct Genome

    PubMed Central

    Pask, Andrew J.; Behringer, Richard R.; Renfree, Marilyn B.

    2008-01-01

    There is a burgeoning repository of information available from ancient DNA that can be used to understand how genomes have evolved and to determine the genetic features that defined a particular species. To assess the functional consequences of changes to a genome, a variety of methods are needed to examine extinct DNA function. We isolated a transcriptional enhancer element from the genome of an extinct marsupial, the Tasmanian tiger (Thylacinus cynocephalus or thylacine), obtained from 100 year-old ethanol-fixed tissues from museum collections. We then examined the function of the enhancer in vivo. Using a transgenic approach, it was possible to resurrect DNA function in transgenic mice. The results demonstrate that the thylacine Col2A1 enhancer directed chondrocyte-specific expression in this extinct mammalian species in the same way as its orthologue does in mice. While other studies have examined extinct coding DNA function in vitro, this is the first example of the restoration of extinct non-coding DNA and examination of its function in vivo. Our method using transgenesis can be used to explore the function of regulatory and protein-coding sequences obtained from any extinct species in an in vivo model system, providing important insights into gene evolution and diversity. PMID:18493600

  17. Strategies to explore functional genomics data sets in NCBI's GEO database.

    PubMed

    Wilhite, Stephen E; Barrett, Tanya

    2012-01-01

    The Gene Expression Omnibus (GEO) database is a major repository that stores high-throughput functional genomics data sets that are generated using both microarray-based and sequence-based technologies. Data sets are submitted to GEO primarily by researchers who are publishing their results in journals that require original data to be made freely available for review and analysis. In addition to serving as a public archive for these data, GEO has a suite of tools that allow users to identify, analyze, and visualize data relevant to their specific interests. These tools include sample comparison applications, gene expression profile charts, data set clusters, genome browser tracks, and a powerful search engine that enables users to construct complex queries.

  18. Functional and evolutionary insights from the genomes of three parasitoid Nasonia species.

    PubMed

    Werren, John H; Richards, Stephen; Desjardins, Christopher A; Niehuis, Oliver; Gadau, Jürgen; Colbourne, John K; Werren, John H; Richards, Stephen; Desjardins, Christopher A; Niehuis, Oliver; Gadau, Jürgen; Colbourne, John K; Beukeboom, Leo W; Desplan, Claude; Elsik, Christine G; Grimmelikhuijzen, Cornelis J P; Kitts, Paul; Lynch, Jeremy A; Murphy, Terence; Oliveira, Deodoro C S G; Smith, Christopher D; van de Zande, Louis; Worley, Kim C; Zdobnov, Evgeny M; Aerts, Maarten; Albert, Stefan; Anaya, Victor H; Anzola, Juan M; Barchuk, Angel R; Behura, Susanta K; Bera, Agata N; Berenbaum, May R; Bertossa, Rinaldo C; Bitondi, Márcia M G; Bordenstein, Seth R; Bork, Peer; Bornberg-Bauer, Erich; Brunain, Marleen; Cazzamali, Giuseppe; Chaboub, Lesley; Chacko, Joseph; Chavez, Dean; Childers, Christopher P; Choi, Jeong-Hyeon; Clark, Michael E; Claudianos, Charles; Clinton, Rochelle A; Cree, Andrew G; Cristino, Alexandre S; Dang, Phat M; Darby, Alistair C; de Graaf, Dirk C; Devreese, Bart; Dinh, Huyen H; Edwards, Rachel; Elango, Navin; Elhaik, Eran; Ermolaeva, Olga; Evans, Jay D; Foret, Sylvain; Fowler, Gerald R; Gerlach, Daniel; Gibson, Joshua D; Gilbert, Donald G; Graur, Dan; Gründer, Stefan; Hagen, Darren E; Han, Yi; Hauser, Frank; Hultmark, Da; Hunter, Henry C; Hurst, Gregory D D; Jhangian, Shalini N; Jiang, Huaiyang; Johnson, Reed M; Jones, Andrew K; Junier, Thomas; Kadowaki, Tatsuhiko; Kamping, Albert; Kapustin, Yuri; Kechavarzi, Bobak; Kim, Jaebum; Kim, Jay; Kiryutin, Boris; Koevoets, Tosca; Kovar, Christie L; Kriventseva, Evgenia V; Kucharski, Robert; Lee, Heewook; Lee, Sandra L; Lees, Kristin; Lewis, Lora R; Loehlin, David W; Logsdon, John M; Lopez, Jacqueline A; Lozado, Ryan J; Maglott, Donna; Maleszka, Ryszard; Mayampurath, Anoop; Mazur, Danielle J; McClure, Marcella A; Moore, Andrew D; Morgan, Margaret B; Muller, Jean; Munoz-Torres, Monica C; Muzny, Donna M; Nazareth, Lynne V; Neupert, Susanne; Nguyen, Ngoc B; Nunes, Francis M F; Oakeshott, John G; Okwuonu, Geoffrey O; Pannebakker, Bart A; Pejaver, Vikas R; Peng, Zuogang; Pratt, Stephen C; Predel, Reinhard; Pu, Ling-Ling; Ranson, Hilary; Raychoudhury, Rhitoban; Rechtsteiner, Andreas; Reese, Justin T; Reid, Jeffrey G; Riddle, Megan; Robertson, Hugh M; Romero-Severson, Jeanne; Rosenberg, Miriam; Sackton, Timothy B; Sattelle, David B; Schlüns, Helge; Schmitt, Thomas; Schneider, Martina; Schüler, Andreas; Schurko, Andrew M; Shuker, David M; Simões, Zilá L P; Sinha, Saurabh; Smith, Zachary; Solovyev, Victor; Souvorov, Alexandre; Springauf, Andreas; Stafflinger, Elisabeth; Stage, Deborah E; Stanke, Mario; Tanaka, Yoshiaki; Telschow, Arndt; Trent, Carol; Vattathil, Selina; Verhulst, Eveline C; Viljakainen, Lumi; Wanner, Kevin W; Waterhouse, Robert M; Whitfield, James B; Wilkes, Timothy E; Williamson, Michael; Willis, Judith H; Wolschin, Florian; Wyder, Stefan; Yamada, Takuji; Yi, Soojin V; Zecher, Courtney N; Zhang, Lan; Gibbs, Richard A

    2010-01-15

    We report here genome sequences and comparative analyses of three closely related parasitoid wasps: Nasonia vitripennis, N. giraulti, and N. longicornis. Parasitoids are important regulators of arthropod populations, including major agricultural pests and disease vectors, and Nasonia is an emerging genetic model, particularly for evolutionary and developmental genetics. Key findings include the identification of a functional DNA methylation tool kit; hymenopteran-specific genes including diverse venoms; lateral gene transfers among Pox viruses, Wolbachia, and Nasonia; and the rapid evolution of genes involved in nuclear-mitochondrial interactions that are implicated in speciation. Newly developed genome resources advance Nasonia for genetic research, accelerate mapping and cloning of quantitative trait loci, and will ultimately provide tools and knowledge for further increasing the utility of parasitoids as pest insect-control agents.

  19. Functional and Evolutionary Insights from the Genomes of Three Parasitoid Nasonia Species

    PubMed Central

    2010-01-01

    We report here genome sequences and comparative analyses of three closely related parasitoid wasps: Nasonia vitripennis, N. giraulti, and N. longicornis. Parasitoids are important regulators of arthropod populations, including major agricultural pests and disease vectors, and Nasonia is an emerging genetic model, particularly for evolutionary and developmental genetics. Key findings include the identification of a functional DNA methylation tool kit; hymenopteran-specific genes including diverse venoms; lateral gene transfers among Pox viruses, Wolbachia, and Nasonia; and the rapid evolution of genes involved in nuclear-mitochondrial interactions that are implicated in speciation. Newly developed genome resources advance Nasonia for genetic research, accelerate mapping and cloning of quantitative trait loci, and will ultimately provide tools and knowledge for further increasing the utility of parasitoids as pest insect-control agents. PMID:20075255

  20. MELOGEN: an EST database for melon functional genomics

    PubMed Central

    Gonzalez-Ibeas, Daniel; Blanca, José; Roig, Cristina; González-To, Mireia; Picó, Belén; Truniger, Verónica; Gómez, Pedro; Deleu, Wim; Caño-Delgado, Ana; Arús, Pere; Nuez, Fernando; Garcia-Mas, Jordi; Puigdomènech, Pere; Aranda, Miguel A

    2007-01-01

    Background Melon (Cucumis melo L.) is one of the most important fleshy fruits for fresh consumption. Despite this, few genomic resources exist for this species. To facilitate the discovery of genes involved in essential traits, such as fruit development, fruit maturation and disease resistance, and to speed up the process of breeding new and better adapted melon varieties, we have produced a large collection of expressed sequence tags (ESTs) from eight normalized cDNA libraries from different tissues in different physiological conditions. Results We determined over 30,000 ESTs that were clustered into 16,637 non-redundant sequences or unigenes, comprising 6,023 tentative consensus sequences (contigs) and 10,614 unclustered sequences (singletons). Many potential molecular markers were identified in the melon dataset: 1,052 potential simple sequence repeats (SSRs) and 356 single nucleotide polymorphisms (SNPs) were found. Sixty-nine percent of the melon unigenes showed a significant similarity with proteins in databases. Functional classification of the unigenes was carried out following the Gene Ontology scheme. In total, 9,402 unigenes were mapped to one or more ontology. Remarkably, the distributions of melon and Arabidopsis unigenes followed similar tendencies, suggesting that the melon dataset is representative of the whole melon transcriptome. Bioinformatic analyses primarily focused on potential precursors of melon micro RNAs (miRNAs) in the melon dataset, but many other genes potentially controlling disease resistance and fruit quality traits were also identified. Patterns of transcript accumulation were characterised by Real-Time-qPCR for 20 of these genes. Conclusion The collection of ESTs characterised here represents a substantial increase on the genetic information available for melon. A database (MELOGEN) which contains all EST sequences, contig images and several tools for analysis and data mining has been created. This set of sequences constitutes

  1. Functional annotation from the genome sequence of the giant panda.

    PubMed

    Huo, Tong; Zhang, Yinjie; Lin, Jianping

    2012-08-01

    The giant panda is one of the most critically endangered species due to the fragmentation and loss of its habitat. Studying the functions of proteins in this animal, especially specific trait-related proteins, is therefore necessary to protect the species. In this work, the functions of these proteins were investigated using the genome sequence of the giant panda. Data on 21,001 proteins and their functions were stored in the Giant Panda Protein Database, in which the proteins were divided into two groups: 20,179 proteins whose functions can be predicted by GeneScan formed the known-function group, whereas 822 proteins whose functions cannot be predicted by GeneScan comprised the unknown-function group. For the known-function group, we further classified the proteins by molecular function, biological process, cellular component, and tissue specificity. For the unknown-function group, we developed a strategy in which the proteins were filtered by cross-Blast to identify panda-specific proteins under the assumption that proteins related to the panda-specific traits in the unknown-function group exist. After this filtering procedure, we identified 32 proteins (2 of which are membrane proteins) specific to the giant panda genome as compared against the dog and horse genomes. Based on their amino acid sequences, these 32 proteins were further analyzed by functional classification using SVM-Prot, motif prediction using MyHits, and interacting protein prediction using the Database of Interacting Proteins. Nineteen proteins were predicted to be zinc-binding proteins, thus affecting the activities of nucleic acids. The 32 panda-specific proteins will be further investigated by structural and functional analysis.

  2. The hidden perils of read mapping as a quality assessment tool in genome sequencing

    PubMed Central

    Lehri, B.; Seddon, A. M.; Karlyshev, A. V.

    2017-01-01

    This article provides a comparative analysis of the various methods of genome sequencing focusing on verification of the assembly quality. The results of a comparative assessment of various de novo assembly tools, as well as sequencing technologies, are presented using a recently completed sequence of the genome of Lactobacillus fermentum 3872. In particular, quality of assemblies is assessed by using CLC Genomics Workbench read mapping and Optical mapping developed by OpGen. Over-extension of contigs without prior knowledge of contig location can lead to misassembled contigs, even when commonly used quality indicators such as read mapping suggest that a contig is well assembled. Precautions must also be undertaken when using long read sequencing technology, which may also lead to misassembled contigs. PMID:28225089

  3. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities

    SciTech Connect

    Kang, Dongwan D.; Froula, Jeff; Egan, Rob; Wang, Zhong

    2015-01-01

    Grouping large genomic fragments assembled from shotgun metagenomic sequences to deconvolute complex microbial communities, or metagenome binning, enables the study of individual organisms and their interactions. Because of the complex nature of these communities, existing metagenome binning methods often miss a large number of microbial species. In addition, most of the tools are not scalable to large datasets. Here we introduce automated software called MetaBAT that integrates empirical probabilistic distances of genome abundance and tetranucleotide frequency for accurate metagenome binning. MetaBAT outperforms alternative methods in accuracy and computational efficiency on both synthetic and real metagenome datasets. Lastly, it automatically forms hundreds of high quality genome bins on a very large assembly consisting millions of contigs in a matter of hours on a single node. MetaBAT is open source software and available at https://bitbucket.org/berkeleylab/metabat.

  4. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities

    DOE PAGES

    Kang, Dongwan D.; Froula, Jeff; Egan, Rob; ...

    2015-01-01

    Grouping large genomic fragments assembled from shotgun metagenomic sequences to deconvolute complex microbial communities, or metagenome binning, enables the study of individual organisms and their interactions. Because of the complex nature of these communities, existing metagenome binning methods often miss a large number of microbial species. In addition, most of the tools are not scalable to large datasets. Here we introduce automated software called MetaBAT that integrates empirical probabilistic distances of genome abundance and tetranucleotide frequency for accurate metagenome binning. MetaBAT outperforms alternative methods in accuracy and computational efficiency on both synthetic and real metagenome datasets. Lastly, it automatically formsmore » hundreds of high quality genome bins on a very large assembly consisting millions of contigs in a matter of hours on a single node. MetaBAT is open source software and available at https://bitbucket.org/berkeleylab/metabat.« less

  5. Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data

    PubMed Central

    2011-01-01

    Background Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases. Another challenge is the use of annotations to interpret large lists of 'interesting' genes generated by genome-scale datasets. Previously, these gene lists had to be analyzed across several independent biological databases, often on a gene-by-gene basis. In contrast, several annotation databases, such as DAVID, integrate data from multiple functional databases and reveal underlying biological themes of large gene lists. While several such databases have been constructed for animals, none is currently available for the study of algae. Due to renewed interest in algae as potential sources of biofuels and the emergence of multiple algal genome sequences, a significant need has arisen for such a database to process the growing compendiums of algal genomic data. Description The Algal Functional Annotation Tool is a web-based comprehensive analysis suite integrating annotation data from several pathway, ontology, and protein family databases. The current version provides annotation for the model alga Chlamydomonas reinhardtii, and in the future will include additional genomes. The site allows users to interpret large gene lists by identifying associated functional terms, and their enrichment. Additionally, expression data for several experimental conditions were compiled and analyzed to provide an expression-based enrichment search. A tool to search for functionally-related genes based on gene expression across these conditions is also provided. Other features include dynamic visualization of genes on KEGG pathway maps

  6. Nuclease-mediated genome editing: At the front-line of functional genomics technology.

    PubMed

    Sakuma, Tetsushi; Woltjen, Knut

    2014-01-01

    Genome editing with engineered endonucleases is rapidly becoming a staple method in developmental biology studies. Engineered nucleases permit random or designed genomic modification at precise loci through the stimulation of endogenous double-strand break repair. Homology-directed repair following targeted DNA damage is mediated by co-introduction of a custom repair template, allowing the derivation of knock-out and knock-in alleles in animal models previously refractory to classic gene targeting procedures. Currently there are three main types of customizable site-specific nucleases delineated by the source mechanism of DNA binding that guides nuclease activity to a genomic target: zinc-finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and clustered regularly interspaced short palindromic repeats (CRISPR). Among these genome engineering tools, characteristics such as the ease of design and construction, mechanism of inducing DNA damage, and DNA sequence specificity all differ, making their application complementary. By understanding the advantages and disadvantages of each method, one may make the best choice for their particular purpose.

  7. Cpf1 Is A Versatile Tool for CRISPR Genome Editing Across Diverse Species of Cyanobacteria

    PubMed Central

    Ungerer, Justin; Pakrasi, Himadri B.

    2016-01-01

    Cyanobacteria are the ideal organisms for the production of a wide range of bioproducts as they can convert CO2 directly into the desired end product using solar energy. Unfortunately, the engineering of cyanobacteria to create efficient cell factories has been impaired by the cumbersome genetic tools that are currently available for these organisms; especially when trying to accumulate multiple modifications. We sought to construct an efficient and precise tool for generating numerous markerless modifications in cyanobacteria using CRISPR technology and the alternative nuclease, Cpf1. In this study we demonstrate rapid engineering of markerless knock-ins, knock-outs and point mutations in each of three model cyanobacteria; Synechococcus, Synechocystis and Anabaena. The markerless nature of cpf1 genome editing will allow for complex genome modification that was not possible with previously existing technology while facilitating the development of cyanobacteria as highly modified biofactories. PMID:28000776

  8. Cpf1 Is A Versatile Tool for CRISPR Genome Editing Across Diverse Species of Cyanobacteria.

    PubMed

    Ungerer, Justin; Pakrasi, Himadri B

    2016-12-21

    Cyanobacteria are the ideal organisms for the production of a wide range of bioproducts as they can convert CO2 directly into the desired end product using solar energy. Unfortunately, the engineering of cyanobacteria to create efficient cell factories has been impaired by the cumbersome genetic tools that are currently available for these organisms; especially when trying to accumulate multiple modifications. We sought to construct an efficient and precise tool for generating numerous markerless modifications in cyanobacteria using CRISPR technology and the alternative nuclease, Cpf1. In this study we demonstrate rapid engineering of markerless knock-ins, knock-outs and point mutations in each of three model cyanobacteria; Synechococcus, Synechocystis and Anabaena. The markerless nature of cpf1 genome editing will allow for complex genome modification that was not possible with previously existing technology while facilitating the development of cyanobacteria as highly modified biofactories.

  9. Immersive virtual environment technology: a promising tool for future social and behavioral genomics research and practice.

    PubMed

    Persky, Susan; McBride, Colleen M

    2009-12-01

    Social and behavioral research needs to get started now if scientists are to direct genomic discoveries to address pressing public health problems. Advancing social and behavioral science will require innovative and rigorous communication methodologies that move researchers beyond reliance on traditional tools and their inherent limitations. One such emerging research tool is immersive virtual environment technology (virtual reality), a methodology that gives researchers the ability to maintain high experimental control and mundane realism of scenarios; portray and manipulate complex, abstract objects and concepts; and implement innovative implicit behavioral measurement. This report suggests the role that immersive virtual environment technology can play in furthering future research in genomics-related education, decision making, test intentions, behavior change, and health-care provider behaviors. Practical implementation and challenges are also discussed.

  10. Structure and Functional Studies on Dengue-2 Virus Genome

    DTIC Science & Technology

    1986-03-01

    AD_ _ _ Lfl oSTRUCTURE AND FUNCTIONAL STUDIES ON DENGUE -2 VIRUS GENOME 0Annual Report Radha Krishnan Padmanabhan, Ph.D. March 1, 1986 Supported by...Studies on Dengue -2 Virus Genome 12 PERSONAL AUTHOR(S) Radha Krishnan Padmanabhan 13a TYPE OF REPORT 1 3b TIME COVERED 14 DATE OF REPORT (Year, Month, Day...analysis of these clones totalling 06 01 14,586 nucleotides: Deduced amino acid sequences of dengue virI 19 ABSTRACT (Continue on reverse of

  11. Knocking down the obstacles to functional genomics data sharing

    PubMed Central

    Simpson, Kaylene J.; Smith, Jennifer A.

    2017-01-01

    This week, Scientific Data published a collection of eight papers that describe datasets from high-throughput functional genomics screens, primarily utilizing RNA interference (RNAi). The publications explore host-pathogen dependencies, innate immune response, disease pathways, and cell morphology and motility at the genome-level. All data, including raw images from the high content screens, are publically available in PubChem BioAssay, figshare, Harvard Dataverse or the Image Data Resource (IDR). Detailed data descriptors enable use of these data for analysis algorithm design, machine learning, data comparisons, as well as generating new scientific hypotheses. PMID:28248922

  12. Knocking down the obstacles to functional genomics data sharing.

    PubMed

    Simpson, Kaylene J; Smith, Jennifer A

    2017-03-01

    This week, Scientific Data published a collection of eight papers that describe datasets from high-throughput functional genomics screens, primarily utilizing RNA interference (RNAi). The publications explore host-pathogen dependencies, innate immune response, disease pathways, and cell morphology and motility at the genome-level. All data, including raw images from the high content screens, are publically available in PubChem BioAssay, figshare, Harvard Dataverse or the Image Data Resource (IDR). Detailed data descriptors enable use of these data for analysis algorithm design, machine learning, data comparisons, as well as generating new scientific hypotheses.

  13. Approaching Functions: Cabri Tools as Instruments of Semiotic Mediation

    ERIC Educational Resources Information Center

    Falcade, Rossana; Laborde, Colette; Mariotti, Maria Alessandra

    2007-01-01

    Assuming that dynamic features of Dynamic Geometry Software may provide a basic representation of both variation and functional dependency, and taking the Vygotskian perspective of semiotic mediation, a teaching experiment was designed with the aim of introducing students to the idea of function. This paper focuses on the use of the Trace tool and…

  14. Complete genome sequence of cyanobacterium Fischerella sp. NIES-3754, providing thermoresistant optogenetic tools.

    PubMed

    Hirose, Yuu; Fujisawa, Takatomo; Ohtsubo, Yoshiyuki; Katayama, Mitsunori; Misawa, Naomi; Wakazuki, Sachiko; Shimura, Yohei; Nakamura, Yasukazu; Kawachi, Masanobu; Yoshikawa, Hirofumi; Eki, Toshihiko; Kanesaki, Yu

    2016-02-20

    Cyanobacterial phytochrome-class photosensors are recently emerging optogenetic tools. We isolated Fischerella sp. strain NIES-3754 from hotspring at Suwa-shrine, Suwa, Nagano, Japan. We determined complete genome sequence of the NIES-3754 strain, which is composed of one chromosome and two putative replicons (total 5,826,863bp containing no gaps). We identified photosensor genes of 5 phytochromes and 9 cyanobacteriochromes, which will facilitate optogenetics of thermophile.

  15. A Semi-Automated Functional Test Data Analysis Tool

    SciTech Connect

    Xu, Peng; Haves, Philip; Kim, Moosung

    2005-05-01

    The growing interest in commissioning is creating a demand that will increasingly be met by mechanical contractors and less experienced commissioning agents. They will need tools to help them perform commissioning effectively and efficiently. The widespread availability of standardized procedures, accessible in the field, will allow commissioning to be specified with greater certainty as to what will be delivered, enhancing the acceptance and credibility of commissioning. In response, a functional test data analysis tool is being developed to analyze the data collected during functional tests for air-handling units. The functional test data analysis tool is designed to analyze test data, assess performance of the unit under test and identify the likely causes of the failure. The tool has a convenient user interface to facilitate manual entry of measurements made during a test. A graphical display shows the measured performance versus the expected performance, highlighting significant differences that indicate the unit is not able to pass the test. The tool is described as semiautomated because the measured data need to be entered manually, instead of being passed from the building control system automatically. However, the data analysis and visualization are fully automated. The tool is designed to be used by commissioning providers conducting functional tests as part of either new building commissioning or retro-commissioning, as well as building owners and operators interested in conducting routine tests periodically to check the performance of their HVAC systems.

  16. Integrative genomics--a basic and essential tool for the development of molecular medicine.

    PubMed

    Ostrowski, Jerzy

    2008-01-01

    Understanding the molecular mechanisms of disease requires the introduction of molecular diagnostics into medical practice. Current medicine employs only elements of molecular diagnostics, and usually on the scale of single genes. Medicine in the post-genomic era will utilize thousands of molecular markers associated with disease that are provided by high-throughput sequencing and functional genomic, proteomic and metabolomic studies. Such a spectrum of techniques will link clinical medicine based on molecularly oriented diagnostics with the prediction and prevention of disease. To achieve this task, large-scale and genome-wide biological and medical data must be combined with biostatistical analyses and bioinformatic modeling of biological systems. The collecting, cataloging and comparison of data from molecular studies and the subsequent development of conclusions create the fundamentals of systems biology. This highly complex analytical process reflects a new scientific paradigm called integrative genomics.

  17. Beyond Drosophila: RNAi in vivo and functional genomics in insects.

    PubMed

    Bellés, Xavier

    2010-01-01

    The increasing availability of insect genomes has revealed a large number of genes with unknown functions and the resulting problem of how to discover these functions. The RNA interference (RNAi) technique, which generates loss-of-function phenotypes by depletion of a chosen transcript, can help to overcome this challenge. RNAi can unveil the functions of new genes, lead to the discovery of new functions for old genes, and find the genes for old functions. Moreover, the possibility of studying the functions of homologous genes in different species can allow comparisons of the genetic networks regulating a given function in different insect groups, thereby facilitating an evolutionary insight into developmental processes. RNAi also has drawbacks and obscure points, however, such as those related to differences in species sensitivity. Disentangling these differences is one of the main challenges in the RNAi field.

  18. Dissecting diabetes/metabolic disease mechanisms using pluripotent stem cells and genome editing tools

    PubMed Central

    Teo, Adrian Kee Keong; Gupta, Manoj K.; Doria, Alessandro; Kulkarni, Rohit N.

    2015-01-01

    Background Diabetes and metabolic syndromes are chronic, devastating diseases with increasing prevalence. Human pluripotent stem cells are gaining popularity in their usage for human in vitro disease modeling. With recent rapid advances in genome editing tools, these cells can now be genetically manipulated with relative ease to study how genes and gene variants contribute to diabetes and metabolic syndromes. Scope of review We highlight the diabetes and metabolic genes and gene variants, which could potentially be studied, using two powerful technologies – human pluripotent stem cells (hPSCs) and genome editing tools – to aid the elucidation of yet elusive mechanisms underlying these complex diseases. Major conclusions hPSCs and the advancing genome editing tools appear to be a timely and potent combination for probing molecular mechanism(s) underlying diseases such as diabetes and metabolic syndromes. The knowledge gained from these hiPSC-based disease modeling studies can potentially be translated into the clinics by guiding clinicians on the appropriate type of medication to use for each condition based on the mechanism of action of the disease. PMID:26413465

  19. Genomic islands predict functional adaptation in marine actinobacteria

    SciTech Connect

    Penn, Kevin; Jenkins, Caroline; Nett, Markus; Udwary, Daniel; Gontang, Erin; McGlinchey, Ryan; Foster, Brian; Lapidus, Alla; Podell, Sheila; Allen, Eric; Moore, Bradley; Jensen, Paul

    2009-04-01

    Linking functional traits to bacterial phylogeny remains a fundamental but elusive goal of microbial ecology 1. Without this information, it becomes impossible to resolve meaningful units of diversity and the mechanisms by which bacteria interact with each other and adapt to environmental change. Ecological adaptations among bacterial populations have been linked to genomic islands, strain-specific regions of DNA that house functionally adaptive traits 2. In the case of environmental bacteria, these traits are largely inferred from bioinformatic or gene expression analyses 2, thus leaving few examples in which the functions of island genes have been experimentally characterized. Here we report the complete genome sequences of Salinispora tropica and S. arenicola, the first cultured, obligate marine Actinobacteria 3. These two species inhabit benthic marine environments and dedicate 8-10percent of their genomes to the biosynthesis of secondary metabolites. Despite a close phylogenetic relationship, 25 of 37 secondary metabolic pathways are species-specific and located within 21 genomic islands, thus providing new evidence linking secondary metabolism to ecological adaptation. Species-specific differences are also observed in CRISPR sequences, suggesting that variations in phage immunity provide fitness advantages that contribute to the cosmopolitan distribution of S. arenicola 4. The two Salinispora genomes have evolved by complex processes that include the duplication and acquisition of secondary metabolite genes, the products of which provide immediate opportunities for molecular diversification and ecological adaptation. Evidence that secondary metabolic pathways are exchanged by Horizontal Gene Transfer (HGT) yet are fixed among globally distributed populations 5 supports a functional role for their products and suggests that pathway acquisition represents a previously unrecognized force driving bacterial diversification

  20. Neocaridina denticulata: A Decapod Crustacean Model for Functional Genomics.

    PubMed

    Mykles, Donald L; Hui, Jerome H L

    2015-11-01

    A decapod crustacean model is needed for understanding the molecular mechanisms underlying physiological processes, such as reproduction, sex determination, molting and growth, immunity, regeneration, and response to stress. Criteria for selection are: life-history traits, adult size, availability and ease of culture, and genomics and genetic manipulation. Three freshwater species are considered: cherry shrimp, Neocaridina denticulata; red swamp crayfish, Procambarus clarkii; and redclaw crayfish, Cherax quadricarinatus. All three are readily available, reproduce year round, and grow rapidly. The crayfish species require more space for culture than does N. denticulata. The transparent cuticle of cherry shrimp provides for direct assessment of reproductive status, stage of molt, and tissue-specific expression of reporter genes, and facilitates screening of mutations affecting phenotype. Moreover, a preliminary genome of N. denticulata is available and efforts toward complete genome sequencing and transcriptome sequencing have been initiated. Neocaridina denticulata possesses the best combination of traits that make it most suitable as a model for functional genomics. The next step is to obtain the complete genome sequence and to develop molecular technologies for the screening of mutants and for manipulating tissue-specific gene expression.

  1. Marine invertebrate lipases: Comparative and functional genomic analysis.

    PubMed

    Rivera-Perez, Crisalejandra

    2015-09-01

    Lipases are key enzymes involved in lipid digestion, storage and mobilization of reserves during fasting or heightened metabolic demand. This is a highly conserved process, essential for survival. The genomes of five marine invertebrate species with distinctive digestive system were screened for the six major lipase families. The two most common families in marine invertebrates, the neutral an acid lipases, are also the main families in mammals and insects. The number of lipases varies two-fold across analyzed genomes. A high degree of orthology with mammalian lipases was observed. Interestingly, 19% of the marine invertebrate lipases have lost motifs required for catalysis. Analysis of the lid and loop regions of the neutral lipases suggests that many marine invertebrates have a functional triacylglycerol hydrolytic activity as well as some acid lipases. A revision of the expression profiles and functional activity on sequences in databases and scientific literature provided information regarding the function of these families of enzymes in marine invertebrates.

  2. Recent advances in tomato functional genomics: utilization of VIGS.

    PubMed

    Sahu, Pranav Pankaj; Puranik, Swati; Khan, Moinuddin; Prasad, Manoj

    2012-10-01

    Tomato unquestionably occupies a significant position in world vegetable production owing to its world-wide consumption. The tomato genome sequencing efforts being recently concluded, it becomes more imperative to recognize important functional genes from this treasure of generated information for improving tomato yield. While much progress has been made in conventional tomato breeding, post-transcriptional gene silencing (PTGS) offers an alternative approach for advancement of tomato functional genomics. In particular, virus-induced gene silencing (VIGS) is increasingly being used as rapid, reliable, and lucrative screening strategy to elucidate gene function. In this review, we focus on the recent advancement made through exploiting the potential of this technique for manipulating different agronomically important traits in tomato by discussing several case studies.

  3. Comparative genomics and functional analysis of niche-specific adaptation in Pseudomonas putida

    SciTech Connect

    Wu X.; van der Lelie D.; Monchy, S.; Taghavi, S.; Zhu, W.; Ramos, J.

    2011-03-01

    Pseudomonas putida is a gram-negative rod-shaped gammaproteobacterium that is found throughout various environments. Members of the species P. putida show a diverse spectrum of metabolic activities, which is indicative of their adaptation to various niches, which includes the ability to live in soils and sediments contaminated with high concentrations of heavy metals and organic contaminants. Pseudomonas putida strains are also found as plant growth-promoting rhizospheric and endophytic bacteria. The genome sequences of several P. putida species have become available and provide a unique tool to study the specific niche adaptation of the various P. putida strains. In this review, we compare the genomes of four P. putida strains: the rhizospheric strain KT2440, the endophytic strain W619, the aromatic hydrocarbon-degrading strain F1 and the manganese-oxidizing strain GB-1. Comparative genomics provided a powerful tool to gain new insights into the adaptation of P. putida to specific lifestyles and environmental niches, and clearly demonstrated that horizontal gene transfer played a key role in this adaptation process, as many of the niche-specific functions were found to be encoded on clearly defined genomic islands.

  4. A Tool for Multiple Targeted Genome Deletions that Is Precise, Scar-Free, and Suitable for Automation.

    PubMed

    Aubrey, Wayne; Riley, Michael C; Young, Michael; King, Ross D; Oliver, Stephen G; Clare, Amanda

    2015-01-01

    Many advances in synthetic biology require the removal of a large number of genomic elements from a genome. Most existing deletion methods leave behind markers, and as there are a limited number of markers, such methods can only be applied a fixed number of times. Deletion methods that recycle markers generally are either imprecise (remove untargeted sequences), or leave scar sequences which can cause genome instability and rearrangements. No existing marker recycling method is automation-friendly. We have developed a novel openly available deletion tool that consists of: 1) a method for deleting genomic elements that can be repeatedly used without limit, is precise, scar-free, and suitable for automation; and 2) software to design the method's primers. Our tool is sequence agnostic and could be used to delete large numbers of coding sequences, promoter regions, transcription factor binding sites, terminators, etc in a single genome. We have validated our tool on the deletion of non-essential open reading frames (ORFs) from S. cerevisiae. The tool is applicable to arbitrary genomes, and we provide primer sequences for the deletion of: 90% of the ORFs from the S. cerevisiae genome, 88% of the ORFs from S. pombe genome, and 85% of the ORFs from the L. lactis genome.

  5. A Tool for Multiple Targeted Genome Deletions that Is Precise, Scar-Free, and Suitable for Automation

    PubMed Central

    Aubrey, Wayne; Riley, Michael C.; Young, Michael; King, Ross D.; Oliver, Stephen G.; Clare, Amanda

    2015-01-01

    Many advances in synthetic biology require the removal of a large number of genomic elements from a genome. Most existing deletion methods leave behind markers, and as there are a limited number of markers, such methods can only be applied a fixed number of times. Deletion methods that recycle markers generally are either imprecise (remove untargeted sequences), or leave scar sequences which can cause genome instability and rearrangements. No existing marker recycling method is automation-friendly. We have developed a novel openly available deletion tool that consists of: 1) a method for deleting genomic elements that can be repeatedly used without limit, is precise, scar-free, and suitable for automation; and 2) software to design the method’s primers. Our tool is sequence agnostic and could be used to delete large numbers of coding sequences, promoter regions, transcription factor binding sites, terminators, etc in a single genome. We have validated our tool on the deletion of non-essential open reading frames (ORFs) from S. cerevisiae. The tool is applicable to arbitrary genomes, and we provide primer sequences for the deletion of: 90% of the ORFs from the S. cerevisiae genome, 88% of the ORFs from S. pombe genome, and 85% of the ORFs from the L. lactis genome. PMID:26630677

  6. Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomics Data.

    PubMed

    Bolser, Dan; Staines, Daniel M; Pritchard, Emily; Kersey, Paul

    2016-01-01

    Ensembl Plants ( http://plants.ensembl.org ) is an integrative resource presenting genome-scale information for a growing number of sequenced plant species (currently 33). Data provided includes genome sequence, gene models, functional annotation, and polymorphic loci. Various additional information are provided for variation data, including population structure, individual genotypes, linkage, and phenotype data. In each release, comparative analyses are performed on whole genome and protein sequences, and genome alignments and gene trees are made available that show the implied evolutionary history of each gene family. Access to the data is provided through a genome browser incorporating many specialist interfaces for different data types, and through a variety of additional methods for programmatic access and data mining. These access routes are consistent with those offered through the Ensembl interface for the genomes of non-plant species, including those of plant pathogens, pests, and pollinators.Ensembl Plants is updated 4-5 times a year and is developed in collaboration with our international partners in the Gramene ( http://www.gramene.org ) and transPLANT projects ( http://www.transplantdb.org ).

  7. A profile-based method for identifying functional divergence of orthologous genes in bacterial genomes

    PubMed Central

    Wheeler, Nicole E.; Barquist, Lars; Kingsley, Robert A.; Gardner, Paul P.

    2016-01-01

    Motivation: Next generation sequencing technologies have provided us with a wealth of information on genetic variation, but predicting the functional significance of this variation is a difficult task. While many comparative genomics studies have focused on gene flux and large scale changes, relatively little attention has been paid to quantifying the effects of single nucleotide polymorphisms and indels on protein function, particularly in bacterial genomics. Results: We present a hidden Markov model based approach we call delta-bitscore (DBS) for identifying orthologous proteins that have diverged at the amino acid sequence level in a way that is likely to impact biological function. We benchmark this approach with several widely used datasets and apply it to a proof-of-concept study of orthologous proteomes in an investigation of host adaptation in Salmonella enterica. We highlight the value of the method in identifying functional divergence of genes, and suggest that this tool may be a better approach than the commonly used dN/dS metric for identifying functionally significant genetic changes occurring in recently diverged organisms. Availability and Implementation: A program implementing DBS for pairwise genome comparisons is freely available at: https://github.com/UCanCompBio/deltaBS. Contact: nicole.wheeler@pg.canterbury.ac.nz or lars.barquist@uni-wuerzburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27503221

  8. CTDB: An Integrated Chickpea Transcriptome Database for Functional and Applied Genomics

    PubMed Central

    Patel, Ravi K.; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Chickpea is an important grain legume used as a rich source of protein in human diet. The narrow genetic diversity and limited availability of genomic resources are the major constraints in implementing breeding strategies and biotechnological interventions for genetic enhancement of chickpea. We developed an integrated Chickpea Transcriptome Database (CTDB), which provides the comprehensive web interface for visualization and easy retrieval of transcriptome data in chickpea. The database features many tools for similarity search, functional annotation (putative function, PFAM domain and gene ontology) search and comparative gene expression analysis. The current release of CTDB (v2.0) hosts transcriptome datasets with high quality functional annotation from cultivated (desi and kabuli types) and wild chickpea. A catalog of transcription factor families and their expression profiles in chickpea are available in the database. The gene expression data have been integrated to study the expression profiles of chickpea transcripts in major tissues/organs and various stages of flower development. The utilities, such as similarity search, ortholog identification and comparative gene expression have also been implemented in the database to facilitate comparative genomic studies among different legumes and Arabidopsis. Furthermore, the CTDB represents a resource for the discovery of functional molecular markers (microsatellites and single nucleotide polymorphisms) between different chickpea types. We anticipate that integrated information content of this database will accelerate the functional and applied genomic research for improvement of chickpea. The CTDB web service is freely available at http://nipgr.res.in/ctdb.html. PMID:26322998

  9. Use of Genomic Tools to Improve Cattle Health in the Context of Infectious Diseases

    PubMed Central

    Raszek, Mikolaj M.; Guan, Le L.; Plastow, Graham S.

    2016-01-01

    Although infectious diseases impose a heavy economic burden on the cattle industry, the etiology of many disorders that affect livestock is not fully elucidated, and effective countermeasures are often lacking. The main tools available until now have been vaccines, antibiotics and antiparasitic drugs. Although these have been very successful in some cases, the appearance of parasite and microbial resistance to these treatments is a cause of concern. Next-generation sequencing provides important opportunities to tackle problems associated with pathogenic illnesses. This review describes the rapid gains achieved to track disease progression, identify the pathogens involved, and map pathogen interactions with the host. Use of novel genomic tools subsequently aids in treatment development, as well as successful creation of breeding programs aimed toward less susceptible livestock. These may be important tools for mitigating the long term effects of combating infection and helping reduce the reliance on antibiotic treatment. PMID:27014337

  10. Genome-wide functional analysis in Candida albicans.

    PubMed

    Motaung, Thabiso E; Ells, Ruan; Pohl, Carolina H; Albertyn, Jacobus; Tsilo, Toi J

    2017-02-08

    Candida albicans is an important etiological agent of superficial and life-threatening infections in individuals with compromised immune systems. To date, we know of several overlapping genetic networks that govern virulence attributes in this fungal pathogen. Classical use of deletion mutants has led to the discovery of numerous virulence factors over the years, and genome-wide functional analysis has propelled gene discovery at an even faster pace. Indeed, a number of recent studies using large-scale genetic screens followed by genome-wide functional analysis has allowed for the unbiased discovery of many new genes involved in C. albicans biology. Here we share our perspectives on the role of these studies in analyzing fundamental aspects of C. albicans virulence properties.

  11. Relaxation of yeast mitochondrial functions after whole-genome duplication

    PubMed Central

    Jiang, Huifeng; Guan, Wenjun; Pinney, David; Wang, Wen; Gu, Zhenglong

    2008-01-01

    Mitochondria are essential for cellular energy production in most eukaryotic organisms. However, when glucose is abundant, yeast species that underwent whole-genome duplication (WGD) mostly conduct fermentation even under aerobic conditions, and most can survive without a functional mitochondrial genome. In this study, we show that the rate of evolution for the nuclear-encoded mitochondrial genes was greater in post-WGD species than pre-WGD species. Furthermore, codon usage bias was relaxed for these genes in post-WGD yeast species. The codon usage pattern and the distribution of a particular transcription regulatory element suggest that the change to an efficient aerobic fermentation lifestyle in this lineage might have emerged after WGD between the divergence of Kluyveromyces polysporus and Saccharomyces castellii from their common ancestor. This new energy production strategy could have led to the relaxation of mitochondrial function in the relevant yeast species. PMID:18669479

  12. The ethics of functional genomics: same, same, but different?

    PubMed

    Hoeyer, Klaus; Koch, Lene

    2006-09-01

    Respect for human life--a notion of worth uniting all members of the human race--constitutes a sense of anthropocentrism that has long been the justification for the enrollment of animals in experimentation executed to develop therapies to alleviate human suffering. Currently, however, advances in functional genomics are causing a qualitative transformation of the rationale for medical research performed on animals. The notion of human distinctness is being fundamentally challenged when gene sequences similar to those found in humans are identified in different species. In this Opinion article, we would like to highlight an inherent tension brought about by the current developments in functional genomics: a tension between the scientific and the ethical status of gene sequences. Is it reasonable to argue that they are the same for all practical purposes but different in ethical status?

  13. Genomic Tools for Customized Recovery and Detection of Foodborne Shiga Toxigenic Escherichia coli.

    PubMed

    Knowles, Michael; Stinson, Sara; Lambert, Dominic; Carrillo, Catherine; Koziol, Adam; Gauthier, Martine; Blais, Burton

    2016-12-01

    Genomic antimicrobial resistance (AMR) prediction tools have the potential to support foodborne illness outbreak investigations through their application in the analysis of bacterial genomes from causative strains. The AMR marker profile of a strain of interest, initially identified in outbreak-associated clinical samples, may serve as the basis for customization of selective enrichment media, facilitating its recovery from samples in a food safety investigation. Different possibilities for AMR analyses include the use of comprehensive AMR gene databases such as the Comprehensive Antibiotic Resistance Database, which can be mined with in-house bioinformatics alignment tools (e.g., Antimicrobial Resistance Marker Identifier), or publicly available tools based on clinically relevant acquired AMR gene databases (e.g., ResFinder). In combination with a previously reported pipeline (SigSeekr) designed to identify specific DNA sequences associated with a particular strain for its rapid identification by PCR, it should be possible to deploy custom recovery and identification tools for the efficient detection of priority pathogens such as Shiga toxigenic Escherichia coli (STEC) outbreak strains within the time frame of an active investigation. Using a laboratory STEC strain as a model, trimethoprim resistance identified by both Antimicrobial Resistance Marker Identifier and ResFinder was used as the basis for its selective recovery against a background of commensal E. coli bacteria in ground beef samples. Enrichment in modified tryptic soy broth containing trimethoprim greatly enhanced the recovery of low numbers of model strain cells inoculated in ground beef samples, as verified by the enumeration of colonies on plating media using a strain-specific PCR method to determine the recovery efficiency for the target strain. We discuss the relative merits of different AMR marker prediction tools for this purpose and describe how such tools can be utilized to good effect in a

  14. The infectious BAC genomic DNA expression library: a high capacity vector system for functional genomics

    PubMed Central

    Lufino, Michele M. P.; Edser, Pauline A. H.; Quail, Michael A.; Rice, Stephen; Adams, David J.; Wade-Martins, Richard

    2016-01-01

    Gene dosage plays a critical role in a range of cellular phenotypes, yet most cellular expression systems use heterologous cDNA-based vectors which express proteins well above physiological levels. In contrast, genomic DNA expression vectors generate physiologically-relevant levels of gene expression by carrying the whole genomic DNA locus of a gene including its regulatory elements. Here we describe the first genomic DNA expression library generated using the high-capacity herpes simplex virus-1 amplicon technology to deliver bacterial artificial chromosomes (BACs) into cells by viral transduction. The infectious BAC (iBAC) library contains 184,320 clones with an average insert size of 134.5 kb. We show in a Chinese hamster ovary (CHO) disease model cell line and mouse embryonic stem (ES) cells that this library can be used for genetic rescue studies in a range of contexts including the physiological restoration of Ldlr deficiency, and viral receptor expression. The iBAC library represents an important new genetic analysis tool openly available to the research community. PMID:27353647

  15. RNA-Mediated Silencing in Algae: Biological Roles and Tools for Analysis of Gene Function

    PubMed Central

    Cerutti, Heriberto; Ma, Xinrong; Msanne, Joseph; Repas, Timothy

    2011-01-01

    Algae are a large group of aquatic, typically photosynthetic, eukaryotes that include species from very diverse phylogenetic lineages, from those similar to land plants to those related to protist parasites. The recent sequencing of several algal genomes has provided insights into the great complexity of these organisms. Genomic information has also emphasized our lack of knowledge of the functions of many predicted genes, as well as the gene regulatory mechanisms in algae. Core components of the machinery for RNA-mediated silencing show widespread distribution among algal lineages, but they also seem to have been lost entirely from several species with relatively small nuclear genomes. Complex sets of endogenous small RNAs, including candidate microRNAs and small interfering RNAs, have now been identified by high-throughput sequencing in green, red, and brown algae. However, the natural roles of RNA-mediated silencing in algal biology remain poorly understood. Limited evidence suggests that small RNAs may function, in different algae, in defense mechanisms against transposon mobilization, in responses to nutrient deprivation and, possibly, in the regulation of recently evolved developmental processes. From a practical perspective, RNA interference (RNAi) is becoming a promising tool for assessing gene function by sequence-specific knockdown. Transient gene silencing, triggered with exogenously synthesized nucleic acids, and/or stable gene repression, involving genome-integrated transgenes, have been achieved in green algae, diatoms, yellow-green algae, and euglenoids. The development of RNAi technology in conjunction with system level “omics” approaches may provide the tools needed to advance our understanding of algal physiological and metabolic processes. PMID:21803865

  16. Functional Genomics for Epithelial-Mesenchymal Transition in Breast Cancer

    DTIC Science & Technology

    2012-09-01

    leads for therapeutic targeting in breast cancer. We are employing the high throughput functional genomic screens using epithelial mesenchymal...of sequencing from in vitro and in vivo hits in stream. We anticipate completion in the coming year. Body Task 1: To identify gene products...focus on Vim induction at the invasive edge of formed tumors generated by shRNA transduction. Task 2: To identify gene products that may

  17. Unlocking Holocentric Chromosomes: New Perspectives from Comparative and Functional Genomics?

    PubMed Central

    Mandrioli, Mauro; Manicardi, Gian Carlo

    2012-01-01

    The presence of chromosomes with diffuse centromeres (holocentric chromosomes) has been reported in several taxa since more than fifty years, but a full understanding of their origin is still lacking. Comparative and functional genomics are nowadays furnishing new data to better understand holocentric chromosome evolution thus opening new perspectives to analyse karyotype rearrangements in species with holocentric chromosomes in particular evidencing unusual common features, such as the uniform GC content and gene distribution along chromosomes. PMID:23372420

  18. The Princeton Protein Orthology Database (P-POD): A Comparative Genomics Analysis Tool for Biologists

    PubMed Central

    Kang, Fan; Angiuoli, Samuel V.; White, Owen; Botstein, David; Dolinski, Kara

    2007-01-01

    Many biological databases that provide comparative genomics information and tools are now available on the internet. While certainly quite useful, to our knowledge none of the existing databases combine results from multiple comparative genomics methods with manually curated information from the literature. Here we describe the Princeton Protein Orthology Database (P-POD, http://ortholog.princeton.edu), a user-friendly database system that allows users to find and visualize the phylogenetic relationships among predicted orthologs (based on the OrthoMCL method) to a query gene from any of eight eukaryotic organisms, and to see the orthologs in a wider evolutionary context (based on the Jaccard clustering method). In addition to the phylogenetic information, the database contains experimental results manually collected from the literature that can be compared to the computational analyses, as well as links to relevant human disease and gene information via the OMIM, model organism, and sequence databases. Our aim is for the P-POD resource to be extremely useful to typical experimental biologists wanting to learn more about the evolutionary context of their favorite genes. P-POD is based on the commonly used Generic Model Organism Database (GMOD) schema and can be downloaded in its entirety for installation on one's own system. Thus, bioinformaticians and software developers may also find P-POD useful because they can use the P-POD database infrastructure when developing their own comparative genomics resources and database tools. PMID:17712414

  19. Tools for Accurate and Efficient Analysis of Complex Evolutionary Mechanisms in Microbial Genomes. Final Report

    SciTech Connect

    Nakhleh, Luay

    2014-03-12

    I proposed to develop computationally efficient tools for accurate detection and reconstruction of microbes' complex evolutionary mechanisms, thus enabling rapid and accurate annotation, analysis and understanding of their genomes. To achieve this goal, I proposed to address three aspects. (1) Mathematical modeling. A major challenge facing the accurate detection of HGT is that of distinguishing between these two events on the one hand and other events that have similar "effects." I proposed to develop a novel mathematical approach for distinguishing among these events. Further, I proposed to develop a set of novel optimization criteria for the evolutionary analysis of microbial genomes in the presence of these complex evolutionary events. (2) Algorithm design. In this aspect of the project, I proposed to develop an array of e cient and accurate algorithms for analyzing microbial genomes based on the formulated optimization criteria. Further, I proposed to test the viability of the criteria and the accuracy of the algorithms in an experimental setting using both synthetic as well as biological data. (3) Software development. I proposed the nal outcome to be a suite of software tools which implements the mathematical models as well as the algorithms developed.

  20. Nucleotide diversity analysis highlights functionally important genomic regions.

    PubMed

    Tatarinova, Tatiana V; Chekalin, Evgeny; Nikolsky, Yuri; Bruskin, Sergey; Chebotarov, Dmitry; McNally, Kenneth L; Alexandrov, Nickolai

    2016-10-24

    We analyzed functionality and relative distribution of genetic variants across the complete Oryza sativa genome, using the 40 million single nucleotide polymorphisms (SNPs) dataset from the 3,000 Rice Genomes Project (http://snp-seek.irri.org), the largest and highest density SNP collection for any higher plant. We have shown that the DNA-binding transcription factors (TFs) are the most conserved group of genes, whereas kinases and membrane-localized transporters are the most variable ones. TFs may be conserved because they belong to some of the most connected regulatory hubs that modulate transcription of vast downstream gene networks, whereas signaling kinases and transporters need to adapt rapidly to changing environmental conditions. In general, the observed profound patterns of nucleotide variability reveal functionally important genomic regions. As expected, nucleotide diversity is much higher in intergenic regions than within gene bodies (regions spanning gene models), and protein-coding sequences are more conserved than untranslated gene regions. We have observed a sharp decline in nucleotide diversity that begins at about 250 nucleotides upstream of the transcription start and reaches minimal diversity exactly at the transcription start. We found the transcription termination sites to have remarkably symmetrical patterns of SNP density, implying presence of functional sites near transcription termination. Also, nucleotide diversity was significantly lower near 3' UTRs, the area rich with regulatory regions.

  1. Functional Genomics in the Study of Mind-Body Therapies

    PubMed Central

    Niles, Halsey; Mehta, Darshan H.; Corrigan, Alexandra A.; Bhasin, Manoj K.; Denninger, John W.

    2014-01-01

    Background Mind-body therapies (MBTs) are used throughout the world in treatment, disease prevention, and health promotion. However, the mechanisms by which MBTs exert their positive effects are not well understood. Investigations into MBTs using functional genomics have revolutionized the understanding of MBT mechanisms and their effects on human physiology. Methods We searched the literature for the effects of MBTs on functional genomics determinants using MEDLINE, supplemented by a manual search of additional journals and a reference list review. Results We reviewed 15 trials that measured global or targeted transcriptomic, epigenomic, or proteomic changes in peripheral blood. Sample sizes ranged from small pilot studies (n=2) to large trials (n=500). While the reliability of individual genes from trial to trial was often inconsistent, genes related to inflammatory response, particularly those involved in the nuclear factor-kappa B (NF-κB) pathway, were consistently downregulated across most studies. Conclusion In general, existing trials focusing on gene expression changes brought about by MBTs have revealed intriguing connections to the immune system through the NF-κB cascade, to telomere maintenance, and to apoptotic regulation. However, these findings are limited to a small number of trials and relatively small sample sizes. More rigorous randomized controlled trials of healthy subjects and specific disease states are warranted. Future research should investigate functional genomics areas both upstream and downstream of MBT-related gene expression changes—from epigenomics to proteomics and metabolomics. PMID:25598735

  2. Nucleotide diversity analysis highlights functionally important genomic regions

    PubMed Central

    Tatarinova, Tatiana V.; Chekalin, Evgeny; Nikolsky, Yuri; Bruskin, Sergey; Chebotarov, Dmitry; McNally, Kenneth L.; Alexandrov, Nickolai

    2016-01-01

    We analyzed functionality and relative distribution of genetic variants across the complete Oryza sativa genome, using the 40 million single nucleotide polymorphisms (SNPs) dataset from the 3,000 Rice Genomes Project (http://snp-seek.irri.org), the largest and highest density SNP collection for any higher plant. We have shown that the DNA-binding transcription factors (TFs) are the most conserved group of genes, whereas kinases and membrane-localized transporters are the most variable ones. TFs may be conserved because they belong to some of the most connected regulatory hubs that modulate transcription of vast downstream gene networks, whereas signaling kinases and transporters need to adapt rapidly to changing environmental conditions. In general, the observed profound patterns of nucleotide variability reveal functionally important genomic regions. As expected, nucleotide diversity is much higher in intergenic regions than within gene bodies (regions spanning gene models), and protein-coding sequences are more conserved than untranslated gene regions. We have observed a sharp decline in nucleotide diversity that begins at about 250 nucleotides upstream of the transcription start and reaches minimal diversity exactly at the transcription start. We found the transcription termination sites to have remarkably symmetrical patterns of SNP density, implying presence of functional sites near transcription termination. Also, nucleotide diversity was significantly lower near 3′ UTRs, the area rich with regulatory regions. PMID:27774999

  3. Functional Profiling Using the Saccharomyces Genome Deletion Project Collections.

    PubMed

    Nislow, Corey; Wong, Lai Hong; Lee, Amy Huei-Yi; Giaever, Guri

    2016-09-01

    The ability to measure and quantify the fitness of an entire organism requires considerably more complex approaches than simply using traditional "omic" methods that examine, for example, the abundance of RNA transcripts, proteins, or metabolites. The yeast deletion collections represent the only systematic, comprehensive set of null alleles for any organism in which such fitness measurements can be assayed. Generated by the Saccharomyces Genome Deletion Project, these collections allow the systematic and parallel analysis of gene functions using any measurable phenotype. The unique 20-bp molecular barcodes engineered into the genome of each deletion strain facilitate the massively parallel analysis of individual fitness. Here, we present functional genomic protocols for use with the yeast deletion collections. We describe how to maintain, propagate, and store the deletion collections and how to perform growth fitness assays on single and parallel screening platforms. Phenotypic fitness analyses of the yeast mutants, described in brief here, provide important insights into biological functions, mechanisms of drug action, and response to environmental stresses. It is important to bear in mind that the specific assays described in this protocol represent some of the many ways in which these collections can be assayed, and in this description particular attention is paid to maximizing throughput using growth as the phenotypic measure.

  4. MicrobesOnline: an integrated portal for comparative and functional genomics

    SciTech Connect

    Dehal, Paramvir S.; Joachimiak, Marcin P.; Price, Morgan N.; Bates, John T.; Baumohl, Jason K.; Chivian, Dylan; Friedland, Greg D.; Huang, Katherine H.; Keller, Keith; Novichkov, Pavel S.; Dubchak, Inna L.; Alm, Eric J.; Arkin, Adam P.

    2009-09-17

    Since 2003, MicrobesOnline (http://www.microbesonline.org) has been providing a community resource for comparative and functional genome analysis. The portal includes over 1000 complete genomes of bacteria, archaea and fungi and thousands of expression microarrays from diverse organisms ranging from model organisms such as Escherichia coli and Saccharomyces cerevisiae to environmental microbes such as Desulfovibrio vulgaris and Shewanella oneidensis. To assist in annotating genes and in reconstructing their evolutionary history, MicrobesOnline includes a comparative genome browser based on phylogenetic trees for every gene family as well as a species tree. To identify co-regulated genes, MicrobesOnline can search for genes based on their expression profile, and provides tools for identifying regulatory motifs and seeing if they are conserved. MicrobesOnline also includes fast phylogenetic profile searches, comparative views of metabolic pathways, operon predictions, a workbench for sequence analysis and integration with RegTransBase and other microbial genome resources. The next update of MicrobesOnline will contain significant new functionality, including comparative analysis of metagenomic sequence data. Programmatic access to the database, along with source code and documentation, is available at http://microbesonline.org/programmers.html.

  5. MicrobesOnline: an integrated portal for comparative and functional genomics

    SciTech Connect

    Dehal, Paramvir; Joachimiak, Marcin; Price, Morgan; Bates, John; Baumohl, Jason; Chivian, Dylan; Friedland, Greg; Huang, Kathleen; Keller, Keith; Novichkov, Pavel; Dubchak, Inna; Alm, Eric; Arkin, Adam

    2011-07-14

    Since 2003, MicrobesOnline (http://www.microbesonline.org) has been providing a community resource for comparative and functional genome analysis. The portal includes over 1000 complete genomes of bacteria, archaea and fungi and thousands of expression microarrays from diverse organisms ranging from model organisms such as Escherichia coli and Saccharomyces cerevisiae to environmental microbes such as Desulfovibrio vulgaris and Shewanella oneidensis. To assist in annotating genes and in reconstructing their evolutionary history, MicrobesOnline includes a comparative genome browser based on phylogenetic trees for every gene family as well as a species tree. To identify co-regulated genes, MicrobesOnline can search for genes based on their expression profile, and provides tools for identifying regulatory motifs and seeing if they are conserved. MicrobesOnline also includes fast phylogenetic profile searches, comparative views of metabolic pathways, operon predictions, a workbench for sequence analysis and integration with RegTransBase and other microbial genome resources. The next update of MicrobesOnline will contain significant new functionality, including comparative analysis of metagenomic sequence data. Programmatic access to the database, along with source code and documentation, is available at http://microbesonline.org/programmers.html.

  6. Beyond the genome: functional studies of phototrophic sulfur oxidation.

    PubMed

    Hanson, Thomas E; Morgan-Kiss, Rachael M; Chan, Leong-Keat; Hiras, Jennifer

    2010-01-01

    The increasing availability of complete genomic sequences for cultured phototrophic bacteria and assembled metagenomes from environments dominated by phototrophs has reinforced the need for a "post-genomic" analytical effort to test models of cellular structure and function proposed from genomic data. Comparative genomics has produced a testable model for pathways of sulfur compound oxidation in the phototrophic bacteria. In the case of sulfide, two enzymes are predicted to oxidize sulfide: sulfide:quinone oxidoreductase and flavocytochrome c sulfide dehydrogenase. However, these models do not predict which enzyme is important under what conditions. In Chlorobaculum tepidum, a model green sulfur bacterium, a combination of genetics and physiological analysis of mutant strains has led to the realization that this organism contains at least two active sulfide:quinone oxidoreductases and that there is significant interaction between sulfide oxidation and light harvesting. In the case of elemental sulfur, an organothiol intermediate of unknown structure has been proposed to activate elemental sulfur for transport into the cytoplasm where it can be oxidized or assimilated, and recent approaches using classical metabolite analysis have begun to shed light on this issue both in C. tepidum and the purple sulfur bacterium Allochromatium vinosum.

  7. Functional Genomics of Drought Tolerance in Bioenergy Crops

    SciTech Connect

    Yin, Hengfu; Chen, Rick; Yang, Jun; Weston, David; Chen, Jay; Muchero, Wellington; Ye, Ning; Tschaplinski, Timothy J; Wullschleger, Stan D; Cheng, Zong-Ming; Tuskan, Gerald A; Yang, Xiaohan

    2014-01-01

    With the predicted trends in climate change, drought will increasingly impose a grand challenge to biomass production. Most of the bioenergy crops have some degree of drought susceptibility with low water-use efficiency (WUE). It is imperative to improve drought tolerance and WUE in bioenergy crops for sustainable biomass production in arid and semi-arid regions with minimal water input. Genetics and functional genomics can play a critical role in generating knowledge to inform and aid genetic improvement of drought tolerance in bioenergy crops. The molecular aspect of drought response has been extensively investigated in model plants like Arabidopsis, yet our understanding of the molecular mechanisms underlying drought tolerance in bioenergy crops are limited. Crops exhibit various responses to drought stress depending on species and genotype. A rational strategy for studying drought tolerance in bioenergy crops is to translate the knowledge from model plants and pinpoint the unique features associated with individual species and genotypes. In this review, we summarize the general knowledge about drought responsive pathways in plants, with a focus on the identification of commonality and specialty in drought responsive mechanisms among different species and/or genotypes. We describe the genomic resources developed for bioenergy crops and discuss genetic and epigenetic regulation of drought responses. We also examine comparative and evolutionary genomics to leverage the ever-increasing genomics resources and provide new insights beyond what has been known from studies on individual species. Finally, we outline future exploration of drought tolerance using the emerging new technologies.

  8. MEASURING COGNITIVE FUNCTION IN MDD: EMERGING ASSESSMENT TOOLS

    PubMed Central

    Russo, Manuela; Mahon, Katie; Burdick, Katherine E

    2015-01-01

    Cognitive impairment is emerging as an important therapeutic target in patients with psychiatric illnesses, including major depressive disorder (MDD). The objective of this general overview is to briefly review the evidence for cognitive impairment in MDD and to summarize a representative sample of cognitive assessment tools currently available to assess cognitive function in depressed patients. Study results in MDD patients with cognitive dysfunction are somewhat inconsistent, likely due to the heterogeneity of the disorder as well as the use of diverse assessment tools. Measuring cognitive changes in this population is challenging. Cognitive symptoms are typically less severe than in patients with schizophrenia and bipolar disorder, requiring greater sensitivity than afforded by existing tools. Preliminary evidence suggests antidepressant treatments may improve cognitive functioning as a direct result of ameliorating depressive symptoms; however, any procognitive effects have not been elucidated. To evaluate antidepressant efficacy in MDD patients with cognitive dysfunction, a standardized cognitive battery for use in clinical trials is essential. PMID:25421437

  9. Functional genomics of bio-energy plants and related patent activities.

    PubMed

    Jiang, Shu-Ye; Ramachandran, Srinivasan

    2013-04-01

    With dwindling fossil oil resources and increased economic growth of many developing countries due to globalization, energy driven from an alternative source such as bio-energy in a sustainable fashion is the need of the hour. However, production of energy from biological source is relatively expensive due to low starch and sugar contents of bioenergy plants leading to lower oil yield and reduced quality along with lower conversion efficiency of feedstock. In this context genetic improvement of bio-energy plants offers a viable solution. In this manuscript, we reviewed the current status of functional genomics studies and related patent activities in bio-energy plants. Currently, genomes of considerable bio-energy plants have been sequenced or are in progress and also large amount of expression sequence tags (EST) or cDNA sequences are available from them. These studies provide fundamental data for more reliable genome annotation and as a result, several genomes have been annotated in a genome-wide level. In addition to this effort, various mutagenesis tools have also been employed to develop mutant populations for characterization of genes that are involved in bioenergy quantitative traits. With the progress made on functional genomics of important bio-energy plants, more patents were filed with a significant number of them focusing on genes and DNA sequences which may involve in improvement of bio-energy traits including higher yield and quality of starch, sugar and oil. We also believe that these studies will lead to the generation of genetically altered plants with improved tolerance to various abiotic and biotic stresses.

  10. Annotating the Function of the Human Genome with Gene Ontology and Disease Ontology

    PubMed Central

    Hu, Yang; Zhou, Wenyang; Ren, Jun; Dong, Lixiang

    2016-01-01

    Increasing evidences indicated that function annotation of human genome in molecular level and phenotype level is very important for systematic analysis of genes. In this study, we presented a framework named Gene2Function to annotate Gene Reference into Functions (GeneRIFs), in which each functional description of GeneRIFs could be annotated by a text mining tool Open Biomedical Annotator (OBA), and each Entrez gene could be mapped to Human Genome Organisation Gene Nomenclature Committee (HGNC) gene symbol. After annotating all the records about human genes of GeneRIFs, 288,869 associations between 13,148 mRNAs and 7,182 terms, 9,496 associations between 948 microRNAs and 533 terms, and 901 associations between 139 long noncoding RNAs (lncRNAs) and 297 terms were obtained as a comprehensive annotation resource of human genome. High consistency of term frequency of individual gene (Pearson correlation = 0.6401, p = 2.2e − 16) and gene frequency of individual term (Pearson correlation = 0.1298, p = 3.686e − 14) in GeneRIFs and GOA shows our annotation resource is very reliable. PMID:27635398

  11. Annotating the Function of the Human Genome with Gene Ontology and Disease Ontology.

    PubMed

    Hu, Yang; Zhou, Wenyang; Ren, Jun; Dong, Lixiang; Wang, Yadong; Jin, Shuilin; Cheng, Liang

    2016-01-01

    Increasing evidences indicated that function annotation of human genome in molecular level and phenotype level is very important for systematic analysis of genes. In this study, we presented a framework named Gene2Function to annotate Gene Reference into Functions (GeneRIFs), in which each functional description of GeneRIFs could be annotated by a text mining tool Open Biomedical Annotator (OBA), and each Entrez gene could be mapped to Human Genome Organisation Gene Nomenclature Committee (HGNC) gene symbol. After annotating all the records about human genes of GeneRIFs, 288,869 associations between 13,148 mRNAs and 7,182 terms, 9,496 associations between 948 microRNAs and 533 terms, and 901 associations between 139 long noncoding RNAs (lncRNAs) and 297 terms were obtained as a comprehensive annotation resource of human genome. High consistency of term frequency of individual gene (Pearson correlation = 0.6401, p = 2.2e - 16) and gene frequency of individual term (Pearson correlation = 0.1298, p = 3.686e - 14) in GeneRIFs and GOA shows our annotation resource is very reliable.

  12. Genetic Simulation Tools for Post-Genome Wide Association Studies of Complex Diseases

    PubMed Central

    Amos, Christopher I.; Bafna, Vineet; Hauser, Elizabeth R.; Hernandez, Ryan D.; Li, Chun; Liberles, David A.; McAllister, Kimberly; Moore, Jason H.; Paltoo, Dina N.; Papanicolaou, George J.; Peng, Bo; Ritchie, Marylyn D.; Rosenfeld, Gabriel; Witte, John S.

    2014-01-01

    Genetic simulation programs are used to model data under specified assumptions to facilitate the understanding and study of complex genetic systems. Standardized data sets generated using genetic simulation are essential for the development and application of novel analytical tools in genetic epidemiology studies. With continuing advances in high-throughput genomic technologies and generation and analysis of larger, more complex data sets, there is a need for updating current approaches in genetic simulation modeling. To provide a forum to address current and emerging challenges in this area, the National Cancer Institute (NCI) sponsored a workshop, entitled “Genetic Simulation Tools for Post-Genome Wide Association Studies of Complex Diseases” at the National Institutes of Health (NIH) in Bethesda, Maryland on March 11-12, 2014. The goals of the workshop were to: (i) identify opportunities, challenges and resource needs for the development and application of genetic simulation models; (ii) improve the integration of tools for modeling and analysis of simulated data; and (iii) foster collaborations to facilitate development and applications of genetic simulation. During the course of the meeting the group identified challenges and opportunities for the science of simulation, software and methods development, and collaboration. This paper summarizes key discussions at the meeting, and highlights important challenges and opportunities to advance the field of genetic simulation. PMID:25371374

  13. A collection of enhancer trap insertional mutants for functional genomics in tomato.

    PubMed

    Pérez-Martín, Fernando; Yuste-Lisbona, Fernando J; Pineda, Benito; Angarita-Díaz, María Pilar; García-Sogo, Begoña; Antón, Teresa; Sánchez, Sibilla; Giménez, Estela; Atarés, Alejandro; Fernández-Lozano, Antonia; Ortíz-Atienza, Ana; García-Alcázar, Manuel; Castañeda, Laura; Fonseca, Rocío; Capel, Carmen; Goergen, Geraldine; Sánchez, Jorge; Quispe, Jorge L; Capel, Juan; Angosto, Trinidad; Moreno, Vicente; Lozano, Rafael

    2017-03-19

    With the completion of genome sequencing projects, the next challenge is to close the gap between gene annotation and gene functional assignment. Genomic tools to identify gene functions are based on the analysis of phenotypic variations between a wild-type and its mutant; hence, mutant collections are a valuable resource. In this sense, T-DNA collections allow for an easy and straightforward identification of the tagged gene, serving as the basis of both forward and reverse genetic strategies. This study reports on the phenotypic and molecular characterization of an enhancer trap T-DNA collection in tomato (Solanum lycopersicum L.), which has been produced by Agrobacterium-mediated transformation using a binary vector bearing a minimal promoter fused to the uidA reporter gene. Two genes have been isolated from different T-DNA mutants, one of these genes codes for a UTP-glucose-1-phosphate uridylyltransferase involved in programmed cell death and leaf development, which means a novel gene function reported in tomato. Together, our results support that enhancer trapping is a powerful tool to identify novel genes and regulatory elements in tomato and that this T-DNA mutant collection represents a highly valuable resource for functional analyses in this fleshy-fruited model species. This article is protected by copyright. All rights reserved.

  14. GraP: platform for functional genomics analysis of Gossypium raimondii

    PubMed Central

    Zhang, Liwei; Guo, Jinyan; You, Qi; Yi, Xin; Ling, Yi; Xu, Wenying; Hua, Jinping; Su, Zhen

    2015-01-01

    Cotton (Gossypium spp.) is one of the most important natural fiber and oil crops worldwide. Improvement of fiber yield and quality under changing environments attract much attention from cotton researchers; however, a functional analysis platform integrating omics data is still missing. The success of cotton genome sequencing and large amount of available transcriptome data allows the opportunity to establish a comprehensive analysis platform for integrating these data and related information. A comprehensive database, Platform of Functional Genomics Analysis in Gossypium raimondii (GraP), was constructed to provide multi-dimensional analysis, integration and visualization tools. GraP includes updated functional annotation, gene family classifications, protein–protein interaction networks, co-expression networks and microRNA–target pairs. Moreover, gene set enrichment analysis and cis-element significance analysis tools are also provided for gene batch analysis of high-throughput data sets. Based on these effective services, GraP may offer further information for subsequent studies of functional genes and in-depth analysis of high-throughput data. GraP is publically accessible at http://structuralbiology.cau.edu.cn/GraP/, with all data available for downloading. PMID:25982315

  15. ARG-ANNOT, a New Bioinformatic Tool To Discover Antibiotic Resistance Genes in Bacterial Genomes

    PubMed Central

    Gupta, Sushim Kumar; Padmanabhan, Babu Roshan; Diene, Seydina M.; Lopez-Rojas, Rafael; Kempf, Marie; Landraud, Luce

    2014-01-01

    ARG-ANNOT (Antibiotic Resistance Gene-ANNOTation) is a new bioinformatic tool that was created to detect existing and putative new antibiotic resistance (AR) genes in bacterial genomes. ARG-ANNOT uses a local BLAST program in Bio-Edit software that allows the user to analyze sequences without a Web interface. All AR genetic determinants were collected from published works and online resources; nucleotide and protein sequences were retrieved from the NCBI GenBank database. After building a database that includes 1,689 antibiotic resistance genes, the software was tested in a blind manner using 100 random sequences selected from the database to verify that the sensitivity and specificity were at 100% even when partial sequences were queried. Notably, BLAST analysis results obtained using the rmtF gene sequence (a new aminoglycoside-modifying enzyme gene sequence that is not included in the database) as a query revealed that the tool was able to link this sequence to short sequences (17 to 40 bp) found in other genes of the rmt family with significant E values. Finally, the analysis of 178 Acinetobacter baumannii and 20 Staphylococcus aureus genomes allowed the detection of a significantly higher number of AR genes than the Resfinder gene analyzer and 11 point mutations in target genes known to be associated with AR. The average time for the analysis of a genome was 3.35 ± 0.13 min. We have created a concise database for BLAST using a Bio-Edit interface that can detect AR genetic determinants in bacterial genomes and can rapidly and easily discover putative new AR genetic determinants. PMID:24145532

  16. Application of RNA interference in functional genomics studies of a social insect.

    PubMed

    Scharf, Michael E; Zhou, Xuguo; Schwinghammer, Margaret A

    2008-01-01

    Social insects represent a group of organisms that have dual importance from perspectives relating to both basic and applied science. From a basic perspective, social insects serve as excellent model systems for studying social organization, behavioral ecology, neurobiology, and phenotypic plasticity. From applied perspectives, social insects play important roles in the pollination of agricultural crops, in the damage of human structures and commodities, and in cellulose processing in natural ecosystems. With the advent of insect sociogenomics research (and the ability to identify dozens or hundreds of relevant candidate genes from a single experiment) has come a great demand for functional genomics tools for application in gene characterization. To date, RNAi is one of the most powerful tools to have become available for such functional characterizations, and it has broad relevance across a range of insect sociobiology research topics.

  17. Event-based text mining for biology and functional genomics

    PubMed Central

    Thompson, Paul; Nawaz, Raheel; McNaught, John; Kell, Douglas B.

    2015-01-01

    The assessment of genome function requires a mapping between genome-derived entities and biochemical reactions, and the biomedical literature represents a rich source of information about reactions between biological components. However, the increasingly rapid growth in the volume of literature provides both a challenge and an opportunity for researchers to isolate information about reactions of interest in a timely and efficient manner. In response, recent text mining research in the biology domain has been largely focused on the identification and extraction of ‘events’, i.e. categorised, structured representations of relationships between biochemical entities, from the literature. Functional genomics analyses necessarily encompass events as so defined. Automatic event extraction systems facilitate the development of sophisticated semantic search applications, allowing researchers to formulate structured queries over extracted events, so as to specify the exact types of reactions to be retrieved. This article provides an overview of recent research into event extraction. We cover annotated corpora on which systems are trained, systems that achieve state-of-the-art performance and details of the community shared tasks that have been instrumental in increasing the quality, coverage and scalability of recent systems. Finally, several concrete applications of event extraction are covered, together with emerging directions of research. PMID:24907365

  18. Whole-genome sequence-based analysis of thyroid function

    PubMed Central

    Taylor, Peter N.; Porcu, Eleonora; Chew, Shelby; Campbell, Purdey J.; Traglia, Michela; Brown, Suzanne J.; Mullin, Benjamin H.; Shihab, Hashem A.; Min, Josine; Walter, Klaudia; Memari, Yasin; Huang, Jie; Barnes, Michael R.; Beilby, John P.; Charoen, Pimphen; Danecek, Petr; Dudbridge, Frank; Forgetta, Vincenzo; Greenwood, Celia; Grundberg, Elin; Johnson, Andrew D.; Hui, Jennie; Lim, Ee M.; McCarthy, Shane; Muddyman, Dawn; Panicker, Vijay; Perry, John R.B.; Bell, Jordana T.; Yuan, Wei; Relton, Caroline; Gaunt, Tom; Schlessinger, David; Abecasis, Goncalo; Cucca, Francesco; Surdulescu, Gabriela L.; Woltersdorf, Wolfram; Zeggini, Eleftheria; Zheng, Hou-Feng; Toniolo, Daniela; Dayan, Colin M.; Naitza, Silvia; Walsh, John P.; Spector, Tim; Davey Smith, George; Durbin, Richard; Brent Richards, J.; Sanna, Serena; Soranzo, Nicole; Timpson, Nicholas J.; Wilson, Scott G.; Turki, Saeed Al; Anderson, Carl; Anney, Richard; Antony, Dinu; Artigas, Maria Soler; Ayub, Muhammad; Balasubramaniam, Senduran; Barrett, Jeffrey C.; Barroso, Inês; Beales, Phil; Bentham, Jamie; Bhattacharya, Shoumo; Birney, Ewan; Blackwood, Douglas; Bobrow, Martin; Bochukova, Elena; Bolton, Patrick; Bounds, Rebecca; Boustred, Chris; Breen, Gerome; Calissano, Mattia; Carss, Keren; Chatterjee, Krishna; Chen, Lu; Ciampi, Antonio; Cirak, Sebhattin; Clapham, Peter; Clement, Gail; Coates, Guy; Collier, David; Cosgrove, Catherine; Cox, Tony; Craddock, Nick; Crooks, Lucy; Curran, Sarah; Curtis, David; Daly, Allan; Day-Williams, Aaron; Day, Ian N.M.; Down, Thomas; Du, Yuanping; Dunham, Ian; Edkins, Sarah; Ellis, Peter; Evans, David; Faroogi, Sadaf; Fatemifar, Ghazaleh; Fitzpatrick, David R.; Flicek, Paul; Flyod, James; Foley, A. Reghan; Franklin, Christopher S.; Futema, Marta; Gallagher, Louise; Geihs, Matthias; Geschwind, Daniel; Griffin, Heather; Grozeva, Detelina; Guo, Xueqin; Guo, Xiaosen; Gurling, Hugh; Hart, Deborah; Hendricks, Audrey; Holmans, Peter; Howie, Bryan; Huang, Liren; Hubbard, Tim; Humphries, Steve E.; Hurles, Matthew E.; Hysi, Pirro; Jackson, David K.; Jamshidi, Yalda; Jing, Tian; Joyce, Chris; Kaye, Jane; Keane, Thomas; Keogh, Julia; Kemp, John; Kennedy, Karen; Kolb-Kokocinski, Anja; Lachance, Genevieve; Langford, Cordelia; Lawson, Daniel; Lee, Irene; Lek, Monkol; Liang, Jieqin; Lin, Hong; Li, Rui; Li, Yingrui; Liu, Ryan; Lönnqvist, Jouko; Lopes, Margarida; Lotchkova, Valentina; MacArthur, Daniel; Marchini, Jonathan; Maslen, John; Massimo, Mangino; Mathieson, Iain; Marenne, Gaëlle; McGuffin, Peter; McIntosh, Andrew; McKechanie, Andrew G.; McQuillin, Andrew; Metrustry, Sarah; Mitchison, Hannah; Moayyeri, Alireza; Morris, James; Muntoni, Francesco; Northstone, Kate; O'Donnovan, Michael; Onoufriadis, Alexandros; O'Rahilly, Stephen; Oualkacha, Karim; Owen, Michael J.; Palotie, Aarno; Panoutsopoulou, Kalliope; Parker, Victoria; Parr, Jeremy R.; Paternoster, Lavinia; Paunio, Tiina; Payne, Felicity; Pietilainen, Olli; Plagnol, Vincent; Quaye, Lydia; Quai, Michael A.; Raymond, Lucy; Rehnström, Karola; Richards, Brent; Ring, Susan; Ritchie, Graham R.S.; Roberts, Nicola; Savage, David B.; Scambler, Peter; Schiffels, Stephen; Schmidts, Miriam; Schoenmakers, Nadia; Semple, Robert K.; Serra, Eva; Sharp, Sally I.; Shin, So-Youn; Skuse, David; Small, Kerrin; Southam, Lorraine; Spasic-Boskovic, Olivera; Clair, David St; Stalker, Jim; Stevens, Elizabeth; Pourcian, Beate St; Sun, Jianping; Suvisaari, Jaana; Tachmazidou, Ionna; Tobin, Martin D.; Valdes, Ana; Kogelenberg, Margriet Van; Vijayarangakannan, Parthiban; Visscher, Peter M.; Wain, Louise V.; Walters, James T.R.; Wang, Guangbiao; Wang, Jun; Wang, Yu; Ward, Kirsten; Wheeler, Elanor; Whyte, Tamieka; Williams, Hywel; Williamson, Kathleen A.; Wilson, Crispian; Wong, Kim; Xu, ChangJiang; Yang, Jian; Zhang, Fend; Zhang, Pingbo

    2015-01-01

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 10−9) and a new independent variant in PDE8B (MAF=10.4%, P=5.94 × 10−14). For FT4, we report a low-frequency variant near B4GALT6/SLC25A52 (MAF=3.2%, P=1.27 × 10−9) tagging a rare TTR variant (MAF=0.4%, P=2.14 × 10−11). All common variants explain ≥20% of the variance in TSH and FT4. Analysis of rare variants (MAF<1%) using sequence kernel association testing reveals a novel association with FT4 in NRG1. Our results demonstrate that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function. PMID:25743335

  19. Whole-genome sequence-based analysis of thyroid function.

    PubMed

    Taylor, Peter N; Porcu, Eleonora; Chew, Shelby; Campbell, Purdey J; Traglia, Michela; Brown, Suzanne J; Mullin, Benjamin H; Shihab, Hashem A; Min, Josine; Walter, Klaudia; Memari, Yasin; Huang, Jie; Barnes, Michael R; Beilby, John P; Charoen, Pimphen; Danecek, Petr; Dudbridge, Frank; Forgetta, Vincenzo; Greenwood, Celia; Grundberg, Elin; Johnson, Andrew D; Hui, Jennie; Lim, Ee M; McCarthy, Shane; Muddyman, Dawn; Panicker, Vijay; Perry, John R B; Bell, Jordana T; Yuan, Wei; Relton, Caroline; Gaunt, Tom; Schlessinger, David; Abecasis, Goncalo; Cucca, Francesco; Surdulescu, Gabriela L; Woltersdorf, Wolfram; Zeggini, Eleftheria; Zheng, Hou-Feng; Toniolo, Daniela; Dayan, Colin M; Naitza, Silvia; Walsh, John P; Spector, Tim; Davey Smith, George; Durbin, Richard; Richards, J Brent; Sanna, Serena; Soranzo, Nicole; Timpson, Nicholas J; Wilson, Scott G

    2015-03-06

    Normal thyroid function is essential for health, but its genetic architecture remains poorly understood. Here, for the heritable thyroid traits thyrotropin (TSH) and free thyroxine (FT4), we analyse whole-genome sequence data from the UK10K project (N=2,287). Using additional whole-genome sequence and deeply imputed data sets, we report meta-analysis results for common variants (MAF≥1%) associated with TSH and FT4 (N=16,335). For TSH, we identify a novel variant in SYN2 (MAF=23.5%, P=6.15 × 10(-9)) and a new independent variant in PDE8B (MAF=10.4%, P=5.94 × 10(-14)). For FT4, we report a low-frequency variant near B4GALT6/SLC25A52 (MAF=3.2%, P=1.27 × 10(-9)) tagging a rare TTR variant (MAF=0.4%, P=2.14 × 10(-11)). All common variants explain ≥20% of the variance in TSH and FT4. Analysis of rare variants (MAF<1%) using sequence kernel association testing reveals a novel association with FT4 in NRG1. Our results demonstrate that increased coverage in whole-genome sequence association studies identifies novel variants associated with thyroid function.

  20. Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classification.

    PubMed

    Bao, Yiming; Chetvernin, Vyacheslav; Tatusova, Tatiana

    2014-12-01

    The number of viral genome sequences in the public databases is increasing dramatically, and these sequences are playing an important role in virus classification. Pairwise sequence comparison is a sequence-based virus classification method. A program using this method calculates the pairwise identities of virus sequences within a virus family and displays their distribution, and visual analysis helps to determine demarcations at different taxonomic levels such as strain, species, genus and subfamily. Subsequent comparison of new sequences against existing ones allows viruses from which the new sequences were derived to be classified. Although this method cannot be used as the only criterion for virus classification in some cases, it is a quantitative method and has many advantages over conventional virus classification methods. It has been applied to several virus families, and there is an increasing interest in using this method for other virus families/groups. The Pairwise Sequence Comparison (PASC) classification tool was created at the National Center for Biotechnology Information. The tool's database stores pairwise identities for complete genomes/segments of 56 virus families/groups. Data in the system are updated every day to reflect changes in virus taxonomy and additions of new virus sequences to the public database. The web interface of the tool ( http://www.ncbi.nlm.nih.gov/sutils/pasc/ ) makes it easy to navigate and perform analyses. Multiple new viral genome sequences can be tested simultaneously with this system to suggest the taxonomic position of virus isolates in a specific family. PASC eliminates potential discrepancies in the results caused by different algorithms and/or different data used by researchers.

  1. Rainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing

    PubMed Central

    2013-01-01

    Background Technical improvements have decreased sequencing costs and, as a result, the size and number of genomic datasets have increased rapidly. Because of the lower cost, large amounts of sequence data are now being produced by small to midsize research groups. Crossbow is a software tool that can detect single nucleotide polymorphisms (SNPs) in whole-genome sequencing (WGS) data from a single subject; however, Crossbow has a number of limitations when applied to multiple subjects from large-scale WGS projects. The data storage and CPU resources that are required for large-scale whole genome sequencing data analyses are too large for many core facilities and individual laboratories to provide. To help meet these challenges, we have developed Rainbow, a cloud-based software package that can assist in the automation of large-scale WGS data analyses. Results Here, we evaluated the performance of Rainbow by analyzing 44 different whole-genome-sequenced subjects. Rainbow has the capacity to process genomic data from more than 500 subjects in two weeks using cloud computing provided by the Amazon Web Service. The time includes the import and export of the data using Amazon Import/Export service. The average cost of processing a single sample in the cloud was less than 120 US dollars. Compared with Crossbow, the main improvements incorporated into Rainbow include the ability: (1) to handle BAM as well as FASTQ input files; (2) to split large sequence files for better load balance downstream; (3) to log the running metrics in data processing and monitoring multiple Amazon Elastic Compute Cloud (EC2) instances; and (4) to merge SOAPsnp outputs for multiple individuals into a single file to facilitate downstream genome-wide association studies. Conclusions Rainbow is a scalable, cost-effective, and open-source tool for large-scale WGS data analysis. For human WGS data sequenced by either the Illumina HiSeq 2000 or HiSeq 2500 platforms, Rainbow can be used straight out of

  2. Cajal body function in genome organization and transcriptome diversity.

    PubMed

    Sawyer, Iain A; Sturgill, David; Sung, Myong-Hee; Hager, Gordon L; Dundr, Miroslav

    2016-12-01

    Nuclear bodies contribute to non-random organization of the human genome and nuclear function. Using a major prototypical nuclear body, the Cajal body, as an example, we suggest that these structures assemble at specific gene loci located across the genome as a result of high transcriptional activity. Subsequently, target genes are physically clustered in close proximity in Cajal body-containing cells. However, Cajal bodies are observed in only a limited number of human cell types, including neuronal and cancer cells. Ultimately, Cajal body depletion perturbs splicing kinetics by reducing target small nuclear RNA (snRNA) transcription and limiting the levels of spliceosomal snRNPs, including their modification and turnover following each round of RNA splicing. As such, Cajal bodies are capable of shaping the chromatin interaction landscape and the transcriptome by influencing spliceosome kinetics. Future studies should concentrate on characterizing the direct influence of Cajal bodies upon snRNA gene transcriptional dynamics. Also see the video abstract here.

  3. Functional genomics in osteoarthritis: Past, present, and future

    PubMed Central

    Steinberg, Julia

    2016-01-01

    ABSTRACT Osteoarthritis (OA) is a common complex disease of high public health burden. OA is characterized by the degeneration of affected joints leading to pain and reduced mobility. Over the last few years, several studies have focused on the genomic changes underpinning OA. Here, we provide a comprehensive overview of genome‐wide, non‐hypothesis‐driven functional genomics (methylation, gene, and protein expression) studies of knee and hip OA in humans. Individual studies have generally been limited in sample size and hence power, and have differed in their approaches; nonetheless, some common themes have started to emerge, notably the role played by biological processes related to the extracellular matrix, immune response, the WNT pathway, angiogenesis, and skeletal development. Larger‐scale studies and streamlined, robust methodologies will be needed to further elucidate the biological etiology of OA going forward. © 2016 Orthopaedic Research Society. Published by Wiley Periodicals, Inc. J Orthop Res 34:1105–1110, 2016. PMID:27176659

  4. Genome-wide protein-protein interactions and protein function exploration in cyanobacteria.

    PubMed

    Lv, Qi; Ma, Weimin; Liu, Hui; Li, Jiang; Wang, Huan; Lu, Fang; Zhao, Chen; Shi, Tieliu

    2015-10-22

    Genome-wide network analysis is well implemented to study proteins of unknown function. Here, we effectively explored protein functions and the biological mechanism based on inferred high confident protein-protein interaction (PPI) network in cyanobacteria. We integrated data from seven different sources and predicted 1,997 PPIs, which were evaluated by experiments in molecular mechanism, text mining of literatures in proved direct/indirect evidences, and "interologs" in conservation. Combined the predicted PPIs with known PPIs, we obtained 4,715 no-redundant PPIs (involving 3,231 proteins covering over 90% of genome) to generate the PPI network. Based on the PPI network, terms in Gene ontology (GO) were assigned to function-unknown proteins. Functional modules were identified by dissecting the PPI network into sub-networks and analyzing pathway enrichment, with which we investigated novel function of underlying proteins in protein complexes and pathways. Examples of photosynthesis and DNA repair indicate that the network approach is a powerful tool in protein function analysis. Overall, this systems biology approach provides a new insight into posterior functional analysis of PPIs in cyanobacteria.

  5. Integrative pathway genomics of lung function and airflow obstruction.

    PubMed

    Gharib, Sina A; Loth, Daan W; Soler Artigas, María; Birkland, Timothy P; Wilk, Jemma B; Wain, Louise V; Brody, Jennifer A; Obeidat, Ma'en; Hancock, Dana B; Tang, Wenbo; Rawal, Rajesh; Boezen, H Marike; Imboden, Medea; Huffman, Jennifer E; Lahousse, Lies; Alves, Alexessander C; Manichaikul, Ani; Hui, Jennie; Morrison, Alanna C; Ramasamy, Adaikalavan; Smith, Albert Vernon; Gudnason, Vilmundur; Surakka, Ida; Vitart, Veronique; Evans, David M; Strachan, David P; Deary, Ian J; Hofman, Albert; Gläser, Sven; Wilson, James F; North, Kari E; Zhao, Jing Hua; Heckbert, Susan R; Jarvis, Deborah L; Probst-Hensch, Nicole; Schulz, Holger; Barr, R Graham; Jarvelin, Marjo-Riitta; O'Connor, George T; Kähönen, Mika; Cassano, Patricia A; Hysi, Pirro G; Dupuis, Josée; Hayward, Caroline; Psaty, Bruce M; Hall, Ian P; Parks, William C; Tobin, Martin D; London, Stephanie J

    2015-12-01

    Chronic respiratory disorders are important contributors to the global burden of disease. Genome-wide association studies (GWASs) of lung function measures have identified several trait-associated loci, but explain only a modest portion of the phenotypic variability. We postulated that integrating pathway-based methods with GWASs of pulmonary function and airflow obstruction would identify a broader repertoire of genes and processes influencing these traits. We performed two independent GWASs of lung function and applied gene set enrichment analysis to one of the studies and validated the results using the second GWAS. We identified 131 significantly enriched gene sets associated with lung function and clustered them into larger biological modules involved in diverse processes including development, immunity, cell signaling, proliferation and arachidonic acid. We found that enrichment of gene sets was not driven by GWAS-significant variants or loci, but instead by those with less stringent association P-values. Next, we applied pathway enrichment analysis to a meta-analyzed GWAS of airflow obstruction. We identified several biologic modules that functionally overlapped with those associated with pulmonary function. However, differences were also noted, including enrichment of extracellular matrix (ECM) processes specifically in the airflow obstruction study. Network analysis of the ECM module implicated a candidate gene, matrix metalloproteinase 10 (MMP10), as a putative disease target. We used a knockout mouse model to functionally validate MMP10's role in influencing lung's susceptibility to cigarette smoke-induced emphysema. By integrating pathway analysis with population-based genomics, we unraveled biologic processes underlying pulmonary function traits and identified a candidate gene for obstructive lung disease.

  6. Integrative pathway genomics of lung function and airflow obstruction

    PubMed Central

    Gharib, Sina A.; Loth, Daan W.; Soler Artigas, María; Birkland, Timothy P.; Wilk, Jemma B.; Wain, Louise V.; Brody, Jennifer A.; Obeidat, Ma'en; Hancock, Dana B.; Tang, Wenbo; Rawal, Rajesh; Boezen, H. Marike; Imboden, Medea; Huffman, Jennifer E.; Lahousse, Lies; Alves, Alexessander C.; Manichaikul, Ani; Hui, Jennie; Morrison, Alanna C.; Ramasamy, Adaikalavan; Smith, Albert Vernon; Gudnason, Vilmundur; Surakka, Ida; Vitart, Veronique; Evans, David M.; Strachan, David P.; Deary, Ian J.; Hofman, Albert; Gläser, Sven; Wilson, James F.; North, Kari E.; Zhao, Jing Hua; Heckbert, Susan R.; Jarvis, Deborah L.; Probst-Hensch, Nicole; Schulz, Holger; Barr, R. Graham; Jarvelin, Marjo-Riitta; O'Connor, George T.; Kähönen, Mika; Cassano, Patricia A.; Hysi, Pirro G.; Dupuis, Josée; Hayward, Caroline; Psaty, Bruce M.; Hall, Ian P.; Parks, William C.; Tobin, Martin D.; London, Stephanie J.

    2015-01-01

    Chronic respiratory disorders are important contributors to the global burden of disease. Genome-wide association studies (GWASs) of lung function measures have identified several trait-associated loci, but explain only a modest portion of the phenotypic variability. We postulated that integrating pathway-based methods with GWASs of pulmonary function and airflow obstruction would identify a broader repertoire of genes and processes influencing these traits. We performed two independent GWASs of lung function and applied gene set enrichment analysis to one of the studies and validated the results using the second GWAS. We identified 131 significantly enriched gene sets associated with lung function and clustered them into larger biological modules involved in diverse processes including development, immunity, cell signaling, proliferation and arachidonic acid. We found that enrichment of gene sets was not driven by GWAS-significant variants or loci, but instead by those with less stringent association P-values. Next, we applied pathway enrichment analysis to a meta-analyzed GWAS of airflow obstruction. We identified several biologic modules that functionally overlapped with those associated with pulmonary function. However, differences were also noted, including enrichment of extracellular matrix (ECM) processes specifically in the airflow obstruction study. Network analysis of the ECM module implicated a candidate gene, matrix metalloproteinase 10 (MMP10), as a putative disease target. We used a knockout mouse model to functionally validate MMP10's role in influencing lung's susceptibility to cigarette smoke-induced emphysema. By integrating pathway analysis with population-based genomics, we unraveled biologic processes underlying pulmonary function traits and identified a candidate gene for obstructive lung disease. PMID:26395457

  7. RNA Interference for Functional Genomics and Improvement of Cotton (Gossypium sp.)

    PubMed Central

    Abdurakhmonov, Ibrokhim Y.; Ayubov, Mirzakamol S.; Ubaydullaeva, Khurshida A.; Buriev, Zabardast T.; Shermatov, Shukhrat E.; Ruziboev, Haydarali S.; Shapulatov, Umid M.; Saha, Sukumar; Ulloa, Mauricio; Yu, John Z.; Percy, Richard G.; Devor, Eric J.; Sharma, Govind C.; Sripathi, Venkateswara R.; Kumpatla, Siva P.; van der Krol, Alexander; Kater, Hake D.; Khamidov, Khakimdjan; Salikhov, Shavkat I.; Jenkins, Johnie N.; Abdukarimov, Abdusattor; Pepper, Alan E.

    2016-01-01

    RNA interference (RNAi), is a powerful new technology in the discovery of genetic sequence functions, and has become a valuable tool for functional genomics of cotton (Gossypium sp.). The rapid adoption of RNAi has replaced previous antisense technology. RNAi has aided in the discovery of function and biological roles of many key cotton genes involved in fiber development, fertility and somatic embryogenesis, resistance to important biotic and abiotic stresses, and oil and seed quality improvements as well as the key agronomic traits including yield and maturity. Here, we have comparatively reviewed seminal research efforts in previously used antisense approaches and currently applied breakthrough RNAi studies in cotton, analyzing developed RNAi methodologies, achievements, limitations, and future needs in functional characterizations of cotton genes. We also highlighted needed efforts in the development of RNAi-based cotton cultivars, and their safety and risk assessment, small and large-scale field trials, and commercialization. PMID:26941765

  8. Optical Whole-Genome Restriction Mapping as a Tool for Rapidly Distinguishing and Identifying Bacterial Contaminants in Clinical Samples

    PubMed Central

    Baldwin, James C.

    2015-01-01

    Introduction Optical restriction genome mapping is a technology in which a genome is linearized on a surface and digested with specific restriction enzymes, giving an arrangement of the genome with gaps whose order and size are unique for a given organism. Current applications of this technology include assisting with the correct scaffolding and ordering of genomes in conjunction with whole-genome sequencing, observation of genetic drift and evolution using comparative genomics and epidemiological monitoring of the spread of infections. Here, we investigated the suitability of genome mapping for use in clinical labs as a potential diagnostic tool. Materials and Methods Using whole genome mapping, we investigated the basic performance of the technology for identifying two bacteria of interest for food-safety (Lactobacilli spp. and Enterohemorrhagic Escherichia coli). We further evaluated the performance for identifying multiple organisms from both simple and complex mixtures. Results We were able to successfully generate optical restriction maps of four Lactobacillus species as well as a strain of Enterohemorrhagic Escherichia coli from within a mixed solution, each distinguished using a common compatible restriction enzyme. Finally, we demonstrated that optical restriction maps were successfully obtained and the correct organism identified within a clinical matrix. Conclusion With additional development, whole genome mapping may be a useful clinical tool for rapid invitro diagnostics. PMID:26435946

  9. Physiological and genomic basis of mechanical-functional trade-off in plant vasculature

    PubMed Central

    Sengupta, Sonali; Majumder, Arun Lahiri

    2014-01-01

    Some areas in plant abiotic stress research are not frequently addressed by genomic and molecular tools. One such area is the cross reaction of gravitational force with upward capillary pull of water and the mechanical-functional trade-off in plant vasculature. Although frost, drought and flooding stress greatly impact these physiological processes and consequently plant performance, the genomic and molecular basis of such trade-off is only sporadically addressed and so is its adaptive value. Embolism resistance is an important multiple stress- opposition trait and do offer scopes for critical insight to unravel and modify the input of living cells in the process and their biotechnological intervention may be of great importance. Vascular plants employ different physiological strategies to cope with embolism and variation is observed across the kingdom. The genomic resources in this area have started to emerge and open up possibilities of synthesis, validation and utilization of the new knowledge-base. This review article assesses the research till date on this issue and discusses new possibilities for bridging physiology and genomics of a plant, and foresees its implementation in crop science. PMID:24904619

  10. Automated genomic context analysis and experimental validation platform for discovery of prokaryote transcriptional regulator functions

    DOE PAGES

    Martí-Arbona, Ricardo; Mu, Fangping; Nowak-Lovato, Kristy L.; ...

    2014-12-18

    The clustering of genes in a pathway and the co-location of functionally related genes is widely recognized in prokaryotes. We used these characteristics to predict the metabolic involvement for a Transcriptional Regulator (TR) of unknown function, identified and confirmed its biological activity. A software tool that identifies the genes encoded within a defined genomic neighborhood for the subject TR and its homologs was developed. The output lists of genes in the genetic neighborhoods, their annotated functions, the reactants/products, and identifies the metabolic pathway in which the encoded-proteins function. When a set of TRs of known function was analyzed, we observedmore » that their homologs frequently had conserved genomic neighborhoods that co-located the metabolically related genes regulated by the subject TR. We postulate that TR effectors are metabolites in the identified pathways; indeed the known effectors were present. We analyzed Bxe_B3018 from Burkholderia xenovorans, a TR of unknown function and predicted that this TR was related to the glycine, threonine and serine degradation. We tested the binding of metabolites in these pathways and for those that bound, their ability to modulate TR binding to its specific DNA operator sequence. Using rtPCR, we confirmed that methylglyoxal was an effector of Bxe_3018.« less

  11. Automated genomic context analysis and experimental validation platform for discovery of prokaryote transcriptional regulator functions

    SciTech Connect

    Martí-Arbona, Ricardo; Mu, Fangping; Nowak-Lovato, Kristy L.; Wren, Melinda S.; Unkefer, Clifford J.; Unkefer, Pat J.

    2014-12-18

    The clustering of genes in a pathway and the co-location of functionally related genes is widely recognized in prokaryotes. We used these characteristics to predict the metabolic involvement for a Transcriptional Regulator (TR) of unknown function, identified and confirmed its biological activity. A software tool that identifies the genes encoded within a defined genomic neighborhood for the subject TR and its homologs was developed. The output lists of genes in the genetic neighborhoods, their annotated functions, the reactants/products, and identifies the metabolic pathway in which the encoded-proteins function. When a set of TRs of known function was analyzed, we observed that their homologs frequently had conserved genomic neighborhoods that co-located the metabolically related genes regulated by the subject TR. We postulate that TR effectors are metabolites in the identified pathways; indeed the known effectors were present. We analyzed Bxe_B3018 from Burkholderia xenovorans, a TR of unknown function and predicted that this TR was related to the glycine, threonine and serine degradation. We tested the binding of metabolites in these pathways and for those that bound, their ability to modulate TR binding to its specific DNA operator sequence. Using rtPCR, we confirmed that methylglyoxal was an effector of Bxe_3018.

  12. GDF: A tool for function estimation through grammatical evolution

    NASA Astrophysics Data System (ADS)

    Tsoulos, Ioannis G.; Gavrilis, Dimitris; Dermatas, Evangelos

    2006-04-01

    This article introduces a tool for data fitting that is based on genetic programming and especially on the grammatical evolution technique. The user needs to input a series of points and the accompanied dimensionality n and the tool will produce via the genetic programming paradigm a function f:R→R which is an approximate solution to the symbolic regression problem. The tool is entirely written in ANSI C++ and it can be installed in any UNIX system. Program summaryTitle of program: GDF Catalogue identifier:ADXC Program obtainable from: CPC Program Library, Queen's University of Belfast, N. Ireland Program summary URL:http://cpc.cs.qub.ac.uk/summaries/ADXC Computer for which the program is designed and others on which is has been tested:The tool is designed to be portable in all systems running the GNU C++ compiler Installation: University of Ioannina and University of Patras, Greece Programming language used:GNU-C++ Memory required to execute with typical data:200 KB No. of bits in a word: 32 No. of processors used: 1 Has the code been vectorized or parallelized?: No No. of bytes in distributed program, including test data, etc.: 33 469 No. of lines in distributed program, including test data, etc.: 5704 Distribution format: tar.gz Solution method: Functional forms are being created by genetic programming which are approximations for the symbolic regression problem.

  13. Clinical assessment tools identify functional deficits in fragility fracture patients

    PubMed Central

    Ames, Tyler D; Wee, Corinne E; Le, Khoi M; Wang, Tiffany L; Bishop, Julie Y; Phieffer, Laura S; Quatman, Carmen E

    2016-01-01

    Purpose To identify inexpensive, noninvasive, portable, clinical assessment tools that can be used to assess functional performance measures that may put older patients at risk for falls such as balance, handgrip strength, and lumbopelvic control. Patients and methods Twenty fragility fracture patients and 21 healthy control subjects were evaluated using clinical assessment tools (Nintendo Wii Balance Board [WBB], a handheld dynamometer, and an application for the Apple iPod Touch, the Level Belt) that measure functional performance during activity of daily living tasks. The main outcome measurements were balance (WBB), handgrip strength (handheld dynamometer), and lumbopelvic control (iPod Touch Level Belt), which were compared between fragility fracture patients and healthy controls. Results Fragility fracture patients had lower scores on the vertical component of the WBB Torso Twist task (P=0.042) and greater medial–lateral lumbopelvic sway during a 40 m walk (P=0.026) when compared to healthy controls. Unexpectedly, the fracture patients had significantly higher scores on the left leg (P=0.020) and total components (P=0.010) of the WBB Single Leg Stand task as well as less faults during the left Single Leg Stand task (P=0.003). Conclusion The clinical assessment tools utilized in this study are relatively inexpensive and portable tools of performance measures capable of detecting differences in postural sway between fragility fracture patients and controls. PMID:27217738

  14. MADAP, a flexible clustering tool for the interpretation of one-dimensional genome annotation data.

    PubMed

    Schmid, Christoph D; Sengstag, Thierry; Bucher, Philipp; Delorenzi, Mauro

    2007-07-01

    A recurring task in the analysis of mass genome annotation data from high-throughput technologies is the identification of peaks or clusters in a noisy signal profile. Examples of such applications are the definition of promoters on the basis of transcription start site profiles, the mapping of transcription factor binding sites based on ChIP-chip data and the identification of quantitative trait loci (QTL) from whole genome SNP profiles. Input to such an analysis is a set of genome coordinates associated with counts or intensities. The output consists of a discrete number of peaks with respective volumes, extensions and center positions. We have developed for this purpose a flexible one-dimensional clustering tool, called MADAP, which we make available as a web server and as standalone program. A set of parameters enables the user to customize the procedure to a specific problem. The web server, which returns results in textual and graphical form, is useful for small to medium-scale applications, as well as for evaluation and parameter tuning in view of large-scale applications, requiring a local installation. The program written in C++ can be freely downloaded from ftp://ftp.epd.unil.ch/pub/software/unix/madap. The MADAP web server can be accessed at http://www.isrec.isb-sib.ch/madap/.

  15. Use of genome-editing tools to treat sickle cell disease.

    PubMed

    Tasan, Ipek; Jain, Surbhi; Zhao, Huimin

    2016-09-01

    Recent advances in genome-editing techniques have made it possible to modify any desired DNA sequence by employing programmable nucleases. These next-generation genome-modifying tools are the ideal candidates for therapeutic applications, especially for the treatment of genetic disorders like sickle cell disease (SCD). SCD is an inheritable monogenic disorder which is caused by a point mutation in the β-globin gene. Substantial success has been achieved in the development of supportive therapeutic strategies for SCD, but unfortunately there is still a lack of long-term universal cure. The only existing curative treatment is based on allogeneic stem cell transplantation from healthy donors; however, this treatment is applicable to a limited number of patients only. Hence, a universally applicable therapy is highly desirable. In this review, we will discuss the three programmable nucleases that are commonly used for genome-editing purposes: zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and clustered regularly interspaced short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9). We will continue by exemplifying uses of these methods to correct the sickle cell mutation. Additionally, we will present induction of fetal globin expression as an alternative approach to cure sickle cell disease. We will conclude by comparing the three methods and explaining the concerns about their use in therapy.

  16. Different whole-genome amplification methods as a preamplification tool in Y-chromosome Loci analysis.

    PubMed

    Maciejewska, Agnieszka; Jakubowska, Joanna; Pawłowski, Ryszard

    2014-06-01

    Degraded and low template DNA is often analyzed in forensic genetics laboratories. Reliable analysis of degraded and low template DNA is of great importance, because its results impact the quality and reliability of expert testimonies. Recently, a number of whole-genome amplification (WGA) methods have been proposed as preamplification tools improving quantity and quality of DNA. We chose, investigated, and compared 7 WGA methods to evaluate their ability to "recover" degraded and nondegraded DNA. These methods include degenerate oligonucleotide primed polymerase chain reaction, primer extension preamplification (PEP) polymerase chain reaction, GenomePlex WGA (Sigma), multiple displacement amplification, GenomiPhi Amplification Kit (Amersham Biosciences), restriction and circularization aided rolling circle amplification, and blunt-end ligation-mediated WGA. Recently, we have described the comparison of these methods' efficiency and reliability using SGMPlus kit. However, Y-chromosome profiling is also often used in analysis of both nondegraded and degraded DNA. This includes criminal cases and investigation of kinship in male linage. Here we demonstrate the impact of WGA methods on Y-chromosome loci (Yfiler) reactivation.The best results for nondegraded DNA were obtained with GenomiPhi kit and PEP method. In the case of degraded DNA (200 base pairs), the most complete profiles were obtained with GenomePlex kit and PEP method. None of the analyzed methods allowed full reactivation of degraded (200 base pairs) DNA in terms of Y-chromosome loci profiling.

  17. Applying Genomic and Genetic Tools to Understand and Mitigate Damage from Exposure to Toxins

    DTIC Science & Technology

    2011-10-01

    donepezil (brand name Aricept) are all water-soluble AChEIs that are currently approved by the FDA for the treatment of Alzheimer’s disease and have...Huang da, B. T. Sherman, R. A. Lempicki, Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4...large gene lists . Nucleic Acids Res 37, 1 (Jan, 2009). 13. T. Whistler et al., Impaired immune function in Gulf War Illness. BMC Med Genomics 2, 12

  18. Enhancer function: mechanistic and genome-wide insights come together.

    PubMed

    Plank, Jennifer L; Dean, Ann

    2014-07-03

    Enhancers establish spatial or temporal patterns of gene expression that are critical for development, yet our understanding of how these DNA cis-regulatory elements function from a distance to increase transcription of their target genes and shape the cellular transcriptome has been gleaned primarily from studies of individual genes or gene families. High-throughput sequencing studies place enhancer-gene interactions within the 3D context of chromosome folding, inviting a new look at enhancer function and stimulating provocative new questions. Here, we integrate these whole-genome studies with recent mechanistic studies to illuminate how enhancers physically interact with target genes, how enhancer activity is regulated during development, and the role of noncoding RNAs transcribed from enhancers in their function.

  19. PentaPlot: A software tool for the illustration of genome mosaicism

    PubMed Central

    Hamel, Lutz; Zhaxybayeva, Olga; Gogarten, J Peter

    2005-01-01

    Background Dekapentagonal maps depict the phylogenetic relationships of five genomes in a visually appealing diagram and can be viewed as an alternative to a single evolutionary consensus tree. In particular, the generated maps focus attention on those gene families that significantly deviate from the consensus or plurality phylogeny. PentaPlot is a software tool that computes such dekapentagonal maps given an appropriate probability support matrix. Results The visualization with dekapentagonal maps critically depends on the optimal layout of unrooted tree topologies representing different evolutionary relationships among five organisms along the vertices of the dekapentagon. This is a difficult optimization problem given the large number of possible layouts. At its core our tool utilizes a genetic algorithm with demes and a local search strategy to search for the optimal layout. The hybrid genetic algorithm performs satisfactorily even in those cases where the chosen genomes are so divergent that little phylogenetic information has survived in the individual gene families. Conclusion PentaPlot is being made publicly available as an open source project at . PMID:15938752

  20. Current genome editing tools in gene therapy: new approaches to treat cancer.

    PubMed

    Shuvalov, Oleg; Petukhov, Alexey; Daks, Alexandra; Fedorova, Olga; Ermakov, Alexander; Melino, Gerry; Barlev, Nickolai A

    2015-01-01

    Gene therapy suggests a promising approach to treat genetic diseases by applying genes as pharmaceuticals. Cancer is a complex disease, which strongly depends on a particular genetic make-up and hence can be treated with gene therapy. From about 2,000 clinical trials carried out so far, more than 60% were cancer targeted. Development of precise and effective gene therapy approaches is intimately connected with achievements in the molecular biology techniques. The field of gene therapy was recently revolutionized by the introduction of "programmable" nucleases, including ZFNs, TALENs, and CRISPR, which target specific genomic loci with high efficacy and precision. Furthermore, when combined with DNA transposons for the delivery purposes into cells, these programmable nucleases represent a promising alternative to the conventional viral-mediated gene delivery. In addition to "programmable" nucleases, a new class of TALE- and CRISPR-based "artificial transcription effectors" has been developed to mediate precise regulation of specific genes. In sum, these new molecular tools may be used in a wide plethora of gene therapy strategies. This review highlights the current status of novel genome editing tools and discusses their suitability and perspectives in respect to cancer gene therapy studies.

  1. DECIDE: a Decision Support Tool to Facilitate Parents' Choices Regarding Genome-Wide Sequencing.

    PubMed

    Birch, Patricia; Adam, S; Bansback, N; Coe, R R; Hicklin, J; Lehman, A; Li, K C; Friedman, J M

    2016-12-01

    We describe the rationale, development, and usability testing for an integrated e-learning tool and decision aid for parents facing decisions about genome-wide sequencing (GWS) for their children with a suspected genetic condition. The online tool, DECIDE, is designed to provide decision-support and to promote high quality decisions about undergoing GWS with or without return of optional incidental finding results. DECIDE works by integrating educational material with decision aids. Users may tailor their learning by controlling both the amount of information and its format - text and diagrams and/or short videos. The decision aid guides users to weigh the importance of various relevant factors in their own lives and circumstances. After considering the pros and cons of GWS and return of incidental findings, DECIDE summarizes the user's responses and apparent preferred choices. In a usability study of 16 parents who had already chosen GWS after conventional genetic counselling, all participants found DECIDE to be helpful. Many would have been satisfied to use it alone to guide their GWS decisions, but most would prefer to have the option of consulting a health care professional as well to aid their decision. Further testing is necessary to establish the effectiveness of using DECIDE as an adjunct to or instead of conventional pre-test genetic counselling for clinical genome-wide sequencing.

  2. Molecular tools for investigating ANME community structure and function

    SciTech Connect

    Hallam, Steven J.; Page, Antoine P.; Constan, Lea; Song, Young C.; Norbeck, Angela D.; Brewer, Heather M.; Pasa-Tolic, Ljiljana

    2011-05-20

    Methane production and consumption in anaerobic marine sediments 1 is catalyzed by a series of reversible tetramethanopterin (H4MPT)-linked C1 transfer reactions. Although many of these reactions are conserved between one-carbon compound utilizing microorganisms, two remain diagnostic for archaeal methane metabolism. These include reactions catalyzed by N5-methyltetrahydromethanopterin: coenzyme M methyltransferase and methyl coenzyme M reductase. The latter enzyme is central to C-H bond formation and cleavage underlying methanogenic and reverse methanogenic phenotypes. Here we describe a set of novel tools for the detection and functional analysis of H4MPT-linked C1 transfer reactions mediated by uncultivated anaerobic methane oxidizing archaea (ANME). These tools include polymerase chain reaction primers targeting ANME methyl coenzyme M reductase subunit A subgroups and protein extraction methods from marine sediments compatible with high-resolution mass spectrometry for profiling population structure and functional dynamics. [910, 1,043

  3. The Xenopus ORFeome: A resource that enables functional genomics

    PubMed Central

    Grant, Ian M.; Balcha, Dawit; Hao, Tong; Shen, Yun; Trivedi, Prasad; Patrushev, Ilya; Fortriede, Joshua D.; Karpinka, John B.; Liu, Limin; Zorn, Aaron M.; Stukenberg, P. Todd; Hill, David E.; Gilchrist, Michael J.

    2015-01-01

    Functional characterisation of proteins and large-scale, systems-level studies are enabled by extensive sets of cloned open reading frames (ORFs) in an easily-accessible format that enables many different applications. Here we report the release of the first stage of the Xenopus ORFeome, which contains 8673 ORFs from the Xenopus Gene Collection (XGC) for Xenopus laevis, cloned into a Gateway® donor vector enabling rapid in-frame transfer of the ORFs to expression vectors. This resource represents an estimated 7871 unique genes, approximately 40% of the non-redundant X. laevis gene complement, and includes 2724 genes where the human ortholog has an association with disease. Transfer into the Gateway system was validated by 5′ and 3′ end sequencing of the entire collection and protein expression of a set of test clones. In a parallel process, the underlying ORF predictions from the original XGC collection were re-analysed to verify quality and full-length status, identifying those proteins likely to exhibit truncations when translated. These data are integrated into Xenbase, the Xenopus community database, which associates genomic, expression, function and human disease model metadata to each ORF, enabling end-users to search for ORFeome clones with links to commercial distributors of the collection. When coupled with the experimental advantages of Xenopus eggs and embryos, the ORFeome collection represents a valuable resource for functional genomics and disease modelling. PMID:26391338

  4. Structure and Functional Studies of DEN-2 Virus Genome.

    DTIC Science & Technology

    1982-09-01

    Structure and Functional Studies on Dengue -2 Progress Report Virus Genome 1 Mar 82 - I Sep 82 6. PERFORMING ORO. REPORT NUMBER 7. AUTHOR(e) 8. CONTRACT OR...Identify by block number) Complementdry DNA synthesis of Dengue -2 RNA by avian reverse transcriptase in vitro. The size of the DNA copy of Dengue RNA is in...Unannounced 0 Justification ............ By........... Di.A b-Aio: i Availability Codes S Avail and (or 2 Abstract 1. Dengue -2 RNA (DEN-2 RNA) was extracted

  5. Empowered genome community: leveraging a bioinformatics platform as a citizen-scientist collaboration tool.

    PubMed

    Wendelsdorf, Katherine; Shah, Sohela

    2015-09-01

    There is on-going effort in the biomedical research community to leverage Next Generation Sequencing (NGS) technology to identify genetic variants that affect our health. The main challenge facing researchers is getting enough samples from individuals either sick or healthy - to be able to reliably identify the few variants that are causal for a phenotype among all other variants typically seen among individuals. At the same time, more and more individuals are having their genome sequenced either out of curiosity or to identify the cause of an illness. These individuals may benefit from of a way to view and understand their data. QIAGEN's Ingenuity Variant Analysis is an online application that allows users with and without extensive bioinformatics training to incorporate information from published experiments, genetic databases, and a variety of statistical models to identify variants, from a long list of candidates, that are most likely causal for a phenotype as well as annotate variants with what is already known about them in the literature and databases. Ingenuity Variant Analysis is also an information sharing platform where users may exchange samples and analyses. The Empowered Genome Community (EGC) is a new program in which QIAGEN is making this on-line tool freely available to any individual who wishes to analyze their own genetic sequence. EGC members are then able to make their data available to other Ingenuity Variant Analysis users to be used in research. Here we present and describe the Empowered Genome Community in detail. We also present a preliminary, proof-of-concept study that utilizes the 200 genomes currently available through the EGC. The goal of this program is to allow individuals to access and understand their own data as well as facilitate citizen-scientist collaborations that can drive research forward and spur quality scientific dialogue in the general public.

  6. SitesIdentify: a protein functional site prediction tool

    PubMed Central

    2009-01-01

    Background The rate of protein structures being deposited in the Protein Data Bank surpasses the capacity to experimentally characterise them and therefore computational methods to analyse these structures have become increasingly important. Identifying the region of the protein most likely to be involved in function is useful in order to gain information about its potential role. There are many available approaches to predict functional site, but many are not made available via a publicly-accessible application. Results Here we present a functional site prediction tool (SitesIdentify), based on combining sequence conservation information with geometry-based cleft identification, that is freely available via a web-server. We have shown that SitesIdentify compares favourably to other functional site prediction tools in a comparison of seven methods on a non-redundant set of 237 enzymes with annotated active sites. Conclusion SitesIdentify is able to produce comparable accuracy in predicting functional sites to its closest available counterpart, but in addition achieves improved accuracy for proteins with few characterised homologues. SitesIdentify is available via a webserver at http://www.manchester.ac.uk/bioinformatics/sitesidentify/ PMID:19922660

  7. Wheat EST resources for functional genomics of abiotic stress

    PubMed Central

    Houde, Mario; Belcaid, Mahdi; Ouellet, François; Danyluk, Jean; Monroy, Antonio F; Dryanova, Ani; Gulick, Patrick; Bergeron, Anne; Laroche, André; Links, Matthew G; MacCarthy, Luke; Crosby, William L; Sarhan, Fathey

    2006-01-01

    Background Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS) project. Results We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets). Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology. Conclusion We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in wheat and other cereals. PMID

  8. OntoMate: a text-mining tool aiding curation at the Rat Genome Database

    PubMed Central

    Liu, Weisong; Laulederkind, Stanley J. F.; Hayman, G. Thomas; Wang, Shur-Jen; Nigam, Rajni; Smith, Jennifer R.; De Pons, Jeff; Dwinell, Melinda R.; Shimoyama, Mary

    2015-01-01

    The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Quantitative Trait Locus (QTL) and strain information. The rapidly growing volume of biomedical literature and the active research in the biological natural language processing (bioNLP) community have given RGD the impetus to adopt text-mining tools to improve curation efficiency. Recently, RGD has initiated a project to use OntoMate, an ontology-driven, concept-based literature search engine developed at RGD, as a replacement for the PubMed (http://www.ncbi.nlm.nih.gov/pubmed) search engine in the gene curation workflow. OntoMate tags abstracts with gene names, gene mutations, organism name and most of the 16 ontologies/vocabularies used at RGD. All terms/ entities tagged to an abstract are listed with the abstract in the search results. All listed terms are linked both to data entry boxes and a term browser in the curation tool. OntoMate also provides user-activated filters for species, date and other parameters relevant to the literature search. Using the system for literature search and import has streamlined the process compared to using PubMed. The system was built with a scalable and open architecture, including features specifically designed to accelerate the RGD gene curation process. With the use of bioNLP tools, RGD has added more automation to its curation workflow. Database URL: http://rgd.mcw.edu PMID:25619558

  9. OntoMate: a text-mining tool aiding curation at the Rat Genome Database.

    PubMed

    Liu, Weisong; Laulederkind, Stanley J F; Hayman, G Thomas; Wang, Shur-Jen; Nigam, Rajni; Smith, Jennifer R; De Pons, Jeff; Dwinell, Melinda R; Shimoyama, Mary

    2015-01-01

    The Rat Genome Database (RGD) is the premier repository of rat genomic, genetic and physiologic data. Converting data from free text in the scientific literature to a structured format is one of the main tasks of all model organism databases. RGD spends considerable effort manually curating gene, Quantitative Trait Locus (QTL) and strain information. The rapidly growing volume of biomedical literature and the active research in the biological natural language processing (bioNLP) community have given RGD the impetus to adopt text-mining tools to improve curation efficiency. Recently, RGD has initiated a project to use OntoMate, an ontology-driven, concept-based literature search engine developed at RGD, as a replacement for the PubMed (http://www.ncbi.nlm.nih.gov/pubmed) search engine in the gene curation workflow. OntoMate tags abstracts with gene names, gene mutations, organism name and most of the 16 ontologies/vocabularies used at RGD. All terms/ entities tagged to an abstract are listed with the abstract in the search results. All listed terms are linked both to data entry boxes and a term browser in the curation tool. OntoMate also provides user-activated filters for species, date and other parameters relevant to the literature search. Using the system for literature search and import has streamlined the process compared to using PubMed. The system was built with a scalable and open architecture, including features specifically designed to accelerate the RGD gene curation process. With the use of bioNLP tools, RGD has added more automation to its curation workflow. Database URL: http://rgd.mcw.edu.

  10. The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

    PubMed

    Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

    2013-10-01

    The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens.

  11. Cubozoan genome illuminates functional diversification of opsins and photoreceptor evolution.

    PubMed

    Liegertová, Michaela; Pergner, Jiří; Kozmiková, Iryna; Fabian, Peter; Pombinho, Antonio R; Strnad, Hynek; Pačes, Jan; Vlček, Čestmír; Bartůněk, Petr; Kozmik, Zbyněk

    2015-07-08

    Animals sense light primarily by an opsin-based photopigment present in a photoreceptor cell. Cnidaria are arguably the most basal phylum containing a well-developed visual system. The evolutionary history of opsins in the animal kingdom has not yet been resolved. Here, we study the evolution of animal opsins by genome-wide analysis of the cubozoan jellyfish Tripedalia cystophora, a cnidarian possessing complex lens-containing eyes and minor photoreceptors. A large number of opsin genes with distinct tissue- and stage-specific expression were identified. Our phylogenetic analysis unequivocally classifies cubozoan opsins as a sister group to c-opsins and documents lineage-specific expansion of the opsin gene repertoire in the cubozoan genome. Functional analyses provided evidence for the use of the Gs-cAMP signaling pathway in a small set of cubozoan opsins, indicating the possibility that the majority of other cubozoan opsins signal via distinct pathways. Additionally, these tests uncovered subtle differences among individual opsins, suggesting possible fine-tuning for specific photoreceptor tasks. Based on phylogenetic, expression and biochemical analysis we propose that rapid lineage- and species-specific duplications of the intron-less opsin genes and their subsequent functional diversification promoted evolution of a large repertoire of both visual and extraocular photoreceptors in cubozoans.

  12. Phylogenetic Portrait of the Saccharomyces cerevisiae Functional Genome

    PubMed Central

    Gibney, Patrick A.; Hickman, Mark J.; Bradley, Patrick H.; Matese, John C.; Botstein, David

    2013-01-01

    The genome of budding yeast (Saccharomyces cerevisiae) contains approximately 5800 protein-encoding genes, the majority of which are associated with some known biological function. Yet the extent of amino acid sequence conservation of these genes over all phyla has only been partially examined. Here we provide a more comprehensive overview and visualization of the conservation of yeast genes and a means for browsing and exploring the data in detail, down to the individual yeast gene, at http://yeast-phylogroups.princeton.edu. We used data from the OrthoMCL database, which has defined orthologs from approximately 150 completely sequenced genomes, including diverse representatives of the archeal, bacterial, and eukaryotic domains. By clustering genes based on similar patterns of conservation, we organized and visualized all the protein-encoding genes in yeast as a single heat map. Most genes fall into one of eight major clusters, called “phylogroups.” Gene ontology analysis of the phylogroups revealed that they were associated with specific, distinct trends in gene function, generalizations likely to be of interest to a wide range of biologists. PMID:23749449

  13. Cubozoan genome illuminates functional diversification of opsins and photoreceptor evolution

    PubMed Central

    Liegertová, Michaela; Pergner, Jiří; Kozmiková, Iryna; Fabian, Peter; Pombinho, Antonio R.; Strnad, Hynek; Pačes, Jan; Vlček, Čestmír; Bartůněk, Petr; Kozmik, Zbyněk

    2015-01-01

    Animals sense light primarily by an opsin-based photopigment present in a photoreceptor cell. Cnidaria are arguably the most basal phylum containing a well-developed visual system. The evolutionary history of opsins in the animal kingdom has not yet been resolved. Here, we study the evolution of animal opsins by genome-wide analysis of the cubozoan jellyfish Tripedalia cystophora, a cnidarian possessing complex lens-containing eyes and minor photoreceptors. A large number of opsin genes with distinct tissue- and stage-specific expression were identified. Our phylogenetic analysis unequivocally classifies cubozoan opsins as a sister group to c-opsins and documents lineage-specific expansion of the opsin gene repertoire in the cubozoan genome. Functional analyses provided evidence for the use of the Gs-cAMP signaling pathway in a small set of cubozoan opsins, indicating the possibility that the majority of other cubozoan opsins signal via distinct pathways. Additionally, these tests uncovered subtle differences among individual opsins, suggesting possible fine-tuning for specific photoreceptor tasks. Based on phylogenetic, expression and biochemical analysis we propose that rapid lineage- and species-specific duplications of the intron-less opsin genes and their subsequent functional diversification promoted evolution of a large repertoire of both visual and extraocular photoreceptors in cubozoans. PMID:26154478

  14. Identification of Functional Differences in Metabolic Networks Using Comparative Genomics and Constraint-Based Models

    PubMed Central

    Hamilton, Joshua J.; Reed, Jennifer L.

    2012-01-01

    Genome-scale network reconstructions are useful tools for understanding cellular metabolism, and comparisons of such reconstructions can provide insight into metabolic differences between organisms. Recent efforts toward comparing genome-scale models have focused primarily on aligning metabolic networks at the reaction level and then looking at differences and similarities in reaction and gene content. However, these reaction comparison approaches are time-consuming and do not identify the effect network differences have on the functional states of the network. We have developed a bilevel mixed-integer programming approach, CONGA, to identify functional differences between metabolic networks by comparing network reconstructions aligned at the gene level. We first identify orthologous genes across two reconstructions and then use CONGA to identify conditions under which differences in gene content give rise to differences in metabolic capabilities. By seeking genes whose deletion in one or both models disproportionately changes flux through a selected reaction (e.g., growth or by-product secretion) in one model over another, we are able to identify structural metabolic network differences enabling unique metabolic capabilities. Using CONGA, we explore functional differences between two metabolic reconstructions of Escherichia coli and identify a set of reactions responsible for chemical production differences between the two models. We also use this approach to aid in the development of a genome-scale model of Synechococcus sp. PCC 7002. Finally, we propose potential antimicrobial targets in Mycobacterium tuberculosis and Staphylococcus aureus based on differences in their metabolic capabilities. Through these examples, we demonstrate that a gene-centric approach to comparing metabolic networks allows for a rapid comparison of metabolic models at a functional level. Using CONGA, we can identify differences in reaction and gene content which give rise to different

  15. Target selection and determination of function in structural genomics.

    PubMed

    Watson, James D; Todd, Annabel E; Bray, James; Laskowski, Roman A; Edwards, Aled; Joachimiak, Andrzej; Orengo, Christine A; Thornton, Janet M

    2003-01-01

    The first crucial step in any structural genomics project is the selection and prioritization of target proteins for structure determination. There may be a number of selection criteria to be satisfied, including that the proteins have novel folds, that they be representatives of large families for which no structure is known, and so on. The better the selection at this stage, the greater is the value of the structures obtained at the end of the experimental process. This value can be further enhanced once the protein structures have been solved if the functions of the given proteins can also be determined. Here we describe the methods used at either end of the experimental process: firstly, sensitive sequence comparison techniques for selecting a high-quality list of target proteins, and secondly the various computational methods that can be applied to the eventual 3D structures to determine the most likely biochemical function of the proteins in question.

  16. Computational tools and resources for prediction and analysis of gene regulatory regions in the chick genome.

    PubMed

    Khan, Mohsin A F; Soto-Jimenez, Luz Mayela; Howe, Timothy; Streit, Andrea; Sosinsky, Alona; Stern, Claudio D

    2013-05-01

    The discovery of cis-regulatory elements is a challenging problem in bioinformatics, owing to distal locations and context-specific roles of these elements in controlling gene regulation. Here we review the current bioinformatics methodologies and resources available for systematic discovery of cis-acting regulatory elements and conserved transcription factor binding sites in the chick genome. In addition, we propose and make available, a novel workflow using computational tools that integrate CTCF analysis to predict putative insulator elements, enhancer prediction, and TFBS analysis. To demonstrate the usefulness of this computational workflow, we then use it to analyze the locus of the gene Sox2 whose developmental expression is known to be controlled by a complex array of cis-acting regulatory elements. The workflow accurately predicts most of the experimentally verified elements along with some that have not yet been discovered. A web version of the CTCF tool, together with instructions for using the workflow can be accessed from http://toolshed.g2.bx.psu.edu/view/mkhan1980/ctcf_analysis. For local installation of the tool, relevant Perl scripts and instructions are provided in the directory named "code" in the supplementary materials.

  17. Computational tools and resources for prediction and analysis of gene regulatory regions in the chick genome

    PubMed Central

    Khan, Mohsin A. F.; Soto-Jimenez, Luz Mayela; Howe, Timothy; Streit, Andrea; Sosinsky, Alona; Stern, Claudio D.

    2013-01-01

    The discovery of cis-regulatory elements is a challenging problem in bioinformatics, owing to distal locations and context-specific roles of these elements in controlling gene regulation. Here we review the current bioinformatics methodologies and resources available for systematic discovery of cis-acting regulatory elements and conserved transcription factor binding sites in the chick genome. In addition, we propose and make available, a novel workflow using computational tools that integrate CTCF analysis to predict putative insulator elements, enhancer prediction and TFBS analysis. To demonstrate the usefulness of this computational workflow, we then use it to analyze the locus of the gene Sox2 whose developmental expression is known to be controlled by a complex array of cis-acting regulatory elements. The workflow accurately predicts most of the experimentally verified elements along with some that have not yet been discovered. A web version of the CTCF tool, together with instructions for using the workflow can be accessed from http://toolshed.g2.bx.psu.edu/view/mkhan1980/ctcf_analysis. For local installation of the tool, relevant Perl scripts and instructions are provided in the directory named “code” in the supplementary materials. PMID:23355428

  18. Optopharmacological tools for restoring visual function in degenerative retinal diseases

    PubMed Central

    Tochitsky, Ivan; Kramer, Richard H

    2015-01-01

    Retinitis pigmentosa (RP) and age-related macular degeneration (AMD) are progressive retinal diseases that result from the death of rod and cone photoreceptors, ultimately leading to blindness. The only currently approved vision restoration treatment employs an implanted retinal ‘chip’ as a prosthetic device to electrically stimulate retinal neurons that survive after the photoreceptors are gone, thereby restoring light-driven neural signaling to the brain. An alternative strategy has been proposed, which would utilize optogenetic or opto-pharmacological tools to enable direct optical stimulation of surviving retinal neurons. Here, we review the latest studies evaluating the feasibility of these molecular tools as potential therapeutics for restoring visual function in human blinding disease. PMID:25706312

  19. Genome elimination: translating basic research into a future tool for plant breeding.

    PubMed

    Comai, Luca

    2014-06-01

    During the course of our history, humankind has been through different periods of agricultural improvement aimed at enhancing our food supply and the performance of food crops. In recent years, it has become apparent that future crop improvement efforts will require new approaches to address the local challenges of farmers while empowering discovery across industry and academia. New plant breeding approaches are needed to meet this challenge to help feed a growing world population. Here I discuss how a basic research discovery is being translated into a potential future tool for plant breeding, and share the story of researcher Simon Chan, who recognized the potential application of this new approach--genome elimination--for the breeding of staple food crops in Africa and South America.

  20. 'PACLIMS': A component LIM system for high-throughput functional genomic analysis

    PubMed Central

    Donofrio, Nicole; Rajagopalon, Ravi; Brown, Douglas; Diener, Stephen; Windham, Donald; Nolin, Shelly; Floyd, Anna; Mitchell, Thomas; Galadima, Natalia; Tucker, Sara; Orbach, Marc J; Patel, Gayatri; Farman, Mark; Pampanwar, Vishal; Soderlund, Cari; Lee, Yong-Hwan; Dean, Ralph A

    2005-01-01

    Background Recent advances in sequencing techniques leading to cost reduction have resulted in the generation of a growing number of sequenced eukaryotic genomes. Computational tools greatly assist in defining open reading frames and assigning tentative annotations. However, gene functions cannot be asserted without biological support through, among other things, mutational analysis. In taking a genome-wide approach to functionally annotate an entire organism, in this application the ~11,000 predicted genes in the rice blast fungus (Magnaporthe grisea), an effective platform for tracking and storing both the biological materials created and the data produced across several participating institutions was required. Results The platform designed, named PACLIMS, was built to support our high throughput pipeline for generating 50,000 random insertion mutants of Magnaporthe grisea. To be a useful tool for materials and data tracking and storage, PACLIMS was designed to be simple to use, modifiable to accommodate refinement of research protocols, and cost-efficient. Data entry into PACLIMS was simplified through the use of barcodes and scanners, thus reducing the potential human error, time constraints, and labor. This platform was designed in concert with our experimental protocol so that it leads the researchers through each step of the process from mutant generation through phenotypic assays, thus ensuring that every mutant produced is handled in an identical manner and all necessary data is captured. Conclusion Many sequenced eukaryotes have reached the point where computational analyses are no longer sufficient and require biological support for their predicted genes. Consequently, there is an increasing need for platforms that support high throughput genome-wide mutational analyses. While PACLIMS was designed specifically for this project, the source and ideas present in its implementation can be used as a model for other high throughput mutational endeavors. PMID

  1. Sugarcane Functional Genomics: Gene Discovery for Agronomic Trait Development

    PubMed Central

    Menossi, M.; Silva-Filho, M. C.; Vincentz, M.; Van-Sluys, M.-A.; Souza, G. M.

    2008-01-01

    Sugarcane is a highly productive crop used for centuries as the main source of sugar and recently to produce ethanol, a renewable bio-fuel energy source. There is increased interest in this crop due to the impending need to decrease fossil fuel usage. Sugarcane has a highly polyploid genome. Expressed sequence tag (EST) sequencing has significantly contributed to gene discovery and expression studies used to associate function with sugarcane genes. A significant amount of data exists on regulatory events controlling responses to herbivory, drought, and phosphate deficiency, which cause important constraints on yield and on endophytic bacteria, which are highly beneficial. The means to reduce drought, phosphate deficiency, and herbivory by the sugarcane borer have a negative impact on the environment. Improved tolerance for these constraints is being sought. Sugarcane's ability to accumulate sucrose up to 16% of its culm dry weight is a challenge for genetic manipulation. Genome-based technology such as cDNA microarray data indicates genes associated with sugar content that may be used to develop new varieties improved for sucrose content or for traits that restrict the expansion of the cultivated land. The genes can also be used as molecular markers of agronomic traits in traditional breeding programs. PMID:18273390

  2. Integrating functional genomics to accelerate mechanistic personalized medicine

    PubMed Central

    Tyner, Jeffrey W.

    2017-01-01

    The advent of deep sequencing technologies has resulted in the deciphering of tremendous amounts of genetic information. These data have led to major discoveries, and many anecdotes now exist of individual patients whose clinical outcomes have benefited from novel, genetically guided therapeutic strategies. However, the majority of genetic events in cancer are currently undrugged, leading to a biological gap between understanding of tumor genetic etiology and translation to improved clinical approaches. Functional screening has made tremendous strides in recent years with the development of new experimental approaches to studying ex vivo and in vivo drug sensitivity. Numerous discoveries and anecdotes also exist for translation of functional screening into novel clinical strategies; however, the current clinical application of functional screening remains largely confined to small clinical trials at specific academic centers. The intersection between genomic and functional approaches represents an ideal modality to accelerate our understanding of drug sensitivities as they relate to specific genetic events and further understand the full mechanisms underlying drug sensitivity patterns. PMID:28299357

  3. Integrating functional genomics to accelerate mechanistic personalized medicine.

    PubMed

    Tyner, Jeffrey W

    2017-03-01

    The advent of deep sequencing technologies has resulted in the deciphering of tremendous amounts of genetic information. These data have led to major discoveries, and many anecdotes now exist of individual patients whose clinical outcomes have benefited from novel, genetically guided therapeutic strategies. However, the majority of genetic events in cancer are currently undrugged, leading to a biological gap between understanding of tumor genetic etiology and translation to improved clinical approaches. Functional screening has made tremendous strides in recent years with the development of new experimental approaches to studying ex vivo and in vivo drug sensitivity. Numerous discoveries and anecdotes also exist for translation of functional screening into novel clinical strategies; however, the current clinical application of functional screening remains largely confined to small clinical trials at specific academic centers. The intersection between genomic and functional approaches represents an ideal modality to accelerate our understanding of drug sensitivities as they relate to specific genetic events and further understand the full mechanisms underlying drug sensitivity patterns.

  4. Development of Genomic and Genetic Tools for Foxtail Millet, and Use of These Tools in the Improvement of Biomass Production for Bioenergy Crops

    SciTech Connect

    Doust, Andrew, N.

    2011-11-11

    The overall aim of this research was to develop genomic and genetic tools in foxtail millet that will be useful in improving biomass production in bioenergy crops such as switchgrass, napier grass, and pearl millet. A variety of approaches have been implemented, and our lab has been primarily involved in genome analysis and quantitative genetic analysis. Our progress in these activities has been substantially helped by the genomic sequence of foxtail millet produced by the Joint Genome Institute (Bennetzen et al., in prep). In particular, the annotation and analysis of candidate genes for architecture, biomass production and flowering has led to new insights into the control of branching and flowering time, and has shown how closely related flowering time is to vegetative architectural development and biomass accumulation. The differences in genetic control identified at high and low density plantings have direct relevance to the breeding of bioenergy grasses that are tolerant of high planting densities. The developmental analyses have shown how plant architecture changes over time and may indicate which genes may best be manipulated at various times during development to obtain required biomass characteristics. This data contributes to the overall aim of significantly improving genetic and genomic tools in foxtail millet that can be directed to improvement of bioenergy grasses such as switchgrass, where it is important to maximize vegetative growth for greatest biomass production.

  5. Identification of novel biomass-degrading enzymes from genomic dark matter: Populating genomic sequence space with functional annotation.

    PubMed

    Piao, Hailan; Froula, Jeff; Du, Changbin; Kim, Tae-Wan; Hawley, Erik R; Bauer, Stefan; Wang, Zhong; Ivanova, Nathalia; Clark, Douglas S; Klenk, Hans-Peter; Hess, Matthias

    2014-08-01

    Although recent nucleotide sequencing technologies have significantly enhanced our understanding of microbial genomes, the function of ∼35% of genes identified in a genome currently remains unknown. To improve the understanding of microbial genomes and consequently of microbial processes it will be crucial to assign a function to this "genomic dark matter." Due to the urgent need for additional carbohydrate-active enzymes for improved production of transportation fuels from lignocellulosic biomass, we screened the genomes of more than 5,500 microorganisms for hypothetical proteins that are located in the proximity of already known cellulases. We identified, synthesized and expressed a total of 17 putative cellulase genes with insufficient sequence similarity to currently known cellulases to be identified as such using traditional sequence annotation techniques that rely on significant sequence similarity. The recombinant proteins of the newly identified putative cellulases were subjected to enzymatic activity assays to verify their hydrolytic activity towards cellulose and lignocellulosic biomass. Eleven (65%) of the tested enzymes had significant activity towards at least one of the substrates. This high success rate highlights that a gene context-based approach can be used to assign function to genes that are otherwise categorized as "genomic dark matter" and to identify biomass-degrading enzymes that have little sequence similarity to already known cellulases. The ability to assign function to genes that have no related sequence representatives with functional annotation will be important to enhance our understanding of microbial processes and to identify microbial proteins for a wide range of applications.

  6. Functional annotations for the Saccharomyces cerevisiae genome: the knowns and the known unknowns

    PubMed Central

    Christie, Karen R.; Hong, Eurie L.; Cherry, J. Michael

    2011-01-01

    The quest to characterize each of the genes of the yeast Saccharomyces cerevisiae has propelled the development and application of novel high-throughput (HTP) experimental techniques. To handle the enormous amount of information generated by these techniques, new bioinformatics tools and resources are needed. Gene Ontology (GO) annotations curated by the Saccharomyces Genome Database (SGD) have facilitated the development of algorithms that analyze HTP data and help predict functions for poorly characterized genes in S. cerevisiae and other organisms. Here, we describe how published results are incorporated into GO annotations at SGD and why researchers can benefit from using these resources wisely to analyze their HTP data and predict gene functions. PMID:19577472

  7. Deep sequencing is an appropriate tool for the selection of unique Hepatitis C virus (HCV) variants after single genomic amplification.

    PubMed

    Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc'h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine

    2017-01-01

    Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.

  8. Deep sequencing is an appropriate tool for the selection of unique Hepatitis C virus (HCV) variants after single genomic amplification

    PubMed Central

    Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc’h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine

    2017-01-01

    Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus’s but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies. PMID:28362878

  9. NCBI GEO: archive for functional genomics data sets--10 years on.

    PubMed

    Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Evangelista, Carlos; Kim, Irene F; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Muertter, Rolf N; Holko, Michelle; Ayanbule, Oluwabukunmi; Yefanov, Andrey; Soboleva, Alexandra

    2011-01-01

    A decade ago, the Gene Expression Omnibus (GEO) database was established at the National Center for Biotechnology Information (NCBI). The original objective of GEO was to serve as a public repository for high-throughput gene expression data generated mostly by microarray technology. However, the research community quickly applied microarrays to non-gene-expression studies, including examination of genome copy number variation and genome-wide profiling of DNA-binding proteins. Because the GEO database was designed with a flexible structure, it was possible to quickly adapt the repository to store these data types. More recently, as the microarray community switches to next-generation sequencing technologies, GEO has again adapted to host these data sets. Today, GEO stores over 20,000 microarray- and sequence-based functional genomics studies, and continues to handle the majority of direct high-throughput data submissions from the research community. Multiple mechanisms are provided to help users effectively search, browse, download and visualize the data at the level of individual genes or entire studies. This paper describes recent database enhancements, including new search and data representation tools, as well as a brief review of how the community uses GEO data. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.

  10. JGI Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor V.

    2011-03-14

    Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here

  11. Genomic Encyclopedia of Fungi

    SciTech Connect

    Grigoriev, Igor

    2012-08-10

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  12. Functional genomic approaches for understanding the mode of action of Bacillus sp biocontrol strains

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Complete genome sequencing of several Bacillus sp. strains has shed new light on the mode of action of these antagonists of plant pathogens. The use of genomic data mining tools provided the ability to quickly determine the potential of these strains to produce bioactive secondary metabolites. Our B...

  13. ACCELERATORS: A GUI tool for beta function measurement using MATLAB

    NASA Astrophysics Data System (ADS)

    Chen, Guang-Ling; Tian, Shun-Qiang; Jiang, Bo-Cheng; Liu, Gui-Min

    2009-04-01

    The beta function measurement is used to detect the shift in the betatron tune as the strength of an individual quadrupole magnet is varied. A GUI (graphic user interface) tool for the beta function measurement is developed using the MATLAB program language in the Linux environment, which facilitates the commissioning of the Shanghai Synchrotron Radiation Facility (SSRF) storage ring. In this paper, we describe the design of the application and give some measuring results and discussions about the definition of the measurement. The program has been optimized to solve some restrictions of the AT tracking code. After the correction with LOCO (linear optics from closed orbits), the horizontal and the vertical root mean square values (rms values) can be reduced to 0.12 and 0.10.

  14. Functional genomics of the horn fly, Haematobia irritans (Linnaeus, 1758)

    PubMed Central

    2011-01-01

    Background The horn fly, Haematobia irritans (Linnaeus, 1758) (Diptera: Muscidae) is one of the most important ectoparasites of pastured cattle. Horn flies infestations reduce cattle weight gain and milk production. Additionally, horn flies are mechanical vectors of different pathogens that cause disease in cattle. The aim of this study was to conduct a functional genomics study in female horn flies using Expressed Sequence Tags (EST) analysis and RNA interference (RNAi). Results A cDNA library was made from whole abdominal tissues collected from partially fed adult female horn flies. High quality horn fly ESTs (2,160) were sequenced and assembled into 992 unigenes (178 contigs and 814 singlets) representing molecular functions such as serine proteases, cell metabolism, mitochondrial function, transcription and translation, transport, chromatin structure, vitellogenesis, cytoskeleton, DNA replication, cell response to stress and infection, cell proliferation and cell-cell interactions, intracellular trafficking and secretion, and development. Functional analyses were conducted using RNAi for the first time in horn flies. Gene knockdown by RNAi resulted in higher horn fly mortality (protease inhibitor functional group), reduced oviposition (vitellogenin, ferritin and vATPase groups) or both (immune response and 5'-NUC groups) when compared to controls. Silencing of ubiquitination ESTs did not affect horn fly mortality and ovisposition while gene knockdown in the ferritin and vATPse functional groups reduced mortality when compared to controls. Conclusions These results advanced the molecular characterization of this important ectoparasite and suggested candidate protective antigens for the development of vaccines for the control of horn fly infestations. PMID:21310032

  15. Functional Analysis of Shewanella, a cross genome comparison.

    SciTech Connect

    Serres, Margrethe H.

    2009-05-15

    The bacterial genus Shewanella includes a group of highly versatile organisms that have successfully adapted to life in many environments ranging from aquatic (fresh and marine) to sedimentary (lake and marine sediments, subsurface sediments, sea vent). A unique respiratory capability of the Shewanellas, initially observed for Shewanella oneidensis MR-1, is the ability to use metals and metalloids, including radioactive compounds, as electron acceptors. Members of the Shewanella genus have also been shown to degrade environmental pollutants i.e. halogenated compounds, making this group highly applicable for the DOE mission. S. oneidensis MR-1 has in addition been found to utilize a diverse set of nutrients and to have a large set of genes dedicated to regulation and to sensing of the environment. The sequencing of the S. oneidensis MR-1 genome facilitated experimental and bioinformatics analyses by a group of collaborating researchers, the Shewanella Federation. Through the joint effort and with support from Department of Energy S. oneidensis MR-1 has become a model organism of study. Our work has been a functional analysis of S. oneidensis MR-1, both by itself and as part of a comparative study. We have improved the annotation of gene products, assigned metabolic functions, and analyzed protein families present in S. oneidensis MR-1. The data has been applied to analysis of experimental data (i.e. gene expression, proteome) generated for S. oneidensis MR-1. Further, this work has formed the basis for a comparative study of over 20 members of the Shewanella genus. The species and strains selected for genome sequencing represented an evolutionary gradient of DNA relatedness, ranging from close to intermediate, and to distant. The organisms selected have also adapted to a variety of ecological niches. Through our work we have been able to detect and interpret genome similarities and differences between members of the genus. We have in this way contributed to the

  16. Functional genomics of cactus host shifts in Drosophila mojavensis.

    PubMed

    Matzkin, Luciano M; Watts, Thomas D; Bitler, Benjamin G; Machado, Carlos A; Markow, Therese A

    2006-12-01

    Understanding the genetic basis of adaptation to novel environments remains one of the major challenges confronting evolutionary biologists. While newly developed genomic approaches hold considerable promise for addressing this overall question, the relevant tools have not often been available in the most ecologically interesting organisms. Our study organism, Drosophila mojavensis, is a cactophilic Sonoran Desert endemic utilizing four different cactus hosts across its geographical range. Its well-known ecology makes it an attractive system in which to study the evolution of gene expression during adaptation. As a cactophile, D. mojavensis oviposits in the necrotic tissues of cacti, therefore exposing larvae and even adults to the varied and toxic compounds of rotting cacti. We have developed a cDNA microarray of D. mojavensis to examine gene expression associated with cactus host use. Using a population from the Baja California population we examined gene expression differences of third instar larvae when reared in two chemically distinct cactus hosts, agria (Stenocereus gummosus, native host) vs. organpipe (Stenocereus thurberi, alternative host). We have observed differential gene expression associated with cactus host use in genes involved in metabolism and detoxification.

  17. Computational and molecular tools for scalable rAAV-mediated genome editing

    PubMed Central

    Stoimenov, Ivaylo; Ali, Muhammad Akhtar; Pandzic, Tatjana; Sjöblom, Tobias

    2015-01-01

    The rapid discovery of potential driver mutations through large-scale mutational analyses of human cancers generates a need to characterize their cellular phenotypes. Among the techniques for genome editing, recombinant adeno-associated virus (rAAV)-mediated gene targeting is suited for knock-in of single nucleotide substitutions and to a lesser degree for gene knock-outs. However, the generation of gene targeting constructs and the targeting process is time-consuming and labor-intense. To facilitate rAAV-mediated gene targeting, we developed the first software and complementary automation-friendly vector tools to generate optimized targeting constructs for editing human protein encoding genes. By computational approaches, rAAV constructs for editing ∼71% of bases in protein-coding exons were designed. Similarly, ∼81% of genes were predicted to be targetable by rAAV-mediated knock-out. A Gateway-based cloning system for facile generation of rAAV constructs suitable for robotic automation was developed and used in successful generation of targeting constructs. Together, these tools enable automated rAAV targeting construct design, generation as well as enrichment and expansion of targeted cells with desired integrations. PMID:25488813

  18. netview p: a network visualization tool to unravel complex population structure using genome-wide SNPs.

    PubMed

    Steinig, Eike J; Neuditschko, Markus; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R

    2016-01-01

    Network-based approaches are emerging as valuable tools for the analysis of complex genetic structure in wild and captive populations. netview p combines data quality control with the construction of population networks through mutual k-nearest neighbours thresholds applied to genome-wide SNPs. The program is cross-platform compatible, open-source and efficiently operates on data ranging from hundreds to hundreds of thousands of SNPs. The pipeline was used for the analysis of pedigree data from simulated (n = 750, SNPs = 1279) and captive silver-lipped pearl oysters (n = 415, SNPs = 1107), wild populations of the European hake from the Atlantic and Mediterranean (n = 834, SNPs = 380) and grey wolves from North America (n = 239, SNPs = 78 255). The population networks effectively visualize large- and fine-scale genetic structure within and between populations, including family-level structure and relationships. netview p comprises a network-based addition to other population analysis tools and provides user-friendly access to a complex network analysis pipeline through implementation in python.

  19. Plant Ion Channels: Gene Families, Physiology, and Functional Genomics Analyses

    PubMed Central

    Ward, John M.; Mäser, Pascal; Schroeder, Julian I.

    2016-01-01

    Distinct potassium, anion, and calcium channels in the plasma membrane and vacuolar membrane of plant cells have been identified and characterized by patch clamping. Primarily owing to advances in Arabidopsis genetics and genomics, and yeast functional complementation, many of the corresponding genes have been identified. Recent advances in our understanding of ion channel genes that mediate signal transduction and ion transport are discussed here. Some plant ion channels, for example, ALMT and SLAC anion channel subunits, are unique. The majority of plant ion channel families exhibit homology to animal genes; such families include both hyperpolarization-and depolarization-activated Shaker-type potassium channels, CLC chloride transporters/channels, cyclic nucleotide–gated channels, and ionotropic glutamate receptor homologs. These plant ion channels offer unique opportunities to analyze the structural mechanisms and functions of ion channels. Here we review gene families of selected plant ion channel classes and discuss unique structure-function aspects and their physiological roles in plant cell signaling and transport. PMID:18842100

  20. Reliability and validity of the functional analysis screening tool.

    PubMed

    Iwata, Brian A; Deleon, Iser G; Roscoe, Eileen M

    2013-01-01

    The Functional Analysis Screening Tool (FAST) is a 16-item questionnaire about antecedent and consequent events that might be correlated with the occurrence of problem behavior. Items are organized into 4 functional categories based on contingencies that maintain problem behavior. We assessed interrater reliability of the FAST with 196 problem behaviors through independent administration to pairs of raters (Study 1). Mean item-by-item agreement between pairs of raters was 71.5%. Agreement for individual items ranged from 53.3% to 84.5%. Agreement on FAST outcomes, based on comparison of informants' highest totals, was 64.8%. We assessed the validity of the FAST by comparing its outcomes with results of 69 functional analyses (Study 2). The FAST score predicted the condition of the functional analysis in which the highest rate of problem behavior occurred in 44 cases (63.8%). Potential uses of the FAST in the context of a clinical interview, as well as limitations, are discussed.

  1. Urban streets functionality as a tool for urban pollution management.

    PubMed

    Rey Gozalo, Guillermo; Barrigón Morillas, Juan Miguel; Gómez Escobar, Valentín

    2013-09-01

    Pollution derived from traffic can be considered one of the major problems of modern cities. Although considerable efforts have been devoted to gathering information about pollution and its control, little attention has been paid to the analysis of relationships between pollution distribution and town planning. The existence of these relationships would enable better prediction and prevention of pollution through town planning. In this work, an analysis of one pollutant derived from traffic (urban noise) in 27 cities is presented. Non-parametric tests and ROC analyses were employed, using the equivalent sound level (L(eq)) values as the dependent variable. For the characterization of the pollutant, an alternative concept to accessibility is analyzed: the concept of functionality. Results of statistical inferential analysis showed the existence of significant differences between the sound levels of the different category results, confirming that noise is stratified in the studied cities and that the five categories proposed based in the concept of functionality highlight this noise stratification. Moreover, high sensitivity and low non-specificity were obtained by using ROC analysis. Results of this analysis also showed an overall average value of prediction capacity close to 90%. Therefore, because the proposed categories highlight the noise stratification of the studied pollutant in all the towns studied, the functionality concept can be considered an interesting tool for urban planning and for designing pollution prevention policies. Finally, as traffic is a source of other urban pollutants, the concept of functionality may be a new concept for wide environmental pollution management.

  2. AnnoTALE: bioinformatics tools for identification, annotation, and nomenclature of TALEs from Xanthomonas genomic sequences

    PubMed Central

    Grau, Jan; Reschke, Maik; Erkes, Annett; Streubel, Jana; Morgan, Richard D.; Wilson, Geoffrey G.; Koebnik, Ralf; Boch, Jens

    2016-01-01

    Transcription activator-like effectors (TALEs) are virulence factors, produced by the bacterial plant-pathogen Xanthomonas, that function as gene activators inside plant cells. Although the contribution of individual TALEs to infectivity has been shown, the specific roles of most TALEs, and the overall TALE diversity in Xanthomonas spp. is not known. TALEs possess a highly repetitive DNA-binding domain, which is notoriously difficult to sequence. Here, we describe an improved method for characterizing TALE genes by the use of PacBio sequencing. We present ‘AnnoTALE’, a suite of applications for the analysis and annotation of TALE genes from Xanthomonas genomes, and for grouping similar TALEs into classes. Based on these classes, we propose a unified nomenclature for Xanthomonas TALEs that reveals similarities pointing to related functionalities. This new classification enables us to compare related TALEs and to identify base substitutions responsible for the evolution of TALE specificities. PMID:26876161

  3. LEMONS – A Tool for the Identification of Splice Junctions in Transcriptomes of Organisms Lacking Reference Genomes

    PubMed Central

    Bouskila, Amos; Chorev, Michal; Carmel, Liran; Mishmar, Dan

    2015-01-01

    RNA-seq is becoming a preferred tool for genomics studies of model and non-model organisms. However, DNA-based analysis of organisms lacking sequenced genomes cannot rely on RNA-seq data alone to isolate most genes of interest, as DNA codes both exons and introns. With this in mind, we designed a novel tool, LEMONS, that exploits the evolutionary conservation of both exon/intron boundary positions and splice junction recognition signals to produce high throughput splice-junction predictions in the absence of a reference genome. When tested on multiple annotated vertebrate mRNA data, LEMONS accurately identified 87% (average) of the splice-junctions. LEMONS was then applied to our updated Mediterranean chameleon transcriptome, which lacks a reference genome, and predicted a total of 90,820 exon-exon junctions. We experimentally verified these splice-junction predictions by amplifying and sequencing twenty randomly selected genes from chameleon DNA templates. Exons and introns were detected in 19 of 20 of the positions predicted by LEMONS. To the best of our knowledge, LEMONS is currently the only experimentally verified tool that can accurately predict splice-junctions in organisms that lack a reference genome. PMID:26606265

  4. Microfluidics as a functional tool for cell mechanics.

    PubMed

    Vanapalli, Siva A; Duits, Michel H G; Mugele, Frieder

    2009-01-05

    Living cells are a fascinating demonstration of nature's most intricate and well-coordinated micromechanical objects. They crawl, spread, contract, and relax-thus performing a multitude of complex mechanical functions. Alternatively, they also respond to physical and chemical cues that lead to remodeling of the cytoskeleton. To understand this intricate coupling between mechanical properties, mechanical function and force-induced biochemical signaling requires tools that are capable of both controlling and manipulating the cell microenvironment and measuring the resulting mechanical response. In this review, the power of microfluidics as a functional tool for research in cell mechanics is highlighted. In particular, current literature is discussed to show that microfluidics powered by soft lithographic techniques offers the following capabilities that are of significance for understanding the mechanical behavior of cells: (i) Microfluidics enables the creation of in vitro models of physiological environments in which cell mechanics can be probed. (ii) Microfluidics is an excellent means to deliver physical cues that affect cell mechanics, such as cell shape, fluid flow, substrate topography, and stiffness. (iii) Microfluidics can also expose cells to chemical cues, such as growth factors and drugs, which alter their mechanical behavior. Moreover, these chemical cues can be delivered either at the whole cell or subcellular level. (iv) Microfluidic devices offer the possibility of measuring the intrinsic mechanical properties of cells in a high throughput fashion. (v) Finally, microfluidic methods provide exquisite control over drop size, generation, and manipulation. As a result, droplets are being increasingly used to control the physicochemical environment of cells and as biomimetic analogs of living cells. These powerful attributes of microfluidics should further stimulate novel means of investigating the link between physicochemical cues and the biomechanical

  5. Metabolomic Functional Analysis of Bacterial Genomes: Final Report

    SciTech Connect

    Arp, Daniel J; Sayavedra-Soto, Luis A

    2008-01-01

    The availability of the complete DNA sequence of the bacterial genome of Nitrosomonas europaea offered the opportunity for unprecedented and detailed investigations of function. We studied the function of genes involved in carbohydrate and Fe metabolism. N. europaea has genes for the synthesis and degradation of glycogen and sucrose but cannot grow on substrates other than ammonia and CO2. Granules of glycogen were detected in whole cells by electron microscopy and quantified in cell-free extracts by enzymatic methods. The cellular glycogen and sucrose content varied depending on the composition of the growth medium and cellular growth stage. N. europaea also depends heavily on iron for metabolism of ammonia, is particularly interesting since it lacks genes for siderophore production, and has genes with only low similarity to known iron reductases, yet grows relatively well in medium containing low Fe. By comparing the transcriptomes of cells grown in iron-replete medium versus iron-limited medium, 247 genes were identified as differentially expressed. Mutant strains deficient in genes for sucrose, glycogen and iron metabolism were created and are being used to further our understanding of ammonia oxidizing bacteria.

  6. OCTAMER-BINDING TRANSCRIPTION FACTORS: GENOMICS AND FUNCTIONS

    PubMed Central

    Zhao, Feng-Qi

    2015-01-01

    The Octamer-binding proteins (Oct) are a group of highly conserved transcription factors that specifically bind to the octamer motif (ATGCAAAT) and closely related sequences that are found in promoters and enhancers of a wide variety of both ubiquitously expressed and cell type-specific genes. Oct factors belong to the larger family of POU domain factors that are characterized by the presence of a highly conserved bipartite DNA binding domain, consisting of an amino-terminal specific subdomain (POUS) and a carboxyl-terminal homeo-subdomain (POUH). Eleven Oct proteins have been named (Oct1-11), and currently, eight genes encoding Oct proteins (Oct1, Oct2, Oct3/4, Oct6, Oct7, Oct8, Oct9, and Oct11) have been cloned and characterized. Oct1 and Oct2 are widely expressed in adult tissues, while other Oct proteins are much more restricted in their expression patterns. Oct proteins are implicated in crucial and versatile biological events, such as embryogenesis, neurogenesis, immunity, and body glucose and amino acid metabolism. The aberrant expression and null function of Oct proteins have also been linked to various diseases, including deafness, diabetes and cancer. In this review, I will report both the genomic structure and major functions of individual Oct proteins in physiological and pathological processes. PMID:23747866

  7. Beyond the dna: a prototype for functional genomics

    SciTech Connect

    Albala, J

    2000-03-02

    A prototype oligonucleotide ''functional chip'' has been developed to screen novel DNA repair proteins for their ability to bind or alter different forms of DNA. This chip has been developed as a functional genomics screen for analysis of protein-DNA interactions for novel proteins identified from the Human Genome Project The process of novel gene identification that has ensued as a consequence of available sequence information is remarkable. The challenge how lies in determining the function of newly identified gene products in a time-and cost-effective high-throughput manner. The functional chip is generated by the robotic application of DNA spotted in a microarray format onto a glass slide. Individual proteins are then analyzed against the different form of DNA bound to the slide. Several prototype functional chips were designed to contain various DNA fragments tethered to a glass slide for analysis of protein-DNA binding or enzymatic activity of known proteins. The technology has been developed to screen novel, putative DNA repair proteins for their ability to bind various types of DNA alone and in concert with protein partners. An additional scheme has been devised to screen putative repair enzymes for their ability to process different types of DNA molecules. Current methods to analyze gene expression primarily utilize either of two technologies. The oligonucleotide chip, pioneered by Fodor and co-workers and Affymetrix, Inc., consists of greater than 64,000 oligonucleotides attached in situ to a glass support. The oligonucleotide chip has been used primarily to identify specific mutations in a given gene by hybridization against a fluorescently-labeled substrate. The second method is the microarray, whereby DNA targets are systematically arranged on a glass slide and then hybridized with fluorescently-labeled complex targets for gene expression analysis (Jordan, 1998). By this technique, a large amount of information can be obtained examining global

  8. Functional toxicology: tools to advance the future of toxicity testing

    PubMed Central

    Gaytán, Brandon D.; Vulpe, Chris D.

    2014-01-01

    The increased presence of chemical contaminants in the environment is an undeniable concern to human health and ecosystems. Historically, by relying heavily upon costly and laborious animal-based toxicity assays, the field of toxicology has often neglected examinations of the cellular and molecular mechanisms of toxicity for the majority of compounds—information that, if available, would strengthen risk assessment analyses. Functional toxicology, where cells or organisms with gene deletions or depleted proteins are used to assess genetic requirements for chemical tolerance, can advance the field of toxicity testing by contributing data regarding chemical mechanisms of toxicity. Functional toxicology can be accomplished using available genetic tools in yeasts, other fungi and bacteria, and eukaryotes of increased complexity, including zebrafish, fruit flies, rodents, and human cell lines. Underscored is the value of using less complex systems such as yeasts to direct further studies in more complex systems such as human cell lines. Functional techniques can yield (1) novel insights into chemical toxicity; (2) pathways and mechanisms deserving of further study; and (3) candidate human toxicant susceptibility or resistance genes. PMID:24847352

  9. Ciliobrevins as tools for studying dynein motor function

    PubMed Central

    Roossien, Douglas H.; Miller, Kyle E.; Gallo, Gianluca

    2015-01-01

    Dyneins are a small class of molecular motors that bind to microtubules and walk toward their minus ends. They are essential for the transport and distribution of organelles, signaling complexes and cytoskeletal elements. In addition dyneins generate forces on microtubule arrays that power the beating of cilia and flagella, cell division, migration and growth cone motility. Classical approaches to the study of dynein function in axons involve the depletion of dynein, expression of mutant/truncated forms of the motor, or interference with accessory subunits. By necessity, these approaches require prolonged time periods for the expression or manipulation of cellular dynein levels. With the discovery of the ciliobrevins, a class of cell permeable small molecule inhibitors of dynein, it is now possible to acutely disrupt dynein both globally and locally. In this review, we briefly summarize recent work using ciliobrevins to inhibit dynein and discuss the insights ciliobrevins have provided about dynein function in various cell types with a focus on neurons. We temper this with a discussion of the need for studies that will elucidate the mechanism of action of ciliobrevin and as well as the need for experiments to further analyze the specificity of ciliobreviens for dynein. Although much remains to be learned about ciliobrevins, these small molecules are proving themselves to be valuable novel tools to assess the cellular functions of dynein. PMID:26217180

  10. GeNemo: a search engine for web-based functional genomic data.

    PubMed

    Zhang, Yongqing; Cao, Xiaoyi; Zhong, Sheng

    2016-07-08

    A set of new data types emerged from functional genomic assays, including ChIP-seq, DNase-seq, FAIRE-seq and others. The results are typically stored as genome-wide intensities (WIG/bigWig files) or functional genomic regions (peak/BED files). These data types present new challenges to big data science. Here, we present GeNemo, a web-based search engine for functional genomic data. GeNemo searches user-input data against online functional genomic datasets, including the entire collection of ENCODE and mouse ENCODE datasets. Unlike text-based search engines, GeNemo's searches are based on pattern matching of functional genomic regions. This distinguishes GeNemo from text or DNA sequence searches. The user can input any complete or partial functional genomic dataset, for example, a binding intensity file (bigWig) or a peak file. GeNemo reports any genomic regions, ranging from hundred bases to hundred thousand bases, from any of the online ENCODE datasets that share similar functional (binding, modification, accessibility) patterns. This is enabled by a Markov Chain Monte Carlo-based maximization process, executed on up to 24 parallel computing threads. By clicking on a search result, the user can visually compare her/his data with the found datasets and navigate the identified genomic regions. GeNemo is available at www.genemo.org.

  11. Functional organization of the genome may shape the species boundary in the house mouse.

    PubMed

    Janoušek, Václav; Munclinger, Pavel; Wang, Liuyang; Teeter, Katherine C; Tucker, Priscilla K

    2015-05-01

    Genomic features such as rate of recombination and differentiation have been suggested to play a role in species divergence. However, the relationship of these phenomena to functional organization of the genome in the context of reproductive isolation remains unexplored. Here, we examine genomic characteristics of the species boundaries between two house mouse subspecies (Mus musculus musculus/M. m. domesticus). These taxa form a narrow semipermeable zone of secondary contact across Central Europe. Due to the incomplete nature of reproductive isolation, gene flow in the zone varies across the genome. We present an analysis of genomic differentiation, rate of recombination, and functional composition of genes relative to varying amounts of introgression. We assessed introgression using 1,316 autosomal single nucleotide polymorphism markers, previously genotyped in hybrid populations from three transects. We found a significant relationship between amounts of introgression and both genomic differentiation and rate of recombination with genomic regions of reduced introgression associated with higher genomic differentiation and lower rates of recombination, and the opposite for genomic regions of extensive introgression. We also found a striking functional polarization of genes based on where they are expressed in the cell. Regions of elevated introgression exhibit a disproportionate number of genes involved in signal transduction functioning at the cell periphery, among which olfactory receptor genes were found to be the most prominent group. Conversely, genes expressed intracellularly and involved in DNA binding were the most prevalent in regions of reduced introgression. We hypothesize that functional organization of the genome is an important driver of species divergence.

  12. RNAi-Based Functional Genomics Identifies New Virulence Determinants in Mucormycosis

    PubMed Central

    Sanchis, Marta; Lopez-Fernandez, Loida; Torres-Martínez, Santiago; Garre, Victoriano; Ruiz-Vázquez, Rosa María

    2017-01-01

    Mucorales are an emerging group of human pathogens that are responsible for the lethal disease mucormycosis. Unfortunately, functional studies on the genetic factors behind the virulence of these organisms are hampered by their limited genetic tractability, since they are reluctant to classical genetic tools like transposable elements or gene mapping. Here, we describe an RNAi-based functional genomic platform that allows the identification of new virulence factors through a forward genetic approach firstly described in Mucorales. This platform contains a whole-genome collection of Mucor circinelloides silenced transformants that presented a broad assortment of phenotypes related to the main physiological processes in fungi, including virulence, hyphae morphology, mycelial and yeast growth, carotenogenesis and asexual sporulation. Selection of transformants with reduced virulence allowed the identification of mcplD, which encodes a Phospholipase D, and mcmyo5, encoding a probably essential cargo transporter of the Myosin V family, as required for a fully virulent phenotype of M. circinelloides. Knock-out mutants for those genes showed reduced virulence in both Galleria mellonella and Mus musculus models, probably due to a delayed germination and polarized growth within macrophages. This study provides a robust approach to study virulence in Mucorales and as a proof of concept identified new virulence determinants in M. circinelloides that could represent promising targets for future antifungal therapies. PMID:28107502

  13. Functional Genomic and Advanced Genetic Studies Reveal Novel Insights into the Metabolism, Regulation, and Biology of Haloferax volcanii

    PubMed Central

    Soppa, Jörg

    2011-01-01

    The genome sequence of Haloferax volcanii is available and several comparative genomic in silico studies were performed that yielded novel insight for example into protein export, RNA modifications, small non-coding RNAs, and ubiquitin-like Small Archaeal Modifier Proteins. The full range of functional genomic methods has been established and results from transcriptomic, proteomic and metabolomic studies are discussed. Notably, Hfx. volcanii is together with Halobacterium salinarum the only prokaryotic species for which a translatome analysis has been performed. The results revealed that the fraction of translationally-regulated genes in haloarchaea is as high as in eukaryotes. A highly efficient genetic system has been established that enables the application of libraries as well as the parallel generation of genomic deletion mutants. Facile mutant generation is complemented by the possibility to culture Hfx. volcanii in microtiter plates, allowing the phenotyping of mutant collections. Genetic approaches are currently used to study diverse biological questions–from replication to posttranslational modification—and selected results are discussed. Taken together, the wealth of functional genomic and genetic tools make Hfx. volcanii a bona fide archaeal model species, which has enabled the generation of important results in recent years and will most likely generate further breakthroughs in the future. PMID:22190865

  14. Purdue Ionomics Information Management System. An Integrated Functional Genomics Platform1[C][W][OA

    PubMed Central

    Baxter, Ivan; Ouzzani, Mourad; Orcun, Seza; Kennedy, Brad; Jandhyala, Shrinivas S.; Salt, David E.

    2007-01-01

    The advent of high-throughput phenotyping technologies has created a deluge of information that is difficult to deal with without the appropriate data management tools. These data management tools should integrate defined workflow controls for genomic-scale data acquisition and validation, data storage and retrieval, and data analysis, indexed around the genomic information of the organism of interest. To maximize the impact of these large datasets, it is critical that they are rapidly disseminated to the broader research community, allowing open access for data mining and discovery. We describe here a system that incorporates such functionalities developed around the Purdue University high-throughput ionomics phenotyping platform. The Purdue Ionomics Information Management System (PiiMS) provides integrated workflow control, data storage, and analysis to facilitate high-throughput data acquisition, along with integrated tools for data search, retrieval, and visualization for hypothesis development. PiiMS is deployed as a World Wide Web-enabled system, allowing for integration of distributed workflow processes and open access to raw data for analysis by numerous laboratories. PiiMS currently contains data on shoot concentrations of P, Ca, K, Mg, Cu, Fe, Zn, Mn, Co, Ni, B, Se, Mo, Na, As, and Cd in over 60,000 shoot tissue samples of Arabidopsis (Arabidopsis thaliana), including ethyl methanesulfonate, fast-neutron and defined T-DNA mutants, and natural accession and populations of recombinant inbred lines from over 800 separate experiments, representing over 1,000,000 fully quantitative elemental concentrations. PiiMS is accessible at www.purdue.edu/dp/ionomics. PMID:17189337

  15. Development of 5006 Full-Length CDNAs in Barley: A Tool for Accessing Cereal Genomics Resources

    PubMed Central

    Sato, Kazuhiro; Shin-I, Tadasu; Seki, Motoaki; Shinozaki, Kazuo; Yoshida, Hideya; Takeda, Kazuyoshi; Yamazaki, Yukiko; Conte, Matthieu; Kohara, Yuji

    2009-01-01

    A collection of 5006 full-length (FL) cDNA sequences was developed in barley. Fifteen mRNA samples from various organs and treatments were pooled to develop a cDNA library using the CAP trapper method. More than 60% of the clones were confirmed to have complete coding sequences, based on comparison with rice amino acid and UniProt sequences. Blastn homologies (E<1E-5) to rice genes and Arabidopsis genes were 89 and 47%, respectively. Of the 5028 possible amino acid sequences derived from the 5006 FLcDNAs, 4032 (80.2%) were classified into 1678 GreenPhyl multigenic families. There were 555 cDNAs showing low homology to both rice and Arabidopsis. Gene ontology annotation by InterProScan indicated that many of these cDNAs (71%) have no known molecular functions and may be unique to barley. The cDNAs showed high homology to Barley 1 GeneChip oligo probes (81%) and the wheat gene index (84%). The high homology between FLcDNAs (27%) and mapped barley expressed sequence tag enabled assigning linkage map positions to 151–233 FLcDNAs on each of the seven barley chromosomes. These comprehensive barley FLcDNAs provide strong platform to connect pre-existing genomic and genetic resources and accelerate gene identification and genome analysis in barley and related species. PMID:19150987

  16. Argot2: a large scale function prediction tool relying on semantic similarity of weighted Gene Ontology terms

    PubMed Central

    2012-01-01

    Background Predicting protein function has become increasingly demanding in the era of next generation sequencing technology. The task to assign a curator-reviewed function to every single sequence is impracticable. Bioinformatics tools, easy to use and able to provide automatic and reliable annotations at a genomic scale, are necessary and urgent. In this scenario, the Gene Ontology has provided the means to standardize the annotation classification with a structured vocabulary which can be easily exploited by computational methods. Results Argot2 is a web-based function prediction tool able to annotate nucleic or protein sequences from small datasets up to entire genomes. It accepts as input a list of sequences in FASTA format, which are processed using BLAST and HMMER searches vs UniProKB and Pfam databases respectively; these sequences are then annotated with GO terms retrieved from the UniProtKB-GOA database and the terms are weighted using the e-values from BLAST and HMMER. The weighted GO terms are processed according to both their semantic similarity relations described by the Gene Ontology and their associated score. The algorithm is based on the original idea developed in a previous tool called Argot. The entire engine has been completely rewritten to improve both accuracy and computational efficiency, thus allowing for the annotation of complete genomes. Conclusions The revised algorithm has been already employed and successfully tested during in-house genome projects of grape and apple, and has proven to have a high precision and recall in all our benchmark conditions. It has also been successfully compared with Blast2GO, one of the methods most commonly employed for sequence annotation. The server is freely accessible at http://www.medcomp.medicina.unipd.it/Argot2. PMID:22536960

  17. Nonempirical Double-Hybrid Functionals: An Effective Tool for Chemists.

    PubMed

    Brémond, Eric; Ciofini, Ilaria; Sancho-García, Juan Carlos; Adamo, Carlo

    2016-08-16

    Density functional theory (DFT) emerged in the last two decades as the most reliable tool for the description and prediction of properties of molecular systems and extended materials, coupling in an unprecedented way high accuracy and reasonable computational cost. This success rests also on the development of more and more performing density functional approximations (DFAs). Indeed, the Achilles' heel of DFT is represented by the exchange-correlation contribution to the total energy, which, being unknown, must be approximated. Since the beginning of the 1990s, global hybrids (GH) functionals, where an explicit dependence of the exchange-correlation energy on occupied Kohn-Sham orbitals is introduced thanks to a fraction of Hartree-Fock-like exchange, imposed themselves as the most reliable DFAs for chemical applications. However, if these functionals normally provide results of sufficient accuracy for most of the cases analyzed, some properties, such as thermochemistry or dispersive interactions, can still be significantly improved. A possible way out is represented by the inclusion, into the exchange-correlation functional, of an explicit dependence on virtual Kohn-Sham orbitals via perturbation theory. This leads to a new class of functionals, called double-hybrids (DHs). In this Account, we describe our nonempirical approach to DHs, which, following the line traced by the Perdew-Burke-Ernzerhof approach, allows for the definition of a GH (PBE0) and a DH (QIDH) model. In such a way, a whole family of nonempirical functionals, spanning on the highest rungs of the Perdew's quality scale, is now available and competitive with other-more empirical-DFAs. Discussion of selected cases, ranging from thermochemistry and reactions to weak interactions and excitation energies, not only show the large range of applicability of nonempirical DFAs, but also underline how increasing the number of theoretical constraints parallels with an improvement of the DFA's numerical

  18. Dynamic Visual Acuity: a Functionally Relevant Research Tool

    NASA Technical Reports Server (NTRS)

    Peters, Brian T.; Brady, Rachel A.; Miller, Chris A.; Mulavara, Ajitkumar P.; Wood, Scott J.; Cohen, Helen S.; Bloomberg, Jacob J.

    2010-01-01

    Coordinated movements between the eyes and head are required to maintain a stable retinal image during head and body motion. The vestibulo-ocular reflex (VOR) plays a significant role in this gaze control system that functions well for most daily activities. However, certain environmental conditions or interruptions in normal VOR function can lead to inadequate ocular compensation, resulting in oscillopsia, or blurred vision. It is therefore possible to use acuity to determine when the environmental conditions, VOR function, or the combination of the two is not conductive for maintaining clear vision. Over several years we have designed and tested several tests of dynamic visual acuity (DVA). Early tests used the difference between standing and walking acuity to assess decrements in the gaze stabilization system after spaceflight. Supporting ground-based studies measured the responses from patients with bilateral vestibular dysfunction and explored the effects of visual target viewing distance and gait cycle events on walking acuity. Results from these studies show that DVA is affected by spaceflight, is degraded in patients with vestibular dysfunction, changes with target distance, and is not consistent across the gait cycle. We have recently expanded our research to include studies in which seated subjects are translated or rotated passively. Preliminary results from this work indicate that gaze stabilization ability may differ between similar active and passive conditions, may change with age, and can be affected by the location of the visual target with respect to the axis of motion. Use of DVA as a diagnostic tool is becoming more popular but the functional nature of the acuity outcome measure also makes it ideal for identifying conditions that could lead to degraded vision. By doing so, steps can be taken to alter the problematic environments to improve the man-machine interface and optimize performance.

  19. EggLib: processing, analysis and simulation tools for population genetics and genomics

    PubMed Central

    2012-01-01

    Background With the considerable growth of available nucleotide sequence data over the last decade, integrated and flexible analytical tools have become a necessity. In particular, in the field of population genetics, there is a strong need for automated and reliable procedures to conduct repeatable and rapid polymorphism analyses, coalescent simulations, data manipulation and estimation of demographic parameters under a variety of scenarios. Results In this context, we present EggLib (Evolutionary Genetics and Genomics Library), a flexible and powerful C++/Python software package providing efficient and easy to use computational tools for sequence data management and extensive population genetic analyses on nucleotide sequence data. EggLib is a multifaceted project involving several integrated modules: an underlying computationally efficient C++ library (which can be used independently in pure C++ applications); two C++ programs; a Python package providing, among other features, a high level Python interface to the C++ library; and the egglib script which provides direct access to pre-programmed Python applications. Conclusions EggLib has been designed aiming to be both efficient and easy to use. A wide array of methods are implemented, including file format conversion, sequence alignment edition, coalescent simulations, neutrality tests and estimation of demographic parameters by Approximate Bayesian Computation (ABC). Classes implementing different demographic scenarios for ABC analyses can easily be developed by the user and included to the package. EggLib source code is distributed freely under the GNU General Public License (GPL) from its website http://egglib.sourceforge.net/ where a full documentation and a manual can also be found and downloaded. PMID:22494792

  20. Mammalian-specific genomic functions: Newly acquired traits generated by genomic imprinting and LTR retrotransposon-derived genes in mammals

    PubMed Central

    KANEKO-ISHINO, Tomoko; ISHINO, Fumitoshi

    2015-01-01

    Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is “mammalian-specific genomic functions”, a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of “mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons”, based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes. PMID:26666304

  1. The power of EST sequence data: Relation to Acyrthosiphon pisum genome annotation and functional genomics initiatives

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genes important to aphid biology, survival and reproduction were successfully identified by use of a genomics approach. We created and described the Sequencing, compilation, and annotation of the approxiamtely 525Mb nuclear genome of the pea aphid, Acyrthosiphon pisum, which represents an important ...

  2. Measuring error rates in genomic perturbation screens: gold standards for human functional genomics

    PubMed Central

    Hart, Traver; Brown, Kevin R; Sircoulomb, Fabrice; Rottapel, Robert; Moffat, Jason

    2014-01-01

    Technological advancement has opened the door to systematic genetics in mammalian cells. Genome-scale loss-of-function screens can assay fitness defects induced by partial gene knockdown, using RNA interference, or complete gene knockout, using new CRISPR techniques. These screens can reveal the basic blueprint required for cellular proliferation. Moreover, comparing healthy to cancerous tissue can uncover genes that are essential only in the tumor; these genes are targets for the development of specific anticancer therapies. Unfortunately, progress in this field has been hampered by off-target effects of perturbation reagents and poorly quantified error rates in large-scale screens. To improve the quality of information derived from these screens, and to provide a framework for understanding the capabilities and limitations of CRISPR technology, we derive gold-standard reference sets of essential and nonessential genes, and provide a Bayesian classifier of gene essentiality that outperforms current methods on both RNAi and CRISPR screens. Our results indicate that CRISPR technology is more sensitive than RNAi and that both techniques have nontrivial false discovery rates that can be mitigated by rigorous analytical methods. PMID:24987113

  3. Small-molecule tools for dissecting the roles of SSB/protein interactions in genome maintenance

    SciTech Connect

    Lu, Duo; Bernstein, Douglas A.; Satyshur, Kenneth A.; Keck, James L.

    2010-09-03

    Bacterial single-stranded DNA-binding proteins (SSBs) help to recruit a diverse array of genome maintenance enzymes to their sites of action through direct protein interactions. For all cases examined to date, these interactions are mediated by the evolutionarily conserved C terminus of SSB (SSB-Ct). The essential nature of SSB protein interactions makes inhibitors that block SSB complex formation valuable biochemical tools and attractive potential antibacterial agents. Here, we identify four small molecules that disrupt complexes formed between Escherichia coli SSB and Exonuclease I (ExoI), a well-studied SSB-interacting enzyme. Each compound disrupts ExoI/SSB-Ct peptide complexes and abrogates SSB stimulation of ExoI nuclease activity. Structural and biochemical studies support a model for three of the compounds in which they compete with SSB for binding to ExoI. The fourth appears to rely on an allosteric mechanism to disrupt ExoI/SSB complexes. Subsets of the inhibitors block SSB-Ct complex formation with two other SSB-interaction partners as well, which highlights their utility as reagents for investigating the roles of SSB/protein interactions in diverse DNA replication, recombination, and repair reactions.

  4. Visualizing genome and systems biology: technologies, tools, implementation techniques and trends, past, present and future.

    PubMed

    Pavlopoulos, Georgios A; Malliarakis, Dimitris; Papanikolaou, Nikolas; Theodosiou, Theodosis; Enright, Anton J; Iliopoulos, Ioannis

    2015-01-01

    "Α picture is worth a thousand words." This widely used adage sums up in a few words the notion that a successful visual representation of a concept should enable easy and rapid absorption of large amounts of information. Although, in general, the notion of capturing complex ideas using images is very appealing, would 1000 words be enough to describe the unknown in a research field such as the life sciences? Life sciences is one of the biggest generators of enormous datasets, mainly as a result of recent and rapid technological advances; their complexity can make these datasets incomprehensible without effective visualization methods. Here we discuss the past, present and future of genomic and systems biology visualization. We briefly comment on many visualization and analysis tools and the purposes that they serve. We focus on the latest libraries and programming languages that enable more effective, efficient and faster approaches for visualizing biological concepts, and also comment on the future human-computer interaction trends that would enable for enhancing visualization further.

  5. Bioadhesion in ascidians: a developmental and functional genomics perspective

    PubMed Central

    Pennati, Roberta; Rothbächer, Ute

    2015-01-01

    The development of bioadhesives inspired from marine animals is a promising approach to generate new tissue-compatible medical components. A number of marine species, through their adhesive properties, also represent significant foulers that become increasingly problematic to aquaculture, shipping or local biodiversity. In order to develop more sophisticated man-made glues and/or efficient fouling resistant surfaces, it is important to understand the mechanical, structural and molecular properties of adhesive organs in selected species. Ascidians are marine invertebrates with larvae that opportunistically attach to almost any type of submerged surface to undergo metamorphosis into permanently sessile adults. Not only do they represent a globally important fouling organism, but they are becoming increasingly popular as model organisms for developmental biology. The latter is due to their phylogenetic position as the sister group to the vertebrates and their cellular and molecular accessibility for experimentation. In this paper, we review the mechanisms of larval adhesion in ascidians and draw conclusions from comparative analyses of selected species. We further discuss how knowledge from a developmental and functional genomics point of view can advance our understanding of cellular and molecular signatures and their hierarchical usage in animal adhesive organs. PMID:25657840

  6. An analysis of extensible modelling for functional genomics data

    PubMed Central

    Jones, Andrew R; Paton, Norman W

    2005-01-01

    Background Several data formats have been developed for large scale biological experiments, using a variety of methodologies. Most data formats contain a mechanism for allowing extensions to encode unanticipated data types. Extensions to data formats are important because the experimental methodologies tend to be fairly diverse and rapidly evolving, which hinders the creation of formats that will be stable over time. Results In this paper we review the data formats that exist in functional genomics, some of which have become de facto or de jure standards, with a particular focus on how each domain has been modelled, and how each format allows extensions. We describe the tasks that are frequently performed over data formats and analyse how well each task is supported by a particular modelling structure. Conclusion From our analysis, we make recommendations as to the types of modelling structure that are most suitable for particular types of experimental annotation. There are several standards currently under development that we believe could benefit from systematically following a set of guidelines. PMID:16188029

  7. Strigolactone biology: genes, functional genomics, epigenetics and applications.

    PubMed

    Makhzoum, Abdullah; Yousefzadi, Morteza; Malik, Sonia; Gantet, Pascal; Tremouillaux-Guiller, Jocelyne

    2017-03-01

    Strigolactones (SLs) represent an important new plant hormone class marked by their multifunctional role in plant and rhizosphere interactions. These compounds stimulate hyphal branching in arbuscular mycorrhizal fungi (AMF) and seed germination of root parasitic plants. In addition, they are involved in the control of plant architecture by inhibiting bud outgrowth as well as many other morphological and developmental processes together with other plant hormones such as auxins and cytokinins. The biosynthetic pathway of SLs that are derived from carotenoids was partially decrypted based on the identification of mutants from a variety of plant species. Only a few SL biosynthetic and regulated genes and related regulatory transcription factors have been identified. However, functional genomics and epigenetic studies started to give first elements on the modality of the regulation of SLs related genes. Since they control plant architecture and plant-rhizosphere interaction, SLs start to be used for agronomical and biotechnological applications. Furthermore, the genes involved in the SL biosynthetic pathway and genes regulated by SL constitute interesting targets for plant breeding. Therefore, it is necessary to decipher and better understand the genetic determinants of their regulation at different levels.

  8. Advances in Pig Genomics and Functional Gene Discovery

    PubMed Central

    2003-01-01

    Advances in pig gene identification, mapping and functional analysis have continued to make rapid progress. The porcine genetic linkage map now has nearly 3000 loci, including several hundred genes, and is likely to expand considerably in the next few years, with many more genes and amplified fragment length polymorphism (AFLP) markers being added to the map. The physical genetic map is also growing rapidly and has over 3000 genes and markers. Several recent quantitative trait loci (QTL) scans and candidate gene analyses have identified important chromosomal regions and individual genes associated with traits of economic interest. The commercial pig industry is actively using this information and traditional performance information to improve pig production by marker-assisted selection (MAS). Research to study the co-expression of thousands of genes is now advancing and methods to combine these approaches to aid in gene discovery are under way. The pig's role in xenotransplantation and biomedical research makes the study of its genome important for the study of human disease. This review will briefly describe advances made, directions for future research and the implications for both the pig industry and human health. PMID:18629119

  9. MAKER-P: a tool kit for the rapid creation, management, and quality control of plant genome annotations.

    PubMed

    Campbell, Michael S; Law, MeiYee; Holt, Carson; Stein, Joshua C; Moghe, Gaurav D; Hufnagel, David E; Lei, Jikai; Achawanantakun, Rujira; Jiao, Dian; Lawrence, Carolyn J; Ware, Doreen; Shiu, Shin-Han; Childs, Kevin L; Sun, Yanni; Jiang, Ning; Yandell, Mark

    2014-02-01

    We have optimized and extended the widely used annotation engine MAKER in order to better support plant genome annotation efforts. New features include better parallelization for large repeat-rich plant genomes, noncoding RNA annotation capabilities, and support for pseudogene identification. We have benchmarked the resulting software tool kit, MAKER-P, using the Arabidopsis (Arabidopsis thaliana) and maize (Zea mays) genomes. Here, we demonstrate the ability of the MAKER-P tool kit to automatically update, extend, and revise the Arabidopsis annotations in light of newly available data and to annotate pseudogenes and noncoding RNAs absent from The Arabidopsis Informatics Resource 10 build. Our results demonstrate that MAKER-P can be used to manage and improve the annotations of even Arabidopsis, perhaps the best-annotated plant genome. We have also installed and benchmarked MAKER-P on the Texas Advanced Computing Center. We show that this public resource can de novo annotate the entire Arabidopsis and maize genomes in less than 3 h and produce annotations of comparable quality to those of the current The Arabidopsis Information Resource 10 and maize V2 annotation builds.

  10. CodaChrome: a tool for the visualization of proteome conservation across all fully sequenced bacterial genomes

    PubMed Central

    2014-01-01

    Background The relationships between bacterial genomes are complicated by rampant horizontal gene transfer, varied selection pressures, acquisition of new genes, loss of genes, and divergence of genes, even in closely related lineages. As more and more bacterial genomes are sequenced, organizing and interpreting the incredible amount of relational information that connects them becomes increasingly difficult. Results We have developed CodaChrome (http://www.sourceforge.com/p/codachrome), a one-versus-all proteome comparison tool that allows the user to visually investigate the relationship between a bacterial proteome of interest and the proteomes encoded by every other bacterial genome recorded in GenBank in a massive interactive heat map. This tool has allowed us to rapidly identify the most highly conserved proteins encoded in the bacterial pan-genome, fast-clock genes useful for subtyping of bacterial species, the evolutionary history of an indel in the Sphingobium lineage, and an example of horizontal gene transfer from a member of the genus Enterococcus to a recent ancestor of Helicobacter pylori. Conclusion CodaChrome is a user-friendly and powerful tool for simultaneously visualizing relationships between thousands of proteomes. PMID:24460813

  11. Annotation inconsistencies beyond sequence similarity-based function prediction - phylogeny and genome structure.

    PubMed

    Promponas, Vasilis J; Iliopoulos, Ioannis; Ouzounis, Christos A

    2015-01-01

    The function annotation process in computational biology has increasingly shifted from the traditional characterization of individual biochemical roles of protein molecules to the system-wide detection of entire metabolic pathways and genomic structures. The so-called genome-aware methods broaden misannotation inconsistencies in genome sequences beyond protein function assignments, encompassing phylogenetic anomalies and artifactual genomic regions. We outline three categories of error propagation in databases by providing striking examples - at various levels of appreciation by the community from traditional to emerging, thus raising awareness for future solutions.

  12. Genomic and functional adaptation in surface ocean planktonic prokaryotes.

    PubMed

    Yooseph, Shibu; Nealson, Kenneth H; Rusch, Douglas B; McCrow, John P; Dupont, Christopher L; Kim, Maria; Johnson, Justin; Montgomery, Robert; Ferriera, Steve; Beeson, Karen; Williamson, Shannon J; Tovchigrechko, Andrey; Allen, Andrew E; Zeigler, Lisa A; Sutton, Granger; Eisenstadt, Eric; Rogers, Yu-Hui; Friedman, Robert; Frazier, Marvin; Venter, J Craig

    2010-11-04

    The understanding of marine microbial ecology and metabolism has been hampered by the paucity of sequenced reference genomes. To this end, we report the sequencing of 137 diverse marine isolates collected from around the world. We analysed these sequences, along with previously published marine prokaryotic genomes, in the context of marine metagenomic data, to gain insights into the ecology of the surface ocean prokaryotic picoplankton (0.1-3.0 μm size range). The results suggest that the sequenced genomes define two microbial groups: one composed of only a few taxa that are nearly always abundant in picoplanktonic communities, and the other consisting of many microbial taxa that are rarely abundant. The genomic content of the second group suggests that these microbes are capable of slow growth and survival in energy-limited environments, and rapid growth in energy-rich environments. By contrast, the abundant and cosmopolitan picoplanktonic prokaryotes for which there is genomic representation have smaller genomes, are probably capable of only slow growth and seem to be relatively unable to sense or rapidly acclimate to energy-rich conditions. Their genomic features also lead us to propose that one method used to avoid predation by viruses and/or bacterivores is by means of slow growth and the maintenance of low biomass.

  13. A draft genome of field pennycress (Thlaspi arvense) provides tools for the domestication of a new winter biofuel crop

    PubMed Central

    Dorn, Kevin M.; Fankhauser, Johnathon D.; Wyse, Donald L.; Marks, M. David

    2015-01-01

    Field pennycress (Thlaspi arvense L.) is being domesticated as a new winter cover crop and biofuel species for the Midwestern United States that can be double-cropped between corn and soybeans. A genome sequence will enable the use of new technologies to make improvements in pennycress. To generate a draft genome, a hybrid sequencing approach was used to generate 47 Gb of DNA sequencing reads from both the Illumina and PacBio platforms. These reads were used to assemble 6,768 genomic scaffolds. The draft genome was annotated using the MAKER pipeline, which identified 27,390 predicted protein-coding genes, with almost all of these predicted peptides having significant sequence similarity to Arabidopsis proteins. A comprehensive analysis of pennycress gene homologues involved in glucosinolate biosynthesis, metabolism, and transport pathways revealed high sequence conservation compared with other Brassicaceae species, and helps validate the assembly of the pennycress gene space in this draft genome. Additional comparative genomic analyses indicate that the knowledge gained from years of basic Brassicaceae research will serve as a powerful tool for identifying gene targets whose manipulation can be predicted to result in improvements for pennycress. PMID:25632110

  14. Evolution and function of genomic imprinting in plants

    PubMed Central

    Rodrigues, Jessica A.; Zilberman, Daniel

    2015-01-01

    Genomic imprinting, an inherently epigenetic phenomenon defined by parent of origin-dependent gene expression, is observed in mammals and flowering plants. Genome-scale surveys of imprinted expression and the underlying differential epigenetic marks have led to the discovery of hundreds of imprinted plant genes and confirmed DNA and histone methylation as key regulators of plant imprinting. However, the biological roles of the vast majority of imprinted plant genes are unknown, and the evolutionary forces shaping plant imprinting remain rather opaque. Here, we review the mechanisms of plant genomic imprinting and discuss theories of imprinting evolution and biological significance in light of recent findings. PMID:26680300

  15. Evolution and function of genomic imprinting in plants.

    PubMed

    Rodrigues, Jessica A; Zilberman, Daniel

    2015-12-15

    Genomic imprinting, an inherently epigenetic phenomenon defined by parent of origin-dependent gene expression, is observed in mammals and flowering plants. Genome-scale surveys of imprinted expression and the underlying differential epigenetic marks have led to the discovery of hundreds of imprinted plant genes and confirmed DNA and histone methylation as key regulators of plant imprinting. However, the biological roles of the vast majority of imprinted plant genes are unknown, and the evolutionary forces shaping plant imprinting remain rather opaque. Here, we review the mechanisms of plant genomic imprinting and discuss theories of imprinting evolution and biological significance in light of recent findings.

  16. MISIS-2: A bioinformatics tool for in-depth analysis of small RNAs and representation of consensus master genome in viral quasispecies.

    PubMed

    Seguin, Jonathan; Otten, Patricia; Baerlocher, Loïc; Farinelli, Laurent; Pooggin, Mikhail M

    2016-07-01

    In most eukaryotes, small RNA (sRNA) molecules such as miRNAs, siRNAs and piRNAs regulate gene expression and repress transposons and viruses. AGO/PIWI family proteins sort functional sRNAs based on size, 5'-nucleotide and other sequence features. In plants and some animals, viral sRNAs are extremely diverse and cover the entire viral genome sequences, which allows for de novo reconstruction of a complete viral genome by deep sequencing and bioinformatics analysis of viral sRNAs. Previously, we have developed a tool MISIS to view and analyze sRNA maps of viruses and cellular genome regions which spawn multiple sRNAs. Here we describe a new release of MISIS, MISIS-2, which enables to determine and visualize a consensus sequence and count sRNAs of any chosen sizes and 5'-terminal nucleotide identities. Furthermore we demonstrate the utility of MISIS-2 for identification of single nucleotide polymorphisms (SNPs) at each position of a reference sequence and reconstruction of a consensus master genome in evolving viral quasispecies. MISIS-2 is a Java standalone program. It is freely available along with the source code at the website http://www.fasteris.com/apps.

  17. [The application of genome editing in identification of plant gene function and crop breeding].

    PubMed

    Xiangchun, Zhou; Yongzhong, Xing

    2016-03-01

    Plant genome can be modified via current biotechnology with high specificity and excellent efficiency. Zinc finger nucleases (ZFN), transcription activator-like effector nucleases (TALEN) and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) system are the key engineered nucleases used in the genome editing. Genome editing techniques enable gene targeted mutagenesis, gene knock-out, gene insertion or replacement at the target sites during the endogenous DNA repair process, including non-homologous end joining (NHEJ) and homologous recombination (HR), triggered by the induction of DNA double-strand break (DSB). Genome editing has been successfully applied in the genome modification of diverse plant species, such as Arabidopsis thaliana, Oryza sativa, and Nicotiana tabacum. In this review, we summarize the application of genome editing in identification of plant gene function and crop breeding. Moreover, we also discuss the improving points of genome editing in crop precision genetic improvement for further study.

  18. From Genetics to Functional Genomics: Improvement in Drought Signaling and Tolerance in Wheat

    PubMed Central

    Budak, Hikmet; Hussain, Babar; Khan, Zaeema; Ozturk, Neslihan Z.; Ullah, Naimat

    2015-01-01

    Drought being a yield limiting factor has become a major threat to international food security. It is a complex trait and drought tolerance response is carried out by various genes, transcription factors (TFs), microRNAs (miRNAs), hormones, proteins, co-factors, ions, and metabolites. This complexity has limited the development of wheat cultivars for drought tolerance by classical breeding. However, attempts have been made to fill the lost genetic diversity by crossing wheat with wild wheat relatives. In recent years, several molecular markers including single nucleotide polymorphisms (SNPs) and quantitative trait loci (QTLs) associated with genes for drought signaling pathways have been reported. Screening of large wheat collections by marker assisted selection (MAS) and transformation of wheat with different genes/TFs has improved drought signaling pathways and tolerance. Several miRNAs also provide drought tolerance to wheat by regulating various TFs/genes. Emergence of OMICS techniques including transcriptomics, proteomics, metabolomics, and ionomics has helped to identify and characterize the genes, proteins, metabolites, and ions involved in drought signaling pathways. Together, all these efforts helped in understanding the complex drought tolerance mechanism. Here, we have reviewed the advances in wide hybridization, MAS, QTL mapping, miRNAs, transgenic technique, genome editing system, and above mentioned functional genomics tools for identification and utility of signaling molecules for improvement in wheat drought tolerance. PMID:26635838

  19. Recent progress on bioinformatics, functional genomics, and metabolomics research of cytochrome P450 and its impact on drug discovery.

    PubMed

    Zhang, Tao; Zhao, Mingzhu; Pang, Yushu; Zhang, Wen; Angela Liu, Limin; Wei, Dong-Qing

    2012-01-01

    The cytochrome P450 superfamily is responsible primarily for human drug metabolism, which is of critical importance for the drug discovery and development. Rapid advancement of bioinformatics, functional genomics and metabolomics has been made over the last decade. These disciplines are essential in target identification, lead discovery and optimization. In this review, we summarize the recent progress on cytochrome P450 and its role on drug metabolism in the context of bioinformatics, functional genomics and metabolomics. Data are integrated into various databases and web-based platforms on cytochrome P450. These research tools and resources are playing an increasingly important role in drug discovery, and are helping in achieving the ultimate goal of personalized medicine, that is, to prescribe personalized drugs according to each person's genetic makeup, metabolic level, and drug disposition.

  20. A Functional Genomic Approach to Chlorinated Ethenes Bioremediation

    NASA Astrophysics Data System (ADS)

    Lee, P. K.; Brodie, E. L.; MacBeth, T. W.; Deeb, R. A.; Sorenson, K. S.; Andersen, G. L.; Alvarez-Cohen, L.

    2007-12-01

    With the recent advances in genomic sciences, a knowledge-based approach can now be taken to optimize the bioremediation of trichloroethene (TCE). During the bioremediation of a heterogeneous subsurface, it is vital to identify and quantify the functionally important microorganisms present, characterize the microbial community and measure their physiological activity. In our field experiments, quantitative PCR (qPCR) was coupled with reverse-transcription (RT) to analyze both copy numbers and transcripts expressed by the 16S rRNA gene and three reductive dehalogenase (RDase) genes as biomarkers of Dehalococcoides spp. in the groundwater of a TCE-DNAPL site at Ft. Lewis (WA) that was serially subjected to biostimulation and bioaugmentation. Genes in the Dehalococcoides genus were targeted as they are the only known organisms that can completely dechlorinate TCE to the innocuous product ethene. Biomarker quantification revealed an overall increase of more than three orders of magnitude in the total Dehalococcoides population and quantification of the more liable and stringently regulated mRNAs confirmed that Dehalococcoides spp. were active. Parallel with our field experiments, laboratory studies were conducted to explore the physiology of Dehalococcoides isolates in order to develop relevant biomarkers that are indicative of the metabolic state of cells. Recently, we verified the function of the nitrogenase operon in Dehalococcoides sp. strain 195 and nitrogenase-encoding genes are ideal biomarker targets to assess cellular nitrogen requirement. To characterize the microbial community, we applied a high-density phylogenetic microarray (16S PhyloChip) that simultaneous monitors over 8,700 unique taxa to track the bacterial and archaeal populations through different phases of treatment. As a measure of species richness, 1,300 to 1,520 taxa were detected in groundwater samples extracted during different stages of treatment as well as in the bioaugmentation culture. We

  1. Computational analysis of conserved coil functional residues in the mitochondrial genomic sequences of dermatophytes

    PubMed Central

    Gupta, Bulbul; Kaur, Jaspreet

    2016-01-01

    Dermatophyte is a group of closely related fungi that have the capacity to invade keratinized tissue of humans and other animals. The infection known as dermatophytosis, caused by members of the genera Microsporum, Trichophyton, and Epidermophyton includes infection to the groin (tinea cruris), beard (tinea barbae), scalp (tinea capitis), feet (tinea pedis), glabrous skin (tinea corporis), nail (tinea unguium), and hand (tinea manuum). The identification of evolutionary relationship between these three genera of dermatophyte is epidemiologically important to understand their pathogenicity. Mitochondrial DNA evolves more rapidly than a nuclear DNA due to higher rate of mutation but is very less affected by genetic recombination, making it an important tool for phylogenetic studies. Thus, here we present a novel scheme to identify the conserved coil functional residues of Trichophyton rubrum, Trichophyton mentagrophytes, Epidermophyton floccosum and Microsporum canis. Protein coding sequences of the mitochondrial genome were aligned for their similar sequences and homology modelling was performed for structure and pocket identification. The results obtained from comparative analysis of the protein sequences revealed the presence of functionally active sites in all the species of the genera Trichophyton and Microsporum. However in Epidermophyton floccosum it was observed in three protein sequences of the five studied. The absence of these conserved coil functional residues in E. floccusum may be correlated with lesser infectivity of this organism. The functional residues identified in the present study could be responsible for the disease and thus can act as putative target sites for drug designing. PMID:28149055

  2. Mercator: a fast and simple web server for genome scale functional annotation of plant sequence data.

    PubMed

    Lohse, Marc; Nagel, Axel; Herter, Thomas; May, Patrick; Schroda, Michael; Zrenner, Rita; Tohge, Takayuki; Fernie, Alisdair R; Stitt, Mark; Usadel, Björn

    2014-05-01

    Next-generation technologies generate an overwhelming amount of gene sequence data. Efficient annotation tools are required to make these data amenable to functional genomics analyses. The Mercator pipeline automatically assigns functional terms to protein or nucleotide sequences. It uses the MapMan 'BIN' ontology, which is tailored for functional annotation of plant 'omics' data. The classification procedure performs parallel sequence searches against reference databases, compiles the results and computes the most likely MapMan BINs for each query. In the current version, the pipeline relies on manually curated reference classifications originating from the three reference organisms (Arabidopsis, Chlamydomonas, rice), various other plant species that have a reviewed SwissProt annotation, and more than 2000 protein domain and family profiles at InterPro, CDD and KOG. Functional annotations predicted by Mercator achieve accuracies above 90% when benchmarked against manual annotation. In addition to mapping files for direct use in the visualization software MapMan, Mercator provides graphical overview charts, detailed annotation information in a convenient web browser interface and a MapMan-to-GO translation table to export results as GO terms. Mercator is available free of charge via http://mapman.gabipd.org/web/guest/app/Mercator.

  3. Functional and Genomic Features of Human Genes Mutated in Neuropsychiatric Disorders

    PubMed Central

    Forero, Diego A.; Prada, Carlos F.; Perry, George

    2016-01-01

    Background: In recent years, a large number of studies around the world have led to the identification of causal genes for hereditary types of common and rare neurological and psychiatric disorders. Objective: To explore the functional and genomic features of known human genes mutated in neuropsychiatric disorders. Methods: A systematic search was used to develop a comprehensive catalog of genes mutated in neuropsychiatric disorders (NPD). Functional enrichment and protein-protein interaction analyses were carried out. A false discovery rate approach was used for correction for multiple testing. Results: We found several functional categories that are enriched among NPD genes, such as gene ontologies, protein domains, tissue expression, signaling pathways and regulation by brain-expressed miRNAs and transcription factors. Sixty six of those NPD genes are known to be druggable. Several topographic parameters of protein-protein interaction networks and the degree of conservation between orthologous genes were identified as significant among NPD genes. Conclusion: These results represent one of the first analyses of enrichment of functional categories of genes known to harbor mutations for NPD. These findings could be useful for a future creation of computational tools for prioritization of novel candidate genes for NPD. PMID:27990183

  4. Chromatin Insulators: Linking genome organization to cellular function

    PubMed Central

    Phillips-Cremins, Jennifer E.; Corces, Victor G.

    2013-01-01

    A growing body of evidence suggests that insulators have a primary role in orchestrating the topological arrangement of higher-order chromatin architecture. Insulator-mediated long-range interactions can influence the epigenetic status of the genome and, in certain contexts, may have important effects on gene expression. Here we discuss higher-order chromatin organization as a unifying mechanism for diverse insulator actions across the genome. PMID:23706817

  5. Rapid tooling for functional prototyping of metal mold processes: Literature review on cast tooling

    SciTech Connect

    Baldwin, M.D.; Hochanadel, P.W.

    1995-11-01

    This report is a literature review on cast tooling with the general focus on AISI H13 tool steel. The review includes processing of both wrought and cast H13 steel along with the accompanying microstructures. Also included is the incorporation of new rapid prototyping technologies, such as Stereolithography and Selective Laser Sintering, into the investment casting of tool steel. The limiting property of using wrought or cast tool steel for die casting is heat checking. Heat checking is addressed in terms of testing procedures, theories regarding the mechanism, and microstructural aspects related to the cracking.

  6. Metabolic classification of microbial genomes using functional probes

    PubMed Central

    2012-01-01

    Background Microorganisms able to grow under artificial culture conditions comprise only a small proportion of the biosphere's total microbial community. Until recently, scientists have been unable to perform thorough analyses of difficult-to-culture microorganisms due to limitations in sequencing technology. As modern techniques have dramatically increased sequencing rates and rapidly expanded the number of sequenced genomes, in addition to traditional taxonomic classifications which focus on the evolutionary relationships of organisms, classifications of the genomes based on alternative points of view may help advance our understanding of the delicate relationships of organisms. Results We have developed a proteome-based method for classifying microbial species. This classification method uses a set of probes comprising short, highly conserved amino acid sequences. For each genome, in silico translation is performed to obtained its proteome, based on which a probe-set frequency pattern is generated. Then, the probe-set frequency patterns are used to cluster the proteomes/genomes. Conclusions Features of the proposed method include a high running speed in challenge of a large number of genomes, and high applicability for classifying organisms with incomplete genome sequences. Moreover, the probe-set clustering method is sensitive to the metabolic phenotypic similarities/differences among species and is thus supposed potential for the classification or differentiation of closely-related organisms. PMID:22537274

  7. Comparative Genome Analysis in the Integrated Microbial Genomes(IMG) System

    SciTech Connect

    Kyrpides, Nikos C.; Markowitz, Victor M.

    2006-03-01

    Comparative genome analysis is critical for the effectiveexploration of a rapidly growing number of complete and draft sequencesfor microbial genomes. The Integrated Microbial Genomes (IMG) system(img.jgi.doe.gov) has been developed as a community resource thatprovides support for comparative analysis of microbial genomes in anintegrated context. IMG allows users to navigate the multidimensionalmicrobial genome data space and focus their analysis on a subset ofgenes, genomes, and functions of interest. IMG provides graphicalviewers, summaries and occurrence profile tools for comparing genes,pathways and functions (terms) across specific genomes. Genes can befurther examined using gene neighborhoods and compared with sequencealignment tools.

  8. GORpipe: a query tool for working with sequence data based on a Genomic Ordered Relational (GOR) architecture

    PubMed Central

    Guðbjartsson, Hákon; Georgsson, Guðmundur Fr.; Guðjónsson, Sigurjón A.; Valdimarsson, Ragnar þór; Sigurðsson, Jóhann H.; Stefánsson, Sigmar K.; Másson, Gísli; Magnússon, Gísli; Pálmason, Vilmundur; Stefánsson, Kári

    2016-01-01

    Motivation: Our aim was to create a general-purpose relational data format and analysis tools to provide an efficient and coherent framework for working with large volumes of DNA sequence data. Results: For this purpose we developed the GORpipe software system. It is based on a genomic ordered architecture and uses a declarative query language that combines features from SQL and shell pipe syntax in a novel manner. The system can for instance be used to annotate sequence variants, find genomic spatial overlap between various types of genomic features, filter and aggregate them in various ways. Availability and Implementation: The GORpipe software is freely available for non-commercial academic usage and can be downloaded from www.nextcode.com/gorpipe. Contact: hakon@wuxinextcode.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27339714

  9. The potential of comparative genomic hybridization as a tool in the differential diagnosis of matrix-producing bone lesions.

    PubMed

    Gebert, Carsten; Brinkschmidt, Christian; Bielack, Stefan; Bernhardt, Thomas; Jürgens, Heribert; Böcker, Werner; Winkelmann, Winfried; Bürger, Horst; Gosheger, Georg

    2006-07-01

    Matrix-producing bone lesions consist of a wide variety of benign and malignant conditions. With respect to morphology, an overlap exists between benign and malignant bone tumors that causes difficulties in the final determination of the tumor. This study was conducted to show the potential of comparative genomic hybridization as a tool in the differential diagnosis of matrix-producing bone lesions. Thirty benign bone tumors were evaluated by conventional comparative genomic hybridization. To test its diagnostic reliability, 5 additional cases were analyzed, all with differential diagnostic difficulties related to morphology and radiology. All were ultimately diagnosed as malignant sarcomas, and unbalanced alterations were detected. In contrast benign tumors or tumor-like lesions did not reveal any chromosomal alterations. Comparative genomic hybridization is a useful adjunct in the complicated differential diagnostic algorithms of matrix-producing bone tumors.

  10. ChroMoS: an integrated web tool for SNP classification, prioritization and functional interpretation

    PubMed Central

    Barenboim, Maxim; Manke, Thomas

    2013-01-01

    Summary: Genome-wide association studies and re-sequencing projects are revealing an increasing number of disease-associated SNPs, a large fraction of which are non-coding. Although they could have relevance for disease susceptibility and progression, the lack of information about regulatory regions impedes the assessment of their functionality. Here we present a web server, ChroMoS (Chromatin Modified SNPs), which combines genetic and epigenetic data with the goal of facilitating SNPs' classification, prioritization and prediction of their functional consequences. ChroMoS uses a large database of SNPs and chromatin states, but allows a user to provide his/her own genetic information. Based on the SNP classification and interactive prioritization, a user can compute the functional impact of multiple SNPs using two prediction tools, one for differential analysis of transcription factor binding (sTRAP) and another for SNPs with potential impact on binding of miRNAs (MicroSNiPer). Availability: Web server, ChroMoS, is freely available at http://epicenter.immunbio.mpg.de/services/chromos. Contact: barenboim@ie-freiburg.mpg.de or manke@ie-freiburg.mpg.de PMID:23782616

  11. MetaSpark: a spark-based distributed processing tool to recruit metagenomic reads to reference genomes.

    PubMed

    Zhou, Wei; Li, Ruilin; Yuan, Shuo; Liu, ChangChun; Yao, Shaowen; Luo, Jing; Niu, Beifang

    2017-01-08

    With the advent of next-generation sequencing, traditional bioinformatics tools are challenged by massive raw metagenomic datasets. One of the bottlenecks of metagenomic studies is lack of large-scale and cloud computing suitable data analysis tools. In this paper, we proposed a Spark -: based tool, called MetaSpark, to recruit metagenomic reads to reference genomes. MetaSpark benefits from the distributed data set (RDD) of Spark, which makes it able to cache data set in memory across cluster nodes and scale well with the datasets. Compared with previous metagenomics recruitment tools, MetaSpark recruited significantly more reads than many programs such as SOAP2, BWA and LAST and increased recruited reads by ∼4% compared with FR-HIT when there were 1 million reads and 0.75 GB references. Different test cases demonstrate MetaSpark's scalability and overall high performance.

  12. Tool use in left brain damage and Alzheimer's disease: What about function and manipulation knowledge?

    PubMed

    Jarry, Christophe; Osiurak, François; Besnard, Jérémy; Baumard, Josselin; Lesourd, Mathieu; Croisile, Bernard; Etcharry-Bouyx, Frédérique; Chauviré, Valérie; Le Gall, Didier

    2016-03-01

    Tool use disorders are usually associated with difficulties in retrieving function and manipulation knowledge. Here, we investigate tool use (Real Tool Use, RTU), function (Functional Association, FA) and manipulation knowledge (Gesture Recognition, GR) in 17 left-brain-damaged (LBD) patients and 14 AD patients (Alzheimer disease). LBD group exhibited predicted deficit on RTU but not on FA and GR while AD patients showed deficits on GR and FA with preserved tool use skills. These findings question the role played by function and manipulation knowledge in actual tool use.

  13. CyMATE: a new tool for methylation analysis of plant genomic DNA after bisulphite sequencing.

    PubMed

    Hetzl, Jennifer; Foerster, Andrea M; Raidl, Günther; Mittelsten Scheid, Ortrun

    2007-08-01

    Cytosine methylation is a hallmark of epigenetic information in the DNA of many fungi, vertebrates and plants. The technique of bisulphite genomic sequencing reveals the methylation state of every individual cytosine in a sequence, and thereby provides high-resolution data on epigenetic diversity; however, the manual evaluation and documentation of large amounts of data is laborious and error-prone. While some software is available for facilitating the analysis of mammalian DNA methylation, which is found nearly exclusively at CG sites, there is no software optimally suited for data from DNA with significant non-CG methylation. We describe CyMATE (Cytosine Methylation Analysis Tool for Everyone) for in silico analysis of DNA sequences after bisulphite conversion of plant DNA, in which methylation is more divergent with respect to sequence context and biological relevance. From aligned sequences, CyMATE includes and distinguishes methylation at CG, CHG and CHH (where H = A, C or T), and can extract both quantitative and qualitative data regarding general and pattern-specific methylation per sequence and per position, i.e. data for individual sites in a sequence and the epigenetic divergence within a sample. In addition, it can provide graphical output from alignments in either an overview or a 'zoom-in' view as pdf files. Detailed information, including a quality control of the sequencing data, is provided in text format. We applied CyMATE to the analysis of DNA methylation at transcriptionally silenced promoters in diploid and polyploid Arabidopsis and found significant hypermethylation, high stability of the methylated state independent of chromosome number, and non-redundant patterns of mC distribution. CyMATE is freely available for non-commercial use at http://www.gmi.oeaw.ac.at/CyMATE.

  14. Screen and clean: a tool for identifying interactions in genome-wide association studies.

    PubMed

    Wu, Jing; Devlin, Bernie; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn

    2010-04-01

    Epistasis could be an important source of risk for disease. How interacting loci might be discovered is an open question for genome-wide association studies (GWAS). Most researchers limit their statistical analyses to testing individual pairwise interactions (i.e., marginal tests for association). A more effective means of identifying important predictors is to fit models that include many predictors simultaneously (i.e., higher-dimensional models). We explore a procedure called screen and clean (SC) for identifying liability loci, including interactions, by using the lasso procedure, which is a model selection tool for high-dimensional regression. We approach the problem by using a varying dictionary consisting of terms to include in the model. In the first step the lasso dictionary includes only main effects. The most promising single-nucleotide polymorphisms (SNPs) are identified using a screening procedure. Next the lasso dictionary is adjusted to include these main effects and the corresponding interaction terms. Again, promising terms are identified using lasso screening. Then significant terms are identified through the cleaning process. Implementation of SC for GWAS requires algorithms to explore the complex model space induced by the many SNPs genotyped and their interactions. We propose and explore a set of algorithms and find that SC successfully controls Type I error while yielding good power to identify risk loci and their interactions. When the method is applied to data obtained from the Wellcome Trust Case Control Consortium study of Type 1 Diabetes it uncovers evidence supporting interaction within the HLA class II region as well as within Chromosome 12q24.

  15. Screen and Clean: a tool for identifying interactions in genome-wide association studies

    PubMed Central

    Wu, Jing; Devlin, Bernie; Ringquist, Steven; Trucco, Massimo; Roeder, Kathryn

    2010-01-01

    Epistasis could be an important source of risk for disease. How interacting loci might be discovered is an open question for genome-wide association studies (GWAS). Most researchers limit their statistical analyses to testing individual pairwise interactions (i.e., marginal tests for association). A more effective means of identifying important predictors is to fit models that include many predictors simultaneously (i.e., higher dimensional models). We explore a procedure called screen and clean (SC) for identifying liability loci, including interactions, by using the lasso procedure, which is a model selection tool for high dimensional regression. We approach the problem by using a varying dictionary consisting of terms to include in the model. In the first step the lasso dictionary includes only main effects. The most promising SNPs are identified using a screening procedure. Next the lasso dictionary is adjusted to include these main effects and the corresponding interaction terms. Again, promising terms are identified using lasso screening. Then significant terms are identified through the cleaning process. Implementation of SC for GWAS requires algorithms to explore the complex model space induced by the many SNPs genotyped and their interactions. We propose and explore a set of algorithms and find that SC successfully controls Type I error while yielding good power to identify risk loci and their interactions. When the method is applied to data obtained from the Wellcome Trust Case Control Consortium study of Type 1 Diabetes it uncovers evidence supporting interaction within the HLA class II region as well as within Chromosome 12q24. PMID:20088021

  16. Comparative and functional genomics of lipases in holometabolous insects.

    PubMed

    Horne, Irene; Haritos, Victoria S; Oakeshott, John G

    2009-08-01

    Lipases have key roles in insect lipid acquisition, storage and mobilisation and are also fundamental to many physiological processes underpinning insect reproduction, development, defence from pathogens and oxidative stress, and pheromone signalling. We have screened the recently sequenced genomes of five species from four orders of holometabolous insects, the dipterans Drosophila melanogaster and Anopheles gambiae, the hymenopteran Apis mellifera, the moth Bombyx mori and the beetle Tribolium castaneum, for the six major lipase families that are also found in other organisms. The two most numerous families in the insects, the neutral and acid lipases, are also the main families in mammals, albeit not in Caenorhabditis elegans, plants or microbes. Total numbers of the lipases vary two-fold across the five insect species, from numbers similar to those in mammals up to numbers comparable to those seen in C. elegans. Whilst there is a high degree of orthology with mammalian lipases in the other four families, the great majority of the insect neutral and acid lipases have arisen since the insect orders themselves diverged. Intriguingly, about 10% of the insect neutral and acid lipases have lost motifs critical for catalytic function. Examination of the length of lid and loop regions of the neutral lipase sequences suggest that most of the insect lipases lack triacylglycerol (TAG) hydrolysis activity, although the acid lipases all have intact cap domains required for TAG hydrolysis. We have also reviewed the sequence databases and scientific literature for insights into the expression profiles and functions of the insect neutral and acid lipases and the orthologues of the mammalian adipose triglyceride lipase which has a pivotal role in lipid mobilisation. These data suggest that some of the acid and neutral lipase diversity may be due to a requirement for rapid accumulation of dietary lipids. The different roles required of lipases at the four discrete life stages of

  17. Genome-scale functional profiling of the mammalian AP-1 signaling pathway

    PubMed Central

    Chanda, Sumit K.; White, Suhaila; Orth, Anthony P.; Reisdorph, Richard; Miraglia, Loren; Thomas, Russell S.; DeJesus, Paul; Mason, Daniel E.; Huang, Qihong; Vega, Raquel; Yu, De-Hua; Nelson, Christian G.; Smith, Brendan M.; Terry, Robert; Linford, Alicia S.; Yu, Yang; Chirn, Gung-wei; Song, Chuanzheng; Labow, Mark A.; Cohen, Dalia; King, Frederick J.; Peters, Eric C.; Schultz, Peter G.; Vogt, Peter K.; Hogenesch, John B.; Caldwell, Jeremy S.

    2003-01-01

    Large-scale functional genomics approaches are fundamental to the characterization of mammalian transcriptomes annotated by genome sequencing projects. Although current high-throughput strategies systematically survey either transcriptional or biochemical networks, analogous genome-scale investigations that analyze gene function in mammalian cells have yet to be fully realized. Through transient overexpression analysis, we describe the parallel interrogation of ≈20,000 sequence annotated genes in cancer-related signaling pathways. For experimental validation of these genome data, we apply an integrative strategy to characterize previously unreported effectors of activator protein-1 (AP-1) mediated growth and mitogenic response pathways. These studies identify the ADP-ribosylation factor GTPase-activating protein Centaurin α1 and a Tudor domain-containing hypothetical protein as putative AP-1 regulatory oncogenes. These results provide insight into the composition of the AP-1 signaling machinery and validate this approach as a tractable platform for genome-wide functional analysis. PMID:14514886

  18. Functional genomics of physiological plasticity and local adaptation in killifish.

    PubMed

    Whitehead, Andrew; Galvez, Fernando; Zhang, Shujun; Williams, Larissa M; Oleksiak, Marjorie F

    2011-01-01

    Evolutionary solutions to the physiological challenges of life in highly variable habitats can span the continuum from evolution of a cosmopolitan plastic phenotype to the evolution of locally adapted phenotypes. Killifish (Fundulus sp.) have evolved both highly plastic and locally adapted phenotypes within different selective contexts, providing a comparative system in which to explore the genomic underpinnings of physiological plasticity and adaptive variation. Importantly, extensive variation exists among populations and species for tolerance to a variety of stressors, and we exploit this variation in comparative studies to yield insights into the genomic basis of evolved phenotypic variation. Notably, species of Fundulus occupy the continuum of osmotic habitats from freshwater to marine and populations within Fundulus heteroclitus span far greater variation in pollution tolerance than across all species of fish. Here, we explore how transcriptome regulation underpins extreme physiological plasticity on osmotic shock and how genomic and transcriptomic variation is associated with locally evolved pollution tolerance. We show that F. heteroclitus quickly acclimate to extreme osmotic shock by mounting a dramatic rapid transcriptomic response including an early crisis control phase followed by a tissue remodeling phase involving many regulatory pathways. We also show that convergent evolution of locally adapted pollution tolerance involves complex patterns of gene expression and genome sequence variation, which is confounded with body-weight dependence for some genes. Similarly, exploiting the natural phenotypic variation associated with other established and emerging model organisms is likely to greatly accelerate the pace of discovery of the genomic basis of phenotypic variation.

  19. Observing functional actions affects semantic processing of tools: evidence of a motor-to-semantic priming.

    PubMed

    De Bellis, Francesco; Ferrara, Antonia; Errico, Domenico; Panico, Francesco; Sagliano, Laura; Conson, Massimiliano; Trojano, Luigi

    2016-01-01

    Recent evidence shows that activation of motor information can favor identification of related tools, thus suggesting a strict link between motor and conceptual knowledge in cognitive representation of tools. However, the involvement of motor information in further semantic processing has not been elucidated. In three experiments, we aimed to ascertain whether motor information provided by observation of actions could affect processing of conceptual knowledge about tools. In Experiment 1, healthy participants judged whether pairs of tools evoking different functional handgrips had the same function. In Experiment 2 participants judged whether tools were paired with appropriate recipients. Finally, in Experiment 3 we again required functional judgments as in Experiment 1, but also included in the set of stimuli pairs of objects having different function and similar functional handgrips. In all experiments, pictures displaying either functional grasping (aimed to use tools) or structural grasping (just aimed to move tools independently from their use) were presented before each stimulus pair. The results demonstrated that, in comparison with structural grasping, observing functional grasping facilitates judgments about tools' function when objects did not imply the same functional manipulation (Experiment 1), whereas worsened such judgments when objects shared functional grasp (Experiment 3). Instead, action observation did not affect judgments concerning tool-recipient associations (Experiment 2). Our findings support a task-dependent influence of motor information on high-order conceptual tasks and provide further insights into how motor and conceptual processing about tools can interact.

  20. GO-FAANG meeting: A gathering on functional annotation of animal genomes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The FAANG (Functional Annotation of Animal Genomes) Consortium recently held a Gathering On FAANG (GO-FAANG) Workshop in Washington, DC on October 7-8, 2015. This consortium is a grass-roots organization formed to advance the annotation of newly assembled genomes of non-model organisms (www.faang.or...

  1. Advances in functional genomics for investigating salinity stress tolerance mechanisms in cereals

    PubMed Central

    Shelden, Megan C.; Roessner, Ute

    2013-01-01

    Abiotic stresses such as low water availability and high salinity are major causes of cereal crop yield losses and significantly impact on sustainability. Wheat and barley are two of the most important cereal crops (after maize and rice) and are grown in increasingly hostile environments with soil salinity and drought both expected to increase this century, reducing the availability of arable land. Barley and wheat are classified as glycophytes (salt-sensitive), yet they are more salt-tolerant than other cereal crops such as rice and so are good models for studying salt tolerance in cereals. The exploitation of genetic variation of phenotypic traits through plant breeding could significantly improve growth of cereals in salinity-affected regions, thus leading to improved crop yields. Genetic variation in phenotypic traits for abiotic stress tolerance have been identified in land races and wild germplasm but the molecular basis of these differences is often difficult to determine due to the complex genetic nature of these species. High-throughput functional genomics technologies, such as transcriptomics, metabolomics, proteomics, and ionomics are powerful tools for investigating the molecular responses of plants to abiotic stress. The advancement of these technologies has allowed for the identification and quantification of transcript/metabolites in specific cell types and/or tissues. Using these new technologies on plants will provide a powerful tool to uncovering genetic traits in more complex species such as wheat and barley and provide novel insights into the molecular mechanisms of salinity stress tolerance. PMID:23717314

  2. The plant ontology as a tool for comparative plant anatomy and genomic analyses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Plant science is now a major player in the fields of genomics, gene expression analysis, phenomics and metabolomics. Recent advances in sequencing technologies have led to a windfall of data, with new species being added rapidly to the list of species whose genomes have been decoded. The Plant Ontol...

  3. solGS: a web-based tool for genomic selection

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genomic selection (GS) promises to improve accuracy in estimating breeding values and genetic gain for quantitative traits compared to traditional breeding methods. Its reliance on high-throughput genome-wide markers and statistical complexity, however, is a serious challenge in data management, ana...

  4. Genome-Wide Association and Functional Follow-Up Reveals New Loci for Kidney Function

    PubMed Central

    Fuchsberger, Christian; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Taliun, Daniel; Li, Man; Gao, Xiaoyi; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; O'Seaghdha, Conall M.; Glazer, Nicole; Isaacs, Aaron; Liu, Ching-Ti; Smith, Albert V.; O'Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Johnson, Andrew D.; Gierman, Hinco J.; Feitosa, Mary; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Chouraki, Vincent; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Cavalieri, Margherita; Rao, Madhumathi; Hu, Frank B.; Demirkan, Ayse; Oostra, Ben A.; de Andrade, Mariza; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H.-Erich; Kolcic, Ivana; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Endlich, Karlhans; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Ketkar, Shamika; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Giulianini, Franco; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Metzger, Marie; Mitchell, Paul; Ciullo, Marina; Kim, Stuart K.; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; Siscovick, David S.; van Duijn, Cornelia M.; Borecki, Ingrid; Kardia, Sharon L. R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline C. M.; Hayward, Caroline; Ridker, Paul; Parsa, Afshin; Bochud, Murielle; Heid, Iris M.; Goessling, Wolfram; Chasman, Daniel I.; Kao, W. H. Linda; Fox, Caroline S.

    2012-01-01

    Chronic kidney disease (CKD) is an important public health problem with a genetic component. We performed genome-wide association studies in up to 130,600 European ancestry participants overall, and stratified for key CKD risk factors. We uncovered 6 new loci in association with estimated glomerular filtration rate (eGFR), the primary clinical measure of CKD, in or near MPPED2, DDX1, SLC47A1, CDK12, CASP9, and INO80. Morpholino knockdown of mpped2 and casp9 in zebrafish embryos revealed podocyte and tubular abnormalities with altered dextran clearance, suggesting a role for these genes in renal function. By providing new insights into genes that regulate renal function, these results could further our understanding of the pathogenesis of CKD. PMID:22479191

  5. R loops: new modulators of genome dynamics and function.

    PubMed

    Santos-Pereira, José M; Aguilera, Andrés

    2015-10-01

    R loops are nucleic acid structures composed of an RNA-DNA hybrid and a displaced single-stranded DNA. Recently, evidence has emerged that R loops occur more often in the genome and have greater physiological relevance, including roles in transcription and chromatin structure, than was previously predicted. Importantly, however, R loops are also a major threat to genome stability. For this reason, several DNA and RNA metabolism factors prevent R-loop formation in cells. Dysfunction of these factors causes R-loop accumulation, which leads to replication stress, genome instability, chromatin alterations or gene silencing, phenomena that are frequently associated with cancer and a number of genetic diseases. We review the current knowledge of the mechanisms controlling R loops and their putative relationship with disease.

  6. Parallel Continuous Flow: A Parallel Suffix Tree Construction Tool for Whole Genomes

    PubMed Central

    Farreras, Montse

    2014-01-01

    Abstract The construction of suffix trees for very long sequences is essential for many applications, and it plays a central role in the bioinformatic domain. With the advent of modern sequencing technologies, biological sequence databases have grown dramatically. Also the methodologies required to analyze these data have become more complex everyday, requiring fast queries to multiple genomes. In this article, we present parallel continuous flow (PCF), a parallel suffix tree construction method that is suitable for very long genomes. We tested our method for the suffix tree construction of the entire human genome, about 3GB. We showed that PCF can scale gracefully as the size of the input genome grows. Our method can work with an efficiency of 90% with 36 processors and 55% with 172 processors. We can index the human genome in 7 minutes using 172 processes. PMID:24597675

  7. Parallel continuous flow: a parallel suffix tree construction tool for whole genomes.

    PubMed

    Comin, Matteo; Farreras, Montse

    2014-04-01

    The construction of suffix trees for very long sequences is essential for many applications, and it plays a central role in the bioinformatic domain. With the advent of modern sequencing technologies, biological sequence databases have grown dramatically. Also the methodologies required to analyze these data have become more complex everyday, requiring fast queries to multiple genomes. In this article, we present parallel continuous flow (PCF), a parallel suffix tree construction method that is suitable for very long genomes. We tested our method for the suffix tree construction of the entire human genome, about 3GB. We showed that PCF can scale gracefully as the size of the input genome grows. Our method can work with an efficiency of 90% with 36 processors and 55% with 172 processors. We can index the human genome in 7 minutes using 172 processes.

  8. Whole Genome Sequence Analysis Using JSpecies Tool Establishes Clonal Relationships between Listeria monocytogenes Strains from Epidemiologically Unrelated Listeriosis Outbreaks

    PubMed Central

    Burall, Laurel S.; Grim, Christopher J.; Mammel, Mark K.; Datta, Atin R.

    2016-01-01

    In an effort to build a comprehensive genomic approach to food safety challenges, the FDA has implemented a whole genome sequencing effort, GenomeTrakr, which involves the sequencing and analysis of genomes of foodborne pathogens. As a part of this effort, we routinely sequence whole genomes of Listeria monocytogenes (Lm) isolates associated with human listeriosis outbreaks, as well as those isolated through other sources. To rapidly establish genetic relatedness of these genomes, we evaluated tetranucleotide frequency analysis via the JSpecies program to provide a cursory analysis of strain relatedness. The JSpecies tetranucleotide (tetra) analysis plots standardized (z-score) tetramer word frequencies of two strains against each other and uses linear regression analysis to determine similarity (r2). This tool was able to validate the close relationships between outbreak related strains from four different outbreaks. Included in this study was the analysis of Lm strains isolated during the recent caramel apple outbreak and stone fruit incident in 2014. We identified that many of the isolates from these two outbreaks shared a common 4b variant (4bV) serotype, also designated as IVb-v1, using a qPCR protocol developed in our laboratory. The 4bV serotype is characterized by the presence of a 6.3 Kb DNA segment normally found in serotype 1/2a, 3a, 1/2c and 3c strains but not in serotype 4b or 1/2b strains. We decided to compare these strains at a genomic level using the JSpecies Tetra tool. Specifically, we compared several 4bV and 4b isolates and identified a high level of similarity between the stone fruit and apple 4bV strains, but not the 4b strains co-identified in the caramel apple outbreak or other 4b or 4bV strains in our collection. This finding was further substantiated by a SNP-based analysis. Additionally, we were able to identify close relatedness between isolates from clinical cases from 1993–1994 and a single case from 2011 as well as links between

  9. RELATIONSHIP BETWEEN PHYLOGENETIC DISTRIBUTION AND GENOMIC FEATURES IN NEUROSPORA CRASSA

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In the post-genome era, insufficient functional annotation of predicted genes greatly restricts the potential of mining genome data. We demonstrate that an evolutionary approach, which is independent of functional annotation, has great potential as a tool for genome analysis. We chose the genome o...

  10. An Integrated Tool for Functional Requirements Analysis, Function Allocation and Task Analysis

    SciTech Connect

    Hugo, Jacques

    2015-03-02

    The software application is called “HFE-Trace”. This is an integrated method and tool for the management of Human Factors Engineering analyses and related data. Its primary purpose is to support the coherent and consistent application of the nuclear industry’s best practices for human factors engineering work. The software is a custom M