Science.gov

Sample records for database oriented variant

  1. Analysing 454 amplicon resequencing experiments using the modular and database oriented Variant Identification Pipeline

    PubMed Central

    2010-01-01

    Background Next-generation amplicon sequencing enables high-throughput genetic diagnostics, sequencing multiple genes in several patients together in one sequencing run. Currently, no open-source out-of-the-box software solution exists that reliably reports detected genetic variations and that can be used to improve future sequencing effectiveness by analyzing the PCR reactions. Results We developed an integrated database oriented software pipeline for analysis of 454/Roche GS-FLX amplicon resequencing experiments using Perl and a relational database. The pipeline enables variation detection, variation detection validation, and advanced data analysis, which provides information that can be used to optimize PCR efficiency using traditional means. The modular approach enables customization of the pipeline where needed and allows researchers to adopt their analysis pipeline to their experiments. Clear documentation and training data is available to test and validate the pipeline prior to using it on real sequencing data. Conclusions We designed an open-source database oriented pipeline that enables advanced analysis of 454/Roche GS-FLX amplicon resequencing experiments using SQL-statements. This modular database approach allows easy coupling with other pipeline modules such as variant interpretation or a LIMS system. There is also a set of standard reporting scripts available. PMID:20487544

  2. Developing a DNA variant database.

    PubMed

    Fung, David C Y

    2008-01-01

    Disease- and locus-specific variant databases have been a valuable resource to clinical and research geneticists. With the recent rapid developments in technologies, the number of DNA variants detected in a typical molecular genetics laboratory easily exceeds 1,000. To keep track of the growing inventory of DNA variants, many laboratories employ information technology to store the data as well as distributing the data and its associated information to clinicians and researchers via the Web. While it is a valuable resource, the hosting of a web-accessible database requires collaboration between bioinformaticians and biologists and careful planning to ensure its usability and availability. In this chapter, a series of tutorials on building a local DNA variant database out of a sample dataset will be provided. However, this tutorial will not include programming details on building a web interface and on constructing the web application necessary for web hosting. Instead, an introduction to the two commonly used methods for hosting web-accessible variant databases will be described. Apart from the tutorials, this chapter will also consider the resources and planning required for making a variant database project successful. PMID:18453092

  3. The Saccharomyces Genome Database Variant Viewer.

    PubMed

    Sheppard, Travis K; Hitz, Benjamin C; Engel, Stacia R; Song, Giltae; Balakrishnan, Rama; Binkley, Gail; Costanzo, Maria C; Dalusag, Kyla S; Demeter, Janos; Hellerstedt, Sage T; Karra, Kalpana; Nash, Robert S; Paskov, Kelley M; Skrzypek, Marek S; Weng, Shuai; Wong, Edith D; Cherry, J Michael

    2016-01-01

    The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. In recent years, we have moved toward increased representation of sequence variation and allelic differences within S. cerevisiae. The publication of numerous additional genomes has motivated the creation of new tools for their annotation and analysis. Here we present the Variant Viewer: a dynamic open-source web application for the visualization of genomic and proteomic differences. Multiple sequence alignments have been constructed across high quality genome sequences from 11 different S. cerevisiae strains and stored in the SGD. The alignments and summaries are encoded in JSON and used to create a two-tiered dynamic view of the budding yeast pan-genome, available at http://www.yeastgenome.org/variant-viewer. PMID:26578556

  4. Gene Variant Databases and Sharing: Creating a Global Genomic Variant Database for Personalized Medicine.

    PubMed

    Bean, Lora J H; Hegde, Madhuri R

    2016-06-01

    Revolutionary changes in sequencing technology and the desire to develop therapeutics for rare diseases have led to the generation of an enormous amount of genomic data in the last 5 years. Large-scale sequencing done in both research and diagnostic laboratories has linked many new genes to rare diseases, but has also generated a number of variants that we cannot interpret today. It is clear that we remain a long way from a complete understanding of the genomic variation in the human genome and its association with human health and disease. Recent studies identified susceptibility markers to infectious diseases and also the contribution of rare variants to complex diseases in different populations. The sequencing revolution has also led to the creation of a large number of databases that act as "keepers" of data, and in many cases give an interpretation of the effect of the variant. This interpretation is based on reports in the literature, prediction models, and in some cases is accompanied by functional evidence. As we move toward the practice of genomic medicine, and consider its place in "personalized medicine," it is time to ask ourselves how we can aggregate this wealth of data into a single database for multiple users with different goals. PMID:26931283

  5. Object-Oriented Geographical Database Model

    NASA Technical Reports Server (NTRS)

    Johnson, M. L.; Bryant, N.; Sapounas, D.

    1996-01-01

    Terbase is an Object-Oriented database system under development at the Jet Propulsion Laboratory (JPL). Terbase is designed for flexibility, reusability, maintenace ease, multi-user collaboration and independence, and efficiency. This paper details the design and development of Terbase as a geographic data server...

  6. The IPD and IMGT/HLA database: allele variant databases

    PubMed Central

    Robinson, James; Halliwell, Jason A.; Hayhurst, James D.; Flicek, Paul; Parham, Peter; Marsh, Steven G. E.

    2015-01-01

    The Immuno Polymorphism Database (IPD) was developed to provide a centralized system for the study of polymorphism in genes of the immune system. Through the IPD project we have established a central platform for the curation and publication of locus-specific databases involved either directly or related to the function of the Major Histocompatibility Complex in a number of different species. We have collaborated with specialist groups or nomenclature committees that curate the individual sections before they are submitted to IPD for online publication. IPD consists of five core databases, with the IMGT/HLA Database as the primary database. Through the work of the various nomenclature committees, the HLA Informatics Group and in collaboration with the European Bioinformatics Institute we are able to provide public access to this data through the website http://www.ebi.ac.uk/ipd/. The IPD project continues to develop with new tools being added to address scientific developments, such as Next Generation Sequencing, and to address user feedback and requests. Regular updates to the website ensure that new and confirmatory sequences are dispersed to the immunogenetics community, and the wider research and clinical communities. PMID:25414341

  7. Genotype-based databases for variants causing rare diseases.

    PubMed

    Lanthaler, Barbara; Wieser, Stefanie; Deutschmann, Andrea; Schossig, Anna; Fauth, Christine; Zschocke, Johannes; Witsch-Baumgartner, Martina

    2014-10-15

    Inherited diseases are the result of DNA sequence changes. In recessive diseases, the clinical phenotype results from the combined functional effects of variants in both copies of the gene. In some diseases there is often considerable variability of clinical presentation or disease severity, which may be predicted by the genotype. Additional effects may be triggered by environmental factors, as well as genetic modifiers which could be nucleotide polymorphisms in related genes, e.g. maternal ApoE or ABCA1 genotypes which may have an influence on the phenotype of SLOS individuals. Here we report the establishment of genotype variation databases for various rare diseases which provide individual clinical phenotypes associated with genotypes and include data about possible genetic modifiers. These databases aim to be an easy public access to information on rare and private variants with clinical data, which will facilitate the interpretation of genetic variants. The created databases include ACAD8 (isobutyryl-CoA dehydrogenase deficiency (IBD)), ACADSB (short-chain acyl-CoA dehydrogenase (SCAD) deficiency), AUH (3-methylglutaconic aciduria (3-MGCA)), DHCR7 (Smith-Lemli-Opitz syndrome), HMGCS2 (3-hydroxy-3-methylglutaryl-CoA synthase 2 deficiency), HSD17B10 (17-beta-hydroxysteroid dehydrogenase X deficiency), FKBP14 (Ehlers-Danlos syndrome with progressive kyphoscoliosis, myopathy, and hearing loss; EDSKMH) and ROGDI (Kohlschütter-Tönz syndrome). These genes have been selected because of our specific research interests in these rare and metabolic diseases. The aim of the database was to include all identified individuals with variants in these specific genes. Identical genotypes are listed multiple times if they were found in several patients, phenotypic descriptions and biochemical data are included as detailed as possible in view also of validating the proposed pathogenicity of these genotypes. For DHCR7 genetic modifier data (maternal APOE and ABCA1 genotypes) is

  8. Compact variant-rich customized sequence database and a fast and sensitive database search for efficient proteogenomic analyses.

    PubMed

    Park, Heejin; Bae, Junwoo; Kim, Hyunwoo; Kim, Sangok; Kim, Hokeun; Mun, Dong-Gi; Joh, Yoonsung; Lee, Wonyeop; Chae, Sehyun; Lee, Sanghyuk; Kim, Hark Kyun; Hwang, Daehee; Lee, Sang-Won; Paek, Eunok

    2014-12-01

    In proteogenomic analysis, construction of a compact, customized database from mRNA-seq data and a sensitive search of both reference and customized databases are essential to accurately determine protein abundances and structural variations at the protein level. However, these tasks have not been systematically explored, but rather performed in an ad-hoc fashion. Here, we present an effective method for constructing a compact database containing comprehensive sequences of sample-specific variants--single nucleotide variants, insertions/deletions, and stop-codon mutations derived from Exome-seq and RNA-seq data. It, however, occupies less space by storing variant peptides, not variant proteins. We also present an efficient search method for both customized and reference databases. The separate searches of the two databases increase the search time, and a unified search is less sensitive to identify variant peptides due to the smaller size of the customized database, compared to the reference database, in the target-decoy setting. Our method searches the unified database once, but performs target-decoy validations separately. Experimental results show that our approach is as fast as the unified search and as sensitive as the separate searches. Our customized database includes mutation information in the headers of variant peptides, thereby facilitating the inspection of peptide-spectrum matches. PMID:25316439

  9. An object-oriented database for protein structure analysis.

    PubMed

    Gray, P M; Paton, N W; Kemp, G J; Fothergill, J E

    1990-03-01

    An object-oriented database system has been developed which is being used to store protein structure data. The database can be queried using the logic programming language Prolog or the query language Daplex. Queries retrieve information by navigating through a network of objects which represent the primary, secondary and tertiary structures of proteins. Routines written in both Prolog and Daplex can integrate complex calculations with the retrieval of data from the database, and can also be stored in the database for sharing among users. Thus object-oriented databases are better suited to prototyping applications and answering complex queries about protein structure than relational databases. This system has been used to find loops of varying length and anchor positions when modelling homologous protein structures. PMID:2188261

  10. Knowledge Discovery in Variant Databases Using Inductive Logic Programming

    PubMed Central

    Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D.

    2013-01-01

    Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/. PMID:23589683

  11. PKDB: Polycystic Kidney Disease Mutation Database--a gene variant database for autosomal dominant polycystic kidney disease.

    PubMed

    Gout, Alexander M; Martin, Neilson C; Brown, Alastair F; Ravine, David

    2007-07-01

    Autosomal dominant polycystic kidney disease (ADPKD) arises from mutations in the PKD1 and PKD2 genes. The Polycystic Kidney Disease Mutation Database (PKDB) is an internet-accessible relational database containing comprehensive information about germline and somatic disease-causing variants within these two genes, as well as polymorphisms and variants of indeterminate pathogenicity. The PKDB database structure incorporates an interface between these gene variant data and any associated patient clinical data. An initiative of the Polycystic Kidney Disease Foundation, PKDB is a publicly accessible database that aims to streamline the evaluation of PKD1 and PKD2 gene variants detected in samples from those with ADPKD, as well as to assist ongoing clinical and molecular research in the field. As the accurate reporting of nucleotide variants is essential for ensuring the quality of data within PKDB, a mutation checker has been mounted on the PKDB server allowing contributors to assess the accuracy of their PKD1 and PKD2 variant reports. Researchers and clinicians may submit their PKD1/PKD2 gene variants and any associated deidentified clinical data via standardized downloadable data entry forms accessible through the PKDB site. PKDB has been launched with the full details of PKD1 and PKD2 gene variant reports published in 73 peer-reviewed articles. Through a series of user-friendly advanced search facilities, users are able to query the database as required. The PKDB server is accessible at http://pkdb.mayo.edu. PMID:17370309

  12. Object-oriented structures supporting remote sensing databases

    NASA Technical Reports Server (NTRS)

    Wichmann, Keith; Cromp, Robert F.

    1995-01-01

    Object-oriented databases show promise for modeling the complex interrelationships pervasive in scientific domains. To examine the utility of this approach, we have developed an Intelligent Information Fusion System based on this technology, and applied it to the problem of managing an active repository of remotely-sensed satellite scenes. The design and implementation of the system is compared and contrasted with conventional relational database techniques, followed by a presentation of the underlying object-oriented data structures used to enable fast indexing into the data holdings.

  13. A survey of commercial object-oriented database management systems

    NASA Technical Reports Server (NTRS)

    Atkins, John

    1992-01-01

    The object-oriented data model is the culmination of over thirty years of database research. Initially, database research focused on the need to provide information in a consistent and efficient manner to the business community. Early data models such as the hierarchical model and the network model met the goal of consistent and efficient access to data and were substantial improvements over simple file mechanisms for storing and accessing data. However, these models required highly skilled programmers to provide access to the data. Consequently, in the early 70's E.F. Codd, an IBM research computer scientists, proposed a new data model based on the simple mathematical notion of the relation. This model is known as the Relational Model. In the relational model, data is represented in flat tables (or relations) which have no physical or internal links between them. The simplicity of this model fostered the development of powerful but relatively simple query languages that now made data directly accessible to the general database user. Except for large, multi-user database systems, a database professional was in general no longer necessary. Database professionals found that traditional data in the form of character data, dates, and numeric data were easily represented and managed via the relational model. Commercial relational database management systems proliferated and performance of relational databases improved dramatically. However, there was a growing community of potential database users whose needs were not met by the relational model. These users needed to store data with data types not available in the relational model and who required a far richer modelling environment than that provided by the relational model. Indeed, the complexity of the objects to be represented in the model mandated a new approach to database technology. The Object-Oriented Model was the result.

  14. A Uniform Indexing Scheme for Object-Oriented Databases.

    ERIC Educational Resources Information Center

    Gudes, Ehud

    1997-01-01

    Performance is a critical factor hindering the use of object-oriented databases (OODB). This article proposes a new and uniform indexing scheme for enhancing OODBs with advantages for small range, clustered sets queries. Reviews several other indexing schemes; presents the U-index scheme; discusses its performance; and presents experimental…

  15. EMEN2: An Object Oriented Database and Electronic Lab Notebook

    PubMed Central

    Rees, Ian; Langley, Ed; Chiu, Wah; Ludtke, Steven J.

    2013-01-01

    Transmission electron microscopy and associated methods such as single particle analysis, 2-D crystallography, helical reconstruction and tomography, are highly data-intensive experimental sciences, which also have substantial variability in experimental technique. Object-oriented databases present an attractive alternative to traditional relational databases for situations where the experiments themselves are continually evolving. We present EMEN2, an easy to use object-oriented database with a highly flexible infrastructure originally targeted for transmission electron microscopy and tomography, which has been extended to be adaptable for use in virtually any experimental science. It is a pure object-oriented database designed for easy adoption in diverse laboratory environments, and does not require professional database administration. It includes a full featured, dynamic web interface in addition to APIs for programmatic access. EMEN2 installations currently support roughly 800 scientists worldwide with over 1/2 million experimental records and over 20 TB of experimental data. The software is freely available with complete source. PMID:23360752

  16. LOVD v.2.0: the next generation in gene variant databases.

    PubMed

    Fokkema, Ivo F A C; Taschner, Peter E M; Schaafsma, Gerard C P; Celli, J; Laros, Jeroen F J; den Dunnen, Johan T

    2011-05-01

    Locus-Specific DataBases (LSDBs) store information on gene sequence variation associated with human phenotypes and are frequently used as a reference by researchers and clinicians. We developed the Leiden Open-source Variation Database (LOVD) as a platform-independent Web-based LSDB-in-a-Box package. LOVD was designed to be easy to set up and maintain and follows the Human Genome Variation Society (HGVS) recommendations. Here we describe LOVD v.2.0, which adds enhanced flexibility and functionality and has the capacity to store sequence variants in multiple genes per patient. To reduce redundancy, patient and sequence variant data are stored in separate tables. Tables are linked to generate connections between sequence variant data for each gene and every patient. The dynamic structure allows database managers to add custom columns. The database structure supports fast queries and allows storage of sequence variants from high-throughput sequence analysis, as demonstrated by the X-chromosomal Mental Retardation LOVD installation. LOVD contains measures to ensure database security from unauthorized access. Currently, the LOVD Website (http://www.LOVD.nl/) lists 71 public LOVD installations hosting 3,294 gene variant databases with 199,000 variants in 84,000 patients. To promote LSDB standardization and thereby database interoperability, we offer free server space and help to establish an LSDB on our Leiden server. PMID:21520333

  17. The Human Variome Project: ensuring the quality of DNA variant databases in inherited renal disease.

    PubMed

    Savige, Judy; Dalgleish, Raymond; Cotton, Richard Gh; den Dunnen, Johan T; Macrae, Finlay; Povey, Sue

    2015-11-01

    A recent review identified 60 common inherited renal diseases caused by DNA variants in 132 different genes. These diseases can be diagnosed with DNA sequencing, but each gene probably also has a thousand normal variants. Many more normal variants have been characterised by individual laboratories than are reported in the literature or found in publicly accessible collections. At present, testing laboratories must assess each novel change they identify for pathogenicity, even when this has been done elsewhere previously, and the distinction between normal and disease-associated variants is particularly an issue with the recent surge in exomic sequencing and gene discovery projects. The Human Variome Project recommends the establishment of gene-specific DNA variant databases to facilitate the sharing of DNA variants and decisions about likely disease causation. Databases improve diagnostic accuracy and testing efficiency, and reduce costs. They also help with genotype-phenotype correlations and predictive algorithms. The Human Variome Project advocates databases that use standardised descriptions, are up-to-date, include clinical information and are freely available. Currently, the genes affected in the most common inherited renal diseases correspond to 350 different variant databases, many of which are incomplete or have insufficient clinical details for genotype-phenotype correlations. Assistance is needed from nephrologists to maximise the usefulness of these databases for the diagnosis and management of inherited renal disease. PMID:25384529

  18. rVarBase: an updated database for regulatory features of human variants.

    PubMed

    Guo, Liyuan; Du, Yang; Qu, Susu; Wang, Jing

    2016-01-01

    We present here the rVarBase database (http://rv.psych.ac.cn), an updated version of the rSNPBase database, to provide reliable and detailed regulatory annotations for known and novel human variants. This update expands the database to include additional types of human variants, such as copy number variations (CNVs) and novel variants, and include additional types of regulatory features. Now rVarBase annotates variants in three dimensions: chromatin states of the surrounding regions, overlapped regulatory elements and variants' potential target genes. Two new types of regulatory elements (lncRNAs and miRNA target sites) have been introduced to provide additional annotation. Detailed information about variants' overlapping transcription factor binding sites (TFBSs) (often less than 15 bp) within experimentally supported TF-binding regions (∼ 150 bp) is provided, along with the binding motifs of matched TF families. Additional types of extended variants and variant-associated phenotypes were also added. In addition to the enrichment in data content, an element-centric search module was added, and the web interface was refined. In summary, rVarBase hosts more types of human variants and includes more types of up-to-date regulatory information to facilitate in-depth functional research and to provide practical clues for experimental design. PMID:26503253

  19. Mapper: A distributed object-oriented database application

    NASA Technical Reports Server (NTRS)

    Younger, Herbert; O'Reilly, John; Frogner, Bjorn

    1995-01-01

    This paper discusses the results of a Small Business Innovation Research (SBIR) project. The initial application involved decomposition of a large database across multiple processors to improve the speed of processing compound queries. The commercial outcome was a tourist information system with a point-to-point driving direction program called MAPPER. A distributed, object-oriented approach was used for the general design, while a spatial decomposition was used to divide the database into computationally manageable pieces. The resulting system is highly flexible with respect to both modifications and reuse.

  20. Building a genome database using an object-oriented approach.

    PubMed

    Barbasiewicz, Anna; Liu, Lin; Lang, B Franz; Burger, Gertraud

    2002-01-01

    GOBASE is a relational database that integrates data associated with mitochondria and chloroplasts. The most important data in GOBASE, i. e., molecular sequences and taxonomic information, are obtained from the public sequence data repository at the National Center for Biotechnology Information (NCBI), and are validated by our experts. Maintaining a curated genomic database comes with a towering labor cost, due to the shear volume of available genomic sequences and the plethora of annotation errors and omissions in records retrieved from public repositories. Here we describe our approach to increase automation of the database population process, thereby reducing manual intervention. As a first step, we used Unified Modeling Language (UML) to construct a list of potential errors. Each case was evaluated independently, and an expert solution was devised, and represented as a diagram. Subsequently, the UML diagrams were used as templates for writing object-oriented automation programs in the Java programming language. PMID:12542407

  1. Embedding CLIPS in a database-oriented diagnostic system

    NASA Technical Reports Server (NTRS)

    Conway, Tim

    1990-01-01

    This paper describes the integration of C Language Production Systems (CLIPS) into a powerful portable maintenance aid (PMA) system used for flightline diagnostics. The current diagnostic target of the system is the Garrett GTCP85-180L, a gas turbine engine used as an Auxiliary Power Unit (APU) on some C-130 military transport aircraft. This project is a database oriented approach to a generic diagnostic system. CLIPS is used for 'many-to-many' pattern matching within the diagnostics process. Patterns are stored in database format, and CLIPS code is generated by a 'compilation' process on the database. Multiple CLIPS rule sets and working memories (in sequence) are supported and communication between the rule sets is achieved via the export and import commands. Work is continuing on using CLIPS in other portions of the diagnostic system and in re-implementing the diagnostic system in the Ada language.

  2. A revised crustal stress orientation database for Canada

    NASA Astrophysics Data System (ADS)

    Reiter, Karsten; Heidbach, Oliver; Schmitt, Douglas; Haug, Kristine; Ziegler, Moritz; Moeck, Inga

    2014-12-01

    The Canadian database on contemporary crustal stress has not been revised systematically in the past two decades. Here we present the results of our new compilation that contains 514 new data records for the orientation data of maximum compressive horizontal stress and 188 data records that were re-assessed. In total the Canadian stress database has now 1667 data records, which is an increase of about 45%. From these data, a new Canadian Stress map as well as one for the Province of Alberta is presented. To analyse the stress pattern, we use the quasi median on the circle as a smoothing algorithm that generates a smoothed stress map of the maximum compressive horizontal stress orientation on a regular grid. The newly introduced quasi interquartile range on the circle estimates the spreading of the data and is used as a measure for the wave-length of the stress pattern. The result of the hybrid wavelength analysis confirms that long spatial wavelength stress patterns (≥ 1000 km) exist in large areas in Canada. The observed stress pattern is transmitted through the intra-plate regions. The results reveal that shorter spatial wave length variation of the maximum compressive horizontal stress orientation of less than 200 km, prevails particularly in south-eastern and western Canada. Regional stress sources such as density contrasts, active fault systems, crustal structures, etc. might have a significant impact in these regions. In contrast to these variations, the observed stress pattern in the Alberta Basin is very homogeneous and mainly controlled by plate boundary forces and body forces. The influence of curvature of the Rocky Mountains salient in southern Alberta is minimal. The present-day horizontal stress orientations determined herein have important implications for the production of hydrocarbons and geothermal energy in the Alberta Basin.

  3. Rice Annotation Database (RAD): a contig-oriented database for map-based rice genomics.

    PubMed

    Ito, Yuichi; Arikawa, Kohji; Antonio, Baltazar A; Ohta, Isamu; Naito, Shinji; Mukai, Yoshiyuki; Shimano, Atsuko; Masukawa, Masatoshi; Shibata, Michie; Yamamoto, Mayu; Ito, Yukiyo; Yokoyama, Junri; Sakai, Yasumichi; Sakata, Katsumi; Nagamura, Yoshiaki; Namiki, Nobukazu; Matsumoto, Takashi; Higo, Kenichi; Sasaki, Takuji

    2005-01-01

    A contig-oriented database for annotation of the rice genome has been constructed to facilitate map-based rice genomics. The Rice Annotation Database has the following functional features: (i) extensive effort of manual annotations of P1-derived artificial chromosome/bacterial artificial chromosome clones can be merged at chromosome and contig-level; (ii) concise visualization of the annotation information such as the predicted genes, results of various prediction programs (RiceHMM, Genscan, Genscan+, Fgenesh, GeneMark, etc.), homology to expressed sequence tag, full-length cDNA and protein; (iii) user-friendly clone / gene query system; (iv) download functions for nucleotide, amino acid and coding sequences; (v) analysis of various features of the genome (GC-content, average value, etc.); and (vi) genome-wide homology search (BLAST) of contig- and chromosome-level genome sequence to allow comparative analysis with the genome sequence of other organisms. As of October 2004, the database contains a total of 215 Mb sequence with relevant annotation results including 30 000 manually curated genes. The database can provide the latest information on manual annotation as well as a comprehensive structural analysis of various features of the rice genome. The database can be accessed at http://rad.dna.affrc.go.jp/. PMID:15608281

  4. HistoneDB 2.0: a histone database with variants--an integrated resource to explore histones and their variants.

    PubMed

    Draizen, Eli J; Shaytan, Alexey K; Mariño-Ramírez, Leonardo; Talbert, Paul B; Landsman, David; Panchenko, Anna R

    2016-01-01

    Compaction of DNA into chromatin is a characteristic feature of eukaryotic organisms. The core (H2A, H2B, H3, H4) and linker (H1) histone proteins are responsible for this compaction through the formation of nucleosomes and higher order chromatin aggregates. Moreover, histones are intricately involved in chromatin functioning and provide a means for genome dynamic regulation through specific histone variants and histone post-translational modifications. 'HistoneDB 2.0--with variants' is a comprehensive database of histone protein sequences, classified by histone types and variants. All entries in the database are supplemented by rich sequence and structural annotations with many interactive tools to explore and compare sequences of different variants from various organisms. The core of the database is a manually curated set of histone sequences grouped into 30 different variant subsets with variant-specific annotations. The curated set is supplemented by an automatically extracted set of histone sequences from the non-redundant protein database using algorithms trained on the curated set. The interactive web site supports various searching strategies in both datasets: browsing of phylogenetic trees; on-demand generation of multiple sequence alignments with feature annotations; classification of histone-like sequences and browsing of the taxonomic diversity for every histone variant. HistoneDB 2.0 is a resource for the interactive comparative analysis of histone protein sequences and their implications for chromatin function. Database URL: http://www.ncbi.nlm.nih.gov/projects/HistoneDB2.0. PMID:26989147

  5. A reuse oriented Development Database: the HELIOS Object Information System.

    PubMed

    Lavril, M; Doré, L; Zaplétal, E; Jean, F C; Degoulet, P

    1994-12-01

    This paper describes the Development Database of a Software Engineering Environment (SEE), that couples reuse and object-oriented technologies. We propose a classification model for the repository of reusable components that is a support for an efficient retrieval mechanism, and a reusable component model that considers components of large and low-granularity levels (e.g., application frameworks and methods of a given class). The reusable component model is based on the underlying idea that development components are not operational components since they do not have the same purpose. This model first represents each component as an aggregation of other sub-components, together with all information needed for its retrieval (e.g., classification and facets), its under-standing (e.g., informal functional description) and its dependencies with other development components, in particular the applications in which it occurs, since applications are considered also as abstract development components. This approach was tested thanks to an existing application which was loaded into the SEE. From that moment, it was possible to regenerate a new application in a reasonably short time. Moreover, the existence of a retrieval tool permits to validate the development components classification, but also put forward the importance of the qualification step. In particular, the versioning should be carefully processed. Furthermore the fact that all the development objects are built on an homogeneous model allows easier tools management and interapplication reusability. PMID:7882674

  6. Update of MmtDB: a Metazoa mitochondrial DNA variants database.

    PubMed Central

    Attimonelli, M; Calò, D; De Montalvo, A; Lanave, C; Sasanelli, D; Tommaseo Ponzetta, M; Saccone, C

    1998-01-01

    The present paper describes the improvements in MmtDB, a specialised database designed to collect Metazoa mitochondrial DNA variants. Priority in the data collection has been given to Metazoa for which a large amount of variants is available, e.g., for humans. Starting from the sequences available in the Nucleotide Sequence Databases, the redundant sequences have been removed and new sequences from other sources have been added. Value-added information is associated to each variant sequence, e.g., analysed region, experimental method, tissue and cell lines, population data, sex, age, family code and information about the variation events (nucleotide position, involved gene, restriction site gain or loss). Cross-references are introduced to the EMBL Data Library, as well as an internal cross-referencing among MmtDB entries according to tissual, heteroplasmic, familiar and aplotypical correlation. Furthermore MmtDB has a new section, AMmtDB: Aligned Metazoan mitochondrial biosequences. MmtDB can be accessed through the World Wide Web at URL http://WWW.ba.cnr.it/[symbol: see text]areamt08/MmtDBWWW.htm PMID:9399815

  7. SMPD1 Mutation Update: Database and Comprehensive Analysis of Published and Novel Variants.

    PubMed

    Zampieri, Stefania; Filocamo, Mirella; Pianta, Annalisa; Lualdi, Susanna; Gort, Laura; Coll, Maria Jose; Sinnott, Richard; Geberhiwot, Tarekegn; Bembi, Bruno; Dardis, Andrea

    2016-02-01

    Niemann-Pick Types A and B (NPA/B) diseases are autosomal recessive lysosomal storage disorders caused by the deficient activity of acid sphingomyelinase (ASM) because of the mutations in the SMPD1 gene. Here, we provide a comprehensive updated review of already reported and newly identified SMPD1 variants. Among them, 185 have been found in NPA/B patients. Disease-causing variants are equally distributed along the SMPD1 gene; most of them are missense (65.4%) or frameshift (19%) mutations. The most frequently reported mutation worldwide is the p.R610del, clearly associated with an attenuated NP disease type B phenotype. The available information about the impact of 52 SMPD1 variants on ASM mRNA and/or enzymatic activity has been collected and whenever possible, phenotype/genotype correlations were established. In addition, we created a locus-specific database easily accessible at http://www.inpdr.org/genes that catalogs the 417 SMPD1 variants reported to date and provides data on their in silico predicted effects on ASM protein function or mRNA splicing. The information reviewed in this article, providing new insights into the genotype/phenotype correlation, is extremely valuable to facilitate diagnosis and genetic counseling of families affected by NPA/B. PMID:26499107

  8. Database for Safety-Oriented Tracking of Chemicals

    NASA Technical Reports Server (NTRS)

    Stump, Jacob; Carr, Sandra; Plumlee, Debrah; Slater, Andy; Samson, Thomas M.; Holowaty, Toby L.; Skeete, Darren; Haenz, Mary Alice; Hershman, Scot; Raviprakash, Pushpa

    2010-01-01

    SafetyChem is a computer program that maintains a relational database for tracking chemicals and associated hazards at Johnson Space Center (JSC) by use of a Web-based graphical user interface. The SafetyChem database is accessible to authorized users via a JSC intranet. All new chemicals pass through a safety office, where information on hazards, required personal protective equipment (PPE), fire-protection warnings, and target organ effects (TOEs) is extracted from material safety data sheets (MSDSs) and recorded in the database. The database facilitates real-time management of inventory with attention to such issues as stability, shelf life, reduction of waste through transfer of unused chemicals to laboratories that need them, quantification of chemical wastes, and identification of chemicals for which disposal is required. Upon searching the database for a chemical, the user receives information on physical properties of the chemical, hazard warnings, required PPE, a link to the MSDS, and references to the applicable International Standards Organization (ISO) 9000 standard work instructions and the applicable job hazard analysis. Also, to reduce the labor hours needed to comply with reporting requirements of the Occupational Safety and Health Administration, the data can be directly exported into the JSC hazardous- materials database.

  9. HbVar: A relational database of human hemoglobin variants and thalassemia mutations at the globin gene server.

    PubMed

    Hardison, Ross C; Chui, David H K; Giardine, Belinda; Riemer, Cathy; Patrinos, George P; Anagnou, Nicholas; Miller, Webb; Wajcman, Henri

    2002-03-01

    We have constructed a relational database of hemoglobin variants and thalassemia mutations, called HbVar, which can be accessed on the web at http://globin.cse.psu.edu. Extensive information is recorded for each variant and mutation, including a description of the variant and associated pathology, hematology, electrophoretic mobility, methods of isolation, stability information, ethnic occurrence, structure studies, functional studies, and references. The initial information was derived from books by Dr. Titus Huisman and colleagues [Huisman et al., 1996, 1997, 1998]. The current database is updated regularly with the addition of new data and corrections to previous data. Queries can be formulated based on fields in the database. Tables of common categories of variants, such as all those involving the alpha1-globin gene (HBA1) or all those that result in high oxygen affinity, are maintained by automated queries on the database. Users can formulate more precise queries, such as identifying "all beta-globin variants associated with instability and found in Scottish populations." This new database should be useful for clinical diagnosis as well as in fundamental studies of hemoglobin biochemistry, globin gene regulation, and human sequence variation at these loci. PMID:11857738

  10. TMC-SNPdb: an Indian germline variant database derived from whole exome sequences.

    PubMed

    Upadhyay, Pawan; Gardi, Nilesh; Desai, Sanket; Sahoo, Bikram; Singh, Ankita; Togar, Trupti; Iyer, Prajish; Prasad, Ratnam; Chandrani, Pratik; Gupta, Sudeep; Dutt, Amit

    2016-01-01

    Cancer is predominantly a somatic disease. A mutant allele present in a cancer cell genome is considered somatic when it's absent in the paired normal genome along with public SNP databases. The current build of dbSNP, the most comprehensive public SNP database, however inadequately represents several non-European Caucasian populations, posing a limitation in cancer genomic analyses of data from these populations. We present the T: ata M: emorial C: entre-SNP D: ata B: ase (TMC-SNPdb), as the first open source, flexible, upgradable, and freely available SNP database (accessible through dbSNP build 149 and ANNOVAR)-representing 114 309 unique germline variants-generated from whole exome data of 62 normal samples derived from cancer patients of Indian origin. The TMC-SNPdb is presented with a companion subtraction tool that can be executed with command line option or using an easy-to-use graphical user interface with the ability to deplete additional Indian population specific SNPs over and above dbSNP and 1000 Genomes databases. Using an institutional generated whole exome data set of 132 samples of Indian origin, we demonstrate that TMC-SNPdb could deplete 42, 33 and 28% false positive somatic events post dbSNP depletion in Indian origin tongue, gallbladder, and cervical cancer samples, respectively. Beyond cancer somatic analyses, we anticipate utility of the TMC-SNPdb in several Mendelian germline diseases. In addition to dbSNP build 149 and ANNOVAR, the TMC-SNPdb along with the subtraction tool is available for download in the public domain at the following:Database URL: http://www.actrec.gov.in/pi-webpages/AmitDutt/TMCSNP/TMCSNPdp.html. PMID:27402678

  11. The Swiss-Prot variant page and the ModSNP database: a resource for sequence and structure information on human protein variants.

    PubMed

    Yip, Yum L; Scheib, Holger; Diemand, Alexander V; Gattiker, Alexandre; Famiglietti, Livia M; Gasteiger, Elisabeth; Bairoch, Amos

    2004-05-01

    Missense mutation leading to single amino acid polymorphism (SAP) is the type of mutation most frequently related to human diseases. The Swiss-Prot protein knowledgebase records information on such mutations in various sections of a protein entry, namely in the "feature," "comment," and "reference" fields. To facilitate users in obtaining the most relevant information about each human SAP recorded in the knowledgebase, the Swiss-Prot Variant web pages were created to provide a summary of available sequence information, as well as additional structural information on each variant. In particular, the ModSNP database was set up to store information related to SAPs and to manage the modeling of SAPs onto protein structures via an automatic homology modeling pipeline. Currently, among the 16,566 human SAPs recorded in the Swiss-Prot knowledgebase (release 42.5, 21 November 2003), more than 25% have corresponding 3D-models. Of these variants, 47% are related to disease, 26% are polymorphisms, and 27% are not yet clearly classified. The ModSNP database is updated and the subsequent model construction pipeline is launched with each weekly Swiss-Prot release. Thus, the ModSNP database represents a valuable resource for the structural analysis of protein variation. The Swiss-Prot variant pages are accessible from the NiceProt view of a Swiss-Prot entry on the ExPASy server (www.expasy.org/), via a hyperlink created for the stable and unique identifier FTId of each human SAP. PMID:15108278

  12. dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions

    PubMed Central

    Wu, Jiaxin; Wu, Mengmeng; Li, Lianshuo; Liu, Zhuo; Zeng, Wanwen; Jiang, Rui

    2016-01-01

    The recent advancement of the next generation sequencing technology has enabled the fast and low-cost detection of all genetic variants spreading across the entire human genome, making the application of whole-genome sequencing a tendency in the study of disease-causing genetic variants. Nevertheless, there still lacks a repository that collects predictions of functionally damaging effects of human genetic variants, though it has been well recognized that such predictions play a central role in the analysis of whole-genome sequencing data. To fill this gap, we developed a database named dbWGFP (a database and web server of human whole-genome single nucleotide variants and their functional predictions) that contains functional predictions and annotations of nearly 8.58 billion possible human whole-genome single nucleotide variants. Specifically, this database integrates 48 functional predictions calculated by 17 popular computational methods and 44 valuable annotations obtained from various data sources. Standalone software, user-friendly query services and free downloads of this database are available at http://bioinfo.au.tsinghua.edu.cn/dbwgfp. dbWGFP provides a valuable resource for the analysis of whole-genome sequencing, exome sequencing and SNP array data, thereby complementing existing data sources and computational resources in deciphering genetic bases of human inherited diseases. PMID:26989155

  13. dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions.

    PubMed

    Wu, Jiaxin; Wu, Mengmeng; Li, Lianshuo; Liu, Zhuo; Zeng, Wanwen; Jiang, Rui

    2016-01-01

    The recent advancement of the next generation sequencing technology has enabled the fast and low-cost detection of all genetic variants spreading across the entire human genome, making the application of whole-genome sequencing a tendency in the study of disease-causing genetic variants. Nevertheless, there still lacks a repository that collects predictions of functionally damaging effects of human genetic variants, though it has been well recognized that such predictions play a central role in the analysis of whole-genome sequencing data. To fill this gap, we developed a database named dbWGFP (a database and web server of human whole-genome single nucleotide variants and their functional predictions) that contains functional predictions and annotations of nearly 8.58 billion possible human whole-genome single nucleotide variants. Specifically, this database integrates 48 functional predictions calculated by 17 popular computational methods and 44 valuable annotations obtained from various data sources. Standalone software, user-friendly query services and free downloads of this database are available at http://bioinfo.au.tsinghua.edu.cn/dbwgfp. dbWGFP provides a valuable resource for the analysis of whole-genome sequencing, exome sequencing and SNP array data, thereby complementing existing data sources and computational resources in deciphering genetic bases of human inherited diseases. PMID:26989155

  14. Reliability database development for use with an object-oriented fault tree evaluation program

    NASA Technical Reports Server (NTRS)

    Heger, A. Sharif; Harringtton, Robert J.; Koen, Billy V.; Patterson-Hine, F. Ann

    1989-01-01

    A description is given of the development of a fault-tree analysis method using object-oriented programming. In addition, the authors discuss the programs that have been developed or are under development to connect a fault-tree analysis routine to a reliability database. To assess the performance of the routines, a relational database simulating one of the nuclear power industry databases has been constructed. For a realistic assessment of the results of this project, the use of one of existing nuclear power reliability databases is planned.

  15. Constraints on Biological Mechanism from Disease Comorbidity Using Electronic Medical Records and Database of Genetic Variants

    PubMed Central

    Bagley, Steven C.; Sirota, Marina; Chen, Richard; Butte, Atul J.; Altman, Russ B.

    2016-01-01

    Patterns of disease co-occurrence that deviate from statistical independence may represent important constraints on biological mechanism, which sometimes can be explained by shared genetics. In this work we study the relationship between disease co-occurrence and commonly shared genetic architecture of disease. Records of pairs of diseases were combined from two different electronic medical systems (Columbia, Stanford), and compared to a large database of published disease-associated genetic variants (VARIMED); data on 35 disorders were available across all three sources, which include medical records for over 1.2 million patients and variants from over 17,000 publications. Based on the sources in which they appeared, disease pairs were categorized as having predominant clinical, genetic, or both kinds of manifestations. Confounding effects of age on disease incidence were controlled for by only comparing diseases when they fall in the same cluster of similarly shaped incidence patterns. We find that disease pairs that are overrepresented in both electronic medical record systems and in VARIMED come from two main disease classes, autoimmune and neuropsychiatric. We furthermore identify specific genes that are shared within these disease groups. PMID:27115429

  16. An intelligent object-oriented database system for materials information

    SciTech Connect

    Smith, F.J.; Krishnamurthy, M.V.; Tripathy, S.R.; Sage, P.

    1995-12-31

    Materials information systems are used normally to provide data that is input to a separate design or test system that then computes the characteristics of some item or entity being designed or examined. This paper concerns the study of an integrated system that includes both the data and design systems together; therefore, it includes not only data on the property of materials but also knowledge on the geometry of the item being designed. In addition, it includes formulas representing the laws of physical science used to carry out the calculation needed to compute the necessary characteristics of the item. This kind of system can determine the optimum materials for the design of an item, rather than simply retrieve material properties. It may replace the simple search and retrieve systems available today, and it is argued that the complexity of such a system could only be achieved using an object-oriented methodology.

  17. VAS: A Vision Advisor System combining agents and object-oriented databases

    NASA Technical Reports Server (NTRS)

    Eilbert, James L.; Lim, William; Mendelsohn, Jay; Braun, Ron; Yearwood, Michael

    1994-01-01

    A model-based approach to identifying and finding the orientation of non-overlapping parts on a tray has been developed. The part models contain both exact and fuzzy descriptions of part features, and are stored in an object-oriented database. Full identification of the parts involves several interacting tasks each of which is handled by a distinct agent. Using fuzzy information stored in the model allowed part features that were essentially at the noise level to be extracted and used for identification. This was done by focusing attention on the portion of the part where the feature must be found if the current hypothesis of the part ID is correct. In going from one set of parts to another the only thing that needs to be changed is the database of part models. This work is part of an effort in developing a Vision Advisor System (VAS) that combines agents and objected-oriented databases.

  18. Construction and assessment of individualized proteogenomic databases for large-scale analysis of nonsynonymous single nucleotide variants.

    PubMed

    Krug, Karsten; Popic, Sasa; Carpy, Alejandro; Taumer, Christoph; Macek, Boris

    2014-12-01

    Next-generation sequencing projects focusing on genomes and transcriptomes identify millions of single nucleotide variants (SNVs), many of which result in single amino acid substitutions. These nonsynonymous (ns) SNVs are typically not incorporated into protein sequence databases used to identify MS/MS data. Here, we perform a comparative analysis of the assembly of nsSNV-containing proteogenomic databases. We use a comprehensive transcriptome and proteome dataset of HeLa cells from the literature to derive and to incorporate SNVs into databases applicable to proteomics search engines, and to assess their performance in the identification of nsSNVs. We assemble the databases by (1) translation of SNV-containing transcripts into all possible reading frames, (2) translation of predicted reading frame, (3) prediction of nsSNVs and subsequent incorporation into canonical protein sequences. We show substantial differences between generated databases in terms of represented nsSNVs and theoretical search space, affecting sensitivity and specificity of database search. We query the databases with >2.2M high-resolution MS/MS spectra using MaxQuant software and identify 451 variant peptides, containing 401 nsSNVs. We conclude that prediction of reading frame and, if applicable, SNV effect result in comprehensive yet compact databases necessary to retain sensitivity in large-scale analysis of nsSNVs called from transcriptomics data. PMID:25251379

  19. Implementation of schema management in STEP-based object-oriented engineering database management system

    NASA Astrophysics Data System (ADS)

    Xiao, Ke; Zhao, Zhige; Sun, Jiaguang

    1996-03-01

    Engineering database management system (EDBMS) is the kernel of CAD/CAM system integration, and object-oriented EDBMS (OOEDBMS) is the best implementation. While STEP is becoming the standard of product data exchange and representation, supporting STEP in engineering database becomes more and more important. In this paper we introduce the architecture of STEP based OOEDBMS in our CAD/CAM integrated system GHCAD. We focus on schema management and three-grade database management in OOEDBMS. Topics such as DDL compiler, transformation from EXPRESS to DDL, DDL tools are discussed. Finally further research directions of schema management in OOEDBMS are present.

  20. An Overview of OOPS+, an Object-Oriented Database Programming Language

    NASA Astrophysics Data System (ADS)

    Laenens, Els; Vermeir, Dirk

    This paper provides a brief introduction to the OOPS+ knowledge-representation language. While basically object-oriented, OOPS+ integrates database concepts as well as classical knowledge-representation techniques such as rule-based inference and demons. In addition, the language supports types as first-class objects, inheritance, imperative function definition, and query facilities based on logic programming.

  1. Design and Implementation of ROCK & ROLL: A Deductive Object-Oriented Database System.

    ERIC Educational Resources Information Center

    Barja, Maria L.; And Others

    1995-01-01

    Presents the design and implementation of a deductive object-oriented database which is built upon a formally defined data model that uses two languages: an imperative programming language called ROCK (Rule Object Computation Kernel), and a logic language called ROLL (Rule Object Logic Language). (LRW)

  2. Compression of Index Term Dictionary in an Inverted-File-Oriented Database: Some Effective Algorithms.

    ERIC Educational Resources Information Center

    Wisniewski, Janusz L.

    1986-01-01

    Discussion of a new method of index term dictionary compression in an inverted-file-oriented database highlights a technique of word coding, which generates short fixed-length codes obtained from the index terms themselves by analysis of monogram and bigram statistical distributions. Substantial savings in communication channel utilization are…

  3. Crystallographic Texture and Orientation Variants in Al2O3-Y3Al5O12 Directionally Solidified Eutectic Crystals

    NASA Technical Reports Server (NTRS)

    Frazer, Colleen S.; Dickey, Elizabeth C.; Sayir, Ali; Farmer, Serene (Technical Monitor)

    2001-01-01

    Eutectic rods of Al2O3 and Y3Al5O12 were grown by a laser-heated float zone method, and their microstructure and crystallographic texture were studied by scanning electron microscopy, electron backscattered diffraction and x-ray diffraction. The composites were found to be highly textured with two twin-related crystallographic orientation relationships between the phases. Electron backscattered diffraction was employed to determine the spatial distribution of the orientational variants within the samples and to define the crystallographic orientation of various microstructural features.

  4. Refined Geographic Distribution of the Oriental ALDH2*504Lys (nee 487Lys) Variant

    PubMed Central

    Li, Hui; Borinskaya, Svetlana; Yoshimura, Kimio; Kal’ina, Nina; Marusin, Andrey; Stepanov, Vadim A.; Qin, Zhendong; Khaliq, Shagufta; Lee, Mi-Young; Yang, Yajun; Mohyuddin, Aisha; Gurwitz, David; Mehdi, Syed Qasim; Rogaev, Evgeny; Jin, Li; Yankovsky, Nikolay K.; Kidd, Judith R.; Kidd, Kenneth K.

    2010-01-01

    Summary Mitochondrial aldehyde dehydrogenase (ALDH2) is one of the most important enzymes in human alcohol metabolism. The oriental ALDH2*504Lys variant functions as a dominant negative greatly reducing activity in heterozygotes and abolishing activity in homozygotes. This allele is associated with serious disorders such as alcohol liver disease, late onset Alzheimer disease, colorectal cancer, and esophageal cancer, and is best known for protection against alcoholism. Many hundreds of papers in various languages have been published on this variant, providing allele frequency data for many different populations. To develop a highly refined global geographic distribution of ALDH2*504Lys, we have collected new data on 4,091 individuals from 86 population samples and assembled published data on a total of 80,691 individuals from 366 population samples. The allele is essentially absent in all parts of the world except East Asia. The ALDH2*504Lys allele has its highest frequency in Southeast China, and occurs in most areas of China, Japan, Korea, Mongolia, and Indochina with frequencies gradually declining radially from Southeast China. As the indigenous populations in South China have much lower frequencies than the southern Han migrants from Central China, we conclude that ALDH2*504Lys was carried by Han Chinese as they spread throughout East Asia. Esophageal cancer, with its highest incidence in East Asia, may be associated with ALDH2*504Lys because of a toxic effect of increased acetaldehyde in the tissue where ingested ethanol has its highest concentration. While the distributions of esophageal cancer and ALDH2*504Lys do not precisely correlate, that does not disprove the hypothesis. In general the study of fine scale geographic distributions of ALDH2*504Lys and diseases may help in understanding the multiple relationships among genes, diseases, environments, and cultures. PMID:19456322

  5. EBSD imaging of orientation relationships and variant groupings in different martensitic alloys and Widmanstätten iron meteorites

    SciTech Connect

    Cayron, Cyril

    2014-08-15

    An automatic method to colorize and quantify the classical Pitsch, Kurdjumov–Sachs, Greninger–Troiano and Nishiyama–Wasserman orientation relationships in the electron backscatter diffraction maps of martensitic/bainitic steels is detailed. Automatic analysis of variant grouping is also presented. The method was applied to low and high carbon steels, and to iron–nickel Widmanstätten meteorites. Many results of recent literature are confirmed. In low carbon steels the individual laths exhibit continuous orientation gradients between the classical orientation relationships, and the laths tend to be grouped by close-packed plane (morphological) packets. A crystallographic scenario describing the formation of the packets is proposed on the base of the one-step model. When the carbon content increases, the orientation spreading is reduced; and martensite tends to form plate groups and burst configurations. In iron–nickel meteorites, the centimeter long Widmanstätten laths do not exhibit continuous orientation gradients but are constituted of subgrains with uniform orientation relationship; the kamacite grains in the plessite regions are grouped into Bain zones, probably due to a recrystallization during the slow cooling of the meteorites. - Highlights: • Analysis of different low and high carbon steels and Widmanstätten meteorites • Automatic color mapping of the classical orientation relationships in EBSD maps • Quantification of variant pairing and grouping tendencies • Crystallographic scenario for the formation of morphological packets.

  6. A comparative study of six European databases of medically oriented Web resources

    PubMed Central

    Abad García, Francisca; González Teruel, Aurora; Bayo Calduch, Patricia; de Ramón Frias, Rosa; Castillo Blasco, Lourdes

    2005-01-01

    Objectives: The paper describes six European medically oriented databases of Web resources, pertaining to five quality-controlled subject gateways, and compares their performance. Method: The characteristics, coverage, procedure for selecting Web resources, record structure, searching possibilities, and existence of user assistance were described for each database. Performance indicators for each database were obtained by means of searches carried out using the key words, “myocardial infarction.” Results: Most of the databases originated in the 1990s in an academic or library context and include all types of Web resources of an international nature. Five databases use Medical Subject Headings. The number of fields per record varies between three and nineteen. The language of the search interfaces is mostly English, and some of them allow searches in other languages. In some databases, the search can be extended to Pubmed. Organizing Medical Networked Information, Catalogue et Index des Sites Médicaux Francophones, and Diseases, Disorders and Related Topics produced the best results. Conclusions: The usefulness of these databases as quick reference resources is clear. In addition, their lack of content overlap means that, for the user, they complement each other. Their continued survival faces three challenges: the instability of the Internet, maintenance costs, and lack of use in spite of their potential usefulness. PMID:16239943

  7. A Toolkit for Active Object-Oriented Databases with Application to Interoperability

    NASA Technical Reports Server (NTRS)

    King, Roger

    1996-01-01

    In our original proposal we stated that our research would 'develop a novel technology that provides a foundation for collaborative information processing.' The essential ingredient of this technology is the notion of 'deltas,' which are first-class values representing collections of proposed updates to a database. The Heraclitus framework provides a variety of algebraic operators for building up, combining, inspecting, and comparing deltas. Deltas can be directly applied to the database to yield a new state, or used 'hypothetically' in queries against the state that would arise if the delta were applied. The central point here is that the step of elevating deltas to 'first-class' citizens in database programming languages will yield tremendous leverage on the problem of supporting updates in collaborative information processing. In short, our original intention was to develop the theoretical and practical foundation for a technology based on deltas in an object-oriented database context, develop a toolkit for active object-oriented databases, and apply this toward collaborative information processing.

  8. A Toolkit for Active Object-Oriented Databases with Application to Interoperability

    NASA Technical Reports Server (NTRS)

    King, Roger

    1996-01-01

    In our original proposal we stated that our research would 'develop a novel technology that provides a foundation for collaborative information processing.' The essential ingredient of this technology is the notion of 'deltas,' which are first-class values representing collections of proposed updates to a database. The Heraclitus framework provides a variety of algebraic operators for building up, combining, inspecting, and comparing deltas. Deltas can be directly applied to the database to yield a new state, or used 'hypothetically' in queries against the state that would arise if the delta were applied. The central point here is that the step of elevating deltas to 'first-class' citizens in database programming languages will yield tremendous leverage on the problem of supporting updates in collaborative information processing. In short, our original intention was to develop the theoretical and practical foundation for a technology based on deltas in an object- oriented database context, develop a toolkit for active object-oriented databases, and apply this toward collaborative information processing.

  9. ARACHNID: A prototype object-oriented database tool for distributed systems

    NASA Technical Reports Server (NTRS)

    Younger, Herbert; Oreilly, John; Frogner, Bjorn

    1994-01-01

    This paper discusses the results of a Phase 2 SBIR project sponsored by NASA and performed by MIMD Systems, Inc. A major objective of this project was to develop specific concepts for improved performance in accessing large databases. An object-oriented and distributed approach was used for the general design, while a geographical decomposition was used as a specific solution. The resulting software framework is called ARACHNID. The Faint Source Catalog developed by NASA was the initial database testbed. This is a database of many giga-bytes, where an order of magnitude improvement in query speed is being sought. This database contains faint infrared point sources obtained from telescope measurements of the sky. A geographical decomposition of this database is an attractive approach to dividing it into pieces. Each piece can then be searched on individual processors with only a weak data linkage between the processors being required. As a further demonstration of the concepts implemented in ARACHNID, a tourist information system is discussed. This version of ARACHNID is the commercial result of the project. It is a distributed, networked, database application where speed, maintenance, and reliability are important considerations. This paper focuses on the design concepts and technologies that form the basis for ARACHNID.

  10. Extensions to the time-oriented database model to support temporal reasoning in medical expert systems.

    PubMed

    Kahn, M G; Fagan, L M; Tu, S

    1991-01-01

    Physicians faced with diagnostic and therapeutic decisions must reason about clinical features that change over time. Database-management systems (DBMS) can increase access to patient data, but most systems are limited in their ability to store and retrieve complex temporal information. The Time-Oriented Databank (TOD) model, the most widely used data model for medical database systems, associates a single time stamp with each observation. The proper analysis of most clinical data requires accounting for multiple concurrent clinical events that may alter the interpretation of the raw data. Most medical DBMSs cannot retrieve patient data indexed by multiple clinical events. We describe two logical extensions to TOD-based databases that solve a set of temporal reasoning problems we encountered in constructing medical expert systems. A key feature of both extensions is that stored data are partitioned into groupings, such as sequential clinical visits, clinical exacerbations, or other abstract events that have clinical decision-making relevance. The temporal network (TNET) is an object-oriented database that extends the temporal reasoning capabilities of ONCOCIN, a medical expert system that provides chemotherapy advice. TNET uses persistent objects to associate observations with intervals of time during which "an event of clinical interest" occurred. A second object-oriented system called the extended temporal network (ETNET), is both an extension and a simplification of TNET. Like TNET, ETNET uses persistent objects to represent relevant intervals; unlike the first system, however, ETNET contains reasoning methods (rules) that can be executed when an event "begins", and that are withdrawn when that event "concludes". TNET and ETNET capture temporal relationships among recorded information that are not represented in TOD-based databases. Although they do not solve all temporal reasoning problems found in medical decision making, these new structures enable patient

  11. Action-Oriented Benchmarking: Using the CEUS Database to Benchmark Commercial Buildings in California

    SciTech Connect

    Mathew, Paul; Mills, Evan; Bourassa, Norman; Brook, Martha

    2008-02-01

    The 2006 Commercial End Use Survey (CEUS) database developed by the California Energy Commission is a far richer source of energy end-use data for non-residential buildings than has previously been available and opens the possibility of creating new and more powerful energy benchmarking processes and tools. In this article--Part 2 of a two-part series--we describe the methodology and selected results from an action-oriented benchmarking approach using the new CEUS database. This approach goes beyond whole-building energy benchmarking to more advanced end-use and component-level benchmarking that enables users to identify and prioritize specific energy efficiency opportunities - an improvement on benchmarking tools typically in use today.

  12. WWW-based access to object-oriented clinical databases: the KHOSPAD project.

    PubMed

    Pinciroli, F; Portoni, L; Combi, C; Violante, F F

    1998-09-01

    KHOSPAD is a project aiming at improving the quality of the process of patient care concerning general practitioner-patient-hospital relationships, using current information and networking technologies. The studied application field is a cardiology division, with hemodynamic laboratory and the population of PTCA patients. Data related to PTCA patients are managed by ARCADIA, an object-oriented database management system developed for the considered clinical setting. We defined a remotely accessible view of ARCADIA medical record, suitable for general practitioners (GPs) caring patients after PTCA, during the follow-up period. Using a PC, a modem and Internet, an authorized GP can consult remotely the medical records of his PTCA patients. Main features of the application are related to the management and display of complex data, specifically characterized by multimedia and temporal features, based on an object-oriented temporal data model. PMID:9861510

  13. ASPicDB: a database of annotated transcript and protein variants generated by alternative splicing

    PubMed Central

    Martelli, Pier L.; D’Antonio, Mattia; Bonizzoni, Paola; Castrignanò, Tiziana; D’Erchia, Anna M.; D’Onorio De Meo, Paolo; Fariselli, Piero; Finelli, Michele; Licciulli, Flavio; Mangiulli, Marina; Mignone, Flavio; Pavesi, Giulio; Picardi, Ernesto; Rizzi, Raffaella; Rossi, Ivan; Valletti, Alessio; Zauli, Andrea; Zambelli, Federico; Casadio, Rita; Pesole, Graziano

    2011-01-01

    Alternative splicing is emerging as a major mechanism for the expansion of the transcriptome and proteome diversity, particularly in human and other vertebrates. However, the proportion of alternative transcripts and proteins actually endowed with functional activity is currently highly debated. We present here a new release of ASPicDB which now provides a unique annotation resource of human protein variants generated by alternative splicing. A total of 256 939 protein variants from 17 191 multi-exon genes have been extensively annotated through state of the art machine learning tools providing information of the protein type (globular and transmembrane), localization, presence of PFAM domains, signal peptides, GPI-anchor propeptides, transmembrane and coiled-coil segments. Furthermore, full-length variants can be now specifically selected based on the annotation of CAGE-tags and polyA signal and/or polyA sites, marking transcription initiation and termination sites, respectively. The retrieval can be carried out at gene, transcript, exon, protein or splice site level allowing the selection of data sets fulfilling one or more features settled by the user. The retrieval interface also enables the selection of protein variants showing specific differences in the annotated features. ASPicDB is available at http://www.caspur.it/ASPicDB/. PMID:21051348

  14. RBP-Var: a database of functional variants involved in regulation mediated by RNA-binding proteins

    PubMed Central

    Mao, Fengbiao; Xiao, Luoyuan; Li, Xianfeng; Liang, Jialong; Teng, Huajing; Cai, Wanshi; Sun, Zhong Sheng

    2016-01-01

    Transcription factors bind to the genome by forming specific contacts with the primary DNA sequence; however, RNA-binding proteins (RBPs) have greater scope to achieve binding specificity through the RNA secondary structure. It has been revealed that single nucleotide variants (SNVs) that alter RNA structure, also known as RiboSNitches, exhibit 3-fold greater local structure changes than replicates of the same DNA sequence, demonstrated by the fact that depletion of RiboSNitches could result in the alteration of specific RNA shapes at thousands of sites, including 3′ UTRs, binding sites of microRNAs and RBPs. However, the network between SNVs and post-transcriptional regulation remains unclear. Here, we developed RBP-Var, a database freely available at http://www.rbp-var.biols.ac.cn/, which provides annotation of functional variants involved in post-transcriptional interaction and regulation. RBP-Var provides an easy-to-use web interface that allows users to rapidly find whether SNVs of interest can transform the secondary structure of RNA and identify RBPs whose binding may be subsequently disrupted. RBP-Var integrates DNA and RNA biology to understand how various genetic variants and post-transcriptional mechanisms cooperate to orchestrate gene expression. In summary, RBP-Var is useful in selecting candidate SNVs for further functional studies and exploring causal SNVs underlying human diseases. PMID:26635394

  15. RBP-Var: a database of functional variants involved in regulation mediated by RNA-binding proteins.

    PubMed

    Mao, Fengbiao; Xiao, Luoyuan; Li, Xianfeng; Liang, Jialong; Teng, Huajing; Cai, Wanshi; Sun, Zhong Sheng

    2016-01-01

    Transcription factors bind to the genome by forming specific contacts with the primary DNA sequence; however, RNA-binding proteins (RBPs) have greater scope to achieve binding specificity through the RNA secondary structure. It has been revealed that single nucleotide variants (SNVs) that alter RNA structure, also known as RiboSNitches, exhibit 3-fold greater local structure changes than replicates of the same DNA sequence, demonstrated by the fact that depletion of RiboSNitches could result in the alteration of specific RNA shapes at thousands of sites, including 3' UTRs, binding sites of microRNAs and RBPs. However, the network between SNVs and post-transcriptional regulation remains unclear. Here, we developed RBP-Var, a database freely available at http://www.rbp-var.biols.ac.cn/, which provides annotation of functional variants involved in post-transcriptional interaction and regulation. RBP-Var provides an easy-to-use web interface that allows users to rapidly find whether SNVs of interest can transform the secondary structure of RNA and identify RBPs whose binding may be subsequently disrupted. RBP-Var integrates DNA and RNA biology to understand how various genetic variants and post-transcriptional mechanisms cooperate to orchestrate gene expression. In summary, RBP-Var is useful in selecting candidate SNVs for further functional studies and exploring causal SNVs underlying human diseases. PMID:26635394

  16. Description and analysis of genetic variants in French hereditary breast and ovarian cancer families recorded in the UMD-BRCA1/BRCA2 databases

    PubMed Central

    Caputo, Sandrine; Benboudjema, Louisa; Sinilnikova, Olga; Rouleau, Etienne; Béroud, Christophe; Lidereau, Rosette

    2012-01-01

    BRCA1 and BRCA2 are the two main genes responsible for predisposition to breast and ovarian cancers, as a result of protein-inactivating monoallelic mutations. It remains to be established whether many of the variants identified in these two genes, so-called unclassified/unknown variants (UVs), contribute to the disease phenotype or are simply neutral variants (or polymorphisms). Given the clinical importance of establishing their status, a nationwide effort to annotate these UVs was launched by laboratories belonging to the French GGC consortium (Groupe Génétique et Cancer), leading to the creation of the UMD-BRCA1/BRCA2 databases (http://www.umd.be/BRCA1/ and http://www.umd.be/BRCA2/). These databases have been endorsed by the French National Cancer Institute (INCa) and are designed to collect all variants detected in France, whether causal, neutral or UV. They differ from other BRCA databases in that they contain co-occurrence data for all variants. Using these data, the GGC French consortium has been able to classify certain UVs also contained in other databases. In this article, we report some novel UVs not contained in the BIC database and explore their impact in cancer predisposition based on a structural approach. PMID:22144684

  17. Building a Tool for Cost-Based Design of Object-Oriented Database Schemas

    NASA Astrophysics Data System (ADS)

    Biskup, Joachim; Menzel, Ralf

    In the traditional waterfall approach for building a software application, the phases of requirements analysis, design, implementation, testing, and maintenance follow one another. Aiming at the efficiency of a database application, we see that the outcome of the implementation phase decisively determines how much time the execution of queries and updates requires and how much space is needed to store the application data. But, these costs of the application result from decisions made not only in the implementation phase but also before that during the design phase. In this paper, we describe a tool to support the cost-based design of database applications. Based on earlier research where we designed a cost-model for an abstract object-oriented database machine, the tool shall provide its user with cost estimates during the design phase. We discuss which modifications and additions to our cost-model we use to build the tool. Specifically, we portray how we adapt the tool to a concrete DBMS. After picturing a design process that employs our tool, we conclude by assessing the achievements of the present work and how we benefited from our earlier underlying research.

  18. TMC-SNPdb: an Indian germline variant database derived from whole exome sequences

    PubMed Central

    Upadhyay, Pawan; Gardi, Nilesh; Desai, Sanket; Sahoo, Bikram; Singh, Ankita; Togar, Trupti; Iyer, Prajish; Prasad, Ratnam; Chandrani, Pratik; Gupta, Sudeep; Dutt, Amit

    2016-01-01

    Cancer is predominantly a somatic disease. A mutant allele present in a cancer cell genome is considered somatic when it’s absent in the paired normal genome along with public SNP databases. The current build of dbSNP, the most comprehensive public SNP database, however inadequately represents several non-European Caucasian populations, posing a limitation in cancer genomic analyses of data from these populations. We present the Tata Memorial Centre-SNP database (TMC-SNPdb), as the first open source, flexible, upgradable, and freely available SNP database (accessible through dbSNP build 149 and ANNOVAR)—representing 114 309 unique germline variants—generated from whole exome data of 62 normal samples derived from cancer patients of Indian origin. The TMC-SNPdb is presented with a companion subtraction tool that can be executed with command line option or using an easy-to-use graphical user interface with the ability to deplete additional Indian population specific SNPs over and above dbSNP and 1000 Genomes databases. Using an institutional generated whole exome data set of 132 samples of Indian origin, we demonstrate that TMC-SNPdb could deplete 42, 33 and 28% false positive somatic events post dbSNP depletion in Indian origin tongue, gallbladder, and cervical cancer samples, respectively. Beyond cancer somatic analyses, we anticipate utility of the TMC-SNPdb in several Mendelian germline diseases. In addition to dbSNP build 149 and ANNOVAR, the TMC-SNPdb along with the subtraction tool is available for download in the public domain at the following: Database URL: http://www.actrec.gov.in/pi-webpages/AmitDutt/TMCSNP/TMCSNPdp.html PMID:27402678

  19. The utilization of neural nets in populating an object-oriented database

    NASA Technical Reports Server (NTRS)

    Campbell, William J.; Hill, Scott E.; Cromp, Robert F.

    1989-01-01

    Existing NASA supported scientific data bases are usually developed, managed and populated in a tedious, error prone and self-limiting way in terms of what can be described in a relational Data Base Management System (DBMS). The next generation Earth remote sensing platforms (i.e., Earth Observation System, (EOS), will be capable of generating data at a rate of over 300 Mbs per second from a suite of instruments designed for different applications. What is needed is an innovative approach that creates object-oriented databases that segment, characterize, catalog and are manageable in a domain-specific context and whose contents are available interactively and in near-real-time to the user community. Described here is work in progress that utilizes an artificial neural net approach to characterize satellite imagery of undefined objects into high-level data objects. The characterized data is then dynamically allocated to an object-oriented data base where it can be reviewed and assessed by a user. The definition, development, and evolution of the overall data system model are steps in the creation of an application-driven knowledge-based scientific information system.

  20. Introduction to CHRS CONNECT - a global extreme precipitation event database using object-oriented approach

    NASA Astrophysics Data System (ADS)

    Nguyen, P.; Thorstensen, A. R.; Liu, H.; Sellars, S. L.; Ashouri, H.; Huynh, P.; Palacios, T.; Li, P.; Tran, H.; Braithwaite, D.; Hsu, K. L.; Gao, X.; Sorooshian, S.

    2015-12-01

    Extreme precipitation events cause natural disasters that impact many parts of the world. Understanding how these events vary in space and time is a key goal in climatology research. The recently developed CHRS CONNECT (Center for Hydrometeorology & Remote Sensing CONNected precipitation objECT) system is a global extreme precipitation event database derived from CHRS's satellite precipitation data products, including PERSIANN (Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks) and PERSIANN-CDR (Climate Data Record). Precipitation data from PERSIANN is hourly, 0.25ox0.25o grid, 60oS - 60oN, from 2000 to 2015, and data from PERSIANN-CDR is daily, 0.25ox0.25o grid, 60oS - 60oN, from 1983 to 2015. We used an advanced method in computer science which represents a data point on a three dimensional grid (longitude, latitude and time) called volumetric pixel or voxel. An object segmentation algorithm was developed to derive precipitation events as objects. In each object, voxels are connected to each other through the 26 connectivity faces (a voxel is connected to a neighboring voxel if they share a common face). The object-oriented algorithm was designed to provide a unique means in which extreme precipitation events and their attributes can be stored in a searchable database. This database is accessible through a user-friendly interface (connect.eng.uci.edu), allowing the user to retrieve events that fit specific criteria of interest such as spatiotemporal domain, maximum intensity, minimum duration and climatology indices. The interface includes several modes for visualization such as total precipitation, event tracking, and event evolution animation. The CHRS CONNECT tool is designed to be used for climatology research related to extreme precipitation events as well as for water resources management applications.

  1. Object {open_quotes}request{close_quotes} based clustering for method processing in object-oriented database system

    SciTech Connect

    Goel, S.; Bhargava, B.

    1996-12-31

    Static grouping (clustering) of component objects in a complex object at the server has been an active area of research in client/server based object oriented database systems. We present a client-driven object grouping approach. A client executing a method makes dynamic decisions and groups objects for a request to the server. The client requires run-time and statically analyzed information for the method to make its decisions. Complex object skeletons are used for navigating the complex object. We have conducted experimental studies to evaluate our approach. We have used a prototype object-oriented database system called O-Raid for our experiments.

  2. HistoneDB 2.0: a histone database with variants—an integrated resource to explore histones and their variants

    PubMed Central

    Draizen, Eli J.; Shaytan, Alexey K.; Mariño-Ramírez, Leonardo; Talbert, Paul B.; Landsman, David; Panchenko, Anna R.

    2016-01-01

    Compaction of DNA into chromatin is a characteristic feature of eukaryotic organisms. The core (H2A, H2B, H3, H4) and linker (H1) histone proteins are responsible for this compaction through the formation of nucleosomes and higher order chromatin aggregates. Moreover, histones are intricately involved in chromatin functioning and provide a means for genome dynamic regulation through specific histone variants and histone post-translational modifications. ‘HistoneDB 2.0 – with variants’ is a comprehensive database of histone protein sequences, classified by histone types and variants. All entries in the database are supplemented by rich sequence and structural annotations with many interactive tools to explore and compare sequences of different variants from various organisms. The core of the database is a manually curated set of histone sequences grouped into 30 different variant subsets with variant-specific annotations. The curated set is supplemented by an automatically extracted set of histone sequences from the non-redundant protein database using algorithms trained on the curated set. The interactive web site supports various searching strategies in both datasets: browsing of phylogenetic trees; on-demand generation of multiple sequence alignments with feature annotations; classification of histone-like sequences and browsing of the taxonomic diversity for every histone variant. HistoneDB 2.0 is a resource for the interactive comparative analysis of histone protein sequences and their implications for chromatin function. Database URL: http://www.ncbi.nlm.nih.gov/projects/HistoneDB2.0 PMID:26989147

  3. Panel sequencing for clinically oriented variant screening and copy number detection in 142 untreated multiple myeloma patients

    PubMed Central

    Kortuem, K M; Braggio, E; Bruins, L; Barrio, S; Shi, C S; Zhu, Y X; Tibes, R; Viswanatha, D; Votruba, P; Ahmann, G; Fonseca, R; Jedlowski, P; Schlam, I; Kumar, S; Bergsagel, P L; Stewart, A K

    2016-01-01

    We employed a customized Multiple Myeloma (MM)-specific Mutation Panel (M3P) to screen a homogenous cohort of 142 untreated MM patients for relevant mutations in a selection of disease-specific genes. M3Pv2.0 includes 77 genes selected for being either actionable targets, potentially related to drug–response or part of known key pathways in MM biology. We identified mutations in potentially actionable genes in 49% of patients and provided prognostic evidence of STAT3 mutations. This panel may serve as a practical alternative to more comprehensive sequencing approaches, providing genomic information in a timely and cost-effective manner, thus allowing clinically oriented variant screening in MM. PMID:26918361

  4. A database of crystal preferred orientation of olivine in upper mantle rocks

    NASA Astrophysics Data System (ADS)

    Mainprice, D.

    2012-12-01

    Olivine is the most volumetrically abundant mineral in the Earth's upper mantle, as such it dominates the mechanical and physical properties and has a controlling influence of the geodynamics of plate tectonics. Since the pioneering work of Hess and others we know that seismic anisotropy of the shallow mantle is related to olivine and it's crystal preferred orientation (CPO). With advent of plate tectonics the understanding of the key role of peridotite rocks became a major scientific objective and the measurement CPO of olivine in upper mantle samples became an important tool for studying the kinematics of these rocks. Our group originally lead by Adolphe Nicolas introduced the systematic use of CPO measured by U-stage for field studies all over the world for over 30 years, this tradition was extended in last 15 years by the use of electron back-scattered diffraction (EBSD) to study of CPO and the associated digital microstructure. It is an appropriate time to analysis this significant database of olivine CPO, which represents the work of our group, both present and former members, as well as collaborating colleagues. It is also interesting to compare the natural record as illustrated by our database in the light of recent experimental results stimulated by the extended ranges in temperature, pressure and finite strain, as well as intrinsic olivine variables such as hydrogen content. To analysis the database, which is heterogeneous because it is constructed from the individual work of many people over a 45 year period containing U-stage data and EBSD measurements (manual indexing point per grain, automatic indexing one point per grain, automatic indexing gridded mapping data) of various formats, we need a flexible software tool that can handle large volumes of data in consistent way. We have used the state-of-art open source MTEX toolbox for quantitative texture analysis. MTEX is a scriptable MATLAB toolbox, which permits all aspects of quantitative texture

  5. Searching biosignal databases by content and context: Research Oriented Integration System for ECG Signals (ROISES).

    PubMed

    Kokkinaki, Alexandra; Chouvarda, Ioanna; Maglaveras, Nicos

    2012-11-01

    Technological advances in textile, biosensor and electrocardiography domain induced the wide spread use of bio-signal acquisition devices leading to the generation of massive bio-signal datasets. Among the most popular bio-signals, electrocardiogram (ECG) possesses the longest tradition in bio-signal monitoring and recording, being a strong and relatively robust signal. As research resources are fostered, research community promotes the need to extract new knowledge from bio-signals towards the adoption of new medical procedures. However, integrated access, query and management of ECGs are impeded by the diversity and heterogeneity of bio-signal storage data formats. In this scope, the proposed work introduces a new methodology for the unified access to bio-signal databases and the accompanying metadata. It allows decoupling information retrieval from actual underlying datasource structures and enables transparent content and context based searching from multiple data resources. Our approach is based on the definition of an interactive global ontology which manipulates the similarities and the differences of the underlying sources to either establish similarity mappings or enrich its terminological structure. We also introduce ROISES (Research Oriented Integration System for ECG Signals), for the definition of complex content based queries against the diverse bio-signal data sources. PMID:21397354

  6. Image Engine: an object-oriented multimedia database for storing, retrieving and sharing medical images and text.

    PubMed

    Lowe, H J

    1993-01-01

    This paper describes Image Engine, an object-oriented, microcomputer-based, multimedia database designed to facilitate the storage and retrieval of digitized biomedical still images, video, and text using inexpensive desktop computers. The current prototype runs on Apple Macintosh computers and allows network database access via peer to peer file sharing protocols. Image Engine supports both free text and controlled vocabulary indexing of multimedia objects. The latter is implemented using the TView thesaurus model developed by the author. The current prototype of Image Engine uses the National Library of Medicine's Medical Subject Headings (MeSH) vocabulary (with UMLS Meta-1 extensions) as its indexing thesaurus. PMID:8130596

  7. Using semantic data modeling techniques to organize an object-oriented database for extending the mass storage model

    NASA Technical Reports Server (NTRS)

    Campbell, William J.; Short, Nicholas M., Jr.; Roelofs, Larry H.; Dorfman, Erik

    1991-01-01

    A methodology for optimizing organization of data obtained by NASA earth and space missions is discussed. The methodology uses a concept based on semantic data modeling techniques implemented in a hierarchical storage model. The modeling is used to organize objects in mass storage devices, relational database systems, and object-oriented databases. The semantic data modeling at the metadata record level is examined, including the simulation of a knowledge base and semantic metadata storage issues. The semantic data model hierarchy and its application for efficient data storage is addressed, as is the mapping of the application structure to the mass storage.

  8. Molecular abnormality of an inactive aldehyde dehydrogenase variant commonly found in Orientals.

    PubMed Central

    Yoshida, A; Huang, I Y; Ikawa, M

    1984-01-01

    Usual human livers contain two major aldehyde dehydrogenase [(ALDH) aldehyde:NAD+ oxidoreductase] isozymes--i.e., a cytosolic ALDH1 component and a mitochondrial ALDH2 component--whereas approximately equal to 50% of Orientals are "atypical" and have only the ALDH1 isozyme and are missing the ALDH2 isozyme. We previously demonstrated that atypical livers contain an enzymatically inactive but immunologically crossreactive material (CRM) corresponding to the ALDH2 component. The enzymatically active ALDH2 obtained from a usual liver and the CRM obtained from an atypical liver were reduced, S-carboxymethylated, and digested by trypsin. Separation of their digests by high-performance reverse-phase chromatography and by two-dimensional paper chromatography and electrophoresis revealed that ALDH2 contained a peptide sequence of -Glu-Leu-Gly-Glu-Ala-Gly-Leu-Gln-Ala-Asn-Val-Gln-Val-Lys- and that the glutamine adjacent to lysine was substituted by lysine in CRM. All other tryptic peptides, including eight peptides containing S-carboxymethylcysteine, were common in ALDH2 and CRM. It is concluded that a point mutation in the human ALDH2 locus produced the glutamine leads to lysine substitution and enzyme inactivation. Images PMID:6582480

  9. Initial experiences with building a health care infrastructure based on Java and object-oriented database technology.

    PubMed Central

    Dionisio, J. D.; Sinha, U.; Dai, B.; Johnson, D. B.; Taira, R. K.

    1999-01-01

    A multi-tiered telemedicine system based on Java and object-oriented database technology has yielded a number of practical insights and experiences on their effectiveness and suitability as implementation bases for a health care infrastructure. The advantages and drawbacks to their use, as seen within the context of the telemedicine system's development, are discussed. Overall, these technologies deliver on their early promise, with a few remaining issues that are due primarily to their relative newness. PMID:10566412

  10. GetData: A filesystem-based, column-oriented database format for time-ordered binary data

    NASA Astrophysics Data System (ADS)

    Wiebe, Donald V.; Netterfield, Calvin B.; Kisner, Theodore S.

    2015-12-01

    The GetData Project is the reference implementation of the Dirfile Standards, a filesystem-based, column-oriented database format for time-ordered binary data. Dirfiles provide a fast, simple format for storing and reading data, suitable for both quicklook and analysis pipelines. GetData provides a C API and bindings exist for various other languages. GetData is distributed under the terms of the GNU Lesser General Public License.

  11. Nomenclature- and database-compatible names for the two Ebola virus variants that emerged in Guinea and the Democratic Republic of the Congo in 2014.

    PubMed

    Kuhn, Jens H; Andersen, Kristian G; Baize, Sylvain; Bào, Yīmíng; Bavari, Sina; Berthet, Nicolas; Blinkova, Olga; Brister, J Rodney; Clawson, Anna N; Fair, Joseph; Gabriel, Martin; Garry, Robert F; Gire, Stephen K; Goba, Augustine; Gonzalez, Jean-Paul; Günther, Stephan; Happi, Christian T; Jahrling, Peter B; Kapetshi, Jimmy; Kobinger, Gary; Kugelman, Jeffrey R; Leroy, Eric M; Maganga, Gael Darren; Mbala, Placide K; Moses, Lina M; Muyembe-Tamfum, Jean-Jacques; N'Faly, Magassouba; Nichol, Stuart T; Omilabu, Sunday A; Palacios, Gustavo; Park, Daniel J; Paweska, Janusz T; Radoshitzky, Sheli R; Rossi, Cynthia A; Sabeti, Pardis C; Schieffelin, John S; Schoepp, Randal J; Sealfon, Rachel; Swanepoel, Robert; Towner, Jonathan S; Wada, Jiro; Wauquier, Nadia; Yozwiak, Nathan L; Formenty, Pierre

    2014-11-01

    In 2014, Ebola virus (EBOV) was identified as the etiological agent of a large and still expanding outbreak of Ebola virus disease (EVD) in West Africa and a much more confined EVD outbreak in Middle Africa. Epidemiological and evolutionary analyses confirmed that all cases of both outbreaks are connected to a single introduction each of EBOV into human populations and that both outbreaks are not directly connected. Coding-complete genomic sequence analyses of isolates revealed that the two outbreaks were caused by two novel EBOV variants, and initial clinical observations suggest that neither of them should be considered strains. Here we present consensus decisions on naming for both variants (West Africa: "Makona", Middle Africa: "Lomela") and provide database-compatible full, shortened, and abbreviated names that are in line with recently established filovirus sub-species nomenclatures. PMID:25421896

  12. SurfaceomeDB: a cancer-orientated database for genes encoding cell surface proteins.

    PubMed

    de Souza, Jorge Estefano Santana; Galante, Pedro Alexandre Favoretto; de Almeida, Renan Valieris Bueno; da Cunha, Julia Pinheiro Chagas; Ohara, Daniel Takatori; Ohno-Machado, Lucila; Old, Lloyd J; de Souza, Sandro José

    2012-01-01

    Cell surface proteins (CSPs) are excellent targets for the development of diagnostic and therapeutic reagents, and it is estimated that 10-20% of all genes in the human genome encode CSPs. In an effort to integrate all data publicly available for genes encoding cell surface proteins, a database (SurfaceomeDB) was developed. SurfaceomeDB is a gene-centered portal containing different types of information, including annotation for gene expression, protein domains, somatic mutations in cancer, and protein-protein interactions for all human genes encoding CSPs. SurfaceomeDB was implemented as an integrative and relational database in a user-friendly web interface, where users can search for gene name, gene annotation, or keywords. There is also a streamlined graphical representation of all data provided and links to the most important data repositories and databases, such as NCBI, UCSC Genome Browser, and EBI. PMID:23390370

  13. A Quality-Control-Oriented Database for a Mesoscale Meteorological Observation Network

    NASA Astrophysics Data System (ADS)

    Lussana, C.; Ranci, M.; Uboldi, F.

    2012-04-01

    In the operational context of a local weather service, data accessibility and quality related issues must be managed by taking into account a wide set of user needs. This work describes the structure and the operational choices made for the operational implementation of a database system storing data from highly automated observing stations, metadata and information on data quality. Lombardy's environmental protection agency, ARPA Lombardia, manages a highly automated mesoscale meteorological network. A Quality Assurance System (QAS) ensures that reliable observational information is collected and disseminated to the users. The weather unit in ARPA Lombardia, at the same time an important QAS component and an intensive data user, has developed a database specifically aimed to: 1) providing quick access to data for operational activities and 2) ensuring data quality for real-time applications, by means of an Automatic Data Quality Control (ADQC) procedure. Quantities stored in the archive include hourly aggregated observations of: precipitation amount, temperature, wind, relative humidity, pressure, global and net solar radiation. The ADQC performs several independent tests on raw data and compares their results in a decision-making procedure. An important ADQC component is the Spatial Consistency Test based on Optimal Interpolation. Interpolated and Cross-Validation analysis values are also stored in the database, providing further information to human operators and useful estimates in case of missing data. The technical solution adopted is based on a LAMP (Linux, Apache, MySQL and Php) system, constituting an open source environment suitable for both development and operational practice. The ADQC procedure itself is performed by R scripts directly interacting with the MySQL database. Users and network managers can access the database by using a set of web-based Php applications.

  14. Object-Oriented Database for Managing Building Modeling Components and Metadata: Preprint

    SciTech Connect

    Long, N.; Fleming, K.; Brackney, L.

    2011-12-01

    Building simulation enables users to explore and evaluate multiple building designs. When tools for optimization, parametrics, and uncertainty analysis are combined with analysis engines, the sheer number of discrete simulation datasets makes it difficult to keep track of the inputs. The integrity of the input data is critical to designers, engineers, and researchers for code compliance, validation, and building commissioning long after the simulations are finished. This paper discusses an application that stores inputs needed for building energy modeling in a searchable, indexable, flexible, and scalable database to help address the problem of managing simulation input data.

  15. Therapeutic target database update 2012: a resource for facilitating target-oriented drug discovery

    PubMed Central

    Zhu, Feng; Shi, Zhe; Qin, Chu; Tao, Lin; Liu, Xin; Xu, Feng; Zhang, Li; Song, Yang; Liu, Xianghui; Zhang, Jingxian; Han, Bucong; Zhang, Peng; Chen, Yuzong

    2012-01-01

    Knowledge and investigation of therapeutic targets (responsible for drug efficacy) and the targeted drugs facilitate target and drug discovery and validation. Therapeutic Target Database (TTD, http://bidd.nus.edu.sg/group/ttd/ttd.asp) has been developed to provide comprehensive information about efficacy targets and the corresponding approved, clinical trial and investigative drugs. Since its last update, major improvements and updates have been made to TTD. In addition to the significant increase of data content (from 1894 targets and 5028 drugs to 2025 targets and 17 816 drugs), we added target validation information (drug potency against target, effect against disease models and effect of target knockout, knockdown or genetic variations) for 932 targets, and 841 quantitative structure activity relationship models for active compounds of 228 chemical types against 121 targets. Moreover, we added the data from our previous drug studies including 3681 multi-target agents against 108 target pairs, 116 drug combinations with their synergistic, additive, antagonistic, potentiative or reductive mechanisms, 1427 natural product-derived approved, clinical trial and pre-clinical drugs and cross-links to the clinical trial information page in the ClinicalTrials.gov database for 770 clinical trial drugs. These updates are useful for facilitating target discovery and validation, drug lead discovery and optimization, and the development of multi-target drugs and drug combinations. PMID:21948793

  16. DOGMA: A Disk-Oriented Graph Matching Algorithm for RDF Databases

    NASA Astrophysics Data System (ADS)

    Bröcheler, Matthias; Pugliese, Andrea; Subrahmanian, V. S.

    RDF is an increasingly important paradigm for the representation of information on the Web. As RDF databases increase in size to approach tens of millions of triples, and as sophisticated graph matching queries expressible in languages like SPARQL become increasingly important, scalability becomes an issue. To date, there is no graph-based indexing method for RDF data where the index was designed in a way that makes it disk-resident. There is therefore a growing need for indexes that can operate efficiently when the index itself resides on disk. In this paper, we first propose the DOGMA index for fast subgraph matching on disk and then develop a basic algorithm to answer queries over this index. This algorithm is then significantly sped up via an optimized algorithm that uses efficient (but correct) pruning strategies when combined with two different extensions of the index. We have implemented a preliminary system and tested it against four existing RDF database systems developed by others. Our experiments show that our algorithm performs very well compared to these systems, with orders of magnitude improvements for complex graph queries.

  17. Patient-Oriented Cancer Information on the Internet: A Comparison of Wikipedia and a Professionally Maintained Database

    PubMed Central

    Rajagopalan, Malolan S.; Khanna, Vineet K.; Leiter, Yaacov; Stott, Meghan; Showalter, Timothy N.; Dicker, Adam P.; Lawrence, Yaacov R.

    2011-01-01

    Purpose: A wiki is a collaborative Web site, such as Wikipedia, that can be freely edited. Because of a wiki's lack of formal editorial control, we hypothesized that the content would be less complete and accurate than that of a professional peer-reviewed Web site. In this study, the coverage, accuracy, and readability of cancer information on Wikipedia were compared with those of the patient-orientated National Cancer Institute's Physician Data Query (PDQ) comprehensive cancer database. Methods: For each of 10 cancer types, medically trained personnel scored PDQ and Wikipedia articles for accuracy and presentation of controversies by using an appraisal form. Reliability was assessed by using interobserver variability and test-retest reproducibility. Readability was calculated from word and sentence length. Results: Evaluators were able to rapidly assess articles (18 minutes/article), with a test-retest reliability of 0.71 and interobserver variability of 0.53. For both Web sites, inaccuracies were rare, less than 2% of information examined. PDQ was significantly more readable than Wikipedia: Flesch-Kincaid grade level 9.6 versus 14.1. There was no difference in depth of coverage between PDQ and Wikipedia (29.9, 34.2, respectively; maximum possible score 72). Controversial aspects of cancer care were relatively poorly discussed in both resources (2.9 and 6.1 for PDQ and Wikipedia, respectively, NS; maximum possible score 18). A planned subanalysis comparing common and uncommon cancers demonstrated no difference. Conclusion: Although the wiki resource had similar accuracy and depth as the professionally edited database, it was significantly less readable. Further research is required to assess how this influences patients' understanding and retention. PMID:22211130

  18. Application of a five-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants lodged on the InSiGHT locus-specific database

    PubMed Central

    Plazzer, John-Paul; Greenblatt, Marc S.; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T.; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P.; Farrington, Susan M.; Frayling, Ian M.; Frebourg, Thierry; Goldgar, David E.; Heinen, Christopher D.; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J.; Sijmons, Rolf; Tavtigian, Sean V.; Tops, Carli M.; Weber, Thomas; Wijnen, Juul; Woods, Michael O.; Macrae, Finlay; Genuardi, Maurizio

    2015-01-01

    Clinical classification of sequence variants identified in hereditary disease genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch Syndrome genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist variant classification, and recognized by microattribution. The scheme was refined by multidisciplinary expert committee review of clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants not obviously protein-truncating from nomenclature. This large-scale endeavor will facilitate consistent management of suspected Lynch Syndrome families, and demonstrates the value of multidisciplinary collaboration for curation and classification of variants in public locus-specific databases. PMID:24362816

  19. Identification, mRNA expression, and functional analysis of chitin synthase 1 gene and its two alternative splicing variants in oriental fruit fly, Bactrocera dorsalis.

    PubMed

    Yang, Wen-Jia; Xu, Kang-Kang; Cong, Lin; Wang, Jin-Jun

    2013-01-01

    Two alternative splicing variants of chitin synthase 1 gene (BdCHS1) were cloned and characterized from the oriental fruit fly, Bactrocera dorsalis (Hendel). The cDNA of both variants (BdCHS1a and BdCHS1b) consisted of 5,552 nucleotides (nt), with an open reading frame (ORF) of 4,776 nt, encoding a protein of 1,592 amino acid residues, plus 685- and 88-nt of 5'- and 3'-noncoding regions, respectively. The alternative splicing site was located between positions 3,784-3,960 and formed a pair of mutually exclusive exons (a/b) that were same in size (177 nt), but showed only 65% identity at the nucleotide level. During B. dorsalis growth and development, BdCHS1 and BdCHS1a were both mainly expressed during the larval-pupal and pupal-adult transitions, while BdCHS1b was mainly expressed during pupal-adult metamorphosis and in the middle of the pupal stage. BdCHS1a was predominately expressed in the integument whereas BdCHS1b was mainly expressed in the trachea. The 20-hydroxyecdysone (20E) induced the expression of BdCHS1 and its variants. Injection of dsRNA of BdCHS1, BdCHS1a, and BdCHS1b into third-instar larvae significantly reduced the expression levels of the corresponding variants, generated phenotypic defects, and killed most of the treated larvae. Furthermore, silencing of BdCHS1 and BdCHS1a had a similar result in that the larva was trapped in old cuticle and died without tanning completely, while silencing of BdCHS1b has no effect on insect morphology. These results demonstrated that BdCHS1 plays an important role in the larval-pupal transition and the expression of BdCHS1 in B. dorsalis is regulated by 20E. PMID:23569438

  20. Identification, mRNA Expression, and Functional Analysis of Chitin Synthase 1 Gene and Its Two Alternative Splicing Variants in Oriental Fruit Fly, Bactrocera dorsalis

    PubMed Central

    Yang, Wen-Jia; Xu, Kang-Kang; Cong, Lin; Wang, Jin-Jun

    2013-01-01

    Two alternative splicing variants of chitin synthase 1 gene (BdCHS1) were cloned and characterized from the oriental fruit fly, Bactrocera dorsalis (Hendel). The cDNA of both variants (BdCHS1a and BdCHS1b) consisted of 5,552 nucleotides (nt), with an open reading frame (ORF) of 4,776 nt, encoding a protein of 1,592 amino acid residues, plus 685- and 88-nt of 5′- and 3′-noncoding regions, respectively. The alternative splicing site was located between positions 3,784-3,960 and formed a pair of mutually exclusive exons (a/b) that were same in size (177 nt), but showed only 65% identity at the nucleotide level. During B. dorsalis growth and development, BdCHS1 and BdCHS1a were both mainly expressed during the larval-pupal and pupal-adult transitions, while BdCHS1b was mainly expressed during pupal-adult metamorphosis and in the middle of the pupal stage. BdCHS1a was predominately expressed in the integument whereas BdCHS1b was mainly expressed in the trachea. The 20-hydroxyecdysone (20E) induced the expression of BdCHS1 and its variants. Injection of dsRNA of BdCHS1, BdCHS1a, and BdCHS1b into third-instar larvae significantly reduced the expression levels of the corresponding variants, generated phenotypic defects, and killed most of the treated larvae. Furthermore, silencing of BdCHS1 and BdCHS1a had a similar result in that the larva was trapped in old cuticle and died without tanning completely, while silencing of BdCHS1b has no effect on insect morphology. These results demonstrated that BdCHS1 plays an important role in the larval-pupal transition and the expression of BdCHS1 in B. dorsalis is regulated by 20E. PMID:23569438

  1. Towards secondary use of heterogeneous radio-oncological data for retrospective clinical trials: service-oriented connection of a central research database with image analysis tools

    NASA Astrophysics Data System (ADS)

    Bougatf, Nina; Bendl, Rolf; Debus, Jürgen

    2015-03-01

    Our overall objective is the utilization of heterogeneous and distributed radio-oncological data in retrospective clinical trials. Previously, we have successfully introduced a central research database for collection of heterogeneous data from distributed systems. The next step is the integration of image analysis tools in the standard retrieval process. Hence, analyses for complex medical questions can be processed automatically and facilitated immensely. In radiation oncology recurrence analysis is a central approach for the evaluation of therapeutic concepts. However, various analysis steps have to be performed like image registration, dose transformation and dose statistics. In this paper we show the integration of image analysis tools in the standard retrieval process by connecting them with our central research database using a service-oriented approach. A concrete problem from recurrence analysis has been selected to prove our concept exemplarily. We implemented service-oriented data collection and analysis tools to use them in a central analysis platform, which is based on a work flow management system. An analysis work flow has been designed that, at first, identifies patients in the research database fulfilling the inclusion criteria. Then the relevant imaging data is collected. Finally the imaging data is analyzed automatically. After the successful work flow execution, the results are available for further evaluation by a physician. As a result, the central research database has been connected successfully with automatic data collection and image analysis tools and the feasibility of our service-oriented approach has been demonstrated. In conclusion, our approach will simplify retrospective clinical trials in our department in future.

  2. Cellulase variants

    SciTech Connect

    Blazej, Robert; Toriello, Nicholas; Emrich, Charles; Cohen, Richard N.; Koppel, Nitzan

    2015-07-14

    This invention provides novel variant cellulolytic enzymes having improved activity and/or stability. In certain embodiments the variant cellulotyic enzymes comprise a glycoside hydrolase with or comprising a substitution at one or more positions corresponding to one or more of residues F64, A226, and/or E246 in Thermobifida fusca Cel9A enzyme. In certain embodiments the glycoside hydrolase is a variant of a family 9 glycoside hydrolase. In certain embodiments the glycoside hydrolase is a variant of a theme B family 9 glycoside hydrolase.

  3. Mesh Oriented datABase

    SciTech Connect

    Tautges, Timothy J.

    2004-04-01

    MOAB is a component for representing and evaluating mesh data. MOAB can store stuctured and unstructured mesh, consisting of elements in the finite element "zoo". The functional interface to MOAB is simple yet powerful, allowing the representation of many types of metadata commonly found on the mesh. MOAB is optimized for efficiency in space and time, based on access to mesh in chunks rather than through individual entities, while also versatile enough to support individual entity access. The MOAB data model consists of a mesh interface instance, mesh entities (vertices and elements), sets, and tags. Entities are addressed through handles rather than pointers, to allow the underlying representation of an entity to change without changing the handle to that entity. Sets are arbitrary groupings of mesh entities and other sets. Sets also support parent/child relationships as a relation distinct from sets containing other sets. The directed-graph provided by set parent/child relationships is useful for modeling topological relations from a geometric model or other metadata. Tags are named data which can be assigned to the mesh as a whole, individual entities, or sets. Tags are a mechanism for attaching data to individual entities and sets are a mechanism for describing relations between entities; the combination of these two mechanisms isa powerful yet simple interface for representing metadata or application-specific data. For example, sets and tags can be used together to describe geometric topology, boundary condition, and inter-processor interface groupings in a mesh. MOAB is used in several ways in various applications. MOAB serves as the underlying mesh data representation in the VERDE mesh verification code. MOAB can also be used as a mesh input mechanism, using mesh readers induded with MOAB, or as a t’anslator between mesh formats, using readers and writers included with MOAB.

  4. Mesh Oriented datABase

    Energy Science and Technology Software Center (ESTSC)

    2004-04-01

    MOAB is a component for representing and evaluating mesh data. MOAB can store stuctured and unstructured mesh, consisting of elements in the finite element "zoo". The functional interface to MOAB is simple yet powerful, allowing the representation of many types of metadata commonly found on the mesh. MOAB is optimized for efficiency in space and time, based on access to mesh in chunks rather than through individual entities, while also versatile enough to support individualmore » entity access. The MOAB data model consists of a mesh interface instance, mesh entities (vertices and elements), sets, and tags. Entities are addressed through handles rather than pointers, to allow the underlying representation of an entity to change without changing the handle to that entity. Sets are arbitrary groupings of mesh entities and other sets. Sets also support parent/child relationships as a relation distinct from sets containing other sets. The directed-graph provided by set parent/child relationships is useful for modeling topological relations from a geometric model or other metadata. Tags are named data which can be assigned to the mesh as a whole, individual entities, or sets. Tags are a mechanism for attaching data to individual entities and sets are a mechanism for describing relations between entities; the combination of these two mechanisms isa powerful yet simple interface for representing metadata or application-specific data. For example, sets and tags can be used together to describe geometric topology, boundary condition, and inter-processor interface groupings in a mesh. MOAB is used in several ways in various applications. MOAB serves as the underlying mesh data representation in the VERDE mesh verification code. MOAB can also be used as a mesh input mechanism, using mesh readers induded with MOAB, or as a t’anslator between mesh formats, using readers and writers included with MOAB.« less

  5. LQTS gene LOVD database.

    PubMed

    Zhang, Tao; Moss, Arthur; Cong, Peikuan; Pan, Min; Chang, Bingxi; Zheng, Liangrong; Fang, Quan; Zareba, Wojciech; Robinson, Jennifer; Lin, Changsong; Li, Zhongxiang; Wei, Junfang; Zeng, Qiang; Qi, Ming

    2010-11-01

    The Long QT Syndrome (LQTS) is a group of genetically heterogeneous disorders that predisposes young individuals to ventricular arrhythmias and sudden death. LQTS is mainly caused by mutations in genes encoding subunits of cardiac ion channels (KCNQ1, KCNH2,SCN5A, KCNE1, and KCNE2). Many other genes involved in LQTS have been described recently(KCNJ2, AKAP9, ANK2, CACNA1C, SCNA4B, SNTA1, and CAV3). We created an online database(http://www.genomed.org/LOVD/introduction.html) that provides information on variants in LQTS-associated genes. As of February 2010, the database contains 1738 unique variants in 12 genes. A total of 950 variants are considered pathogenic, 265 are possible pathogenic, 131 are unknown/unclassified, and 292 have no known pathogenicity. In addition to these mutations collected from published literature, we also submitted information on gene variants, including one possible novel pathogenic mutation in the KCNH2 splice site found in ten Chinese families with documented arrhythmias. The remote user is able to search the data and is encouraged to submit new mutations into the database. The LQTS database will become a powerful tool for both researchers and clinicians. PMID:20809527

  6. Who's Gonna Pay the Piper for Free Online Databases?

    ERIC Educational Resources Information Center

    Jacso, Peter

    1996-01-01

    Discusses new pricing models for some online services and considers the possibilities for the traditional online database market. Topics include multimedia music databases, including copyright implications; other retail-oriented databases; and paying for free databases with advertising. (LRW)

  7. Generalized Database Management System Support for Numeric Database Environments.

    ERIC Educational Resources Information Center

    Dominick, Wayne D.; Weathers, Peggy G.

    1982-01-01

    This overview of potential for utilizing database management systems (DBMS) within numeric database environments highlights: (1) major features, functions, and characteristics of DBMS; (2) applicability to numeric database environment needs and user needs; (3) current applications of DBMS technology; and (4) research-oriented and…

  8. Research directions in object-oriented programming

    SciTech Connect

    Shriver, B.; Wegner, P.

    1987-01-01

    The contributions in this book provide the most complete survey available in programming languages, databases, and programming semantics. The contents include: The Beta Programming Language: Common Objects: An Object-Oriented Programming Language with Encapsulation and Inheritance: Actors: A Conceptual Foundation for Object-Oriented Programmming: Vulcan; A model for Object-Based Inheritance; Definition Groups; Block-Structure and Object-Oriented Languages; A Mechanism for Specifying the Structure of Large Layered Programs; Classification in Object-Oriented Systems; Extensions and Foundations of Object-Oriented Programming; Object-Oriented Specification; Object-Oriented Databases; Development and Implementation of an Object-Oriented Database Management System; Maintaining Consistency in Databases with Changing Types; Object-Oriented Environments; An Object-Oriented Framework for Graphical Programming; A Substrate for Object-Oriented Interface Design.

  9. JAK/STAT signalling--an executable model assembled from molecule-centred modules demonstrating a module-oriented database concept for systems and synthetic biology.

    PubMed

    Blätke, Mary Ann; Dittrich, Anna; Rohr, Christian; Heiner, Monika; Schaper, Fred; Marwan, Wolfgang

    2013-06-01

    Mathematical models of molecular networks regulating biological processes in cells or organisms are most frequently designed as sets of ordinary differential equations. Various modularisation methods have been applied to reduce the complexity of models, to analyse their structural properties, to separate biological processes, or to reuse model parts. Taking the JAK/STAT signalling pathway with the extensive combinatorial cross-talk of its components as a case study, we make a natural approach to modularisation by creating one module for each biomolecule. Each module consists of a Petri net and associated metadata and is organised in a database publically accessible through a web interface (). The Petri net describes the reaction mechanism of a given biomolecule and its functional interactions with other components including relevant conformational states. The database is designed to support the curation, documentation, version control, and update of individual modules, and to assist the user in automatically composing complex models from modules. Biomolecule centred modules, associated metadata, and database support together allow the automatic creation of models by considering differential gene expression in given cell types or under certain physiological conditions or states of disease. Modularity also facilitates exploring the consequences of alternative molecular mechanisms by comparative simulation of automatically created models even for users without mathematical skills. Models may be selectively executed as an ODE system, stochastic, or qualitative models or hybrid and exported in the SBML format. The fully automated generation of models of redesigned networks by metadata-guided modification of modules representing biomolecules with mutated function or specificity is proposed. PMID:23443149

  10. Database computing in HEP

    SciTech Connect

    Day, C.T.; Loken, S.; MacFarlane, J.F. ); May, E.; Lifka, D.; Lusk, E.; Price, L.E. ); Baden, A. . Dept. of Physics); Grossman, R.; Qin, X. . Dept. of Mathematics, Statistics and Computer Science); Cormell, L.; Leibold, P.; Liu, D

    1992-01-01

    The major SSC experiments are expected to produce up to 1 Petabyte of data per year each. Once the primary reconstruction is completed by farms of inexpensive processors. I/O becomes a major factor in further analysis of the data. We believe that the application of database techniques can significantly reduce the I/O performed in these analyses. We present examples of such I/O reductions in prototype based on relational and object-oriented databases of CDF data samples.

  11. Database computing in HEP

    NASA Technical Reports Server (NTRS)

    Day, C. T.; Loken, S.; Macfarlane, J. F.; May, E.; Lifka, D.; Lusk, E.; Price, L. E.; Baden, A.; Grossman, R.; Qin, X.

    1992-01-01

    The major SSC experiments are expected to produce up to 1 Petabyte of data per year each. Once the primary reconstruction is completed by farms of inexpensive processors, I/O becomes a major factor in further analysis of the data. We believe that the application of database techniques can significantly reduce the I/O performed in these analyses. We present examples of such I/O reductions in prototypes based on relational and object-oriented databases of CDF data samples.

  12. MEROPS: the peptidase database

    PubMed Central

    Rawlings, Neil D.; Barrett, Alan J.

    2000-01-01

    Important additions have been made to the MEROPS database (http://www.bi.bbsrc.ac.uk/Merops/Merops.htm ). These include sequence alignments and cladograms for many of the families of peptidases, and these have proved very helpful in the difficult task of distinguishing the sequences of peptidases that are simply species variants of already known enzymes from those that represent novel enzymes. PMID:10592261

  13. Disease variants in genomes of 44 centenarians

    PubMed Central

    Freudenberg-Hua, Yun; Freudenberg, Jan; Vacic, Vladimir; Abhyankar, Avinash; Emde, Anne-Katrin; Ben-Avraham, Danny; Barzilai, Nir; Oschwald, Dayna; Christen, Erika; Koppel, Jeremy; Greenwald, Blaine; Darnell, Robert B; Germer, Soren; Atzmon, Gil; Davies, Peter

    2014-01-01

    To identify previously reported disease mutations that are compatible with extraordinary longevity, we screened the coding regions of the genomes of 44 Ashkenazi Jewish centenarians. Individual genome sequences were generated with 30× coverage on the Illumina HiSeq 2000 and single-nucleotide variants were called with the genome analysis toolkit (GATK). We identified 130 coding variants that were annotated as “pathogenic” or “likely pathogenic” based on the ClinVar database and that are infrequent in the general population. These variants were previously reported to cause a wide range of degenerative, neoplastic, and cardiac diseases with autosomal dominant, autosomal recessive, and X-linked inheritance. Several of these variants are located in genes that harbor actionable incidental findings, according to the recommendations of the American College of Medical Genetics. In addition, we found risk variants for late-onset neurodegenerative diseases, such as the APOE ε4 allele that was even present in a homozygous state in one centenarian who did not develop Alzheimer's disease. Our data demonstrate that the incidental finding of certain reported disease variants in an individual genome may not preclude an extraordinarily long life. When the observed variants are encountered in the context of clinical sequencing, it is thus important to exercise caution in justifying clinical decisions. In genome sequences of 44 Ashkenazi centenarians, we identified many coding variants that were annotated as “pathogenic” or “likely pathogenic” based on the ClinVar database. Our data demonstrate that the incidental finding of certain reported disease variants in an individual genome may not preclude an extraordinarily long life. When the observed variants are encountered in the context of clinical sequencing, it is thus important to exercise caution in justifying clinical decisions. PMID:25333069

  14. Biofuel Database

    National Institute of Standards and Technology Data Gateway

    Biofuel Database (Web, free access)   This database brings together structural, biological, and thermodynamic data for enzymes that are either in current use or are being considered for use in the production of biofuels.

  15. Electronic Databases.

    ERIC Educational Resources Information Center

    Williams, Martha E.

    1985-01-01

    Presents examples of bibliographic, full-text, and numeric databases. Also discusses how to access these databases online, aids to online retrieval, and several issues and trends (including copyright and downloading, transborder data flow, use of optical disc/videodisc technology, and changing roles in database generation and processing). (JN)

  16. Database Administrator

    ERIC Educational Resources Information Center

    Moore, Pam

    2010-01-01

    The Internet and electronic commerce (e-commerce) generate lots of data. Data must be stored, organized, and managed. Database administrators, or DBAs, work with database software to find ways to do this. They identify user needs, set up computer databases, and test systems. They ensure that systems perform as they should and add people to the…

  17. MOAB : a mesh-oriented database.

    SciTech Connect

    Tautges, Timothy James; Ernst, Corey; Stimpson, Clint; Meyers, Ray J.; Merkley, Karl

    2004-04-01

    A finite element mesh is used to decompose a continuous domain into a discretized representation. The finite element method solves PDEs on this mesh by modeling complex functions as a set of simple basis functions with coefficients at mesh vertices and prescribed continuity between elements. The mesh is one of the fundamental types of data linking the various tools in the FEA process (mesh generation, analysis, visualization, etc.). Thus, the representation of mesh data and operations on those data play a very important role in FEA-based simulations. MOAB is a component for representing and evaluating mesh data. MOAB can store structured and unstructured mesh, consisting of elements in the finite element 'zoo'. The functional interface to MOAB is simple yet powerful, allowing the representation of many types of metadata commonly found on the mesh. MOAB is optimized for efficiency in space and time, based on access to mesh in chunks rather than through individual entities, while also versatile enough to support individual entity access. The MOAB data model consists of a mesh interface instance, mesh entities (vertices and elements), sets, and tags. Entities are addressed through handles rather than pointers, to allow the underlying representation of an entity to change without changing the handle to that entity. Sets are arbitrary groupings of mesh entities and other sets. Sets also support parent/child relationships as a relation distinct from sets containing other sets. The directed-graph provided by set parent/child relationships is useful for modeling topological relations from a geometric model or other metadata. Tags are named data which can be assigned to the mesh as a whole, individual entities, or sets. Tags are a mechanism for attaching data to individual entities and sets are a mechanism for describing relations between entities; the combination of these two mechanisms is a powerful yet simple interface for representing metadata or application-specific data. For example, sets and tags can be used together to describe geometric topology, boundary condition, and inter-processor interface groupings in a mesh. MOAB is used in several ways in various applications. MOAB serves as the underlying mesh data representation in the VERDE mesh verification code. MOAB can also be used as a mesh input mechanism, using mesh readers included with MOAB, or as a translator between mesh formats, using readers and writers included with MOAB. The remainder of this report is organized as follows. Section 2, 'Getting Started', provides a few simple examples of using MOAB to perform simple tasks on a mesh. Section 3 discusses the MOAB data model in more detail, including some aspects of the implementation. Section 4 summarizes the MOAB function API. Section 5 describes some of the tools included with MOAB, and the implementation of mesh readers/writers for MOAB. Section 6 contains a brief description of MOAB's relation to the TSTT mesh interface. Section 7 gives a conclusion and future plans for MOAB development. Section 8 gives references cited in this report. A reference description of the full MOAB API is contained in Section 9.

  18. Italian Rett database and biobank.

    PubMed

    Sampieri, Katia; Meloni, Ilaria; Scala, Elisa; Ariani, Francesca; Caselli, Rossella; Pescucci, Chiara; Longo, Ilaria; Artuso, Rosangela; Bruttini, Mirella; Mencarelli, Maria Antonietta; Speciale, Caterina; Causarano, Vincenza; Hayek, Giuseppe; Zappella, Michele; Renieri, Alessandra; Mari, Francesca

    2007-04-01

    Rett syndrome is the second most common cause of severe mental retardation in females, with an incidence of approximately 1 out of 10,000 live female births. In addition to the classic form, a number of Rett variants have been described. MECP2 gene mutations are responsible for about 90% of classic cases and for a lower percentage of variant cases. Recently, CDKL5 mutations have been identified in the early onset seizures variant and other atypical Rett patients. While the high percentage of MECP2 mutations in classic patients supports the hypothesis of a single disease gene, the low frequency of mutated variant cases suggests genetic heterogeneity. Since 1998, we have performed clinical evaluation and molecular analysis of a large number of Italian Rett patients. The Italian Rett Syndrome (RTT) database has been developed to share data and samples of our RTT collection with the scientific community (http://www.biobank.unisi.it). This is the first RTT database that has been connected with a biobank. It allows the user to immediately visualize the list of available RTT samples and, using the "Search by" tool, to rapidly select those with specific clinical and molecular features. By contacting bank curators, users can request the samples of interest for their studies. This database encourages collaboration projects with clinicians and researchers from around the world and provides important resources that will help to better define the pathogenic mechanisms underlying Rett syndrome. PMID:17186495

  19. Variant Selection during Alpha Precipitation in Titanium Alloys: A Simulation Study

    NASA Astrophysics Data System (ADS)

    Shi, Rongpei

    Variant selection of alpha phase during its precipitation from beta matrix plays a key role in determining transformation texture and final mechanical properties of alpha/beta and beta titanium alloys. In this study we develop a three-dimensional quantitative phase field model (PFM) to predict variant selection and microstructure evolution during beta to alpha transformation in polycrystalline Ti-6Al-4V under the influence of different processing variables. The model links its inputs directly to thermodynamic and mobility databases, and incorporates crystallography of BCC to HCP transformation, elastic anisotropy, defects within semi-coherent alpha/beta interfaces and elastic inhomogeneities among different beta grains. In particular, microstructure and transformation texture evolution are treated simultaneously via orientation distribution function (ODF) modeling of alpha/beta two-phase microstructure in beta polycrystalline obtained by PFM. It is found that, for a given undercooling, the development of transformation texture of the alpha phase due to variant selection during precipitation depends on both externally applied stress or strain, initial texture state of parent beta sample and internal stress generated by the precipitation reaction itself. Moreover, the growth of pre-existing widmanstatten alpha precipitates is accompanied by selective nucleation and growth of secondary alpha plates of preferred variants. We further develop a crystallographic model based on the ideal Burgers orientation relationship (BOR) between GBalpha and one of the two adjacent beta grains to investigate how a prior beta grain boundary contributes to variant selection of grain boundary allotriomorph (GBalpha). The model is able to predict all possible special beta grain boundaries where GBalpha is able to maintain BOR with two neighboring grain. In particular, the model has been used to evaluate the validity of all current empirical variant selection rules to obtain more insight of

  20. Statistical databases

    SciTech Connect

    Kogalovskii, M.R.

    1995-03-01

    This paper presents a review of problems related to statistical database systems, which are wide-spread in various fields of activity. Statistical databases (SDB) are referred to as databases that consist of data and are used for statistical analysis. Topics under consideration are: SDB peculiarities, properties of data models adequate for SDB requirements, metadata functions, null-value problems, SDB compromise protection problems, stored data compression techniques, and statistical data representation means. Also examined is whether the present Database Management Systems (DBMS) satisfy the SDB requirements. Some actual research directions in SDB systems are considered.

  1. Database Manager

    ERIC Educational Resources Information Center

    Martin, Andrew

    2010-01-01

    It is normal practice today for organizations to store large quantities of records of related information as computer-based files or databases. Purposeful information is retrieved by performing queries on the data sets. The purpose of DATABASE MANAGER is to communicate to students the method by which the computer performs these queries. This…

  2. Maize databases

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This chapter is a succinct overview of maize data held in the species-specific database MaizeGDB (the Maize Genomics and Genetics Database), and selected multi-species data repositories, such as Gramene/Ensembl Plants, Phytozome, UniProt and the National Center for Biotechnology Information (NCBI), ...

  3. Genome databases

    SciTech Connect

    Courteau, J.

    1991-10-11

    Since the Genome Project began several years ago, a plethora of databases have been developed or are in the works. They range from the massive Genome Data Base at Johns Hopkins University, the central repository of all gene mapping information, to small databases focusing on single chromosomes or organisms. Some are publicly available, others are essentially private electronic lab notebooks. Still others limit access to a consortium of researchers working on, say, a single human chromosome. An increasing number incorporate sophisticated search and analytical software, while others operate as little more than data lists. In consultation with numerous experts in the field, a list has been compiled of some key genome-related databases. The list was not limited to map and sequence databases but also included the tools investigators use to interpret and elucidate genetic data, such as protein sequence and protein structure databases. Because a major goal of the Genome Project is to map and sequence the genomes of several experimental animals, including E. coli, yeast, fruit fly, nematode, and mouse, the available databases for those organisms are listed as well. The author also includes several databases that are still under development - including some ambitious efforts that go beyond data compilation to create what are being called electronic research communities, enabling many users, rather than just one or a few curators, to add or edit the data and tag it as raw or confirmed.

  4. Histone variants and epigenetics.

    PubMed

    Henikoff, Steven; Smith, M Mitchell

    2015-01-01

    Histones package and compact DNA by assembling into nucleosome core particles. Most histones are synthesized at S phase for rapid deposition behind replication forks. In addition, the replacement of histones deposited during S phase by variants that can be deposited independently of replication provide the most fundamental level of chromatin differentiation. Alternative mechanisms for depositing different variants can potentially establish and maintain epigenetic states. Variants have also evolved crucial roles in chromosome segregation, transcriptional regulation, DNA repair, and other processes. Investigations into the evolution, structure, and metabolism of histone variants provide a foundation for understanding the participation of chromatin in important cellular processes and in epigenetic memory. PMID:25561719

  5. BIOMARKERS DATABASE

    EPA Science Inventory

    This database was developed by assembling and evaluating the literature relevant to human biomarkers. It catalogues and evaluates the usefulness of biomarkers of exposure, susceptibility and effect which may be relevant for a longitudinal cohort study. In addition to describing ...

  6. Experiment Databases

    NASA Astrophysics Data System (ADS)

    Vanschoren, Joaquin; Blockeel, Hendrik

    Next to running machine learning algorithms based on inductive queries, much can be learned by immediately querying the combined results of many prior studies. Indeed, all around the globe, thousands of machine learning experiments are being executed on a daily basis, generating a constant stream of empirical information on machine learning techniques. While the information contained in these experiments might have many uses beyond their original intent, results are typically described very concisely in papers and discarded afterwards. If we properly store and organize these results in central databases, they can be immediately reused for further analysis, thus boosting future research. In this chapter, we propose the use of experiment databases: databases designed to collect all the necessary details of these experiments, and to intelligently organize them in online repositories to enable fast and thorough analysis of a myriad of collected results. They constitute an additional, queriable source of empirical meta-data based on principled descriptions of algorithm executions, without reimplementing the algorithms in an inductive database. As such, they engender a very dynamic, collaborative approach to experimentation, in which experiments can be freely shared, linked together, and immediately reused by researchers all over the world. They can be set up for personal use, to share results within a lab or to create open, community-wide repositories. Here, we provide a high-level overview of their design, and use an existing experiment database to answer various interesting research questions about machine learning algorithms and to verify a number of recent studies.

  7. VCF-Miner: GUI-based application for mining variants and annotations stored in VCF files.

    PubMed

    Hart, Steven N; Duffy, Patrick; Quest, Daniel J; Hossain, Asif; Meiners, Mike A; Kocher, Jean-Pierre

    2016-03-01

    Next-generation sequencing platforms are widely used to discover variants associated with disease. The processing of sequencing data involves read alignment, variant calling, variant annotation and variant filtering. The standard file format to hold variant calls is the variant call format (VCF) file. According to the format specifications, any arbitrary annotation can be added to the VCF file for downstream processing. However, most downstream analysis programs disregard annotations already present in the VCF and re-annotate variants using the annotation provided by that particular program. This precludes investigators who have collected information on variants from literature or other sources from including these annotations in the filtering and mining of variants. We have developed VCF-Miner, a graphical user interface-based stand-alone tool, to mine variants and annotation stored in the VCF. Powered by a MongoDB database engine, VCF-Miner enables the stepwise trimming of non-relevant variants. The grouping feature implemented in VCF-Miner can be used to identify somatic variants by contrasting variants in tumor and in normal samples or to identify recessive/dominant variants in family studies. It is not limited to human data, but can also be extended to include non-diploid organisms. It also supports copy number or any other variant type supported by the VCF specification. VCF-Miner can be used on a personal computer or large institutional servers and is freely available for download from http://bioinformaticstools.mayo.edu/research/vcf-miner/. PMID:26210358

  8. VCF-Miner: GUI-based application for mining variants and annotations stored in VCF files

    PubMed Central

    Hart, Steven N.; Duffy, Patrick; Quest, Daniel J.; Hossain, Asif; Meiners, Mike A

    2016-01-01

    Next-generation sequencing platforms are widely used to discover variants associated with disease. The processing of sequencing data involves read alignment, variant calling, variant annotation and variant filtering. The standard file format to hold variant calls is the variant call format (VCF) file. According to the format specifications, any arbitrary annotation can be added to the VCF file for downstream processing. However, most downstream analysis programs disregard annotations already present in the VCF and re-annotate variants using the annotation provided by that particular program. This precludes investigators who have collected information on variants from literature or other sources from including these annotations in the filtering and mining of variants. We have developed VCF-Miner, a graphical user interface-based stand-alone tool, to mine variants and annotation stored in the VCF. Powered by a MongoDB database engine, VCF-Miner enables the stepwise trimming of non-relevant variants. The grouping feature implemented in VCF-Miner can be used to identify somatic variants by contrasting variants in tumor and in normal samples or to identify recessive/dominant variants in family studies. It is not limited to human data, but can also be extended to include non-diploid organisms. It also supports copy number or any other variant type supported by the VCF specification. VCF-Miner can be used on a personal computer or large institutional servers and is freely available for download from http://bioinformaticstools.mayo.edu/research/vcf-miner/. PMID:26210358

  9. Solubility Database

    National Institute of Standards and Technology Data Gateway

    SRD 106 IUPAC-NIST Solubility Database (Web, free access)   These solubilities are compiled from 18 volumes (Click here for List) of the International Union for Pure and Applied Chemistry(IUPAC)-NIST Solubility Data Series. The database includes liquid-liquid, solid-liquid, and gas-liquid systems. Typical solvents and solutes include water, seawater, heavy water, inorganic compounds, and a variety of organic compounds such as hydrocarbons, halogenated hydrocarbons, alcohols, acids, esters and nitrogen compounds. There are over 67,500 solubility measurements and over 1800 references.

  10. The Porcine Immunology and Nutrition Resource Database

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Diverse genomics-based databases have been developed to facilitate research with human and rodent models. Current porcine gene databases, however, lack the nutritional and immunological orientation and robust annotation to design effective molecular tools to study relevant pig models. To address t...

  11. Electronic Databases for Linguistic and Language Research.

    ERIC Educational Resources Information Center

    Oshika, Beatrice T.; Krausse, Sylvia C.

    1992-01-01

    Reviews electronic language data oriented specifically toward linguistic and language research, including machine-readable versions of established dictionaries; text databases, including newspapers and books; speech databases with phonetic/orthographic transcriptions; and bulletin boards and news groups available over computer networks. (17…

  12. Mucopolysaccharidosis: A New Variant?

    ERIC Educational Resources Information Center

    Primrose, D. A.

    1972-01-01

    Described is a possibly new variant of mucopolysaccharidosis characterized by progressive mental and motor deficiency, bone abnormalities, a generalized skin lesion, and abnormal mucopolysaccharides in the urine as seen in a 20-year-old female. (DB)

  13. Normal Variants in Echocardiography.

    PubMed

    Sanchez, Daniel R; Bryg, Robert J

    2016-11-01

    Echocardiography is a powerful and convenient tool used routinely in the cardiac evaluation of many patients. Improved resolution and visualization of cardiac anatomy has led to the discovery of many normal variant structures that have no known pathologic consequence. Importantly, these findings may masquerade as pathology prompting unnecessary further evaluation at the expense of anxiety, cost, or potential harm. This review provides an updated and comprehensive collection of normal anatomic variants on both transthoracic and transesophageal imaging. PMID:27612473

  14. Drinking Water Treatability Database (Database)

    EPA Science Inventory

    The drinking Water Treatability Database (TDB) will provide data taken from the literature on the control of contaminants in drinking water, and will be housed on an interactive, publicly-available USEPA web site. It can be used for identifying effective treatment processes, rec...

  15. Whose Orientations?

    ERIC Educational Resources Information Center

    Gutoff, Joshua

    2010-01-01

    This article presents the author's response to Jon A. Levisohn's article entitled "A Menu of Orientations in the Teaching of Rabbinic Literature." While the "menu" Levisohn describes in his groundbreaking work on orientations to the teaching of rabbinic texts will almost certainly be refined over time, even as it stands this article should be of…

  16. Orienteering injuries

    PubMed Central

    Folan, Jean M.

    1982-01-01

    At the Irish National Orienteering Championships in 1981 a survey of the injuries occurring over the two days of competition was carried out. Of 285 individual competitors there was a percentage injury rate of 5.26%. The article discusses the injuries and aspects of safety in orienteering. Imagesp236-ap237-ap237-bp238-ap239-ap240-a PMID:7159815

  17. Atlas of Computed Tomography Variants

    SciTech Connect

    Kuhns, L.R.; Seeger, J.

    1983-01-01

    Atlas of Computed Tomography Variants is unique in that, while others of its kind may include plain film, roentgen variants, it concentrates solely on CT images of variants which may simulate disease. Organized into four regions, it presents dicussions covering CT variants of the skull, neck and spine; thorax; abdomen; and extremities-featuring a section on the head.

  18. Plant snoRNA database

    PubMed Central

    Brown, John W. S.; Echeverria, Manuel; Qu, Liang-Hu; Lowe, Todd M.; Bachellerie, Jean-Pierre; Hüttenhofer, Alexander; Kastenmayer, James P.; Green, Pamela J.; Shaw, Paul; Marshall, Dave F.

    2003-01-01

    The Plant snoRNA database (http://www.scri.sari.ac.uk/plant_snoRNA/) provides information on small nucleolar RNAs from Arabidopsis and eighteen other plant species. Information includes sequences, expression data, methylation and pseudouridylation target modification sites, initial gene organization (polycistronic, single gene and intronic) and the number of gene variants. The Arabidopsis information is divided into box C/D and box H/ACA snoRNAs, and within each of these groups, by target sites in rRNA, snRNA or unknown. Alignments of orthologous genes and gene variants from different plant species are available for many snoRNA genes. Plant snoRNA genes have been given a standard nomenclature, designed wherever possible, to provide a consistent identity with yeast and human orthologues. PMID:12520043

  19. Analyzing Orientations

    NASA Astrophysics Data System (ADS)

    Ruggles, Clive L. N.

    Archaeoastronomical field survey typically involves the measurement of structural orientations (i.e., orientations along and between built structures) in relation to the visible landscape and particularly the surrounding horizon. This chapter focuses on the process of analyzing the astronomical potential of oriented structures, whether in the field or as a desktop appraisal, with the aim of establishing the archaeoastronomical "facts". It does not address questions of data selection (see instead Chap. 25, "Best Practice for Evaluating the Astronomical Significance of Archaeological Sites", 10.1007/978-1-4614-6141-8_25) or interpretation (see Chap. 24, "Nature and Analysis of Material Evidence Relevant to Archaeoastronomy", 10.1007/978-1-4614-6141-8_22). The main necessity is to determine the azimuth, horizon altitude, and declination in the direction "indicated" by any structural orientation. Normally, there are a range of possibilities, reflecting the various errors and uncertainties in estimating the intended (or, at least, the constructed) orientation, and in more formal approaches an attempt is made to assign a probability distribution extending over a spread of declinations. These probability distributions can then be cumulated in order to visualize and analyze the combined data from several orientations, so as to identify any consistent astronomical associations that can then be correlated with the declinations of particular astronomical objects or phenomena at any era in the past. The whole process raises various procedural and methodological issues and does not proceed in isolation from the consideration of corroborative data, which is essential in order to develop viable cultural interpretations.

  20. Mycosis Fungoides Variants.

    PubMed

    Martínez-Escala, M Estela; González, Belén Rubio; Guitart, Joan

    2014-06-01

    Mycosis fungoides (MF) is a cutaneous T-cell lymphoma that usually manifests as patches and plaques with a propensity for nonphotoexposed areas. MF is a common mimicker of inflammatory and infectious skin diseases, because it can be manifested with a wide variety of clinical and pathologic presentations. These atypical presentations of MF may be difficult to diagnose, requiring a high level of suspicion and careful clinicopathologic correlation. Within this array of clinical presentations, the World Health Organization classification recognizes 3 MF variants: folliculotropic MF, pagetoid reticulosis, and granulomatous slack skin. These 3 variants, as well as hypopigmented MF, are addressed in this article. PMID:26837197

  1. Fine-Mapping of Common Genetic Variants Associated with Colorectal Tumor Risk Identified Potential Functional Variants

    PubMed Central

    Gala, Manish; Abecasis, Goncalo; Bezieau, Stephane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Carlson, Christopher S.; Casey, Graham; Chang-Claude, Jenny; Conti, David V.; Curtis, Keith R.; Duggan, David; Gallinger, Steven; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Jenkins, Mark A.; Küry, Sébastien; Le Marchand, Loic; Leal, Suzanne M.; Newcomb, Polly A.; Nickerson, Deborah A.; Potter, John D.; Schoen, Robert E.; Schumacher, Fredrick R.; Seminara, Daniela; Slattery, Martha L.; Hsu, Li; Chan, Andrew T.; White, Emily; Berndt, Sonja I.; Peters, Ulrike

    2016-01-01

    Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) associated with colorectal cancer risk. These SNPs may tag correlated variants with biological importance. Fine-mapping around GWAS loci can facilitate detection of functional candidates and additional independent risk variants. We analyzed 11,900 cases and 14,311 controls in the Genetics and Epidemiology of Colorectal Cancer Consortium and the Colon Cancer Family Registry. To fine-map genomic regions containing all known common risk variants, we imputed high-density genetic data from the 1000 Genomes Project. We tested single-variant associations with colorectal tumor risk for all variants spanning genomic regions 250-kb upstream or downstream of 31 GWAS-identified SNPs (index SNPs). We queried the University of California, Santa Cruz Genome Browser to examine evidence for biological function. Index SNPs did not show the strongest association signals with colorectal tumor risk in their respective genomic regions. Bioinformatics analysis of SNPs showing smaller P-values in each region revealed 21 functional candidates in 12 loci (5q31.1, 8q24, 11q13.4, 11q23, 12p13.32, 12q24.21, 14q22.2, 15q13, 18q21, 19q13.1, 20p12.3, and 20q13.33). We did not observe evidence of additional independent association signals in GWAS-identified regions. Our results support the utility of integrating data from comprehensive fine-mapping with expanding publicly available genomic databases to help clarify GWAS associations and identify functional candidates that warrant more onerous laboratory follow-up. Such efforts may aid the eventual discovery of disease-causing variant(s). PMID:27379672

  2. Fine-Mapping of Common Genetic Variants Associated with Colorectal Tumor Risk Identified Potential Functional Variants.

    PubMed

    Du, Mengmeng; Jiao, Shuo; Bien, Stephanie A; Gala, Manish; Abecasis, Goncalo; Bezieau, Stephane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J; Carlson, Christopher S; Casey, Graham; Chang-Claude, Jenny; Conti, David V; Curtis, Keith R; Duggan, David; Gallinger, Steven; Haile, Robert W; Harrison, Tabitha A; Hayes, Richard B; Hoffmeister, Michael; Hopper, John L; Hudson, Thomas J; Jenkins, Mark A; Küry, Sébastien; Le Marchand, Loic; Leal, Suzanne M; Newcomb, Polly A; Nickerson, Deborah A; Potter, John D; Schoen, Robert E; Schumacher, Fredrick R; Seminara, Daniela; Slattery, Martha L; Hsu, Li; Chan, Andrew T; White, Emily; Berndt, Sonja I; Peters, Ulrike

    2016-01-01

    Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) associated with colorectal cancer risk. These SNPs may tag correlated variants with biological importance. Fine-mapping around GWAS loci can facilitate detection of functional candidates and additional independent risk variants. We analyzed 11,900 cases and 14,311 controls in the Genetics and Epidemiology of Colorectal Cancer Consortium and the Colon Cancer Family Registry. To fine-map genomic regions containing all known common risk variants, we imputed high-density genetic data from the 1000 Genomes Project. We tested single-variant associations with colorectal tumor risk for all variants spanning genomic regions 250-kb upstream or downstream of 31 GWAS-identified SNPs (index SNPs). We queried the University of California, Santa Cruz Genome Browser to examine evidence for biological function. Index SNPs did not show the strongest association signals with colorectal tumor risk in their respective genomic regions. Bioinformatics analysis of SNPs showing smaller P-values in each region revealed 21 functional candidates in 12 loci (5q31.1, 8q24, 11q13.4, 11q23, 12p13.32, 12q24.21, 14q22.2, 15q13, 18q21, 19q13.1, 20p12.3, and 20q13.33). We did not observe evidence of additional independent association signals in GWAS-identified regions. Our results support the utility of integrating data from comprehensive fine-mapping with expanding publicly available genomic databases to help clarify GWAS associations and identify functional candidates that warrant more onerous laboratory follow-up. Such efforts may aid the eventual discovery of disease-causing variant(s). PMID:27379672

  3. Towards the Architecture of an Instructional Multimedia Database.

    ERIC Educational Resources Information Center

    Verhagen, Plin W.; Bestebreurtje, R.

    1994-01-01

    Discussion of multimedia databases in education focuses on the development of an adaptable database in The Netherlands that uses optical storage media to hold the audiovisual components. Highlights include types of applications; types of users; accessibility; adaptation; an object-oriented approach; levels of the database architecture; and…

  4. [Variant of Bacillus anthracoides].

    PubMed

    Galanina, L A; Bekhtereva, M N; Kraĭnova, O A

    1979-01-01

    A comparative study of the Bacillus anthracoides culture and its variant has shown that the latter differs drastically from the parent culture in the shape and consistence of colonies, the size of spores and vegetative cells, the rate of spore germination in MPB, and the resistence to steam treatment and chloroactive disinfectants. PMID:423806

  5. Stackfile Database

    NASA Technical Reports Server (NTRS)

    deVarvalho, Robert; Desai, Shailen D.; Haines, Bruce J.; Kruizinga, Gerhard L.; Gilmer, Christopher

    2013-01-01

    This software provides storage retrieval and analysis functionality for managing satellite altimetry data. It improves the efficiency and analysis capabilities of existing database software with improved flexibility and documentation. It offers flexibility in the type of data that can be stored. There is efficient retrieval either across the spatial domain or the time domain. Built-in analysis tools are provided for frequently performed altimetry tasks. This software package is used for storing and manipulating satellite measurement data. It was developed with a focus on handling the requirements of repeat-track altimetry missions such as Topex and Jason. It was, however, designed to work with a wide variety of satellite measurement data [e.g., Gravity Recovery And Climate Experiment -- GRACE). The software consists of several command-line tools for importing, retrieving, and analyzing satellite measurement data.

  6. DBATE: database of alternative transcripts expression.

    PubMed

    Bianchi, Valerio; Colantoni, Alessio; Calderone, Alberto; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2013-01-01

    The use of high-throughput RNA sequencing technology (RNA-seq) allows whole transcriptome analysis, providing an unbiased and unabridged view of alternative transcript expression. Coupling splicing variant-specific expression with its functional inference is still an open and difficult issue for which we created the DataBase of Alternative Transcripts Expression (DBATE), a web-based repository storing expression values and functional annotation of alternative splicing variants. We processed 13 large RNA-seq panels from human healthy tissues and in disease conditions, reporting expression levels and functional annotations gathered and integrated from different sources for each splicing variant, using a variant-specific annotation transfer pipeline. The possibility to perform complex queries by cross-referencing different functional annotations permits the retrieval of desired subsets of splicing variant expression values that can be visualized in several ways, from simple to more informative. DBATE is intended as a novel tool to help appreciate how, and possibly why, the transcriptome expression is shaped. DATABASE URL: http://bioinformatica.uniroma2.it/DBATE/. PMID:23842462

  7. Carrier frequency of guanidinoacetate methyltransferase deficiency in the general population by functional characterization of missense variants in the GAMT gene.

    PubMed

    Desroches, Caro-Lyne; Patel, Jaina; Wang, Peixiang; Minassian, Berge; Marshall, Christian R; Salomons, Gajja S; Mercimek-Mahmutoglu, Saadet

    2015-12-01

    Guanidinoacetate methyltransferase (GAMT) deficiency is a neurodegenerative disease. Although no symptomatic patients on treatment achieved normal neurodevelopment, three asymptomatic newborns were reported with normal neurodevelopmental outcome on neonatal treatment. GAMT deficiency is therefore a candidate for newborn screening programs, but there are no studies for the carrier frequency of this disease in the general population. To determine carrier frequency of GAMT deficiency, we studied the variants in the GAMT gene reported in the Exome Variant Server database and performed functional characterization of missense variants. We used previously cloned GAMT transcript variant 1 (7 missense variants) and cloned a novel GAMT transcript variant 2 (5 missense variants). The latter was used in Exome Variant Server database according to recommendations of the Human Genome Variation Society. There were 4 missense variants (1 previously reported and 3 novel) with low GAMT enzyme activity indicating pathogenicity. Additionally, there was one novel frameshift and one novel nonsense variant likely pathogenic. There was no measurable GAMT enzyme activity in the wild type of GAMT transcript variant 2. We concluded that GAMT transcript variant 2 is not involved in GAMT protein synthesis. For this reason, Human Genome Variation Society should use mutation nomenclature according to the coding region of the GAMT transcript variant 1. The carrier frequency of GAMT deficiency was 0.123 % in the general population. As early diagnosis results in normal neurodevelopmental outcome, GAMT deficiency should be included in newborn screening programs to diagnose individuals at the asymptomatic stage of the disease to prevent permanent neurodevelopmental disability. PMID:26003046

  8. The 2014 Nucleic Acids Research Database Issue and an updated NAR online Molecular Biology Database Collection

    PubMed Central

    Fernández-Suárez, Xosé M.; Rigden, Daniel J.; Galperin, Michael Y.

    2014-01-01

    The 2014 Nucleic Acids Research Database Issue includes descriptions of 58 new molecular biology databases and recent updates to 123 databases previously featured in NAR or other journals. For convenience, the issue is now divided into eight sections that reflect major subject categories. Among the highlights of this issue are six databases of the transcription factor binding sites in various organisms and updates on such popular databases as CAZy, Database of Genomic Variants (DGV), dbGaP, DrugBank, KEGG, miRBase, Pfam, Reactome, SEED, TCDB and UniProt. There is a strong block of structural databases, which includes, among others, the new RNA Bricks database, updates on PDBe, PDBsum, ArchDB, Gene3D, ModBase, Nucleic Acid Database and the recently revived iPfam database. An update on the NCBI’s MMDB describes VAST+, an improved tool for protein structure comparison. Two articles highlight the development of the Structural Classification of Proteins (SCOP) database: one describes SCOPe, which automates assignment of new structures to the existing SCOP hierarchy; the other one describes the first version of SCOP2, with its more flexible approach to classifying protein structures. This issue also includes a collection of articles on bacterial taxonomy and metagenomics, which includes updates on the List of Prokaryotic Names with Standing in Nomenclature (LPSN), Ribosomal Database Project (RDP), the Silva/LTP project and several new metagenomics resources. The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been expanded to 1552 databases. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/). PMID:24316579

  9. The 2014 Nucleic Acids Research Database Issue and an updated NAR online Molecular Biology Database Collection.

    PubMed

    Fernández-Suárez, Xosé M; Rigden, Daniel J; Galperin, Michael Y

    2014-01-01

    The 2014 Nucleic Acids Research Database Issue includes descriptions of 58 new molecular biology databases and recent updates to 123 databases previously featured in NAR or other journals. For convenience, the issue is now divided into eight sections that reflect major subject categories. Among the highlights of this issue are six databases of the transcription factor binding sites in various organisms and updates on such popular databases as CAZy, Database of Genomic Variants (DGV), dbGaP, DrugBank, KEGG, miRBase, Pfam, Reactome, SEED, TCDB and UniProt. There is a strong block of structural databases, which includes, among others, the new RNA Bricks database, updates on PDBe, PDBsum, ArchDB, Gene3D, ModBase, Nucleic Acid Database and the recently revived iPfam database. An update on the NCBI's MMDB describes VAST+, an improved tool for protein structure comparison. Two articles highlight the development of the Structural Classification of Proteins (SCOP) database: one describes SCOPe, which automates assignment of new structures to the existing SCOP hierarchy; the other one describes the first version of SCOP2, with its more flexible approach to classifying protein structures. This issue also includes a collection of articles on bacterial taxonomy and metagenomics, which includes updates on the List of Prokaryotic Names with Standing in Nomenclature (LPSN), Ribosomal Database Project (RDP), the Silva/LTP project and several new metagenomics resources. The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been expanded to 1552 databases. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/). PMID:24316579

  10. Database Marketplace 2002: The Database Universe.

    ERIC Educational Resources Information Center

    Tenopir, Carol; Baker, Gayle; Robinson, William

    2002-01-01

    Reviews the database industry over the past year, including new companies and services, company closures, popular database formats, popular access methods, and changes in existing products and services. Lists 33 firms and their database services; 33 firms and their database products; and 61 company profiles. (LRW)

  11. Industrial Orientation.

    ERIC Educational Resources Information Center

    Rasor, Leslie; Brooks, Valerie

    These eight modules for an industrial orientation class were developed by a project to design an interdisciplinary program of basic skills training for disadvantaged students in a Construction Technology Program (see Note). The Drafting module overviews drafting career opportunities, job markets, salaries, educational requirements, and basic…

  12. Meningococcal vaccine antigen diversity in global databases

    PubMed Central

    Brehony, C; Hill, DM; Lucidarme, J; Borrow, R; Maiden, MC

    2016-01-01

    The lack of an anti-capsular vaccine against serogroup B meningococcal disease has necessitated the exploration of alternative vaccine candidates, mostly proteins exhibiting varying degrees of antigenic variation. Analysis of variants of antigen-encoding genes is facilitated by publicly accessible online sequence repositories, such as the Neisseria PubMLST database and the associated Meningitis Research Foundation Meningococcus Genome Library (MRF-MGL). We investigated six proposed meningococcal vaccine formulations by deducing the prevalence of their components in the isolates represented in these repositories. Despite high diversity, a limited number of antigenic variants of each of the vaccine antigens were prevalent, with strong associations of particular variant combinations with given serogroups and genotypes. In the MRF-MGL and globally, the highest levels of identical sequences were observed with multicomponent/multivariant vaccines. Our analyses further demonstrated that certain combinations of antigen variants were prevalent over periods of decades in widely differing locations, indicating that vaccine formulations containing a judicious choice of antigen variants have potential for long-term protection across geographic regions. The data further indicated that formulations with multiple variants would be especially relevant at times of low disease incidence, as relative diversity was higher. Continued surveillance is required to monitor the changing prevalence of these vaccine antigens. PMID:26676305

  13. Variants of glycoside hydrolases

    SciTech Connect

    Teter, Sarah; Ward, Connie; Cherry, Joel; Jones, Aubrey; Harris, Paul; Yi, Jung

    2013-02-26

    The present invention relates to variants of a parent glycoside hydrolase, comprising a substitution at one or more positions corresponding to positions 21, 94, 157, 205, 206, 247, 337, 350, 373, 383, 438, 455, 467, and 486 of amino acids 1 to 513 of SEQ ID NO: 2, and optionally further comprising a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2 a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2, wherein the variants have glycoside hydrolase activity. The present invention also relates to nucleotide sequences encoding the variant glycoside hydrolases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  14. Variants of glycoside hydrolases

    DOEpatents

    Teter, Sarah; Ward, Connie; Cherry, Joel; Jones, Aubrey; Harris, Paul; Yi, Jung

    2011-04-26

    The present invention relates to variants of a parent glycoside hydrolase, comprising a substitution at one or more positions corresponding to positions 21, 94, 157, 205, 206, 247, 337, 350, 373, 383, 438, 455, 467, and 486 of amino acids 1 to 513 of SEQ ID NO: 2, and optionally further comprising a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2 a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2, wherein the variants have glycoside hydrolase activity. The present invention also relates to nucleotide sequences encoding the variant glycoside hydrolases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  15. Device Oriented Project Controller

    SciTech Connect

    Dalesio, Leo; Kraimer, Martin

    2013-11-20

    This proposal is directed at the issue of developing control systems for very large HEP projects. A de-facto standard in accelerator control is the Experimental Physics and Industrial Control System (EPICS), which has been applied successfully to many physics projects. EPICS is a channel based system that requires that each channel of each device be configured and controlled. In Phase I, the feasibility of a device oriented extension to the distributed channel database was demonstrated by prototyping a device aware version of an EPICS I/O controller that functions with the current version of the channel access communication protocol. Extensions have been made to the grammar to define the database. Only a multi-stage position controller with limit switches was developed in the demonstration, but the grammar should support a full range of functional record types. In phase II, a full set of record types will be developed to support all existing record types, a set of process control functions for closed loop control, and support for experimental beam line control. A tool to configure these records will be developed. A communication protocol will be developed or extensions will be made to Channel Access to support introspection of components of a device. Performance bench marks will be made on both communication protocol and the database. After these records and performance tests are under way, a second of the grammar will be undertaken.

  16. UCSC Data Integrator and Variant Annotation Integrator

    PubMed Central

    Hinrichs, Angie S.; Raney, Brian J.; Speir, Matthew L.; Rhead, Brooke; Casper, Jonathan; Karolchik, Donna; Kuhn, Robert M.; Rosenbloom, Kate R.; Zweig, Ann S.; Haussler, David; Kent, W. James

    2016-01-01

    Summary: Two new tools on the UCSC Genome Browser web site provide improved ways of combining information from multiple datasets, optionally including the user's own custom track data and/or data from track hubs. The Data Integrator combines columns from multiple data tracks, showing all items from the first track along with overlapping items from the other tracks. The Variant Annotation Integrator is tailored to adding functional annotations to variant calls; it offers a more restricted set of underlying data tracks but adds predictions of each variant's consequences for any overlapping or nearby gene transcript. When available, it optionally adds additional annotations including effect prediction scores from dbNSFP for missense mutations, ENCODE regulatory summary tracks and conservation scores. Availability and implementation: The web tools are freely available at http://genome.ucsc.edu/ and the underlying database is available for download at http://hgdownload.cse.ucsc.edu/. The software (written in C and Javascript) is available from https://genome-store.ucsc.edu/ and is freely available for academic and non-profit usage; commercial users must obtain a license. Contact: angie@soe.ucsc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26740527

  17. Human Variome Project Quality Assessment Criteria for Variation Databases.

    PubMed

    Vihinen, Mauno; Hancock, John M; Maglott, Donna R; Landrum, Melissa J; Schaafsma, Gerard C P; Taschner, Peter

    2016-06-01

    Numerous databases containing information about DNA, RNA, and protein variations are available. Gene-specific variant databases (locus-specific variation databases, LSDBs) are typically curated and maintained for single genes or groups of genes for a certain disease(s). These databases are widely considered as the most reliable information source for a particular gene/protein/disease, but it should also be made clear they may have widely varying contents, infrastructure, and quality. Quality is very important to evaluate because these databases may affect health decision-making, research, and clinical practice. The Human Variome Project (HVP) established a Working Group for Variant Database Quality Assessment. The basic principle was to develop a simple system that nevertheless provides a good overview of the quality of a database. The HVP quality evaluation criteria that resulted are divided into four main components: data quality, technical quality, accessibility, and timeliness. This report elaborates on the developed quality criteria and how implementation of the quality scheme can be achieved. Examples are provided for the current status of the quality items in two different databases, BTKbase, an LSDB, and ClinVar, a central archive of submissions about variants and their clinical significance. PMID:26919176

  18. Database computing in HEP. Progress report

    SciTech Connect

    Day, C.T.; Loken, S.; MacFarlane, J.F.; May, E.; Lifka, D.; Lusk, E.; Price, L.E.; Baden, A.; Grossman, R.; Qin, X.; Cormell, L.; Leibold, P.; Liu, D.; Nixdorf, U.; Scipioni, B.; Song, T.

    1992-12-31

    The major SSC experiments are expected to produce up to 1 Petabyte of data per year each. Once the primary reconstruction is completed by farms of inexpensive processors. I/O becomes a major factor in further analysis of the data. We believe that the application of database techniques can significantly reduce the I/O performed in these analyses. We present examples of such I/O reductions in prototype based on relational and object-oriented databases of CDF data samples.

  19. Overlap in Bibliographic Databases.

    ERIC Educational Resources Information Center

    Hood, William W.; Wilson, Concepcion S.

    2003-01-01

    Examines the topic of Fuzzy Set Theory to determine the overlap of coverage in bibliographic databases. Highlights include examples of comparisons of database coverage; frequency distribution of the degree of overlap; records with maximum overlap; records unique to one database; intra-database duplicates; and overlap in the top ten databases.…

  20. Rare Titin (TTN) Variants in Diseases Associated with Sudden Cardiac Death

    PubMed Central

    Campuzano, Oscar; Sanchez-Molero, Olallo; Mademont-Soler, Irene; Riuró, Helena; Allegue, Catarina; Coll, Monica; Pérez-Serra, Alexandra; Mates, Jesus; Picó, Ferran; Iglesias, Anna; Brugada, Ramon

    2015-01-01

    A leading cause of death in western countries is sudden cardiac death, and can be associated with genetic disease. Next-generation sequencing has allowed thorough analysis of genes associated with this entity, including, most recently, titin. We aimed to identify potentially pathogenic genetic variants in titin. A total of 1126 samples were analyzed using a custom sequencing panel including major genes related to sudden cardiac death. Our cohort was divided into three groups: 432 cases from patients with cardiomyopathies, 130 cases from patients with channelopathies, and 564 post-mortem samples from individuals showing anatomical healthy hearts and non-conclusive causes of death after comprehensive autopsy. None of the patients included had definite pathogenic variants in the genes analyzed by our custom cardio-panel. Retrospective analysis comparing the in-house database and available public databases also was performed. We identified 554 rare variants in titin, 282 of which were novel. Seven were previously reported as pathogenic. Of these 554 variants, 493 were missense variants, 233 of which were novel. Of all variants identified, 399 were unique and 155 were identified at least twice. No definite pathogenic variants were identified in any of genes analyzed. We identified rare, mostly novel, titin variants that seem to play a potentially pathogenic role in sudden cardiac death. Additional studies should be performed to clarify the role of these variants in sudden cardiac death. PMID:26516846

  1. Meta-analysis of gene-level associations for rare variants based on single-variant statistics.

    PubMed

    Hu, Yi-Juan; Berndt, Sonja I; Gustafsson, Stefan; Ganna, Andrea; Hirschhorn, Joel; North, Kari E; Ingelsson, Erik; Lin, Dan-Yu

    2013-08-01

    Meta-analysis of genome-wide association studies (GWASs) has led to the discoveries of many common variants associated with complex human diseases. There is a growing recognition that identifying "causal" rare variants also requires large-scale meta-analysis. The fact that association tests with rare variants are performed at the gene level rather than at the variant level poses unprecedented challenges in the meta-analysis. First, different studies may adopt different gene-level tests, so the results are not compatible. Second, gene-level tests require multivariate statistics (i.e., components of the test statistic and their covariance matrix), which are difficult to obtain. To overcome these challenges, we propose to perform gene-level tests for rare variants by combining the results of single-variant analysis (i.e., p values of association tests and effect estimates) from participating studies. This simple strategy is possible because of an insight that multivariate statistics can be recovered from single-variant statistics, together with the correlation matrix of the single-variant test statistics, which can be estimated from one of the participating studies or from a publicly available database. We show both theoretically and numerically that the proposed meta-analysis approach provides accurate control of the type I error and is as powerful as joint analysis of individual participant data. This approach accommodates any disease phenotype and any study design and produces all commonly used gene-level tests. An application to the GWAS summary results of the Genetic Investigation of ANthropometric Traits (GIANT) consortium reveals rare and low-frequency variants associated with human height. The relevant software is freely available. PMID:23891470

  2. Meta-analysis of Gene-Level Associations for Rare Variants Based on Single-Variant Statistics

    PubMed Central

    Hu, Yi-Juan; Berndt, Sonja I.; Gustafsson, Stefan; Ganna, Andrea; Berndt, Sonja I.; Gustafsson, Stefan; Mägi, Reedik; Ganna, Andrea; Wheeler, Eleanor; Feitosa, Mary F.; Justice, Anne E.; Monda, Keri L.; Croteau-Chonka, Damien C.; Day, Felix R.; Esko, Tõnu; Fall, Tove; Ferreira, Teresa; Gentilini, Davide; Jackson, Anne U.; Luan, Jian’an; Randall, Joshua C.; Vedantam, Sailaja; Willer, Cristen J.; Winkler, Thomas W.; Wood, Andrew R.; Workalemahu, Tsegaselassie; Hu, Yi-Juan; Lee, Sang Hong; Liang, Liming; Lin, Dan-Yu; Min, Josine L.; Neale, Benjamin M.; Thorleifsson, Gudmar; Yang, Jian; Albrecht, Eva; Amin, Najaf; Bragg-Gresham, Jennifer L.; Cadby, Gemma; den Heijer, Martin; Eklund, Niina; Fischer, Krista; Goel, Anuj; Hottenga, Jouke-Jan; Huffman, Jennifer E.; Jarick, Ivonne; Johansson, Åsa; Johnson, Toby; Kanoni, Stavroula; Kleber, Marcus E.; König, Inke R.; Kristiansson, Kati; Kutalik, Zoltán; Lamina, Claudia; Lecoeur, Cecile; Li, Guo; Mangino, Massimo; McArdle, Wendy L.; Medina-Gomez, Carolina; Müller-Nurasyid, Martina; Ngwa, Julius S.; Nolte, Ilja M.; Paternoster, Lavinia; Pechlivanis, Sonali; Perola, Markus; Peters, Marjolein J.; Preuss, Michael; Rose, Lynda M.; Shi, Jianxin; Shungin, Dmitry; Smith, Albert Vernon; Strawbridge, Rona J.; Surakka, Ida; Teumer, Alexander; Trip, Mieke D.; Tyrer, Jonathan; Van Vliet-Ostaptchouk, Jana V.; Vandenput, Liesbeth; Waite, Lindsay L.; Zhao, Jing Hua; Absher, Devin; Asselbergs, Folkert W.; Atalay, Mustafa; Attwood, Antony P.; Balmforth, Anthony J.; Basart, Hanneke; Beilby, John; Bonnycastle, Lori L.; Brambilla, Paolo; Bruinenberg, Marcel; Campbell, Harry; Chasman, Daniel I.; Chines, Peter S.; Collins, Francis S.; Connell, John M.; Cookson, William; de Faire, Ulf; de Vegt, Femmie; Dei, Mariano; Dimitriou, Maria; Edkins, Sarah; Estrada, Karol; Evans, David M.; Farrall, Martin; Ferrario, Marco M.; Ferrières, Jean; Franke, Lude; Frau, Francesca; Gejman, Pablo V.; Grallert, Harald; Grönberg, Henrik; Gudnason, Vilmundur; Hall, Alistair S.; Hall, Per; Hartikainen, Anna-Liisa; Hayward, Caroline; Heard-Costa, Nancy L.; Heath, Andrew C.; Hebebrand, Johannes; Homuth, Georg; Hu, Frank B.; Hunt, Sarah E.; Hyppönen, Elina; Iribarren, Carlos; Jacobs, Kevin B.; Jansson, John-Olov; Jula, Antti; Kähönen, Mika; Kathiresan, Sekar; Kee, Frank; Khaw, Kay-Tee; Kivimaki, Mika; Koenig, Wolfgang; Kraja, Aldi T.; Kumari, Meena; Kuulasmaa, Kari; Kuusisto, Johanna; Laitinen, Jaana H.; Lakka, Timo A.; Langenberg, Claudia; Launer, Lenore J.; Lind, Lars; Lindström, Jaana; Liu, Jianjun; Liuzzi, Antonio; Lokki, Marja-Liisa; Lorentzon, Mattias; Madden, Pamela A.; Magnusson, Patrik K.; Manunta, Paolo; Marek, Diana; März, Winfried; Leach, Irene Mateo; McKnight, Barbara; Medland, Sarah E.; Mihailov, Evelin; Milani, Lili; Montgomery, Grant W.; Mooser, Vincent; Mühleisen, Thomas W.; Munroe, Patricia B.; Musk, Arthur W.; Narisu, Narisu; Navis, Gerjan; Nicholson, George; Nohr, Ellen A.; Ong, Ken K.; Oostra, Ben A.; Palmer, Colin N.A.; Palotie, Aarno; Peden, John F.; Pedersen, Nancy; Peters, Annette; Polasek, Ozren; Pouta, Anneli; Pramstaller, Peter P.; Prokopenko, Inga; Pütter, Carolin; Radhakrishnan, Aparna; Raitakari, Olli; Rendon, Augusto; Rivadeneira, Fernando; Rudan, Igor; Saaristo, Timo E.; Sambrook, Jennifer G.; Sanders, Alan R.; Sanna, Serena; Saramies, Jouko; Schipf, Sabine; Schreiber, Stefan; Schunkert, Heribert; Shin, So-Youn; Signorini, Stefano; Sinisalo, Juha; Skrobek, Boris; Soranzo, Nicole; Stančáková, Alena; Stark, Klaus; Stephens, Jonathan C.; Stirrups, Kathleen; Stolk, Ronald P.; Stumvoll, Michael; Swift, Amy J.; Theodoraki, Eirini V.; Thorand, Barbara; Tregouet, David-Alexandre; Tremoli, Elena; Van der Klauw, Melanie M.; van Meurs, Joyce B.J.; Vermeulen, Sita H.; Viikari, Jorma; Virtamo, Jarmo; Vitart, Veronique; Waeber, Gérard; Wang, Zhaoming; Widén, Elisabeth; Wild, Sarah H.; Willemsen, Gonneke; Winkelmann, Bernhard R.; Witteman, Jacqueline C.M.; Wolffenbuttel, Bruce H.R.; Wong, Andrew; Wright, Alan F.

    2013-01-01

    Meta-analysis of genome-wide association studies (GWASs) has led to the discoveries of many common variants associated with complex human diseases. There is a growing recognition that identifying “causal” rare variants also requires large-scale meta-analysis. The fact that association tests with rare variants are performed at the gene level rather than at the variant level poses unprecedented challenges in the meta-analysis. First, different studies may adopt different gene-level tests, so the results are not compatible. Second, gene-level tests require multivariate statistics (i.e., components of the test statistic and their covariance matrix), which are difficult to obtain. To overcome these challenges, we propose to perform gene-level tests for rare variants by combining the results of single-variant analysis (i.e., p values of association tests and effect estimates) from participating studies. This simple strategy is possible because of an insight that multivariate statistics can be recovered from single-variant statistics, together with the correlation matrix of the single-variant test statistics, which can be estimated from one of the participating studies or from a publicly available database. We show both theoretically and numerically that the proposed meta-analysis approach provides accurate control of the type I error and is as powerful as joint analysis of individual participant data. This approach accommodates any disease phenotype and any study design and produces all commonly used gene-level tests. An application to the GWAS summary results of the Genetic Investigation of ANthropometric Traits (GIANT) consortium reveals rare and low-frequency variants associated with human height. The relevant software is freely available. PMID:23891470

  3. The EXOSAT database system. Available databases.

    NASA Astrophysics Data System (ADS)

    Barron, C.

    1991-02-01

    This User's Guide describes the databases that are currently available by remote login to the EXOSAT/ESTEC site of the EXOSAT database system. This includes where ever possible the following: brief descriptions of each observatory, telescope and instrument references to more complete observatory descriptions a list of the contents of each database and how it was generated, parameter descriptions.

  4. Crystallographic variant selection in {alpha}-{beta} brass

    SciTech Connect

    Stanford, N.; Bate, P.S. . E-mail: pete.bate@man.ac.uk

    2005-02-01

    The transformation texture of {alpha}/{beta} brass with a diffusional Widmanstaetten {alpha} growth morphology has been investigated. Electron micrographs and electron backscattered diffraction was used to determine that the orientation relationship between the {beta} phase and the {alpha} associated with nucleation at {beta} grain boundaries was 44.3 deg <1 1 6>. Crystallographic variant selection was observed across those prior {beta}/{beta} grain boundaries, but this has little effect on the transformation texture due to the crystal symmetry. The effect of the crystallographic variant selection on texture is further weakened by nucleation of diffusional transformed {alpha} in the grain interior.

  5. Using Whole Exome Sequencing to Identify Candidate Genes With Rare Variants In Nonsyndromic Cleft Lip and Palate.

    PubMed

    Aylward, Alana; Cai, Yi; Lee, Andrew; Blue, Elizabeth; Rabinowitz, Daniel; Haddad, Joseph

    2016-07-01

    Studies suggest that nonsyndromic cleft lip and palate (NSCLP) is polygenic with variable penetrance, presenting a challenge in identifying all causal genetic variants. Despite relatively high prevalence of NSCLP among Amerindian populations, no large whole exome sequencing (WES) studies have been completed in this population. Our goal was to identify candidate genes with rare genetic variants for NSCLP in a Honduran population using WES. WES was performed on two to four members of 27 multiplex Honduran families. Genetic variants with a minor allele frequency > 1% in reference databases were removed. Heterozygous variants consistent with dominant disease with incomplete penetrance were ascertained, and variants with predicted functional consequence were prioritized for analysis. Pedigree-specific P-values were calculated as the probability of all affected members in the pedigree being carriers, given that at least one is a carrier. Preliminary results identified 3,727 heterozygous rare variants; 1,282 were predicted to be functionally consequential. Twenty-three genes had variants of interest in ≥3 families, where some genes had different variants in each family, giving a total of 50 variants. Variant validation via Sanger sequencing of the families and unrelated unaffected controls excluded variants that were sequencing errors or common variants not in databases, leaving four genes with candidate variants in ≥3 families. Of these, candidate variants in two genes consistently segregate with NSCLP as a dominant variant with incomplete penetrance: ACSS2 and PHYH. Rare variants found at the same gene in all affected individuals in several families are likely to be directly related to NSCLP. PMID:27229527

  6. Knowledge Abstraction in Chinese Chess Endgame Databases

    NASA Astrophysics Data System (ADS)

    Chen, Bo-Nian; Liu, Pangfeng; Hsu, Shun-Chin; Hsu, Tsan-Sheng

    Retrograde analysis is a well known approach to construct endgame databases. However, the size of the endgame databases are too large to be loaded into the main memory of a computer during tournaments. In this paper, a novel knowledge abstraction strategy is proposed to compress endgame databases. The goal is to obtain succinct knowledge for practical endgames. A specialized goal-oriented search method is described and applied on the important endgame KRKNMM. The method of combining a search algorithm with a small size of knowledge is used to handle endgame positions up to a limited depth, but with a high degree of correctness.

  7. The Ensembl Variant Effect Predictor.

    PubMed

    McLaren, William; Gil, Laurent; Hunt, Sarah E; Riat, Harpreet Singh; Ritchie, Graham R S; Thormann, Anja; Flicek, Paul; Cunningham, Fiona

    2016-01-01

    The Ensembl Variant Effect Predictor is a powerful toolset for the analysis, annotation, and prioritization of genomic variants in coding and non-coding regions. It provides access to an extensive collection of genomic annotation, with a variety of interfaces to suit different requirements, and simple options for configuring and extending analysis. It is open source, free to use, and supports full reproducibility of results. The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs. PMID:27268795

  8. Design considerations for a space database

    NASA Technical Reports Server (NTRS)

    Moss, Lance M.

    1989-01-01

    Part of the information used in a real-time simulator is stored in the visual database. This information is processed by an image generator and displayed as a real-time visual image. The database must be constructed in a specific format, and it should efficiently utilize the capacities of the image generator that is was created for. A visual simulation is crucially dependent upon the success with which the database provides visual cues and recognizable scenes. For this reason, more and more attention is being paid to the art and science of creating effective real-time visual databases. Investigated here are the database design considerations required for a space-oriented real-time simulator. Space applications often require unique designs that correspond closely to the particular image-generator hardware and visual-database-management software. Specific examples from the databases constructed for NASA and its Evans and Sutherland CT6 image generator illustrate the various design strategies used in a space-simulation environment. These database design considerations are essential for all who would create a space database.

  9. Better prediction of functional effects for sequence variants

    PubMed Central

    2015-01-01

    Elucidating the effects of naturally occurring genetic variation is one of the major challenges for personalized health and personalized medicine. Here, we introduce SNAP2, a novel neural network based classifier that improves over the state-of-the-art in distinguishing between effect and neutral variants. Our method's improved performance results from screening many potentially relevant protein features and from refining our development data sets. Cross-validated on >100k experimentally annotated variants, SNAP2 significantly outperformed other methods, attaining a two-state accuracy (effect/neutral) of 83%. SNAP2 also outperformed combinations of other methods. Performance increased for human variants but much more so for other organisms. Our method's carefully calibrated reliability index informs selection of variants for experimental follow up, with the most strongly predicted half of all effect variants predicted at over 96% accuracy. As expected, the evolutionary information from automatically generated multiple sequence alignments gave the strongest signal for the prediction. However, we also optimized our new method to perform surprisingly well even without alignments. This feature reduces prediction runtime by over two orders of magnitude, enables cross-genome comparisons, and renders our new method as the best solution for the 10-20% of sequence orphans. SNAP2 is available at: https://rostlab.org/services/snap2web Definitions used Delta, input feature that results from computing the difference feature scores for native amino acid and feature scores for variant amino acid; nsSNP, non-synoymous SNP; PMD, Protein Mutant Database; SNAP, Screening for non-acceptable polymorphisms; SNP, single nucleotide polymorphism; variant, any amino acid changing sequence variant. PMID:26110438

  10. Interpreting Secondary Cardiac Disease Variants in an Exome Cohort

    PubMed Central

    Ng, David; Johnston, Jennifer J.; Teer, Jamie K.; Singh, Larry N.; Peller, Lindsey C.; Wynter, Jamila S.; Lewis, Katie L.; Cooper, David N.; Stenson, Peter D.; Mullikin, James C.; Biesecker, Leslie G.

    2013-01-01

    Background Massively parallel sequencing to identify rare variants is widely practiced in medical research and in the clinic. Genome and exome sequencing can identify the genetic cause of a disease (primary results), but can also identify pathogenic variants underlying diseases that are not being sought (secondary or incidental results). A major controversy has developed surrounding the return of secondary results to research participants. We have piloted a method to analyze exomes to identify participants at-risk for cardiac arrhythmias, cardiomyopathies or sudden death. Methods and Results Exome sequencing was performed on 870 participants not selected for arrhythmia, cardiomyopathy, or a family history of sudden death. Exome data from 22 cardiac arrhythmia and 41 cardiomyopathy-associated genes were analyzed using an algorithm that filtered results on genotype quality, frequency, and database information. We identified 1367 variants in the cardiomyopathy genes and 360 variants in the arrhythmia genes. Six participants had pathogenic variants associated with dilated cardiomyopathy (n=1), hypertrophic cardiomyopathy (n=2), left ventricular noncompaction (n=1) or long QT syndrome (n=2). Two of these participants had evidence of cardiomyopathy and one had left ventricular noncompaction on ECHO. Three participants with likely pathogenic variants had prolonged QTc. Family history included unexplained sudden death among relatives. Conclusions Approximately 0.5% of participants in this study had pathogenic variants in known cardiomyopathy or arrhythmia genes. This high frequency may be due to self-selection, false positives, or underestimation of the prevalence of these conditions. We conclude that clinically important cardiomyopathy and dysrhythmia secondary variants can be identified in unselected exomes. PMID:23861362

  11. NEOBASE: databasing the neocortical microcircuit.

    PubMed

    Muhammad, Asif Jan; Markram, Henry

    2005-01-01

    Mammals adapt to a rapidly changing world because of the sophisticated perceptual and cognitive function enabled by the neocortex. The neocortex, which has expanded to constitute nearly 80% of the human brain seems to have arisen from repeated duplication of a stereotypical template of neurons and synaptic circuits with subtle specializations in different brain regions and species. Determining the design and function of this microcircuitry is therefore of paramount importance to understanding normal and abnormal higher brain function. Recent advances in recording synaptically-coupled neurons has allowed rapid dissection of the neocortical microcircuitry thus yielding a massive amount of quantitative anatomical, electrical and gene expression data on the neurons and the synaptic circuits that connect the neurons. Due to the availability of the above mentioned data, it has now become imperative to database the neurons of the microcircuit and their synaptic connections. The NEOBASE project, aims to archive the neocortical microcircuit data in a manner that facilitates development of advanced data mining applications, statistical and bioinformatics analyses tools, custom microcircuit builders, and visualization and simulation applications. The database architecture is based on ROOT, a software environment that allows the construction of an object oriented database with numerous relational capabilities. The proposed architecture allows construction of a database that closely mimics the architecture of the real microcircuit, which facilitates the interface with virtually any application, allows for data format evolution, and aims for full interoperability with other databases. NEOBASE will provide an important resource and research tool for studying the microcircuit basis of normal and abnormal neocortical function. The database will be available to local as well as remote users using Grid based tools and technologies. PMID:15923726

  12. Scripps Genome ADVISER: Annotation and Distributed Variant Interpretation SERver

    PubMed Central

    Pham, Phillip H.; Shipman, William J.; Erikson, Galina A.; Schork, Nicholas J.; Torkamani, Ali

    2015-01-01

    Interpretation of human genomes is a major challenge. We present the Scripps Genome ADVISER (SG-ADVISER) suite, which aims to fill the gap between data generation and genome interpretation by performing holistic, in-depth, annotations and functional predictions on all variant types and effects. The SG-ADVISER suite includes a de-identification tool, a variant annotation web-server, and a user interface for inheritance and annotation-based filtration. SG-ADVISER allows users with no bioinformatics expertise to manipulate large volumes of variant data with ease – without the need to download large reference databases, install software, or use a command line interface. SG-ADVISER is freely available at genomics.scripps.edu/ADVISER. PMID:25706643

  13. Group B streptococcal opacity variants.

    PubMed Central

    Pincus, S H; Cole, R L; Wessels, M R; Corwin, M D; Kamanga-Sollo, E; Hayes, S F; Cieplak, W; Swanson, J

    1992-01-01

    Colony opacity variants were detected for type III group B streptococci (GBS). Transparent colonies predominate in the parent GBS, with occasional colonies having opaque portions. Two stable opaque variants (1.1 and 1.5) were compared with three transparent clones (1.2, 1.3, and 1.4). All grew well on blood agar and on GC medium, but variant 1.1 failed to grow on Todd-Hewitt medium. Scanning and transmission electron microscopy demonstrated that colony opacity correlated with bacterial aggregation status, with opaque variants forming longer and more organized chains. Opaque-transparent switches were observed in both directions for most variants, with transparent to opaque noted most frequently, but 1.5 did not switch at all. Switching of the opacity phenotype was observed both in vitro and in neonatal mice. Relationships between colony opacity and several cell surface phenomena were explored. (i) Opaque variant 1.1 had two surface proteins (46 and 75 kDa) that were either unique or greatly overexpressed. (ii) Variant 1.1 was deficient in type III polysaccharide, while 1.5 lacked group B antigen. Diminished capsular polysaccharide of variant 1.1 was reflected in reduced negative electrophoretic mobility and in increased buoyant density. (iii) Transparent variant colonies growing closest to a penicillin disk were opaque, but colonial variants did not differ in their sensitivity to penicillin. These data indicate that GBS can exist in both opaque and transparent forms, with opaque appearance occurring by multiple routes. Opaque variants grow poorly on Todd-Hewitt medium generally used for isolation of GBS, so any possible relationships between opacity variation and pathogenesis of GBS infection are unknown. Images PMID:1592825

  14. Orienting hypnosis.

    PubMed

    Hope, Anna E; Sugarman, Laurence I

    2015-01-01

    This article presents a new frame for understanding hypnosis and its clinical applications. Despite great potential to transform health and care, hypnosis research and clinical integration is impaired in part by centuries of misrepresentation and ignorance about its demonstrated efficacy. The authors contend that advances in the field are primarily encumbered by the lack of distinct boundaries and definitions. Here, hypnosis, trance, and mind are all redefined and grounded in biological, neurological, and psychological phenomena. Solutions are proposed for boundary and language problems associated with hypnosis. The biological role of novelty stimulating an orienting response that, in turn, potentiates systemic plasticity forms the basis for trance. Hypnosis is merely the skill set that perpetuates and influences trance. This formulation meshes with many aspects of Milton Erickson's legacy and Ernest Rossi's recent theory of mind and health. Implications of this hypothesis for clinical skills, professional training, and research are discussed. PMID:25928677

  15. Variants of windmill nystagmus.

    PubMed

    Choi, Kwang-Dong; Shin, Hae Kyung; Kim, Ji-Soo; Kim, Sung-Hee; Choi, Jae-Hwan; Kim, Hyo-Jung; Zee, David S

    2016-07-01

    Windmill nystagmus is characterized by a clock-like rotation of the beating direction of a jerk nystagmus suggesting separate horizontal and vertical oscillators, usually 90° out of phase. We report oculographic characteristics in three patients with variants of windmill nystagmus in whom the common denominator was profound visual loss due to retinal diseases. Two patients showed a clock-like pattern, while in the third, the nystagmus was largely diagonal (in phase or 180° out of phase) but also periodically changed direction by 180°. We hypothesize that windmill nystagmus is a unique manifestation of "eye movements of the blind." It emerges when the central structures, including the cerebellum, that normally keep eye movements calibrated and gaze steady can no longer perform their task, because they are deprived of the retinal image motion that signals a need for adaptive recalibration. PMID:27159990

  16. Phenylketonuria variants in Ontario.

    PubMed Central

    1976-01-01

    Since mass screening of the newborn population for phenylketonuria (PKU) by the Guthrie test was begun in Ontario in July 1965 many variants of PKU have been recognized in the 96 to 97% screened. Seventy-one cases of classic PKU were detected (four were missed). Of 48 cases of persistent hyperphenylalaninemia discovered, 18 were classified as atypical PKU and 30 as persistent benign hyperphenylalaninemia. Numerous infants with transient hyperphenylalaninemia (initial values over 10 mg/dl in 12), in many instances the result of transient neonatal tyrosinemia, were discovered. There was a slight predominance of males. Serum phenylalanine values of up to 15 mg/dl seemed to be harmless to the developing brain. A survey of 67 247 adults in the general population revealed 1 person with PKU and 1 with persistent benign hyperphenylalaninemia; both had normal intelligence quotients. Of 1548 mothers of retarded children tested, none had hyperphenylalaninemia. PMID:953933

  17. Arginine-Glycine Amidinotransferase Deficiency and Functional Characterization of Missense Variants in GATM.

    PubMed

    DesRoches, Caro-Lyne; Bruun, Theodora; Wang, Peixiang; Marshall, Christian R; Mercimek-Mahmutoglu, Saadet

    2016-09-01

    Arginine-glycine amidinotransferase (GATM) deficiency is an autosomal-recessive disorder caused by pathogenic variants in GATM. Clinical features include intellectual disability, hypotonia, and myopathy. Due to normal neurodevelopment in asymptomatic individuals on creatine monotherapy, GATM deficiency is a good candidate for newborn screening. To determine the carrier frequency of GATM deficiency, we performed functional characterization of rare missense variants in GATM reported as heterozygous in the Exome Variant Server database. To assess phenotype and genotype correlation, we developed a clinical severity scoring system. Two patients with mild phenotype had a nonsense missense variant. Severe phenotype was present in patients with missense as well as truncating variants. There seems to be no phenotype and genotype correlation. We cloned a novel GATM transcript. We found seven missense variants retaining 0% of wild-type GATM activity indicating putative pathogenicity. Based on our study results, high Genomic Evolutionary Rate Profiling conservation score, conserved amino acid substitution in species, and low allele frequency in exome databases would be the most sensitive in silico analysis tools to predict pathogenicity of missense variants. We present first study of the functional characterization of missense variants in GATM as well as clinical severity score of patients with GATM deficiency. PMID:27233232

  18. Databases: Beyond the Basics.

    ERIC Educational Resources Information Center

    Whittaker, Robert

    This presented paper offers an elementary description of database characteristics and then provides a survey of databases that may be useful to the teacher and researcher in Slavic and East European languages and literatures. The survey focuses on commercial databases that are available, usable, and needed. Individual databases discussed include:…

  19. Reflective Database Access Control

    ERIC Educational Resources Information Center

    Olson, Lars E.

    2009-01-01

    "Reflective Database Access Control" (RDBAC) is a model in which a database privilege is expressed as a database query itself, rather than as a static privilege contained in an access control list. RDBAC aids the management of database access controls by improving the expressiveness of policies. However, such policies introduce new interactions…

  20. A case study for a digital seabed database: Bohai Sea engineering geology database

    NASA Astrophysics Data System (ADS)

    Tianyun, Su; Shikui, Zhai; Baohua, Liu; Ruicai, Liang; Yanpeng, Zheng; Yong, Wang

    2006-07-01

    This paper discusses the designing plan of ORACLE-based Bohai Sea engineering geology database structure from requisition analysis, conceptual structure analysis, logical structure analysis, physical structure analysis and security designing. In the study, we used the object-oriented Unified Modeling Language (UML) to model the conceptual structure of the database and used the powerful function of data management which the object-oriented and relational database ORACLE provides to organize and manage the storage space and improve its security performance. By this means, the database can provide rapid and highly effective performance in data storage, maintenance and query to satisfy the application requisition of the Bohai Sea Oilfield Paradigm Area Information System.

  1. EDITORIAL: Optical orientation Optical orientation

    NASA Astrophysics Data System (ADS)

    SAME ADDRESS *, Yuri; Landwehr, Gottfried

    2008-11-01

    priority of the discovery in the literature, which was partly caused by the existence of the Iron Curtain. I had already enjoyed contact with Boris in the 1980s when the two volumes of Landau Level Spectroscopy were being prepared [2]. He was one of the pioneers of magneto-optics in semiconductors. In the 1950s the band structure of germanium and silicon was investigated by magneto-optical methods, mainly in the United States. No excitonic effects were observed and the band structure parameters were determined without taking account of excitons. However, working with cuprous oxide, which is a direct semiconductor with a relative large energy gap, Zakharchenya and his co-worker Seysan showed that in order to obtain correct band structure parameters, it is necessary to take excitons into account [3]. About 1970 Boris started work on optical orientation. Early work by Hanle in Germany in the 1920s on the depolarization of luminescence in mercury vapour by a transverse magnetic field was not appreciated for a long time. Only in the late 1940s did Kastler and co-workers in Paris begin a systematic study of optical pumping, which led to the award of a Nobel prize. The ideas of optical pumping were first applied by Georges Lampel to solid state physics in 1968. He demonstrated optical orientation of free carriers in silicon. The detection method was nuclear magnetic resonance; optically oriented free electrons dynamically polarized the 29Si nuclei of the host lattice. The first optical detection of spin orientation was demonstrated by with the III-V semiconductor GaSb by Parsons. Due to the various interaction mechanisms of spins with their environment, the effects occurring in semiconductors are naturally more complex than those in atoms. Optical detection is now the preferred method to detect spin alignment in semiconductors. The orientation of spins in crystals pumped with circularly polarized light is deduced from the degree of circular polarization of the recombination

  2. Human Mitochondrial Protein Database

    National Institute of Standards and Technology Data Gateway

    SRD 131 Human Mitochondrial Protein Database (Web, free access)   The Human Mitochondrial Protein Database (HMPDb) provides comprehensive data on mitochondrial and human nuclear encoded proteins involved in mitochondrial biogenesis and function. This database consolidates information from SwissProt, LocusLink, Protein Data Bank (PDB), GenBank, Genome Database (GDB), Online Mendelian Inheritance in Man (OMIM), Human Mitochondrial Genome Database (mtDB), MITOMAP, Neuromuscular Disease Center and Human 2-D PAGE Databases. This database is intended as a tool not only to aid in studying the mitochondrion but in studying the associated diseases.

  3. Orientation field estimation for latent fingerprint enhancement.

    PubMed

    Feng, Jianjiang; Zhou, Jie; Jain, Anil K

    2013-04-01

    Identifying latent fingerprints is of vital importance for law enforcement agencies to apprehend criminals and terrorists. Compared to live-scan and inked fingerprints, the image quality of latent fingerprints is much lower, with complex image background, unclear ridge structure, and even overlapping patterns. A robust orientation field estimation algorithm is indispensable for enhancing and recognizing poor quality latents. However, conventional orientation field estimation algorithms, which can satisfactorily process most live-scan and inked fingerprints, do not provide acceptable results for most latents. We believe that a major limitation of conventional algorithms is that they do not utilize prior knowledge of the ridge structure in fingerprints. Inspired by spelling correction techniques in natural language processing, we propose a novel fingerprint orientation field estimation algorithm based on prior knowledge of fingerprint structure. We represent prior knowledge of fingerprints using a dictionary of reference orientation patches. which is constructed using a set of true orientation fields, and the compatibility constraint between neighboring orientation patches. Orientation field estimation for latents is posed as an energy minimization problem, which is solved by loopy belief propagation. Experimental results on the challenging NIST SD27 latent fingerprint database and an overlapped latent fingerprint database demonstrate the advantages of the proposed orientation field estimation algorithm over conventional algorithms. PMID:22826508

  4. Crystallographic variant selection of martensite during fatigue deformation

    NASA Astrophysics Data System (ADS)

    Das, Arpan

    2015-03-01

    Metastable austenitic stainless steels are prone to form deformation-induced martensite under the influence of externally applied stress. Crystallographic variant selection during martensitic transformation of metastable austenite has been investigated thoroughly with respect to the interaction between the applied uniaxial cyclic stress and the resulting accumulated plastic strain during cyclic plastic deformation. The orientation of all the Kurdjomov-Sachs (K-S) variants has been evaluated extensively and compared with the measured orientation of martensite with their corresponding interaction energies by applying the elegant transformation texture model recently developed by Kundu and Bhadeshia. Encouraging correlation between model prediction and experimental data generation for martensite pole figures at many deformed austenite grains has been observed. It has been found that both the applied uniaxial cyclic stress and the accumulated plastic strain are having strong influence on crystallographic variant selection during cyclic plastic deformation. Patel and Cohen's classical theory can be utilized to predict the crystallographic variant selection, if it is correctly used along with the phenomenological theory of martensite crystallography.

  5. Variant terminology. [for aerospace information systems

    NASA Technical Reports Server (NTRS)

    Buchan, Ronald L.

    1991-01-01

    A system called Variant Terminology Switching (VTS) is set forth that is intended to provide computer-assisted spellings for terms that have American and British versions. VTS is based on the use of brackets, parentheses, and other symbols in conjunction with letters that distinguish American and British spellings. The symbols are used in the systems as indicators of actions such as deleting, adding, and replacing letters as well as replacing entire words and concepts. The system is shown to be useful for the intended purpose and also for the recognition of misspellings and for the standardization of computerized input/output. The VTS system is of interest to the development of international retrieval systems for aerospace and other technical databases that enhance the use by the global scientific community.

  6. datOZ: scientific databases readily accessible over the WWW.

    NASA Astrophysics Data System (ADS)

    Ortiz, P. F.

    datOZ: a tool designed to create scientific database systems accessible over the WWW is presented. datOZ can create customized database systems for a variety of scientific applications, (but oriented towards astronomy), keeping the same interface structure and philosophy for all the databases it creates. From the user's point of view, datOZ creates platform-independent databases with multilingual capabilities, which makes data accessible to any user with a connection to the Internet around the world. Some features of the databases created with datOZ include full data retrieval of individual database elements (DBE), partial data retrieval for a user-selected set of DBE's, the possibility of generating graphs with the data selected, and to cross-correlate the data with other databases created with the system. datOZ's databases can contain images, movies, hyper-links, and any other multimedia-type of information.

  7. EDITORIAL: Optical orientation Optical orientation

    NASA Astrophysics Data System (ADS)

    SAME ADDRESS *, Yuri; Landwehr, Gottfried

    2008-11-01

    priority of the discovery in the literature, which was partly caused by the existence of the Iron Curtain. I had already enjoyed contact with Boris in the 1980s when the two volumes of Landau Level Spectroscopy were being prepared [2]. He was one of the pioneers of magneto-optics in semiconductors. In the 1950s the band structure of germanium and silicon was investigated by magneto-optical methods, mainly in the United States. No excitonic effects were observed and the band structure parameters were determined without taking account of excitons. However, working with cuprous oxide, which is a direct semiconductor with a relative large energy gap, Zakharchenya and his co-worker Seysan showed that in order to obtain correct band structure parameters, it is necessary to take excitons into account [3]. About 1970 Boris started work on optical orientation. Early work by Hanle in Germany in the 1920s on the depolarization of luminescence in mercury vapour by a transverse magnetic field was not appreciated for a long time. Only in the late 1940s did Kastler and co-workers in Paris begin a systematic study of optical pumping, which led to the award of a Nobel prize. The ideas of optical pumping were first applied by Georges Lampel to solid state physics in 1968. He demonstrated optical orientation of free carriers in silicon. The detection method was nuclear magnetic resonance; optically oriented free electrons dynamically polarized the 29Si nuclei of the host lattice. The first optical detection of spin orientation was demonstrated by with the III-V semiconductor GaSb by Parsons. Due to the various interaction mechanisms of spins with their environment, the effects occurring in semiconductors are naturally more complex than those in atoms. Optical detection is now the preferred method to detect spin alignment in semiconductors. The orientation of spins in crystals pumped with circularly polarized light is deduced from the degree of circular polarization of the recombination

  8. Oriental noodles.

    PubMed

    Hou, G

    2001-01-01

    Oriental noodles have been consumed for thousands of years and remain an important part in the diet of many Asians. There is a wide variety of noodles in Asia with many local variations as result of differences in culture, climate, region and a host of other factors. In this article noodle classification, formulation, processing and evaluation are reviewed, with emphasis on eight major types. Wheat quality requirements, basic flour specifications, ingredient functions, and production variables are identified for different noodles. In the evaluation of flour for noodle making, three key quality attributes are considered: processability, noodle color and texture. Noodle process behavior is particularly important in the modern industrial production. Each noodle type has its own unique color and texture characteristics. Flour color, protein content, ash content, yellow pigment and polyphenol oxidase activity are important factors responsible for noodle color. Starch characteristics, protein content and quality play major roles in governing the texture of cooked noodles. However, the relative importance of starch and proteins varies considerably with noodle type. Starch pasting quality is the primary trait determining the eating quality of Japanese and Korean noodles that are characterized by soft and elastic texture, while protein quantity and strength are very important to Chinese-type noodles that require firm bite and chewy texture. Other factors such as ingredients added in the noodle formula and processing variables used during noodle preparation also affect the cooked noodle texture as well. PMID:11285682

  9. SpliceVista, a Tool for Splice Variant Identification and Visualization in Shotgun Proteomics Data*

    PubMed Central

    Zhu, Yafeng; Hultin-Rosenberg, Lina; Forshed, Jenny; Branca, Rui M. M.; Orre, Lukas M.; Lehtiö, Janne

    2014-01-01

    Alternative splicing is a pervasive process in eukaryotic organisms. More than 90% of human genes have alternatively spliced products, and aberrant splicing has been shown to be associated with many diseases. Current methods employed in the detection of splice variants include prediction by clustering of expressed sequence tags, exon microarray, and mRNA sequencing, all methods focusing on RNA-level information. There is a lack of tools for analyzing splice variants at the protein level. Here, we present SpliceVista, a tool for splice variant identification and visualization based on mass spectrometry proteomics data. SpliceVista retrieves gene structure and translated sequences from alternative splicing databases and maps MS-identified peptides to splice variants. The visualization module plots the exon composition of each splice variant and aligns identified peptides with transcript positions. If quantitative mass spectrometry data are used, SpliceVista plots the quantitative patterns for each peptide and provides users with the option to cluster peptides based on their quantitative patterns. SpliceVista can identify splice-variant-specific peptides, providing the possibility for variant-specific analysis. The tool was tested on two experimental datasets (PXD000065 and PXD000134). In A431 cells treated with gefitinib, 2983 splice-variant-specific peptides corresponding to 939 splice variants were identified. Through comparison of splice-variant-centric, protein-centric, and gene-centric quantification, several genes (e.g. EIF4H) were found to have differentially regulated splice variants after gefitinib treatment. The same discrepancy between protein-centric and splice-centric quantification was detected in the other dataset, in which induced pluripotent stem cells were compared with parental fibroblast and human embryotic stem cells. In addition, SpliceVista can be used to visualize novel splice variants inferred from peptide-level evidence. In summary, Splice

  10. SpliceVista, a tool for splice variant identification and visualization in shotgun proteomics data.

    PubMed

    Zhu, Yafeng; Hultin-Rosenberg, Lina; Forshed, Jenny; Branca, Rui M M; Orre, Lukas M; Lehtiö, Janne

    2014-06-01

    Alternative splicing is a pervasive process in eukaryotic organisms. More than 90% of human genes have alternatively spliced products, and aberrant splicing has been shown to be associated with many diseases. Current methods employed in the detection of splice variants include prediction by clustering of expressed sequence tags, exon microarray, and mRNA sequencing, all methods focusing on RNA-level information. There is a lack of tools for analyzing splice variants at the protein level. Here, we present SpliceVista, a tool for splice variant identification and visualization based on mass spectrometry proteomics data. SpliceVista retrieves gene structure and translated sequences from alternative splicing databases and maps MS-identified peptides to splice variants. The visualization module plots the exon composition of each splice variant and aligns identified peptides with transcript positions. If quantitative mass spectrometry data are used, SpliceVista plots the quantitative patterns for each peptide and provides users with the option to cluster peptides based on their quantitative patterns. SpliceVista can identify splice-variant-specific peptides, providing the possibility for variant-specific analysis. The tool was tested on two experimental datasets (PXD000065 and PXD000134). In A431 cells treated with gefitinib, 2983 splice-variant-specific peptides corresponding to 939 splice variants were identified. Through comparison of splice-variant-centric, protein-centric, and gene-centric quantification, several genes (e.g. EIF4H) were found to have differentially regulated splice variants after gefitinib treatment. The same discrepancy between protein-centric and splice-centric quantification was detected in the other dataset, in which induced pluripotent stem cells were compared with parental fibroblast and human embryotic stem cells. In addition, SpliceVista can be used to visualize novel splice variants inferred from peptide-level evidence. In summary, Splice

  11. Cellobiohydrolase variants and polynucleotides encoding same

    SciTech Connect

    Wogulis, Mark

    2014-10-14

    The present invention relates to variants of a parent cellobiohydrolase II. The present invention also relates to polynucleotides encoding the variants; nucleic acid constructs, vectors, and host cells comprising the polynucleotides; and methods of using the variants.

  12. Cellobiohydrolase variants and polynucleotides encoding same

    DOEpatents

    Wogulis, Mark

    2013-09-24

    The present invention relates to variants of a parent cellobiohydrolase II. The present invention also relates to polynucleotides encoding the variants; nucleic acid constructs, vectors, and host cells comprising the polynucleotides; and methods of using the variants.

  13. Cellobiohydrolase variants and polynucleotides encoding the same

    DOEpatents

    Wogulis, Mark

    2014-09-09

    The present invention relates to variants of a parent cellobiohydrolase. The present invention also relates to polynucleotides encoding the cellobiohydrolase variants; nucleic acid constructs, vectors, and host cells comprising the polynucleotides; and methods of using the cellobiohydrolase variants.

  14. HABP2 G534E Variant in Papillary Thyroid Carcinoma

    PubMed Central

    Tomsic, Jerneja; Fultz, Rebecca; Liyanarachchi, Sandya; He, Huiling; Senter, Leigha; de la Chapelle, Albert

    2016-01-01

    The main nonmedullary form of thyroid cancer is papillary thyroid carcinoma (PTC) that accounts for 80–90% of all thyroid malignancies. Only 3–10% of PTC patients have a positive family history of PTC yet the familiality is one of the highest of all cancers as measured by case control studies. A handful of genes have been implicated accounting for a small fraction of this genetic predisposition. It was therefore of considerable interest that a mutation in the HABP2 gene was recently implicated in familial PTC. The present work was undertaken to examine the extent of HABP2 variant involvement in PTC. The HABP2 G534E variant (rs7080536) was genotyped in blood DNA from 179 PTC families (one affected individual per family), 1160 sporadic PTC cases and 1395 controls. RNA expression of HABP2 was tested by qPCR in RNA extracted from tumor and normal thyroid tissue from individuals that are homozygous wild-type or heterozygous for the variant. The variant was found to be present in 6.1% familial cases, 8.0% sporadic cases (2 individuals were homozygous for the variant) and 8.7% controls. The variant did not segregate with PTC in one large and 6 smaller families in which it occurred. In keeping with data from the literature and databases the expression of HABP2 was highest in the liver, much lower in 3 other tested tissues (breast, kidney, brain) but not found in thyroid. Given these results showing lack of any involvement we suggest that the putative role of variant HABP2 in PTC should be carefully scrutinized. PMID:26745718

  15. Rare Copy Number Variants

    PubMed Central

    Grozeva, Detelina; Kirov, George; Ivanov, Dobril; Jones, Ian R.; Jones, Lisa; Green, Elaine K.; St Clair, David M.; Young, Allan H.; Ferrier, Nicol; Farmer, Anne E.; McGuffin, Peter; Holmans, Peter A.; Owen, Michael J.; O’Donovan, Michael C.; Craddock, Nick

    2015-01-01

    Context Recent studies suggest that copy number variation in the human genome is extensive and may play an important role in susceptibility to disease, including neuropsychiatric disorders such as schizophrenia and autism. The possible involvement of copy number variants (CNVs) in bipolar disorder has received little attention to date. Objectives To determine whether large (>100 000 base pairs) and rare (found in <1% of the population) CNVs are associated with susceptibility to bipolar disorder and to compare with findings in schizophrenia. Design A genome-wide survey of large, rare CNVs in a case-control sample using a high-density microarray. Setting The Wellcome Trust Case Control Consortium. Participants There were 1697 cases of bipolar disorder and 2806 nonpsychiatric controls. All participants were white UK residents. Main Outcome Measures Overall load of CNVs and presence of rare CNVs. Results The burden of CNVs in bipolar disorder was not increased compared with controls and was significantly less than in schizophrenia cases. The CNVs previously implicated in the etiology of schizophrenia were not more common in cases with bipolar disorder. Conclusions Schizophrenia and bipolar disorder differ with respect to CNV burden in general and association with specific CNVs in particular. Our data are consistent with the possibility that possession of large, rare deletions may modify the phenotype in those at risk of psychosis: those possessing such events are more likely to be diagnosed as having schizophrenia, and those without them are more likely to be diagnosed as having bipolar disorder. PMID:20368508

  16. Conceptual and logical level of database modeling

    NASA Astrophysics Data System (ADS)

    Hunka, Frantisek; Matula, Jiri

    2016-06-01

    Conceptual and logical levels form the top most levels of database modeling. Usually, ORM (Object Role Modeling) and ER diagrams are utilized to capture the corresponding schema. The final aim of business process modeling is to store its results in the form of database solution. For this reason, value oriented business process modeling which utilizes ER diagram to express the modeling entities and relationships between them are used. However, ER diagrams form the logical level of database schema. To extend possibilities of different business process modeling methodologies, the conceptual level of database modeling is needed. The paper deals with the REA value modeling approach to business process modeling using ER-diagrams, and derives conceptual model utilizing ORM modeling approach. Conceptual model extends possibilities for value modeling to other business modeling approaches.

  17. Variants of beta-glucosidase

    SciTech Connect

    Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

    2015-07-14

    The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  18. Variants of beta-glucosidases

    SciTech Connect

    Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

    2014-10-07

    The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  19. Variants of beta-glucosidases

    DOEpatents

    Fidantsef, Ana; Lamsa, Michael; Clancy, Brian Gorre

    2008-08-19

    The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  20. Variants of beta-glucosidase

    DOEpatents

    Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

    2009-12-29

    The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  1. An incremental database access method for autonomous interoperable databases

    NASA Technical Reports Server (NTRS)

    Roussopoulos, Nicholas; Sellis, Timos

    1994-01-01

    We investigated a number of design and performance issues of interoperable database management systems (DBMS's). The major results of our investigation were obtained in the areas of client-server database architectures for heterogeneous DBMS's, incremental computation models, buffer management techniques, and query optimization. We finished a prototype of an advanced client-server workstation-based DBMS which allows access to multiple heterogeneous commercial DBMS's. Experiments and simulations were then run to compare its performance with the standard client-server architectures. The focus of this research was on adaptive optimization methods of heterogeneous database systems. Adaptive buffer management accounts for the random and object-oriented access methods for which no known characterization of the access patterns exists. Adaptive query optimization means that value distributions and selectives, which play the most significant role in query plan evaluation, are continuously refined to reflect the actual values as opposed to static ones that are computed off-line. Query feedback is a concept that was first introduced to the literature by our group. We employed query feedback for both adaptive buffer management and for computing value distributions and selectivities. For adaptive buffer management, we use the page faults of prior executions to achieve more 'informed' management decisions. For the estimation of the distributions of the selectivities, we use curve-fitting techniques, such as least squares and splines, for regressing on these values.

  2. Identifying Mendelian disease genes with the Variant Effect Scoring Tool

    PubMed Central

    2013-01-01

    Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is

  3. Are all the previously reported genetic variants in limb girdle muscular dystrophy genes pathogenic?

    PubMed

    Di Fruscio, Giuseppina; Garofalo, Arcomaria; Mutarelli, Margherita; Savarese, Marco; Nigro, Vincenzo

    2016-01-01

    Hundreds of variants in autosomal genes associated with the limb girdle muscular dystrophies (LGMDs) have been reported as being causative. However, in most cases the proof of pathogenicity derives from their non-occurrence in hundreds of healthy controls and/or from segregation studies in small families. The limited statistics of the genetic variations in the general population may hamper a correct interpretation of the effect of variants on the protein. To clarify the meaning of low-frequency variants in LGMD genes, we have selected all variants described as causative in the Leiden Open Variation Database and the Human Gene Mutation Database. We have systematically searched for their frequency in the NHLBI GO Exome Sequencing Project (ESP) and in our internal database. Surprisingly, the ESP contains about 4% of the variants previously associated with a dominant inheritance and about 9% of those associated with a recessive inheritance. The putative disease alleles are much more frequent than those estimated considering the disease prevalence. In conclusion, we hypothesize that a number of disease-associated variants are non-pathogenic and that other variations are not fully penetrant, even if they affect the protein function, suggesting a more complex genetic mechanisms for such heterogeneous disorders. PMID:25898921

  4. Are all the previously reported genetic variants in limb girdle muscular dystrophy genes pathogenic?

    PubMed Central

    Di Fruscio, Giuseppina; Garofalo, Arcomaria; Mutarelli, Margherita; Savarese, Marco; Nigro, Vincenzo

    2015-01-01

    Hundreds of variants in autosomal genes associated to the limb girdle muscular dystrophies (LGMDs) have been reported as being causative. However, in most cases the proof of pathogenicity derives from their non-occurrence in hundreds of healthy controls and/or from segregation studies in small families. The limited statistics of the genetic variations in the general population may hamper a correct interpretation of the effect of variants on the protein. To clarify the meaning of low-frequency variants in LGMD genes, we have selected all variants described as causative in the Leiden Open Variation Database and the Human Gene Mutation Database. We have systematically searched for their frequency in the NHLBI GO Exome Sequencing Project (ESP) and in our internal database. Surprisingly, the ESP contains about 4% of the variants previously associated with a dominant inheritance and about 9% of those associated with a recessive inheritance. The putative disease alleles are much more frequent than those estimated considering the disease prevalence. In conclusion, we hypothesize that a number of disease-associated variants are nonpathogenic and that other variations are not fully penetrant, even if they affect the protein function, suggesting a more complex genetic mechanisms for such heterogeneous disorders. PMID:25898921

  5. Large numbers of genetic variants considered to be pathogenic are common in asymptomatic individuals

    PubMed Central

    Cassa, Christopher A.; Tong, Mark Y.; Jordan, Daniel M.

    2013-01-01

    It is now affordable to order clinically interpreted whole genome sequence reports from clinical laboratories. One major component of these reports is derived from the knowledge base of previously identified pathogenic variants, including research articles, locus specific and other databases. While over 150,000 such pathogenic variants have been identified, many of these were originally discovered in small cohort studies of affected individuals, so their applicability to asymptomatic populations is unclear. We analyzed the prevalence of a large set of pathogenic variants from the medical and scientific literature in a large set of asymptomatic individuals (N=1,092) and found 8.5% of these pathogenic variants in at least one individual. In the average individual in the 1000 Genomes Project, previously identified pathogenic variants occur on average 294 times (σ= 25.5) in homozygous form and 942 times (σ = 68.2) in heterozygous form. We also find that many of these pathogenic variants are frequently occurring: there are 3,744 variants with MAF >= 0.01 (4.6%) and 2,837 variants with MAF >= 0.05 (3.5%). This indicates that many of these variants may be erroneous findings or have lower penetrance than previously expected. PMID:23818451

  6. Physiological Information Database (PID)

    EPA Science Inventory

    EPA has developed a physiological information database (created using Microsoft ACCESS) intended to be used in PBPK modeling. The database contains physiological parameter values for humans from early childhood through senescence as well as similar data for laboratory animal spec...

  7. THE ECOTOX DATABASE

    EPA Science Inventory

    The database provides chemical-specific toxicity information for aquatic life, terrestrial plants, and terrestrial wildlife. ECOTOX is a comprehensive ecotoxicology database and is therefore essential for providing and suppoirting high quality models needed to estimate population...

  8. Network II Database

    Energy Science and Technology Software Center (ESTSC)

    1994-11-07

    The Oak Ridge National Laboratory (ORNL) Rail and Barge Network II Database is a representation of the rail and barge system of the United States. The network is derived from the Federal Rail Administration (FRA) rail database.

  9. Household Products Database: Pesticides

    MedlinePlus

    ... Names Types of Products Manufacturers Ingredients About the Database FAQ Product Recalls Help Glossary Contact Us More ... holders. Information is extracted from Consumer Product Information Database ©2001-2015 by DeLima Associates. All rights reserved. ...

  10. Unusual variants of mycosis fungoides.

    PubMed

    Abeldaño, Alejandra; Arias, Mariana; Benedetti, Adriana; Ochoa, Karina; Maskin, Matías; Pellerano, Graciela; Kien, María Cristina; Chouela, Edgardo

    2011-01-01

    Unusual variants of mycosis fungoides (MF) differ substantially from the classical presentation, and most of them resemble other dermatologic diseases. The authors reviewed files of patients with MF who consulted our clinic between November 1995 and June 2010 to evaluate the relative frequency and clinical behavior of these variants. Among 98 patients with MF, 32 (32.65%) had unusual variants. The most common types included follicular MF (31.25%), hypopigmented MF (18.75%), poiquilodermic MF (15.6%), and erythrodermic MF (12.5%). Less common variants included unilesional MF, bullosa MF, ichthyosiform MF, granulomatous slack skin, and pigmented purpura-like MF. Progressive disease and MF-related death were most commonly associated with follicular MF, bullosa MF, and erythrodermic MF. PMID:21980706

  11. Gene Variants Reduce Opioid Risks

    MedlinePlus

    ... Charts Emerging Trends and Alerts Alcohol Club Drugs Cocaine Hallucinogens Heroin Inhalants Marijuana MDMA (Ecstasy/Molly) Methamphetamine ... a decreased risk for addiction to heroin or cocaine. The other linked variants in two genes— OPRM1 , ...

  12. RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data

    PubMed Central

    Zhan, Xiaowei; Hu, Youna; Li, Bingshan; Abecasis, Goncalo R.; Liu, Dajiang J.

    2016-01-01

    Motivation: Next-generation sequencing technologies have enabled the large-scale assessment of the impact of rare and low-frequency genetic variants for complex human diseases. Gene-level association tests are often performed to analyze rare variants, where multiple rare variants in a gene region are analyzed jointly. Applying gene-level association tests to analyze sequence data often requires integrating multiple heterogeneous sources of information (e.g. annotations, functional prediction scores, allele frequencies, genotypes and phenotypes) to determine the optimal analysis unit and prioritize causal variants. Given the complexity and scale of current sequence datasets and bioinformatics databases, there is a compelling need for more efficient software tools to facilitate these analyses. To answer this challenge, we developed RVTESTS, which implements a broad set of rare variant association statistics and supports the analysis of autosomal and X-linked variants for both unrelated and related individuals. RVTESTS also provides useful companion features for annotating sequence variants, integrating bioinformatics databases, performing data quality control and sample selection. We illustrate the advantages of RVTESTS in functionality and efficiency using the 1000 Genomes Project data. Availability and implementation: RVTESTS is available on Linux, MacOS and Windows. Source code and executable files can be obtained at https://github.com/zhanxw/rvtests Contact: zhanxw@gmail.com; goncalo@umich.edu; dajiang.liu@outlook.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153000

  13. Common hyperspectral image database design

    NASA Astrophysics Data System (ADS)

    Tian, Lixun; Liao, Ningfang; Chai, Ali

    2009-11-01

    This paper is to introduce Common hyperspectral image database with a demand-oriented Database design method (CHIDB), which comprehensively set ground-based spectra, standardized hyperspectral cube, spectral analysis together to meet some applications. The paper presents an integrated approach to retrieving spectral and spatial patterns from remotely sensed imagery using state-of-the-art data mining and advanced database technologies, some data mining ideas and functions were associated into CHIDB to make it more suitable to serve in agriculture, geological and environmental areas. A broad range of data from multiple regions of the electromagnetic spectrum is supported, including ultraviolet, visible, near-infrared, thermal infrared, and fluorescence. CHIDB is based on dotnet framework and designed by MVC architecture including five main functional modules: Data importer/exporter, Image/spectrum Viewer, Data Processor, Parameter Extractor, and On-line Analyzer. The original data were all stored in SQL server2008 for efficient search, query and update, and some advance Spectral image data Processing technology are used such as Parallel processing in C#; Finally an application case is presented in agricultural disease detecting area.

  14. MPlus Database system

    SciTech Connect

    Not Available

    1989-01-20

    The MPlus Database program was developed to keep track of mail received. This system was developed by TRESP for the Department of Energy/Oak Ridge Operations. The MPlus Database program is a PC application, written in dBase III+'' and compiled with Clipper'' into an executable file. The files you need to run the MPLus Database program can be installed on a Bernoulli, or a hard drive. This paper discusses the use of this database.

  15. Aviation Safety Issues Database

    NASA Technical Reports Server (NTRS)

    Morello, Samuel A.; Ricks, Wendell R.

    2009-01-01

    The aviation safety issues database was instrumental in the refinement and substantiation of the National Aviation Safety Strategic Plan (NASSP). The issues database is a comprehensive set of issues from an extremely broad base of aviation functions, personnel, and vehicle categories, both nationally and internationally. Several aviation safety stakeholders such as the Commercial Aviation Safety Team (CAST) have already used the database. This broader interest was the genesis to making the database publically accessible and writing this report.

  16. Mission and Assets Database

    NASA Technical Reports Server (NTRS)

    Baldwin, John; Zendejas, Silvino; Gutheinz, Sandy; Borden, Chester; Wang, Yeou-Fang

    2009-01-01

    Mission and Assets Database (MADB) Version 1.0 is an SQL database system with a Web user interface to centralize information. The database stores flight project support resource requirements, view periods, antenna information, schedule, and forecast results for use in mid-range and long-term planning of Deep Space Network (DSN) assets.

  17. Plant and Crop Databases

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Databases have become an integral part of all aspects of biological research, including basic and applied plant biology. The importance of databases continues to increase as the volume of data from direct and indirect genomics approaches expands. What is not always obvious to users of databases is t...

  18. SDS, a structural disruption score for assessment of missense variant deleteriousness

    PubMed Central

    Preeprem, Thanawadee; Gibson, Greg

    2014-01-01

    We have developed a novel structure-based evaluation for missense variants that explicitly models protein structure and amino acid properties to predict the likelihood that a variant disrupts protein function. A structural disruption score (SDS) is introduced as a measure to depict the likelihood that a case variant is functional. The score is constructed using characteristics that distinguish between causal and neutral variants within a group of proteins. The SDS score is correlated with standard sequence-based deleteriousness, but shows promise for improving discrimination between neutral and causal variants at less conserved sites. The prediction was performed on 3-dimentional structures of 57 gene products whose homozygous SNPs were identified as case-exclusive variants in an exome sequencing study of epilepsy disorders. We contrasted the candidate epilepsy variants with scores for likely benign variants found in the EVS database, and for positive control variants in the same genes that are suspected to promote a range of diseases. To derive a characteristic profile of damaging SNPs, we transformed continuous scores into categorical variables based on the score distribution of each measurement, collected from all possible SNPs in this protein set, where extreme measures were assumed to be deleterious. A second epilepsy dataset was used to replicate the findings. Causal variants tend to receive higher sequence-based deleterious scores, induce larger physico-chemical changes between amino acid pairs, locate in protein domains, buried sites or on conserved protein surface clusters, and cause protein destabilization, relative to negative controls. These measures were agglomerated for each variant. A list of nine high-priority putative functional variants for epilepsy was generated. Our newly developed SDS protocol facilitates SNP prioritization for experimental validation. PMID:24795746

  19. Comprehensive splicing functional analysis of DNA variants of the BRCA2 gene by hybrid minigenes

    PubMed Central

    2012-01-01

    Introduction The underlying pathogenic mechanism of a large fraction of DNA variants of disease-causing genes is the disruption of the splicing process. We aimed to investigate the effect on splicing of the BRCA2 variants c.8488-1G > A (exon 20) and c.9026_9030del (exon 23), as well as 41 BRCA2 variants reported in the Breast Cancer Information Core (BIC) mutation database. Methods DNA variants were analyzed with the splicing prediction programs NNSPLICE and Human Splicing Finder. Functional analyses of candidate variants were performed by lymphocyte RT-PCR and/or hybrid minigene assays. Forty-one BIC variants of exons 19, 20, 23 and 24 were bioinformatically selected and generated by PCR-mutagenesis of the wild type minigenes. Results Lymphocyte RT-PCR of c.8488-1G > A showed intron 19 retention and a 12-nucleotide deletion in exon 20, whereas c.9026_9030del did not show any splicing anomaly. Minigene analysis of c.8488-1G > A displayed the aforementioned aberrant isoforms but also exon 20 skipping. We further evaluated the splicing outcomes of 41 variants of four BRCA2 exons by minigene analysis. Eighteen variants presented splicing aberrations. Most variants (78.9%) disrupted the natural splice sites, whereas four altered putative enhancers/silencers and had a weak effect. Fluorescent RT-PCR of minigenes accurately detected 14 RNA isoforms generated by cryptic site usage, exon skipping and intron retention events. Fourteen variants showed total splicing disruptions and were predicted to truncate or eliminate essential domains of BRCA2. Conclusions A relevant proportion of BRCA2 variants are correlated with splicing disruptions, indicating that RNA analysis is a valuable tool to assess the pathogenicity of a particular DNA change. The minigene system is a straightforward and robust approach to detect variants with an impact on splicing and contributes to a better knowledge of this gene expression step. PMID:22632462

  20. Databases of genomic variation and phenotypes: existing resources and future needs

    PubMed Central

    Johnston, Jennifer J.; Biesecker, Leslie G.

    2013-01-01

    Massively parallel sequencing (MPS) has become an important tool for identifying medically significant variants in both research and the clinic. Accurate variation and genotype–phenotype databases are critical in our ability to make sense of the vast amount of information that MPS generates. The purpose of this review is to summarize the state of the art of variation and genotype–phenotype databases, how they can be used, and opportunities to improve these resources. Our working assumption is that the objective of the clinical genomicist is to identify highly penetrant variants that could explain existing disease or predict disease risk for individual patients or research participants. We have detailed how current databases contribute to this goal providing frequency data, literature reviews and predictions of causation for individual variants. For variant annotation, databases vary greatly in their ease of use, the use of standard mutation nomenclature, the comprehensiveness of the variant cataloging and the degree of expert opinion. Ultimately, we need a dynamic and comprehensive reference database of medically important variants that is easily cross referenced to exome and genome sequence data and allows for an accumulation of expert opinion. PMID:23962721

  1. A Database Practicum for Teaching Database Administration and Software Development at Regis University

    ERIC Educational Resources Information Center

    Mason, Robert T.

    2013-01-01

    This research paper compares a database practicum at the Regis University College for Professional Studies (CPS) with technology oriented practicums at other universities. Successful andragogy for technology courses can motivate students to develop a genuine interest in the subject, share their knowledge with peers and can inspire students to…

  2. DIDA: A curated and annotated digenic diseases database.

    PubMed

    Gazzo, Andrea M; Daneels, Dorien; Cilia, Elisa; Bonduelle, Maryse; Abramowicz, Marc; Van Dooren, Sonia; Smits, Guillaume; Lenaerts, Tom

    2016-01-01

    DIDA (DIgenic diseases DAtabase) is a novel database that provides for the first time detailed information on genes and associated genetic variants involved in digenic diseases, the simplest form of oligogenic inheritance. The database is accessible via http://dida.ibsquare.be and currently includes 213 digenic combinations involved in 44 different digenic diseases. These combinations are composed of 364 distinct variants, which are distributed over 136 distinct genes. The web interface provides browsing and search functionalities, as well as documentation and help pages, general database statistics and references to the original publications from which the data have been collected. The possibility to submit novel digenic data to DIDA is also provided. Creating this new repository was essential as current databases do not allow one to retrieve detailed records regarding digenic combinations. Genes, variants, diseases and digenic combinations in DIDA are annotated with manually curated information and information mined from other online resources. Next to providing a unique resource for the development of new analysis methods, DIDA gives clinical and molecular geneticists a tool to find the most comprehensive information on the digenic nature of their diseases of interest. PMID:26481352

  3. Saada: A Generator of Astronomical Database

    NASA Astrophysics Data System (ADS)

    Michel, L.

    2011-11-01

    Saada transforms a set of heterogeneous FITS files or VOtables of various categories (images, tables, spectra, etc.) in a powerful database deployed on the Web. Databases are located on your host and stay independent of any external server. This job doesn’t require writing code. Saada can mix data of various categories in multiple collections. Data collections can be linked each to others making relevant browsing paths and allowing data-mining oriented queries. Saada supports 4 VO services (Spectra, images, sources and TAP) . Data collections can be published immediately after the deployment of the Web interface.

  4. The Mars Express limbs observations database

    NASA Astrophysics Data System (ADS)

    Gondet, Brigitte; Bibring, Jean-Pierre; Montmessin, Franck; Giuranna, Marco; Hoffmann, Harald; Cardesin, Alejandro

    2015-04-01

    The capability to orient Mars Express allows a great diversity of observations modes, in particular nadir and limb. During day and night limb's observations, 4 out of 7 MEX instruments (the spectrometers: SPICAM, OMEGA, PFS and the high-resolution camera HRSC) work together to provide spectra (.12 µm to 45 µm) of the Martian atmosphere, at each altitude step, with the associated image. We will present the limbs database of more than 10 years in orbit with striking results (dust and clouds detached layers, day and night emissions). The database is now accessible to the scientific community via the ESA/PSA website (www.rssd.esa.int/PSA).

  5. Database computing in HEP. Abstract Only

    SciTech Connect

    Day, C.T.; Loken, S.; Macfarlane, J.F.; May, E.; Lifka, D.; Lusk, E.; Price, L.E.; Baden, A.; Grossman, R.; Qin, X.

    1992-12-31

    The major SSC experiments are expected to produce up to 1 Petabyte of data per year each. Once the primary reconstruction is completed by farms of inexpensive processors, I/O becomes a major factor in further analysis of the data. The authors believe that the application of database techniques can significantly reduce the I/O performed in these analyses. They present examples of such I/O reductions in prototypes based on relational and object-oriented databases of CDF data samples.

  6. Heterogenous database integration in a physician workstation.

    PubMed Central

    Annevelink, J.; Young, C. Y.; Tang, P. C.

    1991-01-01

    We discuss the integration of a variety of data and information sources in a Physician Workstation (PWS), focusing on the integration of data from DHCP, the Veteran Administration's Distributed Hospital Computer Program. We designed a logically centralized, object-oriented data-schema, used by end users and applications to explore the data accessible through an object-oriented database using a declarative query language. We emphasize the use of procedural abstraction to transparently integrate a variety of information sources into the data schema. PMID:1807624

  7. Visualization of multidimensional database

    NASA Astrophysics Data System (ADS)

    Lee, Chung

    2008-01-01

    The concept of multidimensional databases has been extensively researched and wildly used in actual database application. It plays an important role in contemporary information technology, but due to the complexity of its inner structure, the database design is a complicated process and users are having a hard time fully understanding and using the database. An effective visualization tool for higher dimensional information system helps database designers and users alike. Most visualization techniques focus on displaying dimensional data using spreadsheets and charts. This may be sufficient for the databases having three or fewer dimensions but for higher dimensions, various combinations of projection operations are needed and a full grasp of total database architecture is very difficult. This study reviews existing visualization techniques for multidimensional database and then proposes an alternate approach to visualize a database of any dimension by adopting the tool proposed by Kiviat for software engineering processes. In this diagramming method, each dimension is represented by one branch of concentric spikes. This paper documents a C++ based visualization tool with extensive use of OpenGL graphics library and GUI functions. Detailed examples of actual databases demonstrate the feasibility and effectiveness in visualizing multidimensional databases.

  8. Efficient analysis of mouse genome sequences reveal many nonsense variants.

    PubMed

    Steeland, Sophie; Timmermans, Steven; Van Ryckeghem, Sara; Hulpiau, Paco; Saeys, Yvan; Van Montagu, Marc; Vandenbroucke, Roosmarijn E; Libert, Claude

    2016-05-17

    Genetic polymorphisms in coding genes play an important role when using mouse inbred strains as research models. They have been shown to influence research results, explain phenotypical differences between inbred strains, and increase the amount of interesting gene variants present in the many available inbred lines. SPRET/Ei is an inbred strain derived from Mus spretus that has ∼1% sequence difference with the C57BL/6J reference genome. We obtained a listing of all SNPs and insertions/deletions (indels) present in SPRET/Ei from the Mouse Genomes Project (Wellcome Trust Sanger Institute) and processed these data to obtain an overview of all transcripts having nonsynonymous coding sequence variants. We identified 8,883 unique variants affecting 10,096 different transcripts from 6,328 protein-coding genes, which is about 28% of all coding genes. Because only a subset of these variants results in drastic changes in proteins, we focused on variations that are nonsense mutations that ultimately resulted in a gain of a stop codon. These genes were identified by in silico changing the C57BL/6J coding sequences to the SPRET/Ei sequences, converting them to amino acid (AA) sequences, and comparing the AA sequences. All variants and transcripts affected were also stored in a database, which can be browsed using a SPRET/Ei M. spretus variants web tool (www.spretus.org), including a manual. We validated the tool by demonstrating the loss of function of three proteins predicted to be severely truncated, namely Fas, IRAK2, and IFNγR1. PMID:27147605

  9. An Introduction to Database Structure and Database Machines.

    ERIC Educational Resources Information Center

    Detweiler, Karen

    1984-01-01

    Enumerates principal management objectives of database management systems (data independence, quality, security, multiuser access, central control) and criteria for comparison (response time, size, flexibility, other features). Conventional database management systems, relational databases, and database machines used for backend processing are…

  10. Rare missense variants in POT1 predispose to familial cutaneous malignant melanoma.

    PubMed

    Shi, Jianxin; Yang, Xiaohong R; Ballew, Bari; Rotunno, Melissa; Calista, Donato; Fargnoli, Maria Concetta; Ghiorzo, Paola; Bressac-de Paillerets, Brigitte; Nagore, Eduardo; Avril, Marie Francoise; Caporaso, Neil E; McMaster, Mary L; Cullen, Michael; Wang, Zhaoming; Zhang, Xijun; Bruno, William; Pastorino, Lorenza; Queirolo, Paola; Banuls-Roca, Jose; Garcia-Casado, Zaida; Vaysse, Amaury; Mohamdi, Hamida; Riazalhosseini, Yasser; Foglio, Mario; Jouenne, Fanélie; Hua, Xing; Hyland, Paula L; Yin, Jinhu; Vallabhaneni, Haritha; Chai, Weihang; Minghetti, Paola; Pellegrini, Cristina; Ravichandran, Sarangan; Eggermont, Alexander; Lathrop, Mark; Peris, Ketty; Scarra, Giovanna Bianchi; Landi, Giorgio; Savage, Sharon A; Sampson, Joshua N; He, Ji; Yeager, Meredith; Goldin, Lynn R; Demenais, Florence; Chanock, Stephen J; Tucker, Margaret A; Goldstein, Alisa M; Liu, Yie; Landi, Maria Teresa

    2014-05-01

    Although CDKN2A is the most frequent high-risk melanoma susceptibility gene, the underlying genetic factors for most melanoma-prone families remain unknown. Using whole-exome sequencing, we identified a rare variant that arose as a founder mutation in the telomere shelterin gene POT1 (chromosome 7, g.124493086C>T; p.Ser270Asn) in five unrelated melanoma-prone families from Romagna, Italy. Carriers of this variant had increased telomere lengths and numbers of fragile telomeres, suggesting that this variant perturbs telomere maintenance. Two additional rare POT1 variants were identified in all cases sequenced in two separate Italian families, one variant per family, yielding a frequency for POT1 variants comparable to that for CDKN2A mutations in this population. These variants were not found in public databases or in 2,038 genotyped Italian controls. We also identified two rare recurrent POT1 variants in US and French familial melanoma cases. Our findings suggest that POT1 is a major susceptibility gene for familial melanoma in several populations. PMID:24686846

  11. Variobox: automatic detection and annotation of human genetic variants.

    PubMed

    Gaspar, Paulo; Lopes, Pedro; Oliveira, Jorge; Santos, Rosário; Dalgleish, Raymond; Oliveira, José Luís

    2014-02-01

    Triggered by the sequencing of the human genome, personalized medicine has been one of the fastest growing research areas in the last decade. Multiple software and hardware technologies have been developed by several projects, culminating in the exponential growth of genetic data. Considering the technological developments in this field, it is now fairly easy and inexpensive to obtain genetic profiles for unique individuals, such as those performed by several genetic analysis companies. The availability of computational tools that simplify genetic data analysis and the disclosure of biomedical evidences are of utmost importance. We present Variobox, a desktop tool to annotate, analyze, and compare human genes. Variobox obtains variant annotation data from WAVe, protein metadata annotations from Protein Data Bank, and sequences are obtained from Locus Reference Genomic or RefSeq databases. To explore the data, Variobox provides an advanced sequence visualization that enables agile navigation through genetic regions. DNA sequencing data can be compared with reference sequences retrieved from LRG or RefSeq records, identifying and automatically annotating new potential variants. These features and data, ranging from patient sequences to HGVS-compliant variant descriptions, are combined in an intuitive interface to analyze genes and variants. Variobox is a Java application, available at http://bioinformatics.ua.pt/variobox. PMID:24186831

  12. USING BIOBIN TO EXPLORE RARE VARIANT POPULATION STRATIFICATION*

    PubMed Central

    Moore, Carrie B.; Wallace, John R.; Frase, Alex T.; Pendergrass, Sarah A.; Ritchie, Marylyn D.

    2013-01-01

    Rare variants (RVs) will likely explain additional heritability of many common complex diseases; however, the natural frequencies of rare variation across and between human populations are largely unknown. We have developed a powerful, flexible collapsing method called BioBin that utilizes prior biological knowledge using multiple publicly available database sources to direct analyses. Variants can be collapsed according to functional regions, evolutionary conserved regions, regulatory regions, genes, and/or pathways without the need for external files. We conducted an extensive comparison of rare variant burden differences (MAF < 0.03) between two ancestry groups from 1000 Genomes Project data, Yoruba (YRI) and European descent (CEU) individuals. We found that 56.86% of gene bins, 72.73% of intergenic bins, 69.45% of pathway bins, 32.36% of ORegAnno annotated bins, and 9.10% of evolutionary conserved regions (shared with primates) have statistically significant differences in RV burden. Ongoing efforts include examining additional regional characteristics using regulatory regions and protein binding domains. Our results show interesting variant differences between two ancestral populations and demonstrate that population stratification is a pervasive concern for sequence analyses. PMID:23424138

  13. The Autoimmune Disease Database: a dynamically compiled literature-derived database

    PubMed Central

    Karopka, Thomas; Fluck, Juliane; Mevissen, Heinz-Theodor; Glass, Änne

    2006-01-01

    Background Autoimmune diseases are disorders caused by an immune response directed against the body's own organs, tissues and cells. In practice more than 80 clinically distinct diseases, among them systemic lupus erythematosus and rheumatoid arthritis, are classified as autoimmune diseases. Although their etiology is unclear these diseases share certain similarities at the molecular level i.e. susceptibility regions on the chromosomes or the involvement of common genes. To gain an overview of these related diseases it is not feasible to do a literary review but it requires methods of automated analyses of the more than 500,000 Medline documents related to autoimmune disorders. Results In this paper we present the first version of the Autoimmune Disease Database which to our knowledge is the first comprehensive literature-based database covering all known or suspected autoimmune diseases. This dynamically compiled database allows researchers to link autoimmune diseases to the candidate genes or proteins through the use of named entity recognition which identifies genes/proteins in the corresponding Medline abstracts. The Autoimmune Disease Database covers 103 autoimmune disease concepts. This list was expanded to include synonyms and spelling variants yielding a list of over 1,200 disease names. The current version of the database provides links to 541,690 abstracts and over 5,000 unique genes/proteins. Conclusion The Autoimmune Disease Database provides the researcher with a tool to navigate potential gene-disease relationships in Medline abstracts in the context of autoimmune diseases. PMID:16803617

  14. Orientation and Dynamics of Synthetic Transbilayer Polypeptides Containing GpATM Dimerization Motifs

    PubMed Central

    McDonald, Mark C.; Booth, Valerie; Morrow, Michael R.

    2011-01-01

    Deuterium NMR spectroscopy was used to study how the positioning of a dimerization motif within a transbilayer polypeptide influences its orientation and dynamics in bilayers. Three polypeptide variants comprising glycophorin A transmembrane (GpATM) dimerization motifs incorporated into lysine-terminated poly-leucine-alanine helices were mixed into 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphatidylcholine multilamellar vesicles. The variants differed in orientation of the motif segment around the helix axis with respect to the peptide ends. Polypeptides were labeled with methyl-deuterated alanines at positions that were identically situated relative to the peptide ends (Ala-20 and Ala-22) and at two positions within the motif. An analysis of quadrupole splittings revealed similar tilts and orientations of the peptide ends for all three variants, suggesting that average orientations were dominated by interactions at the bilayer surface. For one variant, however, fast orientational fluctuations about the helix axis were significantly smaller. This may indicate some perturbation of peptide dynamics and conformation by interactions that are sensitive to the motif orientation relative to the peptide ends. For the variant that displayed distinct dynamics, one orientation consistent with observed splittings corresponded to the motif being situated such that its two glycines were particularly accessible to adjacent peptides. PMID:21281580

  15. 2010 Worldwide Gasification Database

    DOE Data Explorer

    The 2010 Worldwide Gasification Database describes the current world gasification industry and identifies near-term planned capacity additions. The database lists gasification projects and includes information (e.g., plant location, number and type of gasifiers, syngas capacity, feedstock, and products). The database reveals that the worldwide gasification capacity has continued to grow for the past several decades and is now at 70,817 megawatts thermal (MWth) of syngas output at 144 operating plants with a total of 412 gasifiers.

  16. ITS-90 Thermocouple Database

    National Institute of Standards and Technology Data Gateway

    SRD 60 NIST ITS-90 Thermocouple Database (Web, free access)   Web version of Standard Reference Database 60 and NIST Monograph 175. The database gives temperature -- electromotive force (emf) reference functions and tables for the letter-designated thermocouple types B, E, J, K, N, R, S and T. These reference functions have been adopted as standards by the American Society for Testing and Materials (ASTM) and the International Electrotechnical Commission (IEC).

  17. Human genetic variation database, a reference database of genetic variations in the Japanese population

    PubMed Central

    Higasa, Koichiro; Miyake, Noriko; Yoshimura, Jun; Okamura, Kohji; Niihori, Tetsuya; Saitsu, Hirotomo; Doi, Koichiro; Shimizu, Masakazu; Nakabayashi, Kazuhiko; Aoki, Yoko; Tsurusaki, Yoshinori; Morishita, Shinichi; Kawaguchi, Takahisa; Migita, Osuke; Nakayama, Keiko; Nakashima, Mitsuko; Mitsui, Jun; Narahara, Maiko; Hayashi, Keiko; Funayama, Ryo; Yamaguchi, Daisuke; Ishiura, Hiroyuki; Ko, Wen-Ya; Hata, Kenichiro; Nagashima, Takeshi; Yamada, Ryo; Matsubara, Yoichi; Umezawa, Akihiro; Tsuji, Shoji; Matsumoto, Naomichi; Matsuda, Fumihiko

    2016-01-01

    Whole-genome and -exome resequencing using next-generation sequencers is a powerful approach for identifying genomic variations that are associated with diseases. However, systematic strategies for prioritizing causative variants from many candidates to explain the disease phenotype are still far from being established, because the population-specific frequency spectrum of genetic variation has not been characterized. Here, we have collected exomic genetic variation from 1208 Japanese individuals through a collaborative effort, and aggregated the data into a prevailing catalog. In total, we identified 156 622 previously unreported variants. The allele frequencies for the majority (88.8%) were lower than 0.5% in allele frequency and predicted to be functionally deleterious. In addition, we have constructed a Japanese-specific major allele reference genome by which the number of unique mapping of the short reads in our data has increased 0.045% on average. Our results illustrate the importance of constructing an ethnicity-specific reference genome for identifying rare variants. All the collected data were centralized to a newly developed database to serve as useful resources for exploring pathogenic variations. Public access to the database is available at http://www.genome.med.kyoto-u.ac.jp/SnpDB/. PMID:26911352

  18. Opening CEM vendor databases

    SciTech Connect

    Long, A.; Patel, D.

    1995-12-31

    CEM database performance requirements (i.e., voluminous data storage, rapid response times) often conflict with the concept of an open, accessible database. Utilities would like to use their CEM data for more purposes than simply submitting environmental reports. But in most cases, other uses are inhibited because today`s sophisticated CEM systems incorporate databases that have forsaken openness and accessibility in favor of performance. Several options are available for CEM vendors wishing to move in the direction of open, accessible CEM databases.

  19. Databases for Microbiologists

    PubMed Central

    2015-01-01

    Databases play an increasingly important role in biology. They archive, store, maintain, and share information on genes, genomes, expression data, protein sequences and structures, metabolites and reactions, interactions, and pathways. All these data are critically important to microbiologists. Furthermore, microbiology has its own databases that deal with model microorganisms, microbial diversity, physiology, and pathogenesis. Thousands of biological databases are currently available, and it becomes increasingly difficult to keep up with their development. The purpose of this minireview is to provide a brief survey of current databases that are of interest to microbiologists. PMID:26013493

  20. Veterans Administration Databases

    Cancer.gov

    The Veterans Administration Information Resource Center provides database and informatics experts, customer service, expert advice, information products, and web technology to VA researchers and others.

  1. Backing up DMF Databases

    NASA Technical Reports Server (NTRS)

    Cardo, Nicholas P.; Woodrow, Thomas (Technical Monitor)

    1994-01-01

    A complete backup of the Cray Data Migration Facility (DMF) databases should include the data migration databases, all media specific process' (MSP's) databases, and the journal file. The backup should be able to accomplished without impacting users or stopping DMF. The High Speed Processors group at the Numerical Aerodynamics Simulation (NAS) Facility at NASA Ames Research Center undertook the task of finding an effective and efficient way to backup all DMF databases. This has been accomplished by taking advantage of new features introduced in DMF 2.0 and adding a minor modification to the dmdaemon. This paper discusses the investigation and the changes necessary to implement these enhancements.

  2. ClinVar: public archive of interpretations of clinically relevant variants.

    PubMed

    Landrum, Melissa J; Lee, Jennifer M; Benson, Mark; Brown, Garth; Chao, Chen; Chitipiralla, Shanmuga; Gu, Baoshan; Hart, Jennifer; Hoffman, Douglas; Hoover, Jeffrey; Jang, Wonhee; Katz, Kenneth; Ovetsky, Michael; Riley, George; Sethi, Amanjeev; Tully, Ray; Villamarin-Salomon, Ricardo; Rubinstein, Wendy; Maglott, Donna R

    2016-01-01

    ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/) at the National Center for Biotechnology Information (NCBI) is a freely available archive for interpretations of clinical significance of variants for reported conditions. The database includes germline and somatic variants of any size, type or genomic location. Interpretations are submitted by clinical testing laboratories, research laboratories, locus-specific databases, OMIM®, GeneReviews™, UniProt, expert panels and practice guidelines. In NCBI's Variation submission portal, submitters upload batch submissions or use the Submission Wizard for single submissions. Each submitted interpretation is assigned an accession number prefixed with SCV. ClinVar staff review validation reports with data types such as HGVS (Human Genome Variation Society) expressions; however, clinical significance is reported directly from submitters. Interpretations are aggregated by variant-condition combination and assigned an accession number prefixed with RCV. Clinical significance is calculated for the aggregate record, indicating consensus or conflict in the submitted interpretations. ClinVar uses data standards, such as HGVS nomenclature for variants and MedGen identifiers for conditions. The data are available on the web as variant-specific views; the entire data set can be downloaded via ftp. Programmatic access for ClinVar records is available through NCBI's E-utilities. Future development includes providing a variant-centric XML archive and a web page for details of SCV submissions. PMID:26582918

  3. ClinVar: public archive of interpretations of clinically relevant variants

    PubMed Central

    Landrum, Melissa J.; Lee, Jennifer M.; Benson, Mark; Brown, Garth; Chao, Chen; Chitipiralla, Shanmuga; Gu, Baoshan; Hart, Jennifer; Hoffman, Douglas; Hoover, Jeffrey; Jang, Wonhee; Katz, Kenneth; Ovetsky, Michael; Riley, George; Sethi, Amanjeev; Tully, Ray; Villamarin-Salomon, Ricardo; Rubinstein, Wendy; Maglott, Donna R.

    2016-01-01

    ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/) at the National Center for Biotechnology Information (NCBI) is a freely available archive for interpretations of clinical significance of variants for reported conditions. The database includes germline and somatic variants of any size, type or genomic location. Interpretations are submitted by clinical testing laboratories, research laboratories, locus-specific databases, OMIM®, GeneReviews™, UniProt, expert panels and practice guidelines. In NCBI's Variation submission portal, submitters upload batch submissions or use the Submission Wizard for single submissions. Each submitted interpretation is assigned an accession number prefixed with SCV. ClinVar staff review validation reports with data types such as HGVS (Human Genome Variation Society) expressions; however, clinical significance is reported directly from submitters. Interpretations are aggregated by variant-condition combination and assigned an accession number prefixed with RCV. Clinical significance is calculated for the aggregate record, indicating consensus or conflict in the submitted interpretations. ClinVar uses data standards, such as HGVS nomenclature for variants and MedGen identifiers for conditions. The data are available on the web as variant-specific views; the entire data set can be downloaded via ftp. Programmatic access for ClinVar records is available through NCBI's E-utilities. Future development includes providing a variant-centric XML archive and a web page for details of SCV submissions. PMID:26582918

  4. The EUVE proposal database and scheduling system

    NASA Technical Reports Server (NTRS)

    Christian, C.; Olson, E.; Jelinsky, P.; Samuel, M.

    1992-01-01

    The proposal database and scheduling system for the Extreme Ultraviolet Explorer will be described. The proposal database has been implemented to take input for approved observations selected by the EUVE Peer Review Panel and output target information suitable for the scheduling system to digest. The scheduling system is a hybrid of the SPIKE program and EUVE software which checks spacecraft constraints, produces a proposed schedule and selects spacecraft orientations with optimal configurations for acquiring star trackers, etc. This system has been used to schedule the In Orbit Calibration activities that took place this summer, following the EUVE launch in early June 1992. The implemented strategy has implications for the selection of approved targets, which have impacted the Peer Review process. In addition, it will be discussed how the proposal database, founded on Sybase, controls the processing of EUVE Guest Observer data.

  5. Producing approximate answers to database queries

    NASA Technical Reports Server (NTRS)

    Vrbsky, Susan V.; Liu, Jane W. S.

    1993-01-01

    We have designed and implemented a query processor, called APPROXIMATE, that makes approximate answers available if part of the database is unavailable or if there is not enough time to produce an exact answer. The accuracy of the approximate answers produced improves monotonically with the amount of data retrieved to produce the result. The exact answer is produced if all of the needed data are available and query processing is allowed to continue until completion. The monotone query processing algorithm of APPROXIMATE works within the standard relational algebra framework and can be implemented on a relational database system with little change to the relational architecture. We describe here the approximation semantics of APPROXIMATE that serves as the basis for meaningful approximations of both set-valued and single-valued queries. We show how APPROXIMATE is implemented to make effective use of semantic information, provided by an object-oriented view of the database, and describe the additional overhead required by APPROXIMATE.

  6. Atomic Spectra Database (ASD)

    National Institute of Standards and Technology Data Gateway

    SRD 78 NIST Atomic Spectra Database (ASD) (Web, free access)   This database provides access and search capability for NIST critically evaluated data on atomic energy levels, wavelengths, and transition probabilities that are reasonably up-to-date. The NIST Atomic Spectroscopy Data Center has carried out these critical compilations.

  7. CDS - Database Administrator's Guide

    NASA Astrophysics Data System (ADS)

    Day, J. P.

    This guide aims to instruct the CDS database administrator in: o The CDS file system. o The CDS index files. o The procedure for assimilating a new CDS tape into the database. It is assumed that the administrator has read SUN/79.

  8. Ionic Liquids Database- (ILThermo)

    National Institute of Standards and Technology Data Gateway

    SRD 147 Ionic Liquids Database- (ILThermo) (Web, free access)   IUPAC Ionic Liquids Database, ILThermo, is a free web research tool that allows users worldwide to access an up-to-date data collection from the publications on experimental investigations of thermodynamic, and transport properties of ionic liquids as well as binary and ternary mixtures containing ionic liquids.

  9. Database Searching by Managers.

    ERIC Educational Resources Information Center

    Arnold, Stephen E.

    Managers and executives need the easy and quick access to business and management information that online databases can provide, but many have difficulty articulating their search needs to an intermediary. One possible solution would be to encourage managers and their immediate support staff members to search textual databases directly as they now…

  10. Morchella MLST database

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Welcome to the Morchella MLST database. This dedicated database was set up at the CBS-KNAW Biodiversity Center by Vincent Robert in February 2012, using BioloMICS software (Robert et al., 2011), to facilitate DNA sequence-based identifications of Morchella species via the Internet. The current datab...

  11. First Look: TRADEMARKSCAN Database.

    ERIC Educational Resources Information Center

    Fernald, Anne Conway; Davidson, Alan B.

    1984-01-01

    Describes database produced by Thomson and Thomson and available on Dialog which contains over 700,000 records representing all active federal trademark registrations and applications for registrations filed in United States Patent and Trademark Office. A typical record, special features, database applications, learning to use TRADEMARKSCAN, and…

  12. HIV Structural Database

    National Institute of Standards and Technology Data Gateway

    SRD 102 HIV Structural Database (Web, free access)   The HIV Protease Structural Database is an archive of experimentally determined 3-D structures of Human Immunodeficiency Virus 1 (HIV-1), Human Immunodeficiency Virus 2 (HIV-2) and Simian Immunodeficiency Virus (SIV) Proteases and their complexes with inhibitors or products of substrate cleavage.

  13. Biological Macromolecule Crystallization Database

    National Institute of Standards and Technology Data Gateway

    SRD 21 Biological Macromolecule Crystallization Database (Web, free access)   The Biological Macromolecule Crystallization Database and NASA Archive for Protein Crystal Growth Data (BMCD) contains the conditions reported for the crystallization of proteins and nucleic acids used in X-ray structure determinations and archives the results of microgravity macromolecule crystallization studies.

  14. Assignment to database industy

    NASA Astrophysics Data System (ADS)

    Abe, Kohichiroh

    Various kinds of databases are considered to be essential part in future large sized systems. Information provision only by databases is also considered to be growing as the market becomes mature. This paper discusses how such circumstances have been built and will be developed from now on.

  15. Dictionary as Database.

    ERIC Educational Resources Information Center

    Painter, Derrick

    1996-01-01

    Discussion of dictionaries as databases focuses on the digitizing of The Oxford English dictionary (OED) and the use of Standard Generalized Mark-Up Language (SGML). Topics include the creation of a consortium to digitize the OED, document structure, relational databases, text forms, sequence, and discourse. (LRW)

  16. A Quality System Database

    NASA Technical Reports Server (NTRS)

    Snell, William H.; Turner, Anne M.; Gifford, Luther; Stites, William

    2010-01-01

    A quality system database (QSD), and software to administer the database, were developed to support recording of administrative nonconformance activities that involve requirements for documentation of corrective and/or preventive actions, which can include ISO 9000 internal quality audits and customer complaints.

  17. BioImaging Database

    Energy Science and Technology Software Center (ESTSC)

    2006-10-25

    The Biolmaging Database (BID) is a relational database developed to store the data and meta-data for the 3D gene expression in early Drosophila embryo development on a cellular level. The schema was written to be used with the MySQL DBMS but with minor modifications can be used on any SQL compliant relational DBMS.

  18. The intelligent database machine

    NASA Technical Reports Server (NTRS)

    Yancey, K. E.

    1985-01-01

    The IDM data base was compared with the data base crack to determine whether IDM 500 would better serve the needs of the MSFC data base management system than Oracle. The two were compared and the performance of the IDM was studied. Implementations that work best on which database are implicated. The choice is left to the database administrator.

  19. Build Your Own Database.

    ERIC Educational Resources Information Center

    Jacso, Peter; Lancaster, F. W.

    This book is intended to help librarians and others to produce databases of better value and quality, especially if they have had little previous experience in database construction. Drawing upon almost 40 years of experience in the field of information retrieval, this book emphasizes basic principles and approaches rather than in-depth and…

  20. Database Reviews: Legal Information.

    ERIC Educational Resources Information Center

    Seiser, Virginia

    Detailed reviews of two legal information databases--"Laborlaw I" and "Legal Resource Index"--are presented in this paper. Each database review begins with a bibliographic entry listing the title; producer; vendor; cost per hour contact time; offline print cost per citation; time period covered; frequency of updates; and size of file. A detailed…

  1. Database in Artificial Intelligence.

    ERIC Educational Resources Information Center

    Wilkinson, Julia

    1986-01-01

    Describes a specialist bibliographic database of literature in the field of artificial intelligence created by the Turing Institute (Glasgow, Scotland) using the BRS/Search information retrieval software. The subscription method for end-users--i.e., annual fee entitles user to unlimited access to database, document provision, and printed awareness…

  2. Structural Ceramics Database

    National Institute of Standards and Technology Data Gateway

    SRD 30 NIST Structural Ceramics Database (Web, free access)   The NIST Structural Ceramics Database (WebSCD) provides evaluated materials property data for a wide range of advanced ceramics known variously as structural ceramics, engineering ceramics, and fine ceramics.

  3. National Vulnerability Database (NVD)

    National Institute of Standards and Technology Data Gateway

    National Vulnerability Database (NVD) (Web, free access)   NVD is a comprehensive cyber security vulnerability database that integrates all publicly available U.S. Government vulnerability resources and provides references to industry resources. It is based on and synchronized with the CVE vulnerability naming standard.

  4. Knowledge Discovery in Databases.

    ERIC Educational Resources Information Center

    Norton, M. Jay

    1999-01-01

    Knowledge discovery in databases (KDD) revolves around the investigation and creation of knowledge, processes, algorithms, and mechanisms for retrieving knowledge from data collections. The article is an introductory overview of KDD. The rationale and environment of its development and applications are discussed. Issues related to database design…

  5. Online Database Searching Workbook.

    ERIC Educational Resources Information Center

    Littlejohn, Alice C.; Parker, Joan M.

    Designed primarily for use by first-time searchers, this workbook provides an overview of online searching. Following a brief introduction which defines online searching, databases, and database producers, five steps in carrying out a successful search are described: (1) identifying the main concepts of the search statement; (2) selecting a…

  6. CPDB: Carcinogenic Potency Database.

    PubMed

    Fitzpatrick, Roberta Bronson

    2008-01-01

    The Carcinogenic Potency Database reports analyses of animal cancer tests on 1,547 chemicals. These tests are used in support of cancer risk assessments for humans. Results are searchable and are made available via the National Library of Medicine's (NLM) TOXNET system. This column will provide background information on the database, as well as present search basics. PMID:19042710

  7. The UCSC Genome Browser database: 2014 update

    PubMed Central

    Karolchik, Donna; Barber, Galt P.; Casper, Jonathan; Clawson, Hiram; Cline, Melissa S.; Diekhans, Mark; Dreszer, Timothy R.; Fujita, Pauline A.; Guruvadoo, Luvina; Haeussler, Maximilian; Harte, Rachel A.; Heitner, Steve; Hinrichs, Angie S.; Learned, Katrina; Lee, Brian T.; Li, Chin H.; Raney, Brian J.; Rhead, Brooke; Rosenbloom, Kate R.; Sloan, Cricket A.; Speir, Matthew L.; Zweig, Ann S.; Haussler, David; Kuhn, Robert M.; Kent, W. James

    2014-01-01

    The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a large collection of organisms, primarily vertebrates, with an emphasis on the human and mouse genomes. The Browser’s web-based tools provide an integrated environment for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic data sets. As of September 2013, the database contained genomic sequence and a basic set of annotation ‘tracks’ for ∼90 organisms. Significant new annotations include a 60-species multiple alignment conservation track on the mouse, updated UCSC Genes tracks for human and mouse, and several new sets of variation and ENCODE data. New software tools include a Variant Annotation Integrator that returns predicted functional effects of a set of variants uploaded as a custom track, an extension to UCSC Genes that displays haplotype alleles for protein-coding genes and an expansion of data hubs that includes the capability to display remotely hosted user-provided assembly sequence in addition to annotation data. To improve European access, we have added a Genome Browser mirror (http://genome-euro.ucsc.edu) hosted at Bielefeld University in Germany. PMID:24270787

  8. SuSPect: enhanced prediction of single amino acid variant (SAV) phenotype using network features.

    PubMed

    Yates, Christopher M; Filippis, Ioannis; Kelley, Lawrence A; Sternberg, Michael J E

    2014-07-15

    Whole-genome and exome sequencing studies reveal many genetic variants between individuals, some of which are linked to disease. Many of these variants lead to single amino acid variants (SAVs), and accurate prediction of their phenotypic impact is important. Incorporating sequence conservation and network-level features, we have developed a method, SuSPect (Disease-Susceptibility-based SAV Phenotype Prediction), for predicting how likely SAVs are to be associated with disease. SuSPect performs significantly better than other available batch methods on the VariBench benchmarking dataset, with a balanced accuracy of 82%. SuSPect is available at www.sbg.bio.ic.ac.uk/suspect. The Web site has been implemented in Perl and SQLite and is compatible with modern browsers. An SQLite database of possible missense variants in the human proteome is available to download at www.sbg.bio.ic.ac.uk/suspect/download.html. PMID:24810707

  9. Cascadia Tsunami Deposit Database

    USGS Publications Warehouse

    Peters, Robert; Jaffe, Bruce; Gelfenbaum, Guy; Peterson, Curt

    2003-01-01

    The Cascadia Tsunami Deposit Database contains data on the location and sedimentological properties of tsunami deposits found along the Cascadia margin. Data have been compiled from 52 studies, documenting 59 sites from northern California to Vancouver Island, British Columbia that contain known or potential tsunami deposits. Bibliographical references are provided for all sites included in the database. Cascadia tsunami deposits are usually seen as anomalous sand layers in coastal marsh or lake sediments. The studies cited in the database use numerous criteria based on sedimentary characteristics to distinguish tsunami deposits from sand layers deposited by other processes, such as river flooding and storm surges. Several studies cited in the database contain evidence for more than one tsunami at a site. Data categories include age, thickness, layering, grainsize, and other sedimentological characteristics of Cascadia tsunami deposits. The database documents the variability observed in tsunami deposits found along the Cascadia margin.

  10. Protein sequence databases.

    PubMed

    Apweiler, Rolf; Bairoch, Amos; Wu, Cathy H

    2004-02-01

    A variety of protein sequence databases exist, ranging from simple sequence repositories, which store data with little or no manual intervention in the creation of the records, to expertly curated universal databases that cover all species and in which the original sequence data are enhanced by the manual addition of further information in each sequence record. As the focus of researchers moves from the genome to the proteins encoded by it, these databases will play an even more important role as central comprehensive resources of protein information. Several the leading protein sequence databases are discussed here, with special emphasis on the databases now provided by the Universal Protein Knowledgebase (UniProt) consortium. PMID:15036160

  11. Locus-specific database domain and data content analysis: evolution and content maturation toward clinical use.

    PubMed

    Mitropoulou, Christina; Webb, Adam J; Mitropoulos, Konstantinos; Brookes, Anthony J; Patrinos, George P

    2010-10-01

    Genetic variation databases have become indispensable in many areas of health care. In addition, more and more experts are depositing published and unpublished disease-causing variants of particular genes into locus-specific databases (LSDBs). Some of these databases contain such extensive information that they have become known as knowledge bases. Here, we analyzed 1,188 LSDBs and their content for the presence or absence of 44 content criteria related to database features (general presentation, locus-specific information, database structure) and data content (data collection, summary table of variants, database querying). Our analyses revealed that several elements have helped to advance the field and reduce data heterogeneity, such as the development of specialized database management systems and the creation of data querying tools. We also identified a number of deficiencies, namely, the lack of detailed disease and phenotypic descriptions for each genetic variant and links to relevant patient organizations, which, if addressed, would allow LSDBs to better serve the clinical genetics community. We propose a structure, based on LSDBs and closely related repositories (namely, clinical genetics databases), which would contribute to a federated genetic variation browser and also allow the maintenance of variation data. PMID:20672379

  12. Variant (Swine Origin) Influenza Viruses in Humans

    MedlinePlus

    ... What's this? Submit Button Past Newsletters Variant Influenza Viruses: Background and CDC Risk Assessment and Reporting Language: ... Background CDC Assessment Reporting Background On Variant Influenza Viruses Swine flu viruses do not normally infect humans. ...

  13. Swine Influenza/Variant Influenza Viruses

    MedlinePlus

    ... Humans Key Facts about Human Infections with Variant Viruses Interim Guidance for Clinicians on Human Infections Background, Risk Assessment & Reporting Reported Infections with Variant Influenza Viruses in the United States since 2005 Prevention Treatment ...

  14. BRONCO: Biomedical entity Relation ONcology COrpus for extracting gene-variant-disease-drug relations

    PubMed Central

    Lee, Kyubum; Lee, Sunwon; Park, Sungjoon; Kim, Sunkyu; Kim, Suhkyung; Choi, Kwanghun; Tan, Aik Choon; Kang, Jaewoo

    2016-01-01

    Comprehensive knowledge of genomic variants in a biological context is key for precision medicine. As next-generation sequencing technologies improve, the amount of literature containing genomic variant data, such as new functions or related phenotypes, rapidly increases. Because numerous articles are published every day, it is almost impossible to manually curate all the variant information from the literature. Many researchers focus on creating an improved automated biomedical natural language processing (BioNLP) method that extracts useful variants and their functional information from the literature. However, there is no gold-standard data set that contains texts annotated with variants and their related functions. To overcome these limitations, we introduce a Biomedical entity Relation ONcology COrpus (BRONCO) that contains more than 400 variants and their relations with genes, diseases, drugs and cell lines in the context of cancer and anti-tumor drug screening research. The variants and their relations were manually extracted from 108 full-text articles. BRONCO can be utilized to evaluate and train new methods used for extracting biomedical entity relations from full-text publications, and thus be a valuable resource to the biomedical text mining research community. Using BRONCO, we quantitatively and qualitatively evaluated the performance of three state-of-the-art BioNLP methods. We also identified their shortcomings, and suggested remedies for each method. We implemented post-processing modules for the three BioNLP methods, which improved their performance. Database URL: http://infos.korea.ac.kr/bronco PMID:27074804

  15. BRONCO: Biomedical entity Relation ONcology COrpus for extracting gene-variant-disease-drug relations.

    PubMed

    Lee, Kyubum; Lee, Sunwon; Park, Sungjoon; Kim, Sunkyu; Kim, Suhkyung; Choi, Kwanghun; Tan, Aik Choon; Kang, Jaewoo

    2016-01-01

    Comprehensive knowledge of genomic variants in a biological context is key for precision medicine. As next-generation sequencing technologies improve, the amount of literature containing genomic variant data, such as new functions or related phenotypes, rapidly increases. Because numerous articles are published every day, it is almost impossible to manually curate all the variant information from the literature. Many researchers focus on creating an improved automated biomedical natural language processing (BioNLP) method that extracts useful variants and their functional information from the literature. However, there is no gold-standard data set that contains texts annotated with variants and their related functions. To overcome these limitations, we introduce a Biomedical entity Relation ONcology COrpus (BRONCO) that contains more than 400 variants and their relations with genes, diseases, drugs and cell lines in the context of cancer and anti-tumor drug screening research. The variants and their relations were manually extracted from 108 full-text articles. BRONCO can be utilized to evaluate and train new methods used for extracting biomedical entity relations from full-text publications, and thus be a valuable resource to the biomedical text mining research community. Using BRONCO, we quantitatively and qualitatively evaluated the performance of three state-of-the-art BioNLP methods. We also identified their shortcomings, and suggested remedies for each method. We implemented post-processing modules for the three BioNLP methods, which improved their performance.Database URL:http://infos.korea.ac.kr/bronco. PMID:27074804

  16. Do Variants Associated with Susceptibility to Pancreatic Cancer and Type 2 Diabetes Reciprocally Affect Risk?

    PubMed Central

    Wu, Lang; Rabe, Kari G.; Petersen, Gloria M.

    2015-01-01

    Objectives Although type 2 diabetes mellitus is a known risk factor for pancreatic cancer, the existence of shared genetic susceptibility is largely unknown. We evaluated whether any reported genetic risk variants of either disease found by genome-wide association studies reciprocally confer susceptibility. Methods Data that were generated in previous genome-wide association studies (GENEVA Type 2 Diabetes; PanScan) were obtained through the National Institutes of Health database of Genotypes and Phenotypes (dbGaP). Using the PanScan datasets, we tested for association of 38 variants within 37 genomic regions known to be susceptibility factors for type 2 diabetes. We further examined whether type 2 diabetes variants predispose to pancreatic cancer risk stratified by diabetes status. Correspondingly, we examined the association of fourteen pancreatic cancer susceptibility variants within eight genomic regions in the GENEVA Type 2 Diabetes dataset. Results Four plausible associations of diabetes variants and pancreatic cancer risk were detected at a significance threshold of p = 0.05, and one pancreatic cancer susceptibility variant was associated with diabetes risk at threshold of p = 0.05, but none remained significant after correction for multiple comparisons. Conclusion Currently identified GWAS susceptibility variants are unlikely to explain the potential shared genetic etiology between Type 2 diabetes and pancreatic cancer. PMID:25658847

  17. Lateral orientation (image)

    MedlinePlus

    A lateral orientation is a position away from the midline of the body. For instance, the arms are lateral to the ... ears are lateral to the head. A medial orientation is a position toward the midline of the ...

  18. Theories of Sexual Orientation.

    ERIC Educational Resources Information Center

    Storms, Michael D.

    1980-01-01

    Results indicated homosexuals, heterosexuals, and bisexuals did not differ within each sex on measures of masculinity and femininity. Strong support was obtained for the hypothesis that sexual orientation relates primarily to erotic fantasy orientation. (Author/DB)

  19. Hazard Analysis Database Report

    SciTech Connect

    GRAMS, W.H.

    2000-12-28

    The Hazard Analysis Database was developed in conjunction with the hazard analysis activities conducted in accordance with DOE-STD-3009-94, Preparation Guide for U S . Department of Energy Nonreactor Nuclear Facility Safety Analysis Reports, for HNF-SD-WM-SAR-067, Tank Farms Final Safety Analysis Report (FSAR). The FSAR is part of the approved Authorization Basis (AB) for the River Protection Project (RPP). This document describes, identifies, and defines the contents and structure of the Tank Farms FSAR Hazard Analysis Database and documents the configuration control changes made to the database. The Hazard Analysis Database contains the collection of information generated during the initial hazard evaluations and the subsequent hazard and accident analysis activities. The Hazard Analysis Database supports the preparation of Chapters 3 ,4 , and 5 of the Tank Farms FSAR and the Unreviewed Safety Question (USQ) process and consists of two major, interrelated data sets: (1) Hazard Analysis Database: Data from the results of the hazard evaluations, and (2) Hazard Topography Database: Data from the system familiarization and hazard identification.

  20. PADB : Published Association Database

    PubMed Central

    Rhee, Hwanseok; Lee, Jin-Sung

    2007-01-01

    Background Although molecular pathway information and the International HapMap Project data can help biomedical researchers to investigate the aetiology of complex diseases more effectively, such information is missing or insufficient in current genetic association databases. In addition, only a few of the environmental risk factors are included as gene-environment interactions, and the risk measures of associations are not indexed in any association databases. Description We have developed a published association database (PADB; ) that includes both the genetic associations and the environmental risk factors available in PubMed database. Each genetic risk factor is linked to a molecular pathway database and the HapMap database through human gene symbols identified in the abstracts. And the risk measures such as odds ratios or hazard ratios are extracted automatically from the abstracts when available. Thus, users can review the association data sorted by the risk measures, and genetic associations can be grouped by human genes or molecular pathways. The search results can also be saved to tab-delimited text files for further sorting or analysis. Currently, PADB indexes more than 1,500,000 PubMed abstracts that include 3442 human genes, 461 molecular pathways and about 190,000 risk measures ranging from 0.00001 to 4878.9. Conclusion PADB is a unique online database of published associations that will serve as a novel and powerful resource for reviewing and interpreting huge association data of complex human diseases. PMID:17877839

  1. ResPlan Database

    NASA Technical Reports Server (NTRS)

    Zellers, Michael L.

    2003-01-01

    The main project I was involved in was new application development for the existing CIS0 Database (ResPlan). This database application was developed in Microsoft Access. Initial meetings with Greg Follen, Linda McMillen, Griselle LaFontaine and others identified a few key weaknesses with the existing database. The weaknesses centered around that while the database correctly modeled the structure of Programs, Projects and Tasks, once the data was entered, the database did not capture any dynamic status information, and as such was of limited usefulness. After the initial meetings my goals were identified as follows: Enhance the ResPlan Database to include qualitative and quantitative status information about the Programs, Projects and Tasks Train staff members about the ResPlan database from both the user perspective and the developer perspective Give consideration to a Web Interface for reporting. Initially, the thought was that there would not be adequate time to actually develop the Web Interface, Greg wanted it understood that this was an eventual goal and as such should be a consideration throughout the development process.

  2. Hazard Analysis Database Report

    SciTech Connect

    GAULT, G.W.

    1999-10-13

    The Hazard Analysis Database was developed in conjunction with the hazard analysis activities conducted in accordance with DOE-STD-3009-94, Preparation Guide for US Department of Energy Nonreactor Nuclear Facility Safety Analysis Reports, for the Tank Waste Remediation System (TWRS) Final Safety Analysis Report (FSAR). The FSAR is part of the approved TWRS Authorization Basis (AB). This document describes, identifies, and defines the contents and structure of the TWRS FSAR Hazard Analysis Database and documents the configuration control changes made to the database. The TWRS Hazard Analysis Database contains the collection of information generated during the initial hazard evaluations and the subsequent hazard and accident analysis activities. The database supports the preparation of Chapters 3,4, and 5 of the TWRS FSAR and the USQ process and consists of two major, interrelated data sets: (1) Hazard Evaluation Database--Data from the results of the hazard evaluations; and (2) Hazard Topography Database--Data from the system familiarization and hazard identification.

  3. Database for propagation models

    NASA Technical Reports Server (NTRS)

    Kantak, Anil V.

    1991-01-01

    A propagation researcher or a systems engineer who intends to use the results of a propagation experiment is generally faced with various database tasks such as the selection of the computer software, the hardware, and the writing of the programs to pass the data through the models of interest. This task is repeated every time a new experiment is conducted or the same experiment is carried out at a different location generating different data. Thus the users of this data have to spend a considerable portion of their time learning how to implement the computer hardware and the software towards the desired end. This situation may be facilitated considerably if an easily accessible propagation database is created that has all the accepted (standardized) propagation phenomena models approved by the propagation research community. Also, the handling of data will become easier for the user. Such a database construction can only stimulate the growth of the propagation research it if is available to all the researchers, so that the results of the experiment conducted by one researcher can be examined independently by another, without different hardware and software being used. The database may be made flexible so that the researchers need not be confined only to the contents of the database. Another way in which the database may help the researchers is by the fact that they will not have to document the software and hardware tools used in their research since the propagation research community will know the database already. The following sections show a possible database construction, as well as properties of the database for the propagation research.

  4. The Gaia Parameter Database

    NASA Astrophysics Data System (ADS)

    de Bruijne, J. H. J.; Lammers, U.; Perryman, M. A. C.

    2005-01-01

    The parallel development of many aspects of a complex mission like Gaia, which includes numerous participants in ESA, industrial companies, and a large and active scientific collaboration throughout Europe, makes keeping track of the many design changes, instrument and operational complexities, and numerical values for the data analysis a very challenging problem. A comprehensive, easily-accessible, up-to-date, and definitive compilation of a large range of numerical quantities is required, and the Gaia parameter database has been established to satisfy these needs. The database is a centralised repository containing, besides mathematical, physical, and astronomical constants, many satellite and subsystem design parameters. At the end of 2004, more than 1600 parameters had been included. Version control has been implemented, providing, next to a `live' version with the most recent parameters, well-defined reference versions of the full database contents. The database can be queried or browsed using a regular Web browser (http://www.rssd.esa.int/Gaia/paramdb). Query results are formated by default in HTML. Data can also be retrieved as Fortran-77, Fortran-90, Java, ANSIC, C++, or XML structures for direct inclusion into software codes in these languages. The idea is that all collaborating scientists can use the database parameters and values, once retrieved, directly linked to computational routines. An off-line access mode is also available, enabling users to automatically download the contents of the database. The database will be maintained actively, and significant extensions of the contents are planned. Consistent use in the future of the database by the Gaia community at large, including all industrial teams, will ensure correct numerical values throughout the complex software systems being built up as details of the Gaia design develop. The database is already being used for the telemetry simulation chain in ESTEC, and in the data simulations for GDAAS2.

  5. Variant Humicola grisea CBH1.1

    SciTech Connect

    Goedegeburr, Frits; Gualfetti, Peter; Mitchinson, Colin; Larenas, Edmund

    2013-02-19

    Disclosed are variants of Humicola grisea Cel7A (CBH1.1), H. jecorina CBH1 variant or S. thermophilium CBH1, nucleic acids encoding the same and methods for producing the same. The variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted.

  6. Variant Humicola grisea CBH1.1

    SciTech Connect

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Larenas, Edmund

    2014-03-18

    Disclosed are variants of Humicola grisea Cel7A (CBH1.1), H. jecorina CBH1 variant or S. thermophilium CBH1, nucleic acids encoding the same and methods for producing the same. The variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted.

  7. Variant humicola grisea CBH1.1

    SciTech Connect

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Edmund, Larenas

    2014-09-09

    Disclosed are variants of Humicola grisea Cel7A (CBH1.1), H. jecorina CBH1 variant or S. thermophilium CBH1, nucleic acids encoding the same and methods for producing the same. The variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted.

  8. Variant Humicola grisea CBH1.1

    DOEpatents

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Larenas, Edmund

    2011-08-16

    Disclosed are variants of Humicola grisea Cel7A (CBH1.1), H. jecorina CBH1 variant or S. thermophilium CBH1, nucleic acids encoding the same and methods for producing the same. The variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted.

  9. Variant Humicola grisea CBH1.1

    DOEpatents

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Larenas, Edmund

    2011-05-31

    Disclosed are variants of Humicola grisea Cel7A (CBH1.1), H. jecorina CBH1 variant or S. thermophilium CBH1, nucleic acids encoding the same and methods for producing the same. The variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted.

  10. Variant Humicola grisea CBH1.1

    DOEpatents

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Larenas, Edmund

    2008-12-02

    Disclosed are variants of Humicola grisea Cel7A (CBH1.1), H. jecorina CBH1 variant or S. thermophilium CBH1, nucleic acids encoding the same and methods for producing the same. The variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted.

  11. Variant Humicola grisea CBH1.1

    DOEpatents

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Larenas, Edmund

    2012-08-07

    Disclosed are variants of Humicola grisea Cel7A (CBH1.1), H. jecorina CBH1 variant or S. thermophilium CBH1, nucleic acids encoding the same and methods for producing the same. The variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted.

  12. Human variation databases

    PubMed Central

    Küntzer, Jan; Eggle, Daniela; Klostermann, Stefan; Burtscher, Helmut

    2010-01-01

    More than 100 000 human genetic variations have been described in various genes that are associated with a wide variety of diseases. Such data provides invaluable information for both clinical medicine and basic science. A number of locus-specific databases have been developed to exploit this huge amount of data. However, the scope, format and content of these databases differ strongly and as no standard for variation databases has yet been adopted, the way data is presented varies enormously. This review aims to give an overview of current resources for human variation data in public and commercial resources. PMID:20639550

  13. International Comparisions Database

    National Institute of Standards and Technology Data Gateway

    International Comparisions Database (Web, free access)   The International Comparisons Database (ICDB) serves the U.S. and the Inter-American System of Metrology (SIM) with information based on Appendices B (International Comparisons), C (Calibration and Measurement Capabilities) and D (List of Participating Countries) of the Comit� International des Poids et Mesures (CIPM) Mutual Recognition Arrangement (MRA). The official source of the data is The BIPM key comparison database. The ICDB provides access to results of comparisons of measurements and standards organized by the consultative committees of the CIPM and the Regional Metrology Organizations.

  14. Hybrid Terrain Database

    NASA Technical Reports Server (NTRS)

    Arthur, Trey

    2006-01-01

    A prototype hybrid terrain database is being developed in conjunction with other databases and with hardware and software that constitute subsystems of aerospace cockpit display systems (known in the art as synthetic vision systems) that generate images to increase pilots' situation awareness and eliminate poor visibility as a cause of aviation accidents. The basic idea is to provide a clear view of the world around an aircraft by displaying computer-generated imagery derived from an onboard database of terrain, obstacle, and airport information.

  15. Phase Equilibria Diagrams Database

    National Institute of Standards and Technology Data Gateway

    SRD 31 NIST/ACerS Phase Equilibria Diagrams Database (PC database for purchase)   The Phase Equilibria Diagrams Database contains commentaries and more than 21,000 diagrams for non-organic systems, including those published in all 21 hard-copy volumes produced as part of the ACerS-NIST Phase Equilibria Diagrams Program (formerly titled Phase Diagrams for Ceramists): Volumes I through XIV (blue books); Annuals 91, 92, 93; High Tc Superconductors I & II; Zirconium & Zirconia Systems; and Electronic Ceramics I. Materials covered include oxides as well as non-oxide systems such as chalcogenides and pnictides, phosphates, salt systems, and mixed systems of these classes.

  16. JICST Factual Database(2)

    NASA Astrophysics Data System (ADS)

    Araki, Keisuke

    The computer programme, which builds atom-bond connection tables from nomenclatures, is developed. Chemical substances with their nomenclature and varieties of trivial names or experimental code numbers are inputted. The chemical structures of the database are stereospecifically stored and are able to be searched and displayed according to stereochemistry. Source data are from laws and regulations of Japan, RTECS of US and so on. The database plays a central role within the integrated fact database service of JICST and makes interrelational retrieval possible.

  17. Databases for materials selection

    SciTech Connect

    1996-06-01

    The Cambridge Materials Selector (CMS2.0) materials database was developed by the Engineering Dept. at Cambridge University in the United Kingdom. This database makes it possible to select a material for a specific application from essentially all classes of materials. Genera, Predict, and Socrates software programs from CLI International, Houston, Texas, automate materials selection and corrosion problem-solving tasks. They are said to significantly reduce the time necessary to select a suitable material and/or to assess a corrosion problem and reach cost-effective solutions. This article describes both databases and tells how to use them.

  18. Pigmented Porokeratosis. A Further Variant?

    PubMed

    Tan, Tracy S P; Tallon, Ben

    2016-03-01

    Porokeratosis is a clonal disorder of keratinization characterized by the presence of the cornoid lamella. A number of variants of porokeratosis have been described, based on the clinical features and histologic features of the lesions. The authors present a case of porokeratosis with prominent melanocytic hyperplasia, which was biopsied to clinically exclude melanoma. The authors retrospectively studied cases of porokeratosis to look for the presence of melanocytic hyperplasia. Melanocytic hyperplasia was identified in 8 of 31 cases (25.8%). All of the cases except the index case were clinically nonpigmented but arose in solar damaged skin. This case represents a distinct variant of porokeratosis, and the authors propose the designation pigmented porokeratosis. Melanocytic hyperplasia is a benign condition, and it is important that this is not histologically confused with melanoma in situ, particularly in a context of clinically pigmented lesion. Increased recognition of pigmented porokeratosis is essential to avoid an erroneous diagnosis of melanoma in situ. PMID:26894774

  19. A rigorous approach for selection of optimal variant sets for carrier screening with demonstration of clinical utility.

    PubMed

    Perreault-Micale, Cynthia; Davie, Jocelyn; Breton, Benjamin; Hallam, Stephanie; Greger, Valerie

    2015-07-01

    Carrier screening for certain diseases is recommended by major medical and Ashkenazi Jewish (AJ) societies. Most carrier screening panels test only for common, ethnic-specific variants. However, with formerly isolated ethnic groups becoming increasingly intermixed, this approach is becoming inadequate. Our objective was to develop a rigorous process to curate all variants, for relevant genes, into a database and then apply stringent clinical validity classification criteria to each in order to retain only those with clear evidence for pathogenicity. The resulting variant set, in conjunction with next-generation DNA sequencing (NGS), then affords the capability for an ethnically diverse, comprehensive, highly specific carrier-screening assay. The clinical utility of our approach was demonstrated by screening a pan-ethnic population of 22,864 individuals for Bloom syndrome carrier status using a BLM variant panel comprised of 50 pathogenic variants. In addition to carriers of the common AJ founder variant, we identified 57 carriers of other pathogenic BLM variants. All variants reported had previously been curated and their clinical validity documented, or were of a type that met our stringent, preassigned validity criteria. Thus, it was possible to confidently report an increased number of Bloom's syndrome carriers compared to traditional, ethnicity-based screening, while not reducing the specificity of the screening due to reporting variants of unknown clinical significance. PMID:26247052

  20. A rigorous approach for selection of optimal variant sets for carrier screening with demonstration of clinical utility

    PubMed Central

    Perreault-Micale, Cynthia; Davie, Jocelyn; Breton, Benjamin; Hallam, Stephanie; Greger, Valerie

    2015-01-01

    Carrier screening for certain diseases is recommended by major medical and Ashkenazi Jewish (AJ) societies. Most carrier screening panels test only for common, ethnic-specific variants. However, with formerly isolated ethnic groups becoming increasingly intermixed, this approach is becoming inadequate. Our objective was to develop a rigorous process to curate all variants, for relevant genes, into a database and then apply stringent clinical validity classification criteria to each in order to retain only those with clear evidence for pathogenicity. The resulting variant set, in conjunction with next-generation DNA sequencing (NGS), then affords the capability for an ethnically diverse, comprehensive, highly specific carrier-screening assay. The clinical utility of our approach was demonstrated by screening a pan-ethnic population of 22,864 individuals for Bloom syndrome carrier status using a BLM variant panel comprised of 50 pathogenic variants. In addition to carriers of the common AJ founder variant, we identified 57 carriers of other pathogenic BLM variants. All variants reported had previously been curated and their clinical validity documented, or were of a type that met our stringent, preassigned validity criteria. Thus, it was possible to confidently report an increased number of Bloom’s syndrome carriers compared to traditional, ethnicity-based screening, while not reducing the specificity of the screening due to reporting variants of unknown clinical significance. PMID:26247052

  1. NCCDPHP PUBLICATION DATABASE

    EPA Science Inventory

    This database provides bibliographic citations and abstracts of publications produced by the CDC's National Center for Chronic Disease Prevention and Health Promotion (NCCDPHP) including journal articles, monographs, book chapters, reports, policy documents, and fact sheets. Full...

  2. ARTI Refrigerant Database

    SciTech Connect

    Calm, J.M.

    1994-05-27

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern.

  3. THE CTEPP DATABASE

    EPA Science Inventory

    The CTEPP (Children's Total Exposure to Persistent Pesticides and Other Persistent Organic Pollutants) database contains a wealth of data on children's aggregate exposures to pollutants in their everyday surroundings. Chemical analysis data for the environmental media and ques...

  4. Hawaii bibliographic database

    USGS Publications Warehouse

    Wright, T.L.; Takahashi, T.J.

    1998-01-01

    The Hawaii bibliographic database has been created to contain all of the literature, from 1779 to the present, pertinent to the volcanological history of the Hawaiian-Emperor volcanic chain. References are entered in a PC- and Macintosh-compatible EndNote Plus bibliographic database with keywords and abstracts or (if no abstract) with annotations as to content. Keywords emphasize location, discipline, process, identification of new chemical data or age determinations, and type of publication. The database is updated approximately three times a year and is available to upload from an ftp site. The bibliography contained 8460 references at the time this paper was submitted for publication. Use of the database greatly enhances the power and completeness of library searches for anyone interested in Hawaiian volcanism.

  5. Chemical Kinetics Database

    National Institute of Standards and Technology Data Gateway

    SRD 17 NIST Chemical Kinetics Database (Web, free access)   The NIST Chemical Kinetics Database includes essentially all reported kinetics results for thermal gas-phase chemical reactions. The database is designed to be searched for kinetics data based on the specific reactants involved, for reactions resulting in specified products, for all the reactions of a particular species, or for various combinations of these. In addition, the bibliography can be searched by author name or combination of names. The database contains in excess of 38,000 separate reaction records for over 11,700 distinct reactant pairs. These data have been abstracted from over 12,000 papers with literature coverage through early 2000.

  6. Requirements Management Database

    Energy Science and Technology Software Center (ESTSC)

    2009-08-13

    This application is a simplified and customized version of the RBA and CTS databases to capture federal, site, and facility requirements, link to actions that must be performed to maintain compliance with their contractual and other requirements.

  7. Navigating Public Microarray Databases

    PubMed Central

    Bähler, Jürg

    2004-01-01

    With the ever-escalating amount of data being produced by genome-wide microarray studies, it is of increasing importance that these data are captured in public databases so that researchers can use this information to complement and enhance their own studies. Many groups have set up databases of expression data, ranging from large repositories, which are designed to comprehensively capture all published data, through to more specialized databases. The public repositories, such as ArrayExpress at the European Bioinformatics Institute contain complete datasets in raw format in addition to processed data, whilst the specialist databases tend to provide downstream analysis of normalized data from more focused studies and data sources. Here we provide a guide to the use of these public microarray resources. PMID:18629145

  8. Nuclear Science References Database

    SciTech Connect

    Pritychenko, B.; Běták, E.; Singh, B.; Totans, J.

    2014-06-15

    The Nuclear Science References (NSR) database together with its associated Web interface, is the world's only comprehensive source of easily accessible low- and intermediate-energy nuclear physics bibliographic information for more than 210,000 articles since the beginning of nuclear science. The weekly-updated NSR database provides essential support for nuclear data evaluation, compilation and research activities. The principles of the database and Web application development and maintenance are described. Examples of nuclear structure, reaction and decay applications are specifically included. The complete NSR database is freely available at the websites of the National Nuclear Data Center (http://www.nndc.bnl.gov/nsr) and the International Atomic Energy Agency (http://www-nds.iaea.org/nsr)

  9. Navigating public microarray databases.

    PubMed

    Penkett, Christopher J; Bähler, Jürg

    2004-01-01

    With the ever-escalating amount of data being produced by genome-wide microarray studies, it is of increasing importance that these data are captured in public databases so that researchers can use this information to complement and enhance their own studies. Many groups have set up databases of expression data, ranging from large repositories, which are designed to comprehensively capture all published data, through to more specialized databases. The public repositories, such as ArrayExpress at the European Bioinformatics Institute contain complete datasets in raw format in addition to processed data, whilst the specialist databases tend to provide downstream analysis of normalized data from more focused studies and data sources. Here we provide a guide to the use of these public microarray resources. PMID:18629145

  10. Household Products Database

    MedlinePlus

    ... Commercial / Institutional Product Names Types of Products Manufacturers Ingredients About the Database FAQ Product Recalls Help Glossary Contact Us More Resources What's under your kitchen sink, in your garage, in your bathroom, and ...

  11. TREATABILITY DATABASE DESCRIPTION

    EPA Science Inventory

    The Drinking Water Treatability Database (TDB) presents referenced information on the control of contaminants in drinking water. It allows drinking water utilities, first responders to spills or emergencies, treatment process designers, research organizations, academics, regulato...

  12. ARTI Refrigerant Database

    SciTech Connect

    Calm, J.M.

    1995-06-01

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern.

  13. ARTI Refrigerant Database

    SciTech Connect

    Calm, J.M.

    1995-02-01

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase-out of chemical compounds of environmental concern.

  14. Querying genomic databases

    SciTech Connect

    Baehr, A.; Hagstrom, R.; Joerg, D.; Overbeek, R.

    1991-09-01

    A natural-language interface has been developed that retrieves genomic information by using a simple subset of English. The interface spares the biologist from the task of learning database-specific query languages and computer programming. Currently, the interface deals with the E. coli genome. It can, however, be readily extended and shows promise as a means of easy access to other sequenced genomic databases as well.

  15. Steam Properties Database

    National Institute of Standards and Technology Data Gateway

    SRD 10 NIST/ASME Steam Properties Database (PC database for purchase)   Based upon the International Association for the Properties of Water and Steam (IAPWS) 1995 formulation for the thermodynamic properties of water and the most recent IAPWS formulations for transport and other properties, this updated version provides water properties over a wide range of conditions according to the accepted international standards.

  16. The ribosomal database project.

    PubMed Central

    Larsen, N; Olsen, G J; Maidak, B L; McCaughey, M J; Overbeek, R; Macke, T J; Marsh, T L; Woese, C R

    1993-01-01

    The Ribosomal Database Project (RDP) is a curated database that offers ribosome data along with related programs and services. The offerings include phylogenetically ordered alignments of ribosomal RNA (rRNA) sequences, derived phylogenetic trees, rRNA secondary structure diagrams and various software packages for handling, analyzing and displaying alignments and trees. The data are available via ftp and electronic mail. Certain analytic services are also provided by the electronic mail server. PMID:8332524

  17. Databases: Peter's Picks and Pans.

    ERIC Educational Resources Information Center

    Jacso, Peter

    1995-01-01

    Reviews the best and worst in databases on disk, CD-ROM, and online, and offers judgments and observations on database characteristics. Two databases are praised and three are criticized. (Author/JMV)

  18. Specialist Bibliographic Databases.

    PubMed

    Gasparyan, Armen Yuri; Yessirkepov, Marlen; Voronov, Alexander A; Trukhachev, Vladimir I; Kostyukova, Elena I; Gerasimov, Alexey N; Kitas, George D

    2016-05-01

    Specialist bibliographic databases offer essential online tools for researchers and authors who work on specific subjects and perform comprehensive and systematic syntheses of evidence. This article presents examples of the established specialist databases, which may be of interest to those engaged in multidisciplinary science communication. Access to most specialist databases is through subscription schemes and membership in professional associations. Several aggregators of information and database vendors, such as EBSCOhost and ProQuest, facilitate advanced searches supported by specialist keyword thesauri. Searches of items through specialist databases are complementary to those through multidisciplinary research platforms, such as PubMed, Web of Science, and Google Scholar. Familiarizing with the functional characteristics of biomedical and nonbiomedical bibliographic search tools is mandatory for researchers, authors, editors, and publishers. The database users are offered updates of the indexed journal lists, abstracts, author profiles, and links to other metadata. Editors and publishers may find particularly useful source selection criteria and apply for coverage of their peer-reviewed journals and grey literature sources. These criteria are aimed at accepting relevant sources with established editorial policies and quality controls. PMID:27134485

  19. Crude Oil Analysis Database

    DOE Data Explorer

    Shay, Johanna Y.

    The composition and physical properties of crude oil vary widely from one reservoir to another within an oil field, as well as from one field or region to another. Although all oils consist of hydrocarbons and their derivatives, the proportions of various types of compounds differ greatly. This makes some oils more suitable than others for specific refining processes and uses. To take advantage of this diversity, one needs access to information in a large database of crude oil analyses. The Crude Oil Analysis Database (COADB) currently satisfies this need by offering 9,056 crude oil analyses. Of these, 8,500 are United States domestic oils. The database contains results of analysis of the general properties and chemical composition, as well as the field, formation, and geographic location of the crude oil sample. [Taken from the Introduction to COAMDATA_DESC.pdf, part of the zipped software and database file at http://www.netl.doe.gov/technologies/oil-gas/Software/database.html] Save the zipped file to your PC. When opened, it will contain PDF documents and a large Excel spreadsheet. It will also contain the database in Microsoft Access 2002.

  20. The comprehensive peptaibiotics database.

    PubMed

    Stoppacher, Norbert; Neumann, Nora K N; Burgstaller, Lukas; Zeilinger, Susanne; Degenkolb, Thomas; Brückner, Hans; Schuhmacher, Rainer

    2013-05-01

    Peptaibiotics are nonribosomally biosynthesized peptides, which - according to definition - contain the marker amino acid α-aminoisobutyric acid (Aib) and possess antibiotic properties. Being known since 1958, a constantly increasing number of peptaibiotics have been described and investigated with a particular emphasis on hypocrealean fungi. Starting from the existing online 'Peptaibol Database', first published in 1997, an exhaustive literature survey of all known peptaibiotics was carried out and resulted in a list of 1043 peptaibiotics. The gathered information was compiled and used to create the new 'The Comprehensive Peptaibiotics Database', which is presented here. The database was devised as a software tool based on Microsoft (MS) Access. It is freely available from the internet at http://peptaibiotics-database.boku.ac.at and can easily be installed and operated on any computer offering a Windows XP/7 environment. It provides useful information on characteristic properties of the peptaibiotics included such as peptide category, group name of the microheterogeneous mixture to which the peptide belongs, amino acid sequence, sequence length, producing fungus, peptide subfamily, molecular formula, and monoisotopic mass. All these characteristics can be used and combined for automated search within the database, which makes The Comprehensive Peptaibiotics Database a versatile tool for the retrieval of valuable information about peptaibiotics. Sequence data have been considered as to December 14, 2012. PMID:23681723

  1. Drinking Water Database

    NASA Technical Reports Server (NTRS)

    Murray, ShaTerea R.

    2004-01-01

    This summer I had the opportunity to work in the Environmental Management Office (EMO) under the Chemical Sampling and Analysis Team or CS&AT. This team s mission is to support Glenn Research Center (GRC) and EM0 by providing chemical sampling and analysis services and expert consulting. Services include sampling and chemical analysis of water, soil, fbels, oils, paint, insulation materials, etc. One of this team s major projects is the Drinking Water Project. This is a project that is done on Glenn s water coolers and ten percent of its sink every two years. For the past two summers an intern had been putting together a database for this team to record the test they had perform. She had successfully created a database but hadn't worked out all the quirks. So this summer William Wilder (an intern from Cleveland State University) and I worked together to perfect her database. We began be finding out exactly what every member of the team thought about the database and what they would change if any. After collecting this data we both had to take some courses in Microsoft Access in order to fix the problems. Next we began looking at what exactly how the database worked from the outside inward. Then we began trying to change the database but we quickly found out that this would be virtually impossible.

  2. The Transporter Classification Database

    PubMed Central

    Saier, Milton H.; Reddy, Vamsee S.; Tamang, Dorjee G.; Västermark, Åke

    2014-01-01

    The Transporter Classification Database (TCDB; http://www.tcdb.org) serves as a common reference point for transport protein research. The database contains more than 10 000 non-redundant proteins that represent all currently recognized families of transmembrane molecular transport systems. Proteins in TCDB are organized in a five level hierarchical system, where the first two levels are the class and subclass, the second two are the family and subfamily, and the last one is the transport system. Superfamilies that contain multiple families are included as hyperlinks to the five tier TC hierarchy. TCDB includes proteins from all types of living organisms and is the only transporter classification system that is both universal and recognized by the International Union of Biochemistry and Molecular Biology. It has been expanded by manual curation, contains extensive text descriptions providing structural, functional, mechanistic and evolutionary information, is supported by unique software and is interconnected to many other relevant databases. TCDB is of increasing usefulness to the international scientific community and can serve as a model for the expansion of database technologies. This manuscript describes an update of the database descriptions previously featured in NAR database issues. PMID:24225317

  3. Specialist Bibliographic Databases

    PubMed Central

    2016-01-01

    Specialist bibliographic databases offer essential online tools for researchers and authors who work on specific subjects and perform comprehensive and systematic syntheses of evidence. This article presents examples of the established specialist databases, which may be of interest to those engaged in multidisciplinary science communication. Access to most specialist databases is through subscription schemes and membership in professional associations. Several aggregators of information and database vendors, such as EBSCOhost and ProQuest, facilitate advanced searches supported by specialist keyword thesauri. Searches of items through specialist databases are complementary to those through multidisciplinary research platforms, such as PubMed, Web of Science, and Google Scholar. Familiarizing with the functional characteristics of biomedical and nonbiomedical bibliographic search tools is mandatory for researchers, authors, editors, and publishers. The database users are offered updates of the indexed journal lists, abstracts, author profiles, and links to other metadata. Editors and publishers may find particularly useful source selection criteria and apply for coverage of their peer-reviewed journals and grey literature sources. These criteria are aimed at accepting relevant sources with established editorial policies and quality controls. PMID:27134485

  4. Mucolipidosis types II and III and non-syndromic stuttering are associated with different variants in the same genes.

    PubMed

    Raza, M Hashim; Domingues, Carlos E F; Webster, Ronald; Sainz, Eduardo; Paris, Emily; Rahn, Rachel; Gutierrez, Joanne; Chow, Ho Ming; Mundorff, Jennifer; Kang, Chang-Soo; Riaz, Naveeda; Basra, Muhammad A R; Khan, Shaheen; Riazuddin, Sheikh; Moretti-Ferreira, Danilo; Braun, Allen; Drayna, Dennis

    2016-04-01

    Homozygous mutations in GNPTAB and GNPTG are classically associated with mucolipidosis II (ML II) alpha/beta and mucolipidosis III (ML III) alpha/beta/gamma, which are rare lysosomal storage disorders characterized by multiple pathologies. Recently, variants in GNPTAB, GNPTG, and the functionally related NAGPA gene have been associated with non-syndromic persistent stuttering. In a worldwide sample of 1013 unrelated individuals with non-syndromic persistent stuttering we found 164 individuals who carried a rare non-synonymous coding variant in one of these three genes. We compared the frequency of these variants with those in population-matched controls and genomic databases, and their location with those reported in mucolipidosis. Stuttering subjects displayed an excess of non-synonymous coding variants compared to controls and individuals in the 1000 Genomes and Exome Sequencing Project databases. We identified a total of 81 different variants in our stuttering cases. Virtually all of these were missense substitutions, only one of which has been previously reported in mucolipidosis, a disease frequently associated with complete loss-of-function mutations. We hypothesize that rare non-synonymous coding variants in GNPTAB, GNPTG, and NAGPA may account for as much as 16% of persistent stuttering cases, and that variants in GNPTAB and GNPTG are at different sites and may in general, cause less severe effects on protein function than those in ML II alpha/beta and ML III alpha/beta/gamma. PMID:26130485

  5. The Problem-Oriented Medical Synopsis: a patient-centered clinical information system.

    PubMed Central

    Stitt, F. W.

    1993-01-01

    A clinical information system consists of four major components: the clinical database, decision support, data analysis (including outcomes), and the development system. We have created such a system using generally available database methodology. The clinical database, for record-keeping, is called the Problem-Oriented Medical Synopsis, and is quite an old system, originating in 1966. We describe the suitability of a problem-oriented model of clinical records management to the relational model of database design, and describe our experience with the database as a departmental information system for patient care and outcomes research. Hybrid, or partially problem-oriented, databases represent an acceptable approach to clinical record-keeping. PMID:8130600

  6. Accumulation of rare variants in the arylsulfatase G (ARSG) gene in task-specific dystonia.

    PubMed

    Nibbeling, Esther; Schaake, Susen; Tijssen, Marina A; Weissbach, Anne; Groen, Justus L; Altenmüller, Eckart; Verbeek, Dineke S; Lohmann, Katja

    2015-05-01

    Musician's dystonia and writer's cramp are examples of task-specific dystonia. Recently, the arylsulfatase G (ARSG) locus was suggested to be associated with musician's dystonia and writer's cramp by a genome-wide association study. To test for the presence of causal variants, the entire coding region and exon-intron boundaries of ARSG were sequenced in DNA samples from 158 musician's dystonia patients which were collected at the University of Music, Drama, and Media (Hanover, Germany), and 72 patients with writer's cramp which were recruited at the Academic Medical Centers in Amsterdam and Groningen, the Netherlands. The frequency of variants within ARSG was compared to publically available data at the exome variant server (EVS) from the NHLBI GO Exome Sequencing Project. We identified 11 single nucleotide variants (SNVs) in the patients including eight non-synonymous substitutions. All variants have previously been reported at EVS including two SNVs with a reported minor allele frequency <1%. One rare missense variant, rs61999318 (p.I493T), was significantly enriched in the group of writer's cramp patients compared to European Americans in EVS database (p = 0.0013). In patients with writer's cramp, there was an overall enrichment for rare, protein-changing variants compared to controls (p < 0.01). In conclusion, we did not detect any conclusive mutation in ARSG. However, we showed an association with rs61999318 in patients with writer's cramp that contributed to an overall enrichment for rare, protein-changing variants in these patients. Thus, our data provide further support for a role of ARSG variants in task-specific dystonia, especially writer's cramp. PMID:25825126

  7. The SSV Evaluation System: A Tool to Prioritize Short Structural Variants for Studies of Possible Regulatory and Causal Variants.

    PubMed

    Saul, Robert; Lutz, Michael W; Burns, Daniel K; Roses, Allen D; Chiba-Falek, Ornit

    2016-09-01

    Short structural variants (SSVs) are short genomic variants (<50 bp) other than SNPs. It has been suggested that SSVs contribute to many human complex traits. However, high-throughput analysis of SSVs presents numerous technical challenges. In order to facilitate the discovery and assessment of SSVs, we have developed a prototype bioinformatics tool, "SSV evaluation system," which is a searchable, annotated database of SSVs in the human genome, with associated customizable scoring software that is used to evaluate and prioritize SSVs that are most likely to have significant biological effects and impact on disease risk. This new bioinformatics tool is a component in a larger strategy that we have been using to discover potentially important SSVs within candidate genomic regions that have been identified in genome-wide association studies, with the goal to prioritize potential functional/causal SSVs and focus the follow-up experiments on a relatively small list of strong candidate SSVs. We describe our strategy and discuss how we have used the SSV evaluation system to discover candidate causal variants related to complex neurodegenerative diseases. We present the SSV evaluation system as a powerful tool to guide genetic investigations aiming to uncover SSVs that underlie human complex diseases including neurodegenerative diseases in aging. PMID:27279261

  8. Great Basin paleontological database

    USGS Publications Warehouse

    Zhang, N.; Blodgett, R.B.; Hofstra, A.H.

    2008-01-01

    The U.S. Geological Survey has constructed a paleontological database for the Great Basin physiographic province that can be served over the World Wide Web for data entry, queries, displays, and retrievals. It is similar to the web-database solution that we constructed for Alaskan paleontological data (www.alaskafossil.org). The first phase of this effort was to compile a paleontological bibliography for Nevada and portions of adjacent states in the Great Basin that has recently been completed. In addition, we are also compiling paleontological reports (Known as E&R reports) of the U.S. Geological Survey, which are another extensive source of l,egacy data for this region. Initial population of the database benefited from a recently published conodont data set and is otherwise focused on Devonian and Mississippian localities because strata of this age host important sedimentary exhalative (sedex) Au, Zn, and barite resources and enormons Carlin-type An deposits. In addition, these strata are the most important petroleum source rocks in the region, and record the transition from extension to contraction associated with the Antler orogeny, the Alamo meteorite impact, and biotic crises associated with global oceanic anoxic events. The finished product will provide an invaluable tool for future geologic mapping, paleontological research, and mineral resource investigations in the Great Basin, making paleontological data acquired over nearly the past 150 yr readily available over the World Wide Web. A description of the structure of the database and the web interface developed for this effort are provided herein. This database is being used ws a model for a National Paleontological Database (which we am currently developing for the U.S. Geological Survey) as well as for other paleontological databases now being developed in other parts of the globe. ?? 2008 Geological Society of America.

  9. Chemical Explosion Database

    NASA Astrophysics Data System (ADS)

    Johansson, Peder; Brachet, Nicolas

    2010-05-01

    A database containing information on chemical explosions, recorded and located by the International Data Center (IDC) of the CTBTO, should be established in the IDC prior to entry into force of the CTBT. Nearly all of the large chemical explosions occur in connection with mining activity. As a first step towards the establishment of this database, a survey of presumed mining areas where sufficiently large explosions are conducted has been done. This is dominated by the large coal mining areas like the Powder River (U.S.), Kuznetsk (Russia), Bowen (Australia) and Ekibastuz (Kazakhstan) basins. There are also several other smaller mining areas, in e.g. Scandinavia, Poland, Kazakhstan and Australia, with large enough explosions for detection. Events in the Reviewed Event Bulletin (REB) of the IDC that are located in or close to these mining areas, and which therefore are candidates for inclusion in the database, have been investigated. Comparison with a database of infrasound events has been done as many mining blasts generate strong infrasound signals and therefore also are included in the infrasound database. Currently there are 66 such REB events in 18 mining areas in the infrasound database. On a yearly basis several hundreds of events in mining areas have been recorded and included in the REB. Establishment of the database of chemical explosions requires confirmation and ground truth information from the States Parties regarding these events. For an explosion reported in the REB, the appropriate authority in whose country the explosion occurred is encouraged, on a voluntary basis, to seek out information on the explosion and communicate this information to the IDC.

  10. Crystal orientation mapping via ion channeling: An alternative to EBSD.

    PubMed

    Langlois, C; Douillard, T; Yuan, H; Blanchard, N P; Descamps-Mandine, A; Van de Moortèle, B; Rigotti, C; Epicier, T

    2015-10-01

    A new method, which we name ion CHanneling ORientation Determination (iCHORD), is proposed to obtain orientation maps on polycrystals via ion channeling. The iChord method exploits the dependence between grain orientation and ion beam induced secondary electron image contrast. At each position of the region of interest, intensity profiles are obtained from a series of images acquired with different orientations with respect to the ion beam. The profiles are then compared to a database of theoretical profiles of known orientation. The Euler triplet associated to the most similar theoretical profile gives the orientation at that position. The proof-of-concept is obtained on a titanium nitride sample. The potentialities of iCHORD as an alternative to EBSD are then discussed. PMID:26094201

  11. MIPS: a database for genomes and protein sequences

    PubMed Central

    Mewes, H. W.; Frishman, D.; Gruber, C.; Geier, B.; Haase, D.; Kaps, A.; Lemcke, K.; Mannhaupt, G.; Pfeiffer, F.; Schüller, C.; Stocker, S.; Weil, B.

    2000-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF), Martinsried, near Munich, Germany, continues its longstanding tradition to develop and maintain high quality curated genome databases. In addition, efforts have been intensified to cover the wealth of complete genome sequences in a systematic, comprehensive form. Bioinformatics, supporting national as well as European sequencing and functional analysis projects, has resulted in several up-to-date genome-oriented databases. This report describes growing databases reflecting the progress of sequencing the Arabidopsis thaliana (MATDB) and Neurospora crassa genomes (MNCDB), the yeast genome database (MYGD) extended by functional analysis data, the database of annotated human EST-clusters (HIB) and the database of the complete cDNA sequences from the DHGP (German Human Genome Project). It also contains information on the up-to-date database of complete genomes (PEDANT), the classification of protein sequences (ProtFam) and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database. These databases can be accessed through the MIPS WWW server (http://www. mips.biochem.mpg.de ). PMID:10592176

  12. Convex Image Orientation from Relative Orientations

    NASA Astrophysics Data System (ADS)

    Reich, M.; Heipke, C.

    2016-06-01

    In this paper we propose a novel workflow for the estimation of global image orientations given relative orientations between pairs of overlapping images. Our approach is convex and independent on initial values. First, global rotations are estimated in a relaxed semidefinite program (SDP) and refined in an iterative least squares adjustment in the tangent space of SO(3). A critical aspect is the handling of outliers in the relative orientations. We present a novel heuristic graph based approach for filtering the relative rotations that outperforms state-of-the-art robust rotation averaging algorithms. In a second part we make use of point-observations, tracked over a set of overlapping images and formulate a linear homogeneous system of equations to transfer the scale information between triplets of images, using estimated global rotations and relative translation directions. The final step consists of refining the orientation parameters in a robust bundle adjustment. The proposed approach handles outliers in the homologous points and relative orientations in every step of the processing chain. We demonstrate the robustness of the procedure on synthetic data. Moreover, the performance of our approach is illustrated on real world benchmark data.

  13. Free the data: one laboratory's approach to knowledge-based genomic variant classification and preparation for EMR integration of genomic data.

    PubMed

    Bean, Lora J H; Tinker, Stuart W; da Silva, Cristina; Hegde, Madhuri R

    2013-09-01

    Current technology allows clinical laboratories to rapidly translate research discoveries from small patient cohorts into clinical genetic tests; therefore, a potentially large proportion of sequence variants identified in individuals with clinical features of a genetic disorder remain unpublished. Without a mechanism for clinical laboratories to share data, interpretation of sequence variants may be inconsistent. We describe here the two components of Emory Genetics Laboratory's (EGL) in-house developed data management system. The first is a highly curated variant database with a data structure designed to facilitate sharing of information about variants identified at EGL with curated databases. This system also tracks changes in variant classifications, creating a record of previous cases in need of updated reports when a classification is changed. The second component, EmVClass, is a Web-based interface that allows any user to view the inventory of variants classified at EGL. These software tools provide a solution to two pressing issues faced by clinical genetics laboratories: how to manage a large variant inventory with evolving variant classifications that need to be communicated to healthcare providers and how to make that inventory of variants freely available to the community. PMID:23757202

  14. M2SG: mapping human disease-related genetic variants to protein sequences and genomic loci

    PubMed Central

    Ji, Renkai; Cong, Qian; Li, Wenlin; Grishin, Nick V.

    2013-01-01

    Summary: Online Mendelian Inheritance in Man (OMIM) is a manually curated compendium of human genetic variants and the corresponding phenotypes, mostly human diseases. Instead of directly documenting the native sequences for gene entries, OMIM links its entries to protein and DNA sequences in other databases. However, because of the existence of gene isoforms and errors in OMIM records, mapping a specific OMIM mutation to its corresponding protein sequence is not trivial. Combining computer programs and extensive manual curation of OMIM full-text descriptions and original literature, we mapped 98% of OMIM amino acid substitutions (AASs) and all SwissProt Variant (SwissVar) disease-related AASs to reference sequences and confidently mapped 99.96% of all AASs to the genomic loci. Based on the results, we developed an online database and interactive web server (M2SG) to (i) retrieve the mapped OMIM and SwissVar variants for a given protein sequence; and (ii) obtain related proteins and mutations for an input disease phenotype. This database will be useful for analyzing sequences, understanding the effect of mutations, identifying important genetic variations and designing experiments on a protein of interest. Availability and implementation: The database and web server are freely available at http://prodata.swmed.edu/M2S/mut2seq.cgi. Contact: grishin@chop.swmed.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24002112

  15. Improved locus-specific database for OPA1 mutations allows inclusion of advanced clinical data.

    PubMed

    Ferré, Marc; Caignard, Angélique; Milea, Dan; Leruez, Stéphanie; Cassereau, Julien; Chevrollier, Arnaud; Amati-Bonneau, Patrizia; Verny, Christophe; Bonneau, Dominique; Procaccio, Vincent; Reynier, Pascal

    2015-01-01

    Autosomal-dominant optic atrophy (ADOA) is the most common inherited optic neuropathy, due to mutations in the optic atrophy 1 gene (OPA1) in about 60%-80% of cases. At present, the clinical heterogeneity of patients carrying OPA1 variants renders genotype-phenotype correlations difficulty. Since 2005, when we published the first locus-specific database (LSDB) dedicated to OPA1, a large amount of new clinical and genetic knowledge has emerged, prompting us to update this database. We have used the Leiden Open-Source Variation Database to develop a clinico-biological database, aiming to add clinical phenotypes related to OPA1 variants. As a first step, we validated this new database by registering several patients previously reported in the literature, as well as new patients from our own institution. Contributors may now make online submissions of clinical and molecular descriptions of phenotypes due to OPA1 variants, including detailed ophthalmological and neurological data, with due respect to patient anonymity. The updated OPA1 LSDB (http://opa1.mitodyn.org/) should prove useful for molecular diagnoses, large-scale variant statistics, and genotype-phenotype correlations in ADOA studies. PMID:25243597

  16. ADANS database specification

    SciTech Connect

    1997-01-16

    The purpose of the Air Mobility Command (AMC) Deployment Analysis System (ADANS) Database Specification (DS) is to describe the database organization and storage allocation and to provide the detailed data model of the physical design and information necessary for the construction of the parts of the database (e.g., tables, indexes, rules, defaults). The DS includes entity relationship diagrams, table and field definitions, reports on other database objects, and a description of the ADANS data dictionary. ADANS is the automated system used by Headquarters AMC and the Tanker Airlift Control Center (TACC) for airlift planning and scheduling of peacetime and contingency operations as well as for deliberate planning. ADANS also supports planning and scheduling of Air Refueling Events by the TACC and the unit-level tanker schedulers. ADANS receives input in the form of movement requirements and air refueling requests. It provides a suite of tools for planners to manipulate these requirements/requests against mobility assets and to develop, analyze, and distribute schedules. Analysis tools are provided for assessing the products of the scheduling subsystems, and editing capabilities support the refinement of schedules. A reporting capability provides formatted screen, print, and/or file outputs of various standard reports. An interface subsystem handles message traffic to and from external systems. The database is an integral part of the functionality summarized above.

  17. The Chandra Bibliography Database

    NASA Astrophysics Data System (ADS)

    Rots, A. H.; Winkelman, S. L.; Paltani, S.; Blecksmith, S. E.; Bright, J. D.

    2004-07-01

    Early in the mission, the Chandra Data Archive started the development of a bibliography database, tracking publications in refereed journals and on-line conference proceedings that are based on Chandra observations, allowing our users to link directly to articles in the ADS from our archive, and to link to the relevant data in the archive from the ADS entries. Subsequently, we have been working closely with the ADS and other data centers, in the context of the ADEC-ITWG, on standardizing the literature-data linking. We have also extended our bibliography database to include all Chandra-related articles and we are also keeping track of the number of citations of each paper. Obviously, in addition to providing valuable services to our users, this database allows us to extract a wide variety of statistical information. The project comprises five components: the bibliography database-proper, a maintenance database, an interactive maintenance tool, a user browsing interface, and a web services component for exchanging information with the ADS. All of these elements are nearly mission-independent and we intend make the package as a whole available for use by other data centers. The capabilities thus provided represent support for an essential component of the Virtual Observatory.

  18. Shuttle Hypervelocity Impact Database

    NASA Technical Reports Server (NTRS)

    Hyde, James L.; Christiansen, Eric L.; Lear, Dana M.

    2011-01-01

    With three missions outstanding, the Shuttle Hypervelocity Impact Database has nearly 3000 entries. The data is divided into tables for crew module windows, payload bay door radiators and thermal protection system regions, with window impacts compromising just over half the records. In general, the database provides dimensions of hypervelocity impact damage, a component level location (i.e., window number or radiator panel number) and the orbiter mission when the impact occurred. Additional detail on the type of particle that produced the damage site is provided when sampling data and definitive analysis results are available. Details and insights on the contents of the database including examples of descriptive statistics will be provided. Post flight impact damage inspection and sampling techniques that were employed during the different observation campaigns will also be discussed. Potential enhancements to the database structure and availability of the data for other researchers will be addressed in the Future Work section. A related database of returned surfaces from the International Space Station will also be introduced.

  19. Shuttle Hypervelocity Impact Database

    NASA Technical Reports Server (NTRS)

    Hyde, James I.; Christiansen, Eric I.; Lear, Dana M.

    2011-01-01

    With three flights remaining on the manifest, the shuttle impact hypervelocity database has over 2800 entries. The data is currently divided into tables for crew module windows, payload bay door radiators and thermal protection system regions, with window impacts compromising just over half the records. In general, the database provides dimensions of hypervelocity impact damage, a component level location (i.e., window number or radiator panel number) and the orbiter mission when the impact occurred. Additional detail on the type of particle that produced the damage site is provided when sampling data and definitive analysis results are available. The paper will provide details and insights on the contents of the database including examples of descriptive statistics using the impact data. A discussion of post flight impact damage inspection and sampling techniques that were employed during the different observation campaigns will be presented. Future work to be discussed will be possible enhancements to the database structure and availability of the data for other researchers. A related database of ISS returned surfaces that are under development will also be introduced.

  20. Using the Reactome Database

    PubMed Central

    Haw, Robin

    2012-01-01

    There is considerable interest in the bioinformatics community in creating pathway databases. The Reactome project (a collaboration between the Ontario Institute for Cancer Research, Cold Spring Harbor Laboratory, New York University Medical Center and the European Bioinformatics Institute) is one such pathway database and collects structured information on all the biological pathways and processes in the human. It is an expert-authored and peer-reviewed, curated collection of well-documented molecular reactions that span the gamut from simple intermediate metabolism to signaling pathways and complex cellular events. This information is supplemented with likely orthologous molecular reactions in mouse, rat, zebrafish, worm and other model organisms. This unit describes how to use the Reactome database to learn the steps of a biological pathway; navigate and browse through the Reactome database; identify the pathways in which a molecule of interest is involved; use the Pathway and Expression analysis tools to search the database for and visualize possible connections within user-supplied experimental data set and Reactome pathways; and the Species Comparison tool to compare human and model organism pathways. PMID:22700314

  1. FishTraits Database

    USGS Publications Warehouse

    Angermeier, Paul L.; Frimpong, Emmanuel A.

    2009-01-01

    The need for integrated and widely accessible sources of species traits data to facilitate studies of ecology, conservation, and management has motivated development of traits databases for various taxa. In spite of the increasing number of traits-based analyses of freshwater fishes in the United States, no consolidated database of traits of this group exists publicly, and much useful information on these species is documented only in obscure sources. The largely inaccessible and unconsolidated traits information makes large-scale analysis involving many fishes and/or traits particularly challenging. FishTraits is a database of >100 traits for 809 (731 native and 78 exotic) fish species found in freshwaters of the conterminous United States, including 37 native families and 145 native genera. The database contains information on four major categories of traits: (1) trophic ecology, (2) body size and reproductive ecology (life history), (3) habitat associations, and (4) salinity and temperature tolerances. Information on geographic distribution and conservation status is also included. Together, we refer to the traits, distribution, and conservation status information as attributes. Descriptions of attributes are available here. Many sources were consulted to compile attributes, including state and regional species accounts and other databases.

  2. NASA Records Database

    NASA Technical Reports Server (NTRS)

    Callac, Christopher; Lunsford, Michelle

    2005-01-01

    The NASA Records Database, comprising a Web-based application program and a database, is used to administer an archive of paper records at Stennis Space Center. The system begins with an electronic form, into which a user enters information about records that the user is sending to the archive. The form is smart : it provides instructions for entering information correctly and prompts the user to enter all required information. Once complete, the form is digitally signed and submitted to the database. The system determines which storage locations are not in use, assigns the user s boxes of records to some of them, and enters these assignments in the database. Thereafter, the software tracks the boxes and can be used to locate them. By use of search capabilities of the software, specific records can be sought by box storage locations, accession numbers, record dates, submitting organizations, or details of the records themselves. Boxes can be marked with such statuses as checked out, lost, transferred, and destroyed. The system can generate reports showing boxes awaiting destruction or transfer. When boxes are transferred to the National Archives and Records Administration (NARA), the system can automatically fill out NARA records-transfer forms. Currently, several other NASA Centers are considering deploying the NASA Records Database to help automate their records archives.

  3. Integrating Paleoecological Databases

    NASA Astrophysics Data System (ADS)

    Blois, Jessica; Goring, Simon; Smith, Alison

    2011-02-01

    Neotoma Consortium Workshop; Madison, Wisconsin, 23-26 September 2010 ; Paleoecology can contribute much to global change science, as paleontological records provide rich information about species range shifts, changes in vegetation composition and productivity, aquatic and terrestrial ecosystem responses to abrupt climate change, and paleoclimate reconstruction, for example. However, while paleoecology is increasingly a multidisciplinary, multiproxy field focused on biotic responses to global change, most paleo databases focus on single-proxy groups. The Neotoma Paleoecology Database (http://www.neotomadb.org) aims to remedy this limitation by integrating discipline-specific databases to facilitate cross-community queries and analyses. In September, Neotoma consortium members and representatives from other databases and data communities met at the University of Wisconsin-Madison to launch the second development phase of Neotoma. The workshop brought together 54 international specialists, including Neotoma data stewards, users, and developers. Goals for the meeting were fourfold: (1) develop working plans for existing data communities; (2) identify new data types and sources; (3) enhance data access, visualization, and analysis on the Neotoma Web site; and (4) coordinate with other databases and cooperate in tool development and sharing.

  4. VIEWCACHE: An incremental database access method for autonomous interoperable databases

    NASA Technical Reports Server (NTRS)

    Roussopoulos, Nick; Sellis, Timoleon

    1991-01-01

    The objective is to illustrate the concept of incremental access to distributed databases. An experimental database management system, ADMS, which has been developed at the University of Maryland, in College Park, uses VIEWCACHE, a database access method based on incremental search. VIEWCACHE is a pointer-based access method that provides a uniform interface for accessing distributed databases and catalogues. The compactness of the pointer structures formed during database browsing and the incremental access method allow the user to search and do inter-database cross-referencing with no actual data movement between database sites. Once the search is complete, the set of collected pointers pointing to the desired data are dereferenced.

  5. Imprecision and Uncertainty in the UFO Database Model.

    ERIC Educational Resources Information Center

    Van Gyseghem, Nancy; De Caluwe, Rita

    1998-01-01

    Discusses how imprecision and uncertainty are dealt with in the UFO (Uncertainty and Fuzziness in an Object-oriented) database model. Such information is expressed by means of possibility distributions, and modeled by means of the proposed concept of "role objects." The role objects model uncertain, tentative information about objects, and thus…

  6. Statistical Profile of Currently Available CD-ROM Database Products.

    ERIC Educational Resources Information Center

    Nicholls, Paul Travis

    1988-01-01

    Survey of currently available CD-ROM products discusses: (1) subject orientation; (2) database type; (3) update frequency; (4) price structure; (5) hardware configuration; (6) retrieval software; and (7) publisher/marketer. Several graphs depict data in these areas. (five references) (MES)

  7. Variant of association of sciences

    NASA Astrophysics Data System (ADS)

    Dubro, V. G.

    2008-03-01

    The variant of association of sciences, offered in the given work, is based on the multilevel description complex (with dynamic treelike structure and with number of levels > 2) systems. In its basis the definition such physical, but dimensionless functions of interactions, as quality and quantity of the information, developed at interaction, lays. Thus levels are defined by hierarchy of interactions and "subordinates" to itself physical functions (space - time systems of readout, energy, pulse, moment of a pulse, information, etc.). The relations, traditional for them, and laws remain, but as special cases of more general (common) relations, i.e. they are formulated anew on some new uniform base of the reconsidered concepts.

  8. C3 variants in Japanese.

    PubMed

    Nishimukai, H; Kitamura, H; Sano, Y; Tamaki, Y

    1985-01-01

    By high-voltage agarose gel electrophoresis, seven phenotypes of C3 were found in Japanese. The allele frequencies for C3*S, C3*S025, C3*S02, C3*F, C3*F06, C3*F065, and C3*F08 were 0.9943, 0.0003, 0.0003, 0.0006, 0.0003, 0.0021, and 0.0021, respectively. CH50, C3/C3c protein concentrations, and C3 hemolytic activities in fresh sera with variant C3 phenotypes were within the normal ranges. PMID:3988301

  9. High Performance Database Management for Earth Sciences

    NASA Technical Reports Server (NTRS)

    Rishe, Naphtali; Barton, David; Urban, Frank; Chekmasov, Maxim; Martinez, Maria; Alvarez, Elms; Gutierrez, Martha; Pardo, Philippe

    1998-01-01

    The High Performance Database Research Center at Florida International University is completing the development of a highly parallel database system based on the semantic/object-oriented approach. This system provides exceptional usability and flexibility. It allows shorter application design and programming cycles and gives the user control via an intuitive information structure. It empowers the end-user to pose complex ad hoc decision support queries. Superior efficiency is provided through a high level of optimization, which is transparent to the user. Manifold reduction in storage size is allowed for many applications. This system allows for operability via internet browsers. The system will be used for the NASA Applications Center program to store remote sensing data, as well as for Earth Science applications.

  10. Open Geoscience Database

    NASA Astrophysics Data System (ADS)

    Bashev, A.

    2012-04-01

    Currently there is an enormous amount of various geoscience databases. Unfortunately the only users of the majority of the databases are their elaborators. There are several reasons for that: incompaitability, specificity of tasks and objects and so on. However the main obstacles for wide usage of geoscience databases are complexity for elaborators and complication for users. The complexity of architecture leads to high costs that block the public access. The complication prevents users from understanding when and how to use the database. Only databases, associated with GoogleMaps don't have these drawbacks, but they could be hardly named "geoscience" Nevertheless, open and simple geoscience database is necessary at least for educational purposes (see our abstract for ESSI20/EOS12). We developed a database and web interface to work with them and now it is accessible at maps.sch192.ru. In this database a result is a value of a parameter (no matter which) in a station with a certain position, associated with metadata: the date when the result was obtained; the type of a station (lake, soil etc); the contributor that sent the result. Each contributor has its own profile, that allows to estimate the reliability of the data. The results can be represented on GoogleMaps space image as a point in a certain position, coloured according to the value of the parameter. There are default colour scales and each registered user can create the own scale. The results can be also extracted in *.csv file. For both types of representation one could select the data by date, object type, parameter type, area and contributor. The data are uploaded in *.csv format: Name of the station; Lattitude(dd.dddddd); Longitude(ddd.dddddd); Station type; Parameter type; Parameter value; Date(yyyy-mm-dd). The contributor is recognised while entering. This is the minimal set of features that is required to connect a value of a parameter with a position and see the results. All the complicated data

  11. Spectrum of DNA variants for non-syndromic deafness in a large cohort from multiple continents.

    PubMed

    Yan, Denise; Tekin, Demet; Bademci, Guney; Foster, Joseph; Cengiz, F Basak; Kannan-Sundhari, Abhiraami; Guo, Shengru; Mittal, Rahul; Zou, Bing; Grati, Mhamed; Kabahuma, Rosemary I; Kameswaran, Mohan; Lasisi, Taye J; Adedeji, Waheed A; Lasisi, Akeem O; Menendez, Ibis; Herrera, Marianna; Carranza, Claudia; Maroofian, Reza; Crosby, Andrew H; Bensaid, Mariem; Masmoudi, Saber; Behnam, Mahdiyeh; Mojarrad, Majid; Feng, Yong; Duman, Duygu; Mawla, Alex M; Nord, Alex S; Blanton, Susan H; Liu, Xue Z; Tekin, Mustafa

    2016-08-01

    Hearing loss is the most common sensory deficit in humans with causative variants in over 140 genes. With few exceptions, however, the population-specific distribution for many of the identified variants/genes is unclear. Until recently, the extensive genetic and clinical heterogeneity of deafness precluded comprehensive genetic analysis. Here, using a custom capture panel (MiamiOtoGenes), we undertook a targeted sequencing of 180 genes in a multi-ethnic cohort of 342 GJB2 mutation-negative deaf probands from South Africa, Nigeria, Tunisia, Turkey, Iran, India, Guatemala, and the United States (South Florida). We detected causative DNA variants in 25 % of multiplex and 7 % of simplex families. The detection rate varied between 0 and 57 % based on ethnicity, with Guatemala and Iran at the lower and higher end of the spectrum, respectively. We detected causative variants within 27 genes without predominant recurring pathogenic variants. The most commonly implicated genes include MYO15A, SLC26A4, USH2A, MYO7A, MYO6, and TRIOBP. Overall, our study highlights the importance of family history and generation of databases for multiple ethnically discrete populations to improve our ability to detect and accurately interpret genetic variants for pathogenicity. PMID:27344577

  12. Protein Structure Databases.

    PubMed

    Laskowski, Roman A

    2016-01-01

    Web-based protein structure databases come in a wide variety of types and levels of information content. Those having the most general interest are the various atlases that describe each experimentally determined protein structure and provide useful links, analyses, and schematic diagrams relating to its 3D structure and biological function. Also of great interest are the databases that classify 3D structures by their folds as these can reveal evolutionary relationships which may be hard to detect from sequence comparison alone. Related to these are the numerous servers that compare folds-particularly useful for newly solved structures, and especially those of unknown function. Beyond these are a vast number of databases for the more specialized user, dealing with specific families, diseases, structural features, and so on. PMID:27115626

  13. Cytochrome P450 database.

    PubMed

    Lisitsa, A V; Gusev, S A; Karuzina, I I; Archakov, A I; Koymans, L

    2001-01-01

    This paper describes a specialized database dedicated exclusively to the cytochrome P450 superfamily. The system provides the impression of superfamily's nomenclature and describes structure and function of different P450 enzymes. Information on P450-catalyzed reactions, substrate preferences, peculiarities of induction and inhibition is available through the database management system. Also the source genes and appropriate translated proteins can be retrieved together with corresponding literature references. Developed programming solution provides the flexible interface for browsing, searching, grouping and reporting the information. Local version of database manager and required data files are distributed on a compact disk. Besides, there is a network version of the software available on Internet. The network version implies the original mechanism, which is useful for the permanent online extension of the data scope. PMID:11769119

  14. Mouse genome database 2016

    PubMed Central

    Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.; Kadin, James A.; Richardson, Joel E.

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data. PMID:26578600

  15. National Ambient Radiation Database

    SciTech Connect

    Dziuban, J.; Sears, R.

    2003-02-25

    The U.S. Environmental Protection Agency (EPA) recently developed a searchable database and website for the Environmental Radiation Ambient Monitoring System (ERAMS) data. This site contains nationwide radiation monitoring data for air particulates, precipitation, drinking water, surface water and pasteurized milk. This site provides location-specific as well as national information on environmental radioactivity across several media. It provides high quality data for assessing public exposure and environmental impacts resulting from nuclear emergencies and provides baseline data during routine conditions. The database and website are accessible at www.epa.gov/enviro/. This site contains (1) a query for the general public which is easy to use--limits the amount of information provided, but includes the ability to graph the data with risk benchmarks and (2) a query for a more technical user which allows access to all of the data in the database, (3) background information on ER AMS.

  16. Mouse genome database 2016.

    PubMed

    Bult, Carol J; Eppig, Janan T; Blake, Judith A; Kadin, James A; Richardson, Joel E

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data. PMID:26578600

  17. ARTI Refrigerant Database

    SciTech Connect

    Calm, J.M.

    1992-04-30

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air- conditioning and refrigeration equipment. The complete documents are not included, though some may be added at a later date. The database identifies sources of specific information on R-32, R-123, R-124, R- 125, R-134a, R-141b, R142b, R-143a, R-152a, R-290 (propane), R-717 (ammonia), ethers, and others as well as azeotropic and zeotropic blends of these fluids. It addresses polyalkylene glycol (PAG), ester, and other lubricants. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits.

  18. The PROSITE database.

    PubMed

    Hulo, Nicolas; Bairoch, Amos; Bulliard, Virginie; Cerutti, Lorenzo; De Castro, Edouard; Langendijk-Genevaux, Petra S; Pagni, Marco; Sigrist, Christian J A

    2006-01-01

    The PROSITE database consists of a large collection of biologically meaningful signatures that are described as patterns or profiles. Each signature is linked to a documentation that provides useful biological information on the protein family, domain or functional site identified by the signature. The PROSITE database is now complemented by a series of rules that can give more precise information about specific residues. During the last 2 years, the documentation and the ScanProsite web pages were redesigned to add more functionalities. The latest version of PROSITE (release 19.11 of September 27, 2005) contains 1329 patterns and 552 profile entries. Over the past 2 years more than 200 domains have been added, and now 52% of UniProtKB/Swiss-Prot entries (release 48.1 of September 27, 2005) have a cross-reference to a PROSITE entry. The database is accessible at http://www.expasy.org/prosite/. PMID:16381852

  19. The PROSITE database

    PubMed Central

    Hulo, Nicolas; Bairoch, Amos; Bulliard, Virginie; Cerutti, Lorenzo; De Castro, Edouard; Langendijk-Genevaux, Petra S.; Pagni, Marco; Sigrist, Christian J. A.

    2006-01-01

    The PROSITE database consists of a large collection of biologically meaningful signatures that are described as patterns or profiles. Each signature is linked to a documentation that provides useful biological information on the protein family, domain or functional site identified by the signature. The PROSITE database is now complemented by a series of rules that can give more precise information about specific residues. During the last 2 years, the documentation and the ScanProsite web pages were redesigned to add more functionalities. The latest version of PROSITE (release 19.11 of September 27, 2005) contains 1329 patterns and 552 profile entries. Over the past 2 years more than 200 domains have been added, and now 52% of UniProtKB/Swiss-Prot entries (release 48.1 of September 27, 2005) have a cross-reference to a PROSITE entry. The database is accessible at . PMID:16381852

  20. Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms

    PubMed Central

    2011-01-01

    Background Database searching is the most frequently used approach for automated peptide assignment and protein inference of tandem mass spectra. The results, however, depend on the sequences in target databases and on search algorithms. Recently by using an alternative splicing database, we identified more proteins than with the annotated proteins in Aspergillus flavus. In this study, we aimed at finding a greater number of eligible splice variants based on newly available transcript sequences and the latest genome annotation. The improved database was then used to compare four search algorithms: Mascot, OMSSA, X! Tandem, and InsPecT. Results The updated alternative splicing database predicted 15833 putative protein variants, 61% more than the previous results. There was transcript evidence for 50% of the updated genes compared to the previous 35% coverage. Database searches were conducted using the same set of spectral data, search parameters, and protein database but with different algorithms. The false discovery rates of the peptide-spectrum matches were estimated < 2%. The numbers of the total identified proteins varied from 765 to 867 between algorithms. Whereas 42% (1651/3891) of peptide assignments were unanimous, the comparison showed that 51% (568/1114) of the RefSeq proteins and 15% (11/72) of the putative splice variants were inferred by all algorithms. 12 plausible isoforms were discovered by focusing on the consensus peptides which were detected by at least three different algorithms. The analysis found different conserved domains in two putative isoforms of UDP-galactose 4-epimerase. Conclusions We were able to detect dozens of new peptides using the improved alternative splicing database with the recently updated annotation of the A. flavus genome. Unlike the identifications of the peptides and the RefSeq proteins, large variations existed between the putative splice variants identified by different algorithms. 12 candidates of putative isoforms

  1. The Neotoma Paleoecology Database

    NASA Astrophysics Data System (ADS)

    Grimm, E. C.; Ashworth, A. C.; Barnosky, A. D.; Betancourt, J. L.; Bills, B.; Booth, R.; Blois, J.; Charles, D. F.; Graham, R. W.; Goring, S. J.; Hausmann, S.; Smith, A. J.; Williams, J. W.; Buckland, P.

    2015-12-01

    The Neotoma Paleoecology Database (www.neotomadb.org) is a multiproxy, open-access, relational database that includes fossil data for the past 5 million years (the late Neogene and Quaternary Periods). Modern distributional data for various organisms are also being made available for calibration and paleoecological analyses. The project is a collaborative effort among individuals from more than 20 institutions worldwide, including domain scientists representing a spectrum of Pliocene-Quaternary fossil data types, as well as experts in information technology. Working groups are active for diatoms, insects, ostracodes, pollen and plant macroscopic remains, testate amoebae, rodent middens, vertebrates, age models, geochemistry and taphonomy. Groups are also active in developing online tools for data analyses and for developing modules for teaching at different levels. A key design concept of NeotomaDB is that stewards for various data types are able to remotely upload and manage data. Cooperatives for different kinds of paleo data, or from different regions, can appoint their own stewards. Over the past year, much progress has been made on development of the steward software-interface that will enable this capability. The steward interface uses web services that provide access to the database. More generally, these web services enable remote programmatic access to the database, which both desktop and web applications can use and which provide real-time access to the most current data. Use of these services can alleviate the need to download the entire database, which can be out-of-date as soon as new data are entered. In general, the Neotoma web services deliver data either from an entire table or from the results of a view. Upon request, new web services can be quickly generated. Future developments will likely expand the spatial and temporal dimensions of the database. NeotomaDB is open to receiving new datasets and stewards from the global Quaternary community

  2. Database Management System

    NASA Technical Reports Server (NTRS)

    1990-01-01

    In 1981 Wayne Erickson founded Microrim, Inc, a company originally focused on marketing a microcomputer version of RIM (Relational Information Manager). Dennis Comfort joined the firm and is now vice president, development. The team developed an advanced spinoff from the NASA system they had originally created, a microcomputer database management system known as R:BASE 4000. Microrim added many enhancements and developed a series of R:BASE products for various environments. R:BASE is now the second largest selling line of microcomputer database management software in the world.

  3. The Genopolis Microarray Database

    PubMed Central

    Splendiani, Andrea; Brandizi, Marco; Even, Gael; Beretta, Ottavio; Pavelka, Norman; Pelizzola, Mattia; Mayhaus, Manuel; Foti, Maria; Mauri, Giancarlo; Ricciardi-Castagnoli, Paola

    2007-01-01

    Background Gene expression databases are key resources for microarray data management and analysis and the importance of a proper annotation of their content is well understood. Public repositories as well as microarray database systems that can be implemented by single laboratories exist. However, there is not yet a tool that can easily support a collaborative environment where different users with different rights of access to data can interact to define a common highly coherent content. The scope of the Genopolis database is to provide a resource that allows different groups performing microarray experiments related to a common subject to create a common coherent knowledge base and to analyse it. The Genopolis database has been implemented as a dedicated system for the scientific community studying dendritic and macrophage cells functions and host-parasite interactions. Results The Genopolis Database system allows the community to build an object based MIAME compliant annotation of their experiments and to store images, raw and processed data from the Affymetrix GeneChip® platform. It supports dynamical definition of controlled vocabularies and provides automated and supervised steps to control the coherence of data and annotations. It allows a precise control of the visibility of the database content to different sub groups in the community and facilitates exports of its content to public repositories. It provides an interactive users interface for data analysis: this allows users to visualize data matrices based on functional lists and sample characterization, and to navigate to other data matrices defined by similarity of expression values as well as functional characterizations of genes involved. A collaborative environment is also provided for the definition and sharing of functional annotation by users. Conclusion The Genopolis Database supports a community in building a common coherent knowledge base and analyse it. This fills a gap between a local

  4. JICST Factual Database(1)

    NASA Astrophysics Data System (ADS)

    Kurosawa, Shinji

    The outline of JICST factual database (JOIS-F), which JICST has started from January, 1988, and its online service are described in this paper. First, the author mentions the circumstances from 1973, when its planning was started, to the present, and its relation to "Project by Special Coordination Founds for Promoting Science and Technology". Secondly, databases, which are now under development aiming to start its services from fiscal 1988 or fiscal 1989, of DNA, metallic material intensity, crystal structure, chemical substance regulations, and so forth, are described. Lastly, its online service is briefly explained.

  5. The Ribosomal Database Project.

    PubMed Central

    Maidak, B L; Larsen, N; McCaughey, M J; Overbeek, R; Olsen, G J; Fogel, K; Blandy, J; Woese, C R

    1994-01-01

    The Ribosomal Database Project (RDP) is a curated database that offers ribosome-related data, analysis services, and associated computer programs. The offerings include phylogenetically ordered alignments of ribosomal RNA (rRNA) sequences, derived phylogenetic trees, rRNA secondary structure diagrams, and various software for handling, analyzing and displaying alignments and trees. The data are available via anonymous ftp (rdp.life.uiuc.edu), electronic mail (server/rdp.life.uiuc.edu) and gopher (rdpgopher.life.uiuc.edu). The electronic mail server also provides ribosomal probe checking, approximate phylogenetic placement of user-submitted sequences, screening for chimeric nature of newly sequenced rRNAs, and automated alignment. PMID:7524021

  6. Efficient hemodynamic event detection utilizing relational databases and wavelet analysis

    NASA Technical Reports Server (NTRS)

    Saeed, M.; Mark, R. G.

    2001-01-01

    Development of a temporal query framework for time-oriented medical databases has hitherto been a challenging problem. We describe a novel method for the detection of hemodynamic events in multiparameter trends utilizing wavelet coefficients in a MySQL relational database. Storage of the wavelet coefficients allowed for a compact representation of the trends, and provided robust descriptors for the dynamics of the parameter time series. A data model was developed to allow for simplified queries along several dimensions and time scales. Of particular importance, the data model and wavelet framework allowed for queries to be processed with minimal table-join operations. A web-based search engine was developed to allow for user-defined queries. Typical queries required between 0.01 and 0.02 seconds, with at least two orders of magnitude improvement in speed over conventional queries. This powerful and innovative structure will facilitate research on large-scale time-oriented medical databases.

  7. Histone variants: emerging players in cancer biology

    PubMed Central

    Vardabasso, Chiara; Hasson, Dan; Ratnakumar, Kajan; Chung, Chi-Yeh; Duarte, Luis F.

    2014-01-01

    Histone variants are key players in shaping chromatin structure, and, thus, in regulating fundamental cellular processes such as chromosome segregation and gene expression. Emerging evidence points towards a role for histone variants in contributing to tumor progression, and, recently, the first cancer-associated mutation in a histone variant-encoding gene was reported. In addition, genetic alterations of the histone chaperones that specifically regulate chromatin incorporation of histone variants are rapidly being uncovered in numerous cancers. Collectively, these findings implicate histone variants as potential drivers of cancer initiation and/or progression, and, therefore, targeting histone deposition or the chromatin remodeling machinery may be of therapeutic value. Here, we review the mammalian histone variants of the H2A and H3 families in their respective cellular functions, and their involvement in tumor biology. PMID:23652611

  8. ClinLabGeneticist: a tool for clinical management of genetic variants from whole exome sequencing in clinical genetic laboratories.

    PubMed

    Wang, Jinlian; Liao, Jun; Zhang, Jinglan; Cheng, Wei-Yi; Hakenberg, Jörg; Ma, Meng; Webb, Bryn D; Ramasamudram-Chakravarthi, Rajasekar; Karger, Lisa; Mehta, Lakshmi; Kornreich, Ruth; Diaz, George A; Li, Shuyu; Edelmann, Lisa; Chen, Rong

    2015-01-01

    Routine clinical application of whole exome sequencing remains challenging due to difficulties in variant interpretation, large dataset management, and workflow integration. We describe a tool named ClinLabGeneticist to implement a workflow in clinical laboratories for management of variant assessment in genetic testing and disease diagnosis. We established an extensive variant annotation data source for the identification of pathogenic variants. A dashboard was deployed to aid a multi-step, hierarchical review process leading to final clinical decisions on genetic variant assessment. In addition, a central database was built to archive all of the genetic testing data, notes, and comments throughout the review process, variant validation data by Sanger sequencing as well as the final clinical reports for future reference. The entire workflow including data entry, distribution of work assignments, variant evaluation and review, selection of variants for validation, report generation, and communications between various personnel is integrated into a single data management platform. Three case studies are presented to illustrate the utility of ClinLabGeneticist. ClinLabGeneticist is freely available to academia at http://rongchenlab.org/software/clinlabgeneticist . PMID:26338694

  9. Exome-based proteogenomics of HEK-293 human cell line: Coding genomic variants identified at the level of shotgun proteome.

    PubMed

    Lobas, Anna A; Karpov, Dmitry S; Kopylov, Arthur T; Solovyeva, Elizaveta M; Ivanov, Mark V; Ilina, Irina Y; Lazarev, Vassily N; Kuznetsova, Ksenia G; Ilgisonis, Ekaterina V; Zgoda, Victor G; Gorshkov, Mikhail V; Moshkovskii, Sergei A

    2016-07-01

    Genomic and proteomic data were integrated into the proteogenomic workflow to identify coding genomic variants of Human Embryonic Kidney 293 (HEK-293) cell line at the proteome level. Shotgun proteome data published by Geiger et al. (2012), Chick et al. (2015), and obtained in this work for HEK-293 were searched against the customized genomic database generated using exome data published by Lin et al. (2014). Overall, 112 unique variants were identified at the proteome level out of ∼1200 coding variants annotated in the exome. Seven identified variants were shared between all the three considered proteomic datasets, and 27 variants were found in any two datasets. Some of the found variants belonged to widely known genomic polymorphisms originated from the germline, while the others were more likely resulting from somatic mutations. At least, eight of the proteins bearing amino acid variants were annotated as cancer-related ones, including p53 tumor suppressor. In all the considered shotgun datasets, the variant peptides were at the ratio of 1:2.5 less likely being identified than the wild-type ones compared with the corresponding theoretical peptides. This can be explained by the presence of the so-called "passenger" mutations in the genes, which were never expressed in HEK-293 cells. All MS data have been deposited in the ProteomeXchange with the dataset identifier PXD002613 (http://proteomecentral.proteomexchange.org/dataset/PXD002613). PMID:27233776

  10. Thirteen new patients with guanidinoacetate methyltransferase deficiency and functional characterization of nineteen novel missense variants in the GAMT gene.

    PubMed

    Mercimek-Mahmutoglu, Saadet; Ndika, Joseph; Kanhai, Warsha; de Villemeur, Thierry Billette; Cheillan, David; Christensen, Ernst; Dorison, Nathalie; Hannig, Vickie; Hendriks, Yvonne; Hofstede, Floris C; Lion-Francois, Laurence; Lund, Allan M; Mundy, Helen; Pitelet, Gaele; Raspall-Chaure, Miquel; Scott-Schwoerer, Jessica A; Szakszon, Katalin; Valayannopoulos, Vassili; Williams, Monique; Salomons, Gajja S

    2014-04-01

    Guanidinoacetate methyltransferase deficiency (GAMT-D) is an autosomal recessively inherited disorder of creatine biosynthesis. Creatine deficiency on cranial proton magnetic resonance spectroscopy, and elevated guanidinoacetate levels in body fluids are the biomarkers of GAMT-D. In 74 patients, 50 different mutations in the GAMT gene have been identified with missense variants being the most common. Clinical and biochemical features of the patients with missense variants were obtained from their physicians using a questionnaire. In 20 patients, 17 missense variants, 25% had a severe, 55% a moderate, and 20% a mild phenotype. The effect of these variants on GAMT enzyme activity was overexpressed using primary GAMT-D fibroblasts: 17 variants retained no significant activity and are therefore considered pathogenic. Two additional variants, c.22C>A (p.Pro8Thr) and c.79T>C (p.Tyr27His) (the latter detected in control cohorts) are in fact not pathogenic as these alleles restored GAMT enzyme activity, although both were predicted to be possibly damaging by in silico analysis. We report 13 new patients with GAMT-D, six novel mutations and functional analysis of 19 missense variants, all being included in our public LOVD database. Our functional assay is important for the confirmation of the pathogenicity of identified missense variants in the GAMT gene. PMID:24415674

  11. Nested Variant of Urothelial Carcinoma

    PubMed Central

    Venyo, Anthony Kodzo-Grey

    2014-01-01

    Background. Nested variant of urothelial carcinoma was added to the WHO's classification in 2004. Aims. To review the literature on nested variant of urothelial carcinoma. Results. About 200 cases of the tumour have been reported so far and it has the ensuing morphological features: large numbers of small confluent irregular nests of bland-appearing, closely packed, haphazardly arranged, and poorly defined urothelial cells infiltrating the lamina propria and the muscularis propria. The tumour has a bland histomorphologic appearance, has an aggressive biological behaviour, and has at times been misdiagnosed as a benign lesion which had led to a significant delay in the establishment of the correct diagnosis and contributing to the advanced stage of the disease. Immunohistochemically, the tumour shares some characteristic features with high-risk conventional urothelial carcinomas such as high proliferation index and loss of p27 expression. However, p53, bcl-2, or EGF-r immunoreactivity is not frequently seen. The tumour must be differentiated from a number of proliferative lesions of the urothelium. Conclusions. Correct and early diagnosis of this tumour is essential to provide early curative treatment to avoid diagnosis at an advanced stage. A multicentre trial is required to identify treatment options that would improve the outcome of this tumour. PMID:24587796

  12. Alternatively Spliced Androgen Receptor Variants

    PubMed Central

    Dehm, Scott M.; Tindall, Donald J.

    2011-01-01

    Alternative splicing is an important mechanism for increasing functional diversity from a limited set of genes. De-regulation of this process is common in diverse pathologic conditions. The androgen receptor (AR) is a steroid receptor transcription factor with functions critical for normal male development as well as the growth and survival of normal and cancerous prostate tissue. Studies of AR function in androgen insensitivity syndrome (AIS) and prostate cancer (PCa) have demonstrated loss-of-function AR alterations in AIS, and gain-of-function AR alterations in PCa. Over the past two decades, AR gene alterations have been identified in various individuals with AIS, which disrupt normal AR splicing patterns and yield dysfunctional AR protein variants. More recently, altered AR splicing patterns have been identified as a mechanism of PCa progression and resistance to androgen-depletion therapy. Several studies have described the synthesis of alternatively spliced transcripts encoding truncated AR isoforms that lack the ligand-binding domain, which is the ultimate target of androgen depletion. Many of these truncated AR isoforms function as constitutively active, ligand-independent transcription factors that can support androgen-independent expression of AR target genes, as well as the androgen-independent growth of PCa cells. In this review, we will summarize the various alternatively spliced AR variants that have been discovered, with a focus on their role and origin in the pathologic conditions of AIS and PCa. PMID:21778211

  13. DataBase on Demand

    NASA Astrophysics Data System (ADS)

    Gaspar Aparicio, R.; Gomez, D.; Coterillo Coz, I.; Wojcik, D.

    2012-12-01

    At CERN a number of key database applications are running on user-managed MySQL database services. The database on demand project was born out of an idea to provide the CERN user community with an environment to develop and run database services outside of the actual centralised Oracle based database services. The Database on Demand (DBoD) empowers the user to perform certain actions that had been traditionally done by database administrators, DBA's, providing an enterprise platform for database applications. It also allows the CERN user community to run different database engines, e.g. presently open community version of MySQL and single instance Oracle database server. This article describes a technology approach to face this challenge, a service level agreement, the SLA that the project provides, and an evolution of possible scenarios.

  14. Reliably Detecting Clinically Important Variants Requires Both Combined Variant Calls and Optimized Filtering Strategies.

    PubMed

    Field, Matthew A; Cho, Vicky; Andrews, T Daniel; Goodnow, Chris C

    2015-01-01

    A diversity of tools is available for identification of variants from genome sequence data. Given the current complexity of incorporating external software into a genome analysis infrastructure, a tendency exists to rely on the results from a single tool alone. The quality of the output variant calls is highly variable however, depending on factors such as sequence library quality as well as the choice of short-read aligner, variant caller, and variant caller filtering strategy. Here we present a two-part study first using the high quality 'genome in a bottle' reference set to demonstrate the significant impact the choice of aligner, variant caller, and variant caller filtering strategy has on overall variant call quality and further how certain variant callers outperform others with increased sample contamination, an important consideration when analyzing sequenced cancer samples. This analysis confirms previous work showing that combining variant calls of multiple tools results in the best quality resultant variant set, for either specificity or sensitivity, depending on whether the intersection or union, of all variant calls is used respectively. Second, we analyze a melanoma cell line derived from a control lymphocyte sample to determine whether software choices affect the detection of clinically important melanoma risk-factor variants finding that only one of the three such variants is unanimously detected under all conditions. Finally, we describe a cogent strategy for implementing a clinical variant detection pipeline; a strategy that requires careful software selection, variant caller filtering optimizing, and combined variant calls in order to effectively minimize false negative variants. While implementing such features represents an increase in complexity and computation the results offer indisputable improvements in data quality. PMID:26600436

  15. Reliably Detecting Clinically Important Variants Requires Both Combined Variant Calls and Optimized Filtering Strategies

    PubMed Central

    Field, Matthew A.; Cho, Vicky

    2015-01-01

    A diversity of tools is available for identification of variants from genome sequence data. Given the current complexity of incorporating external software into a genome analysis infrastructure, a tendency exists to rely on the results from a single tool alone. The quality of the output variant calls is highly variable however, depending on factors such as sequence library quality as well as the choice of short-read aligner, variant caller, and variant caller filtering strategy. Here we present a two-part study first using the high quality ‘genome in a bottle’ reference set to demonstrate the significant impact the choice of aligner, variant caller, and variant caller filtering strategy has on overall variant call quality and further how certain variant callers outperform others with increased sample contamination, an important consideration when analyzing sequenced cancer samples. This analysis confirms previous work showing that combining variant calls of multiple tools results in the best quality resultant variant set, for either specificity or sensitivity, depending on whether the intersection or union, of all variant calls is used respectively. Second, we analyze a melanoma cell line derived from a control lymphocyte sample to determine whether software choices affect the detection of clinically important melanoma risk-factor variants finding that only one of the three such variants is unanimously detected under all conditions. Finally, we describe a cogent strategy for implementing a clinical variant detection pipeline; a strategy that requires careful software selection, variant caller filtering optimizing, and combined variant calls in order to effectively minimize false negative variants. While implementing such features represents an increase in complexity and computation the results offer indisputable improvements in data quality. PMID:26600436

  16. Beyond Rare-Variant Association Testing: Pinpointing Rare Causal Variants in Case-Control Sequencing Study

    PubMed Central

    Lin, Wan-Yu

    2016-01-01

    Rare-variant association testing usually requires some method of aggregation. The next important step is to pinpoint individual rare causal variants among a large number of variants within a genetic region. Recently Ionita-Laza et al. propose a backward elimination (BE) procedure that can identify individual causal variants among the many variants in a gene. The BE procedure removes a variant if excluding this variant can lead to a smaller P-value for the BURDEN test (referred to as “BE-BURDEN”) or the SKAT test (referred to as “BE-SKAT”). We here use the adaptive combination of P-values (ADA) method to pinpoint causal variants. Unlike most gene-based association tests, the ADA statistic is built upon per-site P-values of individual variants. It is straightforward to select important variants given the optimal P-value truncation threshold found by ADA. We performed comprehensive simulations to compare ADA with BE-SKAT and BE-BURDEN. Ranking these three approaches according to positive predictive values (PPVs), the percentage of truly causal variants among the total selected variants, we found ADA > BE-SKAT > BE-BURDEN across all simulation scenarios. We therefore recommend using ADA to pinpoint plausible rare causal variants in a gene. PMID:26903168

  17. The UCSC Genome Browser database: 2016 update.

    PubMed

    Speir, Matthew L; Zweig, Ann S; Rosenbloom, Kate R; Raney, Brian J; Paten, Benedict; Nejad, Parisa; Lee, Brian T; Learned, Katrina; Karolchik, Donna; Hinrichs, Angie S; Heitner, Steve; Harte, Rachel A; Haeussler, Maximilian; Guruvadoo, Luvina; Fujita, Pauline A; Eisenhart, Christopher; Diekhans, Mark; Clawson, Hiram; Casper, Jonathan; Barber, Galt P; Haussler, David; Kuhn, Robert M; Kent, W James

    2016-01-01

    For the past 15 years, the UCSC Genome Browser (http://genome.ucsc.edu/) has served the international research community by offering an integrated platform for viewing and analyzing information from a large database of genome assemblies and their associated annotations. The UCSC Genome Browser has been under continuous development since its inception with new data sets and software features added frequently. Some release highlights of this year include new and updated genome browsers for various assemblies, including bonobo and zebrafish; new gene annotation sets; improvements to track and assembly hub support; and a new interactive tool, the "Data Integrator", for intersecting data from multiple tracks. We have greatly expanded the data sets available on the most recent human assembly, hg38/GRCh38, to include updated gene prediction sets from GENCODE, more phenotype- and disease-associated variants from ClinVar and ClinGen, more genomic regulatory data, and a new multiple genome alignment. PMID:26590259

  18. The UCSC Genome Browser database: 2016 update

    PubMed Central

    Speir, Matthew L.; Zweig, Ann S.; Rosenbloom, Kate R.; Raney, Brian J.; Paten, Benedict; Nejad, Parisa; Lee, Brian T.; Learned, Katrina; Karolchik, Donna; Hinrichs, Angie S.; Heitner, Steve; Harte, Rachel A.; Haeussler, Maximilian; Guruvadoo, Luvina; Fujita, Pauline A.; Eisenhart, Christopher; Diekhans, Mark; Clawson, Hiram; Casper, Jonathan; Barber, Galt P.; Haussler, David; Kuhn, Robert M.; Kent, W. James

    2016-01-01

    For the past 15 years, the UCSC Genome Browser (http://genome.ucsc.edu/) has served the international research community by offering an integrated platform for viewing and analyzing information from a large database of genome assemblies and their associated annotations. The UCSC Genome Browser has been under continuous development since its inception with new data sets and software features added frequently. Some release highlights of this year include new and updated genome browsers for various assemblies, including bonobo and zebrafish; new gene annotation sets; improvements to track and assembly hub support; and a new interactive tool, the “Data Integrator”, for intersecting data from multiple tracks. We have greatly expanded the data sets available on the most recent human assembly, hg38/GRCh38, to include updated gene prediction sets from GENCODE, more phenotype- and disease-associated variants from ClinVar and ClinGen, more genomic regulatory data, and a new multiple genome alignment. PMID:26590259

  19. Incidental Variants Are Critical for Genomics

    PubMed Central

    Biesecker, Leslie G.

    2013-01-01

    The topic of incidental variants detected through exome and genome sequencing is controversial, both in clinical practice and in research. The arguments for and against the deliberate analysis and return of incidental variants focus on issues of clinical validity, clinical utility, autonomy, clinical and research infrastructure and costs, and, in the research arena, therapeutic misconception. These topics are briefly reviewed and an argument is made that these variants are the future of genomic medicine. As a field, we should take full advantage of all opportunities to study these variants by searching them out, returning them to patients and research participants, and studying their utility for predictive medicine. PMID:23643378

  20. The CEBAF Element Database

    SciTech Connect

    Theodore Larrieu, Christopher Slominski, Michele Joyce

    2011-03-01

    With the inauguration of the CEBAF Element Database (CED) in Fall 2010, Jefferson Lab computer scientists have taken a step toward the eventual goal of a model-driven accelerator. Once fully populated, the database will be the primary repository of information used for everything from generating lattice decks to booting control computers to building controls screens. A requirement influencing the CED design is that it provide access to not only present, but also future and past configurations of the accelerator. To accomplish this, an introspective database schema was designed that allows new elements, types, and properties to be defined on-the-fly with no changes to table structure. Used in conjunction with Oracle Workspace Manager, it allows users to query data from any time in the database history with the same tools used to query the present configuration. Users can also check-out workspaces to use as staging areas for upcoming machine configurations. All Access to the CED is through a well-documented Application Programming Interface (API) that is translated automatically from original C++ source code into native libraries for scripting languages such as perl, php, and TCL making access to the CED easy and ubiquitous.

  1. GENERAL PERMITS DATABASE

    EPA Science Inventory

    Resource Purpose:This database was used to provide permit writers with a library of examples for writing general permits. It has not been maintained and is outdated and will be removed. Water Permits Division is trying to determine whether or not to recreate this databas...

  2. Triatomic Spectral Database

    National Institute of Standards and Technology Data Gateway

    SRD 117 Triatomic Spectral Database (Web, free access)   All of the rotational spectral lines observed and reported in the open literature for 55 triatomic molecules have been tabulated. The isotopic molecular species, assigned quantum numbers, observed frequency, estimated measurement uncertainty and reference are given for each transition reported.

  3. Hydrocarbon Spectral Database

    National Institute of Standards and Technology Data Gateway

    SRD 115 Hydrocarbon Spectral Database (Web, free access)   All of the rotational spectral lines observed and reported in the open literature for 91 hydrocarbon molecules have been tabulated. The isotopic molecular species, assigned quantum numbers, observed frequency, estimated measurement uncertainty and reference are given for each transition reported.

  4. Redis database administration tool

    Energy Science and Technology Software Center (ESTSC)

    2013-02-13

    MyRedis is a product of the Lorenz subproject under the ASC Scirntific Data Management effort. MyRedis is a web based utility designed to allow easy administration of instances of Redis databases. It can be usedd to view and manipulate data as well as run commands directly against a variety of different Redis hosts.

  5. NATIONAL NUTRIENTS DATABASE

    EPA Science Inventory

    Resource Purpose:The Nutrient Criteria Program has initiated development of a National relational database application that will be used to store and analyze nutrient data. The ultimate use of these data will be to derive ecoregion- and waterbody-specific numeric nutrient...

  6. NATIONAL ASSESSMENT DATABASE (NAD)

    EPA Science Inventory

    Resource Purpose:The National Assessment Database stores State water quality assessments that are reported under Section 305(b) of the Clean Water Act. The data are stored by individual water quality assessments. Threatened, partially and not supporting waters also have da...

  7. Patent Family Databases.

    ERIC Educational Resources Information Center

    Simmons, Edlyn S.

    1985-01-01

    Reports on retrieval of patent information online and includes definition of patent family, basic and equivalent patents, "parents and children" applications, designated states, patent family databases--International Patent Documentation Center, World Patents Index, APIPAT (American Petroleum Institute), CLAIMS (IFI/Plenum). A table noting country…

  8. The Exoplanet Orbit Database

    NASA Astrophysics Data System (ADS)

    Wright, J. T.; Fakhouri, O.; Marcy, G. W.; Han, E.; Feng, Y.; Johnson, John Asher; Howard, A. W.; Fischer, D. A.; Valenti, J. A.; Anderson, J.; Piskunov, N.

    2011-04-01

    We present a database of well-determined orbital parameters of exoplanets, and their host stars' properties. This database comprises spectroscopic orbital elements measured for 427 planets orbiting 363 stars from radial velocity and transit measurements as reported in the literature. We have also compiled fundamental transit parameters, stellar parameters, and the method used for the planets discovery. This Exoplanet Orbit Database includes all planets with robust, well measured orbital parameters reported in peer-reviewed articles. The database is available in a searchable, filterable, and sortable form online through the Exoplanets Data Explorer table, and the data can be plotted and explored through the Exoplanet Data Explorer plotter. We use the Data Explorer to generate publication-ready plots, giving three examples of the signatures of exoplanet migration and dynamical evolution: We illustrate the character of the apparent correlation between mass and period in exoplanet orbits, the different selection biases between radial velocity and transit surveys, and that the multiplanet systems show a distinct semimajor-axis distribution from apparently singleton systems.

  9. High Performance Buildings Database

    DOE Data Explorer

    The High Performance Buildings Database is a shared resource for the building industry, a unique central repository of in-depth information and data on high-performance, green building projects across the United States and abroad. The database includes information on the energy use, environmental performance, design process, finances, and other aspects of each project. Members of the design and construction teams are listed, as are sources for additional information. In total, up to twelve screens of detailed information are provided for each project profile. Projects range in size from small single-family homes or tenant fit-outs within buildings to large commercial and institutional buildings and even entire campuses. The database is a data repository as well. A series of Web-based data-entry templates allows anyone to enter information about a building project into the database. Once a project has been submitted, each of the partner organizations can review the entry and choose whether or not to publish that particular project on its own Web site.

  10. NATIONAL CONTAMINANT OCCURRENCE DATABASE

    EPA Science Inventory

    Resource Purpose:Under the 1996 Safe Drinking Water Act Amendments, EPA is to assemble a National Drinking Water Occurrence Database (NCOD) by August 1999. The NCOD is a collection of data of documented quality on unregulated and regulated chemical, radiological, microbia...

  11. Database Technologies for RDF

    NASA Astrophysics Data System (ADS)

    Das, Souripriya; Srinivasan, Jagannathan

    Efficient and scalable support for RDF/OWL data storage, loading, inferencing and querying, in conjunction with already available support for enterprise level data and operations reliability requirements, can make databases suitable to act as enterprise-level RDF/OWL repository and hence become a viable platform for building semantic applications for the enterprise environments.

  12. Biomedical term mapping databases.

    PubMed

    Wren, Jonathan D; Chang, Jeffrey T; Pustejovsky, James; Adar, Eytan; Garner, Harold R; Altman, Russ B

    2005-01-01

    Longer words and phrases are frequently mapped onto a shorter form such as abbreviations or acronyms for efficiency of communication. These abbreviations are pervasive in all aspects of biology and medicine and as the amount of biomedical literature grows, so does the number of abbreviations and the average number of definitions per abbreviation. Even more confusing, different authors will often abbreviate the same word/phrase differently. This ambiguity impedes our ability to retrieve information, integrate databases and mine textual databases for content. Efforts to standardize nomenclature, especially those doing so retrospectively, need to be aware of different abbreviatory mappings and spelling variations. To address this problem, there have been several efforts to develop computer algorithms to identify the mapping of terms between short and long form within a large body of literature. To date, four such algorithms have been applied to create online databases that comprehensively map biomedical terms and abbreviations within MEDLINE: ARGH (http://lethargy.swmed.edu/ARGH/argh.asp), the Stanford Biomedical Abbreviation Server (http://bionlp.stanford.edu/abbreviation/), AcroMed (http://medstract.med.tufts.edu/acro1.1/index.htm) and SaRAD (http://www.hpl.hp.com/research/idl/projects/abbrev.html). In addition to serving as useful computational tools, these databases serve as valuable references that help biologists keep up with an ever-expanding vocabulary of terms. PMID:15608198

  13. The Indra Simulation Database

    NASA Astrophysics Data System (ADS)

    Falck, Bridget; Budavari, T.; Cole, S.; Crankshaw, D.; Dobos, L.; Lemson, G.; Neyrinck, M.; Szalay, A.; Wang, J.

    2011-05-01

    We present the Indra suite of cosmological N-body simulations and the design of its companion database. Indra consists of 512 different instances of a 1 Gpc/h-sided box, each with 100 million dark matter particles and the same input cosmology, enabling a characterization of very large-scale modes of the matter power spectrum with galaxy-scale mass resolution and an excellent handle on cosmic variance. Each simulation outputs 64 snapshots, giving over 100 TB of data for the full set of simulations, all of which will be loaded into a SQL database. We discuss the database design for the particle data, consisting of the positions and velocities of each particle; the FOF halos, with links to the particle data so that halo properties can be calculated within the database; and the density field on a power-of-two grid, which can be easily linked to each particle's Peano-Hilbert index. Initial performance tests and example queries will be given. The authors are grateful for support from the Gordon and Betty Moore and the W.M. Keck Foundations.

  14. Diatomic Spectral Database

    National Institute of Standards and Technology Data Gateway

    SRD 114 Diatomic Spectral Database (Web, free access)   All of the rotational spectral lines observed and reported in the open literature for 121 diatomic molecules have been tabulated. The isotopic molecular species, assigned quantum numbers, observed frequency, estimated measurement uncertainty, and reference are given for each transition reported.

  15. Databases and data mining

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Over the course of the past decade, the breadth of information that is made available through online resources for plant biology has increased astronomically, as have the interconnectedness among databases, online tools, and methods of data acquisition and analysis. For maize researchers, the numbe...

  16. Weathering Database Technology

    ERIC Educational Resources Information Center

    Snyder, Robert

    2005-01-01

    Collecting weather data is a traditional part of a meteorology unit at the middle level. However, making connections between the data and weather conditions can be a challenge. One way to make these connections clearer is to enter the data into a database. This allows students to quickly compare different fields of data and recognize which…

  17. The Ribosomal Database Project

    PubMed Central

    Olsen, Gary J.; Overbeek, Ross; Larsen, Niels; Marsh, Terry L.; McCaughey, Michael J.; Maciukenas, Michael A.; Kuan, Wen-Min; Macke, Thomas J.; Xing, Yuqing; Woese, Carl R.

    1992-01-01

    The Ribosomal Database Project (RDP) compiles ribosomal sequences and related data, and redistributes them in aligned and phylogenetically ordered form to its user community. It also offers various software packages for handling, analyzing and displaying sequences. In addition, the RDP offers (or will offer) certain analytic services. At present the project is in an intermediate stage of development. PMID:1598241

  18. LHCb distributed conditions database

    NASA Astrophysics Data System (ADS)

    Clemencic, M.

    2008-07-01

    The LHCb Conditions Database project provides the necessary tools to handle non-event time-varying data. The main users of conditions are reconstruction and analysis processes, which are running on the Grid. To allow efficient access to the data, we need to use a synchronized replica of the content of the database located at the same site as the event data file, i.e. the LHCb Tier1. The replica to be accessed is selected from information stored on LFC (LCG File Catalog) and managed with the interface provided by the LCG developed library CORAL. The plan to limit the submission of jobs to those sites where the required conditions are available will also be presented. LHCb applications are using the Conditions Database framework on a production basis since March 2007. We have been able to collect statistics on the performance and effectiveness of both the LCG library COOL (the library providing conditions handling functionalities) and the distribution framework itself. Stress tests on the CNAF hosted replica of the Conditions Database have been performed and the results will be summarized here.

  19. WETLANDS TREATMENT DATABASE

    EPA Science Inventory

    The U.S. EPA sponsored a project to collect and catalog information from wastewater treatment wetlands into a computer database. PA has also written a user friendly, stand-alone, menu-driven computer program to allow anyone with DOS 3.3 or higher to access the information in the ...

  20. ECOREGION SPATIAL DATABASE

    EPA Science Inventory

    This spatial database contains boundaries and attributes describing Level III ecoregions in EPA Region 8. The ecoregions shown here have been derived from Omernik (1987) and from refinements of Omernik's framework that have been made for other projects. These ongoing or re...

  1. Geologic structure mapping database Spent Fuel Test - Climax, Nevada Test Site

    SciTech Connect

    Yow, J.L. Jr.

    1984-12-04

    Information on over 2500 discontinuities mapped at the SFT-C is contained in the geologic structure mapping database. Over 1800 of these features include complete descriptions of their orientations. This database is now available for use by other researchers. 6 references, 3 figures, 2 tables.

  2. MaizeGDB update: New tools, data, and interface for the maize model organism database

    Technology Transfer Automated Retrieval System (TEKTRAN)

    MaizeGDB is a highly curated, community-oriented database and informatics service to researchers focused on the crop plant and model organism Zea mays ssp. mays. Although some form of the maize community database has existed over the last 25 years, there have only been two major releases. In 1991, ...

  3. The Beltsville Human Nutrition Research Center's Porcine Immunology and Nutrition Resource Database

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Several diverse genomics-based databases have developed to facilitate research with human and rodent models. Current porcine gene databases, however, lack the nutritional and immunological orientation and robust annotation to design effective molecular tools to study relevant pig models. To address ...

  4. The AMMA database

    NASA Astrophysics Data System (ADS)

    Boichard, Jean-Luc; Brissebrat, Guillaume; Cloche, Sophie; Eymard, Laurence; Fleury, Laurence; Mastrorillo, Laurence; Moulaye, Oumarou; Ramage, Karim

    2010-05-01

    The AMMA project includes aircraft, ground-based and ocean measurements, an intensive use of satellite data and diverse modelling studies. Therefore, the AMMA database aims at storing a great amount and a large variety of data, and at providing the data as rapidly and safely as possible to the AMMA research community. In order to stimulate the exchange of information and collaboration between researchers from different disciplines or using different tools, the database provides a detailed description of the products and uses standardized formats. The AMMA database contains: - AMMA field campaigns datasets; - historical data in West Africa from 1850 (operational networks and previous scientific programs); - satellite products from past and future satellites, (re-)mapped on a regular latitude/longitude grid and stored in NetCDF format (CF Convention); - model outputs from atmosphere or ocean operational (re-)analysis and forecasts, and from research simulations. The outputs are processed as the satellite products are. Before accessing the data, any user has to sign the AMMA data and publication policy. This chart only covers the use of data in the framework of scientific objectives and categorically excludes the redistribution of data to third parties and the usage for commercial applications. Some collaboration between data producers and users, and the mention of the AMMA project in any publication is also required. The AMMA database and the associated on-line tools have been fully developed and are managed by two teams in France (IPSL Database Centre, Paris and OMP, Toulouse). Users can access data of both data centres using an unique web portal. This website is composed of different modules : - Registration: forms to register, read and sign the data use chart when an user visits for the first time - Data access interface: friendly tool allowing to build a data extraction request by selecting various criteria like location, time, parameters... The request can

  5. JDD, Inc. Database

    NASA Technical Reports Server (NTRS)

    Miller, David A., Jr.

    2004-01-01

    JDD Inc, is a maintenance and custodial contracting company whose mission is to provide their clients in the private and government sectors "quality construction, construction management and cleaning services in the most efficient and cost effective manners, (JDD, Inc. Mission Statement)." This company provides facilities support for Fort Riley in Fo,rt Riley, Kansas and the NASA John H. Glenn Research Center at Lewis Field here in Cleveland, Ohio. JDD, Inc. is owned and operated by James Vaughn, who started as painter at NASA Glenn and has been working here for the past seventeen years. This summer I worked under Devan Anderson, who is the safety manager for JDD Inc. in the Logistics and Technical Information Division at Glenn Research Center The LTID provides all transportation, secretarial, security needs and contract management of these various services for the center. As a safety manager, my mentor provides Occupational Health and Safety Occupation (OSHA) compliance to all JDD, Inc. employees and handles all other issues (Environmental Protection Agency issues, workers compensation, safety and health training) involving to job safety. My summer assignment was not as considered "groundbreaking research" like many other summer interns have done in the past, but it is just as important and beneficial to JDD, Inc. I initially created a database using a Microsoft Excel program to classify and categorize data pertaining to numerous safety training certification courses instructed by our safety manager during the course of the fiscal year. This early portion of the database consisted of only data (training field index, employees who were present at these training courses and who was absent) from the training certification courses. Once I completed this phase of the database, I decided to expand the database and add as many dimensions to it as possible. Throughout the last seven weeks, I have been compiling more data from day to day operations and been adding the

  6. Diversity of breakpoints of variant Philadelphia chromosomes in chronic myeloid leukemia in Brazilian patients

    PubMed Central

    Chauffaille, Maria de Lourdes Lopes Ferrari; Bandeira, Ana Carolina de Almeida; da Silva, Aline Schiavoni Guarnieri

    2014-01-01

    Background Chronic myeloid leukemia is a myeloproliferative disorder characterized by the Philadelphia chromosome or t(9;22)(q34.1;q11.2), resulting in the break-point cluster region-Abelson tyrosine kinase fusion gene, which encodes a constitutively active tyrosine kinase protein. The Philadelphia chromosome is detected by karyotyping in around 90% of chronic myeloid leukemia patients, but 5–10% may have variant types. Variant Philadelphia chromosomes are characterized by the involvement of another chromosome in addition to chromosome 9 or 22. It can be a simple type of variant when one other chromosome is involved, or complex, in which two or more chromosomes take part in the translocation. Few studies have reported the incidence of variant Philadelphia chromosomes or the breakpoints involved among Brazilian chronic myeloid leukemia patients. Objective The aim of this report is to describe the diversity of the variant Philadelphia chromosomes found and highlight some interesting breakpoint candidates for further studies. Methods the Cytogenetics Section Database was searched for all cases with diagnoses of chronic myeloid leukemia during a 12-year period and all the variant Philadelphia chromosomes were listed. Results Fifty (5.17%) cases out of 1071 Philadelphia-positive chronic myeloid leukemia were variants. The most frequently involved chromosome was 17, followed by chromosomes: 1, 20, 6, 11, 2, 10, 12 and 15. Conclusion Among all the breakpoints seen in this survey, six had previously been described: 11p15, 14q32, 15q11.2, 16p13.1, 17p13 and 17q21. The fact that some regions get more frequently involved in such rare rearrangements calls attention to possible predisposition that should be further studied. Nevertheless, the pathological implication of these variants remains unclear. PMID:25638762

  7. Syndromic X-linked intellectual disability segregating with a missense variant in RLIM.

    PubMed

    Tønne, Elin; Holdhus, Rita; Stansberg, Christine; Stray-Pedersen, Asbjørg; Petersen, Kjell; Brunner, Han G; Gilissen, Christian; Hoischen, Alexander; Prescott, Trine; Steen, Vidar M; Fiskerstrand, Torunn

    2015-12-01

    We describe a three-generation Norwegian family with a novel X-linked intellectual disability (XLID) syndrome characterized by subtle facial dysmorphism, autism and severe feeding problems. By exome sequencing we detected a rare missense variant (c.1067A>G, p.(Tyr356Cys)) in the RLIM gene, in two affected male second cousins. Sanger sequencing confirmed the presence of the variant in the four affected males (none of whom were siblings) and in three mothers available for testing. The variant was not present in 100 normal Norwegian controls, has not been reported in variant databases and is deleterious according to in silico prediction tools. The clinical phenotype and the variant co-segregate, yielding a LOD score of 3.0 for linkage to the shared region (36.09 Mb), which contains 242 genes. No other shared rare variants on the X chromosome were detected in the two affected exome-sequenced individuals, and all female carriers had an extremely skewed X-chromosome inactivation pattern. RLIM encodes RING zinc finger protein 12 (RNF12), an ubiquitin ligase that is essential for X inactivation in mice and that acts as a co-regulator of a range of transcription factors, particularly those containing a LIM homeodomain. Tyrosine in position 356 in RNF12 is located within a highly conserved domain essential for binding such transcription factors. Expression of RNF12 is widespread during embryogenesis, and is particularly high in the outer layers of the cerebral cortex. Functional studies are needed to prove a definite causal relationship between the variant and the phenotype. Subsequent reports may confirm a role for RLIM variants in patients with XLID. PMID:25735484

  8. Curricular Orientations. Chapter Two.

    ERIC Educational Resources Information Center

    Patton, James R.; Polloway, Edward A.

    The purpose of this chapter is to review the major curricular orientations which can be found in special education settings for students with mental disabilities. Program orientations differ along two primary dimensions: the amount of time students spend in special settings or with special education personnel, and the extent to which the…

  9. Orientation in operator algebras

    PubMed Central

    Alfsen, Erik M.; Shultz, Frederic W.

    1998-01-01

    A concept of orientation is relevant for the passage from Jordan structure to associative structure in operator algebras. The research reported in this paper bridges the approach of Connes for von Neumann algebras and ourselves for C*-algebras in a general theory of orientation that is of geometric nature and is related to dynamics. PMID:9618457

  10. Teaching Orienteering. Second Edition.

    ERIC Educational Resources Information Center

    McNeill, Carol; Cory-Wright, Jean; Renfrew, Tom

    The educational value provided by orienteering's blend of navigational and physical skills has given it a permanent place in the primary and secondary school curriculum in the United Kingdom. This book is a reference to orienteering for teachers, leaders, and coaches. It provides a "how to" approach to introducing and developing the skills and…

  11. @neuLink: a service-oriented application for biomedical knowledge discovery.

    PubMed

    Friedrich, Christoph M; Dach, Holger; Gattermayer, Tobias; Engelbrecht, Gerhard; Benkner, Siegfried; Hofmann-Apitius, Martin

    2008-01-01

    We introduce the architecture of @neuLink, a service-oriented environment for biomedical knowledge discovery which has been developed in the course of EU Integrated Project @neurIST. The application integrates data from databases with information extracted from unstructured text sources. Moreover, @neuLink supports the analysis of primary biomolecular data associated with individual patients and thus enables the interpretation of molecular data inside a clinical research environment. Based on an assembly of data services, @neuLink interacts with the complex @neurIST grid infrastructure through a dedicated data access and data mediation service. Data types integrated by @neuLink are covering the entire span of biomolecular entities: from gene names in text to entries in EntrezGene; from mentions of drugs to Drugbank, from information on allelic variants in scientific literature to entries in dbSNP. The architecture of @neuLink allows easy integration of other webservice-based applications and thus the spectrum of analysis capabilities of @neuLink can be extended following the requirements of the users of the @neurIST system. PMID:18560118

  12. PMAG: Relational Database Definition

    NASA Astrophysics Data System (ADS)

    Keizer, P.; Koppers, A.; Tauxe, L.; Constable, C.; Genevey, A.; Staudigel, H.; Helly, J.

    2002-12-01

    The Scripps center for Physical and Chemical Earth References (PACER) was established to help create databases for reference data and make them available to the Earth science community. As part of these efforts PACER supports GERM, REM and PMAG and maintains multiple online databases under the http://earthref.org umbrella website. This website has been built on top of a relational database that allows for the archiving and electronic access to a great variety of data types and formats, permitting data queries using a wide range of metadata. These online databases are designed in Oracle 8.1.5 and they are maintained at the San Diego Supercomputer Center. They are directly available via http://earthref.org/databases/. A prototype of the PMAG relational database is now operational within the existing EarthRef.org framework under http://earthref.org/databases/PMAG/. As will be shown in our presentation, the PMAG design focuses around the general workflow that results in the determination of typical paleo-magnetic analyses. This ensures that individual data points can be traced between the actual analysis and the specimen, sample, site, locality and expedition it belongs to. These relations guarantee traceability of the data by distinguishing between original and derived data, where the actual (raw) measurements are performed on the specimen level, and data on the sample level and higher are then derived products in the database. These relations may also serve to recalculate site means when new data becomes available for that locality. The PMAG data records are extensively described in terms of metadata. These metadata are used when scientists search through this online database in order to view and download their needed data. They minimally include method descriptions for field sampling, laboratory techniques and statistical analyses. They also include selection criteria used during the interpretation of the data and, most importantly, critical information about the

  13. Tautomerism in large databases

    PubMed Central

    Sitzmann, Markus; Ihlenfeldt, Wolf-Dietrich

    2010-01-01

    We have used the Chemical Structure DataBase (CSDB) of the NCI CADD Group, an aggregated collection of over 150 small-molecule databases totaling 103.5 million structure records, to conduct tautomerism analyses on one of the largest currently existing sets of real (i.e. not computer-generated) compounds. This analysis was carried out using calculable chemical structure identifiers developed by the NCI CADD Group, based on hash codes available in the chemoinformatics toolkit CACTVS and a newly developed scoring scheme to define a canonical tautomer for any encountered structure. CACTVS’s tautomerism definition, a set of 21 transform rules expressed in SMIRKS line notation, was used, which takes a comprehensive stance as to the possible types of tautomeric interconversion included. Tautomerism was found to be possible for more than 2/3 of the unique structures in the CSDB. A total of 680 million tautomers were calculated from, and including, the original structure records. Tautomerism overlap within the same individual database (i.e. at least one other entry was present that was really only a different tautomeric representation of the same compound) was found at an average rate of 0.3% of the original structure records, with values as high as nearly 2% for some of the databases in CSDB. Projected onto the set of unique structures (by FICuS identifier), this still occurred in about 1.5% of the cases. Tautomeric overlap across all constituent databases in CSDB was found for nearly 10% of the records in the collection. PMID:20512400

  14. NoSQL technologies for the CMS Conditions Database

    NASA Astrophysics Data System (ADS)

    Sipos, Roland

    2015-12-01

    With the restart of the LHC in 2015, the growth of the CMS Conditions dataset will continue, therefore the need of consistent and highly available access to the Conditions makes a great cause to revisit different aspects of the current data storage solutions. We present a study of alternative data storage backends for the Conditions Databases, by evaluating some of the most popular NoSQL databases to support a key-value representation of the CMS Conditions. The definition of the database infrastructure is based on the need of storing the conditions as BLOBs. Because of this, each condition can reach the size that may require special treatment (splitting) in these NoSQL databases. As big binary objects may be problematic in several database systems, and also to give an accurate baseline, a testing framework extension was implemented to measure the characteristics of the handling of arbitrary binary data in these databases. Based on the evaluation, prototypes of a document store, using a column-oriented and plain key-value store, are deployed. An adaption layer to access the backends in the CMS Offline software was developed to provide transparent support for these NoSQL databases in the CMS context. Additional data modelling approaches and considerations in the software layer, deployment and automatization of the databases are also covered in the research. In this paper we present the results of the evaluation as well as a performance comparison of the prototypes studied.

  15. Mitochondrial DNA variants in obesity.

    PubMed

    Knoll, Nadja; Jarick, Ivonne; Volckmar, Anna-Lena; Klingenspor, Martin; Illig, Thomas; Grallert, Harald; Gieger, Christian; Wichmann, Heinz-Erich; Peters, Annette; Wiegand, Susanna; Biebermann, Heike; Fischer-Posovszky, Pamela; Wabitsch, Martin; Völzke, Henry; Nauck, Matthias; Teumer, Alexander; Rosskopf, Dieter; Rimmbach, Christian; Schreiber, Stefan; Jacobs, Gunnar; Lieb, Wolfgang; Franke, Andre; Hebebrand, Johannes; Hinney, Anke

    2014-01-01

    Heritability estimates for body mass index (BMI) variation are high. For mothers and their offspring higher BMI correlations have been described than for fathers. Variation(s) in the exclusively maternally inherited mitochondrial DNA (mtDNA) might contribute to this parental effect. Thirty-two to 40 mtDNA single nucleotide polymorphisms (SNPs) were available from genome-wide association study SNP arrays (Affymetrix 6.0). For discovery, we analyzed association in a case-control (CC) sample of 1,158 extremely obese children and adolescents and 435 lean adult controls. For independent confirmation, 7,014 population-based adults were analyzed as CC sample of n = 1,697 obese cases (BMI ≥ 30 kg/m2) and n = 2,373 normal weight and lean controls (BMI<25 kg/m2). SNPs were analyzed as single SNPs and haplogroups determined by HaploGrep. Fisher's two-sided exact test was used for association testing. Moreover, the D-loop was re-sequenced (Sanger) in 192 extremely obese children and adolescents and 192 lean adult controls. Association testing of detected variants was performed using Fisher's two-sided exact test. For discovery, nominal association with obesity was found for the frequent allele G of m.8994G/A (rs28358887, p = 0.002) located in ATP6. Haplogroup W was nominally overrepresented in the controls (p = 0.039). These findings could not be confirmed independently. For two of the 252 identified D-loop variants nominal association was detected (m.16292C/T, p = 0.007, m.16189T/C, p = 0.048). Only eight controls carried the m.16292T allele, five of whom belonged to haplogroup W that was initially enriched among these controls. m.16189T/C might create an uninterrupted poly-C tract located near a regulatory element involved in replication of mtDNA. Though follow-up of some D-loop variants still is conceivable, our hypothesis of a contribution of variation in the exclusively maternally inherited mtDNA to the observed larger correlations for BMI between mothers and their

  16. Mitochondrial DNA Variants in Obesity

    PubMed Central

    Knoll, Nadja; Jarick, Ivonne; Volckmar, Anna-Lena; Klingenspor, Martin; Illig, Thomas; Grallert, Harald; Gieger, Christian; Wichmann, Heinz-Erich; Peters, Annette; Wiegand, Susanna; Biebermann, Heike; Fischer-Posovszky, Pamela; Wabitsch, Martin; Völzke, Henry; Nauck, Matthias; Teumer, Alexander; Rosskopf, Dieter; Rimmbach, Christian; Schreiber, Stefan; Jacobs, Gunnar; Lieb, Wolfgang; Franke, Andre; Hebebrand, Johannes; Hinney, Anke

    2014-01-01

    Heritability estimates for body mass index (BMI) variation are high. For mothers and their offspring higher BMI correlations have been described than for fathers. Variation(s) in the exclusively maternally inherited mitochondrial DNA (mtDNA) might contribute to this parental effect. Thirty-two to 40 mtDNA single nucleotide polymorphisms (SNPs) were available from genome-wide association study SNP arrays (Affymetrix 6.0). For discovery, we analyzed association in a case-control (CC) sample of 1,158 extremely obese children and adolescents and 435 lean adult controls. For independent confirmation, 7,014 population-based adults were analyzed as CC sample of n = 1,697 obese cases (BMI≥30 kg/m2) and n = 2,373 normal weight and lean controls (BMI<25 kg/m2). SNPs were analyzed as single SNPs and haplogroups determined by HaploGrep. Fisher's two-sided exact test was used for association testing. Moreover, the D-loop was re-sequenced (Sanger) in 192 extremely obese children and adolescents and 192 lean adult controls. Association testing of detected variants was performed using Fisher's two-sided exact test. For discovery, nominal association with obesity was found for the frequent allele G of m.8994G/A (rs28358887, p = 0.002) located in ATP6. Haplogroup W was nominally overrepresented in the controls (p = 0.039). These findings could not be confirmed independently. For two of the 252 identified D-loop variants nominal association was detected (m.16292C/T, p = 0.007, m.16189T/C, p = 0.048). Only eight controls carried the m.16292T allele, five of whom belonged to haplogroup W that was initially enriched among these controls. m.16189T/C might create an uninterrupted poly-C tract located near a regulatory element involved in replication of mtDNA. Though follow-up of some D-loop variants still is conceivable, our hypothesis of a contribution of variation in the exclusively maternally inherited mtDNA to the observed larger correlations for BMI between

  17. Improved detection of artifactual viral minority variants in high-throughput sequencing data.

    PubMed

    Welkers, Matthijs R A; Jonges, Marcel; Jeeninga, Rienk E; Koopmans, Marion P G; de Jong, Menno D

    2014-01-01

    High-throughput sequencing (HTS) of viral samples provides important information on the presence of viral minority variants. However, detection and accurate quantification is limited by the capacity to distinguish biological from artificial variation. In this study, errors related to the Illumina HiSeq2000 library generation and HTS process were investigated by determining minority variant frequencies in an influenza A/WSN/1933(H1N1) virus reverse-genetics plasmid pool. Errors related to amplification and sequencing were determined using the same plasmid pool, by generation of infectious virus using reverse genetics followed by in duplo reverse-transcriptase PCR (RT-PCR) amplification and HTS in the same sequence run. Results showed that after "best practice" quality control (QC), within the plasmid pool, one minority variant with a frequency >0.5% was identified, while 84 and 139 were identified in the RT-PCR amplified samples, indicating RT-PCR amplification artificially increased variation. Detailed analysis showed that artifactual minority variants could be identified by two major technical characteristics: their predominant presence in a single read orientation and uneven distribution of mismatches over the length of the reads. We demonstrate that by addition of two QC steps 95% of the artifactual minority variants could be identified. When our analysis approach was applied to three clinical samples 68% of the initially identified minority variants were identified as artifacts. Our study clearly demonstrated that, without additional QC steps, overestimation of viral minority variants is very likely to occur, mainly as a consequence of the required RT-PCR amplification step. The improved ability to detect and correct for artifactual minority variants, increases data resolution and could aid both past and future studies incorporating HTS. The source code has been made available through Sourceforge (https://sourceforge.net/projects/mva-ngs). PMID:25657642

  18. IMGT/HLA and the Immuno Polymorphism Database.

    PubMed

    Robinson, James; Halliwell, Jason A; Marsh, Steven G E

    2014-01-01

    The IMGT/HLA Database (http://www.ebi.ac.uk/ipd/imgt/hla/) was first released over 15 years ago, providing the HLA community with a searchable repository of highly curated HLA sequences. The HLA complex is located within the 6p21.3 region of human chromosome 6 and contains more than 220 genes of diverse function. Many of the genes encode proteins of the immune system and are highly polymorphic, with some genes currently having over 3,000 known allelic variants. The Immuno Polymorphism Database (IPD) (http://www.ebi.ac.uk/ipd/) expands on this model, with a further set of specialist databases related to the study of polymorphic genes in the immune system. The IPD project works with specialist groups or nomenclature committees who provide and curate individual sections before they are submitted to IPD for online publication. IPD currently consists of four databases: IPD-KIR contains the allelic sequences of killer-cell immunoglobulin-like receptors; IPD-MHC is a database of sequences of the major histocompatibility complex of different species; IPD-HPA, alloantigens expressed only on platelets; and IPD-ESTDAB, which provides access to the European Searchable Tumour Cell-Line Database, a cell bank of immunologically characterized melanoma cell lines. Through the work of the HLA Informatics Group and in collaboration with the European Bioinformatics Institute we are able to provide public access to this data through the website http://www.ebi.ac.uk/ipd/. PMID:25048120

  19. Genetic variants in microRNA genes: impact on microRNA expression, function, and disease

    PubMed Central

    Cammaerts, Sophia; Strazisar, Mojca; De Rijk, Peter; Del Favero, Jurgen

    2015-01-01

    MicroRNAs (miRNAs) are important regulators of gene expression and like any other gene, their coding sequences are subject to genetic variation. Variants in miRNA genes can have profound effects on miRNA functionality at all levels, including miRNA transcription, maturation, and target specificity, and as such they can also contribute to disease. The impact of variants in miRNA genes is the focus of the present review. To put these effects into context, we first discuss the requirements of miRNA transcripts for maturation. In the last part an overview of available databases and tools and experimental approaches to investigate miRNA variants related to human disease is presented. PMID:26052338

  20. Genetic polymorphisms of pharmacogenomic VIP variants in the lhoba population of southwest China

    PubMed Central

    He, Yongjun; Yang, Hua; Geng, Tingting; Feng, Tian; Yuan, Dongya; Kang, Longli; Luo, Manling; Jin, Tianbo

    2015-01-01

    Background: It is well-established that differences among ethnic groups in drug responses are primarily due to the genetic diversity of pharmacogenes. A number of genes or variants that play a crucial role in drug responses have been designated Very Important Pharmacogenes (VIP) by the PharmGKB database. Clarifying the polymorphic distribution of VIPs in different ethnic groups will aid in personalized medicine for specific populations. Methods: We sequenced 85 VIP variants in the Lhoba population based on the PharmGKB database. The polymorphic distribution of the 85 VIP variants in 100 Lhoba subjects was determined and compared with that of 11 major HapMap populations, including ASW, CEU, CHB, CHD, GIH, JPT, LWK, MEX, MKK, TSI, and YRI. We used χ2 tests to identify significantly different loci between these populations. We downloaded SNP allele frequencies from the ALlele FREquency Database to observe the global genetic variation distribution for these specific loci. And then we used Structure software to perform the genetic structure analysis of 12 populations. Results: Based on comparisons of selected available loci, we found that 23, 28, 16, 10, 20, 16, 24, 19, 22, 21 and 36 of the selected VIP variant genotype frequencies in the Lhoba population differed from those of the ASW, CEU, CHB, CHD, GIH, JPT, LWK, MEX, MKK, TSI, and YRI populations, respectively. In addition, Pairwise FST values and clustering analyses also showed the VIP variants in Lhoba exhibited a close genetic affinity with CHD, CHB and JPT populations. Conclusion: Our results complement pharmacogenomic data on the Lhoba ethnic group and may be helpful in the diagnosis of certain diseases in minorities. PMID:26722533

  1. MitBASE : a comprehensive and integrated mitochondrial DNA database. The present status

    PubMed Central

    Attimonelli, M.; Altamura, N.; Benne, R.; Brennicke, A.; Cooper, J. M.; D’Elia, D.; Montalvo, A. de; Pinto, B. de; De Robertis, M.; Golik, P.; Knoop, V.; Lanave, C.; Lazowska, J.; Licciulli, F.; Malladi, B. S.; Memeo, F.; Monnerot, M.; Pasimeni, R.; Pilbout, S.; Schapira, A. H. V.; Sloof, P.; Saccone, C.

    2000-01-01

    MitBASE is an integrated and comprehensive database of mitochondrial DNA data which collects, under a single interface, databases for Plant, Vertebrate, Invertebrate, Human, Protist and Fungal mtDNA and a Pilot database on nuclear genes involved in mitochondrial biogenesis in Saccharomyces cerevisiae. MitBASE reports all available information from different organisms and from intraspecies variants and mutants. Data have been drawn from the primary databases and from the literature; value adding information has been structured, e.g., editing information on protist mtDNA genomes, pathological information for human mtDNA variants, etc. The different databases, some of which are structured using commercial packages (Microsoft Access, File Maker Pro) while others use a flat-file format, have been integrated under ORACLE. Ad hoc retrieval systems have been devised for some of the above listed databases keeping into account their peculiarities. The database is resident at the EBI and is available at the following site: http://www3.ebi.ac.uk/Research/Mitbase/mitbase.pl . The impact of this project is intended for both basic and applied research. The study of mitochondrial genetic diseases and mitochondrial DNA intraspecies diversity are key topics in several biotechnological fields. The database has been funded within the EU Biotechnology programme. PMID:10592207

  2. On Simplifying Features in OpenStreetMap database

    NASA Astrophysics Data System (ADS)

    Qian, Xinlin; Tao, Kunwang; Wang, Liang

    2015-04-01

    Currently the visualization of OpenStreetMap data is using a tile server which stores map tiles that have been rendered from vector data in advance. However, tiled map are short of functionalities such as data editing and customized styling. To enable these advanced functionality, Client-side processing and rendering of geospatial data is needed. Considering the voluminous size of the OpenStreetMap data, simply sending region queries results of OSM database to client is prohibitive. To make the OSM data retrieved from database adapted for client receiving and rendering, It must be filtered and simplified at server-side to limit its volume. We propose a database extension for OSM database to make it possible to simplifying geospatial objects such as ways and relations during data queries. Several auxiliary tables and PL/pgSQL functions are presented to make the geospatial features can be simplified by omitting unimportant vertices. There are five components in the database extension: Vertices weight computation by polyline and polygon simplification algorithm, Vertices weight storage in auxiliary tables. filtering and selecting of vertices using specific threshold value during spatial queries, assembling of simplified geospatial objects using filtered vertices, vertices weight updating after geospatial objects editing. The database extension is implemented on an OSM APIDB using PL/pgSQL. The database contains a subset of OSM database. The experimental database contains geographic data of United Kingdom which is about 100 million vertices and roughly occupy 100GB disk. JOSM are used to retrieve the data from the database using a revised data accessing API and render the geospatial objects in real-time. When serving simplified data to client, The database allows user to set the bound of the error of simplification or the bound of responding time in each data query. Experimental results show the effectiveness and efficiency of the proposed methods in building a

  3. A global database of soil respiration data

    SciTech Connect

    Bond-Lamberty, Benjamin; Thomson, Allison M.

    2010-06-16

    Soil respiration—RS, the flux of autotropically- and heterotrophically-generated CO2 from the soil to the atmosphere—remains the least well-constrained component of the terrestrial C cycle. Here we introduce the SRDB database, a near-universal compendium of published RS data, and make it available to the scientific community both as a traditional static archive and as a dynamic community database that will be updated over time by interested users. The database encompasses all published studies that report one of the following data measured in the field (not laboratory): annual RS, mean seasonal RS, a seasonal or annual partitioning of RS into its sources fluxes, RS temperature response (Q10), or RS at 10 °C. Its orientation is thus to seasonal and annual fluxes, not shorter-term or chamber-specific measurements. To date, data from 818 studies have been entered into the database, constituting 3379 records. The data span the measurement years 1961-2007 and are dominated by temperate, well-drained forests. We briefly examine some aspects of the SRDB data—mean annual RS fluxes and their correlation with other carbon fluxes, RS variability, temperature sensitivities, and the partitioning of RS source flux–and suggest some potential lines of research that could be explored using these data. The SRDB database described here is available online in a permanent archive as well as via a project-hosting repository; the latter source leverages open-source software technologies to encourage wider participation in the database’s future development. Ultimately, we hope that the updating of, and corrections to, the SRDB will become a shared project, managed by the users of these data in the scientific community.

  4. Scaffold topologies. 2. Analysis of chemical databases.

    PubMed

    Wester, Michael J; Pollock, Sara N; Coutsias, Evangelos A; Allu, Tharun Kumar; Muresan, Sorel; Oprea, Tudor I

    2008-07-01

    We have systematically enumerated graph representations of scaffold topologies for up to eight-ring molecules and four-valence atoms, thus providing coverage of the lower portion of the chemical space of small molecules (Pollock et al. J. Chem. Inf. Model., this issue). Here, we examine scaffold topology distributions for several databases: ChemNavigator and PubChem for commercially available chemicals, the Dictionary of Natural Products, a set of 2742 launched drugs, WOMBAT, a database of medicinal chemistry compounds, and two subsets of PubChem, "actives" and DSSTox comprising toxic substances. We also examined a virtual database of exhaustively enumerated small organic molecules, GDB (Fink et al. Angew. Chem., Int. Ed. 2005, 44, 1504-1508), and we contrast the scaffold topology distribution from these collections to the complete coverage of up to eight-ring molecules. For reasons related, perhaps, to synthetic accessibility and complexity, scaffolds exhibiting six rings or more are poorly represented. Among all collections examined, PubChem has the greatest scaffold topological diversity, whereas GDB is the most limited. More than 50% of all entries (13 000 000+ actual and 13 000 000+ virtual compounds) exhibit only eight distinct topologies, one of which is the nonscaffold topology that represents all treelike structures. However, most of the topologies are represented by a single or very small number of examples. Within topologies, we found that three-way scaffold connections (3-nodes) are much more frequent compared to four-way (4-node) connections. Fused rings have a slightly higher frequency in biologically oriented databases. Scaffold topologies can be the first step toward an efficient coarse-grained classification scheme of the molecules found in chemical databases. PMID:18605681

  5. The GLIMS Glacier Database

    NASA Astrophysics Data System (ADS)

    Raup, B. H.; Khalsa, S. S.; Armstrong, R.

    2007-12-01

    The Global Land Ice Measurements from Space (GLIMS) project has built a geospatial and temporal database of glacier data, composed of glacier outlines and various scalar attributes. These data are being derived primarily from satellite imagery, such as from ASTER and Landsat. Each "snapshot" of a glacier is from a specific time, and the database is designed to store multiple snapshots representative of different times. We have implemented two web-based interfaces to the database; one enables exploration of the data via interactive maps (web map server), while the other allows searches based on text-field constraints. The web map server is an Open Geospatial Consortium (OGC) compliant Web Map Server (WMS) and Web Feature Server (WFS). This means that other web sites can display glacier layers from our site over the Internet, or retrieve glacier features in vector format. All components of the system are implemented using Open Source software: Linux, PostgreSQL, PostGIS (geospatial extensions to the database), MapServer (WMS and WFS), and several supporting components such as Proj.4 (a geographic projection library) and PHP. These tools are robust and provide a flexible and powerful framework for web mapping applications. As a service to the GLIMS community, the database contains metadata on all ASTER imagery acquired over glacierized terrain. Reduced-resolution of the images (browse imagery) can be viewed either as a layer in the MapServer application, or overlaid on the virtual globe within Google Earth. The interactive map application allows the user to constrain by time what data appear on the map. For example, ASTER or glacier outlines from 2002 only, or from Autumn in any year, can be displayed. The system allows users to download their selected glacier data in a choice of formats. The results of a query based on spatial selection (using a mouse) or text-field constraints can be downloaded in any of these formats: ESRI shapefiles, KML (Google Earth), Map

  6. Diversity and impact of rare variants in genes encoding the platelet G protein-coupled receptors.

    PubMed

    Jones, Matthew L; Norman, Jane E; Morgan, Neil V; Mundell, Stuart J; Lordkipanidzé, Marie; Lowe, Gillian C; Daly, Martina E; Simpson, Michael A; Drake, Sian; Watson, Steve P; Mumford, Andrew D

    2015-04-01

    Platelet responses to activating agonists are influenced by common population variants within or near G protein-coupled receptor (GPCR) genes that affect receptor activity. However, the impact of rare GPCR gene variants is unknown. We describe the rare single nucleotide variants (SNVs) in the coding and splice regions of 18 GPCR genes in 7,595 exomes from the 1,000-genomes and Exome Sequencing Project databases and in 31 cases with inherited platelet function disorders (IPFDs). In the population databases, the GPCR gene target regions contained 740 SNVs (318 synonymous, 410 missense, 7 stop gain and 6 splice region) of which 70 % had global minor allele frequency (MAF) < 0.05 %. Functional annotation using six computational algorithms, experimental evidence and structural data identified 156/740 (21 %) SNVs as potentially damaging to GPCR function, most commonly in regions encoding the transmembrane and C-terminal intracellular receptor domains. In 31 index cases with IPFDs (Gi-pathway defect n=15; secretion defect n=11; thromboxane pathway defect n=3 and complex defect n=2) there were 256 SNVs in the target regions of 15 stimulatory platelet GPCRs (34 unique; 12 with MAF< 1 % and 22 with MAF≥ 1 %). These included rare variants predicting R122H, P258T and V207A substitutions in the P2Y12 receptor that were annotated as potentially damaging, but only partially explained the platelet function defects in each case. Our data highlight that potentially damaging variants in platelet GPCR genes have low individual frequencies, but are collectively abundant in the population. Potentially damaging variants are also present in pedigrees with IPFDs and may contribute to complex laboratory phenotypes. PMID:25567036

  7. Diversity and impact of rare variants in genes encoding the platelet G protein-coupled receptors

    PubMed Central

    Jones, Matthew L.; Norman, Jane E.; Morgan, Neil V.; Mundell, Stuart J.; Lordkipanidzé, Marie; Lowe, Gillian C.; Daly, Martina E.; Simpson, Michael A.; Drake, Sian; Watson, Steve P.; Mumford, Andrew D.

    2015-01-01

    Summary Platelet responses to activating agonists are influenced by common population variants within or near G protein-coupled receptor (GPCR) genes that affect receptor activity. However, the impact of rare GPCR gene variants is unknown. We describe the rare single nucleotide variants (SNVs) in the coding and splice regions of 18 GPCR genes in 7,595 exomes from the 1,000-genomes and Exome Sequencing Project databases and in 31 cases with inherited platelet function disorders (IPFDs). In the population databases, the GPCR gene target regions contained 740 SNVs (318 synonymous, 410 missense, 7 stop gain and 6 splice region) of which 70% had global minor allele frequency (MAF) < 0.05%. Functional annotation using six computational algorithms, experimental evidence and structural data identified 156/740 (21%) SNVs as potentially damaging to GPCR function, most commonly in regions encoding the transmembrane and C-terminal intracellular receptor domains. In 31 index cases with IPFDs (Gi-pathway defect n=15; secretion defect n=11; thromboxane pathway defect n=3 and complex defect n=2) there were 256 SNVs in the target regions of 15 stimulatory platelet GPCRs (34 unique; 12 with MAF<1% and 22 with MAF≥1%). These included rare variants predicting R122H, P258T and V207A substitutions in the P2Y12 receptor that were annotated as potentially damaging, but only partially explained the platelet function defects in each case. Our data highlight that potentially damaging variants in platelet GPCR genes have low individual frequencies, but are collectively abundant in the population. Potentially damaging variants are also present in pedigrees with IPFDs and may contribute to complex laboratory phenotypes. PMID:25567036

  8. NATIVE HEALTH DATABASES: NATIVE HEALTH HISTORY DATABASE (NHHD)

    EPA Science Inventory

    The Native Health Databases contain bibliographic information and abstracts of health-related articles, reports, surveys, and other resource documents pertaining to the health and health care of American Indians, Alaska Natives, and Canadian First Nations. The databases provide i...

  9. NATIVE HEALTH DATABASES: NATIVE HEALTH RESEARCH DATABASE (NHRD)

    EPA Science Inventory

    The Native Health Databases contain bibliographic information and abstracts of health-related articles, reports, surveys, and other resource documents pertaining to the health and health care of American Indians, Alaska Natives, and Canadian First Nations. The databases provide i...

  10. MTDH genetic variants in colorectal cancer patients

    PubMed Central

    Gnosa, Sebastian; Ticha, Ivana; Haapaniemi, Staffan; Sun, Xiao-Feng

    2016-01-01

    The colorectal carcinogenesis is a complex process encompassing genetic alterations. The oncoprotein AEG-1, encoded by the MTDH gene, was shown previously to be involved in colorectal cancer (CRC). The aim of this study was to determine the frequency and the spectrum of MTDH variants in tumor tissue, and their relationship to clinicopathological variables in CRC patients. The study included tumors from 356 unselected CRC patients. Mutation analysis of the MTDH gene, including coding region and adjacent intronic sequences, was performed by direct DNA sequencing. The corresponding normal colorectal tissue was analyzed in the carriers of exonic variant to confirm germline or somatic origin. We detected 42 intronic variants, where 25 were novel. Furthermore, we found 8 exonic variants of which four, one missense (c.977C > G-germline) and three frameshift mutations (c.533delA-somatic, c.1340dupA-unknown origin, c.1731delA-unknown origin), were novel. In silico prediction analyses suggested four deleterious variants (c.232G > T, c.533delA, c.1340dupA, and c.1731delA). There were no correlations between the MTDH variants and tumor stage, differentiation or patient survival. We described several novel exonic and intronic variants of the MTDH gene. The detection of likely pathogenic truncating mutations and alterations in functional protein domains indicate their clinical significance, although none of the variants had prognostic potential. PMID:26983693

  11. ARTI Refrigerant Database

    SciTech Connect

    Calm, J.M.

    1992-11-09

    The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air- conditioning and refrigeration equipment. The database identifies sources of specific information on R-32, R-123, R-124, R-125, R-134, R-134a, R-141b, R-142b, R-143a, R-152a, R-245ca, R-290 (propane), R- 717 (ammonia), ethers, and others as well as azeotropic and zeotropic and zeotropic blends of these fluids. It addresses lubricants including alkylbenzene, polyalkylene glycol, ester, and other synthetics as well as mineral oils. It also references documents on compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. A computerized version is available that includes retrieval software.

  12. Protein Model Database

    SciTech Connect

    Fidelis, K; Adzhubej, A; Kryshtafovych, A; Daniluk, P

    2005-02-23

    The phenomenal success of the genome sequencing projects reveals the power of completeness in revolutionizing biological science. Currently it is possible to sequence entire organisms at a time, allowing for a systemic rather than fractional view of their organization and the various genome-encoded functions. There is an international plan to move towards a similar goal in the area of protein structure. This will not be achieved by experiment alone, but rather by a combination of efforts in crystallography, NMR spectroscopy, and computational modeling. Only a small fraction of structures are expected to be identified experimentally, the remainder to be modeled. Presently there is no organized infrastructure to critically evaluate and present these data to the biological community. The goal of the Protein Model Database project is to create such infrastructure, including (1) public database of theoretically derived protein structures; (2) reliable annotation of protein model quality, (3) novel structure analysis tools, and (4) access to the highest quality modeling techniques available.

  13. OR orientation process.

    PubMed

    Weaver, C S

    1992-12-01

    1. Poor OR orientation may lead to a lack of staff retention. Providing an environment that ensures a successful orientation will result in a productive professional who will be an asset to the staff. 2. It has historically been difficult to identify educational time frames that accurately reflect the needs of perioperative nurses and provide the necessary training. Haphazard learning creates inconsistent results and undue stress for new employees. 3. Learners must be given every opportunity to be successful. An orientation program that uses adult learning principles, preceptors, and a competency skills checklist can help new employees become skillful team members. PMID:1481255

  14. Orientation of Hittite Monuments

    NASA Astrophysics Data System (ADS)

    González-García, A. César; Belmonte, Juan Antonio

    The possible astronomical or topographical orientations of the Hittite monuments of the Bronze Age has remained unexplored until recently. This would provide an important insight into how temporality was imprinted by this culture in sacred spaces and in the landscape. The authors' analysis of a statistically significant sample of Hittite temples - and a few monumental gates - has demonstrated that ancient Hittite monuments were not randomly orientated as previously thought. On the contrary, there were well-defined patterns of orientation that can be interpreted within the context of Hittite culture and religion.

  15. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1996-11-15

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. It consolidates and facilitates access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufacturers and those using alternative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern.

  16. Real Time Baseball Database

    NASA Astrophysics Data System (ADS)

    Fukue, Yasuhiro

    The author describes the system outline, features and operations of "Nikkan Sports Realtime Basaball Database" which was developed and operated by Nikkan Sports Shimbun, K. K. The system enables to input numerical data of professional baseball games as they proceed simultaneously, and execute data updating at realtime, just-in-time. Other than serving as supporting tool for prepareing newspapers it is also available for broadcasting media, general users through NTT dial Q2 and others.

  17. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1999-01-01

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. It consolidates and facilities access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufacturers and those using alternative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern.

  18. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1996-07-01

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. It consolidates and facilitates access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufacturers and those using alternative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern.

  19. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1996-01-15

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. it consolidates and facilitates.access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufacturers and those using alternative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern.

  20. Orienteering: An Annotated Bibliography = Orientierungslauf: Eine kommentierte Bibliographie.

    ERIC Educational Resources Information Center

    Seiler, Roland, Ed.; Hartmann, Wolfgang, Ed.

    1994-01-01

    Annotated bibliography of 220 books, monographs, and journal articles on orienteering published 1984-94, from SPOLIT database of the Federal Institute of Sport Science (Cologne, Germany). Annotations in English or German. Ten sections including psychological, physiological, health, sociological, and environmental aspects; training and coaching;…

  1. ARTI Refrigerant Database

    SciTech Connect

    Cain, J.M.

    1993-04-30

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air-conditioning and refrigeration equipment. The complete documents are not included. The database identifies sources of specific information on R-32, R-123, R-124, R-125, R-134, R-134a, R-141b, R-142b, R-143a, R-152a, R-245ca, R-290 (propane), R-717 (ammonia), ethers, and others as well as azeotropic and zeotropic blends of these fluids. It addresses lubricants including alkylbenzene, polyalkylene glycol, ester, and other synthetics as well as mineral oils. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. Incomplete citations or abstracts are provided for some documents to accelerate availability of the information and will be completed or replaced in future updates.

  2. The Cambridge Structural Database

    PubMed Central

    Groom, Colin R.; Bruno, Ian J.; Lightfoot, Matthew P.; Ward, Suzanna C.

    2016-01-01

    The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal–organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface. PMID:27048719

  3. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1997-02-01

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. It consolidates and facilitates access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufacturers and those using alterative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air-conditioning and refrigeration equipment. The complete documents are not included, though some may be added at a later date. The database identifies sources of specific information on various refrigerants. It addresses lubricants including alkylbenzene, polyalkylene glycol, polyolester, and other synthetics as well as mineral oils. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. Incomplete citations or abstracts are provided for some documents. They are included to accelerate availability of the information and will be completed or replaced in future updates.

  4. Curcumin Resource Database

    PubMed Central

    Kumar, Anil; Chetia, Hasnahana; Sharma, Swagata; Kabiraj, Debajyoti; Talukdar, Narayan Chandra; Bora, Utpal

    2015-01-01

    Curcumin is one of the most intensively studied diarylheptanoid, Curcuma longa being its principal producer. This apart, a class of promising curcumin analogs has been generated in laboratories, aptly named as Curcuminoids which are showing huge potential in the fields of medicine, food technology, etc. The lack of a universal source of data on curcumin as well as curcuminoids has been felt by the curcumin research community for long. Hence, in an attempt to address this stumbling block, we have developed Curcumin Resource Database (CRDB) that aims to perform as a gateway-cum-repository to access all relevant data and related information on curcumin and its analogs. Currently, this database encompasses 1186 curcumin analogs, 195 molecular targets, 9075 peer reviewed publications, 489 patents and 176 varieties of C. longa obtained by extensive data mining and careful curation from numerous sources. Each data entry is identified by a unique CRDB ID (identifier). Furnished with a user-friendly web interface and in-built search engine, CRDB provides well-curated and cross-referenced information that are hyperlinked with external sources. CRDB is expected to be highly useful to the researchers working on structure as well as ligand-based molecular design of curcumin analogs. Database URL: http://www.crdb.in PMID:26220923

  5. The Cambridge Structural Database.

    PubMed

    Groom, Colin R; Bruno, Ian J; Lightfoot, Matthew P; Ward, Suzanna C

    2016-04-01

    The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal-organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface. PMID:27048719

  6. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1998-08-01

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. It consolidates and facilitates access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufactures and those using alternative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air-conditioning and refrigeration equipment. The complete documents are not included, though some may be added at a later date. The database identifies sources of specific information on many refrigerants including propane, ammonia, water, carbon dioxide, propylene, ethers, and others as well as azeotropic and zeotropic blends of these fluids. It addresses lubricants including alkylbenzene, polyalkylene glycol, polyolester, and other synthetics as well as mineral oils. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. Incomplete citations or abstracts are provided for some documents. They are included to accelerate availability of the information and will be completed or replaced in future updates.

  7. Human cancer databases (review).

    PubMed

    Pavlopoulou, Athanasia; Spandidos, Demetrios A; Michalopoulos, Ioannis

    2015-01-01

    Cancer is one of the four major non‑communicable diseases (NCD), responsible for ~14.6% of all human deaths. Currently, there are >100 different known types of cancer and >500 genes involved in cancer. Ongoing research efforts have been focused on cancer etiology and therapy. As a result, there is an exponential growth of cancer‑associated data from diverse resources, such as scientific publications, genome‑wide association studies, gene expression experiments, gene‑gene or protein‑protein interaction data, enzymatic assays, epigenomics, immunomics and cytogenetics, stored in relevant repositories. These data are complex and heterogeneous, ranging from unprocessed, unstructured data in the form of raw sequences and polymorphisms to well‑annotated, structured data. Consequently, the storage, mining, retrieval and analysis of these data in an efficient and meaningful manner pose a major challenge to biomedical investigators. In the current review, we present the central, publicly accessible databases that contain data pertinent to cancer, the resources available for delivering and analyzing information from these databases, as well as databases dedicated to specific types of cancer. Examples for this wealth of cancer‑related information and bioinformatic tools have also been provided. PMID:25369839

  8. State Analysis Database Tool

    NASA Technical Reports Server (NTRS)

    Rasmussen, Robert; Bennett, Matthew

    2006-01-01

    The State Analysis Database Tool software establishes a productive environment for collaboration among software and system engineers engaged in the development of complex interacting systems. The tool embodies State Analysis, a model-based system engineering methodology founded on a state-based control architecture (see figure). A state represents a momentary condition of an evolving system, and a model may describe how a state evolves and is affected by other states. The State Analysis methodology is a process for capturing system and software requirements in the form of explicit models and states, and defining goal-based operational plans consistent with the models. Requirements, models, and operational concerns have traditionally been documented in a variety of system engineering artifacts that address different aspects of a mission s lifecycle. In State Analysis, requirements, models, and operations information are State Analysis artifacts that are consistent and stored in a State Analysis Database. The tool includes a back-end database, a multi-platform front-end client, and Web-based administrative functions. The tool is structured to prompt an engineer to follow the State Analysis methodology, to encourage state discovery and model description, and to make software requirements and operations plans consistent with model descriptions.

  9. Filovirus RefSeq Entries: Evaluation and Selection of Filovirus Type Variants, Type Sequences, and Names

    PubMed Central

    Kuhn, Jens H.; Andersen, Kristian G.; Bào, Yīmíng; Bavari, Sina; Becker, Stephan; Bennett, Richard S.; Bergman, Nicholas H.; Blinkova, Olga; Bradfute, Steven; Brister, J. Rodney; Bukreyev, Alexander; Chandran, Kartik; Chepurnov, Alexander A.; Davey, Robert A.; Dietzgen, Ralf G.; Doggett, Norman A.; Dolnik, Olga; Dye, John M.; Enterlein, Sven; Fenimore, Paul W.; Formenty, Pierre; Freiberg, Alexander N.; Garry, Robert F.; Garza, Nicole L.; Gire, Stephen K.; Gonzalez, Jean-Paul; Griffiths, Anthony; Happi, Christian T.; Hensley, Lisa E.; Herbert, Andrew S.; Hevey, Michael C.; Hoenen, Thomas; Honko, Anna N.; Ignatyev, Georgy M.; Jahrling, Peter B.; Johnson, Joshua C.; Johnson, Karl M.; Kindrachuk, Jason; Klenk, Hans-Dieter; Kobinger, Gary; Kochel, Tadeusz J.; Lackemeyer, Matthew G.; Lackner, Daniel F.; Leroy, Eric M.; Lever, Mark S.; Mühlberger, Elke; Netesov, Sergey V.; Olinger, Gene G.; Omilabu, Sunday A.; Palacios, Gustavo; Panchal, Rekha G.; Park, Daniel J.; Patterson, Jean L.; Paweska, Janusz T.; Peters, Clarence J.; Pettitt, James; Pitt, Louise; Radoshitzky, Sheli R.; Ryabchikova, Elena I.; Saphire, Erica Ollmann; Sabeti, Pardis C.; Sealfon, Rachel; Shestopalov, Aleksandr M.; Smither, Sophie J.; Sullivan, Nancy J.; Swanepoel, Robert; Takada, Ayato; Towner, Jonathan S.; van der Groen, Guido; Volchkov, Viktor E.; Volchkova, Valentina A.; Wahl-Jensen, Victoria; Warren, Travis K.; Warfield, Kelly L.; Weidmann, Manfred; Nichol, Stuart T.

    2014-01-01

    Sequence determination of complete or coding-complete genomes of viruses is becoming common practice for supporting the work of epidemiologists, ecologists, virologists, and taxonomists. Sequencing duration and costs are rapidly decreasing, sequencing hardware is under modification for use by non-experts, and software is constantly being improved to simplify sequence data management and analysis. Thus, analysis of virus disease outbreaks on the molecular level is now feasible, including characterization of the evolution of individual virus populations in single patients over time. The increasing accumulation of sequencing data creates a management problem for the curators of commonly used sequence databases and an entry retrieval problem for end users. Therefore, utilizing the data to their fullest potential will require setting nomenclature and annotation standards for virus isolates and associated genomic sequences. The National Center for Biotechnology Information’s (NCBI’s) RefSeq is a non-redundant, curated database for reference (or type) nucleotide sequence records that supplies source data to numerous other databases. Building on recently proposed templates for filovirus variant naming [ ()////variant designation>-], we report consensus decisions from a majority of past and currently active filovirus experts on the eight filovirus type variants and isolates to be represented in RefSeq, their final designations, and their associated sequences. PMID:25256396

  10. Filovirus RefSeq entries: evaluation and selection of filovirus type variants, type sequences, and names.

    PubMed

    Kuhn, Jens H; Andersen, Kristian G; Bào, Yīmíng; Bavari, Sina; Becker, Stephan; Bennett, Richard S; Bergman, Nicholas H; Blinkova, Olga; Bradfute, Steven; Brister, J Rodney; Bukreyev, Alexander; Chandran, Kartik; Chepurnov, Alexander A; Davey, Robert A; Dietzgen, Ralf G; Doggett, Norman A; Dolnik, Olga; Dye, John M; Enterlein, Sven; Fenimore, Paul W; Formenty, Pierre; Freiberg, Alexander N; Garry, Robert F; Garza, Nicole L; Gire, Stephen K; Gonzalez, Jean-Paul; Griffiths, Anthony; Happi, Christian T; Hensley, Lisa E; Herbert, Andrew S; Hevey, Michael C; Hoenen, Thomas; Honko, Anna N; Ignatyev, Georgy M; Jahrling, Peter B; Johnson, Joshua C; Johnson, Karl M; Kindrachuk, Jason; Klenk, Hans-Dieter; Kobinger, Gary; Kochel, Tadeusz J; Lackemeyer, Matthew G; Lackner, Daniel F; Leroy, Eric M; Lever, Mark S; Mühlberger, Elke; Netesov, Sergey V; Olinger, Gene G; Omilabu, Sunday A; Palacios, Gustavo; Panchal, Rekha G; Park, Daniel J; Patterson, Jean L; Paweska, Janusz T; Peters, Clarence J; Pettitt, James; Pitt, Louise; Radoshitzky, Sheli R; Ryabchikova, Elena I; Saphire, Erica Ollmann; Sabeti, Pardis C; Sealfon, Rachel; Shestopalov, Aleksandr M; Smither, Sophie J; Sullivan, Nancy J; Swanepoel, Robert; Takada, Ayato; Towner, Jonathan S; van der Groen, Guido; Volchkov, Viktor E; Volchkova, Valentina A; Wahl-Jensen, Victoria; Warren, Travis K; Warfield, Kelly L; Weidmann, Manfred; Nichol, Stuart T

    2014-09-01

    Sequence determination of complete or coding-complete genomes of viruses is becoming common practice for supporting the work of epidemiologists, ecologists, virologists, and taxonomists. Sequencing duration and costs are rapidly decreasing, sequencing hardware is under modification for use by non-experts, and software is constantly being improved to simplify sequence data management and analysis. Thus, analysis of virus disease outbreaks on the molecular level is now feasible, including characterization of the evolution of individual virus populations in single patients over time. The increasing accumulation of sequencing data creates a management problem for the curators of commonly used sequence databases and an entry retrieval problem for end users. Therefore, utilizing the data to their fullest potential will require setting nomenclature and annotation standards for virus isolates and associated genomic sequences. The National Center for Biotechnology Information's (NCBI's) RefSeq is a non-redundant, curated database for reference (or type) nucleotide sequence records that supplies source data to numerous other databases. Building on recently proposed templates for filovirus variant naming [ ()////variant designation>-], we report consensus decisions from a majority of past and currently active filovirus experts on the eight filovirus type variants and isolates to be represented in RefSeq, their final designations, and their associated sequences. PMID:25256396

  11. The PROTICdb database for 2-DE proteomics.

    PubMed

    Langella, Olivier; Zivy, Michel; Joets, Johann

    2007-01-01

    PROTICdb is a web-based database mainly designed to store and analyze plant proteome data obtained by 2D polyacrylamide gel electrophoresis (2D PAGE) and mass spectrometry (MS). The goals of PROTICdb are (1) to store, track, and query information related to proteomic experiments, i.e., from tissue sampling to protein identification and quantitative measurements; and (2) to integrate information from the user's own expertise and other sources into a knowledge base, used to support data interpretation (e.g., for the determination of allelic variants or products of posttranslational modifications). Data insertion into the relational database of PROTICdb is achieved either by uploading outputs from Mélanie, PDQuest, IM2d, ImageMaster(tm) 2D Platinum v5.0, Progenesis, Sequest, MS-Fit, and Mascot software, or by filling in web forms (experimental design and methods). 2D PAGE-annotated maps can be displayed, queried, and compared through the GelBrowser. Quantitative data can be easily exported in a tabulated format for statistical analyses with any third-party software. PROTICdb is based on the Oracle or the PostgreSQLDataBase Management System (DBMS) and is freely available upon request at http://cms.moulon.inra.fr/content/view/14/44/. PMID:17093318

  12. Making CORBA objects persistent: The object database adapter approach

    SciTech Connect

    Reverbel, F.C.R.

    1997-05-01

    In spite of its remarkable successes in promoting standards for distributed object systems, the Object Management Group (OMG) has not yet settled the issue of object persistence in the Object Request Broker (ORB) environment. The Common Object Request Broker Architecture (CORBA) specification briefly mentions an Object-Oriented Database Adapter that makes objects stored in an object-oriented database accessible through the ORB. This idea is pursued in the Appendix B of the ODMG standard, which identifies a number of issues involved in using an Object Database Management System (ODBMS) in a CORBA environment, and proposes an Object Database Adapter (ODA) to realize the integration of the ORB with the ODBMS. This paper discusses the design and implementation of an ODA that integrates an ORB and an ODBMS with C++ bindings. For the author`s purposes, an ODBMS is a system with programming interfaces. It may be a pure object-oriented DBMS (an OODBMS), or a combination of a relational DBMS and an object-relational mapper.

  13. Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies

    PubMed Central

    Ruiz-Pérez, R.; López-Cózar, E. Delgado; Jiménez-Contreras, E.

    2002-01-01

    Objectives: The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. Methods: The authors sampled 172 articles published by authors from the University of Granada Medical School between 1987 and 1996 and analyzed the variations in how each of their names was indexed in Science Citation Index (SCI), MEDLINE, and Índice Médico Español (IME). The number and types of variants that appeared for each author's name were recorded and compared across databases to identify inconsistencies in indexing practices. We analyzed the relationship between variability (number of variants of an author's name) and productivity (number of items the name was associated with as an author), the consequences for retrieval of information, and the most frequent indexing structures used for Spanish names. Results: The proportion of authors who appeared under more then one name was 48.1% in SCI, 50.7% in MEDLINE, and 69.0% in IME. Productivity correlated directly with variability: more than 50% of the authors listed on five to ten items appeared under more than one name in any given database, and close to 100% of the authors listed on more than ten items appeared under two or more variants. Productivity correlated inversely with retrievability: as the number of variants for a name increased, the number of items retrieved under each variant decreased. For the most highly productive authors, the number of items retrieved under each variant tended toward one. The most frequent indexing methods varied between databases. In MEDLINE and IME, names were indexed correctly as “first surname second surname, first name initial middle name initial” (if present) in 41.7% and 49.5% of the records, respectively. However, in SCI, the most frequent method was “first surname, first name initial second name initial” (48.0% of

  14. Passive orientation apparatus

    DOEpatents

    Spletzer, Barry L.; Fischer, Gary J.; Martinez, Michael A.

    2001-01-01

    An apparatus that can return a payload to a known orientation after unknown motion, without requiring external power or complex mechanical systems. The apparatus comprises a faceted cage that causes the system to rest in a stable position and orientation after arbitrary motion. A gimbal is mounted with the faceted cage and holds the payload, allowing the payload to move relative to the stable faceted cage. The payload is thereby placed in a known orientation by the interaction of gravity with the geometry of the faceted cage, the mass of the system, and the motion of the payload and gimbal. No additional energy, control, or mechanical actuation is required. The apparatus is suitable for use in applications requiring positioning of a payload to a known orientation after arbitrary or uncontrolled motion, including remote sensing and mobile robot applications.

  15. Implementing Strategic Orientation

    ERIC Educational Resources Information Center

    Fischer, Arthur K.; Brownback, Sarah

    2012-01-01

    An HRM case dealing with problems and issues of setting up orientation programs which align with corporate strategy. Discussion concerns how such a case can be used to exhibit the alignment between HRM and business strategy.

  16. Orientation of histopathology specimens.

    PubMed

    Burns, A; Adams, J; Endersby, S

    2004-02-01

    We present a simple way of orientating large specimens being sent to the laboratory for histopathological examination by supplementing the pinning of the specimen on a cork board with Polaroid photographs of the specimen and numbered tags. PMID:14706306

  17. Orientations to Reflective Practice.

    ERIC Educational Resources Information Center

    Wellington, Bud; Austin, Patricia

    1996-01-01

    Delineates five orientations to reflective practice: immediate, technical, deliberative, dialectic, and transpersonal, each reflecting different social science bases and beliefs and values about education. Views them as interactive, interdependent, noncompeting, aspects of reflective practice. (SK)

  18. Sexual Orientation (For Parents)

    MedlinePlus

    ... For Kids For Parents MORE ON THIS TOPIC Transgender People Teaching Your Child Tolerance STDs Understanding Early ... and Romance Am I in a Healthy Relationship? Transgender People Sexual Attraction and Orientation Contact Us Print ...

  19. Identification of rare DNA sequence variants in high-risk autism families and their prevalence in a large case/control population

    PubMed Central

    2014-01-01

    Background Genetics clearly plays a major role in the etiology of autism spectrum disorders (ASDs), but studies to date are only beginning to characterize the causal genetic variants responsible. Until recently, studies using multiple extended multi-generation families to identify ASD risk genes had not been undertaken. Methods We identified haplotypes shared among individuals with ASDs in large multiplex families, followed by targeted DNA capture and sequencing to identify potential causal variants. We also assayed the prevalence of the identified variants in a large ASD case/control population. Results We identified 584 non-conservative missense, nonsense, frameshift and splice site variants that might predispose to autism in our high-risk families. Eleven of these variants were observed to have odds ratios greater than 1.5 in a set of 1,541 unrelated children with autism and 5,785 controls. Three variants, in the RAB11FIP5, ABP1, and JMJD7-PLA2G4B genes, each were observed in a single case and not in any controls. These variants also were not seen in public sequence databases, suggesting that they may be rare causal ASD variants. Twenty-eight additional rare variants were observed only in high-risk ASD families. Collectively, these 39 variants identify 36 genes as ASD risk genes. Segregation of sequence variants and of copy number variants previously detected in these families reveals a complex pattern, with only a RAB11FIP5 variant segregating to all affected individuals in one two-generation pedigree. Some affected individuals were found to have multiple potential risk alleles, including sequence variants and copy number variants (CNVs), suggesting that the high incidence of autism in these families could be best explained by variants at multiple loci. Conclusions Our study is the first to use haplotype sharing to identify familial ASD risk loci. In total, we identified 39 variants in 36 genes that may confer a genetic risk of developing autism. The

  20. ThermoData Engine Database

    National Institute of Standards and Technology Data Gateway

    SRD 103 NIST ThermoData Engine Database (PC database for purchase)   ThermoData Engine is the first product fully implementing all major principles of the concept of dynamic data evaluation formulated at NIST/TRC.

  1. NEUSE RIVER WATER QUALITY DATABASE

    EPA Science Inventory

    The Neuse River water quality database is a Microsoft Access application that includes multiple data tables and some associated queries. The database was developed by Prof. Jim Bowen's research group.

  2. A Case for Database Filesystems

    SciTech Connect

    Adams, P A; Hax, J C

    2009-05-13

    Data intensive science is offering new challenges and opportunities for Information Technology and traditional relational databases in particular. Database filesystems offer the potential to store Level Zero data and analyze Level 1 and Level 3 data within the same database system [2]. Scientific data is typically composed of both unstructured files and scalar data. Oracle SecureFiles is a new database filesystem feature in Oracle Database 11g that is specifically engineered to deliver high performance and scalability for storing unstructured or file data inside the Oracle database. SecureFiles presents the best of both the filesystem and the database worlds for unstructured content. Data stored inside SecureFiles can be queried or written at performance levels comparable to that of traditional filesystems while retaining the advantages of the Oracle database.

  3. Dietary Supplement Label Database (DSLD)

    MedlinePlus

    ... Print Report Error T he Dietary Supplement Label Database (DSLD) is a joint project of the National ... participants in the latest survey in the DSLD database (NHANES): The search options: Quick Search, Browse Dietary ...

  4. High Temperature Superconducting Materials Database

    National Institute of Standards and Technology Data Gateway

    SRD 149 NIST High Temperature Superconducting Materials Database (Web, free access)   The NIST High Temperature Superconducting Materials Database (WebHTS) provides evaluated thermal, mechanical, and superconducting property data for oxides and other nonconventional superconductors.

  5. The PIR-International databases.

    PubMed Central

    Barker, W C; George, D G; Mewes, H W; Pfeiffer, F; Tsugita, A

    1993-01-01

    PIR-International is an association of macromolecular sequence data collection centers dedicated to fostering international cooperation as an essential element in the development of scientific databases. PIR-International is most noted for the Protein Sequence Database. This database originated in the early 1960's with the pioneering work of the late Margaret Dayhoff as a research tool for the study of protein evolution and intersequence relationships; it is maintained as a scientific resource, organized by biological concepts, using sequence homology as a guiding principle. PIR-International also maintains a number of other genomic, protein sequence, and sequence-related databases. The databases of PIR-International are made widely available. This paper briefly describes the architecture of the Protein Sequence Database, a number of other PIR-International databases, and mechanisms for providing access to and for distribution of these databases. PMID:8332528

  6. SmallSat Database

    NASA Technical Reports Server (NTRS)

    Petropulos, Dolores; Bittner, David; Murawski, Robert; Golden, Bert

    2015-01-01

    The SmallSat has an unrealized potential in both the private industry and in the federal government. Currently over 70 companies, 50 universities and 17 governmental agencies are involved in SmallSat research and development. In 1994, the U.S. Army Missile and Defense mapped the moon using smallSat imagery. Since then Smart Phones have introduced this imagery to the people of the world as diverse industries watched this trend. The deployment cost of smallSats is also greatly reduced compared to traditional satellites due to the fact that multiple units can be deployed in a single mission. Imaging payloads have become more sophisticated, smaller and lighter. In addition, the growth of small technology obtained from private industries has led to the more widespread use of smallSats. This includes greater revisit rates in imagery, significantly lower costs, the ability to update technology more frequently and the ability to decrease vulnerability of enemy attacks. The popularity of smallSats show a changing mentality in this fast paced world of tomorrow. What impact has this created on the NASA communication networks now and in future years? In this project, we are developing the SmallSat Relational Database which can support a simulation of smallSats within the NASA SCaN Compatability Environment for Networks and Integrated Communications (SCENIC) Modeling and Simulation Lab. The NASA Space Communications and Networks (SCaN) Program can use this modeling to project required network support needs in the next 10 to 15 years. The SmallSat Rational Database could model smallSats just as the other SCaN databases model the more traditional larger satellites, with a few exceptions. One being that the smallSat Database is designed to be built-to-order. The SmallSat database holds various hardware configurations that can be used to model a smallSat. It will require significant effort to develop as the research material can only be populated by hand to obtain the unique data

  7. Assigning Main Orientation to an EOH Descriptor on Multispectral Images

    PubMed Central

    Li, Yong; Shi, Xiang; Wei, Lijun; Zou, Junwei; Chen, Fang

    2015-01-01

    This paper proposes an approach to compute an EOH (edge-oriented histogram) descriptor with main orientation. EOH has a better matching ability than SIFT (scale-invariant feature transform) on multispectral images, but does not assign a main orientation to keypoints. Alternatively, it tends to assign the same main orientation to every keypoint, e.g., zero degrees. This limits EOH to matching keypoints between images of translation misalignment only. Observing this limitation, we propose assigning to keypoints the main orientation that is computed with PIIFD (partial intensity invariant feature descriptor). In the proposed method, SIFT keypoints are detected from images as the extrema of difference of Gaussians, and every keypoint is assigned to the main orientation computed with PIIFD. Then, EOH is computed for every keypoint with respect to its main orientation. In addition, an implementation variant is proposed for fast computation of the EOH descriptor. Experimental results show that the proposed approach performs more robustly than the original EOH on image pairs that have a rotation misalignment. PMID:26140348

  8. An object-oriented approach to simulator postprocessing

    SciTech Connect

    Leach, B.F.; Scherer, P.W.; Starley, G.P.

    1994-08-01

    An interactive, graphical software package provides the ability to view production well data generated by reservoir simulation. The program (KEYPLOT-X) includes several novel concepts, such as use of object-oriented technology for graphical software and a direct-access database structure. The entire application is constructed from a library of elemental objects. Inheritance of properties between objects produces extremely modular code, which greatly enhances maintenance and extendibility. The database has a direct-access hierarchical structure that is object-oriented, simplifying the data access protocol to provide rapid interactivity between the database, applications, and user interface. The overall approach has provided a high degree of functionality and flexibility to engineering applications and a manageable software structure for maintenance and development.

  9. Distributed Object Oriented Geographic Information System

    Energy Science and Technology Software Center (ESTSC)

    1997-02-01

    This interactive, object-oriented, distributed Geographic Information System (GIS) uses the World Wibe Web (WWW) as application medium and distribution mechanism. The software provides distributed access to multiple geo-spatial databases and presents them as if they came from a single coherent database. DOOGIS distributed access comes not only in the form of multiple geo-spatial servers but can break down a single logical server into the constituent physical servers actually storing the data. The program provides formore » dynamic protocol resolution and content handling allowing unknown objects from a particular server to download their handling code. Security and access privileges are negotiated dynamically with each server contacted and each access attempt.« less

  10. Hydrogen Leak Detection Sensor Database

    NASA Technical Reports Server (NTRS)

    Baker, Barton D.

    2010-01-01

    This slide presentation reviews the characteristics of the Hydrogen Sensor database. The database is the result of NASA's continuing interest in and improvement of its ability to detect and assess gas leaks in space applications. The database specifics and a snapshot of an entry in the database are reviewed. Attempts were made to determine the applicability of each of the 65 sensors for ground and/or vehicle use.

  11. Large-scale mass spectrometric detection of variant peptides resulting from non-synonymous nucleotide differences

    PubMed Central

    Sheynkman, Gloria M.; Shortreed, Michael R.; Frey, Brian L.; Scalf, Mark; Smith, Lloyd M.

    2013-01-01

    Each individual carries thousands of non-synonymous single nucleotide variants (nsSNVs) in their genome, each corresponding to a single amino acid polymorphism (SAP) in the encoded proteins. It is important to be able to directly detect and quantify these variations at the protein level in order to study post-transcriptional regulation, differential allelic expression, and other important biological processes. However, such variant peptides are not generally detected in standard proteomic analyses, due to their absence from the generic databases that are employed for mass spectrometry searching. Here, we extend previous work that demonstrated the use of customized SAP databases constructed from sample-matched RNA-Seq data. We collected deep coverage RNA-Seq data from the Jurkat cell line, compiled the set of nsSNVs that are expressed, used this information to construct a customized SAP database, and searched it against deep coverage shotgun MS data obtained from the same sample. This approach enabled detection of 421 SAP peptides mapping to 395 nsSNVs. We compared these peptides to peptides identified from a large generic search database containing all known nsSNVs (dbSNP) and found that more than 70% of the SAP peptides from this dbSNP-derived search were not supported by the RNA-Seq data, and thus are likely false positives. Next, we increased the SAP coverage from the RNA-Seq derived database by utilizing multiple protease digestions, thereby increasing variant detection to 695 SAP peptides mapping to 504 nsSNV sites. These detected SAP peptides corresponded to moderate to high abundance transcripts (30+ transcripts per million, TPM). The SAP peptides included 192 allelic pairs; the relative expression levels of the two alleles were evaluated for 51 of those pairs, and found to be comparable in all cases. PMID:24175627

  12. Functional characterization of Ape1 variants identified in the human population

    PubMed Central

    Hadi, Masood Z.; Coleman, Matthew A.; Fidelis, Krzysztof; Mohrenweiser, Harvey W.; Wilson, David M.

    2000-01-01

    Apurinic/apyrimidinic (AP) sites are common mutagenic and cytotoxic DNA lesions. Ape1 is the major human repair enzyme for abasic sites and incises the phosphodiester backbone 5′ to the lesion to initiate a cascade of events aimed at removing the AP moiety and maintaining genetic integrity. Through resequencing of genomic DNA from 128 unrelated individuals, and searching published reports and sequence databases, seven amino acid substitution variants were identified in the repair domain of human Ape1. Functional characterization revealed that three of the variants, L104R, E126D and R237A, exhibited ∼40–60% reductions in specific incision activity. A fourth variant, D283G, is similar to the previously characterized mutant D283A found to exhibit ∼10% repair capacity. The most common substitution (D148E; observed at an allele frequency of 0.38) had no impact on endonuclease and DNA binding activities, nor did a G306A substitution. A G241R variant showed slightly enhanced endonuclease activity relative to wild-type. In total, four of seven substitutions in the repair domain of Ape1 imparted reduced function. These reduced function variants may represent low penetrance human polymorphisms that associate with increased disease susceptibility. PMID:11024165

  13. Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes

    PubMed Central

    Lam, Tze Hau; Tay, Matthew Zirui; Wang, Bei; Xiao, Ziwei; Ren, Ee Chee

    2015-01-01

    Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases. PMID:26593880

  14. Learning to Model Task-Oriented Attention.

    PubMed

    Zou, Xiaochun; Zhao, Xinbo; Wang, Jian; Yang, Yongjia

    2016-01-01

    For many applications in graphics, design, and human computer interaction, it is essential to understand where humans look in a scene with a particular task. Models of saliency can be used to predict fixation locations, but a large body of previous saliency models focused on free-viewing task. They are based on bottom-up computation that does not consider task-oriented image semantics and often does not match actual eye movements. To address this problem, we collected eye tracking data of 11 subjects when they performed some particular search task in 1307 images and annotation data of 2,511 segmented objects with fine contours and 8 semantic attributes. Using this database as training and testing examples, we learn a model of saliency based on bottom-up image features and target position feature. Experimental results demonstrate the importance of the target information in the prediction of task-oriented visual attention. PMID:27247561

  15. Learning to Model Task-Oriented Attention

    PubMed Central

    Zou, Xiaochun; Zhao, Xinbo; Wang, Jian; Yang, Yongjia

    2016-01-01

    For many applications in graphics, design, and human computer interaction, it is essential to understand where humans look in a scene with a particular task. Models of saliency can be used to predict fixation locations, but a large body of previous saliency models focused on free-viewing task. They are based on bottom-up computation that does not consider task-oriented image semantics and often does not match actual eye movements. To address this problem, we collected eye tracking data of 11 subjects when they performed some particular search task in 1307 images and annotation data of 2,511 segmented objects with fine contours and 8 semantic attributes. Using this database as training and testing examples, we learn a model of saliency based on bottom-up image features and target position feature. Experimental results demonstrate the importance of the target information in the prediction of task-oriented visual attention. PMID:27247561

  16. Microbial Properties Database Editor Tutorial

    EPA Science Inventory

    A Microbial Properties Database Editor (MPDBE) has been developed to help consolidate microbial-relevant data to populate a microbial database and support a database editor by which an authorized user can modify physico-microbial properties related to microbial indicators and pat...

  17. Scientific and Technical Document Database

    National Institute of Standards and Technology Data Gateway

    NIST Scientific and Technical Document Database (PC database for purchase)   The images in NIST Special Database 20 contain a very rich set of graphic elements from scientific and technical documents, such as graphs, tables, equations, two column text, maps, pictures, footnotes, annotations, and arrays of such elements.

  18. Ontology-Oriented Programming for Biomedical Informatics.

    PubMed

    Lamy, Jean-Baptiste

    2016-01-01

    Ontologies are now widely used in the biomedical domain. However, it is difficult to manipulate ontologies in a computer program and, consequently, it is not easy to integrate ontologies with databases or websites. Two main approaches have been proposed for accessing ontologies in a computer program: traditional API (Application Programming Interface) and ontology-oriented programming, either static or dynamic. In this paper, we will review these approaches and discuss their appropriateness for biomedical ontologies. We will also present an experience feedback about the integration of an ontology in a computer software during the VIIIP research project. Finally, we will present OwlReady, the solution we developed. PMID:27071878

  19. Rare variants in β-Amyloid precursor protein (APP) and Parkinson's disease.

    PubMed

    Schulte, Eva C; Fukumori, Akio; Mollenhauer, Brit; Hor, Hyun; Arzberger, Thomas; Perneczky, Robert; Kurz, Alexander; Diehl-Schmid, Janine; Hüll, Michael; Lichtner, Peter; Eckstein, Gertrud; Zimprich, Alexander; Haubenberger, Dietrich; Pirker, Walter; Brücke, Thomas; Bereznai, Benjamin; Molnar, Maria J; Lorenzo-Betancor, Oswaldo; Pastor, Pau; Peters, Annette; Gieger, Christian; Estivill, Xavier; Meitinger, Thomas; Kretzschmar, Hans A; Trenkwalder, Claudia; Haass, Christian; Winkelmann, Juliane

    2015-10-01

    Many individuals with Parkinson's disease (PD) develop cognitive deficits, and a phenotypic and molecular overlap between neurodegenerative diseases exists. We investigated the contribution of rare variants in seven genes of known relevance to dementias (β-amyloid precursor protein (APP), PSEN1/2, MAPT (microtubule-associated protein tau), fused in sarcoma (FUS), granulin (GRN) and TAR DNA-binding protein 43 (TDP-43)) to PD and PD plus dementia (PD+D) in a discovery sample of 376 individuals with PD and followed by the genotyping of 25 out of the 27 identified variants with a minor allele frequency <5% in 975 individuals with PD, 93 cases with Lewy body disease on neuropathological examination, 613 individuals with Alzheimer's disease (AD), 182 cases with frontotemporal dementia and 1014 general population controls. Variants identified in APP were functionally followed up by Aβ mass spectrometry in transiently transfected HEK293 cells. PD+D cases harbored more rare variants across all the seven genes than PD individuals without dementia, and rare variants in APP were more common in PD cases overall than in either the AD cases or controls. When additional controls from publically available databases were added, one rare variant in APP (c.1795G>A(p.(E599K))) was significantly associated with the PD phenotype but was not found in either the PD cases or controls of an independent replication sample. One of the identified rare variants (c.2125G>A (p.(G709S))) shifted the Aβ spectrum from Aβ40 to Aβ39 and Aβ37. Although the precise mechanism remains to be elucidated, our data suggest a possible role for APP in modifying the PD phenotype as well as a general contribution of genetic factors to the development of dementia in individuals with PD. PMID:25604855

  20. Identification and functional characterization of rare SHANK2 variants in schizophrenia.

    PubMed

    Peykov, S; Berkel, S; Schoen, M; Weiss, K; Degenhardt, F; Strohmaier, J; Weiss, B; Proepper, C; Schratt, G; Nöthen, M M; Boeckers, T M; Rietschel, M; Rappold, G A

    2015-12-01

    Recent genetic data on schizophrenia (SCZ) have suggested that proteins of the postsynaptic density of excitatory synapses have a role in its etiology. Mutations in the three SHANK genes encoding for postsynaptic scaffolding proteins have been shown to represent risk factors for autism spectrum disorders and other neurodevelopmental disorders. To address if SHANK2 variants are associated with SCZ, we sequenced SHANK2 in 481 patients and 659 unaffected individuals. We identified a significant increase in the number of rare (minor allele frequency<1%) SHANK2 missense variants in SCZ individuals (6.9%) compared with controls (3.9%, P=0.039). Four out of fifteen non-synonymous variants identified in the SCZ cohort (S610Y, R958S, P1119T and A1731S) were selected for functional analysis. Overexpression and knockdown-rescue experiments were carried out in cultured primary hippocampal neurons with a major focus on the analysis of morphological changes. Furthermore, the effect on actin polymerization in fibroblast cell lines was investigated. All four variants revealed functional impairment to various degrees, as a consequence of alterations in spine volume and clustering at synapses and an overall loss of presynaptic contacts. The A1731S variant was identified in four unrelated SCZ patients (0.83%) but not in any of the sequenced controls and public databases (P=4.6 × 10(-5)). Patients with the A1731S variant share an early prodromal phase with an insidious onset of psychiatric symptoms. A1731S overexpression strongly decreased the SHANK2-Bassoon-positive synapse number and diminished the F/G-actin ratio. Our results strongly suggest a causative role of rare SHANK2 variants in SCZ and underline the contribution of SHANK2 gene mutations in a variety of neuropsychiatric disorders. PMID:25560758

  1. Prevalence and spectrum of BRCA germline variants in mainland Chinese familial breast and ovarian cancer patients

    PubMed Central

    Huang, Ye; Cui, Jian; Xiao, Fengxia; Downs, Bradley; Wang, San Ming

    2016-01-01

    Germline mutations in BRCA1 and BRCA2 are the most penetrating genetic predispositions for breast and ovarian cancer, and their presence is largely ethnic-specific. Comprehensive information about the prevalence and spectrum of BRCA mutations has been collected in European and North American populations. However, similar information is lacking in other populations, including the mainland Chinese population despite its large size of 1.4 billion accounting for one fifth of the world's population. Herein, we performed an extensive literature analysis to collect BRCA variants identified from mainland Chinese familial breast and ovarian cancer patients. We observed 137 distinct BRCA1 variants in 409 of 3,844 and 80 distinct BRCA2 variants in 157 of 3,024 mainland Chinese patients, with an estimated prevalence of 10.6% for BRCA1 and 5.2% for BRCA2. Of these variants, only 40.3% in BRCA1 and 42.5% in BRCA2 are listed in current Breast Cancer Information Core database. We observed higher frequent variation in BRCA1 exons 11A, 11C, 11D, and 24 and BRCA2 exon 10 in Chinese patients than in the patients of other populations. The most common pathogenic variant in BRCA1 wasc.981_982delAT in exon 11A, and in BRCA2 c.3195_3198delTAAT in exon 11B and c.5576_5579delTTAA in exon 11E; the most common novel variant in BRCA1 was c.919A>G in exon 10A, and in BRCA2 c.7142delC in exon 14. None of the variants overlap with the founder mutations in other populations. Our analysis indicates that the prevalence of BRCA variation in mainland Chinese familial breast and ovarian cancer patients is at a level similar to but the spectrum is substantially different from the ones of other populations. PMID:26848529

  2. Human Aldehyde Dehydrogenase Genes: Alternatively-Spliced Transcriptional Variants and Their Suggested Nomenclature

    PubMed Central

    Black, William J.; Stagos, Dimitrios; Marchitti, Satori A.; Nebert, Daniel W.; Tipton, Keith F.; Bairoch, Amos; Vasiliou, Vasilis

    2011-01-01

    OBJECTIVE The human aldehyde dehydrogenase (ALDH) gene superfamily consists of 19 genes encoding enzymes critical for NAD(P)+-dependent oxidation of endogenous and exogenous aldehydes, including drugs and environmental toxicants. Mutations in ALDH genes are the molecular basis of several disease states (e.g. Sjögren-Larsson syndrome, pyridoxine-dependent seizures, and type II hyperprolinemia) and may contribute to the etiology of complex diseases such as cancer and Alzheimer’s disease. The aim of this nomenclature update was to identify splice transcriptional variants principally for the human ALDH genes. METHODS Data-mining methods were used to retrieve all human ALDH sequences. Alternatively-spliced transcriptional variants were determined based upon: a) criteria for sequence integrity and genomic alignment; b) evidence of multiple independent cDNA sequences corresponding to a variant sequence; and c) if available, empirical evidence of variants from the literature. RESULTS AND CONCLUSION Alternatively-spliced transcriptional variants and their encoded proteins exist for most of the human ALDH genes; however, their function and significance remain to be established. When compared with the human genome, rat and mouse include an additional gene, Aldh1a7, in the ALDH1A subfamily. In order to avoid confusion when identifying splice variants in various genomes, nomenclature guidelines for the naming of such alternative transcriptional variants and proteins are recommended herein. In addition, a web database (www.aldh.org) has been developed to provide up-to-date information and nomenclature guidelines for the ALDH superfamily. PMID:19823103

  3. Theoretical Orientation to Reading and Teacher Verbal Feedback: A Selected Review of Recent Literature.

    ERIC Educational Resources Information Center

    Martin, Elaine Russo

    1982-01-01

    Documents and journal articles in the ERIC databases are listed which explore relationships between teachers' theoretical orientations and feedback to students during oral reading. Specifically, the research concerns findings from the Theoretical Orientation to Reading Profile, the Propositions about Reading Instruction Profile, and the Feedback…

  4. Electronic Publication of Health Information in an Object-Oriented Environment.

    ERIC Educational Resources Information Center

    Prettyman, Maureen; Antonucci, Robert; Lynch, Paul; Mericle, Lee

    1999-01-01

    The National Library of Medicine is supporting a research project on full-text search and retrieval. The project includes a fully deployed system, HSTAT, to provide access to government-supported health information. The retrieval system is an object-oriented client-server model and the data is stored in an object-oriented database management…

  5. GALT protein database: querying structural and functional features of GALT enzyme.

    PubMed

    d'Acierno, Antonio; Facchiano, Angelo; Marabotti, Anna

    2014-09-01

    Knowledge of the impact of variations on protein structure can enhance the comprehension of the mechanisms of genetic diseases related to that protein. Here, we present a new version of GALT Protein Database, a Web-accessible data repository for the storage and interrogation of structural effects of variations of the enzyme galactose-1-phosphate uridylyltransferase (GALT), the impairment of which leads to classic Galactosemia, a rare genetic disease. This new version of this database now contains the models of 201 missense variants of GALT enzyme, including heterozygous variants, and it allows users not only to retrieve information about the missense variations affecting this protein, but also to investigate their impact on substrate binding, intersubunit interactions, stability, and other structural features. In addition, it allows the interactive visualization of the models of variants collected into the database. We have developed additional tools to improve the use of the database by nonspecialized users. This Web-accessible database (http://bioinformatica.isa.cnr.it/GALT/GALT2.0) represents a model of tools potentially suitable for application to other proteins that are involved in human pathologies and that are subjected to genetic variations. PMID:24990533

  6. Genetic variants in adult liver diseases.

    PubMed

    Dröge, C; Häussinger, D; Keitel, V

    2015-12-01

    In the last decades, understanding of genetic variants contributing to liver disease development has considerably improved through novel genotyping techniques. Genetic variants of single genes are known to be decisive for the development of monogenetic liver diseases of varying severity. Identification of genetic variants is an important part of the diagnostic process, e. g. the majority of patients with high iron [Fe] (HFE)-associated hemochromatosis carry the homozygous mutation p.C282Y. Detection of mutations in genes encoding hepatobiliary transport proteins like familial intrahepatic cholestasis 1 (FIC1), bile salt export pump (BSEP), or multidrug resistance protein 3 (MDR3) is the basis to differentiate various forms of intrahepatic cholestasis. Moreover, genetic variants in a variety of genes are known to act as disease modifiers and represent risk factors for disease progression and the development of cirrhosis or even hepatocellular carcinoma. Success of drug treatment or appearance of severe side effects can also be influenced by specific genetic variants. All these aspects underscore the increasing importance of genetic variants, which in the future may help to identify patients at risk for disease progression or help to guide treatment decisions. In the present overview, specific frequent genetic variants are summarized that play roles in monogenetic liver diseases, forms of intrahepatic cholestasis, gallstone development, fatty liver disease, drug-induced liver injury, and liver disease progression as well as hepatocellular carcinoma development. PMID:26666282

  7. [Takotsubo cardiomyopathy: origin and variants].

    PubMed

    Aronov, D M

    2008-01-01

    This literature review is devoted to the " tako-tsubo " cardiomyopathy - rare type of cardiomyopathy characterized by transient myocardial stunning. In acute phase the disease resembles myocardial infarction. However no involvement of coronary arteries is found at angiography. Echocardiography and ventriculography reveal a- or - hypokinesia of various parts of the left ventricle. Classic (initial) variant of the disease is associated with concomitant apical akinesia and hyperkinesis of basal segments. The heart acquires a distinctive configuration with ballooning apex which resembles device used to trap octopus. The author refers to described by him 11 cases of myocardial damage with infarct-like clinic without changes of coronary arteries in healthy men younger than 35 years (D.M.Aronow, 1968, 1974). These cases occurred during severe physical stress and had in their basis hypercatecholaminemia which led to reversible myocardial damage of the myocardium which corresponded to modern concept of myocardial stunning. During exercise tests these patients had 3 times greater increase of urinal epinephrine excretion compared with 61 patients of the same age with atherosclerotic heart disease. PMID:18991836

  8. Chemokine gene variants in schizophrenia.

    PubMed

    Dasdemir, Selcuk; Kucukali, Cem Ismail; Bireller, Elif Sinem; Tuzun, Erdem; Cakmakoglu, Bedia

    2016-08-01

    Background Chemokines are known to play a major role in driving inflammation and immune responses in several neuroinflammatory diseases, including multiple sclerosis, Alzheimer's disease and Parkinson's disease. Inflammation has also been implicated in the pathogenesis of schizophrenia. Aim We aimed to investigate a potential link between chemokines and schizophrenia and analyze the role of MCP-1-A2518G, SDF-1-3'A, CCR5-delta32, CCR5-A55029G, CXCR4-C138T and CCR2-V64I gene polymorphisms in the Turkish population. Methods Genotyping was conducted by PCR-RFLP based on 140 patients and 123 unrelated healthy controls to show the relation between chemokine gene variants and schizophrenia risk. Results Frequencies of CCR5-A55029G A genotypes and CCR5-A55029G AG genotypes were found higher in patients than the controls and even also CCR2-V64I WT: CCR5-A55029G A and CCR2-V64I 64I: CCR5-A55029G A haplotypes significantly associated according to Bonferroni correction. However, no significant association was found for any of the other polymorphisms with the risk of schizophrenia. Conclusions Our findings suggest that CCR5-A55029G polymorphisms and CCR2-V64I WT: CCR5-A55029G A and CCR2-V64I 64I: CCR5-A55029G A haplotypes might have association with schizophrenia pathogenesis. PMID:26906930

  9. Generation of Antigenic Variants via Gene Conversion: Evidence for Recombination Fitness Selection at the Locus Level in Anaplasma marginale▿

    PubMed Central

    Futse, James E.; Brayton, Kelly A.; Nydam, Seth D.; Palmer, Guy H.

    2009-01-01

    Multiple bacterial and protozoal pathogens utilize gene conversion to generate antigenically variant surface proteins to evade immune clearance and establish persistent infection. Both the donor alleles that encode the variants following recombination into an expression site and the donor loci themselves are under evolutionary selection: the alleles that encode variants that are sufficiently antigenically unique yet retain growth fitness and the loci that allow efficient recombination. We examined allelic usage in generating Anaplasma marginale variants during in vivo infection in the mammalian reservoir host and identified preferential usage of specific alleles in the absence of immune selective pressure, consistent with certain individual alleles having a fitness advantage for in vivo growth. In contrast, the loci themselves appear to have been essentially equally selected for donor function in gene conversion with no significant effect of locus position relative to the expression site or origin of replication. This pattern of preferential allelic usage but lack of locus effect was observed independently for Msp2 and Msp3 variants, both generated by gene conversion. Furthermore, there was no locus effect observed when a single locus contained both msp2 and msp3 alleles in a tail-to-tail orientation flanked by a repeat. These experimental results support the hypothesis that predominance of specific variants reflects in vivo fitness as determined by the encoding allele, independent of locus structure and chromosomal position. Identification of highly fit variants provides targets for vaccines that will prevent the high-level bacteremia associated with acute disease. PMID:19487473

  10. Histone H3 Variants in Trichomonas vaginalis.

    PubMed

    Zubácová, Zuzana; Hostomská, Jitka; Tachezy, Jan

    2012-05-01

    The parabasalid protist Trichomonas vaginalis is a widespread parasite that affects humans, frequently causing vaginitis in infected women. Trichomonad mitosis is marked by the persistence of the nuclear membrane and the presence of an asymmetric extranuclear spindle with no obvious direct connection to the chromosomes. No centromeric markers have been described in T. vaginalis, which has prevented a detailed analysis of mitotic events in this organism. In other eukaryotes, nucleosomes of centromeric chromatin contain the histone H3 variant CenH3. The principal aim of this work was to identify a CenH3 homolog in T. vaginalis. We performed a screen of the T. vaginalis genome to retrieve sequences of canonical and variant H3 histones. Three variant histone H3 proteins were identified, and the subcellular localization of their epitope-tagged variants was determined. The localization of the variant TVAG_185390 could not be distinguished from that of the canonical H3 histone. The sequence of the variant TVAG_087830 closely resembled that of histone H3. The tagged protein colocalized with sites of active transcription, indicating that the variant TVAG_087830 represented H3.3 in T. vaginalis. The third H3 variant (TVAG_224460) was localized to 6 or 12 distinct spots at the periphery of the nucleus, corresponding to the number of chromosomes in G(1) phase and G(2) phase, respectively. We propose that this variant represents the centromeric marker CenH3 and thus can be employed as a tool to study mitosis in T. vaginalis. Furthermore, we suggest that the peripheral distribution of CenH3 within the nucleus results from the association of centromeres with the nuclear envelope throughout the cell cycle. PMID:22408228

  11. Nato-Pco Database

    NASA Astrophysics Data System (ADS)

    Wtv Gmbh

    This new CD-ROM is a reference database. It covers almost twenty years of non-military scientific/technical meetings and publications sponsored by the NATO Science Committee. It contains full references (with keywords and/or abstracts) to more than 30,000 contributions from scientists all over the world and is published in more than 1,000 volumes. With the easy-to-follow menu options of the retrieval software, access to the data is simple and fast. Updates are planned on a yearly basis.

  12. Developing customer databases.

    PubMed

    Rao, S K; Shenbaga, S

    2000-01-01

    There is a growing consensus among pharmaceutical companies that more product and customer-specific approaches to marketing and selling a new drug can result in substantial increases in sales. Marketers and researchers taking a proactive micro-marketing approach to identifying, profiling, and communicating with target customers are likely to facilitate such approaches and outcomes. This article provides a working framework for creating customer databases that can be effectively mined to achieve a variety of such marketing and sales force objectives. PMID:11209474

  13. EMU Lessons Learned Database

    NASA Technical Reports Server (NTRS)

    Matthews, Kevin M., Jr.; Crocker, Lori; Cupples, J. Scott

    2011-01-01

    As manned space exploration takes on the task of traveling beyond low Earth orbit, many problems arise that must be solved in order to make the journey possible. One major task is protecting humans from the harsh space environment. The current method of protecting astronauts during Extravehicular Activity (EVA) is through use of the specially designed Extravehicular Mobility Unit (EMU). As more rigorous EVA conditions need to be endured at new destinations, the suit will need to be tailored and improved in order to accommodate the astronaut. The Objective behind the EMU Lessons Learned Database(LLD) is to be able to create a tool which will assist in the development of next-generation EMUs, along with maintenance and improvement of the current EMU, by compiling data from Failure Investigation and Analysis Reports (FIARs) which have information on past suit failures. FIARs use a system of codes that give more information on the aspects of the failure, but if one is unfamiliar with the EMU they will be unable to decipher the information. A goal of the EMU LLD is to not only compile the information, but to present it in a user-friendly, organized, searchable database accessible to all familiarity levels with the EMU; both newcomers and veterans alike. The EMU LLD originally started as an Excel database, which allowed easy navigation and analysis of the data through pivot charts. Creating an entry requires access to the Problem Reporting And Corrective Action database (PRACA), which contains the original FIAR data for all hardware. FIAR data are then transferred to, defined, and formatted in the LLD. Work is being done to create a web-based version of the LLD in order to increase accessibility to all of Johnson Space Center (JSC), which includes converting entries from Excel to the HTML format. FIARs related to the EMU have been completed in the Excel version, and now focus has shifted to expanding FIAR data in the LLD to include EVA tools and support hardware such as

  14. Developing an orientation program.

    PubMed

    Edwards, K

    1999-01-01

    When the local area experienced tremendous growth and change, the radiology department at Maury Hospital in Columbia, Tennessee looked seriously at its orientation process in preparation for hiring additional personnel. It was an appropriate time for the department to review its orientation process and to develop a manual to serve as both a tool for supervisors and an ongoing reference for new employees. To gather information for the manual, supervisors were asked to identify information they considered vital for new employees to know concerning the daily operations of the department, its policies and procedures, the organizational structure of the hospital, and hospital and departmental computer systems. That information became the basis of the orientation manual, and provided an introduction to the hospital and radiology department; the structure of the organization; an overview of the radiology department; personnel information; operating procedures and computer systems; and various policies and procedures. With the manual complete, the radiology department concentrated on an orientation process that would meet the needs of supervisors who said they had trouble remembering the many details necessary to teach new employees. A pre-orientation checklist was developed, which contained the many details supervisors must handle between the time an employee is hired and arrives for work. The next step was the creation of a checklist for use by the supervisor during a new employee's first week on the job. A final step in the hospital's orientation program is to have each new employee evaluate the entire orientation process. That information is then used to update and revise the manual. PMID:10346648

  15. Databases as an information service

    NASA Technical Reports Server (NTRS)

    Vincent, D. A.

    1983-01-01

    The relationship of databases to information services, and the range of information services users and their needs for information is explored and discussed. It is argued that for database information to be valuable to a broad range of users, it is essential that access methods be provided that are relatively unstructured and natural to information services users who are interested in the information contained in databases, but who are not willing to learn and use traditional structured query languages. Unless this ease of use of databases is considered in the design and application process, the potential benefits from using database systems may not be realized.

  16. Vcfanno: fast, flexible annotation of genetic variants.

    PubMed

    Pedersen, Brent S; Layer, Ryan M; Quinlan, Aaron R

    2016-01-01

    The integration of genome annotations is critical to the identification of genetic variants that are relevant to studies of disease or other traits. However, comprehensive variant annotation with diverse file formats is difficult with existing methods. Here we describe vcfanno, which flexibly extracts and summarizes attributes from multiple annotation files and integrates the annotations within the INFO column of the original VCF file. By leveraging a parallel "chromosome sweeping" algorithm, we demonstrate substantial performance gains by annotating ~85,000 variants per second with 50 attributes from 17 commonly used genome annotation resources. Vcfanno is available at https://github.com/brentp/vcfanno under the MIT license. PMID:27250555

  17. IPD: the Immuno Polymorphism Database.

    PubMed

    Robinson, James; Marsh, Steven G E

    2007-01-01

    The Immuno Polymorphism Database (IPD) (http://www.ebi.ac.uk/ipd/) is a set of specialist databases related to the study of polymorphic genes in the immune system. IPD currently consists of four databases: IPD-KIR, contains the allelic sequences of killer cell immunoglobulin-like receptors (KIRs); IPD-MHC, a database of sequences of the major histocompatibility complex (MHC) of different species; IPD-HPA, alloantigens expressed only on platelets; and IPD-ESTAB, which provides access to the European Searchable Tumour Cell Line Database, a cell bank of immunologically characterized melanoma cell lines. The IPD project works with specialist groups or nomenclature committees who provide and curate individual sections before they are submitted to IPD for online publication. The IPD project stores all the data in a set of related databases. Those sections with similar data, such as IPD-KIR and IPD-MHC, share the same database structure. PMID:18449992

  18. Curcumin Resource Database.

    PubMed

    Kumar, Anil; Chetia, Hasnahana; Sharma, Swagata; Kabiraj, Debajyoti; Talukdar, Narayan Chandra; Bora, Utpal

    2015-01-01

    Curcumin is one of the most intensively studied diarylheptanoid, Curcuma longa being its principal producer. This apart, a class of promising curcumin analogs has been generated in laboratories, aptly named as Curcuminoids which are showing huge potential in the fields of medicine, food technology, etc. The lack of a universal source of data on curcumin as well as curcuminoids has been felt by the curcumin research community for long. Hence, in an attempt to address this stumbling block, we have developed Curcumin Resource Database (CRDB) that aims to perform as a gateway-cum-repository to access all relevant data and related information on curcumin and its analogs. Currently, this database encompasses 1186 curcumin analogs, 195 molecular targets, 9075 peer reviewed publications, 489 patents and 176 varieties of C. longa obtained by extensive data mining and careful curation from numerous sources. Each data entry is identified by a unique CRDB ID (identifier). Furnished with a user-friendly web interface and in-built search engine, CRDB provides well-curated and cross-referenced information that are hyperlinked with external sources. CRDB is expected to be highly useful to the researchers working on structure as well as ligand-based molecular design of curcumin analogs. PMID:26220923

  19. Paleomagnetic database search possible

    NASA Astrophysics Data System (ADS)

    Harbert, William

    I have recently finished an on-line search program which allows remote users to search the “Abase” ASCII version of the World Paleomagnetic Database developed by Lock and McElhinny [1991]. The program is very simple to use and will search the Soviet, non-Soviet, rock unit, and reference databases and create output files that can be downloaded back to a researcher's local system using the ftp command.To use Search, telnet to 130.49.3.1 (earth.eps.pitt.edu) and login as the user “Search.rdquo There is no password, and the user is asked a series of questions, which define the geographic region and ages of interest. The program will also ask for an identifier with which to create the output file names. The program has three modes of operation: text-only, Tektronix graphics, or X11l/R5 graphics; the proper choice depends on the computer hardware that is used by the searcher.

  20. UGTA Photograph Database

    SciTech Connect

    NSTec Environmental Restoration

    2009-04-20

    One of the advantages of the Nevada Test Site (NTS) is that most of the geologic and hydrologic features such as hydrogeologic units (HGUs), hydrostratigraphic units (HSUs), and faults, which are important aspects of flow and transport modeling, are exposed at the surface somewhere in the vicinity of the NTS and thus are available for direct observation. However, due to access restrictions and the remote locations of many of the features, most Underground Test Area (UGTA) participants cannot observe these features directly in the field. Fortunately, National Security Technologies, LLC, geologists and their predecessors have photographed many of these features through the years. During fiscal year 2009, work was done to develop an online photograph database for use by the UGTA community. Photographs were organized, compiled, and imported into Adobe® Photoshop® Elements 7. The photographs were then assigned keyword tags such as alteration type, HGU, HSU, location, rock feature, rock type, and stratigraphic unit. Some fully tagged photographs were then selected and uploaded to the UGTA website. This online photograph database provides easy access for all UGTA participants and can help “ground truth” their analytical and modeling tasks. It also provides new participants a resource to more quickly learn the geology and hydrogeology of the NTS.

  1. Asbestos Exposure Assessment Database

    NASA Technical Reports Server (NTRS)

    Arcot, Divya K.

    2010-01-01

    Exposure to particular hazardous materials in a work environment is dangerous to the employees who work directly with or around the materials as well as those who come in contact with them indirectly. In order to maintain a national standard for safe working environments and protect worker health, the Occupational Safety and Health Administration (OSHA) has set forth numerous precautionary regulations. NASA has been proactive in adhering to these regulations by implementing standards which are often stricter than regulation limits and administering frequent health risk assessments. The primary objective of this project is to create the infrastructure for an Asbestos Exposure Assessment Database specific to NASA Johnson Space Center (JSC) which will compile all of the exposure assessment data into a well-organized, navigable format. The data includes Sample Types, Samples Durations, Crafts of those from whom samples were collected, Job Performance Requirements (JPR) numbers, Phased Contrast Microscopy (PCM) and Transmission Electron Microscopy (TEM) results and qualifiers, Personal Protective Equipment (PPE), and names of industrial hygienists who performed the monitoring. This database will allow NASA to provide OSHA with specific information demonstrating that JSC s work procedures are protective enough to minimize the risk of future disease from the exposures. The data has been collected by the NASA contractors Computer Sciences Corporation (CSC) and Wyle Laboratories. The personal exposure samples were collected from devices worn by laborers working at JSC and by building occupants located in asbestos-containing buildings.

  2. PHENIX RPC Production Database

    NASA Astrophysics Data System (ADS)

    Jones, Timothy

    2008-10-01

    The Pioneering High Energy Nuclear Interaction eXperiment (PHENIX) is located on the Relativistic Heavy Ion Collider (RHIC) ring at Brookhaven National Laboratory. A primary physics goal that can be studied by PHENIX is the origin of the proton spin. One of the types of rare events looked for in the moun arms at PHENIX are single high transverse momentum mouns, which tend to result from the decay of a W bozon. Resistive Plate Chambers (RPCs) will be used as a level 1 trigger to select these events from a large background of low transverse momentum muons. As these RPCs are assembled it is necessary to keep track of the individual parts of each RPC as well as data from various quality assurance tests in a way that will allow the information to be easily accessible years to come as the RPCs are being used. This is done through the use of a database and web page interface that can be used to enter data about the RPCs or to look up information from tests. I will be presenting on how we keep track of the RPCs, their parts, and data from quality assurance tests as they are being assembled as well as how we can retrieve this data after it has been stored in the database.

  3. Reactions of oriented molecules.

    PubMed

    Brooks, P R

    1976-07-01

    Beams of oriented molecules have been used to directly study geometrical requirements in chemical reactions. These studies have shown that reactivity is much greater in some orientations than others and demonstrated the existence of steric effects. For some reactions portions of the orientation results are in good accord with traditional views of steric hindrance, but for others it is clear that our chemical intuition needs recalibrating. Indeed, the information gained from simultaneously orienting the reactants and observing the scattering angle of the products may lead to new insights about the detailed mechanism of certain reactions. Further work must be done to extend the scope and detail of the studies described here. More detailed information is needed on the CH(3)I reaction and the CF(3)I reaction. The effects of alkyl groups of various sizes and alkali metals of various sizes are of interest. In addition, reactions where a long-lived complex is formed should be studied to see if orientation is important. Finally, it would be of interest to apply the technique to the sort of reactions that led to our interest in the first place: the S(N)2 displacements in alkyl halides where the fascinating Walden inversion occurs. PMID:17793988

  4. Petrophysical database of Uganda

    NASA Astrophysics Data System (ADS)

    Ruotoistenmäki, Tapio; Birungi, Nelson R.

    2015-06-01

    The petrophysical database of Uganda contains data on ca. 5800 rock samples collected and analyzed during 2009-2012 in international geological and geophysical projects covering the main part of the land area of Uganda. The parameters included are the susceptibilities and densities of all available field samples. Susceptibilities were measured from the samples from three directions. Using these parameters, we also calculated the ratios of susceptibility maxima/minima reflecting direction homogeneity of magnetic minerals, and estimated the iron content of paramagnetic samples and the magnetite content of ferrimagnetic samples. Statistical and visual analysis of the petrophysical data of Uganda demonstrated their wide variation, thus emphasizing their importance in analyzing the bedrock variations in three dimensions. Using the density-susceptibility diagram, the data can be classified into six main groups: 1. A low density and susceptibility group, consisting of sedimentary and altered rocks. 2. Low-susceptibility, felsic rocks (e.g. quartzites and metasandstones). 3. Paramagnetic, felsic rocks (e.g. granites). 4. Ferrimagnetic, magnetite-containing felsic rocks (e.g. granites). 5. Paramagnetic mafic rocks (e.g. amphibolites and dolerites). 6. Ferrimagnetic, mafic rocks containing magnetite and high-density mafic minerals (mainly dolerites). Moreover, analysis revealed that the parameter distributions of even a single rock type (e.g. granites) can be very variable, forming separate clusters. This demonstrates that the simple calculation of density or susceptibility averages of rock types can be highly erratic. For example, the average can lie between two groups, where only few, if any, samples exist. Therefore, estimation of the representative density and susceptibility must be visually verified from these diagrams. The areal distribution of parameters and their calculated derivatives generally correlate well with the regional distribution of lithological and

  5. Kuru and "new variant" CJD.

    PubMed

    Verdrager, J

    1997-09-01

    Acquired transmissible spongiform encephalopathies in humans include Kuru (a disease which was associated with ritualistic cannibalism in Papua New Guinea), iatrogenic Creutzfeldt-Jakob disease and a newly recognized variant form of Creutzfeldt-Jakob disease (nvCJD). Clinical and neuropathological features of nvCJD are reminiscent of Kuru: early and progressive cerebellar ataxia and numerous characteristic Kuru-type amyloid plaques surrounded by spongiform change. In contrast to typical cases of sporadic CJD, Kuru and nvCJD affect young patients. The newly recognized form of CJD has been identified in ten young people in the UK in 1996, approximately 10 years after the beginning of the bovine spongiform encephalopathy (BSE) epidemic in the UK. Molecular analysis has shown that nvCJD has strain characteristics that are distinct from other types of CJD but similar to those of BSE. In the UK an estimated half a million BSE-infected cows entered the human food chain before the bovine offal ban of 1989. To be effective the oral route probably requires high-infectivity titers which are encountered only in the brain, spinal cord and eyes of naturally infected cows. In patients with Kuru, titers of more than 10(8) infectious doses per gram were reported in the brain tissues. As a result of the estimated very long incubation period of nvCJD (10 to 30 years or more) the predicted nvCJD epidemic will have the shape of a normal distribution curve with a peak expected in 2009. The epidemic may extend until 2030. There is already an example to illustrate such a curve in its descending line: the decline of Kuru deaths following the interruption of ritual cannibalism. PMID:9561604

  6. Concealment of sexual orientation.

    PubMed

    Sylva, David; Rieger, Gerulf; Linsenmeier, Joan A W; Bailey, J Michael

    2010-02-01

    Sex-atypical behaviors may be used to identify a person as homosexual. To shield themselves from prejudice, homosexual people may attempt to conceal these behaviors. It is not clear how effectively they can do so. In Study 1, we asked homosexual participants to conceal their sex-atypical behaviors while talking about the weather. Raters watched videos of the participants and judged the likelihood that each participant was homosexual. Homosexual participants were able to partially conceal signs of their orientation, but they remained distinguishable from heterosexual participants. In Study 2, we tested the ability to conceal signs of one's sexual orientation in a more demanding situation: a mock job interview. In this scenario, homosexual men were even less effective at concealing their orientation. Higher cognitive demands in this new situation may have interfered with their ability to conceal. PMID:19169803

  7. FTO variant associated with malformation syndrome.

    PubMed

    Rohena, Luis; Lawson, Michelle; Guzman, Edwin; Ganapathi, Mythily; Cho, Megan T; Haverfield, Eden; Anyane-Yeboa, Kwame

    2016-04-01

    Common FTO variants are associated with obesity. However, it has recently been shown that homozygous FTO c.947G>A variant, which predicts p.R316Q, and c.956C>T, which predicts p.S319F, are associated with a malformation syndrome inherited in an autosomal recessive pattern. We present a similar homozygous FTO c.965G>A variant that predicts p.R322Q, associated with a lethal malformation syndrome in a consanguineous Yemeni family. Functional studies showed that the p.R316Q, p.S219F, and p.R322Q variants render the FTO protein inactive. We further expand on the phenotype of homozygous FTO loss-of-function mutations to include eye abnormalities, gingival overgrowth, craniosynostosis, and cutaneous photosensitivity. PMID:26697951

  8. PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions

    PubMed Central

    Brezovský, Jan

    2016-01-01

    An important message taken from human genome sequencing projects is that the human population exhibits approximately 99.9% genetic similarity. Variations in the remaining parts of the genome determine our identity, trace our history and reveal our heritage. The precise delineation of phenotypically causal variants plays a key role in providing accurate personalized diagnosis, prognosis, and treatment of inherited diseases. Several computational methods for achieving such delineation have been reported recently. However, their ability to pinpoint potentially deleterious variants is limited by the fact that their mechanisms of prediction do not account for the existence of different categories of variants. Consequently, their output is biased towards the variant categories that are most strongly represented in the variant databases. Moreover, most such methods provide numeric scores but not binary predictions of the deleteriousness of variants or confidence scores that would be more easily understood by users. We have constructed three datasets covering different types of disease-related variants, which were divided across five categories: (i) regulatory, (ii) splicing, (iii) missense, (iv) synonymous, and (v) nonsense variants. These datasets were used to develop category-optimal decision thresholds and to evaluate six tools for variant prioritization: CADD, DANN, FATHMM, FitCons, FunSeq2 and GWAVA. This evaluation revealed some important advantages of the category-based approach. The results obtained with the five best-performing tools were then combined into a consensus score. Additional comparative analyses showed that in the case of missense variations, protein-based predictors perform better than DNA sequence-based predictors. A user-friendly web interface was developed that provides easy access to the five tools’ predictions, and their consensus scores, in a user-understandable format tailored to the specific features of different categories of variations

  9. PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions.

    PubMed

    Bendl, Jaroslav; Musil, Miloš; Štourač, Jan; Zendulka, Jaroslav; Damborský, Jiří; Brezovský, Jan

    2016-05-01

    An important message taken from human genome sequencing projects is that the human population exhibits approximately 99.9% genetic similarity. Variations in the remaining parts of the genome determine our identity, trace our history and reveal our heritage. The precise delineation of phenotypically causal variants plays a key role in providing accurate personalized diagnosis, prognosis, and treatment of inherited diseases. Several computational methods for achieving such delineation have been reported recently. However, their ability to pinpoint potentially deleterious variants is limited by the fact that their mechanisms of prediction do not account for the existence of different categories of variants. Consequently, their output is biased towards the variant categories that are most strongly represented in the variant databases. Moreover, most such methods provide numeric scores but not binary predictions of the deleteriousness of variants or confidence scores that would be more easily understood by users. We have constructed three datasets covering different types of disease-related variants, which were divided across five categories: (i) regulatory, (ii) splicing, (iii) missense, (iv) synonymous, and (v) nonsense variants. These datasets were used to develop category-optimal decision thresholds and to evaluate six tools for variant prioritization: CADD, DANN, FATHMM, FitCons, FunSeq2 and GWAVA. This evaluation revealed some important advantages of the category-based approach. The results obtained with the five best-performing tools were then combined into a consensus score. Additional comparative analyses showed that in the case of missense variations, protein-based predictors perform better than DNA sequence-based predictors. A user-friendly web interface was developed that provides easy access to the five tools' predictions, and their consensus scores, in a user-understandable format tailored to the specific features of different categories of variations. To

  10. Topography and pigeon orientation

    NASA Technical Reports Server (NTRS)

    Wagner, G.

    1972-01-01

    Two types of homing experiments with pigeons to determine the influence of topographical features on the orientation behavior of the birds are discussed. The releases and following were conducted by ground experiments in which the birds are tracked by visual observation at points of topographical interest and the helicopter method by which the birds are tracked throughout the entire flight. The ground experiments showed a strong influence of topographical features on initial orientation. The helicopter experiments showed that the ground experiments do not provide adequate information on the manner in which homing occurs.

  11. Orientation and Alignment Echoes

    NASA Astrophysics Data System (ADS)

    Karras, G.; Hertz, E.; Billard, F.; Lavorel, B.; Hartmann, J.-M.; Faucher, O.; Gershnabel, Erez; Prior, Yehiam; Averbukh, Ilya Sh.

    2015-04-01

    We present one of the simplest classical systems featuring the echo phenomenon—a collection of randomly oriented free rotors with dispersed rotational velocities. Following excitation by a pair of time-delayed impulsive kicks, the mean orientation or alignment of the ensemble exhibits multiple echoes and fractional echoes. We elucidate the mechanism of the echo formation by the kick-induced filamentation of phase space, and provide the first experimental demonstration of classical alignment echoes in a thermal gas of CO2 molecules excited by a pair of femtosecond laser pulses.

  12. A Scalable Database Infrastructure

    NASA Astrophysics Data System (ADS)

    Arko, R. A.; Chayes, D. N.

    2001-12-01

    The rapidly increasing volume and complexity of MG&G data, and the growing demand from funding agencies and the user community that it be easily accessible, demand that we improve our approach to data management in order to reach a broader user-base and operate more efficient and effectively. We have chosen an approach based on industry-standard relational database management systems (RDBMS) that use community-wide data specifications, where there is a clear and well-documented external interface that allows use of general purpose as well as customized clients. Rapid prototypes assembled with this approach show significant advantages over the traditional, custom-built data management systems that often use "in-house" legacy file formats, data specifications, and access tools. We have developed an effective database prototype based a public domain RDBMS (PostgreSQL) and metadata standard (FGDC), and used it as a template for several ongoing MG&G database management projects - including ADGRAV (Antarctic Digital Gravity Synthesis), MARGINS, the Community Review system of the Digital Library for Earth Science Education, multibeam swath bathymetry metadata, and the R/V Maurice Ewing onboard acquisition system. By using standard formats and specifications, and working from a common prototype, we are able to reuse code and deploy rapidly. Rather than spend time on low-level details such as storage and indexing (which are built into the RDBMS), we can focus on high-level details such as documentation and quality control. In addition, because many commercial off-the-shelf (COTS) and public domain data browsers and visualization tools have built-in RDBMS support, we can focus on backend development and leave the choice of a frontend client(s) up to the end user. While our prototype is running under an open source RDBMS on a single processor host, the choice of standard components allows this implementation to scale to commercial RDBMS products and multiprocessor servers as

  13. National Geochronological Database

    USGS Publications Warehouse

    Revised by Sloan, Jan; Henry, Christopher D.; Hopkins, Melanie; Ludington, Steve; Original database by Zartman, Robert E.; Bush, Charles A.; Abston, Carl

    2003-01-01

    The National Geochronological Data Base (NGDB) was established by the United States Geological Survey (USGS) to collect and organize published isotopic (also known as radiometric) ages of rocks in the United States. The NGDB (originally known as the Radioactive Age Data Base, RADB) was started in 1974. A committee appointed by the Director of the USGS was given the mission to investigate the feasibility of compiling the published radiometric ages for the United States into a computerized data bank for ready access by the user community. A successful pilot program, which was conducted in 1975 and 1976 for the State of Wyoming, led to a decision to proceed with the compilation of the entire United States. For each dated rock sample reported in published literature, a record containing information on sample location, rock description, analytical data, age, interpretation, and literature citation was constructed and included in the NGDB. The NGDB was originally constructed and maintained on a mainframe computer, and later converted to a Helix Express relational database maintained on an Apple Macintosh desktop computer. The NGDB and a program to search the data files were published and distributed on Compact Disc-Read Only Memory (CD-ROM) in standard ISO 9660 format as USGS Digital Data Series DDS-14 (Zartman and others, 1995). As of May 1994, the NGDB consisted of more than 18,000 records containing over 30,000 individual ages, which is believed to represent approximately one-half the number of ages published for the United States through 1991. Because the organizational unit responsible for maintaining the database was abolished in 1996, and because we wanted to provide the data in more usable formats, we have reformatted the data, checked and edited the information in some records, and provided this online version of the NGDB. This report describes the changes made to the data and formats, and provides instructions for the use of the database in geographic

  14. Integration of NASA/GSFC and USGS Rock Magnetic Databases.

    NASA Astrophysics Data System (ADS)

    Nazarova, K. A.; Glen, J. M.

    2004-05-01

    A global Magnetic Petrology Database (MPDB) was developed and continues to be updated at NASA/Goddard Space Flight Center. The purpose of this database is to provide the geomagnetic community with a comprehensive and user-friendly method of accessing magnetic petrology data via the Internet for a more realistic interpretation of satellite (as well as aeromagnetic and ground) lithospheric magnetic anomalies. The MPDB contains data on rocks from localities around the world (about 19,000 samples) including the Ukranian and Baltic Shields, Kamchatka, Iceland, Urals Mountains, etc. The MPDB is designed, managed and presented on the web as a research oriented database. Several database applications have been specifically developed for data manipulation and analysis of the MPDB. The geophysics unit at the USGS in Menlo Park has over 17,000 rock-property data, largely from sites within the western U.S. This database contains rock-density and rock-magnetic parameters collected for use in gravity and magnetic field modeling, and paleomagnetic studies. Most of these data were taken from surface outcrops and together they span a broad range of rock types. Measurements were made either in-situ at the outcrop, or in the laboratory on hand samples and paleomagnetic cores acquired in the field. The USGS and NASA/GSFC data will be integrated as part of an effort to provide public access to a single, uniformly maintained database. Due to the large number of data and the very large area sampled, the database can yield rock-property statistics on a broad range of rock types; it is thus applicable to study areas beyond the geographic scope of the database. The intent of this effort is to provide incentive for others to further contribute to the database, and a tool with which the geophysical community can entertain studies formerly precluded.

  15. Orientational order in disordered superconductors

    SciTech Connect

    Toner, J. )

    1991-05-13

    Orientational order in weakly pinned Abrikosov flux lattices is studied, taking into account two heretofore neglected effects: dislocations and orientational couplings to the underlying lattice. Without orientational couplings, arbitrarily weak pinning destroys long-ranged orientational order for all spatial dimensions {ital d}{lt}4. Orientational couplings stabilize long-ranged orientational order. For fields along an axis of {ital fourfold} symmetry, {ital sixfold} (hexatic) orientational order is described by a random-field Ising model, and so does not occur in {ital d}=2 (thin films) but does in {ital d}=3 (bulk).

  16. Characterizing Genetic Variants for Clinical Action

    PubMed Central

    Ramos, Erin M.; Din-Lovinescu, Corina; Berg, Jonathan S.; Brooks, Lisa D.; Duncanson, Audrey; Dunn, Michael; Good, Peter; Hubbard, Tim; Jarvik, Gail P.; O'Donnell, Christopher; Sherry, Stephen T.; Aronson, Naomi; Biesecker, Leslie G.; Blumberg, Bruce; Calonge, Ned; Colhoun, Helen M.; Epstein, Robert S.; Flicek, Paul; Gordon, Erynn S.; Green, Eric D.; Green, Robert C.; Hurles, Matthew; Kawamoto, Kensaku; Knaus, William; Ledbetter, David H.; Levy, Howard P.; Lyon, Elaine; Maglott, Donna; McLeod, Howard L.; Rahman, Nazneen; Randhawa, Gurvaneet; Wicklund, Catherine; Manolio, Teri A.; Chisholm, Rex L.; Williams, Marc S.

    2014-01-01

    Genome-wide association studies, DNA sequencing studies, and other genomic studies are finding an increasing number of genetic variants associated with clinical phenotypes that may be useful in developing diagnostic, preventive, and treatment strategies for individual patients. However, few common variants have been integrated into routine clinical practice. The reasons for this are several, but two of the most significant are limited evidence about the clinical implications of the variants and a lack of a comprehensive knowledge base that captures genetic variants, their phenotypic associations, and other pertinent phenotypic information that is openly accessible to clinical groups attempting to interpret sequencing data. As the field of medicine begins to incorporate genome-scale analysis into clinical care, approaches need to be developed for collecting and characterizing data on the clinical implications of variants, developing consensus on their actionability, and making this information available for clinical use. The National Human Genome Research Institute (NHGRI) and the Wellcome Trust thus convened a workshop to consider the processes and resources needed to: 1) identify clinically valid genetic variants; 2) decide whether they are actionable and what the action should be; and 3) provide this information for clinical use. This commentary outlines the key discussion points and recommendations from the workshop. PMID:24634402

  17. Genetic Variants Associated with Colorectal Adenoma Susceptibility

    PubMed Central

    Abulí, Anna; Castells, Antoni; Bujanda, Luis; Lozano, Juan José; Bessa, Xavier; Hernández, Cristina; Álvarez-Urturi, Cristina; Pellisé, Maria; Esteban-Jurado, Clara; Hijona, Elizabeth; Burón, Andrea; Macià, Francesc; Grau, Jaume; Guayta, Rafael

    2016-01-01

    Background Common low-penetrance genetic variants have been consistently associated with colorectal cancer risk. Aim To determine if these genetic variants are associated also with adenoma susceptibility and may improve selection of patients with increased risk for advanced adenomas and/or multiplicity (≥ 3 adenomas). Methods We selected 1,326 patients with increased risk for advanced adenomas and/or multiplicity and 1,252 controls with normal colonoscopy from population-based colorectal cancer screening programs. We conducted a case-control association study analyzing 30 colorectal cancer susceptibility variants in order to investigate the contribution of these variants to the development of subsequent advanced neoplasia and/or multiplicity. Results We found that 14 of the analyzed genetic variants showed a statistically significant association with advanced adenomas and/or multiplicity: the probability of developing these lesions increased with the number of risk alleles reaching a 2.3-fold risk increment in individuals with ≥ 17 risk alleles. Conclusions Nearly half of the genetic variants associated with colorectal cancer risk are also related to advanced adenoma and/or multiplicity predisposition. Assessing the number of risk alleles in individuals within colorectal cancer screening programs may help to identify better a subgroup with increased risk for advanced neoplasia and/or multiplicity in the general population. PMID:27078840

  18. Discovery of rare variants for complex phenotypes.

    PubMed

    Kosmicki, Jack A; Churchhouse, Claire L; Rivas, Manuel A; Neale, Benjamin M

    2016-06-01

    With the rise of sequencing technologies, it is now feasible to assess the role rare variants play in the genetic contribution to complex trait variation. While some of the earlier targeted sequencing studies successfully identified rare variants of large effect, unbiased gene discovery using exome sequencing has experienced limited success for complex traits. Nevertheless, rare variant association studies have demonstrated that rare variants do contribute to phenotypic variability, but sample sizes will likely have to be even larger than those of common variant association studies to be powered for the detection of genes and loci. Large-scale sequencing efforts of tens of thousands of individuals, such as the UK10K Project and aggregation efforts such as the Exome Aggregation Consortium, have made great strides in advancing our knowledge of the landscape of rare variation, but there remain many considerations when studying rare variation in the context of complex traits. We discuss these considerations in this review, presenting a broad range of topics at a high level as an introduction to rare variant analysis in complex traits including the issues of power, study design, sample ascertainment, de novo variation, and statistical testing approaches. Ultimately, as sequencing costs continue to decline, larger sequencing studies will yield clearer insights into the biological consequence of rare mutations and may reveal which genes play a role in the etiology of complex traits. PMID:27221085

  19. Conditionally replicating HIV and SIV variants.

    PubMed

    Das, Atze T; Berkhout, Ben

    2016-05-01

    Conditionally replicating human immunodeficiency virus (HIV) and simian immunodeficiency virus (SIV) variants that can be switched on and off at will are attractive tools for HIV and SIV research. We constructed HIV and SIV variants in which the natural transcription control mechanism was replaced by the doxycycline (dox)-inducible Tet-On gene expression mechanism. These HIV-rtTA and SIV-rtTA variants are fully replication-competent, but replication is critically dependent on dox administration. We here describe how the dox-dependent virus variants may improve the safety of live-attenuated virus vaccines and how they can be used to study the immune responses that correlate with vaccine-induced protection. Furthermore, we review how these variants were initially designed and subsequently optimized by spontaneous viral evolution. These efforts yielded efficiently replicating and tightly dox-controlled HIV-rtTA and SIV-rtTA variants that replicate in a variety of cell and tissue culture systems, and in human immune system (HIS) mice and macaques, respectively. These viruses can be used as a tool in HIV and SIV biology studies and in vaccine research. We review how HIV-rtTA and SIV-rtTA were used to study the role of the viral TAR and Tat elements in virus replication. PMID:25982510

  20. Demography and the Age of Rare Variants

    PubMed Central

    Mathieson, Iain; McVean, Gil

    2014-01-01

    Large whole-genome sequencing projects have provided access to much rare variation in human populations, which is highly informative about population structure and recent demography. Here, we show how the age of rare variants can be estimated from patterns of haplotype sharing and how these ages can be related to historical relationships between populations. We investigate the distribution of the age of variants occurring exactly twice ( variants) in a worldwide sample sequenced by the 1000 Genomes Project, revealing enormous variation across populations. The median age of haplotypes carrying variants is 50 to 160 generations across populations within Europe or Asia, and 170 to 320 generations within Africa. Haplotypes shared between continents are much older with median ages for haplotypes shared between Europe and Asia ranging from 320 to 670 generations. The distribution of the ages of haplotypes is informative about their demography, revealing recent bottlenecks, ancient splits, and more modern connections between populations. We see the effect of selection in the observation that functional variants are significantly younger than nonfunctional variants of the same frequency. This approach is relatively insensitive to mutation rate and complements other nonparametric methods for demographic inference. PMID:25101869