Science.gov

Sample records for comprehensive database including

  1. The comprehensive peptaibiotics database.

    PubMed

    Stoppacher, Norbert; Neumann, Nora K N; Burgstaller, Lukas; Zeilinger, Susanne; Degenkolb, Thomas; Brückner, Hans; Schuhmacher, Rainer

    2013-05-01

    Peptaibiotics are nonribosomally biosynthesized peptides, which - according to definition - contain the marker amino acid α-aminoisobutyric acid (Aib) and possess antibiotic properties. Being known since 1958, a constantly increasing number of peptaibiotics have been described and investigated with a particular emphasis on hypocrealean fungi. Starting from the existing online 'Peptaibol Database', first published in 1997, an exhaustive literature survey of all known peptaibiotics was carried out and resulted in a list of 1043 peptaibiotics. The gathered information was compiled and used to create the new 'The Comprehensive Peptaibiotics Database', which is presented here. The database was devised as a software tool based on Microsoft (MS) Access. It is freely available from the internet at http://peptaibiotics-database.boku.ac.at and can easily be installed and operated on any computer offering a Windows XP/7 environment. It provides useful information on characteristic properties of the peptaibiotics included such as peptide category, group name of the microheterogeneous mixture to which the peptide belongs, amino acid sequence, sequence length, producing fungus, peptide subfamily, molecular formula, and monoisotopic mass. All these characteristics can be used and combined for automated search within the database, which makes The Comprehensive Peptaibiotics Database a versatile tool for the retrieval of valuable information about peptaibiotics. Sequence data have been considered as to December 14, 2012. PMID:23681723

  2. SuperCYP: a comprehensive database on Cytochrome P450 enzymes including a tool for analysis of CYP-drug interactions

    PubMed Central

    Preissner, Saskia; Kroll, Katharina; Dunkel, Mathias; Senger, Christian; Goldsobel, Gady; Kuzman, Daniel; Guenther, Stefan; Winnenburg, Rainer; Schroeder, Michael; Preissner, Robert

    2010-01-01

    Much of the information on the Cytochrome P450 enzymes (CYPs) is spread across literature and the internet. Aggregating knowledge about CYPs into one database makes the search more efficient. Text mining on 57 CYPs and drugs led to a mass of papers, which were screened manually for facts about metabolism, SNPs and their effects on drug degradation. Information was put into a database, which enables the user not only to look up a particular CYP and all metabolized drugs, but also to check tolerability of drug-cocktails and to find alternative combinations, to use metabolic pathways more efficiently. The SuperCYP database contains 1170 drugs with more than 3800 interactions including references. Approximately 2000 SNPs and mutations are listed and ordered according to their effect on expression and/or activity. SuperCYP (http://bioinformatics.charite.de/supercyp) is a comprehensive resource focused on CYPs and drug metabolism. Homology-modeled structures of the CYPs can be downloaded in PDB format and related drugs are available as MOL-files. Within the resource, CYPs can be aligned with each other, drug-cocktails can be ‘mixed’, SNPs, protein point mutations, and their effects can be viewed and corresponding PubMed IDs are given. SuperCYP is meant to be a platform and a starting point for scientists and health professionals for furthering their research. PMID:19934256

  3. The Comprehensive Antibiotic Resistance Database

    PubMed Central

    McArthur, Andrew G.; Waglechner, Nicholas; Nizam, Fazmin; Yan, Austin; Azad, Marisa A.; Baylay, Alison J.; Bhullar, Kirandeep; Canova, Marc J.; De Pascale, Gianfranco; Ejim, Linda; Kalan, Lindsay; King, Andrew M.; Koteva, Kalinka; Morar, Mariya; Mulvey, Michael R.; O'Brien, Jonathan S.; Pawlowski, Andrew C.; Piddock, Laura J. V.; Spanogiannopoulos, Peter; Sutherland, Arlene D.; Tang, Irene; Taylor, Patricia L.; Thaker, Maulik; Wang, Wenliang; Yan, Marie; Yu, Tennison

    2013-01-01

    The field of antibiotic drug discovery and the monitoring of new antibiotic resistance elements have yet to fully exploit the power of the genome revolution. Despite the fact that the first genomes sequenced of free living organisms were those of bacteria, there have been few specialized bioinformatic tools developed to mine the growing amount of genomic data associated with pathogens. In particular, there are few tools to study the genetics and genomics of antibiotic resistance and how it impacts bacterial populations, ecology, and the clinic. We have initiated development of such tools in the form of the Comprehensive Antibiotic Research Database (CARD; http://arpcard.mcmaster.ca). The CARD integrates disparate molecular and sequence data, provides a unique organizing principle in the form of the Antibiotic Resistance Ontology (ARO), and can quickly identify putative antibiotic resistance genes in new unannotated genome sequences. This unique platform provides an informatic tool that bridges antibiotic resistance concerns in health care, agriculture, and the environment. PMID:23650175

  4. Comprehensive T-Matrix Reference Database: A 2012 - 2013 Update

    NASA Technical Reports Server (NTRS)

    Mishchenko, Michael I.; Videen, Gorden; Khlebtsov, Nikolai G.; Wriedt, Thomas

    2013-01-01

    The T-matrix method is one of the most versatile, efficient, and accurate theoretical techniques widely used for numerically exact computer calculations of electromagnetic scattering by single and composite particles, discrete random media, and particles imbedded in complex environments. This paper presents the fifth update to the comprehensive database of peer-reviewed T-matrix publications initiated by us in 2004 and includes relevant publications that have appeared since 2012. It also lists several earlier publications not incorporated in the original database, including Peter Waterman's reports from the 1960s illustrating the history of the T-matrix approach and demonstrating that John Fikioris and Peter Waterman were the true pioneers of the multi-sphere method otherwise known as the generalized Lorenz - Mie theory.

  5. RNAdb—a comprehensive mammalian noncoding RNA database

    PubMed Central

    Pang, Ken C.; Stephen, Stuart; Engström, Pär G.; Tajul-Arifin, Khairina; Chen, Weisan; Wahlestedt, Claes; Lenhard, Boris; Hayashizaki, Yoshihide; Mattick, John S.

    2005-01-01

    In recent years, there have been increasing numbers of transcripts identified that do not encode proteins, many of which are developmentally regulated and appear to have regulatory functions. Here, we describe the construction of a comprehensive mammalian noncoding RNA database (RNAdb) which contains over 800 unique experimentally studied non-coding RNAs (ncRNAs), including many associated with diseases and/or developmental processes. The database is available at http://research.imb.uq.edu.au/RNAdb and is searchable by many criteria. It includes microRNAs and snoRNAs, but not infrastructural RNAs, such as rRNAs and tRNAs, which are catalogued elsewhere. The database also includes over 1100 putative antisense ncRNAs and almost 20 000 putative ncRNAs identified in high-quality murine and human cDNA libraries, with more to be added in the near future. Many of these RNAs are large, and many are spliced, some alternatively. The database will be useful as a foundation for the emerging field of RNomics and the characterization of the roles of ncRNAs in mammalian gene expression and regulation. PMID:15608161

  6. A comprehensive database of Martian landslides

    NASA Astrophysics Data System (ADS)

    Battista Crosta, Giovanni; Vittorio De Blasio, Fabio; Frattini, Paolo; Valbuzzi, Elena

    2016-04-01

    During a long-term project, we have identified and classified a large number (> 3000) of Martian landslides especially but not exclusively from Valles Marineris. This database provides a more complete basis for a statistical study of landslides on Mars and its relationship with geographical and environmental conditions. Landslides have been mapped according to standard geomorphological criteria, delineating both the landslide scar and accumulation limits, associating each scarp to a deposit, and using the program ArcGis for generation of a complete digital dataset. Multiple accumulations from the same source area or from different sources have been differentiated, where possible, to obtain a more complete dataset and to allow more refined analyses. Each landslide has been classified according to a set of criteria including: type, degree of confinement, possible trigger, elevation with respect to datum, geomorphological features, degree of multiplicity, and so on. The runout, fall height, and volume have been measured for each deposit. In fact, the database is revealing a series of trends that may assist at understanding landform processes on Mars and its past climatic conditions. One of the most interesting aspects of our dataset is the presence of a population of landslides whose particularly long mobility deviates from average behavior. While some landslides have travelled unimpeded on a usually flat area, others have travelled against obstacles or mounds. Therefore, landslides are also studied in relation to i) morphologies created by the landslide itself, ii) presence of mounds, barriers or elevations than have affected the movement of the landslide mass. In some extreme cases, the landslide was capable of travelling for several tens of km along the whole valley and upon reaching the opposite side it travelled upslope for several hundreds of meters, which is indication of high travelling speed. In other cases, the high speed is revealed by dynamic deformations

  7. 4DGenome: a comprehensive database of chromatin interactions

    PubMed Central

    Teng, Li; He, Bing; Wang, Jiahui; Tan, Kai

    2015-01-01

    Motivation: The 3D structure of the genome plays a critical role in regulating gene expression. Recent progress in mapping technologies for chromatin interactions has led to a rapid increase in this kind of interaction data. This trend will continue as research in this burgeoning field intensifies. Results: We describe the 4DGenome database that stores chromatin interaction data compiled through comprehensive literature curation. The database currently covers both low- and high-throughput assays, including 3C, 4C-Seq, 5C, Hi-C, ChIA-PET and Capture-C. To complement the set of interactions detected by experimental assays, we also include interactions predicted by a recently developed computational method with demonstrated high accuracy. The database currently contains ∼8 million records, covering 102 cell/tissue types in five organisms. Records in the database are described using a standardized file format, facilitating data exchange. The vast major of the interactions were assigned a confidence score. Using the web interface, users can query and download database records via a number of annotation dimensions. Query results can be visualized along with other genomics datasets via links to the UCSC genome browser. We anticipate that 4DGenome will be a valuable resource for investigating the spatial structure-and-function relationship of genomes. Availability and Implementation: 4Dgenome is freely accessible at http://4dgenome.int-med.uiowa.edu. The database and web interface are implemented in MySQL, Apache and JavaScript with all major browsers supported. Contact: kai-tan@uiowa.edu Supplementary Information: Supplementary Materials are available at Bioinformatics online. PMID:25788621

  8. Dynameomics: A comprehensive database of protein dynamics

    PubMed Central

    van der Kamp, Marc W.; Schaeffer, Richard D.; Jonsson, Amanda L.; Scouras, Alexander D.; Simms, Andrew; Toofanny, Rudesh D.; Benson, Noah C.; Anderson, Peter C.; Merkley, Eric D.; Rysavy, Steve; Bromley, Denny; Beck, David A. C.; Daggett, Valerie

    2010-01-01

    Summary The dynamic behavior of proteins is important for an understanding of their function and folding. We have performed molecular dynamics simulations of the native state and unfolding pathways of over 1000 proteins, representing the majority of folds in globular proteins. These data are stored and organized using an innovative database approach, which can be mined to obtain both general and specific information about the dynamics and folding/unfolding of proteins, relevant subsets thereof, and individual proteins. Here we describe the project in general terms and the type of information contained in the database. Then we provide examples of mining the database for information relevant to protein folding, structure building, the effect of single-nucleotide polymorphisms, and drug design. The native state simulation data and corresponding analyses for the 100 most populated metafolds, together with related resources, are publicly accessible through www.dynameomics.org. PMID:20399180

  9. The peptaibiotics database--a comprehensive online resource.

    PubMed

    Neumann, Nora K N; Stoppacher, Norbert; Zeilinger, Susanne; Degenkolb, Thomas; Brückner, Hans; Schuhmacher, Rainer

    2015-05-01

    In this work, we present the 'Peptaibiotics Database' (PDB), a comprehensive online resource, which intends to cover all Aib-containing non-ribosomal fungal peptides currently described in scientific literature. This database shall extend and update the recently published 'Comprehensive Peptaibiotics Database' and currently consists of 1,297 peptaibiotic sequences. In a literature survey, a total of 235 peptaibiotic sequences published between January 2013 and June 2014 have been compiled, and added to the list of 1,062 peptides in the recently published 'Comprehensive Peptaibiotics Database'. The presented database is intended as a public resource freely accessible to the scientific community at peptaibiotics-database.boku.ac.at. The search options of the previously published repository and the presentation of sequence motif searches have been extended significantly. All of the available search options can be combined to create complex database queries. As a public repository, the presented database enables the easy upload of new peptaibiotic sequences or the correction of existing informations. In addition, an administrative interface for maintenance of the content of the database has been implemented, and the design of the database can be easily extended to store additional information to accommodate future needs of the 'peptaibiomics community'.

  10. Computerized comprehensive data analysis of Lung Imaging Database Consortium (LIDC)

    SciTech Connect

    Tan Jun; Pu Jiantao; Zheng Bin; Wang Xingwei; Leader, Joseph K.

    2010-07-15

    Purpose: Lung Image Database Consortium (LIDC) is the largest public CT image database of lung nodules. In this study, the authors present a comprehensive and the most updated analysis of this dynamically growing database under the help of a computerized tool, aiming to assist researchers to optimally use this database for lung cancer related investigations. Methods: The authors developed a computer scheme to automatically match the nodule outlines marked manually by radiologists on CT images. A large variety of characteristics regarding the annotated nodules in the database including volume, spiculation level, elongation, interobserver variability, as well as the intersection of delineated nodule voxels and overlapping ratio between the same nodules marked by different radiologists are automatically calculated and summarized. The scheme was applied to analyze all 157 examinations with complete annotation data currently available in LIDC dataset. Results: The scheme summarizes the statistical distributions of the abovementioned geometric and diagnosis features. Among the 391 nodules, (1) 365 (93.35%) have principal axis length {<=}20 mm; (2) 120, 75, 76, and 120 were marked by one, two, three, and four radiologists, respectively; and (3) 122 (32.48%) have the maximum volume overlapping ratios {>=}80% for the delineations of two radiologists, while 198 (50.64%) have the maximum volume overlapping ratios <60%. The results also showed that 72.89% of the nodules were assessed with malignancy score between 2 and 4, and only 7.93% of these nodules were considered as severely malignant (malignancy {>=}4). Conclusions: This study demonstrates that LIDC contains examinations covering a diverse distribution of nodule characteristics and it can be a useful resource to assess the performance of the nodule detection and/or segmentation schemes.

  11. Development of a Comprehensive Database System for Safety Analyst.

    PubMed

    Paz, Alexander; Veeramisti, Naveen; Khanal, Indira; Baker, Justin; de la Fuente-Mella, Hanns

    2015-01-01

    This study addressed barriers associated with the use of Safety Analyst, a state-of-the-art tool that has been developed to assist during the entire Traffic Safety Management process but that is not widely used due to a number of challenges as described in this paper. As part of this study, a comprehensive database system and tools to provide data to multiple traffic safety applications, with a focus on Safety Analyst, were developed. A number of data management tools were developed to extract, collect, transform, integrate, and load the data. The system includes consistency-checking capabilities to ensure the adequate insertion and update of data into the database. This system focused on data from roadways, ramps, intersections, and traffic characteristics for Safety Analyst. To test the proposed system and tools, data from Clark County, which is the largest county in Nevada and includes the cities of Las Vegas, Henderson, Boulder City, and North Las Vegas, was used. The database and Safety Analyst together help identify the sites with the potential for safety improvements. Specifically, this study examined the results from two case studies. The first case study, which identified sites having a potential for safety improvements with respect to fatal and all injury crashes, included all roadway elements and used default and calibrated Safety Performance Functions (SPFs). The second case study identified sites having a potential for safety improvements with respect to fatal and all injury crashes, specifically regarding intersections; it used default and calibrated SPFs as well. Conclusions were developed for the calibration of safety performance functions and the classification of site subtypes. Guidelines were provided about the selection of a particular network screening type or performance measure for network screening.

  12. Development of a Comprehensive Database System for Safety Analyst.

    PubMed

    Paz, Alexander; Veeramisti, Naveen; Khanal, Indira; Baker, Justin; de la Fuente-Mella, Hanns

    2015-01-01

    This study addressed barriers associated with the use of Safety Analyst, a state-of-the-art tool that has been developed to assist during the entire Traffic Safety Management process but that is not widely used due to a number of challenges as described in this paper. As part of this study, a comprehensive database system and tools to provide data to multiple traffic safety applications, with a focus on Safety Analyst, were developed. A number of data management tools were developed to extract, collect, transform, integrate, and load the data. The system includes consistency-checking capabilities to ensure the adequate insertion and update of data into the database. This system focused on data from roadways, ramps, intersections, and traffic characteristics for Safety Analyst. To test the proposed system and tools, data from Clark County, which is the largest county in Nevada and includes the cities of Las Vegas, Henderson, Boulder City, and North Las Vegas, was used. The database and Safety Analyst together help identify the sites with the potential for safety improvements. Specifically, this study examined the results from two case studies. The first case study, which identified sites having a potential for safety improvements with respect to fatal and all injury crashes, included all roadway elements and used default and calibrated Safety Performance Functions (SPFs). The second case study identified sites having a potential for safety improvements with respect to fatal and all injury crashes, specifically regarding intersections; it used default and calibrated SPFs as well. Conclusions were developed for the calibration of safety performance functions and the classification of site subtypes. Guidelines were provided about the selection of a particular network screening type or performance measure for network screening. PMID:26167531

  13. Development of a Comprehensive Database System for Safety Analyst

    PubMed Central

    Paz, Alexander; Veeramisti, Naveen; Khanal, Indira; Baker, Justin; de la Fuente-Mella, Hanns

    2015-01-01

    This study addressed barriers associated with the use of Safety Analyst, a state-of-the-art tool that has been developed to assist during the entire Traffic Safety Management process but that is not widely used due to a number of challenges as described in this paper. As part of this study, a comprehensive database system and tools to provide data to multiple traffic safety applications, with a focus on Safety Analyst, were developed. A number of data management tools were developed to extract, collect, transform, integrate, and load the data. The system includes consistency-checking capabilities to ensure the adequate insertion and update of data into the database. This system focused on data from roadways, ramps, intersections, and traffic characteristics for Safety Analyst. To test the proposed system and tools, data from Clark County, which is the largest county in Nevada and includes the cities of Las Vegas, Henderson, Boulder City, and North Las Vegas, was used. The database and Safety Analyst together help identify the sites with the potential for safety improvements. Specifically, this study examined the results from two case studies. The first case study, which identified sites having a potential for safety improvements with respect to fatal and all injury crashes, included all roadway elements and used default and calibrated Safety Performance Functions (SPFs). The second case study identified sites having a potential for safety improvements with respect to fatal and all injury crashes, specifically regarding intersections; it used default and calibrated SPFs as well. Conclusions were developed for the calibration of safety performance functions and the classification of site subtypes. Guidelines were provided about the selection of a particular network screening type or performance measure for network screening. PMID:26167531

  14. Comprehensive Thematic T-matrix Reference Database: a 2013-2014 Update

    NASA Technical Reports Server (NTRS)

    Mishchenko, Michael I.; Zakharova, Nadezhda T.; Khlebtsov, Nikolai G.; Wriedt, Thomas; Videen, Gorden

    2014-01-01

    This paper is the sixth update to the comprehensive thematic database of peer-reviewedT-matrix publications initiated by us in 2004 and includes relevant publications that have appeared since 2013. It also lists several earlier publications not incorporated in the original database and previous updates.

  15. Database trial impact on graduate nursing comprehensive exams.

    PubMed

    Pionke, Katharine; Huckstadt, Alicia

    2015-10-01

    While the authors were doing a test period of databases, the question of whether or not databases affect outcomes of graduate nursing comprehensive examinations came up. This study explored that question through using citation analysis of exams that were taken during a database trial and exams that were not. The findings showed no difference in examination pass/fail rates. While the pass/fail rates did not change, a great deal was learned in terms of citation accuracy and types of materials that students used, leading to discussions about changing how citation and plagiarism awareness were taught. PMID:26512218

  16. THE NATIONAL EXPOSURE RESEARCH LABORATORY'S COMPREHENSIVE HUMAN ACTIVITY DATABASE

    EPA Science Inventory

    EPA's National Exposure Research Laboratory (NERL) has combined data from nine U.S. studies related to human activities into one comprehensive data system that can be accessed via the world-wide web. The data system is called CHAD-Consolidated Human Activity Database-and it is ...

  17. Arabidopsis Hormone Database: a comprehensive genetic and phenotypic information database for plant hormone research in Arabidopsis.

    PubMed

    Peng, Zhi-yu; Zhou, Xin; Li, Linchuan; Yu, Xiangchun; Li, Hongjiang; Jiang, Zhiqiang; Cao, Guangyu; Bai, Mingyi; Wang, Xingchun; Jiang, Caifu; Lu, Haibin; Hou, Xianhui; Qu, Lijia; Wang, Zhiyong; Zuo, Jianru; Fu, Xiangdong; Su, Zhen; Li, Songgang; Guo, Hongwei

    2009-01-01

    Plant hormones are small organic molecules that influence almost every aspect of plant growth and development. Genetic and molecular studies have revealed a large number of genes that are involved in responses to numerous plant hormones, including auxin, gibberellin, cytokinin, abscisic acid, ethylene, jasmonic acid, salicylic acid, and brassinosteroid. Here, we develop an Arabidopsis hormone database, which aims to provide a systematic and comprehensive view of genes participating in plant hormonal regulation, as well as morphological phenotypes controlled by plant hormones. Based on data from mutant studies, transgenic analysis and gene ontology (GO) annotation, we have identified a total of 1026 genes in the Arabidopsis genome that participate in plant hormone functions. Meanwhile, a phenotype ontology is developed to precisely describe myriad hormone-regulated morphological processes with standardized vocabularies. A web interface (http://ahd.cbi.pku.edu.cn) would allow users to quickly get access to information about these hormone-related genes, including sequences, functional category, mutant information, phenotypic description, microarray data and linked publications. Several applications of this database in studying plant hormonal regulation and hormone cross-talk will be presented and discussed.

  18. Arabidopsis Hormone Database: a comprehensive genetic and phenotypic information database for plant hormone research in Arabidopsis.

    PubMed

    Peng, Zhi-yu; Zhou, Xin; Li, Linchuan; Yu, Xiangchun; Li, Hongjiang; Jiang, Zhiqiang; Cao, Guangyu; Bai, Mingyi; Wang, Xingchun; Jiang, Caifu; Lu, Haibin; Hou, Xianhui; Qu, Lijia; Wang, Zhiyong; Zuo, Jianru; Fu, Xiangdong; Su, Zhen; Li, Songgang; Guo, Hongwei

    2009-01-01

    Plant hormones are small organic molecules that influence almost every aspect of plant growth and development. Genetic and molecular studies have revealed a large number of genes that are involved in responses to numerous plant hormones, including auxin, gibberellin, cytokinin, abscisic acid, ethylene, jasmonic acid, salicylic acid, and brassinosteroid. Here, we develop an Arabidopsis hormone database, which aims to provide a systematic and comprehensive view of genes participating in plant hormonal regulation, as well as morphological phenotypes controlled by plant hormones. Based on data from mutant studies, transgenic analysis and gene ontology (GO) annotation, we have identified a total of 1026 genes in the Arabidopsis genome that participate in plant hormone functions. Meanwhile, a phenotype ontology is developed to precisely describe myriad hormone-regulated morphological processes with standardized vocabularies. A web interface (http://ahd.cbi.pku.edu.cn) would allow users to quickly get access to information about these hormone-related genes, including sequences, functional category, mutant information, phenotypic description, microarray data and linked publications. Several applications of this database in studying plant hormonal regulation and hormone cross-talk will be presented and discussed. PMID:19015126

  19. Comprehensive Genetic Database of Expressed Sequence Tags for Coccolithophorids

    NASA Astrophysics Data System (ADS)

    Ranji, Mohammad; Hadaegh, Ahmad R.

    Coccolithophorids are unicellular, marine, golden-brown, single-celled algae (Haptophyta) commonly found in near-surface waters in patchy distributions. They belong to the Phytoplankton family that is known to be responsible for much of the earth reproduction. Phytoplankton, just like plants live based on the energy obtained by Photosynthesis which produces oxygen. Substantial amount of oxygen in the earth's atmosphere is produced by Phytoplankton through Photosynthesis. The single-celled Emiliana Huxleyi is the most commonly known specie of Coccolithophorids and is known for extracting bicarbonate (HCO3) from its environment and producing calcium carbonate to form Coccoliths. Coccolithophorids are one of the world's primary producers, contributing about 15% of the average oceanic phytoplankton biomass to the oceans. They produce elaborate, minute calcite platelets (Coccoliths), covering the cell to form a Coccosphere and supplying up to 60% of the bulk pelagic calcite deposited on the sea floors. In order to understand the genetics of Coccolithophorid and the complexities of their biochemical reactions, we decided to build a database to store a complete profile of these organisms' genomes. Although a variety of such databases currently exist, (http://www.geneservice.co.uk/home/) none have yet been developed to comprehensively address the sequencing efforts underway by the Coccolithophorid research community. This database is called CocooExpress and is available to public (http://bioinfo.csusm.edu) for both data queries and sequence contribution.

  20. ClassLess: A Comprehensive Database of Young Stellar Objects

    NASA Astrophysics Data System (ADS)

    Hillenbrand, Lynne; Baliber, Nairn

    2015-01-01

    We have designed and constructed a database housing published measurements of Young Stellar Objects (YSOs) within ~1 kpc of the Sun. ClassLess, so called because it includes YSOs in all stages of evolution, is a relational database in which user interaction is conducted via HTML web browsers, queries are performed in scientific language, and all data are linked to the sources of publication. Each star is associated with a cluster (or clusters), and both spatially resolved and unresolved measurements are stored, allowing proper use of data from multiple star systems. With this fully searchable tool, myriad ground- and space-based instruments and surveys across wavelength regimes can be exploited. In addition to primary measurements, the database self consistently calculates and serves higher level data products such as extinction, luminosity, and mass. As a result, searches for young stars with specific physical characteristics can be completed with just a few mouse clicks.

  1. Comprehensive T-Matrix Reference Database: A 2007-2009 Update

    NASA Technical Reports Server (NTRS)

    Mishchenko, Michael I.; Zakharova, Nadia T.; Videen, Gorden; Khlebtsov, Nikolai G.; Wriedt, Thomas

    2010-01-01

    The T-matrix method is among the most versatile, efficient, and widely used theoretical techniques for the numerically exact computation of electromagnetic scattering by homogeneous and composite particles, clusters of particles, discrete random media, and particles in the vicinity of an interface separating two half-spaces with different refractive indices. This paper presents an update to the comprehensive database of T-matrix publications compiled by us previously and includes the publications that appeared since 2007. It also lists several earlier publications not included in the original database.

  2. Comprehensive T-matrix Reference Database: A 2009-2011 Update

    NASA Technical Reports Server (NTRS)

    Zakharova, Nadezhda T.; Videen, G.; Khlebtsov, Nikolai G.

    2012-01-01

    The T-matrix method is one of the most versatile and efficient theoretical techniques widely used for the computation of electromagnetic scattering by single and composite particles, discrete random media, and particles in the vicinity of an interface separating two half-spaces with different refractive indices. This paper presents an update to the comprehensive database of peer-reviewed T-matrix publications compiled by us previously and includes the publications that appeared since 2009. It also lists several earlier publications not included in the original database.

  3. FunSimMat: a comprehensive functional similarity database.

    PubMed

    Schlicker, Andreas; Albrecht, Mario

    2008-01-01

    Functional similarity based on Gene Ontology (GO) annotation is used in diverse applications like gene clustering, gene expression data analysis, protein interaction prediction and evaluation. However, there exists no comprehensive resource of functional similarity values although such a database would facilitate the use of functional similarity measures in different applications. Here, we describe FunSimMat (Functional Similarity Matrix, http://funsimmat.bioinf.mpi-inf.mpg.de/), a large new database that provides several different semantic similarity measures for GO terms. It offers various precomputed functional similarity values for proteins contained in UniProtKB and for protein families in Pfam and SMART. The web interface allows users to efficiently perform both semantic similarity searches with GO terms and functional similarity searches with proteins or protein families. All results can be downloaded in tab-delimited files for use with other tools. An additional XML-RPC interface gives automatic online access to FunSimMat for programs and remote services.

  4. Flybrain neuron database: a comprehensive database system of the Drosophila brain neurons.

    PubMed

    Shinomiya, Kazunori; Matsuda, Keiji; Oishi, Takao; Otsuna, Hideo; Ito, Kei

    2011-04-01

    The long history of neuroscience has accumulated information about numerous types of neurons in the brain of various organisms. Because such neurons have been reported in diverse publications without controlled format, it is not easy to keep track of all the known neurons in a particular nervous system. To address this issue we constructed an online database called Flybrain Neuron Database (Flybrain NDB), which serves as a platform to collect and provide information about all the types of neurons published so far in the brain of Drosophila melanogaster. Projection patterns of the identified neurons in diverse areas of the brain were recorded in a unified format, with text-based descriptions as well as images and movies wherever possible. In some cases projection sites and the distribution of the post- and presynaptic sites were determined with greater detail than described in the original publication. Information about the labeling patterns of various antibodies and expression driver strains to visualize identified neurons are provided as a separate sub-database. We also implemented a novel visualization tool with which users can interactively examine three-dimensional reconstruction of the confocal serial section images with desired viewing angles and cross sections. Comprehensive collection and versatile search function of the anatomical information reported in diverse publications make it possible to analyze possible connectivity between different brain regions. We analyzed the preferential connectivity among optic lobe layers and the plausible olfactory sensory map in the lateral horn to show the usefulness of such a database.

  5. ClassLess: A Comprehensive Database of Young Stellar Objects

    NASA Astrophysics Data System (ADS)

    Hillenbrand, Lynne A.; baliber, nairn

    2015-08-01

    We have designed and constructed a database intended to house catalog and literature-published measurements of Young Stellar Objects (YSOs) within ~1 kpc of the Sun. ClassLess, so called because it includes YSOs in all stages of evolution, is a relational database in which user interaction is conducted via HTML web browsers, queries are performed in scientific language, and all data are linked to the sources of publication. Each star is associated with a cluster (or clusters), and both spatially resolved and unresolved measurements are stored, allowing proper use of data from multiple star systems. With this fully searchable tool, myriad ground- and space-based instruments and surveys across wavelength regimes can be exploited. In addition to primary measurements, the database self consistently calculates and serves higher level data products such as extinction, luminosity, and mass. As a result, searches for young stars with specific physical characteristics can be completed with just a few mouse clicks. We are in the database population phase now, and are eager to engage with interested experts worldwide on local galactic star formation and young stellar populations.

  6. The Histone Database: a comprehensive resource for histones and histone fold-containing proteins.

    PubMed

    Mariño-Ramírez, Leonardo; Hsu, Benjamin; Baxevanis, Andreas D; Landsman, David

    2006-03-01

    The Histone Database is a curated and searchable collection of full-length sequences and structures of histones and nonhistone proteins containing histone-like folds, compiled from major public databases. Several new histone fold-containing proteins have been identified, including the huntingtin-interacting protein HYPM. Additionally, based on the recent crystal structure of the Son of Sevenless protein, an interpretation of the sequence analysis of the histone fold domain is presented. The database contains an updated collection of multiple sequence alignments for the four core histones (H2A, H2B, H3, and H4) and the linker histones (H1/H5) from a total of 975 organisms. The database also contains information on the human histone gene complement and provides links to three-dimensional structures of histone and histone fold-containing proteins. The Histone Database is a comprehensive bioinformatics resource for the study of structure and function of histones and histone fold-containing proteins. The database is available at http://research.nhgri.nih.gov/histones/.

  7. sRNATarBase: a comprehensive database of bacterial sRNA targets verified by experiments.

    PubMed

    Cao, Yuan; Wu, Jiayao; Liu, Qian; Zhao, Yalin; Ying, Xiaomin; Cha, Lei; Wang, Ligui; Li, Wuju

    2010-11-01

    Bacterial sRNAs are an emerging class of small regulatory RNAs, 40-500 nt in length, which play a variety of important roles in many biological processes through binding to their mRNA or protein targets. A comprehensive database of experimentally confirmed sRNA targets would be helpful in understanding sRNA functions systematically and provide support for developing prediction models. Here we report on such a database--sRNATarBase. The database holds 138 sRNA-target interactions and 252 noninteraction entries, which were manually collected from peer-reviewed papers. The detailed information for each entry, such as supporting experimental protocols, BLAST-based phylogenetic analysis of sRNA-mRNA target interaction in closely related bacteria, predicted secondary structures for both sRNAs and their targets, and available binding regions, is provided as accurately as possible. This database also provides hyperlinks to other databases including GenBank, SWISS-PROT, and MPIDB. The database is available from the web page http://ccb.bmi.ac.cn/srnatarbase/.

  8. Tomato functional genomics database (TFGD): a comprehensive collection and analysis package for tomato functional genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Tomato Functional Genomics Database (TFGD; http://ted.bti.cornell.edu) provides a comprehensive systems biology resource to store, mine, analyze, visualize and integrate large-scale tomato functional genomics datasets. The database is expanded from the previously described Tomato Expression Database...

  9. circRNADb: A comprehensive database for human circular RNAs with protein-coding annotations

    PubMed Central

    Chen, Xiaoping; Han, Ping; Zhou, Tao; Guo, Xuejiang; Song, Xiaofeng; Li, Yan

    2016-01-01

    It has been known that circular RNAs are widely expressed in human tissues and cells, and play important regulatory roles in physiological or pathological processes. However, there is lack of comprehensively annotated human circular RNAs database. In this study we established a circRNA database, named as circRNADb, containing 32,914 human exonic circRNAs carefully selected from diversified sources. The detailed information of the circRNA, including genomic information, exon splicing, genome sequence, internal ribosome entry site (IRES), open reading frame (ORF) and references were provided in circRNADb. In addition, circRNAs were found to be able to encode proteins, which have not been reported in any species. 16328 circRNAs were annotated to have ORF longer than 100 amino acids, of which 7170 have IRES elements. 46 circRNAs from 37 genes were found to have their corresponding proteins expressed according mass spectrometry. The database provides the function of data search, browse, download, submit and feedback for the user to study particular circular RNA of interest and update the database continually. circRNADb will be built to be a biological information platform for circRNA molecules and related biological functions in the future. The database can be freely available through the web server at http://reprod.njmu.edu.cn/circrnadb. PMID:27725737

  10. LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes.

    PubMed

    Wang, Dapeng; Zhang, Yubin; Fan, Zhonghua; Liu, Guiming; Yu, Jun

    2012-01-01

    Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database-LCGbase (a comprehensive database for lineage-based co-regulated genes)-hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warm-blooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene

  11. T3DB: a comprehensively annotated database of common toxins and their targets.

    PubMed

    Lim, Emilia; Pon, Allison; Djoumbou, Yannick; Knox, Craig; Shrivastava, Savita; Guo, An Chi; Neveu, Vanessa; Wishart, David S

    2010-01-01

    In an effort to capture meaningful biological, chemical and mechanistic information about clinically relevant, commonly encountered or important toxins, we have developed the Toxin and Toxin-Target Database (T3DB). The T3DB is a unique bioinformatics resource that compiles comprehensive information about common or ubiquitous toxins and their toxin-targets into a single electronic repository. The database currently contains over 2900 small molecule and peptide toxins, 1300 toxin-targets and more than 33,000 toxin-target associations. Each T3DB record (ToxCard) contains over 80 data fields providing detailed information on chemical properties and descriptors, toxicity values, protein and gene sequences (for both targets and toxins), molecular and cellular interaction data, toxicological data, mechanistic information and references. This information has been manually extracted and manually verified from numerous sources, including other electronic databases, government documents, textbooks and scientific journals. A key focus of the T3DB is on providing 'depth' over 'breadth' with detailed descriptions, mechanisms of action, and information on toxins and toxin-targets. T3DB is fully searchable and supports extensive text, sequence, chemical structure and relational query searches, similar to those found in the Human Metabolome Database (HMDB) and DrugBank. Potential applications of the T3DB include clinical metabolomics, toxin target prediction, toxicity prediction and toxicology education. The T3DB is available online at http://www.t3db.org. PMID:19897546

  12. Comprehensive thematic T-matrix reference database: A 2014-2015 update

    NASA Astrophysics Data System (ADS)

    Mishchenko, Michael I.; Zakharova, Nadezhda T.; Khlebtsov, Nikolai G.; Videen, Gorden; Wriedt, Thomas

    2016-07-01

    The T-matrix method is one of the most versatile and efficient direct computer solvers of the macroscopic Maxwell equations and is widely used for the computation of electromagnetic scattering by single and composite particles, discrete random media, and particles in the vicinity of an interface separating two half-spaces with different refractive indices. This paper is the seventh update to the comprehensive thematic database of peer-reviewed T-matrix publications initiated by us in 2004 and includes relevant publications that have appeared since 2013. It also lists a number of earlier publications overlooked previously.

  13. MIDAS (Material Implementation, Database, and Analysis Source): A comprehensive resource of material properties

    SciTech Connect

    Tang, M; Norquist, P; Barton, N; Durrenberger, K; Florando, J; Attia, A

    2010-12-13

    MIDAS is aimed to be an easy-to-use and comprehensive common source for material properties including both experimental data and models and their parameters. At LLNL, we will develop MIDAS to be the central repository for material strength related data and models with the long-term goal to encompass other material properties. MIDAS will allow the users to upload experimental data and updated models, to view and read materials data and references, to manipulate models and their parameters, and to serve as the central location for the application codes to access the continuously growing model source codes. MIDAS contains a suite of interoperable tools and utilizes components already existing at LLNL: MSD (material strength database), MatProp (database of materials properties files), and MSlib (library of material model source codes). MIDAS requires significant development of the computer science framework for the interfaces between different components. We present the current status of MIDAS and its future development in this paper.

  14. Characterizing Scattering of PSD-Integrated Aggregate Ensembles using a Comprehensive Particle Scattering Database

    NASA Astrophysics Data System (ADS)

    Kuo, Kwo-Sen; Skofronick-Jackson, Gail; Olson, William; Tanelli, Simone; Haddad, Ziad; van Aartsen, Bruce

    2013-04-01

    We have constructed ~10,000 realistic snow particles, including both pristine and aggregate types, with maximum diameter spanning ~100 micron to 15 mm. The scattering property for each of these particles has subsequently been obtained using the open source discrete-dipole-approximation (DDA) code, DDSCAT, at thirteen (13) microwave frequencies ranging from 10 to 190 GHz. A scattering property database has thus been constructed to support precipitation remote sensing involving ice particles in the GPM era. Preliminary trial retrievals, utilizing radar and radiometer measurements from a field campaign and the GPM radar-radiometer combined algorithm currently under development, demonstrates that much better consistency between radar and radiometer is achieved when using scattering properties from this database rather than the ones derived with simplified assumptions. Although this scattering database, using more realistic particles and being more comprehensive in scope, represents a considerable stride forward in the advancement of solid-precipitation remote sensing, there are still important tasks to be accomplished in order to effectively use this database for practical retrievals. One of such tasks is to find parameters, preferably a very small number of them, that can adequately characterize the single-scattering properties for realistic ensembles of these particles, i.e. with realistic particle size distributions, of these particles. This study reports an approach that holds great promise in reducing the seemingly insurmountable complexity in the plethora of particle shapes into a few parameters.

  15. MeioBase: a comprehensive database for meiosis.

    PubMed

    Li, Hao; Meng, Fanrui; Guo, Chunce; Wang, Yingxiang; Xie, Xiaojing; Zhu, Tiansheng; Zhou, Shuigeng; Ma, Hong; Shan, Hongyan; Kong, Hongzhi

    2014-01-01

    Meiosis is a special type of cell division process necessary for the sexual reproduction of all eukaryotes. The ever expanding meiosis research calls for an effective and specialized database that is not readily available yet. To fill this gap, we have developed a knowledge database MeioBase (http://meiosis.ibcas.ac.cn), which is comprised of two core parts, Resources and Tools. In the Resources part, a wealth of meiosis data collected by curation and manual review from published literatures and biological databases are integrated and organized into various sections, such as Cytology, Pathway, Species, Interaction, and Expression. In the Tools part, some useful tools have been integrated into MeioBase, such as Search, Download, Blast, Comparison, My Favorites, Submission, and Advice. With a simplified and efficient web interface, users are able to search against the database with gene model IDs or keywords, and batch download the data for local investigation. We believe that MeioBase can greatly facilitate the researches related to meiosis. PMID:25566299

  16. Comprehensive Health Education Standards (Including Physical Activity Standards). Comprehensive Health Rationale.

    ERIC Educational Resources Information Center

    Arizona State Dept. of Education, Phoenix.

    Arizona's comprehensive health education standards begins with a health rationale for each standard, then details the seven standards: (1) students comprehend concepts related to health promotion and disease prevention; (2) students demonstrate the ability to access accurate health information; (3) students demonstrate the ability to practice…

  17. Comprehensiveness of "Social Work Abstracts" as a Database for Researchers and Practitioners

    ERIC Educational Resources Information Center

    Shek, Daniel T. L.

    2008-01-01

    Objective: The comprehensiveness of "Social Work Abstracts (SWA)" as a database was examined. Method: The coverage of articles of "Research on Social Work Practice" and "Social Work Research" in "SWA" from 1995 to 2005 was examined. "SWA" was compared with other databases on several dimensions. A case study based on the author's publications was…

  18. AIM: A comprehensive Arabidopsis Interactome Module database and related interologs in plants

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Systems biology analysis of protein modules is important for understanding the functional relationships between proteins in the interactome. Here, we present a comprehensive database named AIM for Arabidopsis (Arabidopsis thaliana) interactome modules. The database contains almost 250,000 modules th...

  19. Skeleton Genetics: a comprehensive database for genes and mutations related to genetic skeletal disorders.

    PubMed

    Chen, Chong; Jiang, Yi; Xu, Chenyang; Liu, Xinting; Hu, Lin; Xiang, Yanbao; Chen, Qingshuang; Chen, Denghui; Li, Huanzheng; Xu, Xueqin; Tang, Shaohua

    2016-01-01

    Genetic skeletal disorders (GSD) involving the skeletal system arises through disturbances in the complex processes of skeletal development, growth and homeostasis and remain a diagnostic challenge because of their clinical heterogeneity and genetic variety. Over the past decades, tremendous effort platforms have been made to explore the complex heterogeneity, and massive new genes and mutations have been identified in different GSD, but the information supplied by literature is still limited and it is hard to meet the further needs of scientists and clinicians. In this study, combined with Nosology and Classification of genetic skeletal disorders, we developed the first comprehensive and annotated genetic skeletal disorders database, named 'SkeletonGenetics', which contains information about all GSD-related knowledge including 8225 mutations in 357 genes, with detailed information associated with 481 clinical diseases (2260 clinical phenotype) classified in 42 groups defined by molecular, biochemical and/or radiographic criteria from 1698 publications. Further annotations were performed to each entry including Gene Ontology, pathways analysis, protein-protein interaction, mutation annotations, disease-disease clustering and gene-disease networking. Furthermore, using concise search methods, intuitive graphical displays, convenient browsing functions and constantly updatable features, 'SkeletonGenetics' could serve as a central and integrative database for unveiling the genetic and pathways pre-dispositions of GSD.Database URL: http://101.200.211.232/skeletongenetics/. PMID:27580923

  20. Skeleton Genetics: a comprehensive database for genes and mutations related to genetic skeletal disorders

    PubMed Central

    Chen, Chong; Jiang, Yi; Xu, Chenyang; Liu, Xinting; Hu, Lin; Xiang, Yanbao; Chen, Qingshuang; Chen, Denghui; Li, Huanzheng; Xu, Xueqin; Tang, Shaohua

    2016-01-01

    Genetic skeletal disorders (GSD) involving the skeletal system arises through disturbances in the complex processes of skeletal development, growth and homeostasis and remain a diagnostic challenge because of their clinical heterogeneity and genetic variety. Over the past decades, tremendous effort platforms have been made to explore the complex heterogeneity, and massive new genes and mutations have been identified in different GSD, but the information supplied by literature is still limited and it is hard to meet the further needs of scientists and clinicians. In this study, combined with Nosology and Classification of genetic skeletal disorders, we developed the first comprehensive and annotated genetic skeletal disorders database, named ‘SkeletonGenetics’, which contains information about all GSD-related knowledge including 8225 mutations in 357 genes, with detailed information associated with 481 clinical diseases (2260 clinical phenotype) classified in 42 groups defined by molecular, biochemical and/or radiographic criteria from 1698 publications. Further annotations were performed to each entry including Gene Ontology, pathways analysis, protein–protein interaction, mutation annotations, disease–disease clustering and gene–disease networking. Furthermore, using concise search methods, intuitive graphical displays, convenient browsing functions and constantly updatable features, ‘SkeletonGenetics’ could serve as a central and integrative database for unveiling the genetic and pathways pre-dispositions of GSD. Database URL: http://101.200.211.232/skeletongenetics/ PMID:27580923

  1. uORFdb—a comprehensive literature database on eukaryotic uORF biology

    PubMed Central

    Wethmar, Klaus; Barbosa-Silva, Adriano; Andrade-Navarro, Miguel A.; Leutz, Achim

    2014-01-01

    Approximately half of all human transcripts contain at least one upstream translational initiation site that precedes the main coding sequence (CDS) and gives rise to an upstream open reading frame (uORF). We generated uORFdb, publicly available at http://cbdm.mdc-berlin.de/tools/uorfdb, to serve as a comprehensive literature database on eukaryotic uORF biology. Upstream ORFs affect downstream translation by interfering with the unrestrained progression of ribosomes across the transcript leader sequence. Although the first uORF-related translational activity was observed >30 years ago, and an increasing number of studies link defective uORF-mediated translational control to the development of human diseases, the features that determine uORF-mediated regulation of downstream translation are not well understood. The uORFdb was manually curated from all uORF-related literature listed at the PubMed database. It categorizes individual publications by a variety of denominators including taxon, gene and type of study. Furthermore, the database can be filtered for multiple structural and functional uORF-related properties to allow convenient and targeted access to the complex field of eukaryotic uORF biology. PMID:24163100

  2. Improving pharmaceutical innovation by building a more comprehensive database on drug development and use.

    PubMed

    Daniel, Gregory W; Cazé, Alexis; Romine, Morgan H; Audibert, Céline; Leff, Jonathan S; McClellan, Mark B

    2015-02-01

    New drugs and biologics have had a tremendous impact on the treatment of many diseases. However, available measures suggest that pharmaceutical innovation has remained relatively flat, despite substantial growth in research and development spending. We review recent literature on pharmaceutical innovation to identify limitations in measuring and assessing innovation, and we describe the framework and collaborative approach we are using to develop more comprehensive, publicly available metrics for innovation. Our research teams at the Brookings Institution and Deerfield Institute are collaborating with experts from multiple areas of drug development and regulatory review to identify and collect comprehensive data elements related to key development and regulatory characteristics for each new molecular entity approved over the past several decades in the United States and the European Union. Subsequent phases of our effort will add data on downstream product use and patient outcomes and will also include drugs that have failed or been abandoned in development. Such a database will enable researchers to better analyze the drivers of drug innovation, trends in the output of new medicines, and the effect of policy efforts designed to improve innovation.

  3. Improving pharmaceutical innovation by building a more comprehensive database on drug development and use.

    PubMed

    Daniel, Gregory W; Cazé, Alexis; Romine, Morgan H; Audibert, Céline; Leff, Jonathan S; McClellan, Mark B

    2015-02-01

    New drugs and biologics have had a tremendous impact on the treatment of many diseases. However, available measures suggest that pharmaceutical innovation has remained relatively flat, despite substantial growth in research and development spending. We review recent literature on pharmaceutical innovation to identify limitations in measuring and assessing innovation, and we describe the framework and collaborative approach we are using to develop more comprehensive, publicly available metrics for innovation. Our research teams at the Brookings Institution and Deerfield Institute are collaborating with experts from multiple areas of drug development and regulatory review to identify and collect comprehensive data elements related to key development and regulatory characteristics for each new molecular entity approved over the past several decades in the United States and the European Union. Subsequent phases of our effort will add data on downstream product use and patient outcomes and will also include drugs that have failed or been abandoned in development. Such a database will enable researchers to better analyze the drivers of drug innovation, trends in the output of new medicines, and the effect of policy efforts designed to improve innovation. PMID:25646113

  4. MitBASE : a comprehensive and integrated mitochondrial DNA database. The present status

    PubMed Central

    Attimonelli, M.; Altamura, N.; Benne, R.; Brennicke, A.; Cooper, J. M.; D’Elia, D.; Montalvo, A. de; Pinto, B. de; De Robertis, M.; Golik, P.; Knoop, V.; Lanave, C.; Lazowska, J.; Licciulli, F.; Malladi, B. S.; Memeo, F.; Monnerot, M.; Pasimeni, R.; Pilbout, S.; Schapira, A. H. V.; Sloof, P.; Saccone, C.

    2000-01-01

    MitBASE is an integrated and comprehensive database of mitochondrial DNA data which collects, under a single interface, databases for Plant, Vertebrate, Invertebrate, Human, Protist and Fungal mtDNA and a Pilot database on nuclear genes involved in mitochondrial biogenesis in Saccharomyces cerevisiae. MitBASE reports all available information from different organisms and from intraspecies variants and mutants. Data have been drawn from the primary databases and from the literature; value adding information has been structured, e.g., editing information on protist mtDNA genomes, pathological information for human mtDNA variants, etc. The different databases, some of which are structured using commercial packages (Microsoft Access, File Maker Pro) while others use a flat-file format, have been integrated under ORACLE. Ad hoc retrieval systems have been devised for some of the above listed databases keeping into account their peculiarities. The database is resident at the EBI and is available at the following site: http://www3.ebi.ac.uk/Research/Mitbase/mitbase.pl . The impact of this project is intended for both basic and applied research. The study of mitochondrial genetic diseases and mitochondrial DNA intraspecies diversity are key topics in several biotechnological fields. The database has been funded within the EU Biotechnology programme. PMID:10592207

  5. microPIR2: a comprehensive database for human-mouse comparative study of microRNA-promoter interactions.

    PubMed

    Piriyapongsa, Jittima; Bootchai, Chaiwat; Ngamphiw, Chumpol; Tongsima, Sissades

    2014-01-01

    microRNA (miRNA)-promoter interaction resource (microPIR) is a public database containing over 15 million predicted miRNA target sites located within human promoter sequences. These predicted targets are presented along with their related genomic and experimental data, making the microPIR database the most comprehensive repository of miRNA promoter target sites. Here, we describe major updates of the microPIR database including new target predictions in the mouse genome and revised human target predictions. The updated database (microPIR2) now provides ∼80 million human and 40 million mouse predicted target sites. In addition to being a reference database, microPIR2 is a tool for comparative analysis of target sites on the promoters of human-mouse orthologous genes. In particular, this new feature was designed to identify potential miRNA-promoter interactions conserved between species that could be stronger candidates for further experimental validation. We also incorporated additional supporting information to microPIR2 such as nuclear and cytoplasmic localization of miRNAs and miRNA-disease association. Extra search features were also implemented to enable various investigations of targets of interest. Database URL: http://www4a.biotec.or.th/micropir2

  6. OncomiRdbB: a comprehensive database of microRNAs and their targets in breast cancer

    PubMed Central

    2014-01-01

    Background Given the estimate that 30% of our genes are controlled by microRNAs, it is essential that we understand the precise relationship between microRNAs and their targets. OncomiRs are microRNAs (miRNAs) that have been frequently shown to be deregulated in cancer. However, although several oncomiRs have been identified and characterized, there is as yet no comprehensive compilation of this data which has rendered it underutilized by cancer biologists. There is therefore an unmet need in generating bioinformatic platforms to speed the identification of novel therapeutic targets. Description We describe here OncomiRdbB, a comprehensive database of oncomiRs mined from different existing databases for mouse and humans along with novel oncomiRs that we have validated in human breast cancer samples. The database also lists their respective predicted targets, identified using miRanda, along with their IDs, sequences, chromosome location and detailed description. This database facilitates querying by search strings including microRNA name, sequence, accession number, target genes and organisms. The microRNA networks and their hubs with respective targets at 3'UTR, 5'UTR and exons of different pathway genes were also deciphered using the 'R' algorithm. Conclusion OncomiRdbB is a comprehensive and integrated database of oncomiRs and their targets in breast cancer with multiple query options which will help enhance both understanding of the biology of breast cancer and the development of new and innovative microRNA based diagnostic tools and targets of therapeutic significance. OncomiRdbB is freely available for download through the URL link http://tdb.ccmb.res.in/OncomiRdbB/index.htm. PMID:24428888

  7. 24 CFR 968.320 - HUD review and approval of comprehensive plan (including five-year action plan).

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... comprehensive plan (including five-year action plan). 968.320 Section 968.320 Housing and Urban Development... approval of comprehensive plan (including five-year action plan). (a) Submission of comprehensive plan. (1... Plan. After HUD approves the Comprehensive Plan (including the Five-Year Action Plan), or...

  8. 24 CFR 968.320 - HUD review and approval of comprehensive plan (including five-year action plan).

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... comprehensive plan (including five-year action plan). 968.320 Section 968.320 Housing and Urban Development... approval of comprehensive plan (including five-year action plan). (a) Submission of comprehensive plan. (1... Plan. After HUD approves the Comprehensive Plan (including the Five-Year Action Plan), or...

  9. Building and Analyzing SURGEDAT: The World's Most Comprehensive Storm Surge Database

    NASA Astrophysics Data System (ADS)

    Needham, H.; Keim, B.

    2012-12-01

    SURGEDAT, the world's most comprehensive tropical storm surge database, has identified and mapped the location and height of hundreds of global storm surge events. This project originated with a study that identified more than 200 tropical surge events along the U.S. Gulf Coast. Spatial analysis of these data reveal that the central and western Gulf Coast observe more frequent and higher magnitude surges, whereas much of the eastern Gulf Coast, including the west coast of Florida, experiences less storm surge activity. Basin-wide return period analysis of these data estimate a 100-year return period of 8.20m, and a 10-year return period of 4.95m. Return period analysis of 10 sub-regions within the basin reveal that the highest surge levels occur in the Southeast Louisiana/ Mississippi zone, which includes the New Orleans metropolitan area. The 100-year surge level in this zone is estimated to be 7.67m. The Southeast Texas/ Southwest Louisiana zone, which includes the Houston metropolitan area, has the second highest surge levels, with a 100-year storm surge estimate of 6.30m. Surge levels are lower on the west coast of Florida, where the 100-year surge level is estimated between three and four meters. Expansion of this work includes mapping all high water marks for each surge event and the creation of a search-by-location web tool, which enables users to see the entire storm surge history for specific locations. In addition, the dataset has expanded internationally and now includes more than 500 surge events, as surges have now been identified in all the major ocean basins that experience tropical cyclones. International partnerships are sought to further expand this work, particularly in Australia, China, Japan, Philippines, India, Bangladesh, Myanmar, Mexico and various countries in Oceania and the Caribbean.; ;

  10. TransportDB: a comprehensive database resource for cytoplasmic membrane transport systems and outer membrane channels.

    PubMed

    Ren, Qinghu; Chen, Kaixi; Paulsen, Ian T

    2007-01-01

    TransportDB (http://www.membranetransport.org/) is a comprehensive database resource of information on cytoplasmic membrane transporters and outer membrane channels in organisms whose complete genome sequences are available. The complete set of membrane transport systems and outer membrane channels of each organism are annotated based on a series of experimental and bioinformatic evidence and classified into different types and families according to their mode of transport, bioenergetics, molecular phylogeny and substrate specificities. User-friendly web interfaces are designed for easy access, query and download of the data. Features of the TransportDB website include text-based and BLAST search tools against known transporter and outer membrane channel proteins; comparison of transporter and outer membrane channel contents from different organisms; known 3D structures of transporters, and phylogenetic trees of transporter families. On individual protein pages, users can find detailed functional annotation, supporting bioinformatic evidence, protein/DNA sequences, publications and cross-referenced external online resource links. TransportDB has now been in existence for over 10 years and continues to be regularly updated with new evidence and data from newly sequenced genomes, as well as having new features added periodically.

  11. Comprehensive national database of tree effects on air quality and human health in the United States.

    PubMed

    Hirabayashi, Satoshi; Nowak, David J

    2016-08-01

    Trees remove air pollutants through dry deposition processes depending upon forest structure, meteorology, and air quality that vary across space and time. Employing nationally available forest, weather, air pollution and human population data for 2010, computer simulations were performed for deciduous and evergreen trees with varying leaf area index for rural and urban areas in every county in the conterminous United States. The results populated a national database of annual air pollutant removal, concentration changes, and reductions in adverse health incidences and costs for NO2, O3, PM2.5 and SO2. The developed database enabled a first order approximation of air quality and associated human health benefits provided by trees with any forest configurations anywhere in the conterminous United States over time. Comprehensive national database of tree effects on air quality and human health in the United States was developed.

  12. Comprehensive national database of tree effects on air quality and human health in the United States.

    PubMed

    Hirabayashi, Satoshi; Nowak, David J

    2016-08-01

    Trees remove air pollutants through dry deposition processes depending upon forest structure, meteorology, and air quality that vary across space and time. Employing nationally available forest, weather, air pollution and human population data for 2010, computer simulations were performed for deciduous and evergreen trees with varying leaf area index for rural and urban areas in every county in the conterminous United States. The results populated a national database of annual air pollutant removal, concentration changes, and reductions in adverse health incidences and costs for NO2, O3, PM2.5 and SO2. The developed database enabled a first order approximation of air quality and associated human health benefits provided by trees with any forest configurations anywhere in the conterminous United States over time. Comprehensive national database of tree effects on air quality and human health in the United States was developed. PMID:27176764

  13. Including the Group Quarters Population in the US Synthesized Population Database

    PubMed Central

    Chasteen, Bernadette M.; Wheaton, William D.; Cooley, Philip C.; Ganapathi, Laxminarayana; Wagener, Diane K.

    2011-01-01

    In 2005, RTI International researchers developed methods to generate synthesized population data on US households for the US Synthesized Population Database. These data are used in agent-based modeling, which simulates large-scale social networks to test how changes in the behaviors of individuals affect the overall network. Group quarters are residences where individuals live in close proximity and interact frequently. Although the Synthesized Population Database represents the population living in households, data for the nation’s group quarters residents are not easily quantified because of US Census Bureau reporting methods designed to protect individuals’ privacy. Including group quarters population data can be an important factor in agent-based modeling because the number of residents and the frequency of their interactions are variables that directly affect modeling results. Particularly with infectious disease modeling, the increased frequency of agent interaction may increase the probability of infectious disease transmission between individuals and the probability of disease outbreaks. This report reviews our methods to synthesize data on group quarters residents to match US Census Bureau data. Our goal in developing the Group Quarters Population Database was to enable its use with RTI’s US Synthesized Population Database in the Modeling of Infectious Diseases Agent Study. PMID:21841972

  14. GLASS: a comprehensive database for experimentally validated GPCR-ligand associations

    PubMed Central

    Chan, Wallace K. B.; Zhang, Hongjiu; Yang, Jianyi; Brender, Jeffrey R.; Hur, Junguk; Özgür, Arzucan; Zhang, Yang

    2015-01-01

    Motivation: G protein-coupled receptors (GPCRs) are probably the most attractive drug target membrane proteins, which constitute nearly half of drug targets in the contemporary drug discovery industry. While the majority of drug discovery studies employ existing GPCR and ligand interactions to identify new compounds, there remains a shortage of specific databases with precisely annotated GPCR-ligand associations. Results: We have developed a new database, GLASS, which aims to provide a comprehensive, manually curated resource for experimentally validated GPCR-ligand associations. A new text-mining algorithm was proposed to collect GPCR-ligand interactions from the biomedical literature, which is then crosschecked with five primary pharmacological datasets, to enhance the coverage and accuracy of GPCR-ligand association data identifications. A special architecture has been designed to allow users for making homologous ligand search with flexible bioactivity parameters. The current database contains ∼500 000 unique entries, of which the vast majority stems from ligand associations with rhodopsin- and secretin-like receptors. The GLASS database should find its most useful application in various in silico GPCR screening and functional annotation studies. Availability and implementation: The website of GLASS database is freely available at http://zhanglab.ccmb.med.umich.edu/GLASS/. Contact: zhng@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25971743

  15. 24 CFR 968.315 - Comprehensive Plan (including five-year action plan).

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... comprehensive plan including, but not limited to, the physical and management needs assessments, viability... CFR part 91) for project and neighborhood improvements where public housing units are located or...; (2) A copy of the summary of total preliminary estimated costs to address physical needs by...

  16. CracidMex1: a comprehensive database of global occurrences of cracids (Aves, Galliformes) with distribution in Mexico

    PubMed Central

    Pinilla-Buitrago, Gonzalo; Martínez-Morales, Miguel Angel; González-García, Fernando; Enríquez, Paula L.; Rangel-Salazar, José Luis; Romero, Carlos Alberto Guichard; Navarro-Sigüenza, Adolfo G.; Monterrubio-Rico, Tiberio César; Escalona-Segura, Griselda

    2014-01-01

    Abstract Cracids are among the most vulnerable groups of Neotropical birds. Almost half of the species of this family are included in a conservation risk category. Twelve taxa occur in Mexico, six of which are considered at risk at national level and two are globally endangered. Therefore, it is imperative that high quality, comprehensive, and high-resolution spatial data on the occurrence of these taxa are made available as a valuable tool in the process of defining appropriate management strategies for conservation at a local and global level. We constructed the CracidMex1 database by collating global records of all cracid taxa that occur in Mexico from available electronic databases, museum specimens, publications, “grey literature”, and unpublished records. We generated a database with 23,896 clean, validated, and standardized geographic records. Database quality control was an iterative process that commenced with the consolidation and elimination of duplicate records, followed by the geo-referencing of records when necessary, and their taxonomic and geographic validation using GIS tools and expert knowledge. We followed the geo-referencing protocol proposed by the Mexican National Commission for the Use and Conservation of Biodiversity. We could not estimate the geographic coordinates of 981 records due to inconsistencies or lack of sufficient information in the description of the locality. Given that current records for most of the taxa have some degree of distributional bias, with redundancies at different spatial scales, the CracidMex1 database has allowed us to detect areas where more sampling effort is required to have a better representation of the global spatial occurrence of these cracids. We also found that particular attention needs to be given to taxa identification in those areas where congeners or conspecifics co-occur in order to avoid taxonomic uncertainty. The construction of the CracidMex1 database represents the first comprehensive

  17. CracidMex1: a comprehensive database of global occurrences of cracids (Aves, Galliformes) with distribution in Mexico.

    PubMed

    Pinilla-Buitrago, Gonzalo; Martínez-Morales, Miguel Angel; González-García, Fernando; Enríquez, Paula L; Rangel-Salazar, José Luis; Romero, Carlos Alberto Guichard; Navarro-Sigüenza, Adolfo G; Monterrubio-Rico, Tiberio César; Escalona-Segura, Griselda

    2014-01-01

    Cracids are among the most vulnerable groups of Neotropical birds. Almost half of the species of this family are included in a conservation risk category. Twelve taxa occur in Mexico, six of which are considered at risk at national level and two are globally endangered. Therefore, it is imperative that high quality, comprehensive, and high-resolution spatial data on the occurrence of these taxa are made available as a valuable tool in the process of defining appropriate management strategies for conservation at a local and global level. We constructed the CracidMex1 database by collating global records of all cracid taxa that occur in Mexico from available electronic databases, museum specimens, publications, "grey literature", and unpublished records. We generated a database with 23,896 clean, validated, and standardized geographic records. Database quality control was an iterative process that commenced with the consolidation and elimination of duplicate records, followed by the geo-referencing of records when necessary, and their taxonomic and geographic validation using GIS tools and expert knowledge. We followed the geo-referencing protocol proposed by the Mexican National Commission for the Use and Conservation of Biodiversity. We could not estimate the geographic coordinates of 981 records due to inconsistencies or lack of sufficient information in the description of the locality. Given that current records for most of the taxa have some degree of distributional bias, with redundancies at different spatial scales, the CracidMex1 database has allowed us to detect areas where more sampling effort is required to have a better representation of the global spatial occurrence of these cracids. We also found that particular attention needs to be given to taxa identification in those areas where congeners or conspecifics co-occur in order to avoid taxonomic uncertainty. The construction of the CracidMex1 database represents the first comprehensive research effort to

  18. CracidMex1: a comprehensive database of global occurrences of cracids (Aves, Galliformes) with distribution in Mexico.

    PubMed

    Pinilla-Buitrago, Gonzalo; Martínez-Morales, Miguel Angel; González-García, Fernando; Enríquez, Paula L; Rangel-Salazar, José Luis; Romero, Carlos Alberto Guichard; Navarro-Sigüenza, Adolfo G; Monterrubio-Rico, Tiberio César; Escalona-Segura, Griselda

    2014-01-01

    Cracids are among the most vulnerable groups of Neotropical birds. Almost half of the species of this family are included in a conservation risk category. Twelve taxa occur in Mexico, six of which are considered at risk at national level and two are globally endangered. Therefore, it is imperative that high quality, comprehensive, and high-resolution spatial data on the occurrence of these taxa are made available as a valuable tool in the process of defining appropriate management strategies for conservation at a local and global level. We constructed the CracidMex1 database by collating global records of all cracid taxa that occur in Mexico from available electronic databases, museum specimens, publications, "grey literature", and unpublished records. We generated a database with 23,896 clean, validated, and standardized geographic records. Database quality control was an iterative process that commenced with the consolidation and elimination of duplicate records, followed by the geo-referencing of records when necessary, and their taxonomic and geographic validation using GIS tools and expert knowledge. We followed the geo-referencing protocol proposed by the Mexican National Commission for the Use and Conservation of Biodiversity. We could not estimate the geographic coordinates of 981 records due to inconsistencies or lack of sufficient information in the description of the locality. Given that current records for most of the taxa have some degree of distributional bias, with redundancies at different spatial scales, the CracidMex1 database has allowed us to detect areas where more sampling effort is required to have a better representation of the global spatial occurrence of these cracids. We also found that particular attention needs to be given to taxa identification in those areas where congeners or conspecifics co-occur in order to avoid taxonomic uncertainty. The construction of the CracidMex1 database represents the first comprehensive research effort to

  19. XML Storage for Magnetotelluric Transfer Functions: Towards a Comprehensive Online Reference Database

    NASA Astrophysics Data System (ADS)

    Kelbert, A.; Blum, C.

    2015-12-01

    Magnetotelluric Transfer Functions (MT TFs) represent most of the information about Earth electrical conductivity found in the raw electromagnetic data, providing inputs for further inversion and interpretation. To be useful for scientific interpretation, they must also contain carefully recorded metadata. Making these data available in a discoverable and citable fashion would provide the most benefit to the scientific community, but such a development requires that the metadata is not only present in the file but is also searchable. The most commonly used MT TF format to date, the historical Society of Exploration Geophysicists Electromagnetic Data Interchange Standard 1987 (EDI), no longer supports some of the needs of modern magnetotellurics, most notably accurate error bars recording. Moreover, the inherent heterogeneity of EDI's and other historic MT TF formats has mostly kept the community away from healthy data sharing practices. Recently, the MT team at Oregon State University in collaboration with IRIS Data Management Center developed a new, XML-based format for MT transfer functions, and an online system for long-term storage, discovery and sharing of MT TF data worldwide (IRIS SPUD; www.iris.edu/spud/emtf). The system provides a query page where all of the MT transfer functions collected within the USArray MT experiment and other field campaigns can be searched for and downloaded; an automatic on-the-fly conversion to the historic EDI format is also included. To facilitate conversion to the new, more comprehensive and sustainable, XML format for MT TFs, and to streamline inclusion of historic data into the online database, we developed a set of open source format conversion tools, which can be used for rotation of MT TFs as well as a general XML <-> EDI converter (https://seiscode.iris.washington.edu/projects/emtf-fcu). Here, we report on the newly established collaboration between the USGS Geomagnetism Program and the Oregon State University to gather

  20. CoReCG: a comprehensive database of genes associated with colon-rectal cancer

    PubMed Central

    Agarwal, Rahul; Kumar, Binayak; Jayadev, Msk; Raghav, Dhwani; Singh, Ashutosh

    2016-01-01

    Cancer of large intestine is commonly referred as colorectal cancer, which is also the third most frequently prevailing neoplasm across the globe. Though, much of work is being carried out to understand the mechanism of carcinogenesis and advancement of this disease but, fewer studies has been performed to collate the scattered information of alterations in tumorigenic cells like genes, mutations, expression changes, epigenetic alteration or post translation modification, genetic heterogeneity. Earlier findings were mostly focused on understanding etiology of colorectal carcinogenesis but less emphasis were given for the comprehensive review of the existing findings of individual studies which can provide better diagnostics based on the suggested markers in discrete studies. Colon Rectal Cancer Gene Database (CoReCG), contains 2056 colon-rectal cancer genes information involved in distinct colorectal cancer stages sourced from published literature with an effective knowledge based information retrieval system. Additionally, interactive web interface enriched with various browsing sections, augmented with advance search facility for querying the database is provided for user friendly browsing, online tools for sequence similarity searches and knowledge based schema ensures a researcher friendly information retrieval mechanism. Colorectal cancer gene database (CoReCG) is expected to be a single point source for identification of colorectal cancer-related genes, thereby helping with the improvement of classification, diagnosis and treatment of human cancers. Database URL: lms.snu.edu.in/corecg PMID:27114494

  1. RiceVarMap: a comprehensive database of rice genomic variations

    PubMed Central

    Zhao, Hu; Yao, Wen; Ouyang, Yidan; Yang, Wanneng; Wang, Gongwei; Lian, Xingming; Xing, Yongzhong; Chen, Lingling; Xie, Weibo

    2015-01-01

    Rice Variation Map (RiceVarMap, http:/ricevarmap.ncpgr.cn) is a database of rice genomic variations. The database provides comprehensive information of 6 551 358 single nucleotide polymorphisms (SNPs) and 1 214 627 insertions/deletions (INDELs) identified from sequencing data of 1479 rice accessions. The SNP genotypes of all accessions were imputed and evaluated, resulting in an overall missing data rate of 0.42% and an estimated accuracy greater than 99%. The SNP/INDEL genotypes of all accessions are available for online query and download. Users can search SNPs/INDELs by identifiers of the SNPs/INDELs, genomic regions, gene identifiers and keywords of gene annotation. Allele frequencies within various subpopulations and the effects of the variation that may alter the protein sequence of a gene are also listed for each SNP/INDEL. The database also provides geographical details and phenotype images for various rice accessions. In particular, the database provides tools to construct haplotype networks and design PCR-primers by taking into account surrounding known genomic variations. These data and tools are highly useful for exploring genetic variations and evolution studies of rice and other species. PMID:25274737

  2. An X-Ray Analysis Database of Photoionization Cross Sections Including Variable Ionization

    NASA Technical Reports Server (NTRS)

    Wang, Ping; Cohen, David H.; MacFarlane, Joseph J.; Cassinelli, Joseph P.

    1997-01-01

    Results of research efforts in the following areas are discussed: review of the major theoretical and experimental data of subshell photoionization cross sections and ionization edges of atomic ions to assess the accuracy of the data, and to compile the most reliable of these data in our own database; detailed atomic physics calculations to complement the database for all ions of 17 cosmically abundant elements; reconciling the data from various sources and our own calculations; and fitting cross sections with functional approximations and incorporating these functions into a compact computer code.Also, efforts included adapting an ionization equilibrium code, tabulating results, and incorporating them into the overall program and testing the code (both ionization equilibrium and opacity codes) with existing observational data. The background and scientific applications of this work are discussed. Atomic physics cross section models and calculations are described. Calculation results are compared with available experimental data and other theoretical data. The functional approximations used for fitting cross sections are outlined and applications of the database are discussed.

  3. The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcripts*

    PubMed Central

    2008-01-01

    Here we report the new features and improvements in our latest release of the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/), a comprehensive annotation resource for human genes and transcripts. H-InvDB, originally developed as an integrated database of the human transcriptome based on extensive annotation of large sets of full-length cDNA (FLcDNA) clones, now provides annotation for 120 558 human mRNAs extracted from the International Nucleotide Sequence Databases (INSD), in addition to 54 978 human FLcDNAs, in the latest release H-InvDB_4.6. We mapped those human transcripts onto the human genome sequences (NCBI build 36.1) and determined 34 699 human gene clusters, which could define 34 057 (98.1%) protein-coding and 642 (1.9%) non-protein-coding loci; 858 (2.5%) transcribed loci overlapped with predicted pseudogenes. For all these transcripts and genes, we provide comprehensive annotation including gene structures, gene functions, alternative splicing variants, functional non-protein-coding RNAs, functional domains, predicted sub cellular localizations, metabolic pathways, predictions of protein 3D structure, mapping of SNPs and microsatellite repeat motifs, co-localization with orphan diseases, gene expression profiles, orthologous genes, protein–protein interactions (PPI) and annotation for gene families. The current H-InvDB annotation resources consist of two main views: Transcript view and Locus view and eight sub-databases: the DiseaseInfo Viewer, H-ANGEL, the Clustering Viewer, G-integra, the TOPO Viewer, Evola, the PPI view and the Gene family/group. PMID:18089548

  4. Phenol-Explorer: an online comprehensive database on polyphenol contents in foods

    PubMed Central

    Neveu, V.; Perez-Jiménez, J.; Vos, F.; Crespy, V.; du Chaffaut, L.; Mennen, L.; Knox, C.; Eisner, R.; Cruz, J.; Wishart, D.; Scalbert, A.

    2010-01-01

    A number of databases on the plant metabolome describe the chemistry and biosynthesis of plant chemicals. However, no such database is specifically focused on foods and more precisely on polyphenols, one of the major classes of phytochemicals. As antoxidants, polyphenols influence human health and may play a role in the prevention of a number of chronic diseases such as cardiovascular diseases, some cancers or type 2 diabetes. To determine polyphenol intake in populations and study their association with health, it is essential to have detailed information on their content in foods. However this information is not easily collected due to the variety of their chemical structures and the variability of their content in a given food. Phenol-Explorer is the first comprehensive web-based database on polyphenol content in foods. It contains more than 37 000 original data points collected from 638 scientific articles published in peer-reviewed journals. The quality of these data has been evaluated before they were aggregated to produce final representative mean content values for 502 polyphenols in 452 foods. The web interface allows making various queries on the aggregated data to identify foods containing a given polyphenol or polyphenols present in a given food. For each mean content value, it is possible to trace all original content values and their literature sources. Phenol-Explorer is a major step forward in the development of databases on food constituents and the food metabolome. It should help researchers to better understand the role of phytochemicals in the technical and nutritional quality of food, and food manufacturers to develop tailor-made healthy foods. Database URL: http://www.phenol-explorer.eu PMID:20428313

  5. Global catalogue of microorganisms (gcm): a comprehensive database and information retrieval, analysis, and visualization system for microbial resources

    PubMed Central

    2013-01-01

    Background Throughout the long history of industrial and academic research, many microbes have been isolated, characterized and preserved (whenever possible) in culture collections. With the steady accumulation in observational data of biodiversity as well as microbial sequencing data, bio-resource centers have to function as data and information repositories to serve academia, industry, and regulators on behalf of and for the general public. Hence, the World Data Centre for Microorganisms (WDCM) started to take its responsibility for constructing an effective information environment that would promote and sustain microbial research data activities, and bridge the gaps currently present within and outside the microbiology communities. Description Strain catalogue information was collected from collections by online submission. We developed tools for automatic extraction of strain numbers and species names from various sources, including Genbank, Pubmed, and SwissProt. These new tools connect strain catalogue information with the corresponding nucleotide and protein sequences, as well as to genome sequence and references citing a particular strain. All information has been processed and compiled in order to create a comprehensive database of microbial resources, and was named Global Catalogue of Microorganisms (GCM). The current version of GCM contains information of over 273,933 strains, which includes 43,436bacterial, fungal and archaea species from 52 collections in 25 countries and regions. A number of online analysis and statistical tools have been integrated, together with advanced search functions, which should greatly facilitate the exploration of the content of GCM. Conclusion A comprehensive dynamic database of microbial resources has been created, which unveils the resources preserved in culture collections especially for those whose informatics infrastructures are still under development, which should foster cumulative research, facilitating the

  6. A European Flood Database: facilitating comprehensive flood research beyond administrative boundaries

    NASA Astrophysics Data System (ADS)

    Hall, J.; Arheimer, B.; Aronica, G. T.; Bilibashi, A.; Boháč, M.; Bonacci, O.; Borga, M.; Burlando, P.; Castellarin, A.; Chirico, G. B.; Claps, P.; Fiala, K.; Gaál, L.; Gorbachova, L.; Gül, A.; Hannaford, J.; Kiss, A.; Kjeldsen, T.; Kohnová, S.; Koskela, J. J.; Macdonald, N.; Mavrova-Guirguinova, M.; Ledvinka, O.; Mediero, L.; Merz, B.; Merz, R.; Molnar, P.; Montanari, A.; Osuch, M.; Parajka, J.; Perdigão, R. A. P.; Radevski, I.; Renard, B.; Rogger, M.; Salinas, J. L.; Sauquet, E.; Šraj, M.; Szolgay, J.; Viglione, A.; Volpi, E.; Wilson, D.; Zaimi, K.; Blöschl, G.

    2015-06-01

    The current work addresses one of the key building blocks towards an improved understanding of flood processes and associated changes in flood characteristics and regimes in Europe: the development of a comprehensive, extensive European flood database. The presented work results from ongoing cross-border research collaborations initiated with data collection and joint interpretation in mind. A detailed account of the current state, characteristics and spatial and temporal coverage of the European Flood Database, is presented. At this stage, the hydrological data collection is still growing and consists at this time of annual maximum and daily mean discharge series, from over 7000 hydrometric stations of various data series lengths. Moreover, the database currently comprises data from over 50 different data sources. The time series have been obtained from different national and regional data sources in a collaborative effort of a joint European flood research agreement based on the exchange of data, models and expertise, and from existing international data collections and open source websites. These ongoing efforts are contributing to advancing the understanding of regional flood processes beyond individual country boundaries and to a more coherent flood research in Europe.

  7. A comprehensive change detection method for updating the National Land Cover Database to circa 2011

    USGS Publications Warehouse

    Jin, Suming; Yang, Limin; Danielson, Patrick; Homer, Collin G.; Fry, Joyce; Xian, George

    2013-01-01

    The importance of characterizing, quantifying, and monitoring land cover, land use, and their changes has been widely recognized by global and environmental change studies. Since the early 1990s, three U.S. National Land Cover Database (NLCD) products (circa 1992, 2001, and 2006) have been released as free downloads for users. The NLCD 2006 also provides land cover change products between 2001 and 2006. To continue providing updated national land cover and change datasets, a new initiative in developing NLCD 2011 is currently underway. We present a new Comprehensive Change Detection Method (CCDM) designed as a key component for the development of NLCD 2011 and the research results from two exemplar studies. The CCDM integrates spectral-based change detection algorithms including a Multi-Index Integrated Change Analysis (MIICA) model and a novel change model called Zone, which extracts change information from two Landsat image pairs. The MIICA model is the core module of the change detection strategy and uses four spectral indices (CV, RCVMAX, dNBR, and dNDVI) to obtain the changes that occurred between two image dates. The CCDM also includes a knowledge-based system, which uses critical information on historical and current land cover conditions and trends and the likelihood of land cover change, to combine the changes from MIICA and Zone. For NLCD 2011, the improved and enhanced change products obtained from the CCDM provide critical information on location, magnitude, and direction of potential change areas and serve as a basis for further characterizing land cover changes for the nation. An accuracy assessment from the two study areas show 100% agreement between CCDM mapped no-change class with reference dataset, and 18% and 82% disagreement for the change class for WRS path/row p22r39 and p33r33, respectively. The strength of the CCDM is that the method is simple, easy to operate, widely applicable, and capable of capturing a variety of natural and

  8. Development of a Publicly Available, Comprehensive Database of Fiber and Health Outcomes: Rationale and Methods

    PubMed Central

    Livingston, Kara A.; Chung, Mei; Sawicki, Caleigh M.; Lyle, Barbara J.; Wang, Ding Ding; Roberts, Susan B.; McKeown, Nicola M.

    2016-01-01

    Background Dietary fiber is a broad category of compounds historically defined as partially or completely indigestible plant-based carbohydrates and lignin with, more recently, the additional criteria that fibers incorporated into foods as additives should demonstrate functional human health outcomes to receive a fiber classification. Thousands of research studies have been published examining fibers and health outcomes. Objectives (1) Develop a database listing studies testing fiber and physiological health outcomes identified by experts at the Ninth Vahouny Conference; (2) Use evidence mapping methodology to summarize this body of literature. This paper summarizes the rationale, methodology, and resulting database. The database will help both scientists and policy-makers to evaluate evidence linking specific fibers with physiological health outcomes, and identify missing information. Methods To build this database, we conducted a systematic literature search for human intervention studies published in English from 1946 to May 2015. Our search strategy included a broad definition of fiber search terms, as well as search terms for nine physiological health outcomes identified at the Ninth Vahouny Fiber Symposium. Abstracts were screened using a priori defined eligibility criteria and a low threshold for inclusion to minimize the likelihood of rejecting articles of interest. Publications then were reviewed in full text, applying additional a priori defined exclusion criteria. The database was built and published on the Systematic Review Data Repository (SRDR™), a web-based, publicly available application. Conclusions A fiber database was created. This resource will reduce the unnecessary replication of effort in conducting systematic reviews by serving as both a central database archiving PICO (population, intervention, comparator, outcome) data on published studies and as a searchable tool through which this data can be extracted and updated. PMID:27348733

  9. Toxic Substances Control Act (TSCA) Test Submissions Database (TSCATS) - comprehensive update (on magnetic tape). Data file

    SciTech Connect

    1996-07-01

    The Toxic Substances Control Act Test Submissions Database (TSCATS) was developed to make unpublished test data available to the public. The test data are submitted to the U.S. Environmental Protection Agency by industry under the Toxic Substances Control Act. Test data are broadly defined to include case reports, episodic incidents, such as spills, and formal test study presentations. The database allows searching of test submissions formal test study presentations. The database allows searching of test submissions according to specific chemical identity or type of study when used with an appropriate search retrieval software program. Studies are indexed under three broad subject areas: health effects, environmental effects and environmental fate. Additional controlled vocabulary terms are assigned which describe the experimental protocol and test observations. Records identify reference information needed to locate the source document, as well as the submitting organization and reason for submission of the test data.

  10. dEMBF: A Comprehensive Database of Enzymes of Microalgal Biofuel Feedstock.

    PubMed

    Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar; Mishra, Barada Kanta

    2016-01-01

    Microalgae have attracted wide attention as one of the most versatile renewable feedstocks for production of biofuel. To develop genetically engineered high lipid yielding algal strains, a thorough understanding of the lipid biosynthetic pathway and the underpinning enzymes is essential. In this work, we have systematically mined the genomes of fifteen diverse algal species belonging to Chlorophyta, Heterokontophyta, Rhodophyta, and Haptophyta, to identify and annotate the putative enzymes of lipid metabolic pathway. Consequently, we have also developed a database, dEMBF (Database of Enzymes of Microalgal Biofuel Feedstock), which catalogues the complete list of identified enzymes along with their computed annotation details including length, hydrophobicity, amino acid composition, subcellular location, gene ontology, KEGG pathway, orthologous group, Pfam domain, intron-exon organization, transmembrane topology, and secondary/tertiary structural data. Furthermore, to facilitate functional and evolutionary study of these enzymes, a collection of built-in applications for BLAST search, motif identification, sequence and phylogenetic analysis have been seamlessly integrated into the database. dEMBF is the first database that brings together all enzymes responsible for lipid synthesis from available algal genomes, and provides an integrative platform for enzyme inquiry and analysis. This database will be extremely useful for algal biofuel research. It can be accessed at http://bbprof.immt.res.in/embf. PMID:26727469

  11. dEMBF: A Comprehensive Database of Enzymes of Microalgal Biofuel Feedstock.

    PubMed

    Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar; Mishra, Barada Kanta

    2016-01-01

    Microalgae have attracted wide attention as one of the most versatile renewable feedstocks for production of biofuel. To develop genetically engineered high lipid yielding algal strains, a thorough understanding of the lipid biosynthetic pathway and the underpinning enzymes is essential. In this work, we have systematically mined the genomes of fifteen diverse algal species belonging to Chlorophyta, Heterokontophyta, Rhodophyta, and Haptophyta, to identify and annotate the putative enzymes of lipid metabolic pathway. Consequently, we have also developed a database, dEMBF (Database of Enzymes of Microalgal Biofuel Feedstock), which catalogues the complete list of identified enzymes along with their computed annotation details including length, hydrophobicity, amino acid composition, subcellular location, gene ontology, KEGG pathway, orthologous group, Pfam domain, intron-exon organization, transmembrane topology, and secondary/tertiary structural data. Furthermore, to facilitate functional and evolutionary study of these enzymes, a collection of built-in applications for BLAST search, motif identification, sequence and phylogenetic analysis have been seamlessly integrated into the database. dEMBF is the first database that brings together all enzymes responsible for lipid synthesis from available algal genomes, and provides an integrative platform for enzyme inquiry and analysis. This database will be extremely useful for algal biofuel research. It can be accessed at http://bbprof.immt.res.in/embf.

  12. dEMBF: A Comprehensive Database of Enzymes of Microalgal Biofuel Feedstock

    PubMed Central

    Misra, Namrata; Panda, Prasanna Kumar; Parida, Bikram Kumar; Mishra, Barada Kanta

    2016-01-01

    Microalgae have attracted wide attention as one of the most versatile renewable feedstocks for production of biofuel. To develop genetically engineered high lipid yielding algal strains, a thorough understanding of the lipid biosynthetic pathway and the underpinning enzymes is essential. In this work, we have systematically mined the genomes of fifteen diverse algal species belonging to Chlorophyta, Heterokontophyta, Rhodophyta, and Haptophyta, to identify and annotate the putative enzymes of lipid metabolic pathway. Consequently, we have also developed a database, dEMBF (Database of Enzymes of Microalgal Biofuel Feedstock), which catalogues the complete list of identified enzymes along with their computed annotation details including length, hydrophobicity, amino acid composition, subcellular location, gene ontology, KEGG pathway, orthologous group, Pfam domain, intron-exon organization, transmembrane topology, and secondary/tertiary structural data. Furthermore, to facilitate functional and evolutionary study of these enzymes, a collection of built-in applications for BLAST search, motif identification, sequence and phylogenetic analysis have been seamlessly integrated into the database. dEMBF is the first database that brings together all enzymes responsible for lipid synthesis from available algal genomes, and provides an integrative platform for enzyme inquiry and analysis. This database will be extremely useful for algal biofuel research. It can be accessed at http://bbprof.immt.res.in/embf. PMID:26727469

  13. A Comprehensive Database for the Most Commonly Measured Paleoceanographic Proxies: Evaluating Global Carbon Burial Variations over the Last Deglaciation

    NASA Astrophysics Data System (ADS)

    Cartapanis, O. A.; Galbraith, E. D.; Bianchi, D.

    2014-12-01

    The paleoceanographic community has measured physical and chemical properties of thousands of sediment cores, in order to reconstruct past oceanic conditions across a wide range of timescales. Most of these datasets were archived in online data repositories in order to facilitate later analysis. However, the diversity of proxy types and heterogeneous reporting standards have hindered the analysis of globally distributed paleoceanographic time series, despite the tremendous potential utility of large datasets. In order for the existing archives of paleoceanographic data to contribute towards important unanswered issues, the datasets must be readily accessible in an organized structure, with an appropriate unified age model for each sediment site. Here we present a comprehensive database, built using MATLAB structure, for the most commonly measured proxies in marine sediment cores and surface sediment. These include the concentrations of carbonate, organic matter and opal, density of the sediment, the oxygen and carbon isotopic composition of benthic and planktonic foraminifera, sea surface temperature estimates, as well as chronological constrains. The records were downloaded from PANGAEA and NOAA databases and multiple records from the same sediment core were regrouped, allowing age models to be shared between records. The resulting database consists of more than 30,000 sites, facilitating rapid and accurate mapping of sediment composition for selected time periods, and retrieving and plotting time series. The greatest hurdle identified in building a unifi­ed dataproduct of this type is inconsistency in the reporting of depth measurements and chronological information. In our view, standardized reporting practices would greatly assist in developing global quantitative analysis, and requires community-wide action. We propose the development of a unified global sediment core registry, including metadata and chronological information for all marine sediment cores

  14. COMPREHENSIVE ADOLESCENT HEALTH PROGRAMS THAT INCLUDE SEXUAL AND REPRODUCTIVE HEALTH SERVICES: A SYSTEMATIC REVIEW

    PubMed Central

    Kågesten, Anna; Parekh, Jenita; Tunçalp, Özge; Turke, Shani; Blum, Robert Wm

    2015-01-01

    Objectives To describe the characteristics and evidence of comprehensive adolescent health (CAH) programs encompassing sexual and reproductive and other health services, educational and social support. Methods Systematic review of peer-reviewed and grey literature on CAH programs (1998-2013). Only programs with experimental, quasi-experimental or pre-post evaluations were included. Two independent reviewers screened 36,119 records and extracted articles using predefined criteria. Data was synthesised into descriptive characteristics; quality was assessed by evidence level (rigorous, strong, modest). Results Data was extracted on 46 programs (181 studies) of which 19 programs were defined as comprehensive; 9 were excluded due to insufficient evaluations. Ten CAH programs met all inclusion criteria. Most were US based; others were implemented in Egypt, Ethiopia and Mexico. Few programs (N=3) displayed rigorous evidence, 5 had strong and 2 had modest evidence. All programs with rigorous/strong evidence directly or indirectly influenced adolescent sexual and reproductive health. Common elements of these programs included: long-term commitment to adolescents, interpersonal connections, community mobilization, and skill-building elements. Conclusion The long term impact of many CAH programs cannot be proven due to insufficient evaluations. Evaluation approaches that take into account the complex operating conditions of many programs are needed to better understand mechanisms behind program effects. PMID:25320876

  15. WATCHDOG: A Comprehensive All-sky Database of Galactic Black Hole X-ray Binaries

    NASA Astrophysics Data System (ADS)

    Tetarenko, B. E.; Sivakoff, G. R.; Heinke, C. O.; Gladstone, J. C.

    2016-02-01

    With the advent of more sensitive all-sky instruments, the transient universe is being probed in greater depth than ever before. Taking advantage of available resources, we have established a comprehensive database of black hole (and black hole candidate) X-ray binary (BHXB) activity between 1996 and 2015 as revealed by all-sky instruments, scanning surveys, and select narrow-field X-ray instruments on board the INTErnational Gamma-Ray Astrophysics Laboratory, Monitor of All-Sky X-ray Image, Rossi X-ray Timing Explorer, and Swift telescopes; the Whole-sky Alberta Time-resolved Comprehensive black-Hole Database Of the Galaxy or WATCHDOG. Over the past two decades, we have detected 132 transient outbursts, tracked and classified behavior occurring in 47 transient and 10 persistently accreting BHs, and performed a statistical study on a number of outburst properties across the Galactic population. We find that outbursts undergone by BHXBs that do not reach the thermally dominant accretion state make up a substantial fraction (∼40%) of the Galactic transient BHXB outburst sample over the past ∼20 years. Our findings suggest that this “hard-only” behavior, observed in transient and persistently accreting BHXBs, is neither a rare nor recent phenomenon and may be indicative of an underlying physical process, relatively common among binary BHs, involving the mass-transfer rate onto the BH remaining at a low level rather than increasing as the outburst evolves. We discuss how the larger number of these “hard-only” outbursts and detected outbursts in general have significant implications for both the luminosity function and mass-transfer history of the Galactic BHXB population.

  16. Lignan contents of Dutch plant foods: a database including lariciresinol, pinoresinol, secoisolariciresinol and matairesinol.

    PubMed

    Milder, Ivon E J; Arts, Ilja C W; van de Putte, Betty; Venema, Dini P; Hollman, Peter C H

    2005-03-01

    Enterolignans (enterodiol and enterolactone) can potentially reduce the risk of certain cancers and cardiovascular diseases. Enterolignans are formed by the intestinal microflora after the consumption of plant lignans. Until recently, only secoisolariciresinol and matairesinol were considered enterolignan precursors, but now several new precursors have been identified, of which lariciresinol and pinoresinol have a high degree of conversion. Quantitative data on the contents in foods of these new enterolignan precursors are not available. Thus, the aim of this study was to compile a lignan database including all four major enterolignan precursors. Liquid chromatography-tandem mass spectrometry was used to quantify lariciresinol, pinoresinol, secoisolariciresinol and matairesinol in eighty-three solid foods and twenty-six beverages commonly consumed in The Netherlands. The richest source of lignans was flaxseed (301,129 microg/100 g), which contained mainly secoisolariciresinol. Also, lignan concentrations in sesame seeds (29,331 microg/100 g, mainly pinoresinol and lariciresinol) were relatively high. For grain products, which are known to be important sources of lignan, lignan concentrations ranged from 7 to 764 microg/100 g. However, many vegetables and fruits had similar concentrations, because of the contribution of lariciresinol and pinoresinol. Brassica vegetables contained unexpectedly high levels of lignans (185-2321 microg/100 g), mainly pinoresinol and lariciresinol. Lignan levels in beverages varied from 0 (cola) to 91 microg/100 ml (red wine). Only four of the 109 foods did not contain a measurable amount of lignans, and in most cases the amount of lariciresinol and pinoresinol was larger than that of secoisolariciresinol and matairesinol. Thus, available databases largely underestimate the amount of enterolignan precursors in foods.

  17. The Prion Disease Database: a comprehensive transcriptome resource for systems biology research in prion diseases

    PubMed Central

    Gehlenborg, Nils; Hwang, Daehee; Lee, Inyoul Y.; Yoo, Hyuntae; Baxter, David; Petritis, Brianne; Pitstick, Rose; Marzolf, Bruz; DeArmond, Stephen J.; Carlson, George A.; Hood, Leroy

    2009-01-01

    Prion diseases reflect conformational conversion of benign isoforms of prion protein (PrPC) to malignant PrPSc isoforms. Networks perturbed by PrPSc accumulation and their ties to pathological events are poorly understood. Time-course transcriptomic and phenotypic data in animal models are critical for understanding prion-perturbed networks in systems biology studies. Here, we present the Prion Disease Database (PDDB), the most comprehensive data resource on mouse prion diseases to date. The PDDB contains: (i) time-course mRNA measurements spanning the interval from prion inoculation through appearance of clinical signs in eight mouse strain-prion strain combinations and (ii) histoblots showing temporal PrPSc accumulation patterns in brains from each mouse–prion combination. To facilitate prion research, the PDDB also provides a suite of analytical tools for reconstructing dynamic networks via integration of temporal mRNA and interaction data and for analyzing these networks to generate hypotheses. Database URL: http://prion.systemsbiology.net PMID:20157484

  18. The Aspergillus Genome Database (AspGD): recent developments in comprehensive multispecies curation, comparative genomics and community resources.

    PubMed

    Arnaud, Martha B; Cerqueira, Gustavo C; Inglis, Diane O; Skrzypek, Marek S; Binkley, Jonathan; Chibucos, Marcus C; Crabtree, Jonathan; Howarth, Clinton; Orvis, Joshua; Shah, Prachi; Wymore, Farrell; Binkley, Gail; Miyasato, Stuart R; Simison, Matt; Sherlock, Gavin; Wortman, Jennifer R

    2012-01-01

    The Aspergillus Genome Database (AspGD; http://www.aspgd.org) is a freely available, web-based resource for researchers studying fungi of the genus Aspergillus, which includes organisms of clinical, agricultural and industrial importance. AspGD curators have now completed comprehensive review of the entire published literature about Aspergillus nidulans and Aspergillus fumigatus, and this annotation is provided with streamlined, ortholog-based navigation of the multispecies information. AspGD facilitates comparative genomics by providing a full-featured genomics viewer, as well as matched and standardized sets of genomic information for the sequenced aspergilli. AspGD also provides resources to foster interaction and dissemination of community information and resources. We welcome and encourage feedback at aspergillus-curator@lists.stanford.edu.

  19. The Chloroplast Function Database II: a comprehensive collection of homozygous mutants and their phenotypic/genotypic traits for nuclear-encoded chloroplast proteins.

    PubMed

    Myouga, Fumiyoshi; Akiyama, Kenji; Tomonaga, Yumi; Kato, Aya; Sato, Yuka; Kobayashi, Megumi; Nagata, Noriko; Sakurai, Tetsuya; Shinozaki, Kazuo

    2013-02-01

    The Chloroplast Function Database has so far offered phenotype information on mutants of the nuclear-encoded chloroplast proteins in Arabidopsis that pertains to >200 phenotypic data sets that were obtained from 1,722 transposon- or T-DNA-tagged lines. Here, we present the development of the second version of the database, which is named the Chloroplast Function Database II and was redesigned to increase the number of mutant characters and new user-friendly tools for data mining and integration. The upgraded database offers information on genome-wide mutant screens for any visible phenotype against 2,495 tagged lines to create a comprehensive homozygous mutant collection. The collection consists of 147 lines with seedling phenotypes and 185 lines for which we could not obtain homozygotes, as well as 1,740 homozygotes with wild-type phenotypes. Besides providing basic information about primer lists that were used for the PCR genotyping of T-DNA-tagged lines and explanations about the preparation of homozygous mutants and phenotype screening, the database includes access to a link between the gene locus and existing publicly available databases. This gives users access to a combined pool of data, enabling them to gain valuable insights into biological processes. In addition, high-resolution images of plastid morphologies of mutants with seedling-specific chloroplast defects as observed with transmission electron microscopy (TEM) are available in the current database. This database is used to compare the phenotypes of visually identifiable mutants with their plastid ultrastructures and to evaluate their potential significance from characteristic patterns of plastid morphology in vivo. Thus, the Chloroplast Function Database II is a useful and comprehensive information resource that can help researchers to connect individual Arabidopsis genes to plastid functions on the basis of phenotype analysis of our tagged mutant collection. It can be freely accessed at http://rarge.psc.riken.jp/chloroplast/.

  20. Feasibility of including green tea products for an analytically verified dietary supplement database

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Dietary Supplement Ingredient Database (DSID) is a federally-funded, publically-accessible dietary supplement database that currently contains analytically derived information on micronutrients in selected adult and children’s multivitamin and mineral (MVM) supplements. Other constituents in di...

  1. An automated DICOM database capable of arbitrary data mining (including radiation dose indicators) for quality monitoring.

    PubMed

    Wang, Shanshan; Pavlicek, William; Roberts, Catherine C; Langer, Steve G; Zhang, Muhong; Hu, Mengqi; Morin, Richard L; Schueler, Beth A; Wellnitz, Clinton V; Wu, Teresa

    2011-04-01

    The U.S. National Press has brought to full public discussion concerns regarding the use of medical radiation, specifically x-ray computed tomography (CT), in diagnosis. A need exists for developing methods whereby assurance is given that all diagnostic medical radiation use is properly prescribed, and all patients' radiation exposure is monitored. The "DICOM Index Tracker©" (DIT) transparently captures desired digital imaging and communications in medicine (DICOM) tags from CT, nuclear imaging equipment, and other DICOM devices across an enterprise. Its initial use is recording, monitoring, and providing automatic alerts to medical professionals of excursions beyond internally determined trigger action levels of radiation. A flexible knowledge base, aware of equipment in use, enables automatic alerts to system administrators of newly identified equipment models or software versions so that DIT can be adapted to the new equipment or software. A dosimetry module accepts mammography breast organ dose, skin air kerma values from XA modalities, exposure indices from computed radiography, etc. upon receipt. The American Association of Physicists in Medicine recommended a methodology for effective dose calculations which are performed with CT units having DICOM structured dose reports. Web interface reporting is provided for accessing the database in real-time. DIT is DICOM-compliant and, thus, is standardized for international comparisons. Automatic alerts currently in use include: email, cell phone text message, and internal pager text messaging. This system extends the utility of DICOM for standardizing the capturing and computing of radiation dose as well as other quality measures. PMID:20824303

  2. An Expanded UV Irradiance Database from TOMS Including the Effects of Ozone, Clouds, and Aerosol Attenuation

    NASA Technical Reports Server (NTRS)

    Herman, J.; Krotkov, N.

    2003-01-01

    The TOMS UV irradiance database (1978 to 2003) has been expanded to include five new products (noon irradiance at 305,310,324, and 380 nm, and noon erythemal-weighted irradiance), in addition to the existing erythemal daily exposure, that permit direct comparisons with ground-based measurements from spectrometers and broadband instruments. The new data are available on http://toms.gsfc.nasa.gov/>http://toms.gsfc.nasa.gov. Comparisons of the TOMS estimated irradiances with ground-based instruments are given along with a review of the sources of known errors, especially the recent improvements in accounting for aerosol attenuation. Trend estimations from the new TOMS irradiances permit the clear separation of changes caused by ozone and those caused by aerosols and clouds. Systematic differences in cloud cover are shown to be the most important factor in determining regional differences in UV radiation reaching the ground for locations at the same latitude (e.g., the summertime differences between Australia and the US southwest).

  3. dbPEC: a comprehensive literature-based database for preeclampsia related genes and phenotypes

    PubMed Central

    Uzun, Alper; Triche, Elizabeth W.; Schuster, Jessica; Dewan, Andrew T.; Padbury, James F.

    2016-01-01

    Preeclampsia is one of the most common causes of fetal and maternal morbidity and mortality in the world. We built a Database for Preeclampsia (dbPEC) consisting of the clinical features, concurrent conditions, published literature and genes associated with Preeclampsia. We included gene sets associated with severity, concurrent conditions, tissue sources and networks. The published scientific literature is the primary repository for all information documenting human disease. We used semantic data mining to retrieve and extract the articles pertaining to preeclampsia-associated genes and performed manual curation. We deposited the articles, genes, preeclampsia phenotypes and other supporting information into the dbPEC. It is publicly available and freely accessible. Previously, we developed a database for preterm birth (dbPTB) using a similar approach. Using the gene sets in dbPTB, we were able to successfully analyze a genome-wide study of preterm birth including 4000 women and children. We identified important genes and pathways associated with preterm birth that were not otherwise demonstrable using genome-wide approaches. dbPEC serves not only as a resources for genes and articles associated with preeclampsia, it is a robust source of gene sets to analyze a wide range of high-throughput data for gene set enrichment analysis. Database URL: http://ptbdb.cs.brown.edu/dbpec/ PMID:26946289

  4. dbPEC: a comprehensive literature-based database for preeclampsia related genes and phenotypes.

    PubMed

    Uzun, Alper; Triche, Elizabeth W; Schuster, Jessica; Dewan, Andrew T; Padbury, James F

    2016-01-01

    Preeclampsia is one of the most common causes of fetal and maternal morbidity and mortality in the world. We built a Database for Preeclampsia (dbPEC) consisting of the clinical features, concurrent conditions, published literature and genes associated with Preeclampsia. We included gene sets associated with severity, concurrent conditions, tissue sources and networks. The published scientific literature is the primary repository for all information documenting human disease. We used semantic data mining to retrieve and extract the articles pertaining to preeclampsia-associated genes and performed manual curation. We deposited the articles, genes, preeclampsia phenotypes and other supporting information into the dbPEC. It is publicly available and freely accessible. Previously, we developed a database for preterm birth (dbPTB) using a similar approach. Using the gene sets in dbPTB, we were able to successfully analyze a genome-wide study of preterm birth including 4000 women and children. We identified important genes and pathways associated with preterm birth that were not otherwise demonstrable using genome-wide approaches. dbPEC serves not only as a resources for genes and articles associated with preeclampsia, it is a robust source of gene sets to analyze a wide range of high-throughput data for gene set enrichment analysis. Database URL: http://ptbdb.cs.brown.edu/dbpec/. PMID:26946289

  5. Toxic Substances Control Act (TSCA) test submissions database (TSCATS) - comprehensive update (on magnetic tape)

    SciTech Connect

    1996-04-01

    The Toxic Substances Control Act Test Submissions Database (TSCATS) was developed to make unpublished test data available to the public. The test data are submitted to the U.S. Environmental Protection Agency by industry under the Toxic Substances Control Act. Test data are broadly defined to include case reports, episodic incidents, such as spills, and formal test study presentations. The database allows searching of test submissions according to specific chemical identity or type of study when used with an appropriate search retrieval software program. Studies are indexed under three broad subject areas: health effects, environmental effects, and environmental fate. additional controlled vocabulary terms are assigned which describe the experimental protocol and test observations. Records identify reference information needed to locate the source document, the submitting organization, and reason for submission of the test data.

  6. Toxic Substances Control Act (TSCA) test submissions database (tscats) - comprehensive update (on magnetic tape). Data file

    SciTech Connect

    1995-12-31

    The Toxic Substances Control Act Test Submissions Database (TSCATS) was developed to make unpublished test data available to the public. The test data are submitted to the U.S. Environmental Protection Agency by industry under the Toxic Substances Control Act. Test data are broadly defined to include case reports, episodic incidents, such as spills, and formal test study presentations. The database allows searching of test submissions according to specific chemical identity or type of study when used with an appropriate search retrieval software program. Studies are indexed under three broad subject areas: health effects, environmental effects and environmental fate. Additional controlled vocabulary terms are assigned which describe the experimental protocol and test observations. Records identify reference information needed to locate the source document, as well as the submitting organization and reason for submission of the test data.

  7. Toxic Substances Control Act (TSCA) Test Submissions Database (TSCATS) - comprehensive update (on magnetic tape). Data file

    SciTech Connect

    1996-12-31

    The Toxic Substances Control Act Test Submissions Database (TSCATS) was developed to make unpublished test data available to the public. The test data are submitted to the U.S. Environmental Protection Agency by industry under the Toxic Substances Control Act. Test data are broadly defined to include case reports, episodic incidents, such as spills, and formal test study presentations. The database allows searching of test submissions according to specific chemical identity or type of study when used with an appropriate search retrieval software program. Studies are indexed under three broad subject areas: health effects, environmental effects and environmental fate. Additional controlled vocabulary terms are assigned which describe the experimental protocol and test observations. Records identify reference information needed to locate the source document, as well as the submitting organization and reason for submission of the test data.

  8. Toxic Substances Control Act (TSCA) test submissions database (TSCATS) - comprehensive update (on magnetic tape). Data file

    SciTech Connect

    1996-09-01

    The Toxic Substances Control Act Test Submissions Database (TSCATS) was developed to make unpublished test data available to the public. The test data are submitted to the U.S. Environmental Protection Agency by industry under the Toxic Substances Control Act. Test data are broadly defined to include case reports, episodic incidents, such as spills, and formal test study presentations. The database allows searching of test submissions according to specific chemical identity or type of study when used with an appropriate search retrieval software program. Studies are indexed under three broad subject areas: health effects, environmental effects, and environmental fate. Additional controlled vocabulary terms are assigned which describe the experimental protocol and test observations. Records identify reference information needed to locate the source document, as well as the submitting organization and reason for submission of the test data.

  9. PlantTFDB 2.0: update and improvement of the comprehensive plant transcription factor database.

    PubMed

    Zhang, He; Jin, Jinpu; Tang, Liang; Zhao, Yi; Gu, Xiaocheng; Gao, Ge; Luo, Jingchu

    2011-01-01

    We updated the plant transcription factor (TF) database to version 2.0 (PlantTFDB 2.0, http://planttfdb.cbi.pku.edu.cn) which contains 53,319 putative TFs predicted from 49 species. We made detailed annotation including general information, domain feature, gene ontology, expression pattern and ortholog groups, as well as cross references to various databases and literature citations for these TFs classified into 58 newly defined families with computational approach and manual inspection. Multiple sequence alignments and phylogenetic trees for each family can be shown as Weblogo pictures or downloaded as text files. We have redesigned the user interface in the new version. Users can search TFs with much more flexibility through the improved advanced search page, and the search results can be exported into various formats for further analysis. In addition, we now provide web service for advanced users to access PlantTFDB 2.0 more efficiently.

  10. Causal biological network database: a comprehensive platform of causal biological network models focused on the pulmonary and vascular systems

    PubMed Central

    Boué, Stéphanie; Talikka, Marja; Westra, Jurjen Willem; Hayes, William; Di Fabio, Anselmo; Park, Jennifer; Schlage, Walter K.; Sewer, Alain; Fields, Brett; Ansari, Sam; Martin, Florian; Veljkovic, Emilija; Kenney, Renee; Peitsch, Manuel C.; Hoeng, Julia

    2015-01-01

    With the wealth of publications and data available, powerful and transparent computational approaches are required to represent measured data and scientific knowledge in a computable and searchable format. We developed a set of biological network models, scripted in the Biological Expression Language, that reflect causal signaling pathways across a wide range of biological processes, including cell fate, cell stress, cell proliferation, inflammation, tissue repair and angiogenesis in the pulmonary and cardiovascular context. This comprehensive collection of networks is now freely available to the scientific community in a centralized web-based repository, the Causal Biological Network database, which is composed of over 120 manually curated and well annotated biological network models and can be accessed at http://causalbionet.com. The website accesses a MongoDB, which stores all versions of the networks as JSON objects and allows users to search for genes, proteins, biological processes, small molecules and keywords in the network descriptions to retrieve biological networks of interest. The content of the networks can be visualized and browsed. Nodes and edges can be filtered and all supporting evidence for the edges can be browsed and is linked to the original articles in PubMed. Moreover, networks may be downloaded for further visualization and evaluation. Database URL: http://causalbionet.com PMID:25887162

  11. Causal biological network database: a comprehensive platform of causal biological network models focused on the pulmonary and vascular systems.

    PubMed

    Boué, Stéphanie; Talikka, Marja; Westra, Jurjen Willem; Hayes, William; Di Fabio, Anselmo; Park, Jennifer; Schlage, Walter K; Sewer, Alain; Fields, Brett; Ansari, Sam; Martin, Florian; Veljkovic, Emilija; Kenney, Renee; Peitsch, Manuel C; Hoeng, Julia

    2015-01-01

    With the wealth of publications and data available, powerful and transparent computational approaches are required to represent measured data and scientific knowledge in a computable and searchable format. We developed a set of biological network models, scripted in the Biological Expression Language, that reflect causal signaling pathways across a wide range of biological processes, including cell fate, cell stress, cell proliferation, inflammation, tissue repair and angiogenesis in the pulmonary and cardiovascular context. This comprehensive collection of networks is now freely available to the scientific community in a centralized web-based repository, the Causal Biological Network database, which is composed of over 120 manually curated and well annotated biological network models and can be accessed at http://causalbionet.com. The website accesses a MongoDB, which stores all versions of the networks as JSON objects and allows users to search for genes, proteins, biological processes, small molecules and keywords in the network descriptions to retrieve biological networks of interest. The content of the networks can be visualized and browsed. Nodes and edges can be filtered and all supporting evidence for the edges can be browsed and is linked to the original articles in PubMed. Moreover, networks may be downloaded for further visualization and evaluation. Database URL: http://causalbionet.com

  12. Comprehensive coverage of cardiovascular disease data in the disease portals at the Rat Genome Database.

    PubMed

    Wang, Shur-Jen; Laulederkind, Stanley J F; Hayman, G Thomas; Petri, Victoria; Smith, Jennifer R; Tutaj, Marek; Nigam, Rajni; Dwinell, Melinda R; Shimoyama, Mary

    2016-08-01

    Cardiovascular diseases are complex diseases caused by a combination of genetic and environmental factors. To facilitate progress in complex disease research, the Rat Genome Database (RGD) provides the community with a disease portal where genome objects and biological data related to cardiovascular diseases are systematically organized. The purpose of this study is to present biocuration at RGD, including disease, genetic, and pathway data. The RGD curation team uses controlled vocabularies/ontologies to organize data curated from the published literature or imported from disease and pathway databases. These organized annotations are associated with genes, strains, and quantitative trait loci (QTLs), thus linking functional annotations to genome objects. Screen shots from the web pages are used to demonstrate the organization of annotations at RGD. The human cardiovascular disease genes identified by annotations were grouped according to data sources and their annotation profiles were compared by in-house tools and other enrichment tools available to the public. The analysis results show that the imported cardiovascular disease genes from ClinVar and OMIM are functionally different from the RGD manually curated genes in terms of pathway and Gene Ontology annotations. The inclusion of disease genes from other databases enriches the collection of disease genes not only in quantity but also in quality. PMID:27287925

  13. A User’s Guide to the Comprehensive Water Quality Database for Groundwater in the Vicinity of the Nevada Test Site, Rev. No.: 1

    SciTech Connect

    Farnham, Irene

    2006-09-01

    This water quality database (viz.GeochemXX.mdb) has been developed as part of the Underground Test Area (UGTA) Program with the cooperation of several agencies actively participating in ongoing evaluation and characterization activities under contract to the U.S. Department of Energy (DOE), National Nuclear Security Administration Nevada Site Office (NNSA/NSO). The database has been constructed to provide up-to-date, comprehensive, and quality controlled data in a uniform format for the support of current and future projects. This database provides a valuable tool for geochemical and hydrogeologic evaluations of the Nevada Test Site (NTS) and surrounding region. Chemistry data have been compiled for groundwater within the NTS and the surrounding region. These data include major ions, organic compounds, trace elements, radionuclides, various field parameters, and environmental isotopes. Colloid data are also included in the database. The GeochemXX.mdb database is distributed on an annual basis. The extension ''XX'' within the database title is replaced by the last two digits of the release year (e.g., Geochem06 for the version released during the 2006 fiscal year). The database is distributed via compact disc (CD) and is also uploaded to the Common Data Repository (CDR) in order to make it available to all agencies with DOE intranet access. This report provides an explanation of the database configuration and summarizes the general content and utility of the individual data tables. In addition to describing the data, subsequent sections of this report provide the data user with an explanation of the quality assurance/quality control (QA/QC) protocols for this database.

  14. EpiFactors: a comprehensive database of human epigenetic factors and complexes

    PubMed Central

    Medvedeva, Yulia A.; Lennartsson, Andreas; Ehsani, Rezvan; Kulakovskiy, Ivan V.; Vorontsov, Ilya E.; Panahandeh, Pouda; Khimulya, Grigory; Kasukawa, Takeya; Drabløs, Finn

    2015-01-01

    Epigenetics refers to stable and long-term alterations of cellular traits that are not caused by changes in the DNA sequence per se. Rather, covalent modifications of DNA and histones affect gene expression and genome stability via proteins that recognize and act upon such modifications. Many enzymes that catalyse epigenetic modifications or are critical for enzymatic complexes have been discovered, and this is encouraging investigators to study the role of these proteins in diverse normal and pathological processes. Rapidly growing knowledge in the area has resulted in the need for a resource that compiles, organizes and presents curated information to the researchers in an easily accessible and user-friendly form. Here we present EpiFactors, a manually curated database providing information about epigenetic regulators, their complexes, targets and products. EpiFactors contains information on 815 proteins, including 95 histones and protamines. For 789 of these genes, we include expressions values across several samples, in particular a collection of 458 human primary cell samples (for approximately 200 cell types, in many cases from three individual donors), covering most mammalian cell steady states, 255 different cancer cell lines (representing approximately 150 cancer subtypes) and 134 human postmortem tissues. Expression values were obtained by the FANTOM5 consortium using Cap Analysis of Gene Expression technique. EpiFactors also contains information on 69 protein complexes that are involved in epigenetic regulation. The resource is practical for a wide range of users, including biologists, pharmacologists and clinicians. Database URL: http://epifactors.autosome.ru PMID:26153137

  15. Chemical composition of ferromanganese crusts in the world ocean: a review and comprehensive database

    USGS Publications Warehouse

    Manheim, Frank T.; Lane-Bostwick, Candice M.

    1989-01-01

    A comprehensive database of chemical and mineralogical properties for ferromanganese crusts collected throughout the Atlantic, Pacific, and Indian Oceans, and has been assembled from published and unpublished sources which provide collection and analytical information for these samples. These crusts, their chemical compositions and natural distribution, have been a topic of interest to scientific research, as well as to industrial and military applications. Unlike abyssal ferromanganese nodules, which form in areas of low disturbance and high sediment accumulation, crusts have been found to contain three to five times more cobalt than these nodules, and can be found on harder, steeper substrates which can be too steep for permanent sediment accumulation. They have also been documented on seamounts and plateaus within the U.S. exclusive economic zone in both Pacific and Atlantic Oceans, and these are therefore of strategic importance to the United States Government, as well as to civilian mining and metallurgical industries. The data tables provided in this report have been digitized and previously uploaded to the National Oceanic and Atmospheric Administration National Geophysical Data Center in 1991 for online distribution, and were provided in plain text format. The 2014 update to the original U.S. Geological Survey open-file report published in 1989 provides these data tables in a slightly reformatted version to make them easier to ingest into geographic information system software, converted to shapefiles, and have completed metadata written and associated with them.

  16. Developing a comprehensive database management system for organization and evaluation of mammography datasets.

    PubMed

    Wu, Yirong; Rubin, Daniel L; Woods, Ryan W; Elezaby, Mai; Burnside, Elizabeth S

    2014-01-01

    We aimed to design and develop a comprehensive mammography database system (CMDB) to collect clinical datasets for outcome assessment and development of decision support tools. A Health Insurance Portability and Accountability Act (HIPAA) compliant CMDB was created to store multi-relational datasets of demographic risk factors and mammogram results using the Breast Imaging Reporting and Data System (BI-RADS) lexicon. The CMDB collected both biopsy pathology outcomes, in a breast pathology lexicon compiled by extending BI-RADS, and our institutional breast cancer registry. The audit results derived from the CMDB were in accordance with Mammography Quality Standards Act (MQSA) audits and national benchmarks. The CMDB has managed the challenges of multi-level organization demanded by the complexity of mammography practice and lexicon development in pathology. We foresee that the CMDB will be useful for efficient quality assurance audits and development of decision support tools to improve breast cancer diagnosis. Our procedure of developing the CMDB provides a framework to build a detailed data repository for breast imaging quality control and research, which has the potential to augment existing resources.

  17. Developing a Comprehensive Database Management System for Organization and Evaluation of Mammography Datasets

    PubMed Central

    Wu, Yirong; Rubin, Daniel L; Woods, Ryan W; Elezaby, Mai; Burnside, Elizabeth S

    2014-01-01

    We aimed to design and develop a comprehensive mammography database system (CMDB) to collect clinical datasets for outcome assessment and development of decision support tools. A Health Insurance Portability and Accountability Act (HIPAA) compliant CMDB was created to store multi-relational datasets of demographic risk factors and mammogram results using the Breast Imaging Reporting and Data System (BI-RADS) lexicon. The CMDB collected both biopsy pathology outcomes, in a breast pathology lexicon compiled by extending BI-RADS, and our institutional breast cancer registry. The audit results derived from the CMDB were in accordance with Mammography Quality Standards Act (MQSA) audits and national benchmarks. The CMDB has managed the challenges of multi-level organization demanded by the complexity of mammography practice and lexicon development in pathology. We foresee that the CMDB will be useful for efficient quality assurance audits and development of decision support tools to improve breast cancer diagnosis. Our procedure of developing the CMDB provides a framework to build a detailed data repository for breast imaging quality control and research, which has the potential to augment existing resources. PMID:25368510

  18. InDiaMed: A Comprehensive Database of Indian Medicinal plants for Diabetes

    PubMed Central

    Tota, Kumudini; Rayabarapu, Nihar; Moosa, Sowmya; Talla, Venu; Bhyravbhatla, Balaji; Rao, Srinivasa

    2013-01-01

    According to International Diabetes Federation (IDF), India has 62.4 million people with diabetes and by 2030 it is predicted that the number will rise to 100 million. Studies claim that there are around 410 experimentally proven Indian medicinal plants which have anti-diabetic activity, of which the mechanism of action of 109 plants has been elucidated or reported. So, the need of the hour is to explore the claims of Indian medicinal flora and open up the facets of many Indian plants which are being examined for their beneficial role in diabetes. So, we created a database (InDiaMed) of Indian medicinal plants that captures their role in anti-diabetic activity. InDiaMed's features include chemical, pharmacological, biochemical and geographical information of the medicinal plant, scientifically relevant information of the plant, and the coherent research done on it in the field of diabetes. The database also includes the list of poly-herbal formulations which are used for treatment of diabetes in India. Availability http://www.indiamed.info PMID:23750084

  19. The MiST2 database: a comprehensive genomics resource on microbial signal transduction

    PubMed Central

    Ulrich, Luke E.; Zhulin, Igor B.

    2010-01-01

    The MiST2 database (http://mistdb.com) identifies and catalogs the repertoire of signal transduction proteins in microbial genomes. Signal transduction systems regulate the majority of cellular activities including the metabolism, development, host-recognition, biofilm production, virulence, and antibiotic resistance of human pathogens. Thus, knowledge of the proteins and interactions that comprise these communication networks is an essential component to furthering biomedical discovery. These are identified by searching protein sequences for specific domain profiles that implicate a protein in signal transduction. Compared to the previous version of the database, MiST2 contains a host of new features and improvements including the following: draft genomes; extracytoplasmic function (ECF) sigma factor protein identification; enhanced classification of signaling proteins; novel, high-quality domain models for identifying histidine kinases and response regulators; neighboring two-component genes; gene cart; better search capabilities; enhanced taxonomy browser; advanced genome browser; and a modern, biologist-friendly web interface. MiST2 currently contains 966 complete and 157 draft bacterial and archaeal genomes, which collectively contain more than 245 000 signal transduction proteins. The majority (66%) of these are one-component systems, followed by two-component proteins (26%), chemotaxis (6%), and finally ECF factors (2%). PMID:19900966

  20. A comprehensive database of quality-rated fossil ages for Sahul's Quaternary vertebrates.

    PubMed

    Rodríguez-Rey, Marta; Herrando-Pérez, Salvador; Brook, Barry W; Saltré, Frédérik; Alroy, John; Beeton, Nicholas; Bird, Michael I; Cooper, Alan; Gillespie, Richard; Jacobs, Zenobia; Johnson, Christopher N; Miller, Gifford H; Prideaux, Gavin J; Roberts, Richard G; Turney, Chris S M; Bradshaw, Corey J A

    2016-07-19

    The study of palaeo-chronologies using fossil data provides evidence for past ecological and evolutionary processes, and is therefore useful for predicting patterns and impacts of future environmental change. However, the robustness of inferences made from fossil ages relies heavily on both the quantity and quality of available data. We compiled Quaternary non-human vertebrate fossil ages from Sahul published up to 2013. This, the FosSahul database, includes 9,302 fossil records from 363 deposits, for a total of 478 species within 215 genera, of which 27 are from extinct and extant megafaunal species (2,559 records). We also provide a rating of reliability of individual absolute age based on the dating protocols and association between the dated materials and the fossil remains. Our proposed rating system identified 2,422 records with high-quality ages (i.e., a reduction of 74%). There are many applications of the database, including disentangling the confounding influences of hypothetical extinction drivers, better spatial distribution estimates of species relative to palaeo-climates, and potentially identifying new areas for fossil discovery.

  1. A comprehensive database of quality-rated fossil ages for Sahul's Quaternary vertebrates.

    PubMed

    Rodríguez-Rey, Marta; Herrando-Pérez, Salvador; Brook, Barry W; Saltré, Frédérik; Alroy, John; Beeton, Nicholas; Bird, Michael I; Cooper, Alan; Gillespie, Richard; Jacobs, Zenobia; Johnson, Christopher N; Miller, Gifford H; Prideaux, Gavin J; Roberts, Richard G; Turney, Chris S M; Bradshaw, Corey J A

    2016-01-01

    The study of palaeo-chronologies using fossil data provides evidence for past ecological and evolutionary processes, and is therefore useful for predicting patterns and impacts of future environmental change. However, the robustness of inferences made from fossil ages relies heavily on both the quantity and quality of available data. We compiled Quaternary non-human vertebrate fossil ages from Sahul published up to 2013. This, the FosSahul database, includes 9,302 fossil records from 363 deposits, for a total of 478 species within 215 genera, of which 27 are from extinct and extant megafaunal species (2,559 records). We also provide a rating of reliability of individual absolute age based on the dating protocols and association between the dated materials and the fossil remains. Our proposed rating system identified 2,422 records with high-quality ages (i.e., a reduction of 74%). There are many applications of the database, including disentangling the confounding influences of hypothetical extinction drivers, better spatial distribution estimates of species relative to palaeo-climates, and potentially identifying new areas for fossil discovery. PMID:27434208

  2. A comprehensive database of quality-rated fossil ages for Sahul’s Quaternary vertebrates

    PubMed Central

    Rodríguez-Rey, Marta; Herrando-Pérez, Salvador; Brook, Barry W.; Saltré, Frédérik; Alroy, John; Beeton, Nicholas; Bird, Michael I.; Cooper, Alan; Gillespie, Richard; Jacobs, Zenobia; Johnson, Christopher N.; Miller, Gifford H.; Prideaux, Gavin J.; Roberts, Richard G.; Turney, Chris S.M.; Bradshaw, Corey J.A.

    2016-01-01

    The study of palaeo-chronologies using fossil data provides evidence for past ecological and evolutionary processes, and is therefore useful for predicting patterns and impacts of future environmental change. However, the robustness of inferences made from fossil ages relies heavily on both the quantity and quality of available data. We compiled Quaternary non-human vertebrate fossil ages from Sahul published up to 2013. This, the FosSahul database, includes 9,302 fossil records from 363 deposits, for a total of 478 species within 215 genera, of which 27 are from extinct and extant megafaunal species (2,559 records). We also provide a rating of reliability of individual absolute age based on the dating protocols and association between the dated materials and the fossil remains. Our proposed rating system identified 2,422 records with high-quality ages (i.e., a reduction of 74%). There are many applications of the database, including disentangling the confounding influences of hypothetical extinction drivers, better spatial distribution estimates of species relative to palaeo-climates, and potentially identifying new areas for fossil discovery. PMID:27434208

  3. The development of a comprehensive regional digital geospatial database for geothermal exploration in the West-Central Great Basin, U.S.A.

    SciTech Connect

    Nash, G.D.; Hernandez, M.W.

    1996-08-01

    The Earth Sciences and Resources Institute, Geomatics Laboratory, (EGL), Department of Civil and Environmental Engineering, University of Utah, has compiled and initiated the use of a comprehensive digital geospatial database for geothermal exploration in the west-central Great Basin, U.S.A. The database consists of hydrothermal alteration, lineament, recent earthquake, aeromagnetic, gravity, hot spring, mining district, heat flow, geochemical, and geologic unit digital maps in a GIS format. Several problems were encountered during the development of this geospatial database. These primarily included (1) data availability and coverage of the study area, (2) data accuracy (both spatial and thematic), (3) inconsistent data formats, and (4) a lack of metadata. These problems were all overcome satisfactorily. The data are currently being used to create a regional geothermal potential model for the study area.

  4. Feasibility of Including Green Tea Products for an Analytically Verified Dietary Supplement Database

    PubMed Central

    Saldanha, Leila; Dwyer, Johanna; Andrews, Karen; Betz, Joseph; Harnely, James; Pehrsson, Pamela; Rimmer, Catherine; Savarala, Sushma

    2015-01-01

    The Dietary Supplement Ingredient Database (DSID) is a federally funded, publicly accessible dietary supplement database that currently contains analytically-derived information on micronutrients in selected adult and children’s multivitamin and mineral (MVM) supplements. Other constituents in dietary supplement products such as botanicals are also of interest and thus are being considered for inclusion in the DSID. Thirty-eight constituents, mainly botanicals were identified and prioritized by a federal interagency committee. Green tea was selected from this list as the botanical for expansion of the DSID. This paper describes the process for prioritizing dietary ingredients in the DSID. It also discusses the criteria for inclusion of these ingredients, and the approach for selecting and testing products for the green tea pilot study. PMID:25817236

  5. Loops In Proteins (LIP)--a comprehensive loop database for homology modelling.

    PubMed

    Michalsky, E; Goede, A; Preissner, R

    2003-12-01

    One of the most important and challenging tasks in protein modelling is the prediction of loops, as can be seen in the large variety of existing approaches. Loops In Proteins (LIP) is a database that includes all protein segments of a length up to 15 residues contained in the Protein Data Bank (PDB). In this study, the applicability of LIP to loop prediction in the framework of homology modelling is investigated. Searching the database for loop candidates takes less than 1 s on a desktop PC, and ranking them takes a few minutes. This is an order of magnitude faster than most existing procedures. The measure of accuracy is the root mean square deviation (RMSD) with respect to the main-chain atoms after local superposition of target loop and predicted loop. Loops of up to nine residues length were modelled with a local RMSD <1 A and those of length up to 14 residues with an accuracy better than 2 A. The results were compared in detail with a thoroughly evaluated and tested ab initio method published recently and additionally with two further methods for a small loop test set. The LIP method produced very good predictions. In particular for longer loops it outperformed other methods.

  6. Development of a comprehensive Blast-Related Auditory Injury Database (BRAID).

    PubMed

    Joseph, Antony R; Horton, Jaime L; Clouser, Mary C; MacGregor, Andrew J; Louie, Michelle; Galarneau, Michael R

    2016-01-01

    The Department of Defense Hearing Conservation Program provides specific guidance for service components to prevent occupational hearing loss; however, it does not specifically contend with the unique noise exposures observed in the theater of war, such as blasts and explosions. In order to examine the effects of blast injury on hearing sensitivity, we developed a large database composed of demographic, audiometric, point of injury, and medical outcome data, with the primary aim of developing a long-standing and integrated capability for the surveillance, assessment, and investigation of blast-related hearing outcomes. Methods used to develop the dataset are described. Encompassing more than 16,500 Navy and Marine Corps personnel, the Blast-Related Auditory Injury Database (BRAID) includes individuals with a blast-related injury and nonblast control subjects. Using baseline and postdeployment hearing threshold data, a retrospective analysis of the cohort revealed that the rate of hearing loss for the injured servicemembers was 39%. The BRAID will be useful for studies that assess hearing patterns following deployment-related injury, such as blast exposures, that facilitate exploration of health outcomes and whether they are predictive of audiometric disposition and that help establish hearing loss prevention strategies and program policies for affected military commands and servicemembers.

  7. Development of a comprehensive Blast-Related Auditory Injury Database (BRAID).

    PubMed

    Joseph, Antony R; Horton, Jaime L; Clouser, Mary C; MacGregor, Andrew J; Louie, Michelle; Galarneau, Michael R

    2016-01-01

    The Department of Defense Hearing Conservation Program provides specific guidance for service components to prevent occupational hearing loss; however, it does not specifically contend with the unique noise exposures observed in the theater of war, such as blasts and explosions. In order to examine the effects of blast injury on hearing sensitivity, we developed a large database composed of demographic, audiometric, point of injury, and medical outcome data, with the primary aim of developing a long-standing and integrated capability for the surveillance, assessment, and investigation of blast-related hearing outcomes. Methods used to develop the dataset are described. Encompassing more than 16,500 Navy and Marine Corps personnel, the Blast-Related Auditory Injury Database (BRAID) includes individuals with a blast-related injury and nonblast control subjects. Using baseline and postdeployment hearing threshold data, a retrospective analysis of the cohort revealed that the rate of hearing loss for the injured servicemembers was 39%. The BRAID will be useful for studies that assess hearing patterns following deployment-related injury, such as blast exposures, that facilitate exploration of health outcomes and whether they are predictive of audiometric disposition and that help establish hearing loss prevention strategies and program policies for affected military commands and servicemembers. PMID:27273241

  8. The National Deep-Sea Coral and Sponge Database: A Comprehensive Resource for United States Deep-Sea Coral and Sponge Records

    NASA Astrophysics Data System (ADS)

    Dornback, M.; Hourigan, T.; Etnoyer, P.; McGuinn, R.; Cross, S. L.

    2014-12-01

    Research on deep-sea corals has expanded rapidly over the last two decades, as scientists began to realize their value as long-lived structural components of high biodiversity habitats and archives of environmental information. The NOAA Deep Sea Coral Research and Technology Program's National Database for Deep-Sea Corals and Sponges is a comprehensive resource for georeferenced data on these organisms in U.S. waters. The National Database currently includes more than 220,000 deep-sea coral records representing approximately 880 unique species. Database records from museum archives, commercial and scientific bycatch, and from journal publications provide baseline information with relatively coarse spatial resolution dating back as far as 1842. These data are complemented by modern, in-situ submersible observations with high spatial resolution, from surveys conducted by NOAA and NOAA partners. Management of high volumes of modern high-resolution observational data can be challenging. NOAA is working with our data partners to incorporate this occurrence data into the National Database, along with images and associated information related to geoposition, time, biology, taxonomy, environment, provenance, and accuracy. NOAA is also working to link associated datasets collected by our program's research, to properly archive them to the NOAA National Data Centers, to build a robust metadata record, and to establish a standard protocol to simplify the process. Access to the National Database is provided through an online mapping portal. The map displays point based records from the database. Records can be refined by taxon, region, time, and depth. The queries and extent used to view the map can also be used to download subsets of the database. The database, map, and website is already in use by NOAA, regional fishery management councils, and regional ocean planning bodies, but we envision it as a model that can expand to accommodate data on a global scale.

  9. A comprehensive database of the geographic spread of past human Ebola outbreaks

    PubMed Central

    Mylne, Adrian; Brady, Oliver J.; Huang, Zhi; Pigott, David M.; Golding, Nick; Kraemer, Moritz U.G.; Hay, Simon I.

    2014-01-01

    Ebola is a zoonotic filovirus that has the potential to cause outbreaks of variable magnitude in human populations. This database collates our existing knowledge of all known human outbreaks of Ebola for the first time by extracting details of their suspected zoonotic origin and subsequent human-to-human spread from a range of published and non-published sources. In total, 22 unique Ebola outbreaks were identified, composed of 117 unique geographic transmission clusters. Details of the index case and geographic spread of secondary and imported cases were recorded as well as summaries of patient numbers and case fatality rates. A brief text summary describing suspected routes and means of spread for each outbreak was also included. While we cannot yet include the ongoing Guinea and DRC outbreaks until they are over, these data and compiled maps can be used to gain an improved understanding of the initial spread of past Ebola outbreaks and help evaluate surveillance and control guidelines for limiting the spread of future epidemics. PMID:25984346

  10. Maine Exhibitions Assessment Project, September 2002-June 2004. Technical Criteria for Including Exhibition Assessments in Comprehensive Local Assessment Systems. Revised.

    ERIC Educational Resources Information Center

    Maine Department of Education, 2004

    2004-01-01

    In September 2002, the Maine Department of Education began organizing an Exhibition Assessment Advisory Committee whose purpose it was to consider meaningful ways to include exhibition assessments in comprehensive local assessment systems. School administrators were invited to participate and/or nominate practitioners in their districts to become…

  11. ECO: A Generic Eutrophication Model Including Comprehensive Sediment-Water Interaction

    PubMed Central

    Smits, Johannes G. C.; van Beek, Jan K. L.

    2013-01-01

    The content and calibration of the comprehensive generic 3D eutrophication model ECO for water and sediment quality is presented. Based on a computational grid for water and sediment, ECO is used as a tool for water quality management to simulate concentrations and mass fluxes of nutrients (N, P, Si), phytoplankton species, detrital organic matter, electron acceptors and related substances. ECO combines integral simulation of water and sediment quality with sediment diagenesis and closed mass balances. Its advanced process formulations for substances in the water column and the bed sediment were developed to allow for a much more dynamic calculation of the sediment-water exchange fluxes of nutrients as resulting from steep concentration gradients across the sediment-water interface than is possible with other eutrophication models. ECO is to more accurately calculate the accumulation of organic matter and nutrients in the sediment, and to allow for more accurate prediction of phytoplankton biomass and water quality in response to mitigative measures such as nutrient load reduction. ECO was calibrated for shallow Lake Veluwe (The Netherlands). Due to restoration measures this lake underwent a transition from hypertrophic conditions to moderately eutrophic conditions, leading to the extensive colonization by submerged macrophytes. ECO reproduces observed water quality well for the transition period of ten years. The values of its process coefficients are in line with ranges derived from literature. ECO’s calculation results underline the importance of redox processes and phosphate speciation for the nutrient return fluxes. Among other things, the results suggest that authigenic formation of a stable apatite-like mineral in the sediment can contribute significantly to oligotrophication of a lake after a phosphorus load reduction. PMID:23844160

  12. ECO: a generic eutrophication model including comprehensive sediment-water interaction.

    PubMed

    Smits, Johannes G C; van Beek, Jan K L

    2013-01-01

    The content and calibration of the comprehensive generic 3D eutrophication model ECO for water and sediment quality is presented. Based on a computational grid for water and sediment, ECO is used as a tool for water quality management to simulate concentrations and mass fluxes of nutrients (N, P, Si), phytoplankton species, detrital organic matter, electron acceptors and related substances. ECO combines integral simulation of water and sediment quality with sediment diagenesis and closed mass balances. Its advanced process formulations for substances in the water column and the bed sediment were developed to allow for a much more dynamic calculation of the sediment-water exchange fluxes of nutrients as resulting from steep concentration gradients across the sediment-water interface than is possible with other eutrophication models. ECO is to more accurately calculate the accumulation of organic matter and nutrients in the sediment, and to allow for more accurate prediction of phytoplankton biomass and water quality in response to mitigative measures such as nutrient load reduction. ECO was calibrated for shallow Lake Veluwe (The Netherlands). Due to restoration measures this lake underwent a transition from hypertrophic conditions to moderately eutrophic conditions, leading to the extensive colonization by submerged macrophytes. ECO reproduces observed water quality well for the transition period of ten years. The values of its process coefficients are in line with ranges derived from literature. ECO's calculation results underline the importance of redox processes and phosphate speciation for the nutrient return fluxes. Among other things, the results suggest that authigenic formation of a stable apatite-like mineral in the sediment can contribute significantly to oligotrophication of a lake after a phosphorus load reduction.

  13. Comprehensive evaluation of long-term trends in occupational exposure: Part 1. Description of the database

    PubMed Central

    Symanski, E.; Kupper, L. L.; Rappaport, S. M.

    1998-01-01

    OBJECTIVES: To conduct a comprehensive evaluation of long term changes in occupational exposure among a broad cross section of industries worldwide. METHODS: A review of the scientific literature identified studies that reported historical changes in exposure. About 700 sets of data from 119 published and several unpublished sources were compiled. Data were published over a 30 year period in 25 journals that spanned a range of disciplines. For each data set, the average exposure level was compiled for each period and details on the contaminant, the industry and location, changes in the threshold limit value (TLV), as well as the type of sampling method were recorded. Spearman rank correlation coefficients were used to identify monotonic changes in exposure over time and simple linear regression analyses were used to characterise trends in exposure. RESULTS: About 78% of the natural log transformed data showed linear trends towards lower exposure levels whereas 22% indicated increasing trends. (The Spearman rank correlation analyses produced a similar breakdown between exposures monotonically increasing or decreasing over time.) Although the rates of reduction for the data showing downward trends ranged from -1% to -62% per year, most exposures declined at rates between -4% and -14% per year (the interquartile range), with a median value of -8% per year. Exposures seemed to increase at rates that were slightly lower than those of exposures which have declined over time. Data sets that showed downward (versus upward) trends were influenced by several factors including type and carcinogenicity of the contaminant, type of monitoring, historical changes in the threshold limit values (TLVs), and period of sampling. CONCLUSIONS: This review supports the notion that occupational exposures are generally lower today than they were years or decades ago. However, such trends seem to have been affected by factors related to the contaminant, as well as to the period and type of

  14. ClonorESTdb: a comprehensive database for Clonorchis sinensis EST sequences

    PubMed Central

    2014-01-01

    Background Clonorchiasis, which is primarily caused by liver fluke (Platyhelminthes), is a fatal infectious disease that is mainly associated with bile duct malignancy and the subsequent development of cholangiocarcinoma. Thus, a genomic approach now represents an important step to further our knowledge of biology and the pathology of these parasites. The results of expressed sequence tags (ESTs) sequencing need to be well organized into databases to provide an integrated set of tools and functional information. Findings Here, the ClonorESTdb database represents a collection of Clonorchis sinensis ESTs that is intended as a resource for parasite functional genomics. A total of 55,736 successful EST sequences, which are cleaned and clustered into non-redundant 13,305 C. sinensis assembled EST sequences (6,497 clusters and 6,808 singletons), were obtained from three in-house prepared cDNA libraries of C. sinensis at different developmental stages. The assembled consensus sequences were annotated using the BLAST algorithm or/and hmm against NCBI NR, UniProt, KEGG and InterProScan. The ClonorESTdb database provides functional annotation, their expression profiles, tandem repeats and putative single nucleotide polymorphisms with utility tools such as local BLAST search and text retrieval. Conclusions This resource enables the researcher to identify and compare expression signatures under different biological stages and promotes ongoing parasite drug and vaccine development and biological research. Database URL: http://pathod.cdc.go.kr/clonorestdb/ PMID:24957044

  15. SU-E-T-153: Establish a Comprehensive Patient-Specific Plan QA Database for Instituitional Quality Control Program

    SciTech Connect

    Ding, X; Olszanski, A; Scheuermann, R; Bellerive, M; Solberg, T

    2014-06-01

    Purpose: Since the publication of TG-119, several new treatment techniques requiring new QA devices have been implemented in the clinic. To monitor and analyze our institutional QA performance, we have created a comprehensive QA database using Research Electronic Data Capture (REDCap). The database will also assist us in creating IMRT QA Analysis practice guidelines. Methods: Since Oct. 2013, 336 patient plan QA results were entered into the database. Plan parameters such as plan ID, treatment site, technique, energy, optimization constraints, modulation factor, leaf speed, leaf opening.etc were automatically extracted from the Varian Eclipse database to allow us to refine our evaluation and analysis method. Specific QA device, LINAC-related information, and measurement and analysis results were manually entered by the QA team. IMRT plans were measured using MapCHECK2 while RapidArc plans were measured using ArcCHECK. Distance-To-Agreement 3%/3mm without global maximum normalization was used. Results: The data indicates that different treatment techniques might benefit from a different site-specific action level(AL) depending on the complexity of the plan and optimization parameter used (e.g., breast IMRT QA= 97.8% and pelvis IMRT QA=93.1%). Different QA devices may also benefit from a different AL (MapCHECK2 = 94.1% while ArcCHECK = 83.0%). The relationship between the parameters and passing rate suggests that the complexity of each plan, characterized by leaf travel, leaf opening and modulation factor, affect the passing rate significantly. The database is reviewed regularly, and any abnormal point of the QA result or a trend of lower QA passing rate on a specific LINAC is further investigated. Conclusion: Establishing a comprehensive QA database provides an overview of the quality assurance program. It not only helps in answering the question “what is a reasonable and achievable standard for each institution”, but also saves time in monitoring and

  16. In-hospital mortality risk for total shoulder arthroplasty: A comprehensive review of the medicare database from 2005 to 2011

    PubMed Central

    McCormick, Frank; Nwachukwu, Benedict U.; Kiriakopoulos, Emmanouil B. S.; Schairer, William W.; Provencher, Matthew T.; Levy, Jonathan

    2015-01-01

    Introduction: The in-hospital mortality rate after total shoulder arthroplasty (TSA) is unknown. The purpose of this study is to quantify the in-patient mortality rates and associated demographic risk factors for patients undergoing a TSA from 2005 to 2011 using a comprehensive Medicare registry database. Materials and Methods: We conducted a retrospective review of the Medicare database within the PearlDiver database. The PearlDiver database is a publicly available Health Insurance Portability and Accountability Act-compliant national database that captures 100% of the Medicare hospital data for TSA between 2005 and 2011. Using International Classification of Diseases, Ninth Revision codes for TSA we identified a dataset of patients undergoing TSA as well as a subset of those for whom there was a death discharge (i.e., in-patient death). Risk for this outcome was further quantified by age, gender and year. Linear regression was performed to identify risk factors for the primary outcome. Results: A total of 101,323 patients underwent 125,813 TSAs between 2005 and 2011. There were 113 in-patient mortalities during this period. Thus the incidence of death was 0.09%. Increasing age was a significant risk factor for mortality (P = 0.03). Gender and year of procedure were not significant risk factors for mortality. Conclusion: The incidence of in-patient mortality for Medicare patients undergoing TSA between 2005 and 2011 was <1 in 1000 surgeries. Increased age is a significant predictor of mortality. Level 4: Retrospective analysis PMID:26622126

  17. Dental health services research utilizing comprehensive clinical databases and information technology.

    PubMed

    Hayden, W J

    1997-01-01

    Marketplace pressures for accountability in dentistry have made clear dental delivery systems' weaknesses in information generation, management, and analysis methods. Without this type of information, dentistry is unable to quantify and document the outcomes of the dental care services it provides. The Pew Health Professions Commission and the Institute of Medicine both suggest that dental schools should be among the leaders in the development and teaching of dental information capabilities, as well as the source of fundamental dental health services research. This paper argues that dental schools are the logical location for the development of valid, reliable, and acceptable health services research methods and databases. It describes the development of an insurance claims database to demonstrate the types of investigations possible, as well as the weaknesses and shortcomings of pure administrative data. PMID:9024342

  18. Characteristics of comprehensive Chemical Industry Database CD-NET : Centered around chemical product information file

    NASA Astrophysics Data System (ADS)

    Takano, Hideo

    This paper describes Chemical Product Information File of Chemical Industry Database, CD-NET provided by Chemical Data Service Inc.. It defines "information" first, then explains file organization and presents how Chemical product Information File is located in CD-NET. Mentioning its complementary relation with JICST's JOIS-F the author defines the File as chemical product information for business purpose. All of the information items in the File emphasize that it is exactly a type of business and practical database. To distinguish general items from important items by product, all of the information is categorized into II classes by general chemical product and by area. The scope and emphasized items under each class are described in detail.

  19. Comprehensive functional characterization of murine infantile Batten disease including Parkinson-like behavior and dopaminergic markers.

    PubMed

    Dearborn, Joshua T; Harmon, Steven K; Fowler, Stephen C; O'Malley, Karen L; Taylor, George T; Sands, Mark S; Wozniak, David F

    2015-01-01

    Infantile neuronal ceroid lipofuscinosis (INCL, Infantile Batten disease) is a neurodegenerative lysosomal storage disease caused by a deficiency in palmitoyl protein thioesterase-1 (PPT1). The PPT1-deficient mouse (Cln1(-/-)) is a useful phenocopy of human INCL. Cln1(-/-) mice display retinal dysfunction, seizures, motor deficits, and die at ~8 months of age. However, little is known about the cognitive and behavioral functions of Cln1(-/-) mice during disease progression. In the present study, younger (~1-2 months of age) Cln1(-/-) mice showed minor deficits in motor/sensorimotor functions while older (~5-6 months of age) Cln1(-/-) mice exhibited more severe impairments, including decreased locomotor activity, inferior cued water maze performance, decreased running wheel ability, and altered auditory cue conditioning. Unexpectedly, certain cognitive functions such as some learning and memory capabilities seemed intact in older Cln1(-/-) mice. Younger and older Cln1(-/-) mice presented with walking initiation defects, gait abnormalities, and slowed movements, which are analogous to some symptoms reported in INCL and parkinsonism. However, there was no evidence of alterations in dopaminergic markers in Cln1(-/-) mice. Results from this study demonstrate quantifiable changes in behavioral functions during progression of murine INCL and suggest that Parkinson-like motor/sensorimotor deficits in Cln1(-/-) mice are not mediated by dopamine deficiency. PMID:26238334

  20. CancerResource: a comprehensive database of cancer-relevant proteins and compound interactions supported by experimental knowledge.

    PubMed

    Ahmed, Jessica; Meinel, Thomas; Dunkel, Mathias; Murgueitio, Manuela S; Adams, Robert; Blasse, Corinna; Eckert, Andreas; Preissner, Saskia; Preissner, Robert

    2011-01-01

    During the development of methods for cancer diagnosis and treatment, a vast amount of information is generated. Novel cancer target proteins have been identified and many compounds that activate or inhibit cancer-relevant target genes have been developed. This knowledge is based on an immense number of experimentally validated compound-target interactions in the literature, and excerpts from literature text mining are spread over numerous data sources. Our own analysis shows that the overlap between important existing repositories such as Comparative Toxicogenomics Database (CTD), Therapeutic Target Database (TTD), Pharmacogenomics Knowledge Base (PharmGKB) and DrugBank as well as between our own literature mining for cancer-annotated entries is surprisingly small. In order to provide an easy overview of interaction data, it is essential to integrate this information into a single, comprehensive data repository. Here, we present CancerResource, a database that integrates cancer-relevant relationships of compounds and targets from (i) our own literature mining and (ii) external resources complemented with (iii) essential experimental and supporting information on genes and cellular effects. In order to facilitate an overview of existing and supporting information, a series of novel information connections have been established. CancerResource addresses the spectrum of research on compound-target interactions in natural sciences as well as in individualized medicine; CancerResource is available at: http://bioinformatics.charite.de/cancerresource/.

  1. Comprehensibility.

    ERIC Educational Resources Information Center

    Pettersson, Rune

    This paper addresses the difficulty involved in creating easily understood information. The act of communicating is not complete until the message has been both received and understood by the audience. Messages must always be comprehensible, otherwise they will have no effect. The readability, legibility, and reading value of a graphic message is…

  2. DenHunt - A Comprehensive Database of the Intricate Network of Dengue-Human Interactions

    PubMed Central

    Arjunan, Selvam; Sastri, Narayan P.; Chandra, Nagasuma

    2016-01-01

    Dengue virus (DENV) is a human pathogen and its etiology has been widely established. There are many interactions between DENV and human proteins that have been reported in literature. However, no publicly accessible resource for efficiently retrieving the information is yet available. In this study, we mined all publicly available dengue–human interactions that have been reported in the literature into a database called DenHunt. We retrieved 682 direct interactions of human proteins with dengue viral components, 382 indirect interactions and 4120 differentially expressed human genes in dengue infected cell lines and patients. We have illustrated the importance of DenHunt by mapping the dengue–human interactions on to the host interactome and observed that the virus targets multiple host functional complexes of important cellular processes such as metabolism, immune system and signaling pathways suggesting a potential role of these interactions in viral pathogenesis. We also observed that 7 percent of the dengue virus interacting human proteins are also associated with other infectious and non-infectious diseases. Finally, the understanding that comes from such analyses could be used to design better strategies to counteract the diseases caused by dengue virus. The whole dataset has been catalogued in a searchable database, called DenHunt (http://proline.biochem.iisc.ernet.in/DenHunt/). PMID:27618709

  3. DenHunt - A Comprehensive Database of the Intricate Network of Dengue-Human Interactions.

    PubMed

    Karyala, Prashanthi; Metri, Rahul; Bathula, Christopher; Yelamanchi, Syam K; Sahoo, Lipika; Arjunan, Selvam; Sastri, Narayan P; Chandra, Nagasuma

    2016-09-01

    Dengue virus (DENV) is a human pathogen and its etiology has been widely established. There are many interactions between DENV and human proteins that have been reported in literature. However, no publicly accessible resource for efficiently retrieving the information is yet available. In this study, we mined all publicly available dengue-human interactions that have been reported in the literature into a database called DenHunt. We retrieved 682 direct interactions of human proteins with dengue viral components, 382 indirect interactions and 4120 differentially expressed human genes in dengue infected cell lines and patients. We have illustrated the importance of DenHunt by mapping the dengue-human interactions on to the host interactome and observed that the virus targets multiple host functional complexes of important cellular processes such as metabolism, immune system and signaling pathways suggesting a potential role of these interactions in viral pathogenesis. We also observed that 7 percent of the dengue virus interacting human proteins are also associated with other infectious and non-infectious diseases. Finally, the understanding that comes from such analyses could be used to design better strategies to counteract the diseases caused by dengue virus. The whole dataset has been catalogued in a searchable database, called DenHunt (http://proline.biochem.iisc.ernet.in/DenHunt/). PMID:27618709

  4. AT_CHLORO, a Comprehensive Chloroplast Proteome Database with Subplastidial Localization and Curated Information on Envelope Proteins*

    PubMed Central

    Ferro, Myriam; Brugière, Sabine; Salvi, Daniel; Seigneurin-Berny, Daphné; Court, Magali; Moyet, Lucas; Ramus, Claire; Miras, Stéphane; Mellal, Mourad; Le Gall, Sophie; Kieffer-Jaquinod, Sylvie; Bruley, Christophe; Garin, Jérôme; Joyard, Jacques; Masselon, Christophe; Rolland, Norbert

    2010-01-01

    Recent advances in the proteomics field have allowed a series of high throughput experiments to be conducted on chloroplast samples, and the data are available in several public databases. However, the accurate localization of many chloroplast proteins often remains hypothetical. This is especially true for envelope proteins. We went a step further into the knowledge of the chloroplast proteome by focusing, in the same set of experiments, on the localization of proteins in the stroma, the thylakoids, and envelope membranes. LC-MS/MS-based analyses first allowed building the AT_CHLORO database (http://www.grenoble.prabi.fr/protehome/grenoble-plant-proteomics/), a comprehensive repertoire of the 1323 proteins, identified by 10,654 unique peptide sequences, present in highly purified chloroplasts and their subfractions prepared from Arabidopsis thaliana leaves. This database also provides extensive proteomics information (peptide sequences and molecular weight, chromatographic retention times, MS/MS spectra, and spectral count) for a unique chloroplast protein accurate mass and time tag database gathering identified peptides with their respective and precise analytical coordinates, molecular weight, and retention time. We assessed the partitioning of each protein in the three chloroplast compartments by using a semiquantitative proteomics approach (spectral count). These data together with an in-depth investigation of the literature were compiled to provide accurate subplastidial localization of previously known and newly identified proteins. A unique knowledge base containing extensive information on the proteins identified in envelope fractions was thus obtained, allowing new insights into this membrane system to be revealed. Altogether, the data we obtained provide unexpected information about plastidial or subplastidial localization of some proteins that were not suspected to be associated to this membrane system. The spectral counting-based strategy was further

  5. GenoBase: comprehensive resource database of Escherichia coli K-12.

    PubMed

    Otsuka, Yuta; Muto, Ai; Takeuchi, Rikiya; Okada, Chihiro; Ishikawa, Motokazu; Nakamura, Koichiro; Yamamoto, Natsuko; Dose, Hitomi; Nakahigashi, Kenji; Tanishima, Shigeki; Suharnan, Sivasundaram; Nomura, Wataru; Nakayashiki, Toru; Aref, Walid G; Bochner, Barry R; Conway, Tyrrell; Gribskov, Michael; Kihara, Daisuke; Rudd, Kenneth E; Tohsato, Yukako; Wanner, Barry L; Mori, Hirotada

    2015-01-01

    Comprehensive experimental resources, such as ORFeome clone libraries and deletion mutant collections, are fundamental tools for elucidation of gene function. Data sets by omics analysis using these resources provide key information for functional analysis, modeling and simulation both in individual and systematic approaches. With the long-term goal of complete understanding of a cell, we have over the past decade created a variety of clone and mutant sets for functional genomics studies of Escherichia coli K-12. We have made these experimental resources freely available to the academic community worldwide. Accordingly, these resources have now been used in numerous investigations of a multitude of cell processes. Quality control is extremely important for evaluating results generated by these resources. Because the annotation has been changed since 2005, which we originally used for the construction, we have updated these genomic resources accordingly. Here, we describe GenoBase (http://ecoli.naist.jp/GB/), which contains key information about comprehensive experimental resources of E. coli K-12, their quality control and several omics data sets generated using these resources.

  6. GenoBase: comprehensive resource database of Escherichia coli K-12.

    PubMed

    Otsuka, Yuta; Muto, Ai; Takeuchi, Rikiya; Okada, Chihiro; Ishikawa, Motokazu; Nakamura, Koichiro; Yamamoto, Natsuko; Dose, Hitomi; Nakahigashi, Kenji; Tanishima, Shigeki; Suharnan, Sivasundaram; Nomura, Wataru; Nakayashiki, Toru; Aref, Walid G; Bochner, Barry R; Conway, Tyrrell; Gribskov, Michael; Kihara, Daisuke; Rudd, Kenneth E; Tohsato, Yukako; Wanner, Barry L; Mori, Hirotada

    2015-01-01

    Comprehensive experimental resources, such as ORFeome clone libraries and deletion mutant collections, are fundamental tools for elucidation of gene function. Data sets by omics analysis using these resources provide key information for functional analysis, modeling and simulation both in individual and systematic approaches. With the long-term goal of complete understanding of a cell, we have over the past decade created a variety of clone and mutant sets for functional genomics studies of Escherichia coli K-12. We have made these experimental resources freely available to the academic community worldwide. Accordingly, these resources have now been used in numerous investigations of a multitude of cell processes. Quality control is extremely important for evaluating results generated by these resources. Because the annotation has been changed since 2005, which we originally used for the construction, we have updated these genomic resources accordingly. Here, we describe GenoBase (http://ecoli.naist.jp/GB/), which contains key information about comprehensive experimental resources of E. coli K-12, their quality control and several omics data sets generated using these resources. PMID:25399415

  7. Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome.

    PubMed

    Wang, Jia; Chen, Dijun; Lei, Yang; Chang, Ji-Wei; Hao, Bao-Hai; Xing, Feng; Li, Sen; Xu, Qiang; Deng, Xiu-Xin; Chen, Ling-Ling

    2014-01-01

    Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET) evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs) and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.

  8. Human transporter database: comprehensive knowledge and discovery tools in the human transporter genes.

    PubMed

    Ye, Adam Y; Liu, Qing-Rong; Li, Chuan-Yun; Zhao, Min; Qu, Hong

    2014-01-01

    Transporters are essential in homeostatic exchange of endogenous and exogenous substances at the systematic, organic, cellular, and subcellular levels. Gene mutations of transporters are often related to pharmacogenetics traits. Recent developments in high throughput technologies on genomics, transcriptomics and proteomics allow in depth studies of transporter genes in normal cellular processes and diverse disease conditions. The flood of high throughput data have resulted in urgent need for an updated knowledgebase with curated, organized, and annotated human transporters in an easily accessible way. Using a pipeline with the combination of automated keywords query, sequence similarity search and manual curation on transporters, we collected 1,555 human non-redundant transporter genes to develop the Human Transporter Database (HTD) (http://htd.cbi.pku.edu.cn). Based on the extensive annotations, global properties of the transporter genes were illustrated, such as expression patterns and polymorphisms in relationships with their ligands. We noted that the human transporters were enriched in many fundamental biological processes such as oxidative phosphorylation and cardiac muscle contraction, and significantly associated with Mendelian and complex diseases such as epilepsy and sudden infant death syndrome. Overall, HTD provides a well-organized interface to facilitate research communities to search detailed molecular and genetic information of transporters for development of personalized medicine.

  9. Human Transporter Database: Comprehensive Knowledge and Discovery Tools in the Human Transporter Genes

    PubMed Central

    Ye, Adam Y.; Liu, Qing-Rong; Li, Chuan-Yun; Zhao, Min; Qu, Hong

    2014-01-01

    Transporters are essential in homeostatic exchange of endogenous and exogenous substances at the systematic, organic, cellular, and subcellular levels. Gene mutations of transporters are often related to pharmacogenetics traits. Recent developments in high throughput technologies on genomics, transcriptomics and proteomics allow in depth studies of transporter genes in normal cellular processes and diverse disease conditions. The flood of high throughput data have resulted in urgent need for an updated knowledgebase with curated, organized, and annotated human transporters in an easily accessible way. Using a pipeline with the combination of automated keywords query, sequence similarity search and manual curation on transporters, we collected 1,555 human non-redundant transporter genes to develop the Human Transporter Database (HTD) (http://htd.cbi.pku.edu.cn). Based on the extensive annotations, global properties of the transporter genes were illustrated, such as expression patterns and polymorphisms in relationships with their ligands. We noted that the human transporters were enriched in many fundamental biological processes such as oxidative phosphorylation and cardiac muscle contraction, and significantly associated with Mendelian and complex diseases such as epilepsy and sudden infant death syndrome. Overall, HTD provides a well-organized interface to facilitate research communities to search detailed molecular and genetic information of transporters for development of personalized medicine. PMID:24558441

  10. CoDNaS 2.0: a comprehensive database of protein conformational diversity in the native state

    PubMed Central

    Monzon, Alexander Miguel; Rohr, Cristian Oscar; Fornasari, María Silvina; Parisi, Gustavo

    2016-01-01

    CoDNaS (conformational diversity of the native state) is a protein conformational diversity database. Conformational diversity describes structural differences between conformers that define the native state of proteins. It is a key concept to understand protein function and biological processes related to protein functions. CoDNaS offers a well curated database that is experimentally driven, thoroughly linked, and annotated. CoDNaS facilitates the extraction of key information on small structural differences based on protein movements. CoDNaS enables users to easily relate the degree of conformational diversity with physical, chemical and biological properties derived from experiments on protein structure and biological characteristics. The new version of CoDNaS includes ∼70% of all available protein structures, and new tools have been added that run sequence searches, display structural flexibility profiles and allow users to browse the database for different structural classes. These tools facilitate the exploration of protein conformational diversity and its role in protein function. Database URL: http://ufq.unq.edu.ar/codnas PMID:27022160

  11. CoDNaS 2.0: a comprehensive database of protein conformational diversity in the native state.

    PubMed

    Monzon, Alexander Miguel; Rohr, Cristian Oscar; Fornasari, María Silvina; Parisi, Gustavo

    2016-01-01

    CoDNaS (conformational diversity of the native state) is a protein conformational diversity database. Conformational diversity describes structural differences between conformers that define the native state of proteins. It is a key concept to understand protein function and biological processes related to protein functions. CoDNaS offers a well curated database that is experimentally driven, thoroughly linked, and annotated. CoDNaS facilitates the extraction of key information on small structural differences based on protein movements. CoDNaS enables users to easily relate the degree of conformational diversity with physical, chemical and biological properties derived from experiments on protein structure and biological characteristics. The new version of CoDNaS includes ∼70% of all available protein structures, and new tools have been added that run sequence searches, display structural flexibility profiles and allow users to browse the database for different structural classes. These tools facilitate the exploration of protein conformational diversity and its role in protein function. Database URL:http://ufq.unq.edu.ar/codnas. PMID:27022160

  12. MalaCards: A Comprehensive Automatically-Mined Database of Human Diseases.

    PubMed

    Rappaport, Noa; Twik, Michal; Nativ, Noam; Stelzer, Gil; Bahir, Iris; Stein, Tsippi Iny; Safran, Marilyn; Lancet, Doron

    2014-09-08

    Systems medicine provides insights into mechanisms of human diseases, and expedites the development of better diagnostics and drugs. To facilitate such strategies, we initiated MalaCards, a compendium of human diseases and their annotations, integrating and often remodeling information from 64 data sources. MalaCards employs, among others, the proven automatic data-mining strategies established in the construction of GeneCards, our widely used compendium of human genes. The development of MalaCards poses many algorithmic challenges, such as disease name unification, integrated classification, gene-disease association, and disease-targeted expression analysis. MalaCards displays a Web card for each of >19,000 human diseases, with 17 sections, including textual summaries, related diseases, related genes, genetic variations and tests, and relevant publications. Also included are a powerful search engine and a variety of categorized disease lists. This unit describes two basic protocols to search and browse MalaCards effectively.

  13. Toxic Substances Control Act (TSCA) Test Submissions Database (TSCATS) - comprehensive update (on cd-rom). Data file

    SciTech Connect

    1997-03-01

    The Toxic Substances Control Act Test Submissions Database (TSCATS) was developed to make unpublished test data available to the public. The test data are submitted to the U.S. Environmental Protection Agency by industry under the Toxic Substances Control Act. Test data are broadly defined to include case reports, episodic incidents, such as spills, and formal test study presentations. The database allows searching of test submissions according to specific chemical identity or type of study when used with an appropriate search retrieval software program. Studies are indexed under three broad subject areas: health effects, environmental effects and environmental fate. Additional controlled vocabulary terms are assigned which describe the experimental protocol and test observations. Records identify reference information needed to locate the source document, as well as the submitting organization and reason for submission of the test data.

  14. Automated Identification of Medically Important Bacteria by 16S rRNA Gene Sequencing Using a Novel Comprehensive Database, 16SpathDB▿

    PubMed Central

    Woo, Patrick C. Y.; Teng, Jade L. L.; Yeung, Juilian M. Y.; Tse, Herman; Lau, Susanna K. P.; Yuen, Kwok-Yung

    2011-01-01

    Despite the increasing use of 16S rRNA gene sequencing, interpretation of 16S rRNA gene sequence results is one of the most difficult problems faced by clinical microbiologists and technicians. To overcome the problems we encountered in the existing databases during 16S rRNA gene sequence interpretation, we built a comprehensive database, 16SpathDB (http://147.8.74.24/16SpathDB) based on the 16S rRNA gene sequences of all medically important bacteria listed in the Manual of Clinical Microbiology and evaluated its use for automated identification of these bacteria. Among 91 nonduplicated bacterial isolates collected in our clinical microbiology laboratory, 71 (78%) were reported by 16SpathDB as a single bacterial species having >98.0% nucleotide identity with the query sequence, 19 (20.9%) were reported as more than one bacterial species having >98.0% nucleotide identity with the query sequence, and 1 (1.1%) was reported as no match. For the 71 bacterial isolates reported as a single bacterial species, all results were identical to their true identities as determined by a polyphasic approach. For the 19 bacterial isolates reported as more than one bacterial species, all results contained their true identities as determined by a polyphasic approach and all of them had their true identities as the “best match in 16SpathDB.” For the isolate (Gordonibacter pamelaeae) reported as no match, the bacterium has never been reported to be associated with human disease and was not included in the Manual of Clinical Microbiology. 16SpathDB is an automated, user-friendly, efficient, accurate, and regularly updated database for 16S rRNA gene sequence interpretation in clinical microbiology laboratories. PMID:21389154

  15. Comprehensive mollusk acute toxicity database improves the use of Interspecies Correlation Estimation (ICE) models to predict toxicity of untested freshwater and endangered mussel species

    EPA Science Inventory

    Interspecies correlation estimation (ICE) models extrapolate acute toxicity data from surrogate test species to untested taxa. A suite of ICE models developed from a comprehensive database is available on the US Environmental Protection Agency’s web-based application, Web-I...

  16. Subject and authorship of records related to the Organization for Tropical Studies (OTS) in BINABITROP, a comprehensive database about Costa Rican biology.

    PubMed

    Monge-Nájera, Julián; Nielsen-Muñoz, Vanessa; Azofeifa-Mora, Ana Beatriz

    2013-06-01

    BINABITROP is a bibliographical database of more than 38000 records about the ecosystems and organisms of Costa Rica. In contrast with commercial databases, such as Web of Knowledge and Scopus, which exclude most of the scientific journals published in tropical countries, BINABITROP is a comprehensive record of knowledge on the tropical ecosystems and organisms of Costa Rica. We analyzed its contents in three sites (La Selva, Palo Verde and Las Cruces) and recorded scientific field, taxonomic group and authorship. We found that most records dealt with ecology and systematics, and that most authors published only one article in the study period (1963-2011). Most research was published in four journals: Biotropica, Revista de Biología Tropical/ International Journal of Tropical Biology and Conservation, Zootaxa and Brenesia. This may be the first study of a such a comprehensive database for any case of tropical biology literature.

  17. Virus Pathogen Database and Analysis Resource (ViPR): A Comprehensive Bioinformatics Database and Analysis Resource for the Coronavirus Research Community

    PubMed Central

    Pickett, Brett E.; Greer, Douglas S.; Zhang, Yun; Stewart, Lucy; Zhou, Liwei; Sun, Guangyu; Gu, Zhiping; Kumar, Sanjeev; Zaremba, Sam; Larsen, Christopher N.; Jen, Wei; Klem, Edward B.; Scheuermann, Richard H.

    2012-01-01

    Several viruses within the Coronaviridae family have been categorized as either emerging or re-emerging human pathogens, with Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) being the most well known. The NIAID-sponsored Virus Pathogen Database and Analysis Resource (ViPR, www.viprbrc.org) supports bioinformatics workflows for a broad range of human virus pathogens and other related viruses, including the entire Coronaviridae family. ViPR provides access to sequence records, gene and protein annotations, immune epitopes, 3D structures, host factor data, and other data types through an intuitive web-based search interface. Records returned from these queries can then be subjected to web-based analyses including: multiple sequence alignment, phylogenetic inference, sequence variation determination, BLAST comparison, and metadata-driven comparative genomics statistical analysis. Additional tools exist to display multiple sequence alignments, view phylogenetic trees, visualize 3D protein structures, transfer existing reference genome annotations to new genomes, and store or share results from any search or analysis within personal private ‘Workbench’ spaces for future access. All of the data and integrated analysis and visualization tools in ViPR are made available without charge as a service to the Coronaviridae research community to facilitate the research and development of diagnostics, prophylactics, vaccines and therapeutics against these human pathogens. PMID:23202522

  18. SUPERSITES INTEGRATED RELATIONAL DATABASE (SIRD)

    EPA Science Inventory

    As part of EPA's Particulate Matter (PM) Supersites Program (Program), the University of Maryland designed and developed the Supersites Integrated Relational Database (SIRD). Measurement data in SIRD include comprehensive air quality data from the 7 Supersite program locations f...

  19. Zinc Finger Database (ZiFDB) v2.0: a comprehensive database of C₂H₂ zinc fingers and engineered zinc finger arrays.

    PubMed

    Fu, Fengli; Voytas, Daniel F

    2013-01-01

    ZiFDB (Zinc Finger Database, http://zifdb.msi.umn.edu) is a web-accessible database that houses information on individual C(2)H(2) zinc fingers (ZFs) and engineered zinc finger arrays (ZFAs). ZiFDB serves as a resource for biologists interested in engineering ZFAs for use as sequence-specific DNA-binding reagents. Here, we describe four new features of ZiFDB: (i) the database allows users to input new ZFs and ZFAs; (ii) a shadow database temporarily stores user-submitted data, pending approval by the database curator and subsequent loading into the persistent database; (iii) ZiFDB contains 181 Context-Dependent Assembly (CoDA) ZFAs, which were generated by this newly described ZFA engineering platform; and (iv) the database also now contains 319 F1F2 CoDA units and 334 F2F3 CoDA units that can be used to construct CoDA arrays. In total, the new release of ZiFDB contains 1226 ZFs and 1123 ZFAs.

  20. Comprehensive mathematical simulation of functional magnetic resonance imaging time series including motion-related image distortion and spin saturation effect.

    PubMed

    Kim, Boklye; Yeo, Desmond T B; Bhagalia, Roshni

    2008-02-01

    There has been vast interest in determining the feasibility of functional magnetic resonance imaging (fMRI) as an accurate method of imaging brain function for patient evaluations. The assessment of fMRI as an accurate tool for activation localization largely depends on the software used to process the time series data. The performance evaluation of different analysis tools is not reliable unless truths in motion and activation are known. Lack of valid truths has been the limiting factor for comparisons of different algorithms. Until now, currently available phantom data do not include comprehensive accounts of head motion. While most fMRI studies assume no interslice motion during the time series acquisition in fMRI data acquired using a multislice and single-shot echo-planar imaging sequence, each slice is subject to a different set of motion parameters. In this study, in addition to known three-dimensional motion parameters applied to each slice, included in the time series computation are geometric distortion from field inhomogeneity and spin saturation effect as a result of out-of-plane head motion. We investigated the effect of these head motion-related artifacts and present a validation of the mapping slice-to-volume (MSV) algorithm for motion correction and activation detection against the known truths. MSV was evaluated, and showed better performance in comparison with other widely used fMRI data processing software, which corrects for head motion with a volume-to-volume realignment method. Furthermore, improvement in signal detection was observed with the implementation of the geometric distortion correction and spin saturation effect compensation features in MSV. PMID:17662548

  1. 24 CFR 968.320 - HUD review and approval of comprehensive plan (including five-year action plan).

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ...) Contradiction of local government certification or PHA resolution. (c) Effect of HUD approval of Comprehensive... of significant changes (such as changes in scope of work or whenever it moves items within...

  2. Potash: a global overview of evaporate-related potash resources, including spatial databases of deposits, occurrences, and permissive tracts: Chapter S in Global mineral resource assessment

    USGS Publications Warehouse

    Orris, Greta J.; Cocker, Mark D.; Dunlap, Pamela; Wynn, Jeff C.; Spanski, Gregory T.; Briggs, Deborah A.; Gass, Leila; Bliss, James D.; Bolm, Karen S.; Yang, Chao; Lipin, Bruce R.; Ludington, Stephen; Miller, Robert J.; Slowakiewicz, Miroslaw

    2014-01-01

    This report describes a global, evaporite-related potash deposits and occurrences database and a potash tracts database. Chapter 1 summarizes potash resource history and use. Chapter 2 describes a global potash deposits and occurrences database, which contains more than 900 site records. Chapter 3 describes a potash tracts database, which contains 84 tracts with geology permissive for the presence of evaporite-hosted potash resources, including areas with active evaporite-related potash production, areas with known mineralization that has not been quantified or exploited, and areas with potential for undiscovered potash resources. Chapter 4 describes geographic information system (GIS) data files that include (1) potash deposits and occurrences data, (2) potash tract data, (3) reference databases for potash deposit and tract data, and (4) representative graphics of geologic features related to potash tracts and deposits. Summary descriptive models for stratabound potash-bearing salt and halokinetic potash-bearing salt are included in appendixes A and B, respectively. A glossary of salt- and potash-related terms is contained in appendix C and a list of database abbreviations is given in appendix D. Appendix E describes GIS data files, and appendix F is a guide to using the geodatabase.

  3. Spanish public awareness regarding DNA profile databases in forensic genetics: what type of DNA profiles should be included?

    PubMed Central

    Gamero, Joaquín J; Romero, Jose‐Luis; Peralta, Juan‐Luis; Carvalho, Mónica; Corte‐Real, Francisco

    2007-01-01

    The importance of non‐codifying DNA polymorphism for the administration of justice is now well known. In Spain, however, this type of test has given rise to questions in recent years: (a) Should consent be obtained before biological samples are taken from an individual for DNA analysis? (b) Does society perceive these techniques and methods of analysis as being reliable? (c) There appears to be lack of knowledge concerning the basic norms that regulate databases containing private or personal information and the protection that information of this type must be given. This opinion survey and the subsequent analysis of the results in ethical terms may serve to reveal the criteria and the degree of information that society has with regard to DNA databases. In the study, 73.20% (SE 1.12%) of the population surveyed was in favour of specific legislation for computer files in which DNA analysis results for forensic purposes are stored. PMID:17906059

  4. Alaska Geochemical Database, Version 2.0 (AGDB2)--including “best value” data compilations for rock, sediment, soil, mineral, and concentrate sample media

    USGS Publications Warehouse

    Granitto, Matthew; Schmidt, Jeanine; Shew, Nora B.; Gamble, Bruce M.; Labay, Keith A.

    2013-01-01

    The Alaska Geochemical Database Version 2.0 (AGDB2) contains new geochemical data compilations in which each geologic material sample has one “best value” determination for each analyzed species, greatly improving speed and efficiency of use. Like the Alaska Geochemical Database (AGDB, http://pubs.usgs.gov/ds/637/) before it, the AGDB2 was created and designed to compile and integrate geochemical data from Alaska in order to facilitate geologic mapping, petrologic studies, mineral resource assessments, definition of geochemical baseline values and statistics, environmental impact assessments, and studies in medical geology. This relational database, created from the Alaska Geochemical Database (AGDB) that was released in 2011, serves as a data archive in support of present and future Alaskan geologic and geochemical projects, and contains data tables in several different formats describing historical and new quantitative and qualitative geochemical analyses. The analytical results were determined by 85 laboratory and field analytical methods on 264,095 rock, sediment, soil, mineral and heavy-mineral concentrate samples. Most samples were collected by U.S. Geological Survey personnel and analyzed in U.S. Geological Survey laboratories or, under contracts, in commercial analytical laboratories. These data represent analyses of samples collected as part of various U.S. Geological Survey programs and projects from 1962 through 2009. In addition, mineralogical data from 18,138 nonmagnetic heavy-mineral concentrate samples are included in this database. The AGDB2 includes historical geochemical data originally archived in the U.S. Geological Survey Rock Analysis Storage System (RASS) database, used from the mid-1960s through the late 1980s and the U.S. Geological Survey PLUTO database used from the mid-1970s through the mid-1990s. All of these data are currently maintained in the National Geochemical Database (NGDB). Retrievals from the NGDB were used to generate

  5. No identifiable Hb1Ac or lifestyle change after a comprehensive diabetes programme including motivational interviewing: A cluster randomised trial

    PubMed Central

    2013-01-01

    Abstract Objective To study the effectiveness of a comprehensive diabetes programme in general practice that integrates patient-centred lifestyle counselling into structured diabetes care. Design and setting Cluster randomised trial in general practices. Intervention Nurse-led structured diabetes care with a protocol, record keeping, reminders, and feedback, plus training in motivational interviewing and agenda setting. Subjects Primary care nurses in 58 general practices and their 940 type 2 diabetes patients with an HbA1c concentration above 7%, and a body mass index (BMI) above 25 kg/m2. Main outcome measures HbA1c, diet, and physical activity (medical records and patient questionnaires). Results Multilevel linear and logistic regression analyses adjusted for baseline outcomes showed that despite active nurse participation in the intervention, the comprehensive programme was no more effective than usual care after 14 months, as shown by HbA1c levels (difference between groups = 0.13; CI 20.8–0.35) and diet (fat (difference between groups = 0.19; CI 20.82–1.21); vegetables (difference between groups = 0.10; CI-0.21–0.41); fruit (difference between groups = 20.02; CI 20.26–0.22)), and physical activity (difference between groups = 21.15; CI 212.26–9.97), or any of the other measures of clinical parameters, patient's readiness to change, or quality of life. Conclusion A comprehensive programme that integrated lifestyle counselling based on motivational interviewing principles integrated into structured diabetes care did not alter HbA1c or the lifestyle related to diet and physical activity. We thus question the impact of motivational interviewing in terms of its ability to improve routine diabetes care in general practice. PMID:23659710

  6. Comprehensive analysis of the N-glycan biosynthetic pathway using bioinformatics to generate UniCorn: A theoretical N-glycan structure database.

    PubMed

    Akune, Yukie; Lin, Chi-Hung; Abrahams, Jodie L; Zhang, Jingyu; Packer, Nicolle H; Aoki-Kinoshita, Kiyoko F; Campbell, Matthew P

    2016-08-01

    Glycan structures attached to proteins are comprised of diverse monosaccharide sequences and linkages that are produced from precursor nucleotide-sugars by a series of glycosyltransferases. Databases of these structures are an essential resource for the interpretation of analytical data and the development of bioinformatics tools. However, with no template to predict what structures are possible the human glycan structure databases are incomplete and rely heavily on the curation of published, experimentally determined, glycan structure data. In this work, a library of 45 human glycosyltransferases was used to generate a theoretical database of N-glycan structures comprised of 15 or less monosaccharide residues. Enzyme specificities were sourced from major online databases including Kyoto Encyclopedia of Genes and Genomes (KEGG) Glycan, Consortium for Functional Glycomics (CFG), Carbohydrate-Active enZymes (CAZy), GlycoGene DataBase (GGDB) and BRENDA. Based on the known activities, more than 1.1 million theoretical structures and 4.7 million synthetic reactions were generated and stored in our database called UniCorn. Furthermore, we analyzed the differences between the predicted glycan structures in UniCorn and those contained in UniCarbKB (www.unicarbkb.org), a database which stores experimentally described glycan structures reported in the literature, and demonstrate that UniCorn can be used to aid in the assignment of ambiguous structures whilst also serving as a discovery database.

  7. Glia Open Access Database (GOAD): A comprehensive gene expression encyclopedia of glia cells in health and disease.

    PubMed

    Holtman, Inge R; Noback, Michiel; Bijlsma, Marieke; Duong, Kim N; van der Geest, Marije A; Ketelaars, Peer T; Brouwer, Nieske; Vainchtein, Ilia D; Eggen, Bart J L; Boddeke, Hendrikus W G M

    2015-09-01

    Recently, the number of genome-wide transcriptome profiles of pure populations of glia cells has drastically increased, resulting in an unprecedented amount of data that offer opportunities to study glia phenotypes and functions in health and disease. To make genome-wide transcriptome data easily accessible, we developed the Glia Open Access Database (GOAD), available via www.goad.education. GOAD contains a collection of previously published and unpublished transcriptome data, including datasets from isolated microglia, astrocytes and oligodendrocytes both at homeostatic and pathological conditions. It contains an intuitive web-based interface that consists of three features that enable searching, browsing, analyzing, and downloading of the data. The first feature is differential gene expression (DE) analysis that provides genes that are significantly up and down-regulated with the associated fold changes and p-values between two conditions of interest. In addition, an interactive Venn diagram is generated to illustrate the overlap and differences between several DE gene lists. The second feature is quantitative gene expression (QE) analysis, to investigate which genes are expressed in a particular glial cell type and to what degree. The third feature is a search utility, which can be used to find a gene of interest and depict its expression in all available expression data sets by generating a gene card. In addition, quality guidelines and relevant concepts for transcriptome analysis are discussed. Finally, GOAD is discussed in relation to several online transcriptome tools developed in neuroscience and immunology. In conclusion, GOAD is a unique platform to facilitate integration of bioinformatics in glia biology.

  8. New Comprehensive Cytogenetic Scoring System for Primary Myelodysplastic Syndromes (MDS) and Oligoblastic Acute Myeloid Leukemia After MDS Derived From an International Database Merge

    PubMed Central

    Schanz, Julie; Tüchler, Heinz; Solé, Francesc; Mallo, Mar; Luño, Elisa; Cervera, José; Granada, Isabel; Hildebrandt, Barbara; Slovak, Marilyn L.; Ohyashiki, Kazuma; Steidl, Christian; Fonatsch, Christa; Pfeilstöcker, Michael; Nösslinger, Thomas; Valent, Peter; Giagounidis, Aristoteles; Aul, Carlo; Lübbert, Michael; Stauder, Reinhard; Krieger, Otto; Garcia-Manero, Guillermo; Faderl, Stefan; Pierce, Sherry; Le Beau, Michelle M.; Bennett, John M.; Greenberg, Peter; Germing, Ulrich; Haase, Detlef

    2012-01-01

    Purpose The karyotype is a strong independent prognostic factor in myelodysplastic syndromes (MDS). Since the implementation of the International Prognostic Scoring System (IPSS) in 1997, knowledge concerning the prognostic impact of abnormalities has increased substantially. The present study proposes a new and comprehensive cytogenetic scoring system based on an international data collection of 2,902 patients. Patients and Methods Patients were included from the German-Austrian MDS Study Group (n = 1,193), the International MDS Risk Analysis Workshop (n = 816), the Spanish Hematological Cytogenetics Working Group (n = 849), and the International Working Group on MDS Cytogenetics (n = 44) databases. Patients with primary MDS and oligoblastic acute myeloid leukemia (AML) after MDS treated with supportive care only were evaluated for overall survival (OS) and AML evolution. Internal validation by bootstrap analysis and external validation in an independent patient cohort were performed to confirm the results. Results In total, 19 cytogenetic categories were defined, providing clear prognostic classification in 91% of all patients. The abnormalities were classified into five prognostic subgroups (P < .001): very good (median OS, 61 months; hazard ratio [HR], 0.5; n = 81); good (49 months; HR, 1.0 [reference category]; n = 1,809); intermediate (26 months; HR, 1.6; n = 529); poor (16 months; HR, 2.6; n = 148); and very poor (6 months; HR, 4.2; n = 187). The internal and external validations confirmed the results of the score. Conclusion In conclusion, these data should contribute to the ongoing efforts to update the IPSS by refining the cytogenetic risk categories. PMID:22331955

  9. Antiphospholipid antibody testing for the antiphospholipid syndrome: a comprehensive practical review including a synopsis of challenges and recent guidelines.

    PubMed

    Favaloro, Emmanuel J; Wong, Richard C W

    2014-10-01

    The antiphospholipid (antibody) syndrome (APS) is an autoimmune condition characterised by a wide range of clinical features, but primarily identified as thrombotic and/or obstetric related adverse events. APS is associated with the presence of antiphospholipid antibodies (aPL), including the so-called lupus anticoagulant (LA). These aPL are heterogeneous in nature, detected with varying sensitivity and specificity by a diverse range of laboratory tests. All these tests are unfortunately imperfect, suffer from poor assay reproducibility (inter-method and inter-laboratory) and a lack of standardisation and harmonisation. Clinicians and laboratory personnel may struggle to keep abreast of these factors, as well as the expanding range of available aPL tests, and consequent result interpretation. Therefore, APS remains a significant diagnostic challenge for many clinicians across a wide range of clinical specialities, due to these issues related to laboratory testing as well as the ever-expanding range of reported clinical manifestations. This review is primarily focussed on issues related to laboratory testing for APS in regards to the currently available assays, and summarises recent international consensus guidelines for aPL testing, both for the liquid phase functional LA assays and the solid phase assays (anticardiolipin and anti-beta-2-Glycoprotein-I).

  10. The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution.

    PubMed

    Greene, Lesley H; Lewis, Tony E; Addou, Sarah; Cuff, Alison; Dallman, Tim; Dibley, Mark; Redfern, Oliver; Pearl, Frances; Nambudiry, Rekha; Reid, Adam; Sillitoe, Ian; Yeats, Corin; Thornton, Janet M; Orengo, Christine A

    2007-01-01

    We report the latest release (version 3.0) of the CATH protein domain database (http://www.cathdb.info). There has been a 20% increase in the number of structural domains classified in CATH, up to 86 151 domains. Release 3.0 comprises 1110 fold groups and 2147 homologous superfamilies. To cope with the increases in diverse structural homologues being determined by the structural genomics initiatives, more sensitive methods have been developed for identifying boundaries in multi-domain proteins and for recognising homologues. The CATH classification update is now being driven by an integrated pipeline that links these automated procedures with validation steps, that have been made easier by the provision of information rich web pages summarising comparison scores and relevant links to external sites for each domain being classified. An analysis of the population of domains in the CATH hierarchy and several domain characteristics are presented for version 3.0. We also report an update of the CATH Dictionary of homologous structures (CATH-DHS) which now contains multiple structural alignments, consensus information and functional annotations for 1459 well populated superfamilies in CATH. CATH is directly linked to the Gene3D database which is a projection of CATH structural data onto approximately 2 million sequences in completed genomes and UniProt.

  11. Prospective validation of a comprehensive in silico hERG model and its applications to commercial compound and drug databases.

    PubMed

    Doddareddy, Munikumar R; Klaasse, Elisabeth C; Shagufta; Ijzerman, Adriaan P; Bender, Andreas

    2010-05-01

    Ligand-based in silico hERG models were generated for 2 644 compounds using linear discriminant analysis (LDA) and support vector machines (SVM). As a result, the dataset used for the model generation is the largest publicly available (see Supporting Information). Extended connectivity fingerprints (ECFPs) and functional class fingerprints (FCFPs) were used to describe chemical space. All models showed area under curve (AUC) values ranging from 0.89 to 0.94 in a fivefold cross-validation, indicating high model consistency. Models correctly predicted 80 % of an additional, external test set; Y-scrambling was also performed to rule out chance correlation. Additionally models based on patch clamp data and radioligand binding data were generated separately to analyze their predictive ability when compared to combined models. To experimentally validate the models, 50 of the predicted hERG blockers from the Chembridge database and ten of the predicted non-hERG blockers from an in-house compound library were selected for biological evaluation. Out of those 50 predicted hERG blockers, tested at a concentration of 10 microM, 18 compounds showed more than 50 % displacement of [(3)H]astemizole binding to cell membranes expressing the hERG channel. K(i) values of four of the selected binders were determined to be in the micromolar and high nanomolar range (K(i) (VH01)=2.0 microM, K(i) (VH06)=0.15 microM, K(i) (VH19)=1.1 microM and K(i) (VH47)=18 microM). Of these four compounds, VH01 and VH47 showed also a second, even higher affinity binding site with K(i) values of 7.4 nM and 36 nM, respectively. In the case of non-hERG blockers, all ten compounds tested were found to be inactive, showing less than 50 % displacement of [(3)H]astemizole binding at 10 microM. These experimentally validated models were then used to virtually screen commercial compound databases to evaluate whether they contain hERG blockers. 109 784 (23 %) of Chembridge, 133 175 (38 %) of Chemdiv, 111 737 (31

  12. Isotopic reconstruction of ancient human migrations: A comprehensive Sr isotope reference database for France and the first case study at Tumulus de Sables, south-western France

    NASA Astrophysics Data System (ADS)

    Willmes, M.; Boel, C.; Grün, R.; Armstrong, R.; Chancerel, A.; Maureille, B.; Courtaud, P.

    2012-04-01

    Strontium isotope ratios (87Sr/86Sr) can be used for the reconstruction of human and animal migrations across geologically different terrains. Sr isotope ratios in rocks are a product of age and composition and thus vary between geologic units. From the eroding environment Sr is transported into the soils, plants and rivers of a region. Humans and animals incorporate Sr from their diet into their bones and teeth, where it substitutes for calcium. Tooth enamel contains Sr isotope signatures acquired during childhood and is most resistant to weathering and overprinting, while the dentine is often diagenetically altered towards the local Sr signature. For the reconstruction of human and animal migrations the tooth enamel 87Sr/86Sr ratio is compared to the Sr isotope signature in the vicinity of the burial site and the surrounding area. This study focuses on the establishment of a comprehensive reference map of bioavailable 87Sr/86Sr ratios for France. In a next step we will compare human and animal teeth from key archaeological sites to this reference map to investigate mobility. So far, we have analysed plant and soil samples from ~200 locations across France including the Aquitaine basin, the western and northern parts of the Paris basin, as well as three transects through the Pyrenees Mountains. The isotope data, geologic background information (BRGM 1:1M), field images, and detailed method descriptions are available through our online database iRhum (http://rses.anu.edu.au/research/ee). This database can also be used in forensic studies and food sciences. As an archaeological case study teeth from 16 adult and 8 juvenile individuals were investigated from an early Bell Beaker (2500-2000 BC) site at Le Tumulus des Sables, south-west France (Gironde). The teeth were analysed for Sr isotope ratios using laser ablation ICP-MS. Four teeth were also analysed using solution ICP-MS, which showed a significant offset to the laser ablation results. This requires further

  13. Expanding the MTM1 mutational spectrum: novel variants including the first multi-exonic duplication and development of a locus-specific database.

    PubMed

    Oliveira, Jorge; Oliveira, Márcia E; Kress, Wolfram; Taipa, Ricardo; Pires, Manuel Melo; Hilbert, Pascale; Baxter, Peter; Santos, Manuela; Buermans, Henk; den Dunnen, Johan T; Santos, Rosário

    2013-05-01

    Myotubular myopathy (MIM#310400), the X-linked form of Centronuclear myopathy (CNM) is mainly characterized by neonatal hypotonia and inability to maintain unassisted respiration. The MTM1 gene, responsible for this disease, encodes myotubularin - a lipidic phosphatase involved in vesicle trafficking regulation and maturation. Recently, it was shown that myotubularin interacts with desmin, being a major regulator of intermediate filaments. We report the development of a locus-specific database for MTM1 using the Leiden Open Variation database software (http://www.lovd.nl/MTM1), with data collated for 474 mutations identified in 472 patients (by June 2012). Among the entries are a total of 25 new mutations, including a large deletion encompassing introns 2-15. During database implementation it was noticed that no large duplications had been reported. We tested a group of eight uncharacterized CNM patients for this specific type of mutation, by multiple ligation-dependent probe amplification (MLPA) analysis. A large duplication spanning exons 1-5 was identified in a boy with a mild phenotype, with results pointing toward possible somatic mosaicism. Further characterization revealed that this duplication causes an in-frame deletion at the mRNA level (r.343_444del). Results obtained with a next generation sequencing approach suggested that the duplication extends into the neighboring MAMLD1 gene and subsequent cDNA analysis detected the presence of a MTM1/MAMLD1 fusion transcript. A complex rearrangement involving the duplication of exon 10 has since been reported, with detection also enabled by MLPA analysis. It is thus conceivable that large duplications in MTM1 may account for a number of CNM cases that have remained genetically unresolved.

  14. A Comprehensive Software and Database Management System for Glomerular Filtration Rate Estimation by Radionuclide Plasma Sampling and Serum Creatinine Methods.

    PubMed

    Jha, Ashish Kumar

    2015-01-01

    Glomerular filtration rate (GFR) estimation by plasma sampling method is considered as the gold standard. However, this method is not widely used because the complex technique and cumbersome calculations coupled with the lack of availability of user-friendly software. The routinely used Serum Creatinine method (SrCrM) of GFR estimation also requires the use of online calculators which cannot be used without internet access. We have developed user-friendly software "GFR estimation software" which gives the options to estimate GFR by plasma sampling method as well as SrCrM. We have used Microsoft Windows(®) as operating system and Visual Basic 6.0 as the front end and Microsoft Access(®) as database tool to develop this software. We have used Russell's formula for GFR calculation by plasma sampling method. GFR calculations using serum creatinine have been done using MIRD, Cockcroft-Gault method, Schwartz method, and Counahan-Barratt methods. The developed software is performing mathematical calculations correctly and is user-friendly. This software also enables storage and easy retrieval of the raw data, patient's information and calculated GFR for further processing and comparison. This is user-friendly software to calculate the GFR by various plasma sampling method and blood parameter. This software is also a good system for storing the raw and processed data for future analysis.

  15. A Comprehensive Software and Database Management System for Glomerular Filtration Rate Estimation by Radionuclide Plasma Sampling and Serum Creatinine Methods.

    PubMed

    Jha, Ashish Kumar

    2015-01-01

    Glomerular filtration rate (GFR) estimation by plasma sampling method is considered as the gold standard. However, this method is not widely used because the complex technique and cumbersome calculations coupled with the lack of availability of user-friendly software. The routinely used Serum Creatinine method (SrCrM) of GFR estimation also requires the use of online calculators which cannot be used without internet access. We have developed user-friendly software "GFR estimation software" which gives the options to estimate GFR by plasma sampling method as well as SrCrM. We have used Microsoft Windows(®) as operating system and Visual Basic 6.0 as the front end and Microsoft Access(®) as database tool to develop this software. We have used Russell's formula for GFR calculation by plasma sampling method. GFR calculations using serum creatinine have been done using MIRD, Cockcroft-Gault method, Schwartz method, and Counahan-Barratt methods. The developed software is performing mathematical calculations correctly and is user-friendly. This software also enables storage and easy retrieval of the raw data, patient's information and calculated GFR for further processing and comparison. This is user-friendly software to calculate the GFR by various plasma sampling method and blood parameter. This software is also a good system for storing the raw and processed data for future analysis. PMID:26097422

  16. A Comprehensive Software and Database Management System for Glomerular Filtration Rate Estimation by Radionuclide Plasma Sampling and Serum Creatinine Methods

    PubMed Central

    Jha, Ashish Kumar

    2015-01-01

    Glomerular filtration rate (GFR) estimation by plasma sampling method is considered as the gold standard. However, this method is not widely used because the complex technique and cumbersome calculations coupled with the lack of availability of user-friendly software. The routinely used Serum Creatinine method (SrCrM) of GFR estimation also requires the use of online calculators which cannot be used without internet access. We have developed user-friendly software “GFR estimation software” which gives the options to estimate GFR by plasma sampling method as well as SrCrM. We have used Microsoft Windows® as operating system and Visual Basic 6.0 as the front end and Microsoft Access® as database tool to develop this software. We have used Russell's formula for GFR calculation by plasma sampling method. GFR calculations using serum creatinine have been done using MIRD, Cockcroft-Gault method, Schwartz method, and Counahan-Barratt methods. The developed software is performing mathematical calculations correctly and is user-friendly. This software also enables storage and easy retrieval of the raw data, patient's information and calculated GFR for further processing and comparison. This is user-friendly software to calculate the GFR by various plasma sampling method and blood parameter. This software is also a good system for storing the raw and processed data for future analysis. PMID:26097422

  17. Nuclear Science References Database

    SciTech Connect

    Pritychenko, B.; Běták, E.; Singh, B.; Totans, J.

    2014-06-15

    The Nuclear Science References (NSR) database together with its associated Web interface, is the world's only comprehensive source of easily accessible low- and intermediate-energy nuclear physics bibliographic information for more than 210,000 articles since the beginning of nuclear science. The weekly-updated NSR database provides essential support for nuclear data evaluation, compilation and research activities. The principles of the database and Web application development and maintenance are described. Examples of nuclear structure, reaction and decay applications are specifically included. The complete NSR database is freely available at the websites of the National Nuclear Data Center (http://www.nndc.bnl.gov/nsr) and the International Atomic Energy Agency (http://www-nds.iaea.org/nsr)

  18. The CEBAF Element Database and Related Operational Software

    SciTech Connect

    Larrieu, Theodore; Slominski, Christopher; Keesee, Marie; Turner, Dennison; Joyce, Michele

    2015-09-01

    The newly commissioned 12GeV CEBAF accelerator relies on a flexible, scalable and comprehensive database to define the accelerator. This database delivers the configuration for CEBAF operational tools, including hardware checkout, the downloadable optics model, control screens, and much more. The presentation will describe the flexible design of the CEBAF Element Database (CED), its features and assorted use case examples.

  19. THE ECOTOX DATABASE

    EPA Science Inventory

    The database provides chemical-specific toxicity information for aquatic life, terrestrial plants, and terrestrial wildlife. ECOTOX is a comprehensive ecotoxicology database and is therefore essential for providing and suppoirting high quality models needed to estimate population...

  20. DEG 10, an update of the database of essential genes that includes both protein-coding genes and noncoding genomic elements.

    PubMed

    Luo, Hao; Lin, Yan; Gao, Feng; Zhang, Chun-Ting; Zhang, Ren

    2014-01-01

    The combination of high-density transposon-mediated mutagenesis and high-throughput sequencing has led to significant advancements in research on essential genes, resulting in a dramatic increase in the number of identified prokaryotic essential genes under diverse conditions and a revised essential-gene concept that includes all essential genomic elements, rather than focusing on protein-coding genes only. DEG 10, a new release of the Database of Essential Genes (available at http://www.essentialgene.org), has been developed to accommodate these quantitative and qualitative advancements. In addition to increasing the number of bacterial and archaeal essential genes determined by genome-wide gene essentiality screens, DEG 10 also harbors essential noncoding RNAs, promoters, regulatory sequences and replication origins. These essential genomic elements are determined not only in vitro, but also in vivo, under diverse conditions including those for survival, pathogenesis and antibiotic resistance. We have developed customizable BLAST tools that allow users to perform species- and experiment-specific BLAST searches for a single gene, a list of genes, annotated or unannotated genomes. Therefore, DEG 10 includes essential genomic elements under different conditions in three domains of life, with customizable BLAST tools.

  1. Characteristics and comprehensiveness of adult HIV care and treatment programmes in Asia-Pacific, sub-Saharan Africa and the Americas: results of a site assessment conducted by the International epidemiologic Databases to Evaluate AIDS (IeDEA) Collaboration

    PubMed Central

    Duda, Stephany N; Farr, Amanda M; Lindegren, Mary Lou; Blevins, Meridith; Wester, C William; Wools-Kaloustian, Kara; Ekouevi, Didier K; Egger, Matthias; Hemingway-Foday, Jennifer; Cooper, David A; Moore, Richard D; McGowan, Catherine C; Nash, Denis

    2014-01-01

    Introduction HIV care and treatment programmes worldwide are transforming as they push to deliver universal access to essential prevention, care and treatment services to persons living with HIV and their communities. The characteristics and capacity of these HIV programmes affect patient outcomes and quality of care. Despite the importance of ensuring optimal outcomes, few studies have addressed the capacity of HIV programmes to deliver comprehensive care. We sought to describe such capacity in HIV programmes in seven regions worldwide. Methods Staff from 128 sites in 41 countries participating in the International epidemiologic Databases to Evaluate AIDS completed a site survey from 2009 to 2010, including sites in the Asia-Pacific region (n=20), Latin America and the Caribbean (n=7), North America (n=7), Central Africa (n=12), East Africa (n=51), Southern Africa (n=16) and West Africa (n=15). We computed a measure of the comprehensiveness of care based on seven World Health Organization-recommended essential HIV services. Results Most sites reported serving urban (61%; region range (rr): 33–100%) and both adult and paediatric populations (77%; rr: 29–96%). Only 45% of HIV clinics that reported treating children had paediatricians on staff. As for the seven essential services, survey respondents reported that CD4+ cell count testing was available to all but one site, while tuberculosis (TB) screening and community outreach services were available in 80 and 72%, respectively. The remaining four essential services – nutritional support (82%), combination antiretroviral therapy adherence support (88%), prevention of mother-to-child transmission (PMTCT) (94%) and other prevention and clinical management services (97%) – were uniformly available. Approximately half (46%) of sites reported offering all seven services. Newer sites and sites in settings with low rankings on the UN Human Development Index (HDI), especially those in the President's Emergency Plan

  2. Color Comprehension and Color Categories among Blind Students: A Multi-Sensory Approach in Implementing Concrete Language to Include All Students in Advanced Writing Classes

    ERIC Educational Resources Information Center

    Antarasena, Salinee

    2009-01-01

    This study investigates teaching methods regarding color comprehension and color categorization among blind students, as compared to their non-blind peers and whether they understand and represent the same color comprehension and color categories. Then after digit codes for color comprehension teaching and assistive technology for the blind had…

  3. Risk of mortality (including sudden cardiac death) and major cardiovascular events in atypical and typical antipsychotic users: a study with the general practice research database.

    PubMed

    Murray-Thomas, Tarita; Jones, Meghan E; Patel, Deven; Brunner, Elizabeth; Shatapathy, Chetan C; Motsko, Stephen; Van Staa, Tjeerd P

    2013-01-01

    Objective. Antipsychotics have been associated with increased cardiac events including mortality. This study assessed cardiac events including mortality among antipsychotic users relative to nonusers. Methods. The General Practice Research Database (GPRD) was used to identify antipsychotic users, matched general population controls, and psychiatric diseased nonusers. Outcomes included cardiac mortality, sudden cardiac death (SCD), all-cause mortality (excluding suicide), coronary heart disease (CHD), and ventricular arrhythmias (VA). Sensitivity analyses were conducted for age, dose, duration, antipsychotic type, and psychiatric disease. Results. 183,392 antipsychotic users (115,491 typical and 67,901 atypical), 544,726 general population controls, and 193,920 psychiatric nonusers were identified. Nonusers with schizophrenia, dementia, or bipolar disorder had increased risks of all-cause mortality compared to general population controls, while nonusers with major depression had comparable risks. Relative to psychiatric nonusers, the adjusted relative ratios (aRR) of all-cause mortality in antipsychotic users was 1.75 (95% CI: 1.64-1.87); cardiac mortality 1.72 (95% CI: 1.42-2.07); SCD primary definition 5.76 (95% CI: 2.90-11.45); SCD secondary definition 2.15 (95% CI: 1.64-2.81); CHD 1.16 (95% CI: 0.94-1.44); and VA 1.16 (95% CI: 1.02-1.31). aRRs of the various outcomes were lower for atypical versus typical antipsychotics (all-cause mortality 0.83 (95% CI: 0.80-0.85); cardiac mortality 0.89 (95% CI: 0.82-0.97); and SCD secondary definition 0.76 (95% CI: 0.55-1.04). Conclusions. Antipsychotic users had an increased risk of cardiac mortality, all-cause mortality, and SCD compared to a psychiatric nonuser cohort.

  4. The Comprehensive, Powerful, Academic Database (CPAD): An Evaluative Study of a Predictive Tool Designed for Elementary School Personnel in Identifying At-Risk Students through Progress, Curriculum, and Performance Monitoring

    ERIC Educational Resources Information Center

    Chavez-Gibson, Sarah

    2013-01-01

    The purpose of this study is to exam in-depth, the Comprehensive, Powerful, Academic Database (CPAD), a data decision-making tool that determines and identifies students at-risk of dropping out of school, and how the CPAD assists administrators and teachers at an elementary campus to monitor progress, curriculum, and performance to improve student…

  5. Marine and Hydrokinetic Technology Database

    DOE Data Explorer

    DOE’s Marine and Hydrokinetic Technology Database provides up-to-date information on marine and hydrokinetic renewable energy, both in the U.S. and around the world. The database includes wave, tidal, current, and ocean thermal energy, and contains information on the various energy conversion technologies, companies active in the field, and development of projects in the water. Depending on the needs of the user, the database can present a snapshot of projects in a given region, assess the progress of a certain technology type, or provide a comprehensive view of the entire marine and hydrokinetic energy industry. Results are displayed as a list of technologies, companies, or projects. Data can be filtered by a number of criteria, including country/region, technology type, generation capacity, and technology or project stage. The database was updated in 2009 to include ocean thermal energy technologies, companies, and projects.

  6. Caenorhabditis elegans par2.1/mtssb-1 is essential for mitochondrial DNA replication and its defect causes comprehensive transcriptional alterations including a hypoxia response

    SciTech Connect

    Sugimoto, Tomoko; Mori, Chihiro; Takanami, Takako; Sasagawa, Yohei; Saito, Rumiko; Ichiishi, Eiichiro; Higashitani, Atsushi

    2008-01-01

    DNA polymerase {gamma} and mtSSB are key components of the mtDNA replication machinery. To study the biological influences of defects in mtDNA replication, we used RNAi to deplete the gene for a putative mtSSB, par2.1, in Caenorhabditis elegans. In previous systematic RNAi screens, downregulation of this gene has not caused any clearly defective phenotypes. Here, we continuously fed a dsRNA targeting par2.1 to C. elegans over generations. Seventy-nine percent of F1 progeny produced 60-72 h after feeding grew to adulthood but were completely sterile, with an arrest of germline cell proliferation. Analyses of mtDNA copy number and cell cytology indicated that the sterile hermaphrodites had fewer mitochondria. These results indicated that par2.1 essentially functions for germline cell proliferation through mtDNA replication; we therefore termed it mtssb-1. Comprehensive transcriptional alterations including hypoxia response induction dependent on and independent of hif-1 function, occurred by RNAi depletion of mtssb-1. Treatment with ethidium bromide, which impairs mtDNA replication and transcription, caused similar transcriptional alterations. In addition, the frequency of apoptosis in the germline cells was reduced in fertile progeny with a partial RNAi effect. These suggest that RNAi depletion of C. elegans mtssb-1 is useful as a model system of mitochondrial dysfunction.

  7. Human Mitochondrial Protein Database

    National Institute of Standards and Technology Data Gateway

    SRD 131 Human Mitochondrial Protein Database (Web, free access)   The Human Mitochondrial Protein Database (HMPDb) provides comprehensive data on mitochondrial and human nuclear encoded proteins involved in mitochondrial biogenesis and function. This database consolidates information from SwissProt, LocusLink, Protein Data Bank (PDB), GenBank, Genome Database (GDB), Online Mendelian Inheritance in Man (OMIM), Human Mitochondrial Genome Database (mtDB), MITOMAP, Neuromuscular Disease Center and Human 2-D PAGE Databases. This database is intended as a tool not only to aid in studying the mitochondrion but in studying the associated diseases.

  8. Assessment of the structural and functional impact of in-frame mutations of the DMD gene, using the tools included in the eDystrophin online database

    PubMed Central

    2012-01-01

    Background Dystrophin is a large essential protein of skeletal and heart muscle. It is a filamentous scaffolding protein with numerous binding domains. Mutations in the DMD gene, which encodes dystrophin, mostly result in the deletion of one or several exons and cause Duchenne (DMD) and Becker (BMD) muscular dystrophies. The most common DMD mutations are frameshift mutations resulting in an absence of dystrophin from tissues. In-frame DMD mutations are less frequent and result in a protein with partial wild-type dystrophin function. The aim of this study was to highlight structural and functional modifications of dystrophin caused by in-frame mutations. Methods and results We developed a dedicated database for dystrophin, the eDystrophin database. It contains 209 different non frame-shifting mutations found in 945 patients from a French cohort and previous studies. Bioinformatics tools provide models of the three-dimensional structure of the protein at deletion sites, making it possible to determine whether the mutated protein retains the typical filamentous structure of dystrophin. An analysis of the structure of mutated dystrophin molecules showed that hybrid repeats were reconstituted at the deletion site in some cases. These hybrid repeats harbored the typical triple coiled-coil structure of native repeats, which may be correlated with better function in muscle cells. Conclusion This new database focuses on the dystrophin protein and its modification due to in-frame deletions in BMD patients. The observation of hybrid repeat reconstitution in some cases provides insight into phenotype-genotype correlations in dystrophin diseases and possible strategies for gene therapy. The eDystrophin database is freely available: http://edystrophin.genouest.org/. PMID:22776072

  9. Leading-edge forensic DNA analyses and the necessity of including crime scene investigators, police officers and technicians in a DNA elimination database.

    PubMed

    Lapointe, Martine; Rogic, Anita; Bourgoin, Sarah; Jolicoeur, Christine; Séguin, Diane

    2015-11-01

    In recent years, sophisticated technology has significantly increased the sensitivity and analytical power of genetic analyses so that very little starting material may now produce viable genetic profiles. This sensitivity however, has also increased the risk of detecting unknown genetic profiles assumed to be that of the perpetrator, yet originate from extraneous sources such as from crime scene workers. These contaminants may mislead investigations, keeping criminal cases active and unresolved for long spans of time. Voluntary submission of DNA samples from crime scene workers is fairly low, therefore we have created a promotional method for our staff elimination database that has resulted in a significant increase in voluntary samples since 2011. Our database enforces privacy safeguards and allows for optional anonymity to all staff members. We also offer information sessions at various police precincts to advise crime scene workers of the importance and success of our staff elimination database. This study, a pioneer in its field, has obtained 327 voluntary submissions from crime scene workers to date, of which 46 individual profiles (14%) have been matched to 58 criminal cases. By implementing our methods and respect for individual privacy, forensic laboratories everywhere may see similar growth and success in explaining unidentified genetic profiles in stagnate criminal cases.

  10. Genome databases

    SciTech Connect

    Courteau, J.

    1991-10-11

    Since the Genome Project began several years ago, a plethora of databases have been developed or are in the works. They range from the massive Genome Data Base at Johns Hopkins University, the central repository of all gene mapping information, to small databases focusing on single chromosomes or organisms. Some are publicly available, others are essentially private electronic lab notebooks. Still others limit access to a consortium of researchers working on, say, a single human chromosome. An increasing number incorporate sophisticated search and analytical software, while others operate as little more than data lists. In consultation with numerous experts in the field, a list has been compiled of some key genome-related databases. The list was not limited to map and sequence databases but also included the tools investigators use to interpret and elucidate genetic data, such as protein sequence and protein structure databases. Because a major goal of the Genome Project is to map and sequence the genomes of several experimental animals, including E. coli, yeast, fruit fly, nematode, and mouse, the available databases for those organisms are listed as well. The author also includes several databases that are still under development - including some ambitious efforts that go beyond data compilation to create what are being called electronic research communities, enabling many users, rather than just one or a few curators, to add or edit the data and tag it as raw or confirmed.

  11. Aviation Safety Issues Database

    NASA Technical Reports Server (NTRS)

    Morello, Samuel A.; Ricks, Wendell R.

    2009-01-01

    The aviation safety issues database was instrumental in the refinement and substantiation of the National Aviation Safety Strategic Plan (NASSP). The issues database is a comprehensive set of issues from an extremely broad base of aviation functions, personnel, and vehicle categories, both nationally and internationally. Several aviation safety stakeholders such as the Commercial Aviation Safety Team (CAST) have already used the database. This broader interest was the genesis to making the database publically accessible and writing this report.

  12. Overlap in Bibliographic Databases.

    ERIC Educational Resources Information Center

    Hood, William W.; Wilson, Concepcion S.

    2003-01-01

    Examines the topic of Fuzzy Set Theory to determine the overlap of coverage in bibliographic databases. Highlights include examples of comparisons of database coverage; frequency distribution of the degree of overlap; records with maximum overlap; records unique to one database; intra-database duplicates; and overlap in the top ten databases.…

  13. Database Marketplace 2002: The Database Universe.

    ERIC Educational Resources Information Center

    Tenopir, Carol; Baker, Gayle; Robinson, William

    2002-01-01

    Reviews the database industry over the past year, including new companies and services, company closures, popular database formats, popular access methods, and changes in existing products and services. Lists 33 firms and their database services; 33 firms and their database products; and 61 company profiles. (LRW)

  14. gEVE: a genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes

    PubMed Central

    Nakagawa, So; Takahashi, Mahoko Ueda

    2016-01-01

    In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species. Database URL: http://geve.med.u-tokai.ac.jp PMID:27242033

  15. gEVE: a genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes.

    PubMed

    Nakagawa, So; Takahashi, Mahoko Ueda

    2016-01-01

    In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species.Database URL: http://geve.med.u-tokai.ac.jp.

  16. ADMET evaluation in drug discovery. 11. PharmacoKinetics Knowledge Base (PKKB): a comprehensive database of pharmacokinetic and toxic properties for drugs.

    PubMed

    Cao, Dongyue; Wang, Junmei; Zhou, Rui; Li, Youyong; Yu, Huidong; Hou, Tingjun

    2012-05-25

    Good and extensive experimental ADMET (absorption, distribution, metabolism, excretion, and toxicity) data is critical for developing reliable in silico ADMET models. Here we develop a PharmacoKinetics Knowledge Base (PKKB) to compile comprehensive information about ADMET properties into a single electronic repository. We incorporate more than 10 000 experimental ADMET measurements of 1685 drugs into the PKKB. The ADMET properties in the PKKB include octanol/water partition coefficient, solubility, dissociation constant, intestinal absorption, Caco-2 permeability, human bioavailability, plasma protein binding, blood-plasma partitioning ratio, volume of distribution, metabolism, half-life, excretion, urinary excretion, clearance, toxicity, half lethal dose in rat or mouse, etc. The PKKB provides the most extensive collection of freely available data for ADMET properties up to date. All these ADMET properties, as well as the pharmacological information and the calculated physiochemical properties are integrated into a web-based information system. Eleven separated data sets for octanol/water partition coefficient, solubility, blood-brain partitioning, intestinal absorption, Caco-2 permeability, human oral bioavailability, and P-glycoprotein inhibitors have been provided for free download and can be used directly for ADMET modeling. The PKKB is available online at http://cadd.suda.edu.cn/admet.

  17. The androgen receptor gene mutations database.

    PubMed

    Patterson, M N; Hughes, I A; Gottlieb, B; Pinsky, L

    1994-09-01

    The androgen receptor gene mutations database is a comprehensive listing of mutations published in journals and meetings proceedings. The majority of mutations are point mutations identified in patients with androgen insensitivity syndrome. Information is included regarding the phenotype, the nature and location of the mutations, as well as the effects of the mutations on the androgen binding activity of the receptor. The current version of the database contains 149 entries, of which 114 are unique mutations. The database is available from EMBL (NetServ@EMBL-Heidelberg.DE) or as a Macintosh Filemaker file (mc33001@musica.mcgill.ca).

  18. Tautomerism in large databases

    PubMed Central

    Sitzmann, Markus; Ihlenfeldt, Wolf-Dietrich

    2010-01-01

    We have used the Chemical Structure DataBase (CSDB) of the NCI CADD Group, an aggregated collection of over 150 small-molecule databases totaling 103.5 million structure records, to conduct tautomerism analyses on one of the largest currently existing sets of real (i.e. not computer-generated) compounds. This analysis was carried out using calculable chemical structure identifiers developed by the NCI CADD Group, based on hash codes available in the chemoinformatics toolkit CACTVS and a newly developed scoring scheme to define a canonical tautomer for any encountered structure. CACTVS’s tautomerism definition, a set of 21 transform rules expressed in SMIRKS line notation, was used, which takes a comprehensive stance as to the possible types of tautomeric interconversion included. Tautomerism was found to be possible for more than 2/3 of the unique structures in the CSDB. A total of 680 million tautomers were calculated from, and including, the original structure records. Tautomerism overlap within the same individual database (i.e. at least one other entry was present that was really only a different tautomeric representation of the same compound) was found at an average rate of 0.3% of the original structure records, with values as high as nearly 2% for some of the databases in CSDB. Projected onto the set of unique structures (by FICuS identifier), this still occurred in about 1.5% of the cases. Tautomeric overlap across all constituent databases in CSDB was found for nearly 10% of the records in the collection. PMID:20512400

  19. Solubility Database

    National Institute of Standards and Technology Data Gateway

    SRD 106 IUPAC-NIST Solubility Database (Web, free access)   These solubilities are compiled from 18 volumes (Click here for List) of the International Union for Pure and Applied Chemistry(IUPAC)-NIST Solubility Data Series. The database includes liquid-liquid, solid-liquid, and gas-liquid systems. Typical solvents and solutes include water, seawater, heavy water, inorganic compounds, and a variety of organic compounds such as hydrocarbons, halogenated hydrocarbons, alcohols, acids, esters and nitrogen compounds. There are over 67,500 solubility measurements and over 1800 references.

  20. Databases: Beyond the Basics.

    ERIC Educational Resources Information Center

    Whittaker, Robert

    This presented paper offers an elementary description of database characteristics and then provides a survey of databases that may be useful to the teacher and researcher in Slavic and East European languages and literatures. The survey focuses on commercial databases that are available, usable, and needed. Individual databases discussed include:…

  1. ECMDB: the E. coli Metabolome Database.

    PubMed

    Guo, An Chi; Jewison, Timothy; Wilson, Michael; Liu, Yifeng; Knox, Craig; Djoumbou, Yannick; Lo, Patrick; Mandal, Rupasri; Krishnamurthy, Ram; Wishart, David S

    2013-01-01

    The Escherichia coli Metabolome Database (ECMDB, http://www.ecmdb.ca) is a comprehensively annotated metabolomic database containing detailed information about the metabolome of E. coli (K-12). Modelled closely on the Human and Yeast Metabolome Databases, the ECMDB contains >2600 metabolites with links to ∼1500 different genes and proteins, including enzymes and transporters. The information in the ECMDB has been collected from dozens of textbooks, journal articles and electronic databases. Each metabolite entry in the ECMDB contains an average of 75 separate data fields, including comprehensive compound descriptions, names and synonyms, chemical taxonomy, compound structural and physicochemical data, bacterial growth conditions and substrates, reactions, pathway information, enzyme data, gene/protein sequence data and numerous hyperlinks to images, references and other public databases. The ECMDB also includes an extensive collection of intracellular metabolite concentration data compiled from our own work as well as other published metabolomic studies. This information is further supplemented with thousands of fully assigned reference nuclear magnetic resonance and mass spectrometry spectra obtained from pure E. coli metabolites that we (and others) have collected. Extensive searching, relational querying and data browsing tools are also provided that support text, chemical structure, spectral, molecular weight and gene/protein sequence queries. Because of E. coli's importance as a model organism for biologists and as a biofactory for industry, we believe this kind of database could have considerable appeal not only to metabolomics researchers but also to molecular biologists, systems biologists and individuals in the biotechnology industry.

  2. HMDB: the Human Metabolome Database

    PubMed Central

    Wishart, David S.; Tzur, Dan; Knox, Craig; Eisner, Roman; Guo, An Chi; Young, Nelson; Cheng, Dean; Jewell, Kevin; Arndt, David; Sawhney, Summit; Fung, Chris; Nikolai, Lisa; Lewis, Mike; Coutouly, Marie-Aude; Forsythe, Ian; Tang, Peter; Shrivastava, Savita; Jeroncic, Kevin; Stothard, Paul; Amegbey, Godwin; Block, David; Hau, David. D.; Wagner, James; Miniaci, Jessica; Clements, Melisa; Gebremedhin, Mulu; Guo, Natalie; Zhang, Ying; Duggan, Gavin E.; MacInnis, Glen D.; Weljie, Alim M.; Dowlatabadi, Reza; Bamforth, Fiona; Clive, Derrick; Greiner, Russ; Li, Liang; Marrie, Tom; Sykes, Brian D.; Vogel, Hans J.; Querengesser, Lori

    2007-01-01

    The Human Metabolome Database (HMDB) is currently the most complete and comprehensive curated collection of human metabolite and human metabolism data in the world. It contains records for more than 2180 endogenous metabolites with information gathered from thousands of books, journal articles and electronic databases. In addition to its comprehensive literature-derived data, the HMDB also contains an extensive collection of experimental metabolite concentration data compiled from hundreds of mass spectra (MS) and Nuclear Magnetic resonance (NMR) metabolomic analyses performed on urine, blood and cerebrospinal fluid samples. This is further supplemented with thousands of NMR and MS spectra collected on purified, reference metabolites. Each metabolite entry in the HMDB contains an average of 90 separate data fields including a comprehensive compound description, names and synonyms, structural information, physico-chemical data, reference NMR and MS spectra, biofluid concentrations, disease associations, pathway information, enzyme data, gene sequence data, SNP and mutation data as well as extensive links to images, references and other public databases. Extensive searching, relational querying and data browsing tools are also provided. The HMDB is designed to address the broad needs of biochemists, clinical chemists, physicians, medical geneticists, nutritionists and members of the metabolomics community. The HMDB is available at: PMID:17202168

  3. Essential proteins and possible therapeutic targets of Wolbachia endosymbiont and development of FiloBase-a comprehensive drug target database for Lymphatic filariasis

    PubMed Central

    Sharma, Om Prakash; Kumar, Muthuvel Suresh

    2016-01-01

    Lymphatic filariasis (Lf) is one of the oldest and most debilitating tropical diseases. Millions of people are suffering from this prevalent disease. It is estimated to infect over 120 million people in at least 80 nations of the world through the tropical and subtropical regions. More than one billion people are in danger of getting affected with this life-threatening disease. Several studies were suggested its emerging limitations and resistance towards the available drugs and therapeutic targets for Lf. Therefore, better medicine and drug targets are in demand. We took an initiative to identify the essential proteins of Wolbachia endosymbiont of Brugia malayi, which are indispensable for their survival and non-homologous to human host proteins. In this current study, we have used proteome subtractive approach to screen the possible therapeutic targets for wBm. In addition, numerous literatures were mined in the hunt for potential drug targets, drugs, epitopes, crystal structures, and expressed sequence tag (EST) sequences for filarial causing nematodes. Data obtained from our study were presented in a user friendly database named FiloBase. We hope that information stored in this database may be used for further research and drug development process against filariasis. URL: http://filobase.bicpu.edu.in. PMID:26806463

  4. Comprehensibility maximization and humanly comprehensible representations

    NASA Astrophysics Data System (ADS)

    Kamimura, Ryotaro

    2012-04-01

    In this paper, we propose a new information-theoretic method to measure the comprehensibility of network configurations in competitive learning. Comprehensibility is supposed to be measured by information contained in components in competitive networks. Thus, the increase in information corresponds to the increase in comprehensibility of network configurations. One of the most important characteristics of the method is that parameters can be explicitly determined so as to produce a state where the different types of comprehensibility can be mutually increased. We applied the method to two problems, namely an artificial data set and the ionosphere data from the well-known machine learning database. In both problems, we showed that improved performance could be obtained in terms of all types of comprehensibility and quantization errors. For the topographic errors, we found that updating connection weights prevented them from increasing. Then, the optimal values of comprehensibility could be explicitly determined, and clearer class boundaries were generated.

  5. Trace elements in sediments, blue spotted tilapia Oreochromis leucostictus (Trewavas, 1933) and its parasite Contracaecum multipapillatum from Lake Naivasha, Kenya, including a comprehensive health risk analysis.

    PubMed

    Otachi, Elick O; Körner, Wilfried; Avenant-Oldewage, Annemariè; Fellner-Frank, Christine; Jirsa, Franz

    2014-06-01

    This study presents the distribution of 15 major and trace elements in sediments and fish and their pericardial parasites from Lake Naivasha, Kenya. The lake is one of the few freshwater lakes in the Great Rift Valley and is under strong anthropogenic pressure mainly due to agricultural activities. Its fish provide a valuable protein source for approximately 100,000 people in the area. Fish and their parasites have been acknowledged as indicators of environmental quality due to their accumulation potential for both essential and nonessential trace elements. A total of 34 specimens of the blue spotted tilapia Oreochromis leucostictus and pooled samples of their pericardial parasite, the anisakid nematode Contracaecum multipapillatum (larvae 3), were examined. Element concentrations were determined by inductively coupled plasma-optical emission spectroscopy (ICP-OES) and graphite furnace atomic absorption spectrometry (GF-AAS). The concentrations of elements in the sediments reflected the geology of the area and did not point to pollution: none of the investigated trace elements, including Pb, Cd, Cu, and Zn, showed elevated values. In contrast, concentrations in the fish muscle were elevated for Li, Sr, Cd, and Zn, with high target hazard quotients (THQ > 0.1) indicating a potential health risk to the consumers of this fish. Fish liver showed significantly higher concentrations of the trace elements Fe, Mn, Cd, and Cu compared to the muscle and C. multipapillatum. In the parasite, Zn had the highest concentration, but the worms only minimally accumulated trace elements in relation to their fish host. PMID:24573461

  6. Comprehensive Genomic Analyses of the OM43 Clade, Including a Novel Species from the Red Sea, Indicate Ecotype Differentiation among Marine Methylotrophs

    PubMed Central

    Jimenez-Infante, Francy; Ngugi, David Kamanda; Vinu, Manikandan; Alam, Intikhab; Kamau, Allan Anthony; Blom, Jochen; Bajic, Vladimir B.

    2015-01-01

    The OM43 clade within the family Methylophilaceae of Betaproteobacteria represents a group of methylotrophs that play important roles in the metabolism of C1 compounds in marine environments and other aquatic environments around the globe. Using dilution-to-extinction cultivation techniques, we successfully isolated a novel species of this clade (here designated MBRS-H7) from the ultraoligotrophic open ocean waters of the central Red Sea. Phylogenomic analyses indicate that MBRS-H7 is a novel species that forms a distinct cluster together with isolate KB13 from Hawaii (Hawaii-Red Sea [H-RS] cluster) that is separate from the cluster represented by strain HTCC2181 (from the Oregon coast). Phylogenetic analyses using the robust 16S-23S internal transcribed spacer revealed a potential ecotype separation of the marine OM43 clade members, which was further confirmed by metagenomic fragment recruitment analyses that showed trends of higher abundance in low-chlorophyll and/or high-temperature provinces for the H-RS cluster but a preference for colder, highly productive waters for the HTCC2181 cluster. This potential environmentally driven niche differentiation is also reflected in the metabolic gene inventories, which in the case of the H-RS cluster include those conferring resistance to high levels of UV irradiation, temperature, and salinity. Interestingly, we also found different energy conservation modules between these OM43 subclades, namely, the existence of the NADH:quinone oxidoreductase complex I (NUO) system in the H-RS cluster and the nonhomologous NADH:quinone oxidoreductase (NQR) system in the HTCC2181 cluster, which might have implications for their overall energetic yields. PMID:26655752

  7. A comprehensive DNA barcode database for Central European beetles with a focus on Germany: adding more than 3500 identified species to BOLD.

    PubMed

    Hendrich, Lars; Morinière, Jérôme; Haszprunar, Gerhard; Hebert, Paul D N; Hausmann, Axel; Köhler, Frank; Balke, Michael

    2015-07-01

    Beetles are the most diverse group of animals and are crucial for ecosystem functioning. In many countries, they are well established for environmental impact assessment, but even in the well-studied Central European fauna, species identification can be very difficult. A comprehensive and taxonomically well-curated DNA barcode library could remedy this deficit and could also link hundreds of years of traditional knowledge with next generation sequencing technology. However, such a beetle library is missing to date. This study provides the globally largest DNA barcode reference library for Coleoptera for 15 948 individuals belonging to 3514 well-identified species (53% of the German fauna) with representatives from 97 of 103 families (94%). This study is the first comprehensive regional test of the efficiency of DNA barcoding for beetles with a focus on Germany. Sequences ≥500 bp were recovered from 63% of the specimens analysed (15 948 of 25 294) with short sequences from another 997 specimens. Whereas most specimens (92.2%) could be unambiguously assigned to a single known species by sequence diversity at CO1, 1089 specimens (6.8%) were assigned to more than one Barcode Index Number (BIN), creating 395 BINs which need further study to ascertain if they represent cryptic species, mitochondrial introgression, or simply regional variation in widespread species. We found 409 specimens (2.6%) that shared a BIN assignment with another species, most involving a pair of closely allied species as 43 BINs were involved. Most of these taxa were separated by barcodes although sequence divergences were low. Only 155 specimens (0.97%) show identical or overlapping clusters.

  8. Databases and in silico tools for vaccine design.

    PubMed

    He, Yongqun; Xiang, Zuoshuang

    2013-01-01

    In vaccine design, databases and in silico tools play different but complementary roles. Databases collect experimentally verified vaccines and vaccine components, and in silico tools provide computational methods to predict and design new vaccines and vaccine components. Vaccine-related databases include databases of vaccines and vaccine components. In the USA, the Food and Drug Administration (FDA) maintains a database of licensed human vaccines, and the US Department of Agriculture keeps a database of licensed animal vaccines. Databases of vaccine clinical trials and vaccines in research also exist. The important vaccine components include vaccine antigens, vaccine adjuvants, vaccine vectors, and -vaccine preservatives. The vaccine antigens can be whole proteins or immune epitopes. Various in silico vaccine design tools are also available. The Vaccine Investigation and Online Information Network (VIOLIN; http://www.violinet.org ) is a comprehensive vaccine database and analysis system. The VIOLIN database includes various types of vaccines and vaccine components. VIOLIN also includes Vaxign, a Web-based in silico vaccine design program based on the reverse vaccinology strategy. Vaccine information and resources can be integrated with Vaccine Ontology (VO). This chapter introduces databases and in silico tools that facilitate vaccine design, especially those in the VIOLIN system.

  9. Building a Comprehensive Mill-Level Database for the Industrial Sectors Integrated Solutions (ISIS) Model of the U.S. Pulp and Paper Sector

    PubMed Central

    Modak, Nabanita; Spence, Kelley; Sood, Saloni; Rosati, Jacky Ann

    2015-01-01

    Air emissions from the U.S. pulp and paper sector have been federally regulated since 1978; however, regulations are periodically reviewed and revised to improve efficiency and effectiveness of existing emission standards. The Industrial Sectors Integrated Solutions (ISIS) model for the pulp and paper sector is currently under development at the U.S. Environmental Protection Agency (EPA), and can be utilized to facilitate multi-pollutant, sector-based analyses that are performed in conjunction with regulatory development. The model utilizes a multi-sector, multi-product dynamic linear modeling framework that evaluates the economic impact of emission reduction strategies for multiple air pollutants. The ISIS model considers facility-level economic, environmental, and technical parameters, as well as sector-level market data, to estimate the impacts of environmental regulations on the pulp and paper industry. Specifically, the model can be used to estimate U.S. and global market impacts of new or more stringent air regulations, such as impacts on product price, exports and imports, market demands, capital investment, and mill closures. One major challenge to developing a representative model is the need for an extensive amount of data. This article discusses the collection and processing of data for use in the model, as well as the methods used for building the ISIS pulp and paper database that facilitates the required analyses to support the air quality management of the pulp and paper sector. PMID:25806516

  10. Building a comprehensive mill-level database for the Industrial Sectors Integrated Solutions (ISIS) model of the U.S. pulp and paper sector.

    PubMed

    Modak, Nabanita; Spence, Kelley; Sood, Saloni; Rosati, Jacky Ann

    2015-01-01

    Air emissions from the U.S. pulp and paper sector have been federally regulated since 1978; however, regulations are periodically reviewed and revised to improve efficiency and effectiveness of existing emission standards. The Industrial Sectors Integrated Solutions (ISIS) model for the pulp and paper sector is currently under development at the U.S. Environmental Protection Agency (EPA), and can be utilized to facilitate multi-pollutant, sector-based analyses that are performed in conjunction with regulatory development. The model utilizes a multi-sector, multi-product dynamic linear modeling framework that evaluates the economic impact of emission reduction strategies for multiple air pollutants. The ISIS model considers facility-level economic, environmental, and technical parameters, as well as sector-level market data, to estimate the impacts of environmental regulations on the pulp and paper industry. Specifically, the model can be used to estimate U.S. and global market impacts of new or more stringent air regulations, such as impacts on product price, exports and imports, market demands, capital investment, and mill closures. One major challenge to developing a representative model is the need for an extensive amount of data. This article discusses the collection and processing of data for use in the model, as well as the methods used for building the ISIS pulp and paper database that facilitates the required analyses to support the air quality management of the pulp and paper sector.

  11. Qualitative Alterations of Bacterial Metabolome after Exposure to Metal Nanoparticles with Bactericidal Properties: A Comprehensive Workflow Based on (1)H NMR, UHPLC-HRMS, and Metabolic Databases.

    PubMed

    Chatzimitakos, Theodoros G; Stalikas, Constantine D

    2016-09-01

    Metal nanoparticles (NPs) have proven to be more toxic than bulk analogues of the same chemical composition due to their unique physical properties. The NPs, lately, have drawn the attention of researchers because of their antibacterial and biocidal properties. In an effort to shed light on the mechanism through which the bacteria elimination is achieved and the metabolic changes they undergo, an untargeted metabolomic fingerprint study was carried out on Gram-positive (Staphylococcus aureus) and Gram-negative (Escherichia coli) bacteria species. The (1)H NMR spectroscopy, in conjunction with high resolution mass-spectrometry (HRMS) and an unsophisticated data processing workflow were implemented. The combined NMR/HRMS data, supported by an open-access metabolomic database, proved to be efficacious in the process of assigning a putative annotation to a wide range of metabolite signals and is a useful tool to appraise the metabolome alterations, as a consequence of bacterial response to NPs. Interestingly, not all the NPs diminished the intracellular metabolites; bacteria treated with iron NPs produced metabolites not present in the nonexposed bacteria sample, implying the activation of previously inactive metabolic pathways. In contrast, copper and iron-copper NPs reduced the annotated metabolites, alluding to the conclusion that the metabolic pathways (mainly alanine, aspartate, and glutamate metabolism, beta-alanine metabolism, glutathione metabolism, and arginine and proline metabolism) were hindered by the interactions of NPs with the intracellular metabolites. PMID:27432757

  12. The Transporter Classification Database (TCDB): recent advances

    PubMed Central

    Saier, Milton H.; Reddy, Vamsee S.; Tsu, Brian V.; Ahmed, Muhammad Saad; Li, Chun; Moreno-Hagelsieb, Gabriel

    2016-01-01

    The Transporter Classification Database (TCDB; http://www.tcdb.org) is a freely accessible reference database for transport protein research, which provides structural, functional, mechanistic, evolutionary and disease/medical information about transporters from organisms of all types. TCDB is the only transport protein classification database adopted by the International Union of Biochemistry and Molecular Biology (IUBMB). It consists of more than 10 000 non-redundant transport systems with more than 11 000 reference citations, classified into over 1000 transporter families. Transporters in TCDB can be single or multi-component systems, categorized in a functional/phylogenetic hierarchical system of classes, subclasses, families, subfamilies and transport systems. TCDB also includes updated software designed to analyze the distinctive features of transport proteins, extending its usefulness. Here we present a comprehensive update of the database contents and features and summarize recent discoveries recorded in TCDB. PMID:26546518

  13. Scopus database: a review

    PubMed Central

    Burnham, Judy F

    2006-01-01

    The Scopus database provides access to STM journal articles and the references included in those articles, allowing the searcher to search both forward and backward in time. The database can be used for collection development as well as for research. This review provides information on the key points of the database and compares it to Web of Science. Neither database is inclusive, but complements each other. If a library can only afford one, choice must be based in institutional needs. PMID:16522216

  14. Scopus database: a review.

    PubMed

    Burnham, Judy F

    2006-03-08

    The Scopus database provides access to STM journal articles and the references included in those articles, allowing the searcher to search both forward and backward in time. The database can be used for collection development as well as for research. This review provides information on the key points of the database and compares it to Web of Science. Neither database is inclusive, but complements each other. If a library can only afford one, choice must be based in institutional needs.

  15. Aero/fluids database system

    NASA Technical Reports Server (NTRS)

    Reardon, John E.; Violett, Duane L., Jr.

    1991-01-01

    The AFAS Database System was developed to provide the basic structure of a comprehensive database system for the Marshall Space Flight Center (MSFC) Structures and Dynamics Laboratory Aerophysics Division. The system is intended to handle all of the Aerophysics Division Test Facilities as well as data from other sources. The system was written for the DEC VAX family of computers in FORTRAN-77 and utilizes the VMS indexed file system and screen management routines. Various aspects of the system are covered, including a description of the user interface, lists of all code structure elements, descriptions of the file structures, a description of the security system operation, a detailed description of the data retrieval tasks, a description of the session log, and a description of the archival system.

  16. Comprehension of Connected Discourse.

    ERIC Educational Resources Information Center

    Mosberg, Ludwig; Shima, Fred

    A rationale was developed for researching reading comprehension based on information gain. Previous definitions of comprehension which were reviewed included operational vs. nonoperational and skills vs. processes. Comprehension was viewed as an informational processing event which includes a constellation of cognitive and learning processes. Two…

  17. A snapshot in time - the new TRUplanner database

    SciTech Connect

    Estill, Wesley Gordon; Crawford, Beverly; Van Soest, Greg

    2010-01-01

    A new database was developed to augment transuranic (TRU) waste inventory information already captured within the Comprehensive Inventory Database (CID). This new database, known as the TRUplanner, includes information focused on site capability for TRU waste certification and availability to ship to either the Waste Isolation Pilot Plant (WIPP) or Idaho National Laboratory for final characterization and certification before coming to WIPP. Waste stream specific data is imported from the CID to the TRUplanner, and additional data is collected and entered as a result of interviews and site visits. This data is continuously updated as sites are visited.

  18. AmphibiaChina: an online database of Chinese Amphibians.

    PubMed

    Che, Jing; Wang, Kai

    2016-01-18

    AmphibiaChina, an open-access, web-based database, is designed to provide comprehensive and up-to-date information on Chinese amphibians. It offers an integrated module with six major sections. Compared to other known databases including AmphibiaWeb and Amphibian Species of the World, AmphibiaChina has the following new functions: (1) online species identification based on DNA barcode sequences; (2) comparisons and discussions of different major taxonomic systems; and (3) phylogenetic progress on Chinese amphibians. This database offers a window for the world to access available information of Chinese amphibians. AmphibiaChina with its Chinese version can be accessed at http://www.amphibiachina.org.

  19. A Chronostratigraphic Relational Database Ontology

    NASA Astrophysics Data System (ADS)

    Platon, E.; Gary, A.; Sikora, P.

    2005-12-01

    A chronostratigraphic research database was donated by British Petroleum to the Stratigraphy Group at the Energy and Geoscience Institute (EGI), University of Utah. These data consists of over 2,000 measured sections representing over three decades of research into the application of the graphic correlation method. The data are global and includes both microfossil (foraminifera, calcareous nannoplankton, spores, pollen, dinoflagellate cysts, etc) and macrofossil data. The objective of the donation was to make the research data available to the public in order to encourage additional chronostratigraphy studies, specifically regarding graphic correlation. As part of the National Science Foundation's Cyberinfrastructure for the Geosciences (GEON) initiative these data have been made available to the public at http://css.egi.utah.edu. To encourage further research using the graphic correlation method, EGI has developed a software package, StrataPlot that will soon be publicly available from the GEON website as a standalone software download. The EGI chronostratigraphy research database, although relatively large, has many data holes relative to some paleontological disciplines and geographical areas, so the challenge becomes how do we expand the data available for chronostratigrahic studies using graphic correlation. There are several public or soon-to-be public databases available to chronostratigraphic research, but they have their own data structures and modes of presentation. The heterogeneous nature of these database schemas hinders their integration and makes it difficult for the user to retrieve and consolidate potentially valuable chronostratigraphic data. The integration of these data sources would facilitate rapid and comprehensive data searches, thus helping advance studies in chronostratigraphy. The GEON project will host a number of databases within the geology domain, some of which contain biostratigraphic data. Ontologies are being developed to provide

  20. Inorganic Crystal Structure Database (ICSD)

    National Institute of Standards and Technology Data Gateway

    SRD 84 FIZ/NIST Inorganic Crystal Structure Database (ICSD) (PC database for purchase)   The Inorganic Crystal Structure Database (ICSD) is produced cooperatively by the Fachinformationszentrum Karlsruhe(FIZ) and the National Institute of Standards and Technology (NIST). The ICSD is a comprehensive collection of crystal structure data of inorganic compounds containing more than 140,000 entries and covering the literature from 1915 to the present.

  1. Central Asia Active Fault Database

    NASA Astrophysics Data System (ADS)

    Mohadjer, Solmaz; Ehlers, Todd A.; Kakar, Najibullah

    2014-05-01

    The ongoing collision of the Indian subcontinent with Asia controls active tectonics and seismicity in Central Asia. This motion is accommodated by faults that have historically caused devastating earthquakes and continue to pose serious threats to the population at risk. Despite international and regional efforts to assess seismic hazards in Central Asia, little attention has been given to development of a comprehensive database for active faults in the region. To address this issue and to better understand the distribution and level of seismic hazard in Central Asia, we are developing a publically available database for active faults of Central Asia (including but not limited to Afghanistan, Tajikistan, Kyrgyzstan, northern Pakistan and western China) using ArcGIS. The database is designed to allow users to store, map and query important fault parameters such as fault location, displacement history, rate of movement, and other data relevant to seismic hazard studies including fault trench locations, geochronology constraints, and seismic studies. Data sources integrated into the database include previously published maps and scientific investigations as well as strain rate measurements and historic and recent seismicity. In addition, high resolution Quickbird, Spot, and Aster imagery are used for selected features to locate and measure offset of landforms associated with Quaternary faulting. These features are individually digitized and linked to attribute tables that provide a description for each feature. Preliminary observations include inconsistent and sometimes inaccurate information for faults documented in different studies. For example, the Darvaz-Karakul fault which roughly defines the western margin of the Pamir, has been mapped with differences in location of up to 12 kilometers. The sense of motion for this fault ranges from unknown to thrust and strike-slip in three different studies despite documented left-lateral displacements of Holocene and late

  2. The NCBI Taxonomy database.

    PubMed

    Federhen, Scott

    2012-01-01

    The NCBI Taxonomy database (http://www.ncbi.nlm.nih.gov/taxonomy) is the standard nomenclature and classification repository for the International Nucleotide Sequence Database Collaboration (INSDC), comprising the GenBank, ENA (EMBL) and DDBJ databases. It includes organism names and taxonomic lineages for each of the sequences represented in the INSDC's nucleotide and protein sequence databases. The taxonomy database is manually curated by a small group of scientists at the NCBI who use the current taxonomic literature to maintain a phylogenetic taxonomy for the source organisms represented in the sequence databases. The taxonomy database is a central organizing hub for many of the resources at the NCBI, and provides a means for clustering elements within other domains of NCBI web site, for internal linking between domains of the Entrez system and for linking out to taxon-specific external resources on the web. Our primary purpose is to index the domain of sequences as conveniently as possible for our user community.

  3. Common hyperspectral image database design

    NASA Astrophysics Data System (ADS)

    Tian, Lixun; Liao, Ningfang; Chai, Ali

    2009-11-01

    This paper is to introduce Common hyperspectral image database with a demand-oriented Database design method (CHIDB), which comprehensively set ground-based spectra, standardized hyperspectral cube, spectral analysis together to meet some applications. The paper presents an integrated approach to retrieving spectral and spatial patterns from remotely sensed imagery using state-of-the-art data mining and advanced database technologies, some data mining ideas and functions were associated into CHIDB to make it more suitable to serve in agriculture, geological and environmental areas. A broad range of data from multiple regions of the electromagnetic spectrum is supported, including ultraviolet, visible, near-infrared, thermal infrared, and fluorescence. CHIDB is based on dotnet framework and designed by MVC architecture including five main functional modules: Data importer/exporter, Image/spectrum Viewer, Data Processor, Parameter Extractor, and On-line Analyzer. The original data were all stored in SQL server2008 for efficient search, query and update, and some advance Spectral image data Processing technology are used such as Parallel processing in C#; Finally an application case is presented in agricultural disease detecting area.

  4. Biofuel Database

    National Institute of Standards and Technology Data Gateway

    Biofuel Database (Web, free access)   This database brings together structural, biological, and thermodynamic data for enzymes that are either in current use or are being considered for use in the production of biofuels.

  5. Navigating public microarray databases.

    PubMed

    Penkett, Christopher J; Bähler, Jürg

    2004-01-01

    With the ever-escalating amount of data being produced by genome-wide microarray studies, it is of increasing importance that these data are captured in public databases so that researchers can use this information to complement and enhance their own studies. Many groups have set up databases of expression data, ranging from large repositories, which are designed to comprehensively capture all published data, through to more specialized databases. The public repositories, such as ArrayExpress at the European Bioinformatics Institute contain complete datasets in raw format in addition to processed data, whilst the specialist databases tend to provide downstream analysis of normalized data from more focused studies and data sources. Here we provide a guide to the use of these public microarray resources.

  6. National Stream Survey data-base guide

    SciTech Connect

    Mitch, M.E.; Kaufmann, P.R.; Herlihy, A.T.; Overton, W.S.; Sale, M.J.

    1990-07-01

    The National Stream Survey (NSS), conducted in the spring of 1985 and 1986, is one component of the U.S. Environmental Protection Agency's National Surface Water Survey. This effort is in support of the National Acid Precipitation Assessment Program. The NSS was a synoptic, spring survey of 500 streams in regions of the Southeastern and Mid-Atlantic United States expected to contain larger numbers of low alkalinity streams. The NSS is based on a probability sample from an explicitly defined population of surface waters. In the NSS, 500 streams were sampled, representing a regional population of 64,700 stream reaches. The NSS database includes stream and watershed physical characteristics, in situ measurements, and water chemistry data. Accompanying the database is a comprehensive user's guide that provides an overview of the NSS design, database structure, and transfer media.

  7. Database Administrator

    ERIC Educational Resources Information Center

    Moore, Pam

    2010-01-01

    The Internet and electronic commerce (e-commerce) generate lots of data. Data must be stored, organized, and managed. Database administrators, or DBAs, work with database software to find ways to do this. They identify user needs, set up computer databases, and test systems. They ensure that systems perform as they should and add people to the…

  8. Geologic map and map database of northeastern San Francisco Bay region, California, [including] most of Solano County and parts of Napa, Marin, Contra Costa, San Joaquin, Sacramento, Yolo, and Sonoma Counties

    USGS Publications Warehouse

    Graymer, Russell Walter; Jones, David Lawrence; Brabb, Earl E.

    2002-01-01

    This digital map database, compiled from previously published and unpublished data, and new mapping by the authors, represents the general distribution of bedrock and surficial deposits in the mapped area. Together with the accompanying text file (nesfmf.ps, nesfmf.pdf, nesfmf.txt), it provides current information on the geologic structure and stratigraphy of the area covered. The database delineates map units that are identified by general age and lithology following the stratigraphic nomenclature of the U.S. Geological Survey. The scale of the source maps limits the spatial resolution (scale) of the database to 1:62,500 or smaller.

  9. 2010 Worldwide Gasification Database

    DOE Data Explorer

    The 2010 Worldwide Gasification Database describes the current world gasification industry and identifies near-term planned capacity additions. The database lists gasification projects and includes information (e.g., plant location, number and type of gasifiers, syngas capacity, feedstock, and products). The database reveals that the worldwide gasification capacity has continued to grow for the past several decades and is now at 70,817 megawatts thermal (MWth) of syngas output at 144 operating plants with a total of 412 gasifiers.

  10. DPTEdb, an integrative database of transposable elements in dioecious plants.

    PubMed

    Li, Shu-Fen; Zhang, Guo-Jun; Zhang, Xue-Jin; Yuan, Jin-Hong; Deng, Chuan-Liang; Gu, Lian-Feng; Gao, Wu-Jun

    2016-01-01

    Dioecious plants usually harbor 'young' sex chromosomes, providing an opportunity to study the early stages of sex chromosome evolution. Transposable elements (TEs) are mobile DNA elements frequently found in plants and are suggested to play important roles in plant sex chromosome evolution. The genomes of several dioecious plants have been sequenced, offering an opportunity to annotate and mine the TE data. However, comprehensive and unified annotation of TEs in these dioecious plants is still lacking. In this study, we constructed a dioecious plant transposable element database (DPTEdb). DPTEdb is a specific, comprehensive and unified relational database and web interface. We used a combination of de novo, structure-based and homology-based approaches to identify TEs from the genome assemblies of previously published data, as well as our own. The database currently integrates eight dioecious plant species and a total of 31 340 TEs along with classification information. DPTEdb provides user-friendly web interfaces to browse, search and download the TE sequences in the database. Users can also use tools, including BLAST, GetORF, HMMER, Cut sequence and JBrowse, to analyze TE data. Given the role of TEs in plant sex chromosome evolution, the database will contribute to the investigation of TEs in structural, functional and evolutionary dynamics of the genome of dioecious plants. In addition, the database will supplement the research of sex diversification and sex chromosome evolution of dioecious plants.Database URL: http://genedenovoweb.ticp.net:81/DPTEdb/index.php.

  11. DPTEdb, an integrative database of transposable elements in dioecious plants.

    PubMed

    Li, Shu-Fen; Zhang, Guo-Jun; Zhang, Xue-Jin; Yuan, Jin-Hong; Deng, Chuan-Liang; Gu, Lian-Feng; Gao, Wu-Jun

    2016-01-01

    Dioecious plants usually harbor 'young' sex chromosomes, providing an opportunity to study the early stages of sex chromosome evolution. Transposable elements (TEs) are mobile DNA elements frequently found in plants and are suggested to play important roles in plant sex chromosome evolution. The genomes of several dioecious plants have been sequenced, offering an opportunity to annotate and mine the TE data. However, comprehensive and unified annotation of TEs in these dioecious plants is still lacking. In this study, we constructed a dioecious plant transposable element database (DPTEdb). DPTEdb is a specific, comprehensive and unified relational database and web interface. We used a combination of de novo, structure-based and homology-based approaches to identify TEs from the genome assemblies of previously published data, as well as our own. The database currently integrates eight dioecious plant species and a total of 31 340 TEs along with classification information. DPTEdb provides user-friendly web interfaces to browse, search and download the TE sequences in the database. Users can also use tools, including BLAST, GetORF, HMMER, Cut sequence and JBrowse, to analyze TE data. Given the role of TEs in plant sex chromosome evolution, the database will contribute to the investigation of TEs in structural, functional and evolutionary dynamics of the genome of dioecious plants. In addition, the database will supplement the research of sex diversification and sex chromosome evolution of dioecious plants.Database URL: http://genedenovoweb.ticp.net:81/DPTEdb/index.php. PMID:27173524

  12. DPTEdb, an integrative database of transposable elements in dioecious plants

    PubMed Central

    Li, Shu-Fen; Zhang, Guo-Jun; Zhang, Xue-Jin; Yuan, Jin-Hong; Deng, Chuan-Liang; Gu, Lian-Feng; Gao, Wu-Jun

    2016-01-01

    Dioecious plants usually harbor ‘young’ sex chromosomes, providing an opportunity to study the early stages of sex chromosome evolution. Transposable elements (TEs) are mobile DNA elements frequently found in plants and are suggested to play important roles in plant sex chromosome evolution. The genomes of several dioecious plants have been sequenced, offering an opportunity to annotate and mine the TE data. However, comprehensive and unified annotation of TEs in these dioecious plants is still lacking. In this study, we constructed a dioecious plant transposable element database (DPTEdb). DPTEdb is a specific, comprehensive and unified relational database and web interface. We used a combination of de novo, structure-based and homology-based approaches to identify TEs from the genome assemblies of previously published data, as well as our own. The database currently integrates eight dioecious plant species and a total of 31 340 TEs along with classification information. DPTEdb provides user-friendly web interfaces to browse, search and download the TE sequences in the database. Users can also use tools, including BLAST, GetORF, HMMER, Cut sequence and JBrowse, to analyze TE data. Given the role of TEs in plant sex chromosome evolution, the database will contribute to the investigation of TEs in structural, functional and evolutionary dynamics of the genome of dioecious plants. In addition, the database will supplement the research of sex diversification and sex chromosome evolution of dioecious plants. Database URL: http://genedenovoweb.ticp.net:81/DPTEdb/index.php PMID:27173524

  13. Advanced transportation system studies. Alternate propulsion subsystem concepts: Propulsion database

    NASA Technical Reports Server (NTRS)

    Levack, Daniel

    1993-01-01

    The Advanced Transportation System Studies alternate propulsion subsystem concepts propulsion database interim report is presented. The objective of the database development task is to produce a propulsion database which is easy to use and modify while also being comprehensive in the level of detail available. The database is to be available on the Macintosh computer system. The task is to extend across all three years of the contract. Consequently, a significant fraction of the effort in this first year of the task was devoted to the development of the database structure to ensure a robust base for the following years' efforts. Nonetheless, significant point design propulsion system descriptions and parametric models were also produced. Each of the two propulsion databases, parametric propulsion database and propulsion system database, are described. The descriptions include a user's guide to each code, write-ups for models used, and sample output. The parametric database has models for LOX/H2 and LOX/RP liquid engines, solid rocket boosters using three different propellants, a hybrid rocket booster, and a NERVA derived nuclear thermal rocket engine.

  14. BSD: the Biodegradative Strain Database.

    PubMed

    Urbance, John W; Cole, James; Saxman, Paul; Tiedje, James M

    2003-01-01

    The Biodegradative Strain Database (BSD) is a freely-accessible, web-based database providing detailed information on degradative bacteria and the hazardous substances that they degrade, including corresponding literature citations, relevant patents and links to additional web-based biological and chemical data. The BSD (http://bsd.cme.msu.edu) is being developed within the phylogenetic framework of the Ribosomal Database Project II (RDPII: http://rdp.cme.msu.edu/html) to provide a biological complement to the chemical and degradative pathway data of the University of Minnesota Biocatalysis/Biodegradation Database (UM-BBD: http://umbbd.ahc.umn.edu). Data is accessible through a series of strain, chemical and reference lists or by keyword search. The web site also includes on-line data submission and user survey forms to solicit user contributions and suggestions. The current release contains information on over 250 degradative bacterial strains and 150 hazardous substances. The transformation of xenobiotics and other environmentally toxic compounds by microorganisms is central to strategies for biocatalysis and the bioremediation of contaminated environments. However, practical, comprehensive, strain-level information on biocatalytic/biodegradative microbes is not readily available and is often difficult to compile. Similarly, for any given environmental contaminant, there is no single resource that can provide comparative information on the array of identified microbes capable of degrading the chemical. A web site that consolidates and cross-references strain, chemical and reference data related to biocatalysis, biotransformation, biodegradation and bioremediation would be an invaluable tool for academic and industrial researchers and environmental engineers.

  15. Compact variant-rich customized sequence database and a fast and sensitive database search for efficient proteogenomic analyses.

    PubMed

    Park, Heejin; Bae, Junwoo; Kim, Hyunwoo; Kim, Sangok; Kim, Hokeun; Mun, Dong-Gi; Joh, Yoonsung; Lee, Wonyeop; Chae, Sehyun; Lee, Sanghyuk; Kim, Hark Kyun; Hwang, Daehee; Lee, Sang-Won; Paek, Eunok

    2014-12-01

    In proteogenomic analysis, construction of a compact, customized database from mRNA-seq data and a sensitive search of both reference and customized databases are essential to accurately determine protein abundances and structural variations at the protein level. However, these tasks have not been systematically explored, but rather performed in an ad-hoc fashion. Here, we present an effective method for constructing a compact database containing comprehensive sequences of sample-specific variants--single nucleotide variants, insertions/deletions, and stop-codon mutations derived from Exome-seq and RNA-seq data. It, however, occupies less space by storing variant peptides, not variant proteins. We also present an efficient search method for both customized and reference databases. The separate searches of the two databases increase the search time, and a unified search is less sensitive to identify variant peptides due to the smaller size of the customized database, compared to the reference database, in the target-decoy setting. Our method searches the unified database once, but performs target-decoy validations separately. Experimental results show that our approach is as fast as the unified search and as sensitive as the separate searches. Our customized database includes mutation information in the headers of variant peptides, thereby facilitating the inspection of peptide-spectrum matches.

  16. Dictionary as Database.

    ERIC Educational Resources Information Center

    Painter, Derrick

    1996-01-01

    Discussion of dictionaries as databases focuses on the digitizing of The Oxford English dictionary (OED) and the use of Standard Generalized Mark-Up Language (SGML). Topics include the creation of a consortium to digitize the OED, document structure, relational databases, text forms, sequence, and discourse. (LRW)

  17. A Quality System Database

    NASA Technical Reports Server (NTRS)

    Snell, William H.; Turner, Anne M.; Gifford, Luther; Stites, William

    2010-01-01

    A quality system database (QSD), and software to administer the database, were developed to support recording of administrative nonconformance activities that involve requirements for documentation of corrective and/or preventive actions, which can include ISO 9000 internal quality audits and customer complaints.

  18. The Halophile protein database.

    PubMed

    Sharma, Naveen; Farooqi, Mohammad Samir; Chaturvedi, Krishna Kumar; Lal, Shashi Bhushan; Grover, Monendra; Rai, Anil; Pandey, Pankaj

    2014-01-01

    Halophilic archaea/bacteria adapt to different salt concentration, namely extreme, moderate and low. These type of adaptations may occur as a result of modification of protein structure and other changes in different cell organelles. Thus proteins may play an important role in the adaptation of halophilic archaea/bacteria to saline conditions. The Halophile protein database (HProtDB) is a systematic attempt to document the biochemical and biophysical properties of proteins from halophilic archaea/bacteria which may be involved in adaptation of these organisms to saline conditions. In this database, various physicochemical properties such as molecular weight, theoretical pI, amino acid composition, atomic composition, estimated half-life, instability index, aliphatic index and grand average of hydropathicity (Gravy) have been listed. These physicochemical properties play an important role in identifying the protein structure, bonding pattern and function of the specific proteins. This database is comprehensive, manually curated, non-redundant catalogue of proteins. The database currently contains 59 897 proteins properties extracted from 21 different strains of halophilic archaea/bacteria. The database can be accessed through link. Database URL: http://webapp.cabgrid.res.in/protein/

  19. Specialist Bibliographic Databases

    PubMed Central

    2016-01-01

    Specialist bibliographic databases offer essential online tools for researchers and authors who work on specific subjects and perform comprehensive and systematic syntheses of evidence. This article presents examples of the established specialist databases, which may be of interest to those engaged in multidisciplinary science communication. Access to most specialist databases is through subscription schemes and membership in professional associations. Several aggregators of information and database vendors, such as EBSCOhost and ProQuest, facilitate advanced searches supported by specialist keyword thesauri. Searches of items through specialist databases are complementary to those through multidisciplinary research platforms, such as PubMed, Web of Science, and Google Scholar. Familiarizing with the functional characteristics of biomedical and nonbiomedical bibliographic search tools is mandatory for researchers, authors, editors, and publishers. The database users are offered updates of the indexed journal lists, abstracts, author profiles, and links to other metadata. Editors and publishers may find particularly useful source selection criteria and apply for coverage of their peer-reviewed journals and grey literature sources. These criteria are aimed at accepting relevant sources with established editorial policies and quality controls. PMID:27134485

  20. Specialist Bibliographic Databases.

    PubMed

    Gasparyan, Armen Yuri; Yessirkepov, Marlen; Voronov, Alexander A; Trukhachev, Vladimir I; Kostyukova, Elena I; Gerasimov, Alexey N; Kitas, George D

    2016-05-01

    Specialist bibliographic databases offer essential online tools for researchers and authors who work on specific subjects and perform comprehensive and systematic syntheses of evidence. This article presents examples of the established specialist databases, which may be of interest to those engaged in multidisciplinary science communication. Access to most specialist databases is through subscription schemes and membership in professional associations. Several aggregators of information and database vendors, such as EBSCOhost and ProQuest, facilitate advanced searches supported by specialist keyword thesauri. Searches of items through specialist databases are complementary to those through multidisciplinary research platforms, such as PubMed, Web of Science, and Google Scholar. Familiarizing with the functional characteristics of biomedical and nonbiomedical bibliographic search tools is mandatory for researchers, authors, editors, and publishers. The database users are offered updates of the indexed journal lists, abstracts, author profiles, and links to other metadata. Editors and publishers may find particularly useful source selection criteria and apply for coverage of their peer-reviewed journals and grey literature sources. These criteria are aimed at accepting relevant sources with established editorial policies and quality controls. PMID:27134485

  1. Specialist Bibliographic Databases.

    PubMed

    Gasparyan, Armen Yuri; Yessirkepov, Marlen; Voronov, Alexander A; Trukhachev, Vladimir I; Kostyukova, Elena I; Gerasimov, Alexey N; Kitas, George D

    2016-05-01

    Specialist bibliographic databases offer essential online tools for researchers and authors who work on specific subjects and perform comprehensive and systematic syntheses of evidence. This article presents examples of the established specialist databases, which may be of interest to those engaged in multidisciplinary science communication. Access to most specialist databases is through subscription schemes and membership in professional associations. Several aggregators of information and database vendors, such as EBSCOhost and ProQuest, facilitate advanced searches supported by specialist keyword thesauri. Searches of items through specialist databases are complementary to those through multidisciplinary research platforms, such as PubMed, Web of Science, and Google Scholar. Familiarizing with the functional characteristics of biomedical and nonbiomedical bibliographic search tools is mandatory for researchers, authors, editors, and publishers. The database users are offered updates of the indexed journal lists, abstracts, author profiles, and links to other metadata. Editors and publishers may find particularly useful source selection criteria and apply for coverage of their peer-reviewed journals and grey literature sources. These criteria are aimed at accepting relevant sources with established editorial policies and quality controls.

  2. Cascadia Tsunami Deposit Database

    USGS Publications Warehouse

    Peters, Robert; Jaffe, Bruce; Gelfenbaum, Guy; Peterson, Curt

    2003-01-01

    The Cascadia Tsunami Deposit Database contains data on the location and sedimentological properties of tsunami deposits found along the Cascadia margin. Data have been compiled from 52 studies, documenting 59 sites from northern California to Vancouver Island, British Columbia that contain known or potential tsunami deposits. Bibliographical references are provided for all sites included in the database. Cascadia tsunami deposits are usually seen as anomalous sand layers in coastal marsh or lake sediments. The studies cited in the database use numerous criteria based on sedimentary characteristics to distinguish tsunami deposits from sand layers deposited by other processes, such as river flooding and storm surges. Several studies cited in the database contain evidence for more than one tsunami at a site. Data categories include age, thickness, layering, grainsize, and other sedimentological characteristics of Cascadia tsunami deposits. The database documents the variability observed in tsunami deposits found along the Cascadia margin.

  3. Database Manager

    ERIC Educational Resources Information Center

    Martin, Andrew

    2010-01-01

    It is normal practice today for organizations to store large quantities of records of related information as computer-based files or databases. Purposeful information is retrieved by performing queries on the data sets. The purpose of DATABASE MANAGER is to communicate to students the method by which the computer performs these queries. This…

  4. Maize databases

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This chapter is a succinct overview of maize data held in the species-specific database MaizeGDB (the Maize Genomics and Genetics Database), and selected multi-species data repositories, such as Gramene/Ensembl Plants, Phytozome, UniProt and the National Center for Biotechnology Information (NCBI), ...

  5. GMDD: a database of GMO detection methods

    PubMed Central

    Dong, Wei; Yang, Litao; Shen, Kailin; Kim, Banghyun; Kleter, Gijs A; Marvin, Hans JP; Guo, Rong; Liang, Wanqi; Zhang, Dabing

    2008-01-01

    Background Since more than one hundred events of genetically modified organisms (GMOs) have been developed and approved for commercialization in global area, the GMO analysis methods are essential for the enforcement of GMO labelling regulations. Protein and nucleic acid-based detection techniques have been developed and utilized for GMOs identification and quantification. However, the information for harmonization and standardization of GMO analysis methods at global level is needed. Results GMO Detection method Database (GMDD) has collected almost all the previous developed and reported GMOs detection methods, which have been grouped by different strategies (screen-, gene-, construct-, and event-specific), and also provide a user-friendly search service of the detection methods by GMO event name, exogenous gene, or protein information, etc. In this database, users can obtain the sequences of exogenous integration, which will facilitate PCR primers and probes design. Also the information on endogenous genes, certified reference materials, reference molecules, and the validation status of developed methods is included in this database. Furthermore, registered users can also submit new detection methods and sequences to this database, and the newly submitted information will be released soon after being checked. Conclusion GMDD contains comprehensive information of GMO detection methods. The database will make the GMOs analysis much easier. PMID:18522755

  6. Phase Equilibria Diagrams Database

    National Institute of Standards and Technology Data Gateway

    SRD 31 NIST/ACerS Phase Equilibria Diagrams Database (PC database for purchase)   The Phase Equilibria Diagrams Database contains commentaries and more than 21,000 diagrams for non-organic systems, including those published in all 21 hard-copy volumes produced as part of the ACerS-NIST Phase Equilibria Diagrams Program (formerly titled Phase Diagrams for Ceramists): Volumes I through XIV (blue books); Annuals 91, 92, 93; High Tc Superconductors I & II; Zirconium & Zirconia Systems; and Electronic Ceramics I. Materials covered include oxides as well as non-oxide systems such as chalcogenides and pnictides, phosphates, salt systems, and mixed systems of these classes.

  7. The Molecular Biology Database Collection: 2008 update

    PubMed Central

    Galperin, Michael Y.

    2008-01-01

    The Nucleic Acids Research online Molecular Biology Database Collection is a public repository that lists more than 1000 databases described in this and previous Nucleic Acids Research annual database issues, as well as a selection of molecular biology databases described in other journals. All databases included in this Collection are freely available to the public. The 2008 update includes 1078 databases, 110 more than the previous one. The links to more than 80 databases have been updated and 25 obsolete databases have been removed from the list. The complete database list and summaries are available online at the Nucleic Acids Research web site, http://nar.oxfordjournals.org/. PMID:18025043

  8. DRAG: a database for recognition and analasys of gait

    NASA Astrophysics Data System (ADS)

    Kuchi, Prem; Hiremagalur, Raghu Ram V.; Huang, Helen; Carhart, Michael; He, Jiping; Panchanathan, Sethuraman

    2003-11-01

    A novel approach is proposed for creating a standardized and comprehensive database for gait analysis. The field of gait analysis is gaining increasing attention for applications such as visual surveillance, human-computer interfaces, and gait recognition and rehabilitation. Numerous algorithms have been developed for analyzing and processing gait data; however, a standard database for their systematic evaluation does not exist. Instead, existing gait databases consist of subsets of kinematic, kinetic, and electromyographic activity recordings by different investigators, at separate laboratories, and under varying conditions. Thus, the existing databases are neither homogenous nor sufficiently populated to statistically validate the algorithms. In this paper, a methodology for creating a database is presented, which can be used as a common ground to test the performance of algorithms that rely upon external marker data, ground reaction loading data, and/or video images. The database consists of: (1) synchronized motion-capture data (3D marker data) obtained using external markers, (2) computed joint angles, and (3) ground reaction loading acquired with plantar pressure insoles. This database could be easily expanded to include synchronized video, which will facilitate further development of video-based algorithms for motion tracking. This eventually could lead to the realization of markerless gait tracking. Such a system would have extensive applications in gait recognition, as well as gait rehabilitation. The entire database (marker, angle, and force data) will be placed in the public domain, and made available for downloads over the World Wide Web.

  9. Who's Gonna Pay the Piper for Free Online Databases?

    ERIC Educational Resources Information Center

    Jacso, Peter

    1996-01-01

    Discusses new pricing models for some online services and considers the possibilities for the traditional online database market. Topics include multimedia music databases, including copyright implications; other retail-oriented databases; and paying for free databases with advertising. (LRW)

  10. Version 3 of the GEOMAGIA50 Paleomagnetic Database

    NASA Astrophysics Data System (ADS)

    Brown, M. C.; Donadini, F.; Nilsson, A.; Korte, M. C.; Panovska, S.; Frank, U.; Korhonen, K.; Lodge, A.; Lengyel, S. N.; Schuberth, M.; Constable, C.

    2015-12-01

    Version 3 of GEOMAGIA50 is the product of an international collaboration to develop a comprehensive online database that allows easy access to published paleomagnetic and chronological data from the past 50 ka. The overarching scientific goal is to improve our understanding of changes in the geomagnetic field, with applications ranging from the deep Earth, to past environments and climate, to the interaction of the geomagnetic field with the space environment. We present the large range of modifications made to the database since its initial release in 2006. The most significant is the addition of paleomagnetic, rock magnetic and chronological data obtained from lake and marine sediments. An emphasis has been placed on geochronological data from sediments and the database includes comprehensive radiocarbon age data, oxygen isotope data, varve ages and tephra ages. In the archeomagnetic and volcanic database, the amount of declination, inclination and paleointensity data has been increased greatly and now exceeds 15,000. In addition, a number of new metadata fields have been added, e.g., more specific information about the archaeological or volcanic sites the data were obtained from. Finally, the web interface has been updated. This includes a new query form for the sediment database and improved functionality of the archeomagnetic and volcanic query form. The webpage can be found at found at http://geomagia.gfz-potsdam.de.

  11. RNAcentral: an international database of ncRNA sequences

    DOE PAGES

    Williams, Kelly Porter

    2014-10-28

    The field of non-coding RNA biology has been hampered by the lack of availability of a comprehensive, up-to-date collection of accessioned RNA sequences. Here we present the first release of RNAcentral, a database that collates and integrates information from an international consortium of established RNA sequence databases. The initial release contains over 8.1 million sequences, including representatives of all major functional classes. A web portal (http://rnacentral.org) provides free access to data, search functionality, cross-references, source code and an integrated genome browser for selected species.

  12. Expert systems identify fossils and manage large paleontological databases

    SciTech Connect

    Beightol, D.S. ); Conrad, M.A.

    1988-02-01

    EXPAL is a computer program permitting creation and maintenance of comprehensive databases in marine paleontology. It is designed to assist specialists and non-specialists. EXPAL includes a powerful expert system based on the morphological descriptors specific to a given group of fossils. The expert system may be used, for example, to describe and automatically identify an unknown specimen. EXPAL was first applied to Dasycladales (Calcareous green algae). Projects are under way for corresponding expert systems and databases on planktonic foraminifers and calpionellids. EXPAL runs on an IBM XT or compatible microcomputer.

  13. SAMMD: Staphylococcus aureus Microarray Meta-Database

    PubMed Central

    Nagarajan, Vijayaraj; Elasri, Mohamed O

    2007-01-01

    Background Staphylococcus aureus is an important human pathogen, causing a wide variety of diseases ranging from superficial skin infections to severe life threatening infections. S. aureus is one of the leading causes of nosocomial infections. Its ability to resist multiple antibiotics poses a growing public health problem. In order to understand the mechanism of pathogenesis of S. aureus, several global expression profiles have been developed. These transcriptional profiles included regulatory mutants of S. aureus and growth of wild type under different growth conditions. The abundance of these profiles has generated a large amount of data without a uniform annotation system to comprehensively examine them. We report the development of the Staphylococcus aureus Microarray meta-database (SAMMD) which includes data from all the published transcriptional profiles. SAMMD is a web-accessible database that helps users to perform a variety of analysis against and within the existing transcriptional profiles. Description SAMMD is a relational database that uses MySQL as the back end and PHP/JavaScript/DHTML as the front end. The database is normalized and consists of five tables, which holds information about gene annotations, regulated gene lists, experimental details, references, and other details. SAMMD data is collected from the peer-reviewed published articles. Data extraction and conversion was done using perl scripts while data entry was done through phpMyAdmin tool. The database is accessible via a web interface that contains several features such as a simple search by ORF ID, gene name, gene product name, advanced search using gene lists, comparing among datasets, browsing, downloading, statistics, and help. The database is licensed under General Public License (GPL). Conclusion SAMMD is hosted and available at . Currently there are over 9500 entries for regulated genes, from 67 microarray experiments. SAMMD will help staphylococcal scientists to analyze their

  14. Splatalogue: Database for Astronomical Spectroscopy

    NASA Astrophysics Data System (ADS)

    Remijan, Anthony J.; Markwick-Kemper, A.; ALMA Working Group on Spectral Line Frequencies

    2007-12-01

    The next generation of powerful radio and millimeter/submillimeter observatories (e.g. EVLA, ALMA, & Herschel) require extensive resources to help identify spectral line transitions. We describe the compilation of the most complete spectral line database currently assembled for this purpose. The Splatalogue is a comprehensive transition-resolved compilation of observed, measured and calculated spectral lines. In addition to the JPL and CDMS spectral line lists, 229,221 new/updated lines from the Spectral Line Atlas of Interstellar Molecules (SLAIM) were included. Of that, 12,332 lines (or an addition of 2000 lines) were added to the Lovas/NIST Recommended Rest Frequencies of known astronomical transitions. To these added lines, we have run diagnostics on the 4 lists for overlaps on transitions, frequencies, formulae and chemical names and have come up with a common way to display and designate each individual species. Splatalogue also contains atomic and recombination lines, template spectra, and is completely VO-compliant, queryable under the IVOA SLAP standard. The details of the database and how it will be used for the ALMA archive, observing tool and data reduction packages will be discussed.

  15. Online interactive U.S. Reservoir Sedimentation Survey Database

    USGS Publications Warehouse

    Gray, J.B.; Bernard, J.M.; Schwarz, G.E.; Stewart, D.W.; Ray, K.T.

    2009-01-01

    In April 2009, the U.S. Geological Survey and the Natural Resources Conservation Service (prior to 1994, the Soil Conservation Service) created the Reservoir Sedimentation Survey Database (RESSED) and Web site, the most comprehensive compilation of data from reservoir bathymetric and dry basin surveys in the United States. RESSED data can be useful for a number of purposes, including calculating changes in reservoir storage characteristics, quantifying rates of sediment delivery to reservoirs, and estimating erosion rates in a reservoir's watershed.

  16. 75 FR 65611 - Native American Tribal Insignia Database

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-10-26

    ... Patent and Trademark Office Native American Tribal Insignia Database ACTION: Proposed collection; comment... recommendations made in the report was that the USPTO create and maintain an accurate and comprehensive database... database. The USPTO database of official tribal insignias assists trademark attorneys in their...

  17. BGDB: a database of bivalent genes.

    PubMed

    Li, Qingyan; Lian, Shuabin; Dai, Zhiming; Xiang, Qian; Dai, Xianhua

    2013-01-01

    Bivalent gene is a gene marked with both H3K4me3 and H3K27me3 epigenetic modification in the same area, and is proposed to play a pivotal role related to pluripotency in embryonic stem (ES) cells. Identification of these bivalent genes and understanding their functions are important for further research of lineage specification and embryo development. So far, lots of genome-wide histone modification data were generated in mouse and human ES cells. These valuable data make it possible to identify bivalent genes, but no comprehensive data repositories or analysis tools are available for bivalent genes currently. In this work, we develop BGDB, the database of bivalent genes. The database contains 6897 bivalent genes in human and mouse ES cells, which are manually collected from scientific literature. Each entry contains curated information, including genomic context, sequences, gene ontology and other relevant information. The web services of BGDB database were implemented with PHP + MySQL + JavaScript, and provide diverse query functions. Database URL: http://dailab.sysu.edu.cn/bgdb/

  18. ECOTOX database; new additions and future direction

    EPA Science Inventory

    The ECOTOXicology database (ECOTOX) is a comprehensive, publicly available knowledgebase developed and maintained by ORD/NHEERL. It is used for environmental toxicity data on aquatic life, terrestrial plants and wildlife. Publications are identified for potential applicability af...

  19. Chemical Kinetics Database

    National Institute of Standards and Technology Data Gateway

    SRD 17 NIST Chemical Kinetics Database (Web, free access)   The NIST Chemical Kinetics Database includes essentially all reported kinetics results for thermal gas-phase chemical reactions. The database is designed to be searched for kinetics data based on the specific reactants involved, for reactions resulting in specified products, for all the reactions of a particular species, or for various combinations of these. In addition, the bibliography can be searched by author name or combination of names. The database contains in excess of 38,000 separate reaction records for over 11,700 distinct reactant pairs. These data have been abstracted from over 12,000 papers with literature coverage through early 2000.

  20. ECRbase: Database of Evolutionary Conserved Regions, Promoters, and Transcription Factor Binding Sites in Vertebrate Genomes

    DOE Data Explorer

    Loots, Gabriela G. [LLNL; Ovcharenko, I. [LLNL

    Evolutionary conservation of DNA sequences provides a tool for the identification of functional elements in genomes. This database of evolutionary conserved regions (ECRs) in vertebrate genomes features a database of syntenic blocks that recapitulate the evolution of rearrangements in vertebrates and a comprehensive collection of promoters in all vertebrate genomes generated using multiple sources of gene annotation. The database also contains a collection of annotated transcription factor binding sites (TFBSs) in evolutionary conserved and promoter elements. ECRbase currently includes human, rhesus macaque, dog, opossum, rat, mouse, chicken, frog, zebrafish, and fugu genomes. (taken from paper in Journal: Bioinformatics, November 7, 2006, pp. 122-124

  1. Comprehensive Care

    MedlinePlus

    ... Text Larger Text Print In this article A complex disease requires a comprehensive approach Today multiple sclerosis ( ... Your Whole Health, Your Whole Team: Managing Your Complex MS Symptoms Webinar/telelearning presented by Roz Kalb, ...

  2. Drinking Water Treatability Database (Database)

    EPA Science Inventory

    The drinking Water Treatability Database (TDB) will provide data taken from the literature on the control of contaminants in drinking water, and will be housed on an interactive, publicly-available USEPA web site. It can be used for identifying effective treatment processes, rec...

  3. The Institute for Rock Magnetism Facility Database

    NASA Astrophysics Data System (ADS)

    Jackson, M. J.; Sølheid, P.; Bowles, J. A.; Moskowitz, B. M.; Feinberg, J. M.

    2012-12-01

    The Institute for Rock Magnetism (IRM) is one of 19 National Multi-User Facilities supported by the Instruments and Facilities program of NSF for geoscience research that requires complex, expensive and advanced instrumentation. Visiting and in-house researchers at the IRM have access to sensitive laboratory instruments for magnetometry, magnetic microscopy and Mössbauer spectroscopy, for carrying out a wide variety of experiments under a range of applied field and temperature conditions. Results are used to gain insight into a very diverse assortment of natural materials and phenomena including biomagnetism, environmental magnetism, petrofabrics, nanophase materials, shocked materials, and paleomagnetism of terrestrial and extraterrestrial materials. A comprehensive laboratory database has been in operation since 2004, storing detailed experimental data and metadata for more than 250 facility users, with measurements on over 50,000 specimens, including over one million remanence measurements and 45,000 hysteresis loops. Custom software tools provide consistent and reliable handling of basic data processing (e.g., mass normalization and unit conversion), as well as more advanced interactive analysis (e.g., deconvolution of u-channel paleomagnetic data; filtering and statistical tests for high-field nonlinearity in calculating hysteresis loop parameters; thermal fluctuation tomography using T-dependent switching-field distributions from backfield remanence measurements or hysteresis loops). Users are also able to access their data and the custom software tools remotely once they leave the IRM for their home institutions. A key advantage of an integrated database/software system for a facility like the IRM is that it provides a rapid and automatic means of combining different kinds of data measured on different instruments. An important design consideration in the development of the facility database has been structural compatibility with the community-wide Mag

  4. Comprehensive Epidemiologic Data Resource (CEDR) (Poster)

    SciTech Connect

    Oak Ridge Institute for Science and Education

    2012-12-12

    This poster introduces the Comprehensive Epidemiologic Data Resource (CEDR), an electronic database with demographic, health outcome, and exposure information for over a million DOE nuclear plant and laboratory workers.

  5. FishTraits Database

    USGS Publications Warehouse

    Angermeier, Paul L.; Frimpong, Emmanuel A.

    2009-01-01

    The need for integrated and widely accessible sources of species traits data to facilitate studies of ecology, conservation, and management has motivated development of traits databases for various taxa. In spite of the increasing number of traits-based analyses of freshwater fishes in the United States, no consolidated database of traits of this group exists publicly, and much useful information on these species is documented only in obscure sources. The largely inaccessible and unconsolidated traits information makes large-scale analysis involving many fishes and/or traits particularly challenging. FishTraits is a database of >100 traits for 809 (731 native and 78 exotic) fish species found in freshwaters of the conterminous United States, including 37 native families and 145 native genera. The database contains information on four major categories of traits: (1) trophic ecology, (2) body size and reproductive ecology (life history), (3) habitat associations, and (4) salinity and temperature tolerances. Information on geographic distribution and conservation status is also included. Together, we refer to the traits, distribution, and conservation status information as attributes. Descriptions of attributes are available here. Many sources were consulted to compile attributes, including state and regional species accounts and other databases.

  6. Drinking Water Database

    NASA Technical Reports Server (NTRS)

    Murray, ShaTerea R.

    2004-01-01

    This summer I had the opportunity to work in the Environmental Management Office (EMO) under the Chemical Sampling and Analysis Team or CS&AT. This team s mission is to support Glenn Research Center (GRC) and EM0 by providing chemical sampling and analysis services and expert consulting. Services include sampling and chemical analysis of water, soil, fbels, oils, paint, insulation materials, etc. One of this team s major projects is the Drinking Water Project. This is a project that is done on Glenn s water coolers and ten percent of its sink every two years. For the past two summers an intern had been putting together a database for this team to record the test they had perform. She had successfully created a database but hadn't worked out all the quirks. So this summer William Wilder (an intern from Cleveland State University) and I worked together to perfect her database. We began be finding out exactly what every member of the team thought about the database and what they would change if any. After collecting this data we both had to take some courses in Microsoft Access in order to fix the problems. Next we began looking at what exactly how the database worked from the outside inward. Then we began trying to change the database but we quickly found out that this would be virtually impossible.

  7. Enabling heterogenous multi-scale database for emergency service functions through geoinformation technologies

    NASA Astrophysics Data System (ADS)

    Bhanumurthy, V.; Venugopala Rao, K.; Srinivasa Rao, S.; Ram Mohan Rao, K.; Chandra, P. Satya; Vidhyasagar, J.; Diwakar, P. G.; Dadhwal, V. K.

    2014-11-01

    Geographical Information Science (GIS) is now graduated from traditional desktop system to Internet system. Internet GIS is emerging as one of the most promising technologies for addressing Emergency Management. Web services with different privileges are playing an important role in dissemination of the emergency services to the decision makers. Spatial database is one of the most important components in the successful implementation of Emergency Management. It contains spatial data in the form of raster, vector, linked with non-spatial information. Comprehensive data is required to handle emergency situation in different phases. These database elements comprise core data, hazard specific data, corresponding attribute data, and live data coming from the remote locations. Core data sets are minimum required data including base, thematic, infrastructure layers to handle disasters. Disaster specific information is required to handle a particular disaster situation like flood, cyclone, forest fire, earth quake, land slide, drought. In addition to this Emergency Management require many types of data with spatial and temporal attributes that should be made available to the key players in the right format at right time. The vector database needs to be complemented with required resolution satellite imagery for visualisation and analysis in disaster management. Therefore, the database is interconnected and comprehensive to meet the requirement of an Emergency Management. This kind of integrated, comprehensive and structured database with appropriate information is required to obtain right information at right time for the right people. However, building spatial database for Emergency Management is a challenging task because of the key issues such as availability of data, sharing policies, compatible geospatial standards, data interoperability etc. Therefore, to facilitate using, sharing, and integrating the spatial data, there is a need to define standards to build

  8. Benefits of an International Database for UF6 Cylinders

    SciTech Connect

    Babcock, R A; Whitaker, J M; Murphy, J; Oakberg, J

    2008-06-30

    A reasonable expectation regarding the nuclear energy renaissance is that the location of fuel cycle nuclear materials throughout the world will be known. We ask--would an international system for uranium hexafluoride (UF{sub 6}) cylinders provide the effective assurances expected for international fuel supply and of the international fuel centers? This paper introduces the question and discusses the potential benefits of tracking UF{sub 6} cylinders through the development of an international database. The nonproliferation benefits of an international database for UF{sub 6} cylinders being used in the fuel cycle include an enhanced capability to reconcile nuclear material imports and exports. Currently, import and export declarations only require the reporting of total 'rolled up' quantities of nuclear materials contained in all items--not the quantities of materials in individual items like individual UF{sub 6} cylinders. The database could provide supplier countries with more assurance on the location of the UF{sub 6} cylinders they export. Additionally, a comprehensive database on all declared cylinders would be a valuable resource in detecting and recognizing undeclared cylinders. The database could potentially be administered by the IAEA and be accessible to authorized countries around the world. During the nuclear renaissance, the general public, as well as the participants will expect transparency and quality information about movement of nuclear fuel cycle nuclear materials. We will discuss the potential benefits of such a database for the suppliers, inspectorates, and general public.

  9. The Validity of Reading Comprehension Rate: Reading Speed, Comprehension, and Comprehension Rates

    ERIC Educational Resources Information Center

    Skinner, Christopher H.; Williams, Jacqueline L.; Morrow, Jennifer Ann; Hale, Andre D.; Neddenriep, Christine E.; Hawkins, Renee O.

    2009-01-01

    This article describes a secondary analysis of a brief reading comprehension rate measure, percent comprehension questions correct per minute spent reading (%C/M). This measure includes reading speed (seconds to read) in the denominator and percentage of comprehension questions answered correctly in the numerator. Participants were 22 4th-, 29…

  10. The RIKEN integrated database of mammals

    PubMed Central

    Masuya, Hiroshi; Makita, Yuko; Kobayashi, Norio; Nishikata, Koro; Yoshida, Yuko; Mochizuki, Yoshiki; Doi, Koji; Takatsuki, Terue; Waki, Kazunori; Tanaka, Nobuhiko; Ishii, Manabu; Matsushima, Akihiro; Takahashi, Satoshi; Hijikata, Atsushi; Kozaki, Kouji; Furuichi, Teiichi; Kawaji, Hideya; Wakana, Shigeharu; Nakamura, Yukio; Yoshiki, Atsushi; Murata, Takehide; Fukami-Kobayashi, Kaoru; Mohan, Sujatha; Ohara, Osamu; Hayashizaki, Yoshihide; Mizoguchi, Riichiro; Obata, Yuichi; Toyoda, Tetsuro

    2011-01-01

    The RIKEN integrated database of mammals (http://scinets.org/db/mammal) is the official undertaking to integrate its mammalian databases produced from multiple large-scale programs that have been promoted by the institute. The database integrates not only RIKEN’s original databases, such as FANTOM, the ENU mutagenesis program, the RIKEN Cerebellar Development Transcriptome Database and the Bioresource Database, but also imported data from public databases, such as Ensembl, MGI and biomedical ontologies. Our integrated database has been implemented on the infrastructure of publication medium for databases, termed SciNetS/SciNeS, or the Scientists’ Networking System, where the data and metadata are structured as a semantic web and are downloadable in various standardized formats. The top-level ontology-based implementation of mammal-related data directly integrates the representative knowledge and individual data records in existing databases to ensure advanced cross-database searches and reduced unevenness of the data management operations. Through the development of this database, we propose a novel methodology for the development of standardized comprehensive management of heterogeneous data sets in multiple databases to improve the sustainability, accessibility, utility and publicity of the data of biomedical information. PMID:21076152

  11. The RIKEN integrated database of mammals.

    PubMed

    Masuya, Hiroshi; Makita, Yuko; Kobayashi, Norio; Nishikata, Koro; Yoshida, Yuko; Mochizuki, Yoshiki; Doi, Koji; Takatsuki, Terue; Waki, Kazunori; Tanaka, Nobuhiko; Ishii, Manabu; Matsushima, Akihiro; Takahashi, Satoshi; Hijikata, Atsushi; Kozaki, Kouji; Furuichi, Teiichi; Kawaji, Hideya; Wakana, Shigeharu; Nakamura, Yukio; Yoshiki, Atsushi; Murata, Takehide; Fukami-Kobayashi, Kaoru; Mohan, Sujatha; Ohara, Osamu; Hayashizaki, Yoshihide; Mizoguchi, Riichiro; Obata, Yuichi; Toyoda, Tetsuro

    2011-01-01

    The RIKEN integrated database of mammals (http://scinets.org/db/mammal) is the official undertaking to integrate its mammalian databases produced from multiple large-scale programs that have been promoted by the institute. The database integrates not only RIKEN's original databases, such as FANTOM, the ENU mutagenesis program, the RIKEN Cerebellar Development Transcriptome Database and the Bioresource Database, but also imported data from public databases, such as Ensembl, MGI and biomedical ontologies. Our integrated database has been implemented on the infrastructure of publication medium for databases, termed SciNetS/SciNeS, or the Scientists' Networking System, where the data and metadata are structured as a semantic web and are downloadable in various standardized formats. The top-level ontology-based implementation of mammal-related data directly integrates the representative knowledge and individual data records in existing databases to ensure advanced cross-database searches and reduced unevenness of the data management operations. Through the development of this database, we propose a novel methodology for the development of standardized comprehensive management of heterogeneous data sets in multiple databases to improve the sustainability, accessibility, utility and publicity of the data of biomedical information.

  12. On Comprehension.

    ERIC Educational Resources Information Center

    Kintsch, Walter

    Attempts have been made to develop a model of the process of reading comprehension as a whole that would indicate under what conditions and for what reasons a long sentence might be more effective than several short ones, when repetition is helpful and when distracting, when content is better left implicit in a text, and how overexplicitness might…

  13. RESIS-II: An Updated Version of the Original Reservoir Sedimentation Survey Information System (RESIS) Database

    USGS Publications Warehouse

    Ackerman, Katherine V.; Mixon, David M.; Sundquist, Eric T.; Stallard, Robert F.; Schwarz, Gregory E.; Stewart, David W.

    2009-01-01

    The Reservoir Sedimentation Survey Information System (RESIS) database, originally compiled by the Soil Conservation Service (now the Natural Resources Conservation Service) in collaboration with the Texas Agricultural Experiment Station, is the most comprehensive compilation of data from reservoir sedimentation surveys throughout the conterminous United States (U.S.). The database is a cumulative historical archive that includes data from as early as 1755 and as late as 1993. The 1,823 reservoirs included in the database range in size from farm ponds to the largest U.S. reservoirs (such as Lake Mead). Results from 6,617 bathymetric surveys are available in the database. This Data Series provides an improved version of the original RESIS database, termed RESIS-II, and a report describing RESIS-II. The RESIS-II relational database is stored in Microsoft Access and includes more precise location coordinates for most of the reservoirs than the original database but excludes information on reservoir ownership. RESIS-II is anticipated to be a template for further improvements in the database.

  14. Patent Family Databases.

    ERIC Educational Resources Information Center

    Simmons, Edlyn S.

    1985-01-01

    Reports on retrieval of patent information online and includes definition of patent family, basic and equivalent patents, "parents and children" applications, designated states, patent family databases--International Patent Documentation Center, World Patents Index, APIPAT (American Petroleum Institute), CLAIMS (IFI/Plenum). A table noting country…

  15. The Ribosomal Database Project.

    PubMed Central

    Maidak, B L; Larsen, N; McCaughey, M J; Overbeek, R; Olsen, G J; Fogel, K; Blandy, J; Woese, C R

    1994-01-01

    The Ribosomal Database Project (RDP) is a curated database that offers ribosome-related data, analysis services, and associated computer programs. The offerings include phylogenetically ordered alignments of ribosomal RNA (rRNA) sequences, derived phylogenetic trees, rRNA secondary structure diagrams, and various software for handling, analyzing and displaying alignments and trees. The data are available via anonymous ftp (rdp.life.uiuc.edu), electronic mail (server/rdp.life.uiuc.edu) and gopher (rdpgopher.life.uiuc.edu). The electronic mail server also provides ribosomal probe checking, approximate phylogenetic placement of user-submitted sequences, screening for chimeric nature of newly sequenced rRNAs, and automated alignment. PMID:7524021

  16. ARTI Refrigerant Database

    SciTech Connect

    Calm, J.M.

    1992-04-30

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air- conditioning and refrigeration equipment. The complete documents are not included, though some may be added at a later date. The database identifies sources of specific information on R-32, R-123, R-124, R- 125, R-134a, R-141b, R142b, R-143a, R-152a, R-290 (propane), R-717 (ammonia), ethers, and others as well as azeotropic and zeotropic blends of these fluids. It addresses polyalkylene glycol (PAG), ester, and other lubricants. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits.

  17. Mouse genome database 2016.

    PubMed

    Bult, Carol J; Eppig, Janan T; Blake, Judith A; Kadin, James A; Richardson, Joel E

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data.

  18. Mouse genome database 2016

    PubMed Central

    Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.; Kadin, James A.; Richardson, Joel E.

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data. PMID:26578600

  19. Mouse genome database 2016.

    PubMed

    Bult, Carol J; Eppig, Janan T; Blake, Judith A; Kadin, James A; Richardson, Joel E

    2016-01-01

    The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the primary community model organism database for the laboratory mouse and serves as the source for key biological reference data related to mouse genes, gene functions, phenotypes and disease models with a strong emphasis on the relationship of these data to human biology and disease. As the cost of genome-scale sequencing continues to decrease and new technologies for genome editing become widely adopted, the laboratory mouse is more important than ever as a model system for understanding the biological significance of human genetic variation and for advancing the basic research needed to support the emergence of genome-guided precision medicine. Recent enhancements to MGD include new graphical summaries of biological annotations for mouse genes, support for mobile access to the database, tools to support the annotation and analysis of sets of genes, and expanded support for comparative biology through the expansion of homology data. PMID:26578600

  20. National Ambient Radiation Database

    SciTech Connect

    Dziuban, J.; Sears, R.

    2003-02-25

    The U.S. Environmental Protection Agency (EPA) recently developed a searchable database and website for the Environmental Radiation Ambient Monitoring System (ERAMS) data. This site contains nationwide radiation monitoring data for air particulates, precipitation, drinking water, surface water and pasteurized milk. This site provides location-specific as well as national information on environmental radioactivity across several media. It provides high quality data for assessing public exposure and environmental impacts resulting from nuclear emergencies and provides baseline data during routine conditions. The database and website are accessible at www.epa.gov/enviro/. This site contains (1) a query for the general public which is easy to use--limits the amount of information provided, but includes the ability to graph the data with risk benchmarks and (2) a query for a more technical user which allows access to all of the data in the database, (3) background information on ER AMS.

  1. Extracting information from the molten salt database

    NASA Astrophysics Data System (ADS)

    Gadzuric, Slobodan; Suh, Changwon; Gaune-Escard, Marcelle; Rajan, Krishna

    2006-12-01

    Molten salt technology is a catchall phrase that includes some very diverse technologies; electrochemistry, heat transfer, chemical oxidation/reduction baths, and nuclear reactors. All of these technologies are linked by the general characteristics of molten salts that can function as solvents, have good heat-transfer characteristics, function like a fluid, can attain very high temperatures, can conduct electricity, and also may have chemical catalytic properties. The Janz molten salt database is the most comprehensive compilation of property data about molten salts available today and is widely used for both fundamental and applied purposes. Databases are traditionally viewed as “static” documents that are used in a “search and retrieval” mode. These static data can be transformed by informatics and data mining tools into a dynamic dataset for analysis of the properties of the, materials and for making predictions. While this approch has been successful in the chemical and biochemical sciences in searching for and establishing structure-property relationships, it is not widely used in the materials science community. Because the design of the original molten salt database was not oriented toward this informatics goal, it was essential to evaluate this dataset in terms of data mining standards. Two techniques were used—a projection (principal components analysis (PCA)) and a predictive method (partial least squares (PLS))—in conjunction with fundamental knowledge acquired from the long-term practice of molten salt chemistry.

  2. ADANS database specification

    SciTech Connect

    1997-01-16

    The purpose of the Air Mobility Command (AMC) Deployment Analysis System (ADANS) Database Specification (DS) is to describe the database organization and storage allocation and to provide the detailed data model of the physical design and information necessary for the construction of the parts of the database (e.g., tables, indexes, rules, defaults). The DS includes entity relationship diagrams, table and field definitions, reports on other database objects, and a description of the ADANS data dictionary. ADANS is the automated system used by Headquarters AMC and the Tanker Airlift Control Center (TACC) for airlift planning and scheduling of peacetime and contingency operations as well as for deliberate planning. ADANS also supports planning and scheduling of Air Refueling Events by the TACC and the unit-level tanker schedulers. ADANS receives input in the form of movement requirements and air refueling requests. It provides a suite of tools for planners to manipulate these requirements/requests against mobility assets and to develop, analyze, and distribute schedules. Analysis tools are provided for assessing the products of the scheduling subsystems, and editing capabilities support the refinement of schedules. A reporting capability provides formatted screen, print, and/or file outputs of various standard reports. An interface subsystem handles message traffic to and from external systems. The database is an integral part of the functionality summarized above.

  3. Shuttle Hypervelocity Impact Database

    NASA Technical Reports Server (NTRS)

    Hyde, James L.; Christiansen, Eric L.; Lear, Dana M.

    2011-01-01

    With three missions outstanding, the Shuttle Hypervelocity Impact Database has nearly 3000 entries. The data is divided into tables for crew module windows, payload bay door radiators and thermal protection system regions, with window impacts compromising just over half the records. In general, the database provides dimensions of hypervelocity impact damage, a component level location (i.e., window number or radiator panel number) and the orbiter mission when the impact occurred. Additional detail on the type of particle that produced the damage site is provided when sampling data and definitive analysis results are available. Details and insights on the contents of the database including examples of descriptive statistics will be provided. Post flight impact damage inspection and sampling techniques that were employed during the different observation campaigns will also be discussed. Potential enhancements to the database structure and availability of the data for other researchers will be addressed in the Future Work section. A related database of returned surfaces from the International Space Station will also be introduced.

  4. Shuttle Hypervelocity Impact Database

    NASA Technical Reports Server (NTRS)

    Hyde, James I.; Christiansen, Eric I.; Lear, Dana M.

    2011-01-01

    With three flights remaining on the manifest, the shuttle impact hypervelocity database has over 2800 entries. The data is currently divided into tables for crew module windows, payload bay door radiators and thermal protection system regions, with window impacts compromising just over half the records. In general, the database provides dimensions of hypervelocity impact damage, a component level location (i.e., window number or radiator panel number) and the orbiter mission when the impact occurred. Additional detail on the type of particle that produced the damage site is provided when sampling data and definitive analysis results are available. The paper will provide details and insights on the contents of the database including examples of descriptive statistics using the impact data. A discussion of post flight impact damage inspection and sampling techniques that were employed during the different observation campaigns will be presented. Future work to be discussed will be possible enhancements to the database structure and availability of the data for other researchers. A related database of ISS returned surfaces that are under development will also be introduced.

  5. Stackfile Database

    NASA Technical Reports Server (NTRS)

    deVarvalho, Robert; Desai, Shailen D.; Haines, Bruce J.; Kruizinga, Gerhard L.; Gilmer, Christopher

    2013-01-01

    This software provides storage retrieval and analysis functionality for managing satellite altimetry data. It improves the efficiency and analysis capabilities of existing database software with improved flexibility and documentation. It offers flexibility in the type of data that can be stored. There is efficient retrieval either across the spatial domain or the time domain. Built-in analysis tools are provided for frequently performed altimetry tasks. This software package is used for storing and manipulating satellite measurement data. It was developed with a focus on handling the requirements of repeat-track altimetry missions such as Topex and Jason. It was, however, designed to work with a wide variety of satellite measurement data [e.g., Gravity Recovery And Climate Experiment -- GRACE). The software consists of several command-line tools for importing, retrieving, and analyzing satellite measurement data.

  6. Analysis of Landslide Hazard Impact Using the Landslide Database for Germany

    NASA Astrophysics Data System (ADS)

    Klose, M.; Damm, B.

    2014-12-01

    The Federal Republic of Germany has long been among the few European countries that lack a national landslide database. Systematic collection and inventory of landslide data still shows a comprehensive research history in Germany, but only one focused on development of databases with local or regional coverage. This has changed in recent years with the launch of a database initiative aimed at closing the data gap existing at national level. The present contribution reports on this project that is based on a landslide database which evolved over the last 15 years to a database covering large parts of Germany. A strategy of systematic retrieval, extraction, and fusion of landslide data is at the heart of the methodology, providing the basis for a database with a broad potential of application. The database offers a data pool of more than 4,200 landslide data sets with over 13,000 single data files and dates back to 12th century. All types of landslides are covered by the database, which stores not only core attributes, but also various complementary data, including data on landslide causes, impacts, and mitigation. The current database migration to PostgreSQL/PostGIS is focused on unlocking the full scientific potential of the database, while enabling data sharing and knowledge transfer via a web GIS platform. In this contribution, the goals and the research strategy of the database project are highlighted at first, with a summary of best practices in database development providing perspective. Next, the focus is on key aspects of the methodology, which is followed by the results of different case studies in the German Central Uplands. The case study results exemplify database application in analysis of vulnerability to landslides, impact statistics, and hazard or cost modeling.

  7. Comprehensive rotorcraft analysis methods

    NASA Technical Reports Server (NTRS)

    Stephens, Wendell B.; Austin, Edward E.

    1988-01-01

    The development and application of comprehensive rotorcraft analysis methods in the field of rotorcraft technology are described. These large scale analyses and the resulting computer programs are intended to treat the complex aeromechanical phenomena that describe the behavior of rotorcraft. They may be used to predict rotor aerodynamics, acoustic, performance, stability and control, handling qualities, loads and vibrations, structures, dynamics, and aeroelastic stability characteristics for a variety of applications including research, preliminary and detail design, and evaluation and treatment of field problems. The principal comprehensive methods developed or under development in recent years and generally available to the rotorcraft community because of US Army Aviation Research and Technology Activity (ARTA) sponsorship of all or part of the software systems are the Rotorcraft Flight Simulation (C81), Dynamic System Coupler (DYSCO), Coupled Rotor/Airframe Vibration Analysis Program (SIMVIB), Comprehensive Analytical Model of Rotorcraft Aerodynamics and Dynamics (CAMRAD), General Rotorcraft Aeromechanical Stability Program (GRASP), and Second Generation Comprehensive Helicopter Analysis System (2GCHAS).

  8. The Neotoma Paleoecology Database

    NASA Astrophysics Data System (ADS)

    Grimm, E. C.; Ashworth, A. C.; Barnosky, A. D.; Betancourt, J. L.; Bills, B.; Booth, R.; Blois, J.; Charles, D. F.; Graham, R. W.; Goring, S. J.; Hausmann, S.; Smith, A. J.; Williams, J. W.; Buckland, P.

    2015-12-01

    The Neotoma Paleoecology Database (www.neotomadb.org) is a multiproxy, open-access, relational database that includes fossil data for the past 5 million years (the late Neogene and Quaternary Periods). Modern distributional data for various organisms are also being made available for calibration and paleoecological analyses. The project is a collaborative effort among individuals from more than 20 institutions worldwide, including domain scientists representing a spectrum of Pliocene-Quaternary fossil data types, as well as experts in information technology. Working groups are active for diatoms, insects, ostracodes, pollen and plant macroscopic remains, testate amoebae, rodent middens, vertebrates, age models, geochemistry and taphonomy. Groups are also active in developing online tools for data analyses and for developing modules for teaching at different levels. A key design concept of NeotomaDB is that stewards for various data types are able to remotely upload and manage data. Cooperatives for different kinds of paleo data, or from different regions, can appoint their own stewards. Over the past year, much progress has been made on development of the steward software-interface that will enable this capability. The steward interface uses web services that provide access to the database. More generally, these web services enable remote programmatic access to the database, which both desktop and web applications can use and which provide real-time access to the most current data. Use of these services can alleviate the need to download the entire database, which can be out-of-date as soon as new data are entered. In general, the Neotoma web services deliver data either from an entire table or from the results of a view. Upon request, new web services can be quickly generated. Future developments will likely expand the spatial and temporal dimensions of the database. NeotomaDB is open to receiving new datasets and stewards from the global Quaternary community

  9. DBGC: A Database of Human Gastric Cancer.

    PubMed

    Wang, Chao; Zhang, Jun; Cai, Mingdeng; Zhu, Zhenggang; Gu, Wenjie; Yu, Yingyan; Zhang, Xiaoyan

    2015-01-01

    The Database of Human Gastric Cancer (DBGC) is a comprehensive database that integrates various human gastric cancer-related data resources. Human gastric cancer-related transcriptomics projects, proteomics projects, mutations, biomarkers and drug-sensitive genes from different sources were collected and unified in this database. Moreover, epidemiological statistics of gastric cancer patients in China and clinicopathological information annotated with gastric cancer cases were also integrated into the DBGC. We believe that this database will greatly facilitate research regarding human gastric cancer in many fields. DBGC is freely available at http://bminfor.tongji.edu.cn/dbgc/index.do. PMID:26566288

  10. DBGC: A Database of Human Gastric Cancer.

    PubMed

    Wang, Chao; Zhang, Jun; Cai, Mingdeng; Zhu, Zhenggang; Gu, Wenjie; Yu, Yingyan; Zhang, Xiaoyan

    2015-01-01

    The Database of Human Gastric Cancer (DBGC) is a comprehensive database that integrates various human gastric cancer-related data resources. Human gastric cancer-related transcriptomics projects, proteomics projects, mutations, biomarkers and drug-sensitive genes from different sources were collected and unified in this database. Moreover, epidemiological statistics of gastric cancer patients in China and clinicopathological information annotated with gastric cancer cases were also integrated into the DBGC. We believe that this database will greatly facilitate research regarding human gastric cancer in many fields. DBGC is freely available at http://bminfor.tongji.edu.cn/dbgc/index.do.

  11. Databases for K-8 Students

    ERIC Educational Resources Information Center

    Young, Terrence E., Jr.

    2004-01-01

    Today's elementary school students have been exposed to computers since birth, so it is not surprising that they are so proficient at using them. As a result, they are ready to search databases that include topics and information appropriate for their age level. Subscription databases are digital copies of magazines, newspapers, journals,…

  12. Wind turbine reliability database update.

    SciTech Connect

    Peters, Valerie A.; Hill, Roger Ray; Stinebaugh, Jennifer A.; Veers, Paul S.

    2009-03-01

    This report documents the status of the Sandia National Laboratories' Wind Plant Reliability Database. Included in this report are updates on the form and contents of the Database, which stems from a fivestep process of data partnerships, data definition and transfer, data formatting and normalization, analysis, and reporting. Selected observations are also reported.

  13. Development of a national, dynamic reservoir-sedimentation database

    USGS Publications Warehouse

    Gray, J.R.; Bernard, J.M.; Stewart, D.W.; McFaul, E.J.; Laurent, K.W.; Schwarz, G.E.; Stinson, J.T.; Jonas, M.M.; Randle, T.J.; Webb, J.W.

    2010-01-01

    The importance of dependable, long-term water supplies, coupled with the need to quantify rates of capacity loss of the Nation’s re servoirs due to sediment deposition, were the most compelling reasons for developing the REServoir- SEDimentation survey information (RESSED) database and website. Created under the auspices of the Advisory Committee on Water Information’s Subcommittee on Sedimenta ion by the U.S. Geological Survey and the Natural Resources Conservation Service, the RESSED database is the most comprehensive compilation of data from reservoir bathymetric and dry-basin surveys in the United States. As of March 2010, the database, which contains data compiled on the 1950s vintage Soil Conservation Service’s Form SCS-34 data sheets, contained results from 6,616 surveys on 1,823 reservoirs in the United States and two surveys on one reservoir in Puerto Rico. The data span the period 1755–1997, with 95 percent of the surveys performed from 1930–1990. The reservoir surface areas range from sub-hectare-scale farm ponds to 658 km2 Lake Powell. The data in the RESSED database can be useful for a number of purposes, including calculating changes in reservoir-storage characteristics, quantifying sediment budgets, and estimating erosion rates in a reservoir’s watershed. The March 2010 version of the RESSED database has a number of deficiencies, including a cryptic and out-of-date database architecture; some geospatial inaccuracies (although most have been corrected); other data errors; an inability to store all data in a readily retrievable manner; and an inability to store all data types that currently exist. Perhaps most importantly, the March 2010 version of RESSED database provides no publically available means to submit new data and corrections to existing data. To address these and other deficiencies, the Subcommittee on Sedimentation, through the U.S. Geological Survey and the U.S. Army Corps of Engineers, began a collaborative project in

  14. Database of recent tsunami deposits

    USGS Publications Warehouse

    Peters, Robert; Jaffe, Bruce E.

    2010-01-01

    This report describes a database of sedimentary characteristics of tsunami deposits derived from published accounts of tsunami deposit investigations conducted shortly after the occurrence of a tsunami. The database contains 228 entries, each entry containing data from up to 71 categories. It includes data from 51 publications covering 15 tsunamis distributed between 16 countries. The database encompasses a wide range of depositional settings including tropical islands, beaches, coastal plains, river banks, agricultural fields, and urban environments. It includes data from both local tsunamis and teletsunamis. The data are valuable for interpreting prehistorical, historical, and modern tsunami deposits, and for the development of criteria to identify tsunami deposits in the geologic record.

  15. Information Literacy Skills: Comparing and Evaluating Databases

    ERIC Educational Resources Information Center

    Grismore, Brian A.

    2012-01-01

    The purpose of this database comparison is to express the importance of teaching information literacy skills and to apply those skills to commonly used Internet-based research tools. This paper includes a comparison and evaluation of three databases (ProQuest, ERIC, and Google Scholar). It includes strengths and weaknesses of each database based…

  16. ePlantLIBRA: A composition and biological activity database for bioactive compounds in plant food supplements.

    PubMed

    Plumb, J; Lyons, J; Nørby, K; Thomas, M; Nørby, E; Poms, R; Bucchini, L; Restani, P; Kiely, M; Finglas, P

    2016-02-15

    The newly developed ePlantLIBRA database is a comprehensive and searchable database, with up-to-date coherent and validated scientific information on plant food supplement (PFS) bioactive compounds, with putative health benefits as well as adverse effects, and contaminants and residues. It is the only web-based database available compiling peer reviewed publications and case studies on PFS. A user-friendly, efficient and flexible interface has been developed for searching, extracting, and exporting the data, including links to the original references. Data from over 570 publications have been quality evaluated and entered covering 70 PFS or their botanical ingredients.

  17. ePlantLIBRA: A composition and biological activity database for bioactive compounds in plant food supplements.

    PubMed

    Plumb, J; Lyons, J; Nørby, K; Thomas, M; Nørby, E; Poms, R; Bucchini, L; Restani, P; Kiely, M; Finglas, P

    2016-02-15

    The newly developed ePlantLIBRA database is a comprehensive and searchable database, with up-to-date coherent and validated scientific information on plant food supplement (PFS) bioactive compounds, with putative health benefits as well as adverse effects, and contaminants and residues. It is the only web-based database available compiling peer reviewed publications and case studies on PFS. A user-friendly, efficient and flexible interface has been developed for searching, extracting, and exporting the data, including links to the original references. Data from over 570 publications have been quality evaluated and entered covering 70 PFS or their botanical ingredients. PMID:26433297

  18. The Establishment of K-CaP (the Multicenter Korean Prostate Cancer Database)

    PubMed Central

    Lee, Dong Hoon; Lee, Seung Hwan; Rha, Koon Ho; Choi, In Young; Lee, Ji Youl; Kim, Sae Woong; Lee, Sangchul; Hong, Sung Kyu; Byun, Seok-Soo; Jeong, In Gab; Hong, Jun Hyuk; Kim, Choung-Soo; Jeon, Hwang Gyun; Lee, Hyun Moo

    2013-01-01

    Purpose The purpose of this article was to announce the establishment of the multicenter Korean Prostate Cancer Database (K-CaP) and to provide urologists with details about K-CaP's methodology. Materials and Methods The initial participating K-CaP institutions include five medical centers in Korea. First, we registered prostate cancer patients who underwent radical prostatectomy as the basic background data. K-CaP is poised to combine these initial observational longitudinal studies with those of other eligible institutions as the database grows. All current prostate cancer patients in Korea are able to be registered into the Web-based database system and thereby have a role in several observational studies. The structure of the database for K-CaP was developed by matching it with the respective data from different studies. The operability of the K-CaP database system was verified by using the existing databases from three participating institutions. Results The analysis of clinicopathologic characteristics of patients with the use of the Web-based database was successfully conducted. We confirmed the accurate operation of the Web-based database system without any difficulties. Conclusions We are announcing the establishment of K-CaP the first database of comprehensive observational longitudinal studies about prostate cancer in Korea. The database will be successfully maintained by sufficiently and continuously updating all patient data covering several treatments. Complete statistical results for registered prostate cancer patients are forthcoming for the basic background data to establish the database. Even though much trial and error are expected during the development process, we expect that K-CaP will eventually become one of the most powerful longitudinal observation databases. PMID:23614058

  19. Clinical Genomic Database

    PubMed Central

    Solomon, Benjamin D.; Nguyen, Anh-Dao; Bear, Kelly A.; Wolfsberg, Tyra G.

    2013-01-01

    Technological advances have greatly increased the availability of human genomic sequencing. However, the capacity to analyze genomic data in a clinically meaningful way lags behind the ability to generate such data. To help address this obstacle, we reviewed all conditions with genetic causes and constructed the Clinical Genomic Database (CGD) (http://research.nhgri.nih.gov/CGD/), a searchable, freely Web-accessible database of conditions based on the clinical utility of genetic diagnosis and the availability of specific medical interventions. The CGD currently includes a total of 2,616 genes organized clinically by affected organ systems and interventions (including preventive measures, disease surveillance, and medical or surgical interventions) that could be reasonably warranted by the identification of pathogenic mutations. To aid independent analysis and optimize new data incorporation, the CGD also includes all genetic conditions for which genetic knowledge may affect the selection of supportive care, informed medical decision-making, prognostic considerations, reproductive decisions, and allow avoidance of unnecessary testing, but for which specific interventions are not otherwise currently available. For each entry, the CGD includes the gene symbol, conditions, allelic conditions, clinical categorization (for both manifestations and interventions), mode of inheritance, affected age group, description of interventions/rationale, links to other complementary databases, including databases of variants and presumed pathogenic mutations, and links to PubMed references (>20,000). The CGD will be regularly maintained and updated to keep pace with scientific discovery. Further content-based expert opinions are actively solicited. Eventually, the CGD may assist the rapid curation of individual genomes as part of active medical care. PMID:23696674

  20. IMGT, the international ImMunoGeneTics database

    PubMed Central

    Ruiz, Manuel; Giudicelli, Véronique; Ginestoux, Chantal; Stoehr, Peter; Robinson, James; Bodmer, Julia; Marsh, Steven G. E.; Bontrop, Ronald; Lemaitre, Marc; Lefranc, Gérard; Chaume, Denys; Lefranc, Marie-Paule

    2000-01-01

    IMGT, the international ImMunoGeneTics database (http://imgt.cines.fr:8104 ), is a high-quality integrated database specialising in Immunoglobulins (Ig), T cell Receptors (TcR) and Major Histocompatibility Complex (MHC) molecules of all vertebrate species, created in 1989 by Marie-Paule Lefranc, Université Montpellier II, CNRS, Montpellier, France (lefranc@ ligm.igh.cnrs.fr ). At present, IMGT includes two databases: IMGT/LIGM-DB, a comprehensive database of Ig and TcR from human and other vertebrates, with translation for fully annotated sequences, and IMGT/HLA-DB, a database of the human MHC referred to as HLA (Human Leucocyte Antigens). The IMGT server provides a common access to expertized genomic, proteomic, structural and polymorphic data of Ig and TcR molecules of all vertebrates. By its high quality and its easy data distribution, IMGT has important implications in medical research (repertoire in autoimmune diseases, AIDS, leukemias, lymphomas), therapeutic approaches (antibody engineering), genome diversity and genome evolution studies. IMGT is freely available at http://imgt.cines.fr:8104 . The IMGT Index is provided at the IMGT Marie-Paule page (http://imgt.cines.fr:8104/textes/IMGTindex.html ). PMID:10592230

  1. DIDA: A curated and annotated digenic diseases database.

    PubMed

    Gazzo, Andrea M; Daneels, Dorien; Cilia, Elisa; Bonduelle, Maryse; Abramowicz, Marc; Van Dooren, Sonia; Smits, Guillaume; Lenaerts, Tom

    2016-01-01

    DIDA (DIgenic diseases DAtabase) is a novel database that provides for the first time detailed information on genes and associated genetic variants involved in digenic diseases, the simplest form of oligogenic inheritance. The database is accessible via http://dida.ibsquare.be and currently includes 213 digenic combinations involved in 44 different digenic diseases. These combinations are composed of 364 distinct variants, which are distributed over 136 distinct genes. The web interface provides browsing and search functionalities, as well as documentation and help pages, general database statistics and references to the original publications from which the data have been collected. The possibility to submit novel digenic data to DIDA is also provided. Creating this new repository was essential as current databases do not allow one to retrieve detailed records regarding digenic combinations. Genes, variants, diseases and digenic combinations in DIDA are annotated with manually curated information and information mined from other online resources. Next to providing a unique resource for the development of new analysis methods, DIDA gives clinical and molecular geneticists a tool to find the most comprehensive information on the digenic nature of their diseases of interest. PMID:26481352

  2. Open TG-GATEs: a large-scale toxicogenomics database

    PubMed Central

    Igarashi, Yoshinobu; Nakatsu, Noriyuki; Yamashita, Tomoya; Ono, Atsushi; Ohno, Yasuo; Urushidani, Tetsuro; Yamada, Hiroshi

    2015-01-01

    Toxicogenomics focuses on assessing the safety of compounds using gene expression profiles. Gene expression signatures from large toxicogenomics databases are expected to perform better than small databases in identifying biomarkers for the prediction and evaluation of drug safety based on a compound's toxicological mechanisms in animal target organs. Over the past 10 years, the Japanese Toxicogenomics Project consortium (TGP) has been developing a large-scale toxicogenomics database consisting of data from 170 compounds (mostly drugs) with the aim of improving and enhancing drug safety assessment. Most of the data generated by the project (e.g. gene expression, pathology, lot number) are freely available to the public via Open TG-GATEs (Toxicogenomics Project-Genomics Assisted Toxicity Evaluation System). Here, we provide a comprehensive overview of the database, including both gene expression data and metadata, with a description of experimental conditions and procedures used to generate the database. Open TG-GATEs is available from http://toxico.nibio.go.jp/english/index.html. PMID:25313160

  3. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1998-08-01

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. It consolidates and facilitates access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufactures and those using alternative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air-conditioning and refrigeration equipment. The complete documents are not included, though some may be added at a later date. The database identifies sources of specific information on many refrigerants including propane, ammonia, water, carbon dioxide, propylene, ethers, and others as well as azeotropic and zeotropic blends of these fluids. It addresses lubricants including alkylbenzene, polyalkylene glycol, polyolester, and other synthetics as well as mineral oils. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. Incomplete citations or abstracts are provided for some documents. They are included to accelerate availability of the information and will be completed or replaced in future updates.

  4. The PIR-International Protein Sequence Database.

    PubMed

    Barker, W C; Garavelli, J S; McGarvey, P B; Marzec, C R; Orcutt, B C; Srinivasarao, G Y; Yeh, L S; Ledley, R S; Mewes, H W; Pfeiffer, F; Tsugita, A; Wu, C

    1999-01-01

    The Protein Information Resource (PIR; http://www-nbrf.georgetown. edu/pir/) supports research on molecular evolution, functional genomics, and computational biology by maintaining a comprehensive, non-redundant, well-organized and freely available protein sequence database. Since 1988 the database has been maintained collaboratively by PIR-International, an international association of data collection centers cooperating to develop this resource during a period of explosive growth in new sequence data and new computer technologies. The PIR Protein Sequence Database entries are classified into superfamilies, families and homology domains, for which sequence alignments are available. Full-scale family classification supports comparative genomics research, aids sequence annotation, assists database organization and improves database integrity. The PIR WWW server supports direct on-line sequence similarity searches, information retrieval, and knowledge discovery by providing the Protein Sequence Database and other supplementary databases. Sequence entries are extensively cross-referenced and hypertext-linked to major nucleic acid, literature, genome, structure, sequence alignment and family databases. The weekly release of the Protein Sequence Database can be accessed through the PIR Web site. The quarterly release of the database is freely available from our anonymous FTP server and is also available on CD-ROM with the accompanying ATLAS database search program.

  5. AMYPdb: A database dedicated to amyloid precursor proteins

    PubMed Central

    Pawlicki, Sandrine; Le Béchec, Antony; Delamarche, Christian

    2008-01-01

    Background Misfolding and aggregation of proteins into ordered fibrillar structures is associated with a number of severe pathologies, including Alzheimer's disease, prion diseases, and type II diabetes. The rapid accumulation of knowledge about the sequences and structures of these proteins allows using of in silico methods to investigate the molecular mechanisms of their abnormal conformational changes and assembly. However, such an approach requires the collection of accurate data, which are inconveniently dispersed among several generalist databases. Results We therefore created a free online knowledge database (AMYPdb) dedicated to amyloid precursor proteins and we have performed large scale sequence analysis of the included data. Currently, AMYPdb integrates data on 31 families, including 1,705 proteins from nearly 600 organisms. It displays links to more than 2,300 bibliographic references and 1,200 3D-structures. A Wiki system is available to insert data into the database, providing a sharing and collaboration environment. We generated and analyzed 3,621 amino acid sequence patterns, reporting highly specific patterns for each amyloid family, along with patterns likely to be involved in protein misfolding and aggregation. Conclusion AMYPdb is a comprehensive online database aiming at the centralization of bioinformatic data regarding all amyloid proteins and their precursors. Our sequence pattern discovery and analysis approach unveiled protein regions of significant interest. AMYPdb is freely accessible [1]. PMID:18544157

  6. Rice Annotation Project Database (RAP-DB): an integrative and interactive database for rice genomics.

    PubMed

    Sakai, Hiroaki; Lee, Sung Shin; Tanaka, Tsuyoshi; Numa, Hisataka; Kim, Jungsok; Kawahara, Yoshihiro; Wakimoto, Hironobu; Yang, Ching-chia; Iwamoto, Masao; Abe, Takashi; Yamada, Yuko; Muto, Akira; Inokuchi, Hachiro; Ikemura, Toshimichi; Matsumoto, Takashi; Sasaki, Takuji; Itoh, Takeshi

    2013-02-01

    The Rice Annotation Project Database (RAP-DB, http://rapdb.dna.affrc.go.jp/) has been providing a comprehensive set of gene annotations for the genome sequence of rice, Oryza sativa (japonica group) cv. Nipponbare. Since the first release in 2005, RAP-DB has been updated several times along with the genome assembly updates. Here, we present our newest RAP-DB based on the latest genome assembly, Os-Nipponbare-Reference-IRGSP-1.0 (IRGSP-1.0), which was released in 2011. We detected 37,869 loci by mapping transcript and protein sequences of 150 monocot species. To provide plant researchers with highly reliable and up to date rice gene annotations, we have been incorporating literature-based manually curated data, and 1,626 loci currently incorporate literature-based annotation data, including commonly used gene names or gene symbols. Transcriptional activities are shown at the nucleotide level by mapping RNA-Seq reads derived from 27 samples. We also mapped the Illumina reads of a Japanese leading japonica cultivar, Koshihikari, and a Chinese indica cultivar, Guangluai-4, to the genome and show alignments together with the single nucleotide polymorphisms (SNPs) and gene functional annotations through a newly developed browser, Short-Read Assembly Browser (S-RAB). We have developed two satellite databases, Plant Gene Family Database (PGFD) and Integrative Database of Cereal Gene Phylogeny (IDCGP), which display gene family and homologous gene relationships among diverse plant species. RAP-DB and the satellite databases offer simple and user-friendly web interfaces, enabling plant and genome researchers to access the data easily and facilitating a broad range of plant research topics.

  7. ARTI Refrigerant Database

    SciTech Connect

    Calm, J.M.

    1992-11-09

    The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air- conditioning and refrigeration equipment. The database identifies sources of specific information on R-32, R-123, R-124, R-125, R-134, R-134a, R-141b, R-142b, R-143a, R-152a, R-245ca, R-290 (propane), R- 717 (ammonia), ethers, and others as well as azeotropic and zeotropic and zeotropic blends of these fluids. It addresses lubricants including alkylbenzene, polyalkylene glycol, ester, and other synthetics as well as mineral oils. It also references documents on compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. A computerized version is available that includes retrieval software.

  8. InterAction Database (IADB)

    Cancer.gov

    The InterAction Database includes demographic and prescription information for more than 500,000 patients in the northern and middle Netherlands and has been integrated with other systems to enhance data collection and analysis.

  9. High Performance Buildings Database

    DOE Data Explorer

    The High Performance Buildings Database is a shared resource for the building industry, a unique central repository of in-depth information and data on high-performance, green building projects across the United States and abroad. The database includes information on the energy use, environmental performance, design process, finances, and other aspects of each project. Members of the design and construction teams are listed, as are sources for additional information. In total, up to twelve screens of detailed information are provided for each project profile. Projects range in size from small single-family homes or tenant fit-outs within buildings to large commercial and institutional buildings and even entire campuses. The database is a data repository as well. A series of Web-based data-entry templates allows anyone to enter information about a building project into the database. Once a project has been submitted, each of the partner organizations can review the entry and choose whether or not to publish that particular project on its own Web site.

  10. LQTS gene LOVD database.

    PubMed

    Zhang, Tao; Moss, Arthur; Cong, Peikuan; Pan, Min; Chang, Bingxi; Zheng, Liangrong; Fang, Quan; Zareba, Wojciech; Robinson, Jennifer; Lin, Changsong; Li, Zhongxiang; Wei, Junfang; Zeng, Qiang; Qi, Ming

    2010-11-01

    The Long QT Syndrome (LQTS) is a group of genetically heterogeneous disorders that predisposes young individuals to ventricular arrhythmias and sudden death. LQTS is mainly caused by mutations in genes encoding subunits of cardiac ion channels (KCNQ1, KCNH2,SCN5A, KCNE1, and KCNE2). Many other genes involved in LQTS have been described recently(KCNJ2, AKAP9, ANK2, CACNA1C, SCNA4B, SNTA1, and CAV3). We created an online database(http://www.genomed.org/LOVD/introduction.html) that provides information on variants in LQTS-associated genes. As of February 2010, the database contains 1738 unique variants in 12 genes. A total of 950 variants are considered pathogenic, 265 are possible pathogenic, 131 are unknown/unclassified, and 292 have no known pathogenicity. In addition to these mutations collected from published literature, we also submitted information on gene variants, including one possible novel pathogenic mutation in the KCNH2 splice site found in ten Chinese families with documented arrhythmias. The remote user is able to search the data and is encouraged to submit new mutations into the database. The LQTS database will become a powerful tool for both researchers and clinicians. PMID:20809527

  11. LQTS gene LOVD database.

    PubMed

    Zhang, Tao; Moss, Arthur; Cong, Peikuan; Pan, Min; Chang, Bingxi; Zheng, Liangrong; Fang, Quan; Zareba, Wojciech; Robinson, Jennifer; Lin, Changsong; Li, Zhongxiang; Wei, Junfang; Zeng, Qiang; Qi, Ming

    2010-11-01

    The Long QT Syndrome (LQTS) is a group of genetically heterogeneous disorders that predisposes young individuals to ventricular arrhythmias and sudden death. LQTS is mainly caused by mutations in genes encoding subunits of cardiac ion channels (KCNQ1, KCNH2,SCN5A, KCNE1, and KCNE2). Many other genes involved in LQTS have been described recently(KCNJ2, AKAP9, ANK2, CACNA1C, SCNA4B, SNTA1, and CAV3). We created an online database(http://www.genomed.org/LOVD/introduction.html) that provides information on variants in LQTS-associated genes. As of February 2010, the database contains 1738 unique variants in 12 genes. A total of 950 variants are considered pathogenic, 265 are possible pathogenic, 131 are unknown/unclassified, and 292 have no known pathogenicity. In addition to these mutations collected from published literature, we also submitted information on gene variants, including one possible novel pathogenic mutation in the KCNH2 splice site found in ten Chinese families with documented arrhythmias. The remote user is able to search the data and is encouraged to submit new mutations into the database. The LQTS database will become a powerful tool for both researchers and clinicians.

  12. Comprehensive Trail Making Test

    ERIC Educational Resources Information Center

    Gray, Rebecca

    2006-01-01

    The Comprehensive Trail Making Test (CTMT) is designed to be used in neuropsychological assessment for the purposes of detecting effects of brain defects and deficits and in tracking progress in rehabilitation. More specific purposes include the detection of frontal lobe deficits, problems with psychomotor speed, visual search and sequencing,…

  13. Comprehensive stormwater management study

    SciTech Connect

    Morrison, T. ); Alter, M. ); Wassum, R.H. )

    1994-02-01

    This article examines Tucson, Arizona's approach to stormwater management. The topics of the article include the quantity and quality of stormwater, developing the stormwater master plan, meeting environmental and regulatory constraints. Tucson's comprehensive, watershed by watershed approach to public works planning and stormwater program development is described.

  14. Designing a Comprehensive Curriculum

    ERIC Educational Resources Information Center

    Faulkner, T. L.

    1970-01-01

    A comprehensive rural "agribusiness industry" curriculum might include: (1) The World of Work (Grade 7 or 8), (2) Vocational Orientation (Grade 9), (3) Basic Agriculture and Industry (Grade 10), (4) Specialized Agribusiness Industry (Grade 11), and (5) Advanced Agribusiness Industry (Grade 12). (DM)

  15. Math Sense: Comprehensive Review.

    ERIC Educational Resources Information Center

    Hoyt, Cathy Fillmore

    This book features a comprehensive review of the Math Sense series and is designed to help students gain the range of math skills they need to succeed in life, work, and on standardized tests; overcome math anxiety; discover math as interesting and purposeful; and develop good number sense. Topics covered in this book include whole numbers;…

  16. An Index to PGE-Ni-Cr Deposits and Occurrences in Selected Mineral-Occurrence Databases

    USGS Publications Warehouse

    Causey, J. Douglas; Galloway, John P.; Zientek, Michael L.

    2009-01-01

    Databases of mineral deposits and occurrences are essential to conducting assessments of undiscovered mineral resources. In the USGS's (U.S. Geological Survey) global assessment of undiscovered resources of copper, potash, and the platinum-group elements (PGE), only a few mineral deposit types will be evaluated. For example, only porphyry-copper and sediment-hosted copper deposits will be considered for the copper assessment. To support the global assessment, the USGS prepared comprehensive compilations of the occurrences of these two deposit types in order to develop grade and tonnage models and delineate permissive areas for undiscovered deposits of those types. This publication identifies previously published databases and database records that describe PGE, nickel, and chromium deposits and occurrences. Nickel and chromium were included in this overview because of the close association of PGE with nickel and chromium mineralization. Users of this database will need to refer to the original databases for detailed information about the deposits and occurrences. This information will be used to develop a current and comprehensive global database of PGE deposits and occurrences.

  17. The UCSC Genome Browser database: 2015 update.

    PubMed

    Rosenbloom, Kate R; Armstrong, Joel; Barber, Galt P; Casper, Jonathan; Clawson, Hiram; Diekhans, Mark; Dreszer, Timothy R; Fujita, Pauline A; Guruvadoo, Luvina; Haeussler, Maximilian; Harte, Rachel A; Heitner, Steve; Hickey, Glenn; Hinrichs, Angie S; Hubley, Robert; Karolchik, Donna; Learned, Katrina; Lee, Brian T; Li, Chin H; Miga, Karen H; Nguyen, Ngan; Paten, Benedict; Raney, Brian J; Smit, Arian F A; Speir, Matthew L; Zweig, Ann S; Haussler, David; Kuhn, Robert M; Kent, W James

    2015-01-01

    Launched in 2001 to showcase the draft human genome assembly, the UCSC Genome Browser database (http://genome.ucsc.edu) and associated tools continue to grow, providing a comprehensive resource of genome assemblies and annotations to scientists and students worldwide. Highlights of the past year include the release of a browser for the first new human genome reference assembly in 4 years in December 2013 (GRCh38, UCSC hg38), a watershed comparative genomics annotation (100-species multiple alignment and conservation) and a novel distribution mechanism for the browser (GBiB: Genome Browser in a Box). We created browsers for new species (Chinese hamster, elephant shark, minke whale), 'mined the web' for DNA sequences and expanded the browser display with stacked color graphs and region highlighting. As our user community increasingly adopts the UCSC track hub and assembly hub representations for sharing large-scale genomic annotation data sets and genome sequencing projects, our menu of public data hubs has tripled.

  18. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1996-04-15

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. It consolidates and facilitates access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufacturers and those using alternative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air-conditioning and refrigeration equipment. The complete documents are not included, though some may be added at a later date. The database identifies sources of specific information on refrigerants. It addresses lubricants including alkylbenzene, polyalkylene glycol, polyolester, and other synthetics as well as mineral oils. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. Incomplete citations or abstracts are provided for some documents. They are included to accelerate availability of the information and will be completed or replaced in future updates. Citations in this report are divided into the following topics: thermophysical properties; materials compatibility; lubricants and tribology; application data; safety; test and analysis methods; impacts; regulatory actions; substitute refrigerants; identification; absorption and adsorption; research programs; and miscellaneous documents. Information is also presented on ordering instructions for the computerized version.

  19. ARTI refrigerant database

    SciTech Connect

    Calm, J.M.

    1997-02-01

    The Refrigerant Database is an information system on alternative refrigerants, associated lubricants, and their use in air conditioning and refrigeration. It consolidates and facilitates access to property, compatibility, environmental, safety, application and other information. It provides corresponding information on older refrigerants, to assist manufacturers and those using alterative refrigerants, to make comparisons and determine differences. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air-conditioning and refrigeration equipment. The complete documents are not included, though some may be added at a later date. The database identifies sources of specific information on various refrigerants. It addresses lubricants including alkylbenzene, polyalkylene glycol, polyolester, and other synthetics as well as mineral oils. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. Incomplete citations or abstracts are provided for some documents. They are included to accelerate availability of the information and will be completed or replaced in future updates.

  20. Nucleic Acids Research annual Database Issue and the NAR online Molecular Biology Database Collection in 2009

    PubMed Central

    Galperin, Michael Y.; Cochrane, Guy R.

    2009-01-01

    The current issue of Nucleic Acids Research includes descriptions of 179 databases, of which 95 are new. These databases (along with several molecular biology databases described in other journals) have been included in the Nucleic Acids Research online Molecular Biology Database Collection, bringing the total number of databases in the collection to 1170. In this introductory comment, we briefly describe some of these new databases and review the principles guiding the selection of databases for inclusion in the Nucleic Acids Research annual Database Issue and the Nucleic Acids Research online Molecular Biology Database Collection. The complete database list and summaries are available online at the Nucleic Acids Research web site (http://nar.oxfordjournals.org/). PMID:19033364

  1. Environmental Sustainability - Including Land and Water Use

    EPA Science Inventory

    Assessments of environmental sustainability can be conducted in many ways with one of the most quantitative methods including Life Cycle Impact Assessment (LCIA). While historically LCIA has included a comprehensive list of impact categories including: ozone depletion, global c...

  2. Validated MicroRNA Target Databases: An Evaluation.

    PubMed

    Lee, Yun Ji Diana; Kim, Veronica; Muth, Dillon C; Witwer, Kenneth W

    2015-11-01

    Preclinical Research Positive findings from preclinical and clinical studies involving depletion or supplementation of microRNA (miRNA) engender optimism about miRNA-based therapeutics. However, off-target effects must be considered. Predicting these effects is complicated. Each miRNA may target many gene transcripts, and the rules governing imperfectly complementary miRNA: target interactions are incompletely understood. Several databases provide lists of the relatively small number of experimentally confirmed miRNA: target pairs. Although incomplete, this information might allow assessment of at least some of the off-target effects. We evaluated the performance of four databases of experimentally validated miRNA: target interactions (miRWalk 2.0, miRTarBase, miRecords, and TarBase 7.0) using a list of 50 alphabetically consecutive genes. We examined the provided citations to determine the degree to which each interaction was experimentally supported. To assess stability, we tested at the beginning and end of a five-month period. Results varied widely by database. Two of the databases changed significantly over the course of 5 months. Most reported evidence for miRNA: target interactions were indirect or otherwise weak, and relatively few interactions were supported by more than one publication. Some returned results appear to arise from simplistic text searches that offer no insight into the relationship of the search terms, may not even include the reported gene or miRNA, and may thus, be invalid. We conclude that validation databases provide important information, but not all information in all extant databases is up-to-date or accurate. Nevertheless, the more comprehensive validation databases may provide useful starting points for investigation of off-target effects of proposed small RNA therapies.

  3. YCRD: Yeast Combinatorial Regulation Database

    PubMed Central

    Wu, Wei-Sheng; Hsieh, Yen-Chen; Lai, Fu-Jou

    2016-01-01

    In eukaryotes, the precise transcriptional control of gene expression is typically achieved through combinatorial regulation using cooperative transcription factors (TFs). Therefore, a database which provides regulatory associations between cooperative TFs and their target genes is helpful for biologists to study the molecular mechanisms of transcriptional regulation of gene expression. Because there is no such kind of databases in the public domain, this prompts us to construct a database, called Yeast Combinatorial Regulation Database (YCRD), which deposits 434,197 regulatory associations between 2535 cooperative TF pairs and 6243 genes. The comprehensive collection of more than 2500 cooperative TF pairs was retrieved from 17 existing algorithms in the literature. The target genes of a cooperative TF pair (e.g. TF1-TF2) are defined as the common target genes of TF1 and TF2, where a TF’s experimentally validated target genes were downloaded from YEASTRACT database. In YCRD, users can (i) search the target genes of a cooperative TF pair of interest, (ii) search the cooperative TF pairs which regulate a gene of interest and (iii) identify important cooperative TF pairs which regulate a given set of genes. We believe that YCRD will be a valuable resource for yeast biologists to study combinatorial regulation of gene expression. YCRD is available at http://cosbi.ee.ncku.edu.tw/YCRD/ or http://cosbi2.ee.ncku.edu.tw/YCRD/. PMID:27392072

  4. YCRD: Yeast Combinatorial Regulation Database.

    PubMed

    Wu, Wei-Sheng; Hsieh, Yen-Chen; Lai, Fu-Jou

    2016-01-01

    In eukaryotes, the precise transcriptional control of gene expression is typically achieved through combinatorial regulation using cooperative transcription factors (TFs). Therefore, a database which provides regulatory associations between cooperative TFs and their target genes is helpful for biologists to study the molecular mechanisms of transcriptional regulation of gene expression. Because there is no such kind of databases in the public domain, this prompts us to construct a database, called Yeast Combinatorial Regulation Database (YCRD), which deposits 434,197 regulatory associations between 2535 cooperative TF pairs and 6243 genes. The comprehensive collection of more than 2500 cooperative TF pairs was retrieved from 17 existing algorithms in the literature. The target genes of a cooperative TF pair (e.g. TF1-TF2) are defined as the common target genes of TF1 and TF2, where a TF's experimentally validated target genes were downloaded from YEASTRACT database. In YCRD, users can (i) search the target genes of a cooperative TF pair of interest, (ii) search the cooperative TF pairs which regulate a gene of interest and (iii) identify important cooperative TF pairs which regulate a given set of genes. We believe that YCRD will be a valuable resource for yeast biologists to study combinatorial regulation of gene expression. YCRD is available at http://cosbi.ee.ncku.edu.tw/YCRD/ or http://cosbi2.ee.ncku.edu.tw/YCRD/. PMID:27392072

  5. The AMMA database

    NASA Astrophysics Data System (ADS)

    Boichard, Jean-Luc; Brissebrat, Guillaume; Cloche, Sophie; Eymard, Laurence; Fleury, Laurence; Mastrorillo, Laurence; Moulaye, Oumarou; Ramage, Karim

    2010-05-01

    The AMMA project includes aircraft, ground-based and ocean measurements, an intensive use of satellite data and diverse modelling studies. Therefore, the AMMA database aims at storing a great amount and a large variety of data, and at providing the data as rapidly and safely as possible to the AMMA research community. In order to stimulate the exchange of information and collaboration between researchers from different disciplines or using different tools, the database provides a detailed description of the products and uses standardized formats. The AMMA database contains: - AMMA field campaigns datasets; - historical data in West Africa from 1850 (operational networks and previous scientific programs); - satellite products from past and future satellites, (re-)mapped on a regular latitude/longitude grid and stored in NetCDF format (CF Convention); - model outputs from atmosphere or ocean operational (re-)analysis and forecasts, and from research simulations. The outputs are processed as the satellite products are. Before accessing the data, any user has to sign the AMMA data and publication policy. This chart only covers the use of data in the framework of scientific objectives and categorically excludes the redistribution of data to third parties and the usage for commercial applications. Some collaboration between data producers and users, and the mention of the AMMA project in any publication is also required. The AMMA database and the associated on-line tools have been fully developed and are managed by two teams in France (IPSL Database Centre, Paris and OMP, Toulouse). Users can access data of both data centres using an unique web portal. This website is composed of different modules : - Registration: forms to register, read and sign the data use chart when an user visits for the first time - Data access interface: friendly tool allowing to build a data extraction request by selecting various criteria like location, time, parameters... The request can

  6. The Histone Database: an integrated resource for histones and histone fold-containing proteins.

    PubMed

    Mariño-Ramírez, Leonardo; Levine, Kevin M; Morales, Mario; Zhang, Suiyuan; Moreland, R Travis; Baxevanis, Andreas D; Landsman, David

    2011-01-01

    Eukaryotic chromatin is composed of DNA and protein components-core histones-that act to compactly pack the DNA into nucleosomes, the fundamental building blocks of chromatin. These nucleosomes are connected to adjacent nucleosomes by linker histones. Nucleosomes are highly dynamic and, through various core histone post-translational modifications and incorporation of diverse histone variants, can serve as epigenetic marks to control processes such as gene expression and recombination. The Histone Sequence Database is a curated collection of sequences and structures of histones and non-histone proteins containing histone folds, assembled from major public databases. Here, we report a substantial increase in the number of sequences and taxonomic coverage for histone and histone fold-containing proteins available in the database. Additionally, the database now contains an expanded dataset that includes archaeal histone sequences. The database also provides comprehensive multiple sequence alignments for each of the four core histones (H2A, H2B, H3 and H4), the linker histones (H1/H5) and the archaeal histones. The database also includes current information on solved histone fold-containing structures. The Histone Sequence Database is an inclusive resource for the analysis of chromatin structure and function focused on histones and histone fold-containing proteins.

  7. Energy science and technology database (on the internet). Online data

    SciTech Connect

    1997-12-01

    The Energy Science and Technology Database (EDB) is a multidisciplinary file containing worldwide references to basic and applied scientific and technical research literature. The information is collected for use by government managers, researchers at the national laboratories, and other research efforts sponsored by the U.S. Department of Energy, and the results of this research are transferred to the public. Abstracts are included for records from 1976 to the present. The EDB also contains the Nuclear Science Abstracts which is a comprehensive abstract and index collection to the international nuclear science and technology literature for the period 1948 through 1976. Included are scientific and technical reports of the U.S. Atomic Energy Commission, U.S. Energy Research and Development Administration and its contractors, other agencies, universities, and industrial and research organizations. Approximately 25% of the records in the file contain abstracts. Nuclear Science Abstracts contains over 900,000 bibliographic records. The entire Energy Science and Technology Database contains over 3 million bibliographic records. This database is now available for searching through the GOV. Research-Center (GRC) service. GRC is a single online web-based search service to well known Government databases. Featuring powerful search and retrieval software, GRC is an important research tool. The GRC web site is at http://grc.ntis.gov.

  8. Windshear database for forward-looking systems certification

    NASA Technical Reports Server (NTRS)

    Switzer, G. F.; Proctor, F. H.; Hinton, D. A.; Aanstoos, J. V.

    1993-01-01

    This document contains a description of a comprehensive database that is to be used for certification testing of airborne forward-look windshear detection systems. The database was developed by NASA Langley Research Center, at the request of the Federal Aviation Administration (FAA), to support the industry initiative to certify and produce forward-look windshear detection equipment. The database contains high resolution, three dimensional fields for meteorological variables that may be sensed by forward-looking systems. The database is made up of seven case studies which have been generated by the Terminal Area Simulation System, a state-of-the-art numerical system for the realistic modeling of windshear phenomena. The selected cases represent a wide spectrum of windshear events. General descriptions and figures from each of the case studies are included, as well as equations for F-factor, radar-reflectivity factor, and rainfall rate. The document also describes scenarios and paths through the data sets, jointly developed by NASA and the FAA, to meet FAA certification testing objectives. Instructions for reading and verifying the data from tape are included.

  9. Establishing a database of radionuclide transfer parameters for freshwater wildlife.

    PubMed

    Yankovich, T; Beresford, N A; Fesenko, S; Fesenko, J; Phaneuf, M; Dagher, E; Outola, I; Andersson, P; Thiessen, K; Ryan, J; Wood, M D; Bollhöfer, A; Barnett, C L; Copplestone, D

    2013-12-01

    Environmental assessments to evaluate potentials risks to humans and wildlife often involve modelling to predict contaminant exposure through key pathways. Such models require input of parameter values, including concentration ratios, to estimate contaminant concentrations in biota based on measurements or estimates of concentrations in environmental media, such as water. Due to the diversity of species and the range in physicochemical conditions in natural ecosystems, concentration ratios can vary by orders of magnitude, even within similar species. Therefore, to improve model input parameter values for application in aquatic systems, freshwater concentration ratios were collated or calculated from national grey literature, Russian language publications, and refereed papers. Collated data were then input into an international database that is being established by the International Atomic Energy Agency. The freshwater database enables entry of information for all radionuclides listed in ICRP (1983), in addition to the corresponding stable elements, and comprises a total of more than 16,500 concentration ratio (CRwo-water) values. Although data were available for all broad wildlife groups (with the exception of birds), data were sparse for many organism types. For example, zooplankton, crustaceans, insects and insect larvae, amphibians, and mammals, for which there were CRwo-water values for less than eight elements. Coverage was most comprehensive for fish, vascular plants, and molluscs. To our knowledge, the freshwater database that has now been established represents the most comprehensive set of CRwo-water values for freshwater species currently available for use in radiological environmental assessments. PMID:23103210

  10. Annual Review of Database Developments 1991.

    ERIC Educational Resources Information Center

    Basch, Reva

    1991-01-01

    Review of developments in databases highlights a new emphasis on accessibility. Topics discussed include the internationalization of databases; databases that deal with finance, drugs, and toxic waste; access to public records, both personal and corporate; media online; reducing large files of data to smaller, more manageable files; and…

  11. ARTI Refrigerant Database

    SciTech Connect

    Cain, J.M. , Great Falls, VA )

    1993-04-30

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern. The database provides bibliographic citations and abstracts for publications that may be useful in research and design of air-conditioning and refrigeration equipment. The complete documents are not included. The database identifies sources of specific information on R-32, R-123, R-124, R-125, R-134, R-134a, R-141b, R-142b, R-143a, R-152a, R-245ca, R-290 (propane), R-717 (ammonia), ethers, and others as well as azeotropic and zeotropic blends of these fluids. It addresses lubricants including alkylbenzene, polyalkylene glycol, ester, and other synthetics as well as mineral oils. It also references documents addressing compatibility of refrigerants and lubricants with metals, plastics, elastomers, motor insulation, and other materials used in refrigerant circuits. Incomplete citations or abstracts are provided for some documents to accelerate availability of the information and will be completed or replaced in future updates.

  12. Tools and databases of the KOMICS web portal for preprocessing, mining, and dissemination of metabolomics data.

    PubMed

    Sakurai, Nozomu; Ara, Takeshi; Enomoto, Mitsuo; Motegi, Takeshi; Morishita, Yoshihiko; Kurabayashi, Atsushi; Iijima, Yoko; Ogata, Yoshiyuki; Nakajima, Daisuke; Suzuki, Hideyuki; Shibata, Daisuke

    2014-01-01

    A metabolome--the collection of comprehensive quantitative data on metabolites in an organism--has been increasingly utilized for applications such as data-intensive systems biology, disease diagnostics, biomarker discovery, and assessment of food quality. A considerable number of tools and databases have been developed to date for the analysis of data generated by various combinations of chromatography and mass spectrometry. We report here a web portal named KOMICS (The Kazusa Metabolomics Portal), where the tools and databases that we developed are available for free to academic users. KOMICS includes the tools and databases for preprocessing, mining, visualization, and publication of metabolomics data. Improvements in the annotation of unknown metabolites and dissemination of comprehensive metabolomic data are the primary aims behind the development of this portal. For this purpose, PowerGet and FragmentAlign include a manual curation function for the results of metabolite feature alignments. A metadata-specific wiki-based database, Metabolonote, functions as a hub of web resources related to the submitters' work. This feature is expected to increase citation of the submitters' work, thereby promoting data publication. As an example of the practical use of KOMICS, a workflow for a study on Jatropha curcas is presented. The tools and databases available at KOMICS should contribute to enhanced production, interpretation, and utilization of metabolomic Big Data. PMID:24949426

  13. Tools and databases of the KOMICS web portal for preprocessing, mining, and dissemination of metabolomics data.

    PubMed

    Sakurai, Nozomu; Ara, Takeshi; Enomoto, Mitsuo; Motegi, Takeshi; Morishita, Yoshihiko; Kurabayashi, Atsushi; Iijima, Yoko; Ogata, Yoshiyuki; Nakajima, Daisuke; Suzuki, Hideyuki; Shibata, Daisuke

    2014-01-01

    A metabolome--the collection of comprehensive quantitative data on metabolites in an organism--has been increasingly utilized for applications such as data-intensive systems biology, disease diagnostics, biomarker discovery, and assessment of food quality. A considerable number of tools and databases have been developed to date for the analysis of data generated by various combinations of chromatography and mass spectrometry. We report here a web portal named KOMICS (The Kazusa Metabolomics Portal), where the tools and databases that we developed are available for free to academic users. KOMICS includes the tools and databases for preprocessing, mining, visualization, and publication of metabolomics data. Improvements in the annotation of unknown metabolites and dissemination of comprehensive metabolomic data are the primary aims behind the development of this portal. For this purpose, PowerGet and FragmentAlign include a manual curation function for the results of metabolite feature alignments. A metadata-specific wiki-based database, Metabolonote, functions as a hub of web resources related to the submitters' work. This feature is expected to increase citation of the submitters' work, thereby promoting data publication. As an example of the practical use of KOMICS, a workflow for a study on Jatropha curcas is presented. The tools and databases available at KOMICS should contribute to enhanced production, interpretation, and utilization of metabolomic Big Data.

  14. The apoptosis database.

    PubMed

    Doctor, K S; Reed, J C; Godzik, A; Bourne, P E

    2003-06-01

    The apoptosis database is a public resource for researchers and students interested in the molecular biology of apoptosis. The resource provides functional annotation, literature references, diagrams/images, and alternative nomenclatures on a set of proteins having 'apoptotic domains'. These are the distinctive domains that are often, if not exclusively, found in proteins involved in apoptosis. The initial choice of proteins to be included is defined by apoptosis experts and bioinformatics tools. Users can browse through the web accessible lists of domains, proteins containing these domains and their associated homologs. The database can also be searched by sequence homology using basic local alignment search tool, text word matches of the annotation, and identifiers for specific records. The resource is available at http://www.apoptosis-db.org and is updated on a regular basis.

  15. Comprehension Instruction: Perspectives and Suggestions.

    ERIC Educational Resources Information Center

    Duffy, Gerald G., Ed.; And Others

    Prepared by members of the Institute for Research on Teaching and the Center for the Study of Reading, the papers in this collection examine reading comprehension as an interactive process and explore how comprehension instruction occurs in the complex workplace of the classroom. Specific topics discussed in 20 papers include the following: (1)…

  16. REBASE--a database for DNA restriction and modification: enzymes, genes and genomes.

    PubMed

    Roberts, Richard J; Vincze, Tamas; Posfai, Janos; Macelis, Dana

    2010-01-01

    REBASE is a comprehensive database of information about restriction enzymes, DNA methyltransferases and related proteins involved in the biological process of restriction-modification (R-M). It contains fully referenced information about recognition and cleavage sites, isoschizomers, neoschizomers, commercial availability, methylation sensitivity, crystal and sequence data. Experimentally characterized homing endonucleases are also included. The fastest growing segment of REBASE contains the putative R-M systems found in the sequence databases. Comprehensive descriptions of the R-M content of all fully sequenced genomes are available including summary schematics. The contents of REBASE may be browsed from the web (http://rebase.neb.com) and selected compilations can be downloaded by ftp (ftp.neb.com). Additionally, monthly updates can be requested via email.

  17. SABIO-RK--database for biochemical reaction kinetics.

    PubMed

    Wittig, Ulrike; Kania, Renate; Golebiewski, Martin; Rey, Maja; Shi, Lei; Jong, Lenneke; Algaa, Enkhjargal; Weidemann, Andreas; Sauer-Danzwith, Heidrun; Mir, Saqib; Krebs, Olga; Bittkowski, Meik; Wetsch, Elina; Rojas, Isabel; Müller, Wolfgang

    2012-01-01

    SABIO-RK (http://sabio.h-its.org/) is a web-accessible database storing comprehensive information about biochemical reactions and their kinetic properties. SABIO-RK offers standardized data manually extracted from the literature and data directly submitted from lab experiments. The database content includes kinetic parameters in relation to biochemical reactions and their biological sources with no restriction on any particular set of organisms. Additionally, kinetic rate laws and corresponding equations as well as experimental conditions are represented. All the data are manually curated and annotated by biological experts, supported by automated consistency checks. SABIO-RK can be accessed via web-based user interfaces or automatically via web services that allow direct data access by other tools. Both interfaces support the export of the data together with its annotations in SBML (Systems Biology Markup Language), e.g. for import in modelling tools.

  18. New geothermal database for Utah

    USGS Publications Warehouse

    Blackett, Robert E.; ,

    1993-01-01

    The Utah Geological Survey complied a preliminary database consisting of over 800 records on thermal wells and springs in Utah with temperatures of 20??C or greater. Each record consists of 35 fields, including location of the well or spring, temperature, depth, flow-rate, and chemical analyses of water samples. Developed for applications on personal computers, the database will be useful for geochemical, statistical, and other geothermal related studies. A preliminary map of thermal wells and springs in Utah, which accompanies the database, could eventually incorporate heat-flow information, bottom-hole temperatures from oil and gas wells, traces of Quaternary faults, and locations of young volcanic centers.

  19. The GLIMS Glacier Database

    NASA Astrophysics Data System (ADS)

    Raup, B. H.; Khalsa, S. S.; Armstrong, R.

    2007-12-01

    Info, GML (Geography Markup Language) and GMT (Generic Mapping Tools). This "clip-and-ship" function allows users to download only the data they are interested in. Our flexible web interfaces to the database, which includes various support layers (e.g. a layer to help collaborators identify satellite imagery over their region of expertise) will facilitate enhanced analysis to be undertaken on glacier systems, their distribution, and their impacts on other Earth systems.

  20. Open geochemical database

    NASA Astrophysics Data System (ADS)

    Zhilin, Denis; Ilyin, Vladimir; Bashev, Anton

    2010-05-01

    We regard "geochemical data" as data on chemical parameters of the environment, linked with the geographical position of the corresponding point. Boosting development of global positioning system (GPS) and measuring instruments allows fast collecting of huge amounts of geochemical data. Presently they are published in scientific journals in text format, that hampers searching for information about particular places and meta-analysis of the data, collected by different researchers. Part of the information is never published. To make the data available and easy to find, it seems reasonable to elaborate an open database of geochemical information, accessible via Internet. It also seems reasonable to link the data with maps or space images, for example, from GoogleEarth service. For this purpose an open geochemical database is being elaborating (http://maps.sch192.ru). Any user after registration can upload geochemical data (position, type of parameter and value of the parameter) and edit them. Every user (including unregistered) can (a) extract the values of parameters, fulfilling desired conditions and (b) see the points, linked to GoogleEarth space image, colored according to a value of selected parameter. Then he can treat extracted values any way he likes. There are the following data types in the database: authors, points, seasons and parameters. Author is a person, who publishes the data. Every author can declare his own profile. A point is characterized by its geographical position and type of the object (i.e. river, lake etc). Value of parameters are linked to a point, an author and a season, when they were obtained. A user can choose a parameter to place on GoogleEarth space image and a scale to color the points on the image according to the value of a parameter. Currently (December, 2009) the database is under construction, but several functions (uploading data on pH and electrical conductivity and placing colored points onto GoogleEarth space image) are

  1. DASHR: database of small human noncoding RNAs.

    PubMed

    Leung, Yuk Yee; Kuksa, Pavel P; Amlie-Wolf, Alexandre; Valladares, Otto; Ungar, Lyle H; Kannan, Sampath; Gregory, Brian D; Wang, Li-San

    2016-01-01

    Small non-coding RNAs (sncRNAs) are highly abundant RNAs, typically <100 nucleotides long, that act as key regulators of diverse cellular processes. Although thousands of sncRNA genes are known to exist in the human genome, no single database provides searchable, unified annotation, and expression information for full sncRNA transcripts and mature RNA products derived from these larger RNAs. Here, we present the Database of small human noncoding RNAs (DASHR). DASHR contains the most comprehensive information to date on human sncRNA genes and mature sncRNA products. DASHR provides a simple user interface for researchers to view sequence and secondary structure, compare expression levels, and evidence of specific processing across all sncRNA genes and mature sncRNA products in various human tissues. DASHR annotation and expression data covers all major classes of sncRNAs including microRNAs (miRNAs), Piwi-interacting (piRNAs), small nuclear, nucleolar, cytoplasmic (sn-, sno-, scRNAs, respectively), transfer (tRNAs), and ribosomal RNAs (rRNAs). Currently, DASHR (v1.0) integrates 187 smRNA high-throughput sequencing (smRNA-seq) datasets with over 2.5 billion reads and annotation data from multiple public sources. DASHR contains annotations for ∼ 48,000 human sncRNA genes and mature sncRNA products, 82% of which are expressed in one or more of the curated tissues. DASHR is available at http://lisanwanglab.org/DASHR.

  2. MGFD: the maize gene families database

    PubMed Central

    Sheng, Lei; Jiang, Haiyang; Yan, Hanwei; Li, Xiaoyu; Lin, Yongxiang; Ye, Hui; Cheng, Beijiu

    2016-01-01

    Most gene families are transcription factor (TF) families, which have fundamental roles in almost all biological processes (development, growth and response to environmental factors) and have been employed to manipulate various types of metabolic, developmental and stress response pathways in plants. Maize (Zea mays) is one of the most important cereal crops in the world due its importance to human nutrition and health. Thus, identifying and annotating all the gene families in maize is an important primary step in defining their functions and understanding their roles in the regulation of diverse biological processes. In this study, we identified 96 predicted maize gene families and systematically characterized all 5826 of the genes in those families. We have also developed a comprehensive database of maize gene families (the MGFD). To further explore the functions of these gene families, we extensively annotated the genes, including such basic information as protein sequence features, gene structure, Gene Ontology classifications, phylogenetic relationships and expression profiles. The MGFD has a user-friendly web interface with multiple browse and search functions, as well as data downloading. The MGFD is freely available to users at http://mgfd.ahau.edu.cn/. Database URL: http://mgfd.ahau.edu.cn/ PMID:26896848

  3. The Dfam database of repetitive DNA families

    PubMed Central

    Hubley, Robert; Finn, Robert D.; Clements, Jody; Eddy, Sean R.; Jones, Thomas A.; Bao, Weidong; Smit, Arian F.A.; Wheeler, Travis J.

    2016-01-01

    Repetitive DNA, especially that due to transposable elements (TEs), makes up a large fraction of many genomes. Dfam is an open access database of families of repetitive DNA elements, in which each family is represented by a multiple sequence alignment and a profile hidden Markov model (HMM). The initial release of Dfam, featured in the 2013 NAR Database Issue, contained 1143 families of repetitive elements found in humans, and was used to produce more than 100 Mb of additional annotation of TE-derived regions in the human genome, with improved speed. Here, we describe recent advances, most notably expansion to 4150 total families including a comprehensive set of known repeat families from four new organisms (mouse, zebrafish, fly and nematode). We describe improvements to coverage, and to our methods for identifying and reducing false annotation. We also describe updates to the website interface. The Dfam website has moved to http://dfam.org. Seed alignments, profile HMMs, hit lists and other underlying data are available for download. PMID:26612867

  4. The Dfam database of repetitive DNA families.

    PubMed

    Hubley, Robert; Finn, Robert D; Clements, Jody; Eddy, Sean R; Jones, Thomas A; Bao, Weidong; Smit, Arian F A; Wheeler, Travis J

    2016-01-01

    Repetitive DNA, especially that due to transposable elements (TEs), makes up a large fraction of many genomes. Dfam is an open access database of families of repetitive DNA elements, in which each family is represented by a multiple sequence alignment and a profile hidden Markov model (HMM). The initial release of Dfam, featured in the 2013 NAR Database Issue, contained 1143 families of repetitive elements found in humans, and was used to produce more than 100 Mb of additional annotation of TE-derived regions in the human genome, with improved speed. Here, we describe recent advances, most notably expansion to 4150 total families including a comprehensive set of known repeat families from four new organisms (mouse, zebrafish, fly and nematode). We describe improvements to coverage, and to our methods for identifying and reducing false annotation. We also describe updates to the website interface. The Dfam website has moved to http://dfam.org. Seed alignments, profile HMMs, hit lists and other underlying data are available for download.

  5. The Dfam database of repetitive DNA families.

    PubMed

    Hubley, Robert; Finn, Robert D; Clements, Jody; Eddy, Sean R; Jones, Thomas A; Bao, Weidong; Smit, Arian F A; Wheeler, Travis J

    2016-01-01

    Repetitive DNA, especially that due to transposable elements (TEs), makes up a large fraction of many genomes. Dfam is an open access database of families of repetitive DNA elements, in which each family is represented by a multiple sequence alignment and a profile hidden Markov model (HMM). The initial release of Dfam, featured in the 2013 NAR Database Issue, contained 1143 families of repetitive elements found in humans, and was used to produce more than 100 Mb of additional annotation of TE-derived regions in the human genome, with improved speed. Here, we describe recent advances, most notably expansion to 4150 total families including a comprehensive set of known repeat families from four new organisms (mouse, zebrafish, fly and nematode). We describe improvements to coverage, and to our methods for identifying and reducing false annotation. We also describe updates to the website interface. The Dfam website has moved to http://dfam.org. Seed alignments, profile HMMs, hit lists and other underlying data are available for download. PMID:26612867

  6. Semantic description of signal and image databases

    NASA Astrophysics Data System (ADS)

    Contreras, Pedro; Kokuer, Munevver; Louys, Mireille; Murtagh, Fionn D.

    2003-03-01

    Information navigation and search on the part of a user requires thorough description of the information content of signal and image datasets and archives. Large signal and image databases need comprehensive metadata to facilitate user access. There is no unique way to describe the semantics of images and signals. Therefore a conceptual model serves as an initial platform. From the conceptual model, a database design can be derived, or a definition of metadata. The different steps from model to description can benefit from tools such as the Unified Modeling Language (UML) for the conceptual model, standard Entity/Relationship (ER) models for database design, and eXtensible Markup Language (XML) for metadata description. As examples of the process of conceptual design and semantic description, we consider the case of a signal database, and the case of astronomical image databases.

  7. Lessons Learned From Developing Reactor Pressure Vessel Steel Embrittlement Database

    SciTech Connect

    Wang, Jy-An John

    2010-08-01

    Materials behaviors caused by neutron irradiation under fission and/or fusion environments can be little understood without practical examination. Easily accessible material information system with large material database using effective computers is necessary for design of nuclear materials and analyses or simulations of the phenomena. The developed Embrittlement Data Base (EDB) at ORNL is this comprehensive collection of data. EDB database contains power reactor pressure vessel surveillance data, the material test reactor data, foreign reactor data (through bilateral agreements authorized by NRC), and the fracture toughness data. The lessons learned from building EDB program and the associated database management activity regarding Material Database Design Methodology, Architecture and the Embedded QA Protocol are described in this report. The development of IAEA International Database on Reactor Pressure Vessel Materials (IDRPVM) and the comparison of EDB database and IAEA IDRPVM database are provided in the report. The recommended database QA protocol and database infrastructure are also stated in the report.

  8. The Cambridge Structural Database.

    PubMed

    Groom, Colin R; Bruno, Ian J; Lightfoot, Matthew P; Ward, Suzanna C

    2016-04-01

    The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal-organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface.

  9. The Cambridge Structural Database

    PubMed Central

    Groom, Colin R.; Bruno, Ian J.; Lightfoot, Matthew P.; Ward, Suzanna C.

    2016-01-01

    The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal–organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface. PMID:27048719

  10. The Cambridge Structural Database.

    PubMed

    Groom, Colin R; Bruno, Ian J; Lightfoot, Matthew P; Ward, Suzanna C

    2016-04-01

    The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal-organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface. PMID:27048719

  11. The Hawaiian Algal Database: a laboratory LIMS and online resource for biodiversity data

    PubMed Central

    Wang, Norman; Sherwood, Alison R; Kurihara, Akira; Conklin, Kimberly Y; Sauvage, Thomas; Presting, Gernot G

    2009-01-01

    Background Organization and presentation of biodiversity data is greatly facilitated by databases that are specially designed to allow easy data entry and organized data display. Such databases also have the capacity to serve as Laboratory Information Management Systems (LIMS). The Hawaiian Algal Database was designed to showcase specimens collected from the Hawaiian Archipelago, enabling users around the world to compare their specimens with our photographs and DNA sequence data, and to provide lab personnel with an organizational tool for storing various biodiversity data types. Description We describe the Hawaiian Algal Database, a comprehensive and searchable database containing photographs and micrographs, geo-referenced collecting information, taxonomic checklists and standardized DNA sequence data. All data for individual samples are linked through unique accession numbers. Users can search online for sample information by accession number, numerous levels of taxonomy, or collection site. At the present time the database contains data representing over 2,000 samples of marine, freshwater and terrestrial algae from the Hawaiian Archipelago. These samples are primarily red algae, although other taxa are being added. Conclusion The Hawaiian Algal Database is a digital repository for Hawaiian algal samples and acts as a LIMS for the laboratory. Users can make use of the online search tool to view and download specimen photographs and micrographs, DNA sequences and relevant habitat data, including georeferenced collecting locations. It is publicly available at . PMID:19728892

  12. Reflective Database Access Control

    ERIC Educational Resources Information Center

    Olson, Lars E.

    2009-01-01

    "Reflective Database Access Control" (RDBAC) is a model in which a database privilege is expressed as a database query itself, rather than as a static privilege contained in an access control list. RDBAC aids the management of database access controls by improving the expressiveness of policies. However, such policies introduce new interactions…

  13. Database tomography for commercial application

    NASA Technical Reports Server (NTRS)

    Kostoff, Ronald N.; Eberhart, Henry J.

    1994-01-01

    Database tomography is a method for extracting themes and their relationships from text. The algorithms, employed begin with word frequency and word proximity analysis and build upon these results. When the word 'database' is used, think of medical or police records, patents, journals, or papers, etc. (any text information that can be computer stored). Database tomography features a full text, user interactive technique enabling the user to identify areas of interest, establish relationships, and map trends for a deeper understanding of an area of interest. Database tomography concepts and applications have been reported in journals and presented at conferences. One important feature of the database tomography algorithm is that it can be used on a database of any size, and will facilitate the users ability to understand the volume of content therein. While employing the process to identify research opportunities it became obvious that this promising technology has potential applications for business, science, engineering, law, and academe. Examples include evaluating marketing trends, strategies, relationships and associations. Also, the database tomography process would be a powerful component in the area of competitive intelligence, national security intelligence and patent analysis. User interests and involvement cannot be overemphasized.

  14. Database of Properties of Meteors

    NASA Technical Reports Server (NTRS)

    Suggs, Rob; Anthea, Coster

    2006-01-01

    A database of properties of meteors, and software that provides access to the database, are being developed as a contribution to continuing efforts to model the characteristics of meteors with increasing accuracy. Such modeling is necessary for evaluation of the risk of penetration of spacecraft by meteors. For each meteor in the database, the record will include an identification, date and time, radiant properties, ballistic coefficient, radar cross section, size, density, and orbital elements. The property of primary interest in the present case is density, and one of the primary goals in this case is to derive densities of meteors from their atmospheric decelerations. The database and software are expected to be valid anywhere in the solar system. The database will incorporate new data plus results of meteoroid analyses that, heretofore, have not been readily available to the aerospace community. Taken together, the database and software constitute a model that is expected to provide improved estimates of densities and to result in improved risk analyses for interplanetary spacecraft. It is planned to distribute the database and software on a compact disk.

  15. The European Narcolepsy Network (EU-NN) database.

    PubMed

    Khatami, Ramin; Luca, Gianina; Baumann, Christian R; Bassetti, Claudio L; Bruni, Oliviero; Canellas, Francesca; Dauvilliers, Yves; Del Rio-Villegas, Rafael; Feketeova, Eva; Ferri, Raffaele; Geisler, Peter; Högl, Birgit; Jennum, Poul; Kornum, Birgitte R; Lecendreux, Michel; Martins-da-Silva, Antonio; Mathis, Johannes; Mayer, Geert; Paiva, Teresa; Partinen, Markku; Peraita-Adrados, Rosa; Plazzi, Guiseppe; Santamaria, Joan; Sonka, Karel; Riha, Renata; Tafti, Mehdi; Wierzbicka, Aleksandra; Young, Peter; Lammers, Gert Jan; Overeem, Sebastiaan

    2016-06-01

    Narcolepsy with cataplexy is a rare disease with an estimated prevalence of 0.02% in European populations. Narcolepsy shares many features of rare disorders, in particular the lack of awareness of the disease with serious consequences for healthcare supply. Similar to other rare diseases, only a few European countries have registered narcolepsy cases in databases of the International Classification of Diseases or in registries of the European health authorities. A promising approach to identify disease-specific adverse health effects and needs in healthcare delivery in the field of rare diseases is to establish a distributed expert network. A first and important step is to create a database that allows collection, storage and dissemination of data on narcolepsy in a comprehensive and systematic way. Here, the first prospective web-based European narcolepsy database hosted by the European Narcolepsy Network is introduced. The database structure, standardization of data acquisition and quality control procedures are described, and an overview provided of the first 1079 patients from 18 European specialized centres. Due to its standardization this continuously increasing data pool is most promising to provide a better insight into many unsolved aspects of narcolepsy and related disorders, including clear phenotype characterization of subtypes of narcolepsy, more precise epidemiological data and knowledge on the natural history of narcolepsy, expectations about treatment effects, identification of post-marketing medication side-effects, and will contribute to improve clinical trial designs and provide facilities to further develop phase III trials. PMID:26809504

  16. The European Narcolepsy Network (EU-NN) database.

    PubMed

    Khatami, Ramin; Luca, Gianina; Baumann, Christian R; Bassetti, Claudio L; Bruni, Oliviero; Canellas, Francesca; Dauvilliers, Yves; Del Rio-Villegas, Rafael; Feketeova, Eva; Ferri, Raffaele; Geisler, Peter; Högl, Birgit; Jennum, Poul; Kornum, Birgitte R; Lecendreux, Michel; Martins-da-Silva, Antonio; Mathis, Johannes; Mayer, Geert; Paiva, Teresa; Partinen, Markku; Peraita-Adrados, Rosa; Plazzi, Guiseppe; Santamaria, Joan; Sonka, Karel; Riha, Renata; Tafti, Mehdi; Wierzbicka, Aleksandra; Young, Peter; Lammers, Gert Jan; Overeem, Sebastiaan

    2016-06-01

    Narcolepsy with cataplexy is a rare disease with an estimated prevalence of 0.02% in European populations. Narcolepsy shares many features of rare disorders, in particular the lack of awareness of the disease with serious consequences for healthcare supply. Similar to other rare diseases, only a few European countries have registered narcolepsy cases in databases of the International Classification of Diseases or in registries of the European health authorities. A promising approach to identify disease-specific adverse health effects and needs in healthcare delivery in the field of rare diseases is to establish a distributed expert network. A first and important step is to create a database that allows collection, storage and dissemination of data on narcolepsy in a comprehensive and systematic way. Here, the first prospective web-based European narcolepsy database hosted by the European Narcolepsy Network is introduced. The database structure, standardization of data acquisition and quality control procedures are described, and an overview provided of the first 1079 patients from 18 European specialized centres. Due to its standardization this continuously increasing data pool is most promising to provide a better insight into many unsolved aspects of narcolepsy and related disorders, including clear phenotype characterization of subtypes of narcolepsy, more precise epidemiological data and knowledge on the natural history of narcolepsy, expectations about treatment effects, identification of post-marketing medication side-effects, and will contribute to improve clinical trial designs and provide facilities to further develop phase III trials.

  17. The ConsensusPathDB interaction database: 2013 update.

    PubMed

    Kamburov, Atanas; Stelzl, Ulrich; Lehrach, Hans; Herwig, Ralf

    2013-01-01

    Knowledge of the various interactions between molecules in the cell is crucial for understanding cellular processes in health and disease. Currently available interaction databases, being largely complementary to each other, must be integrated to obtain a comprehensive global map of the different types of interactions. We have previously reported the development of an integrative interaction database called ConsensusPathDB (http://ConsensusPathDB.org) that aims to fulfill this task. In this update article, we report its significant progress in terms of interaction content and web interface tools. ConsensusPathDB has grown mainly due to the integration of 12 further databases; it now contains 215 541 unique interactions and 4601 pathways from overall 30 databases. Binary protein interactions are scored with our confidence assessment tool, IntScore. The ConsensusPathDB web interface allows users to take advantage of these integrated interaction and pathway data in different contexts. Recent developments include pathway analysis of metabolite lists, visualization of functional gene/metabolite sets as overlap graphs, gene set analysis based on protein complexes and induced network modules analysis that connects a list of genes through various interaction types. To facilitate the interactive, visual interpretation of interaction and pathway data, we have re-implemented the graph visualization feature of ConsensusPathDB using the Cytoscape.js library.

  18. Comprehensive care in hemophilia.

    PubMed

    Ruiz-Sáez, Arlette

    2012-04-01

    Hemophilia is a chronic and inherited X-linked bleeding disorder that requires life-long medical care. Hemophilia treatment is costly and complex partly because of the cost of the factor concentrates used in replacement therapy. However, the management of hemophilia is not based solely on achieving access to better treatment with safe factor concentrates; it also includes accurately diagnosing the disorder and providing specialized comprehensive care by a multidisciplinary team of specialists trained in hemophilia management. Comprehensive care for the person with hemophilia is defined as the continuous supervision of all medical and psychological aspects affecting the patient and his family and it demands the establishment of specialized centers, called Hemophilia Treatment Centers. The services that should be offered by a comprehensive hemophilia healthcare center are diverse and the multidisciplinary team should be coordinated preferably by a hematologist with the participation of other health professionals. It has been demonstrated that the benefits of establishing hemophilia centers are observed even in developing countries and that changes can be achieved when resources are re-organized, especially when education and training are provided at all levels. To reach these objectives, it is essential to have the participation of the patient and family members, and to strive to obtain the financial and legislative support from the State or Government in order to achieve a national comprehensive care program contemplating all the aspects needed for improving the quality of life for the community of patients with hemophilia and other bleeding disorders.

  19. A database for The Geysers geothermal field

    SciTech Connect

    Bodvarsson, G.S.; Cox, B.L.; Fuller, P.; Ripperda, M.; Tulinius, H.; Witherspoon, P.A.; Goldstein, N.; Flexser, S.; Pruess, K. ); Truesdell, A. )

    1989-09-01

    In Fiscal Year 1985-1986 the Earth Sciences Division of Lawrence Berkeley Laboratory (LBL) began a multi-year project for SLC to organize and analyze the field data from The Geysers. In the first year, most of the work concentrated on the development of a comprehensive database for The Geysers, and conventional reservoir engineering analysis of the data. Essentially, all non-proprietary data for wells at The Geysers have been incorporated into the database, as well as proprietary data from wells located on State leases. In following years, a more detailed analysis of The Geysers data has been carried out. This report is a summary of the non- proprietary work performed in FY 1985--1986. It describes various aspects of the database and also includes: review sections on Field Development, Geology, Geophysics, Geochemistry and Reservoir Engineering. It should be emphasized that these background chapters were written in 1986, and therefore only summarize the information available at that time. The appendices contain individual plots of wellhead pressures, degree of superheat, steam flow rates, cumulative mass flows, injection rates and cumulative injection through 1988 for approximately 250 wells. All of the data contained in this report are non-proprietary, from State and non-State leases. The production/injection and heat flow data from the wells were obtained from the California State Division of Oil and gas (DOG) (courtesy of Dick Thomas). Most of the other data were obtained from SLC files in Sacramento (courtesy of Charles Priddy), or DOG files in Santa Rosa (courtesy of Ken Stelling). 159 refs., 23 figs., 3 tabs.

  20. Cinnarizine: Comprehensive Profile.

    PubMed

    Haress, Nadia G

    2015-01-01

    Cinnarizine is a piperazine derivative with antihistaminic, antiserotonergic, antidopaminergic, and calcium channel-blocking activities. A comprehensive profile was performed on cinnarizine including its description and the different methods of analysis. The 1H NMR and 13C one- and two-dimensional NMR methods were used. In addition, infrared and mass spectral analyses were performed which all confirmed the structure of cinnarizine. PMID:26051684

  1. SynProt: A Database for Proteins of Detergent-Resistant Synaptic Protein Preparations

    PubMed Central

    Pielot, Rainer; Smalla, Karl-Heinz; Müller, Anke; Landgraf, Peter; Lehmann, Anne-Christin; Eisenschmidt, Elke; Haus, Utz-Uwe; Weismantel, Robert; Gundelfinger, Eckart D.; Dieterich, Daniela C.

    2012-01-01

    Chemical synapses are highly specialized cell–cell contacts for communication between neurons in the CNS characterized by complex and dynamic protein networks at both synaptic membranes. The cytomatrix at the active zone (CAZ) organizes the apparatus for the regulated release of transmitters from the presynapse. At the postsynaptic side, the postsynaptic density constitutes the machinery for detection, integration, and transduction of the transmitter signal. Both pre- and postsynaptic protein networks represent the molecular substrates for synaptic plasticity. Their function can be altered both by regulating their composition and by post-translational modification of their components. For a comprehensive understanding of synaptic networks the entire ensemble of synaptic proteins has to be considered. To support this, we established a comprehensive database for synaptic junction proteins (SynProt database) primarily based on proteomics data obtained from biochemical preparations of detergent-resistant synaptic junctions. The database currently contains 2,788 non-redundant entries of rat, mouse, and some human proteins, which mainly have been manually extracted from 12 proteomic studies and annotated for synaptic subcellular localization. Each dataset is completed with manually added information including protein classifiers as well as automatically retrieved and updated information from public databases (UniProt and PubMed). We intend that the database will be used to support modeling of synaptic protein networks and rational experimental design. PMID:22737123

  2. Teach with Databases: Toxics Release Inventory. [Multimedia].

    ERIC Educational Resources Information Center

    Barracato, Jay; Spooner, Barbara

    This curriculum unit provides students with real world applications of science as it pertains to toxic releases into the environment. This boxed package contains the Toxics Release Inventory (TRI) Teacher's Guide, TRI Database Basics guide, comprehensive TRI compact disk with user's guide, "Getting Started: A Guide to Bringing Environmental…

  3. Multi-Database Searching in Forensic Psychology.

    ERIC Educational Resources Information Center

    Piotrowski, Chris; Perdue, Robert W.

    Traditional library skills have been augmented since the introduction of online computerized database services. Because of the complexity of the field, forensic psychology can benefit enormously from the application of comprehensive bibliographic search strategies. The study reported here demonstrated the bibliographic results obtained when a…

  4. Databases of the marine metagenomics.

    PubMed

    Mineta, Katsuhiko; Gojobori, Takashi

    2016-02-01

    The metagenomic data obtained from marine environments is significantly useful for understanding marine microbial communities. In comparison with the conventional amplicon-based approach of metagenomics, the recent shotgun sequencing-based approach has become a powerful tool that provides an efficient way of grasping a diversity of the entire microbial community at a sampling point in the sea. However, this approach accelerates accumulation of the metagenome data as well as increase of data complexity. Moreover, when metagenomic approach is used for monitoring a time change of marine environments at multiple locations of the seawater, accumulation of metagenomics data will become tremendous with an enormous speed. Because this kind of situation has started becoming of reality at many marine research institutions and stations all over the world, it looks obvious that the data management and analysis will be confronted by the so-called Big Data issues such as how the database can be constructed in an efficient way and how useful knowledge should be extracted from a vast amount of the data. In this review, we summarize the outline of all the major databases of marine metagenome that are currently publically available, noting that database exclusively on marine metagenome is none but the number of metagenome databases including marine metagenome data are six, unexpectedly still small. We also extend our explanation to the databases, as reference database we call, that will be useful for constructing a marine metagenome database as well as complementing important information with the database. Then, we would point out a number of challenges to be conquered in constructing the marine metagenome database.

  5. Overview of selected molecular biological databases

    SciTech Connect

    Rayl, K.D.; Gaasterland, T.

    1994-11-01

    This paper presents an overview of the purpose, content, and design of a subset of the currently available biological databases, with an emphasis on protein databases. Databases included in this summary are 3D-ALI, Berlin RNA databank, Blocks, DSSP, EMBL Nucleotide Database, EMP, ENZYME, FSSP, GDB, GenBank, HSSP, LiMB, PDB, PIR, PKCDD, ProSite, and SWISS-PROT. The goal is to provide a starting point for researchers who wish to take advantage of the myriad available databases. Rather than providing a complete explanation of each database, we present its content and form by explaining the details of typical entries. Pointers to more complete ``user guides`` are included, along with general information on where to search for a new database.

  6. The Molecular Biology Database Collection: 2002 update

    PubMed Central

    Baxevanis, Andreas D.

    2002-01-01

    The Molecular Biology Database Collection is an online resource listing key databases of value to the biological community. This Collection is intended to bring fellow scientists’ attention to high-quality databases that are available throughout the world, rather than just be a lengthy listing of all available databases. As such, this up-to-date listing is intended to serve as the initial point from which to find specialized databases that may be of use in biological research. The databases included in this Collection provide new value to the underlying data by virtue of curation, new data connections or other innovative approaches. Short, searchable summaries and updates for each of the databases included in the Collection are available through the Nucleic Acids Research Web site at http://nar.oupjournals.org. PMID:11752241

  7. The Molecular Biology Database Collection: 2003 update

    PubMed Central

    Baxevanis, Andreas D.

    2003-01-01

    The Molecular Biology Database Collection is an online resource listing key databases of value to the biological community. This Collection is intended to bring fellow scientists' attention to high-quality databases that are available throughout the world, rather than just be a lengthy listing of all available databases. As such, this up-to-date listing is intended to serve as the jumping-off point from which to find specialized databases that may be of use in advancing biological research. The databases included in this Collection provide new value to the underlying data by virtue of curation, new data connections or other innovative approaches. Short, searchable summaries and updates for each of the databases included in this Collection are available through the Nucleic Acids Research Web site at http://nar.oupjournals.org. PMID:12519937

  8. Rice Glycosyltransferase (GT) Phylogenomic Database

    DOE Data Explorer

    Ronald, Pamela

    The Ronald Laboratory staff at the University of California-Davis has a primary research focus on the genes of the rice plant. They study the role that genetics plays in the way rice plants respond to their environment. They created the Rice GT Database in order to integrate functional genomic information for putative rice Glycosyltransferases (GTs). This database contains information on nearly 800 putative rice GTs (gene models) identified by sequence similarity searches based on the Carbohydrate Active enZymes (CAZy) database. The Rice GT Database provides a platform to display user-selected functional genomic data on a phylogenetic tree. This includes sequence information, mutant line information, expression data, etc. An interactive chromosomal map shows the position of all rice GTs, and links to rice annotation databases are included. The format is intended to "facilitate the comparison of closely related GTs within different families, as well as perform global comparisons between sets of related families." [From http://ricephylogenomics.ucdavis.edu/cellwalls/gt/genInfo.shtml] See also the primary paper discussing this work: Peijian Cao, Laura E. Bartley, Ki-Hong Jung and Pamela C. Ronalda. Construction of a Rice Glycosyltransferase Phylogenomic Database and Identification of Rice-Diverged Glycosyltransferases. Molecular Plant, 2008, 1(5): 858-877.

  9. Creating databases for biological information: an introduction.

    PubMed

    Stein, Lincoln

    2013-06-01

    The essence of bioinformatics is dealing with large quantities of information. Whether it be sequencing data, microarray data files, mass spectrometric data (e.g., fingerprints), the catalog of strains arising from an insertional mutagenesis project, or even large numbers of PDF files, there inevitably comes a time when the information can simply no longer be managed with files and directories. This is where databases come into play. This unit briefly reviews the characteristics of several database management systems, including flat file, indexed file, relational databases, and NoSQL databases. It compares their strengths and weaknesses and offers some general guidelines for selecting an appropriate database management system.

  10. Creating databases for biological information: an introduction.

    PubMed

    Stein, Lincoln

    2013-06-01

    The essence of bioinformatics is dealing with large quantities of information. Whether it be sequencing data, microarray data files, mass spectrometric data (e.g., fingerprints), the catalog of strains arising from an insertional mutagenesis project, or even large numbers of PDF files, there inevitably comes a time when the information can simply no longer be managed with files and directories. This is where databases come into play. This unit briefly reviews the characteristics of several database management systems, including flat file, indexed file, relational databases, and NoSQL databases. It compares their strengths and weaknesses and offers some general guidelines for selecting an appropriate database management system. PMID:23749755

  11. NOAA Data Rescue of Key Solar Databases and Digitization of Historical Solar Images

    NASA Astrophysics Data System (ADS)

    Coffey, H. E.

    2006-08-01

    Over a number of years, the staff at NOAA National Geophysical Data Center (NGDC) has worked to rescue key solar databases by converting them to digital format and making them available via the World Wide Web. NOAA has had several data rescue programs where staff compete for funds to rescue important and critical historical data that are languishing in archives and at risk of being lost due to deteriorating condition, loss of any metadata or descriptive text that describe the databases, lack of interest or funding in maintaining databases, etc. The Solar-Terrestrial Physics Division at NGDC was able to obtain funds to key in some critical historical tabular databases. Recently the NOAA Climate Database Modernization Program (CDMP) funded a project to digitize historical solar images, producing a large online database of historical daily full disk solar images. The images include the wavelengths Calcium K, Hydrogen Alpha, and white light photos, as well as sunspot drawings and the comprehensive drawings of a multitude of solar phenomena on one daily map (Fraunhofer maps and Wendelstein drawings). Included in the digitization are high resolution solar H-alpha images taken at the Boulder Solar Observatory 1967-1984. The scanned daily images document many phases of solar activity, from decadal variation to rotational variation to daily changes. Smaller versions are available online. Larger versions are available by request. See http://www.ngdc.noaa.gov/stp/SOLAR/ftpsolarimages.html. The tabular listings and solar imagery will be discussed.

  12. Frame-Based Approach To Database Management

    NASA Astrophysics Data System (ADS)

    Voros, Robert S.; Hillman, Donald J.; Decker, D. Richard; Blank, Glenn D.

    1989-03-01

    Practical knowledge-based systems need to reason in terms of knowledge that is already available in databases. This type of knowledge is usually represented as tables acquired from external databases and published reports. Knowledge based systems provide a means for reasoning about entities at a higher level of abstraction. What is needed in many of today's expert systems is a link between the knowledge base and external databases. One such approach is a frame-based database management system. Package Expert (PEx) designs packages for integrated circuits. The thrust of our work is to bring together diverse technologies, data and design knowledge in a coherent system. PEx uses design rules to reason about properties of chips and potential packages, including dimensions, possible materials and packaging requirements. This information is available in existing databases. PEx needs to deal with the following types of information consistently: material databases which are in several formats; technology databases, also in several formats; and parts files which contain dimensional information. It is inefficient and inelegant to have rules access the database directly. Instead, PEx uses a frame-based hierarchical knowledge management approach to databases. Frames serve as the interface between rule-based knowledge and databases. We describe PEx and the use of frames in database retrieval. We first give an overview and the design evolution of the expert system. Next, we describe the system implementation. Finally, we describe how the rules in the expert system access the databases via frames.

  13. SmallSat Database

    NASA Technical Reports Server (NTRS)

    Petropulos, Dolores; Bittner, David; Murawski, Robert; Golden, Bert

    2015-01-01

    The SmallSat has an unrealized potential in both the private industry and in the federal government. Currently over 70 companies, 50 universities and 17 governmental agencies are involved in SmallSat research and development. In 1994, the U.S. Army Missile and Defense mapped the moon using smallSat imagery. Since then Smart Phones have introduced this imagery to the people of the world as diverse industries watched this trend. The deployment cost of smallSats is also greatly reduced compared to traditional satellites due to the fact that multiple units can be deployed in a single mission. Imaging payloads have become more sophisticated, smaller and lighter. In addition, the growth of small technology obtained from private industries has led to the more widespread use of smallSats. This includes greater revisit rates in imagery, significantly lower costs, the ability to update technology more frequently and the ability to decrease vulnerability of enemy attacks. The popularity of smallSats show a changing mentality in this fast paced world of tomorrow. What impact has this created on the NASA communication networks now and in future years? In this project, we are developing the SmallSat Relational Database which can support a simulation of smallSats within the NASA SCaN Compatability Environment for Networks and Integrated Communications (SCENIC) Modeling and Simulation Lab. The NASA Space Communications and Networks (SCaN) Program can use this modeling to project required network support needs in the next 10 to 15 years. The SmallSat Rational Database could model smallSats just as the other SCaN databases model the more traditional larger satellites, with a few exceptions. One being that the smallSat Database is designed to be built-to-order. The SmallSat database holds various hardware configurations that can be used to model a smallSat. It will require significant effort to develop as the research material can only be populated by hand to obtain the unique data

  14. Nuclear science abstracts (NSA) database 1948--1974 (on the Internet)

    SciTech Connect

    1999-02-01

    Nuclear Science Abstracts (NSA) is a comprehensive abstract and index collection of the International Nuclear Science and Technology literature for the period 1948 through 1976. Included are scientific and technical reports of the US Atomic Energy Commission, US Energy Research and Development Administration and its contractors, other agencies, universities, and industrial and research organizations. Coverage of the literature since 1976 is provided by Energy Science and Technology Database. Approximately 25% of the records in the file contain abstracts. These are from the following volumes of the print Nuclear Science Abstracts: Volumes 12--18, Volume 29, and Volume 33. The database contains over 900,000 bibliographic records. All aspects of nuclear science and technology are covered, including: Biomedical Sciences; Metals, Ceramics, and Other Materials; Chemistry; Nuclear Materials and Waste Management; Environmental and Earth Sciences; Particle Accelerators; Engineering; Physics; Fusion Energy; Radiation Effects; Instrumentation; Reactor Technology; Isotope and Radiation Source Technology. The database includes all records contained in Volume 1 (1948) through Volume 33 (1976) of the printed version of Nuclear Science Abstracts (NSA). This worldwide coverage includes books, conference proceedings, papers, patents, dissertations, engineering drawings, and journal literature. This database is now available for searching through the GOV. Research Center (GRC) service. GRC is a single online web-based search service to well known Government databases. Featuring powerful search and retrieval software, GRC is an important research tool. The GRC web site is at http://grc.ntis.gov.

  15. Content Evaluation of Textual CD-ROM and Web Databases. Database Searching Series.

    ERIC Educational Resources Information Center

    Jacso, Peter

    This book provides guidelines for evaluating a variety of database types, including abstracting and indexing, directory, full-text, and page-image databases available in online and/or CD-ROM formats. The book discusses the purpose and techniques of comparing and evaluating the most important characteristics of textual databases, such as their…

  16. The role of databases in areawide pest management

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A database is a comprehensive collection of related data organized for convenient access, generally in a computer. The evolution of computer software and the need to distinguish the specialized computer systems for storing and manipulating data, stimulated development of database management systems...

  17. Accessing the BIOSIS Previews Database in Clinical Psychology.

    ERIC Educational Resources Information Center

    Piotrowski, Chris; Perdue, Bob

    The efficacy of using the BIOSIS Previews database as an online information retrieval tool in clinical psychology was investigated in a study conducted at the University of West Florida. Recognizing the importance of multi-database searching strategies when seeking comprehensive results, this study compared the citation output of this…

  18. EMU Lessons Learned Database

    NASA Technical Reports Server (NTRS)

    Matthews, Kevin M., Jr.; Crocker, Lori; Cupples, J. Scott

    2011-01-01

    As manned space exploration takes on the task of traveling beyond low Earth orbit, many problems arise that must be solved in order to make the journey possible. One major task is protecting humans from the harsh space environment. The current method of protecting astronauts during Extravehicular Activity (EVA) is through use of the specially designed Extravehicular Mobility Unit (EMU). As more rigorous EVA conditions need to be endured at new destinations, the suit will need to be tailored and improved in order to accommodate the astronaut. The Objective behind the EMU Lessons Learned Database(LLD) is to be able to create a tool which will assist in the development of next-generation EMUs, along with maintenance and improvement of the current EMU, by compiling data from Failure Investigation and Analysis Reports (FIARs) which have information on past suit failures. FIARs use a system of codes that give more information on the aspects of the failure, but if one is unfamiliar with the EMU they will be unable to decipher the information. A goal of the EMU LLD is to not only compile the information, but to present it in a user-friendly, organized, searchable database accessible to all familiarity levels with the EMU; both newcomers and veterans alike. The EMU LLD originally started as an Excel database, which allowed easy navigation and analysis of the data through pivot charts. Creating an entry requires access to the Problem Reporting And Corrective Action database (PRACA), which contains the original FIAR data for all hardware. FIAR data are then transferred to, defined, and formatted in the LLD. Work is being done to create a web-based version of the LLD in order to increase accessibility to all of Johnson Space Center (JSC), which includes converting entries from Excel to the HTML format. FIARs related to the EMU have been completed in the Excel version, and now focus has shifted to expanding FIAR data in the LLD to include EVA tools and support hardware such as

  19. Gene3D: comprehensive structural and functional annotation of genomes.

    PubMed

    Yeats, Corin; Lees, Jonathan; Reid, Adam; Kellam, Paul; Martin, Nigel; Liu, Xinhui; Orengo, Christine

    2008-01-01

    Gene3D provides comprehensive structural and functional annotation of most available protein sequences, including the UniProt, RefSeq and Integr8 resources. The main structural annotation is generated through scanning these sequences against the CATH structural domain database profile-HMM library. CATH is a database of manually derived PDB-based structural domains, placed within a hierarchy reflecting topology, homology and conservation and is able to infer more ancient and divergent homology relationships than sequence-based approaches. This data is supplemented with Pfam-A, other non-domain structural predictions (i.e. coiled coils) and experimental data from UniProt. In order to enhance the investigations possible with this data, we have also incorporated a variety of protein annotation resources, including protein-protein interaction data, GO functional assignments, KEGG pathways, FUNCAT functional descriptions and links to microarray expression data. All of this data can be accessed through a newly re-designed website that has a focus on flexibility and clarity, with searches that can be restricted to a single genome or across the entire sequence database. Currently Gene3D contains over 3.5 million domain assignments for nearly 5 million proteins including 527 completed genomes. This is available at: http://gene3d.biochem.ucl.ac.uk/ PMID:18032434

  20. The new international GLE database

    NASA Astrophysics Data System (ADS)

    Duldig, M. L.; Watts, D. J.

    2001-08-01

    The Australian Antarctic Division has agreed to host the international GLE database. Access to the database is via a world-wide-web interface and initially covers all GLEs since the start of the 22nd solar cycle. Access restriction for recent events is controlled by password protection and these data are available only to those groups contributing data to the database. The restrictions to data will be automatically removed for events older than 2 years, in accordance with the data exchange provisions of the Antarctic Treaty. Use of the data requires acknowledgment of the database as the source of the data and acknowledgment of the specific groups that provided the data used. Furthermore, some groups that provide data to the database have specific acknowledgment requirements or wording. A new submission format has been developed that will allow easier exchange of data, although the old format will be acceptable for some time. Data download options include direct web based download and email. Data may also be viewed as listings or plots with web browsers. Search options have also been incorporated. Development of the database will be ongoing with extension to viewing and delivery options, addition of earlier data and the development of mirror sites. It is expected that two mirror sites, one in North America and one in Europe, will be developed to enable fast access for the whole cosmic ray community.

  1. A comprehensive company database analysis of biological assay variability.

    PubMed

    Kramer, Christian; Dahl, Göran; Tyrchan, Christian; Ulander, Johan

    2016-08-01

    Analysis of data from various compounds measured in diverse biological assays is a central part of drug discovery research projects. However, no systematic overview of the variability in biological assays has been published and judgments on assay quality and robustness of data are often based on personal belief and experience within the drug discovery community. To address this we performed a reproducibility analysis of all biological assays at AstraZeneca between 2005 and 2014. We found an average experimental uncertainty of less than a twofold difference and no technologies or assay types had higher variability than others. This work suggests that robust data can be obtained from the most commonly applied biological assays.

  2. The 2014 Nucleic Acids Research Database Issue and an updated NAR online Molecular Biology Database Collection

    PubMed Central

    Fernández-Suárez, Xosé M.; Rigden, Daniel J.; Galperin, Michael Y.

    2014-01-01

    The 2014 Nucleic Acids Research Database Issue includes descriptions of 58 new molecular biology databases and recent updates to 123 databases previously featured in NAR or other journals. For convenience, the issue is now divided into eight sections that reflect major subject categories. Among the highlights of this issue are six databases of the transcription factor binding sites in various organisms and updates on such popular databases as CAZy, Database of Genomic Variants (DGV), dbGaP, DrugBank, KEGG, miRBase, Pfam, Reactome, SEED, TCDB and UniProt. There is a strong block of structural databases, which includes, among others, the new RNA Bricks database, updates on PDBe, PDBsum, ArchDB, Gene3D, ModBase, Nucleic Acid Database and the recently revived iPfam database. An update on the NCBI’s MMDB describes VAST+, an improved tool for protein structure comparison. Two articles highlight the development of the Structural Classification of Proteins (SCOP) database: one describes SCOPe, which automates assignment of new structures to the existing SCOP hierarchy; the other one describes the first version of SCOP2, with its more flexible approach to classifying protein structures. This issue also includes a collection of articles on bacterial taxonomy and metagenomics, which includes updates on the List of Prokaryotic Names with Standing in Nomenclature (LPSN), Ribosomal Database Project (RDP), the Silva/LTP project and several new metagenomics resources. The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been expanded to 1552 databases. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/). PMID:24316579

  3. The 2014 Nucleic Acids Research Database Issue and an updated NAR online Molecular Biology Database Collection.

    PubMed

    Fernández-Suárez, Xosé M; Rigden, Daniel J; Galperin, Michael Y

    2014-01-01

    The 2014 Nucleic Acids Research Database Issue includes descriptions of 58 new molecular biology databases and recent updates to 123 databases previously featured in NAR or other journals. For convenience, the issue is now divided into eight sections that reflect major subject categories. Among the highlights of this issue are six databases of the transcription factor binding sites in various organisms and updates on such popular databases as CAZy, Database of Genomic Variants (DGV), dbGaP, DrugBank, KEGG, miRBase, Pfam, Reactome, SEED, TCDB and UniProt. There is a strong block of structural databases, which includes, among others, the new RNA Bricks database, updates on PDBe, PDBsum, ArchDB, Gene3D, ModBase, Nucleic Acid Database and the recently revived iPfam database. An update on the NCBI's MMDB describes VAST+, an improved tool for protein structure comparison. Two articles highlight the development of the Structural Classification of Proteins (SCOP) database: one describes SCOPe, which automates assignment of new structures to the existing SCOP hierarchy; the other one describes the first version of SCOP2, with its more flexible approach to classifying protein structures. This issue also includes a collection of articles on bacterial taxonomy and metagenomics, which includes updates on the List of Prokaryotic Names with Standing in Nomenclature (LPSN), Ribosomal Database Project (RDP), the Silva/LTP project and several new metagenomics resources. The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been expanded to 1552 databases. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/).

  4. Ribosomal Database Project II

    DOE Data Explorer

    The Ribosomal Database Project (RDP) provides ribosome related data and services to the scientific community, including online data analysis and aligned and annotated Bacterial small-subunit 16S rRNA sequences. As of March 2008, RDP Release 10 is available and currently (August 2009) contains 1,074,075 aligned 16S rRNA sequences. Data that can be downloaded include zipped GenBank and FASTA alignment files, a histogram (in Excel) of the number of RDP sequences spanning each base position, data in the Functional Gene Pipeline Repository, and various user submitted data. The RDP-II website also provides numerous analysis tools.[From the RDP-II home page at http://rdp.cme.msu.edu/index.jsp

  5. View generated database

    NASA Technical Reports Server (NTRS)

    Downward, James G.

    1992-01-01

    This document represents the final report for the View Generated Database (VGD) project, NAS7-1066. It documents the work done on the project up to the point at which all project work was terminated due to lack of project funds. The VGD was to provide the capability to accurately represent any real-world object or scene as a computer model. Such models include both an accurate spatial/geometric representation of surfaces of the object or scene, as well as any surface detail present on the object. Applications of such models are numerous, including acquisition and maintenance of work models for tele-autonomous systems, generation of accurate 3-D geometric/photometric models for various 3-D vision systems, and graphical models for realistic rendering of 3-D scenes via computer graphics.

  6. Annual Review of Database Development: 1992.

    ERIC Educational Resources Information Center

    Basch, Reva

    1992-01-01

    Reviews recent trends in databases and online systems. Topics discussed include new access points for established databases; acquisitions, consolidations, and competition between vendors; European coverage; international services; online reference materials, including telephone directories; political and legal materials and public records;…

  7. CMPD: cancer mutant proteome database.

    PubMed

    Huang, Po-Jung; Lee, Chi-Ching; Tan, Bertrand Chin-Ming; Yeh, Yuan-Ming; Julie Chu, Lichieh; Chen, Ting-Wen; Chang, Kai-Ping; Lee, Cheng-Yang; Gan, Ruei-Chi; Liu, Hsuan; Tang, Petrus

    2015-01-01

    Whole-exome sequencing, which centres on the protein coding regions of disease/cancer associated genes, represents the most cost-effective method to-date for deciphering the association between genetic alterations and diseases. Large-scale whole exome/genome sequencing projects have been launched by various institutions, such as NCI, Broad Institute and TCGA, to provide a comprehensive catalogue of coding variants in diverse tissue samples and cell lines. Further functional and clinical interrogation of these sequence variations must rely on extensive cross-platforms integration of sequencing information and a proteome database that explicitly and comprehensively archives the corresponding mutated peptide sequences. While such data resource is a critical for the mass spectrometry-based proteomic analysis of exomic variants, no database is currently available for the collection of mutant protein sequences that correspond to recent large-scale genomic data. To address this issue and serve as bridge to integrate genomic and proteomics datasets, CMPD (http://cgbc.cgu.edu.tw/cmpd) collected over 2 millions genetic alterations, which not only facilitates the confirmation and examination of potential cancer biomarkers but also provides an invaluable resource for translational medicine research and opportunities to identify mutated proteins encoded by mutated genes.

  8. CMPD: cancer mutant proteome database

    PubMed Central

    Huang, Po-Jung; Lee, Chi-Ching; Tan, Bertrand Chin-Ming; Yeh, Yuan-Ming; Julie Chu, Lichieh; Chen, Ting-Wen; Chang, Kai-Ping; Lee, Cheng-Yang; Gan, Ruei-Chi; Liu, Hsuan; Tang, Petrus

    2015-01-01

    Whole-exome sequencing, which centres on the protein coding regions of disease/cancer associated genes, represents the most cost-effective method to-date for deciphering the association between genetic alterations and diseases. Large-scale whole exome/genome sequencing projects have been launched by various institutions, such as NCI, Broad Institute and TCGA, to provide a comprehensive catalogue of coding variants in diverse tissue samples and cell lines. Further functional and clinical interrogation of these sequence variations must rely on extensive cross-platforms integration of sequencing information and a proteome database that explicitly and comprehensively archives the corresponding mutated peptide sequences. While such data resource is a critical for the mass spectrometry-based proteomic analysis of exomic variants, no database is currently available for the collection of mutant protein sequences that correspond to recent large-scale genomic data. To address this issue and serve as bridge to integrate genomic and proteomics datasets, CMPD (http://cgbc.cgu.edu.tw/cmpd) collected over 2 millions genetic alterations, which not only facilitates the confirmation and examination of potential cancer biomarkers but also provides an invaluable resource for translational medicine research and opportunities to identify mutated proteins encoded by mutated genes. PMID:25398898

  9. The mouse genome informatics and the mouse genome database

    SciTech Connect

    Maltais, L.J.; Blackburn, R.E.; Bradt, D.W.

    1994-09-01

    The Mouse Genome Database (MGD) is a centralized, comprehensive database of the mouse genome that includes genetic mapping data, comparative mapping data, gene descriptions, mutant phenotype descriptions, strains and allelic polymorphism data, inbred strain characteristics, physical mapping data, and molecular probes and clones data. Data in MGD are obtained from the published literature and by electronic transfer from laboratories working on large backcross panels of mice. MGD provides tools that enable the user to search the database, retrieve data, generate reports, analyze data, annotate records, and build genetic maps. The Encyclopedia of the Mouse Genome provides a graphic user interface to mouse genome data. It consists of software tools including: LinkMap, a graphic display of genetic linkage maps with the ability to magnify regions of high locus density: CytoMap, a graphic display of cytogenetic maps showing banded chromosomes with cytogenetic locations of genes and chromosomal aberrations; CATS, a catalog searching tool for text retrieval of mouse locus descriptions. These software tools provide access to the following data sets: Chromosome Committee Reports, MIT Genome Center data, GBASE reports, Mouse Locus Catalog (MLC), and Mouse Cytogenetic Mapping Data. The MGD is available to the scientific community through the World Wide Web (WWW) and Gopher. In addition GBASE can be accessed via the Internet.

  10. Physiological Information Database (PID)

    EPA Science Inventory

    EPA has developed a physiological information database (created using Microsoft ACCESS) intended to be used in PBPK modeling. The database contains physiological parameter values for humans from early childhood through senescence as well as similar data for laboratory animal spec...

  11. Network II Database

    1994-11-07

    The Oak Ridge National Laboratory (ORNL) Rail and Barge Network II Database is a representation of the rail and barge system of the United States. The network is derived from the Federal Rail Administration (FRA) rail database.

  12. Household Products Database: Pesticides

    MedlinePlus

    ... Names Types of Products Manufacturers Ingredients About the Database FAQ Product Recalls Help Glossary Contact Us More ... holders. Information is extracted from Consumer Product Information Database ©2001-2015 by DeLima Associates. All rights reserved. ...

  13. Comprehensive Repertoire of Foldable Regions within Whole Genomes

    PubMed Central

    Faure, Guilhem; Callebaut, Isabelle

    2013-01-01

    In order to get a comprehensive repertoire of foldable domains within whole proteomes, including orphan domains, we developed a novel procedure, called SEG-HCA. From only the information of a single amino acid sequence, SEG-HCA automatically delineates segments possessing high densities in hydrophobic clusters, as defined by Hydrophobic Cluster Analysis (HCA). These hydrophobic clusters mainly correspond to regular secondary structures, which together form structured or foldable regions. Genome-wide analyses revealed that SEG-HCA is opposite of disorder predictors, both addressing distinct structural states. Interestingly, there is however an overlap between the two predictions, including small segments of disordered sequences, which undergo coupled folding and binding. SEG-HCA thus gives access to these specific domains, which are generally poorly represented in domain databases. Comparison of the whole set of SEG-HCA predictions with the Conserved Domain Database (CDD) also highlighted a wide proportion of predicted large (length >50 amino acids) segments, which are CDD orphan. These orphan sequences may either correspond to highly divergent members of already known families or belong to new families of domains. Their comprehensive description thus opens new avenues to investigate new functional and/or structural features, which remained so far uncovered. Altogether, the data described here provide new insights into the protein architecture and organization throughout the three kingdoms of life. PMID:24204229

  14. Mars Global Digital Dune Database: MC2-MC29

    USGS Publications Warehouse

    Hayward, Rosalyn K.; Mullins, Kevin F.; Fenton, L.K.; Hare, T.M.; Titus, T.N.; Bourke, M.C.; Colaprete, Anthony; Christensen, P.R.

    2007-01-01

    Introduction The Mars Global Digital Dune Database presents data and describes the methodology used in creating the database. The database provides a comprehensive and quantitative view of the geographic distribution of moderate- to large-size dune fields from 65? N to 65? S latitude and encompasses ~ 550 dune fields. The database will be expanded to cover the entire planet in later versions. Although we have attempted to include all dune fields between 65? N and 65? S, some have likely been excluded for two reasons: 1) incomplete THEMIS IR (daytime) coverage may have caused us to exclude some moderate- to large-size dune fields or 2) resolution of THEMIS IR coverage (100m/pixel) certainly caused us to exclude smaller dune fields. The smallest dune fields in the database are ~ 1 km2 in area. While the moderate to large dune fields are likely to constitute the largest compilation of sediment on the planet, smaller stores of sediment of dunes are likely to be found elsewhere via higher resolution data. Thus, it should be noted that our database excludes all small dune fields and some moderate to large dune fields as well. Therefore the absence of mapped dune fields does not mean that such dune fields do not exist and is not intended to imply a lack of saltating sand in other areas. Where availability and quality of THEMIS visible (VIS) or Mars Orbiter Camera narrow angle (MOC NA) images allowed, we classifed dunes and included dune slipface measurements, which were derived from gross dune morphology and represent the prevailing wind direction at the last time of significant dune modification. For dunes located within craters, the azimuth from crater centroid to dune field centroid was calculated. Output from a general circulation model (GCM) is also included. In addition to polygons locating dune fields, the database includes over 1800 selected Thermal Emission Imaging System (THEMIS) infrared (IR), THEMIS visible (VIS) and Mars Orbiter Camera Narrow Angle (MOC NA

  15. The 2015 Nucleic Acids Research Database Issue and molecular biology database collection.

    PubMed

    Galperin, Michael Y; Rigden, Daniel J; Fernández-Suárez, Xosé M

    2015-01-01

    The 2015 Nucleic Acids Research Database Issue contains 172 papers that include descriptions of 56 new molecular biology databases, and updates on 115 databases whose descriptions have been previously published in NAR or other journals. Following the classification that has been introduced last year in order to simplify navigation of the entire issue, these articles are divided into eight subject categories. This year's highlights include RNAcentral, an international community portal to various databases on noncoding RNA; ValidatorDB, a validation database for protein structures and their ligands; SASBDB, a primary repository for small-angle scattering data of various macromolecular complexes; MoonProt, a database of 'moonlighting' proteins, and two new databases of protein-protein and other macromolecular complexes, ComPPI and the Complex Portal. This issue also includes an unusually high number of cancer-related databases and other databases dedicated to genomic basics of disease and potential drugs and drug targets. The size of NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/a/, remained approximately the same, following the addition of 74 new resources and removal of 77 obsolete web sites. The entire Database Issue is freely available online on the Nucleic Acids Research web site (http://nar.oxfordjournals.org/).

  16. Marine casualty and pollution database (raw data only) (on CD-ROM). Data file

    SciTech Connect

    1998-06-01

    The Marine Casualty and Pollution Database provides details about marine casualty and pollution incidents investigated by the US Coast Guard Marine Safety Officer. The database is an invaluable source of information to understand particulars and circumstances of marine accidents and pollution incidents. The CD-ROM contains forty data tables derived from marine casualty and pollution investigations conducted by investigators at US Coast Guard Marine Safety Offices throughout the United States. The data collection period began in 1973 and is ongoing. Also included on the CD-ROM are vessel and facility tables with specific information on vessel and facility constitution and operating details. The Coast Guard maintains a comprehensive database on approximately 460,000 US and foreign vessels and 32,000 facilities. Entity and attribute descriptions and suggested solutions to general marine pollution, vessel casualty, and personnel injury and death questions are outlined in the documentation.

  17. Sharing the Load: A Planning Checklist for Consortial Database Use.

    ERIC Educational Resources Information Center

    Klingler, Tom

    1999-01-01

    Provides a checklist of concerns for libraries planning to implement shared research databases in a consortium environment, based on experiences at OhioLINK. Highlights include database selection criteria, licensing, pricing, funding, vendor considerations, database profiling, custom data processing, hardware selection, database access,…

  18. Mission and Assets Database

    NASA Technical Reports Server (NTRS)

    Baldwin, John; Zendejas, Silvino; Gutheinz, Sandy; Borden, Chester; Wang, Yeou-Fang

    2009-01-01

    Mission and Assets Database (MADB) Version 1.0 is an SQL database system with a Web user interface to centralize information. The database stores flight project support resource requirements, view periods, antenna information, schedule, and forecast results for use in mid-range and long-term planning of Deep Space Network (DSN) assets.

  19. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases.

    PubMed

    Caspi, Ron; Billington, Richard; Ferrer, Luciana; Foerster, Hartmut; Fulcher, Carol A; Keseler, Ingrid M; Kothari, Anamika; Krummenacker, Markus; Latendresse, Mario; Mueller, Lukas A; Ong, Quang; Paley, Suzanne; Subhraveti, Pallavi; Weaver, Daniel S; Karp, Peter D

    2016-01-01

    The MetaCyc database (MetaCyc.org) is a freely accessible comprehensive database describing metabolic pathways and enzymes from all domains of life. The majority of MetaCyc pathways are small-molecule metabolic pathways that have been experimentally determined. MetaCyc contains more than 2400 pathways derived from >46,000 publications, and is the largest curated collection of metabolic pathways. BioCyc (BioCyc.org) is a collection of 5700 organism-specific Pathway/Genome Databases (PGDBs), each containing the full genome and predicted metabolic network of one organism, including metabolites, enzymes, reactions, metabolic pathways, predicted operons, transport systems, and pathway-hole fillers. The BioCyc website offers a variety of tools for querying and analyzing PGDBs, including Omics Viewers and tools for comparative analysis. This article provides an update of new developments in MetaCyc and BioCyc during the last two years, including addition of Gibbs free energy values for compounds and reactions; redesign of the primary gene/protein page; addition of a tool for creating diagrams containing multiple linked pathways; several new search capabilities, including searching for genes based on sequence patterns, searching for databases based on an organism's phenotypes, and a cross-organism search; and a metabolite identifier translation service.

  20. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases

    PubMed Central

    Caspi, Ron; Billington, Richard; Ferrer, Luciana; Foerster, Hartmut; Fulcher, Carol A.; Keseler, Ingrid M.; Kothari, Anamika; Krummenacker, Markus; Latendresse, Mario; Mueller, Lukas A.; Ong, Quang; Paley, Suzanne; Subhraveti, Pallavi; Weaver, Daniel S.; Karp, Peter D.

    2016-01-01

    The MetaCyc database (MetaCyc.org) is a freely accessible comprehensive database describing metabolic pathways and enzymes from all domains of life. The majority of MetaCyc pathways are small-molecule metabolic pathways that have been experimentally determined. MetaCyc contains more than 2400 pathways derived from >46 000 publications, and is the largest curated collection of metabolic pathways. BioCyc (BioCyc.org) is a collection of 5700 organism-specific Pathway/Genome Databases (PGDBs), each containing the full genome and predicted metabolic network of one organism, including metabolites, enzymes, reactions, metabolic pathways, predicted operons, transport systems, and pathway-hole fillers. The BioCyc website offers a variety of tools for querying and analyzing PGDBs, including Omics Viewers and tools for comparative analysis. This article provides an update of new developments in MetaCyc and BioCyc during the last two years, including addition of Gibbs free energy values for compounds and reactions; redesign of the primary gene/protein page; addition of a tool for creating diagrams containing multiple linked pathways; several new search capabilities, including searching for genes based on sequence patterns, searching for databases based on an organism's phenotypes, and a cross-organism search; and a metabolite identifier translation service. PMID:26527732

  1. Asbestos Exposure Assessment Database

    NASA Technical Reports Server (NTRS)

    Arcot, Divya K.

    2010-01-01

    Exposure to particular hazardous materials in a work environment is dangerous to the employees who work directly with or around the materials as well as those who come in contact with them indirectly. In order to maintain a national standard for safe working environments and protect worker health, the Occupational Safety and Health Administration (OSHA) has set forth numerous precautionary regulations. NASA has been proactive in adhering to these regulations by implementing standards which are often stricter than regulation limits and administering frequent health risk assessments. The primary objective of this project is to create the infrastructure for an Asbestos Exposure Assessment Database specific to NASA Johnson Space Center (JSC) which will compile all of the exposure assessment data into a well-organized, navigable format. The data includes Sample Types, Samples Durations, Crafts of those from whom samples were collected, Job Performance Requirements (JPR) numbers, Phased Contrast Microscopy (PCM) and Transmission Electron Microscopy (TEM) results and qualifiers, Personal Protective Equipment (PPE), and names of industrial hygienists who performed the monitoring. This database will allow NASA to provide OSHA with specific information demonstrating that JSC s work procedures are protective enough to minimize the risk of future disease from the exposures. The data has been collected by the NASA contractors Computer Sciences Corporation (CSC) and Wyle Laboratories. The personal exposure samples were collected from devices worn by laborers working at JSC and by building occupants located in asbestos-containing buildings.

  2. CyanOmics: an integrated database of omics for the model cyanobacterium Synechococcus sp. PCC 7002.

    PubMed

    Yang, Yaohua; Feng, Jie; Li, Tao; Ge, Feng; Zhao, Jindong

    2015-01-01

    Cyanobacteria are an important group of organisms that carry out oxygenic photosynthesis and play vital roles in both the carbon and nitrogen cycles of the Earth. The annotated genome of Synechococcus sp. PCC 7002, as an ideal model cyanobacterium, is available. A series of transcriptomic and proteomic studies of Synechococcus sp. PCC 7002 cells grown under different conditions have been reported. However, no database of such integrated omics studies has been constructed. Here we present CyanOmics, a database based on the results of Synechococcus sp. PCC 7002 omics studies. CyanOmics comprises one genomic dataset, 29 transcriptomic datasets and one proteomic dataset and should prove useful for systematic and comprehensive analysis of all those data. Powerful browsing and searching tools are integrated to help users directly access information of interest with enhanced visualization of the analytical results. Furthermore, Blast is included for sequence-based similarity searching and Cluster 3.0, as well as the R hclust function is provided for cluster analyses, to increase CyanOmics's usefulness. To the best of our knowledge, it is the first integrated omics analysis database for cyanobacteria. This database should further understanding of the transcriptional patterns, and proteomic profiling of Synechococcus sp. PCC 7002 and other cyanobacteria. Additionally, the entire database framework is applicable to any sequenced prokaryotic genome and could be applied to other integrated omics analysis projects. Database URL: http://lag.ihb.ac.cn/cyanomics.

  3. Cross-border use of food databases: equivalence of US and Australian databases for macronutrients.

    PubMed

    Summer, Suzanne S; Ollberding, Nicholas J; Guy, Trish; Setchell, Kenneth D R; Brown, Nadine; Kalkwarf, Heidi J

    2013-10-01

    When estimating dietary intake across multiple countries, the lack of a single comprehensive dietary database may lead researchers to modify one database to analyze intakes for all participants. This approach may yield results different from those using the country-specific database and introduce measurement error. We examined whether nutrient intakes of Australians calculated with a modified US database would be similar to those calculated with an Australian database. We analyzed 3-day food records of 68 Australian adults using the US-based Nutrition Data System for Research, modified to reflect food items consumed in Australia. Modification entailed identifying a substitute food whose energy and macronutrient content were within 10% of the Australian food or by adding a new food to the database. Paired Wilcoxon signed rank tests were used to compare differences in nutrient intakes estimated by both databases, and Pearson and intraclass correlation coefficients measured degree of association and agreement between intake estimates for individuals. Median intakes of energy, carbohydrate, protein, and fiber differed by <5% at the group level. Larger discrepancies were seen for fat (11%; P<0.0001) and most micronutrients. Despite strong correlations, nutrient intakes differed by >10% for an appreciable percentage of participants (35% for energy to 69% for total fat). Adding country-specific food items to an existing database resulted in similar overall macronutrient intake estimates but was insufficient for estimating individual intakes. When analyzing nutrient intakes in multinational studies, greater standardization and modification of databases may be required to more accurately estimate intake of individuals.

  4. Database Research for Pediatric Infectious Diseases.

    PubMed

    Kronman, Matthew P; Gerber, Jeffrey S; Newland, Jason G; Hersh, Adam L

    2015-06-01

    Multiple electronic and administrative databases are available for the study of pediatric infectious diseases. In this review, we identify research questions well suited to investigations using these databases and highlight their advantages, including their relatively low cost, efficiency, and ability to detect rare outcomes. We discuss important limitations, including those inherent in observational study designs and the potential for misclassification of exposures and outcomes, and identify strategies for addressing these limitations. We provide examples of commonly used databases and discuss methodologic considerations in undertaking studies using large databases. Last, we propose a checklist for use in planning or evaluating studies of pediatric infectious diseases that employ electronic databases, and we outline additional practical considerations regarding the cost of and how to access commonly used databases. PMID:26407414

  5. bioDBnet: the biological database network

    PubMed Central

    Mudunuri, Uma; Che, Anney; Yi, Ming; Stephens, Robert M.

    2009-01-01

    Summary: bioDBnet is an online web resource that provides interconnected access to many types of biological databases. It has integrated many of the most commonly used biological databases and in its current state has 153 database identifiers (nodes) covering all aspects of biology including genes, proteins, pathways and other biological concepts. bioDBnet offers various ways to work with these databases including conversions, extensive database reports, custom navigation and has various tools to enhance the quality of the results. Importantly, the access to bioDBnet is updated regularly, providing access to the most recent releases of each individual database. Availability: http://biodbnet.abcc.ncifcrf.gov Contact: stephensr@mail.nih.gov Supplementary information: Supplementary data are available at Bioinformatics online PMID:19129209

  6. TeloPIN: a database of telomeric proteins interaction network in mammalian cells

    PubMed Central

    Luo, Zhenhua; Dai, Zhiming; Xie, Xiaowei; Feng, Xuyang; Liu, Dan; Songyang, Zhou; Xiong, Yuanyan

    2015-01-01

    Interaction network surrounding telomeres has been intensively studied during the past two decades. However, no specific resource by integrating telomere interaction information data is currently available. To facilitate the understanding of the molecular interaction network by which telomeres are associated with biological process and diseases, we have developed TeloPIN (Telomeric Proteins Interaction Network) database (http://songyanglab.sysu.edu.cn/telopin/), a novel database that points to provide comprehensive information on protein–protein, protein–DNA and protein–RNA interaction of telomeres. TeloPIN database contains four types of interaction data, including (i) protein–protein interaction (PPI) data, (ii) telomeric proteins ChIP-seq data, (iii) telomere-associated proteins data and (iv) telomeric repeat-containing RNAs (TERRA)-interacting proteins data. By analyzing these four types of interaction data, we found that 358 and 199 proteins have more than one type of interaction information in human and mouse cells, respectively. We also developed table browser and TeloChIP genome browser to help researchers with better integrated visualization of interaction data from different studies. The current release of TeloPIN database includes 1111 PPI, eight telomeric protein ChIP-seq data sets, 1391 telomere-associated proteins and 183 TERRA-interacting proteins from 92 independent studies in mammalian cells. The interaction information provided by TeloPIN database will greatly expand our knowledge of telomeric proteins interaction network. Database URL: TeloPIN database address is http://songyanglab.sysu.edu.cn/telopin. TeloPIN database is freely available to non-commercial use. PMID:25792605

  7. An automated system for terrain database construction

    NASA Technical Reports Server (NTRS)

    Johnson, L. F.; Fretz, R. K.; Logan, T. L.; Bryant, N. A.

    1987-01-01

    An automated Terrain Database Preparation System (TDPS) for the construction and editing of terrain databases used in computerized wargaming simulation exercises has been developed. The TDPS system operates under the TAE executive, and it integrates VICAR/IBIS image processing and Geographic Information System software with CAD/CAM data capture and editing capabilities. The terrain database includes such features as roads, rivers, vegetation, and terrain roughness.

  8. Safeguarding Databases Basic Concepts Revisited.

    ERIC Educational Resources Information Center

    Cardinali, Richard

    1995-01-01

    Discusses issues of database security and integrity, including computer crime and vandalism, human error, computer viruses, employee and user access, and personnel policies. Suggests some precautions to minimize system vulnerability such as careful personnel screening, audit systems, passwords, and building and software security systems. (JKP)

  9. When Is the Same Database Not the Same? Database Differences among Systems.

    ERIC Educational Resources Information Center

    Tenopir, Carol; Hover, Katie

    1993-01-01

    Examines how online versions of databases vary on different systems. Differences are grouped into three categories: obvious differences, including easily available information such as updating, dates covered, and price; database choice, including subfile structure, field structure, and content; and system choice, including support and system…

  10. A comparison of biomedical databases.

    PubMed Central

    Mychko-Megrin, A Y

    1991-01-01

    Various published bibliographic and abstract services covering the period 1970-1988 were compared to analyze scope and coverage. A total of 7,281 articles and book titles (1,655 Soviet and 5,626 foreign) were selected on forty-one topics in different medical fields. The titles originated from three different samples but included all Soviet medical literature on the subjects. A distribution of biomedical serials from five databases is given by country, and twelve indices to assess the quality of biomedical databases are suggested. PMID:1884085

  11. 34 CFR 303.15 - Include; including.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 34 Education 2 2010-07-01 2010-07-01 false Include; including. 303.15 Section 303.15 Education Regulations of the Offices of the Department of Education (Continued) OFFICE OF SPECIAL EDUCATION AND REHABILITATIVE SERVICES, DEPARTMENT OF EDUCATION EARLY INTERVENTION PROGRAM FOR INFANTS AND TODDLERS...

  12. An Introduction to Database Structure and Database Machines.

    ERIC Educational Resources Information Center

    Detweiler, Karen

    1984-01-01

    Enumerates principal management objectives of database management systems (data independence, quality, security, multiuser access, central control) and criteria for comparison (response time, size, flexibility, other features). Conventional database management systems, relational databases, and database machines used for backend processing are…

  13. PeptiSite: a structural database of peptide binding sites in 4D.

    PubMed

    Acharya, Chayan; Kufareva, Irina; Ilatovskiy, Andrey V; Abagyan, Ruben

    2014-03-21

    We developed PeptiSite, a comprehensive and reliable database of biologically and structurally characterized peptide-binding sites, in which each site is represented by an ensemble of its complexes with protein, peptide and small molecule partners. The unique features of the database include: (1) the ensemble site representation that provides a fourth dimension to the otherwise three dimensional data, (2) comprehensive characterization of the binding site architecture that may consist of a multimeric protein assembly with cofactors and metal ions and (3) analysis of consensus interaction motifs within the ensembles and identification of conserved determinants of these interactions. Currently the database contains 585 proteins with 650 peptide-binding sites. http://peptisite.ucsd.edu/ link allows searching for the sites of interest and interactive visualization of the ensembles using the ActiveICM web-browser plugin. This structural database for protein-peptide interactions enables understanding of structural principles of these interactions and may assist the development of an efficient peptide docking benchmark. PMID:24406170

  14. DOE Global Energy Storage Database

    DOE Data Explorer

    The DOE International Energy Storage Database has more than 400 documented energy storage projects from 34 countries around the world. The database provides free, up-to-date information on grid-connected energy storage projects and relevant state and federal policies. More than 50 energy storage technologies are represented worldwide, including multiple battery technologies, compressed air energy storage, flywheels, gravel energy storage, hydrogen energy storage, pumped hydroelectric, superconducting magnetic energy storage, and thermal energy storage. The policy section of the database shows 18 federal and state policies addressing grid-connected energy storage, from rules and regulations to tariffs and other financial incentives. It is funded through DOE’s Sandia National Laboratories, and has been operating since January 2012.

  15. The Giardia genome project database.

    PubMed

    McArthur, A G; Morrison, H G; Nixon, J E; Passamaneck, N Q; Kim, U; Hinkle, G; Crocker, M K; Holder, M E; Farr, R; Reich, C I; Olsen, G E; Aley, S B; Adam, R D; Gillin, F D; Sogin, M L

    2000-08-15

    The Giardia genome project database provides an online resource for Giardia lamblia (WB strain, clone C6) genome sequence information. The database includes edited single-pass reads, the results of BLASTX searches, and details of progress towards sequencing the entire 12 million-bp Giardia genome. Pre-sorted BLASTX results can be retrieved based on keyword searches and BLAST searches of the high throughput Giardia data can be initiated from the web site or through NCBI. Descriptions of the genomic DNA libraries, project protocols and summary statistics are also available. Although the Giardia genome project is ongoing, new sequences are made available on a bi-monthly basis to ensure that researchers have access to information that may assist them in the search for genes and their biological function. The current URL of the Giardia genome project database is www.mbl.edu/Giardia.

  16. Polymorphix: a sequence polymorphism database.

    PubMed

    Bazin, Eric; Duret, Laurent; Penel, Simon; Galtier, Nicolas

    2005-01-01

    Within-species sequence variation data are of special interest since they contain information about recent population/species history, and the molecular evolutionary forces currently in action in natural populations. These data, however, are presently dispersed within generalist databases, and are difficult to access. To solve this problem, we have developed Polymorphix, a database dedicated to sequence polymorphism. It contains within-species homologous sequence families built using EMBL/GenBank under suitable similarity and bibliographic criteria. Polymorphix is an ACNUC structured database allowing both simple and complex queries for population genomic studies. Alignments within families as well as phylogenetic trees can be download. When available, outgroups are included in the alignment. Polymorphix contains sequences from the nuclear, mitochondrial and chloroplastic genomes of every eukaryote species represented in EMBL. It can be accessed by a web interface (http://pbil.univ-lyon1.fr/polymorphix/query.php).

  17. Comprehension of Discourse Markers and Reading Comprehension

    ERIC Educational Resources Information Center

    Khatib, Mohamad

    2011-01-01

    According to many research findings, the presence of discourse markers (DMs) enhances readers' comprehension of the texts they read. However, there is a paucity of research on the relationship between knowledge of DMs and reading comprehension (RC) and the present study explores the relationship between them. Knowledge of DMs is measured through…

  18. Petrophysical database of Uganda

    NASA Astrophysics Data System (ADS)

    Ruotoistenmäki, Tapio; Birungi, Nelson R.

    2015-06-01

    The petrophysical database of Uganda contains data on ca. 5800 rock samples collected and analyzed during 2009-2012 in international geological and geophysical projects covering the main part of the land area of Uganda. The parameters included are the susceptibilities and densities of all available field samples. Susceptibilities were measured from the samples from three directions. Using these parameters, we also calculated the ratios of susceptibility maxima/minima reflecting direction homogeneity of magnetic minerals, and estimated the iron content of paramagnetic samples and the magnetite content of ferrimagnetic samples. Statistical and visual analysis of the petrophysical data of Uganda demonstrated their wide variation, thus emphasizing their importance in analyzing the bedrock variations in three dimensions. Using the density-susceptibility diagram, the data can be classified into six main groups: 1. A low density and susceptibility group, consisting of sedimentary and altered rocks. 2. Low-susceptibility, felsic rocks (e.g. quartzites and metasandstones). 3. Paramagnetic, felsic rocks (e.g. granites). 4. Ferrimagnetic, magnetite-containing felsic rocks (e.g. granites). 5. Paramagnetic mafic rocks (e.g. amphibolites and dolerites). 6. Ferrimagnetic, mafic rocks containing magnetite and high-density mafic minerals (mainly dolerites). Moreover, analysis revealed that the parameter distributions of even a single rock type (e.g. granites) can be very variable, forming separate clusters. This demonstrates that the simple calculation of density or susceptibility averages of rock types can be highly erratic. For example, the average can lie between two groups, where only few, if any, samples exist. Therefore, estimation of the representative density and susceptibility must be visually verified from these diagrams. The areal distribution of parameters and their calculated derivatives generally correlate well with the regional distribution of lithological and

  19. OrthoDisease: a database of human disease orthologs.

    PubMed

    O'Brien, Kevin P; Westerlund, Isabelle; Sonnhammer, Erik L L

    2004-08-01

    One of the greatest promises of genome sequencing projects is to further the understanding of human diseases and to develop new therapies. Model organism genomes have been sequenced in parallel to human genomes to provide effective tools for the investigation of human gene function. Many of their genes share a common ancestry and function with human genes, and this is particularly true for orthologous genes. Here we present OrthoDisease, a comprehensive database of model organism genes that are orthologous to human disease genes. OrthoDisease was constructed by applying the Inparanoid ortholog detection algorithm to disease genes derived from the Online Mendelian Inheritance in Man database (OMIM). Pairwise whole genome/proteome comparisons between Homo sapiens and six other organisms were performed to identify ortholog clusters. OMIM numbers were extracted from the OMIM Morbid Map and were converted to gene sequences using the Locuslink mim2loc and loc2acc tables. These were mapped to Inparanoid ortholog clusters using Blast. The number of ortholog clusters in OrthoDisease with each respective species is currently: M. musculus, 1,354; D. melanogaster, 724; C. elegans, 533; A. thaliana, 398; S. cerevisiae, 290; and E. coli, 153. The database is accessible online at http://orthodisease.cgb.ki.se, and can be searched with disease or protein names. The web interface presents all ortholog clusters that include a selected disease gene. A capability to download the entire dataset is also provided.

  20. Reference ballistic imaging database performance.

    PubMed

    De Kinder, Jan; Tulleners, Frederic; Thiebaut, Hugues

    2004-03-10

    Ballistic imaging databases allow law enforcement to link recovered cartridge cases to other crime scenes and to firearms. The success of these databases has led many to propose that all firearms in circulation be entered into a reference ballistic image database (RBID). To assess the performance of an RBID, we fired 4200 cartridge cases from 600 9mm Para Sig Sauer model P226 series pistols. Each pistol fired two Remington cartridges, one of which was imaged in the RBID, and five additional cartridges, consisting of Federal, Speer, Winchester, Wolf, and CCI brands. Randomly selected samples from the second series of Remington cartridge cases and from the five additional brands were then correlated against the RBID. Of the 32 cartridges of the same make correlated against the RBID, 72% ranked in the top 10 positions. Likewise, of the 160 cartridges of the five different brands correlated against the database, 21% ranked in the top 10 positions. Generally, the ranking position increased as the size of the RBID increased. We obtained similar results when we expanded the RBID to include firearms with the same class characteristics for breech face marks, firing pin impressions, and extractor marks. The results of our six queries against the RBID indicate that a reference ballistics image database of new guns is currently fraught with too many difficulties to be an effective and efficient law enforcement tool.

  1. NeuroPep: a comprehensive resource of neuropeptides.

    PubMed

    Wang, Yan; Wang, Mingxia; Yin, Sanwen; Jang, Richard; Wang, Jian; Xue, Zhidong; Xu, Tao

    2015-01-01

    Neuropeptides play a variety of roles in many physiological processes and serve as potential therapeutic targets for the treatment of some nervous-system disorders. In recent years, there has been a tremendous increase in the number of identified neuropeptides. Therefore, we have developed NeuroPep, a comprehensive resource of neuropeptides, which holds 5949 non-redundant neuropeptide entries originating from 493 organisms belonging to 65 neuropeptide families. In NeuroPep, the number of neuropeptides in invertebrates and vertebrates is 3455 and 2406, respectively. It is currently the most complete neuropeptide database. We extracted entries deposited in UniProt, the database (www.neuropeptides.nl) and NeuroPedia, and used text mining methods to retrieve entries from the MEDLINE abstracts and full text articles. All the entries in NeuroPep have been manually checked. 2069 of the 5949 (35%) neuropeptide sequences were collected from the scientific literature. Moreover, NeuroPep contains detailed annotations for each entry, including source organisms, tissue specificity, families, names, post-translational modifications, 3D structures (if available) and literature references. Information derived from these peptide sequences such as amino acid compositions, isoelectric points, molecular weight and other physicochemical properties of peptides are also provided. A quick search feature allows users to search the database with keywords such as sequence, name, family, etc., and an advanced search page helps users to combine queries with logical operators like AND/OR. In addition, user-friendly web tools like browsing, sequence alignment and mapping are also integrated into the NeuroPep database. Database URL: http://isyslab.info/NeuroPep

  2. NeuroPep: a comprehensive resource of neuropeptides

    PubMed Central

    Wang, Yan; Wang, Mingxia; Yin, Sanwen; Jang, Richard; Wang, Jian; Xue, Zhidong; Xu, Tao

    2015-01-01

    Neuropeptides play a variety of roles in many physiological processes and serve as potential therapeutic targets for the treatment of some nervous-system disorders. In recent years, there has been a tremendous increase in the number of identified neuropeptides. Therefore, we have developed NeuroPep, a comprehensive resource of neuropeptides, which holds 5949 non-redundant neuropeptide entries originating from 493 organisms belonging to 65 neuropeptide families. In NeuroPep, the number of neuropeptides in invertebrates and vertebrates is 3455 and 2406, respectively. It is currently the most complete neuropeptide database. We extracted entries deposited in UniProt, the database (www.neuropeptides.nl) and NeuroPedia, and used text mining methods to retrieve entries from the MEDLINE abstracts and full text articles. All the entries in NeuroPep have been manually checked. 2069 of the 5949 (35%) neuropeptide sequences were collected from the scientific literature. Moreover, NeuroPep contains detailed annotations for each entry, including source organisms, tissue specificity, families, names, post-translational modifications, 3D structures (if available) and literature references. Information derived from these peptide sequences such as amino acid compositions, isoelectric points, molecular weight and other physicochemical properties of peptides are also provided. A quick search feature allows users to search the database with keywords such as sequence, name, family, etc., and an advanced search page helps users to combine queries with logical operators like AND/OR. In addition, user-friendly web tools like browsing, sequence alignment and mapping are also integrated into the NeuroPep database. Database URL: http://isyslab.info/NeuroPep PMID:25931458

  3. Assessment System for Aircraft Noise (ASAN) citation database. Volume 2: Database update manual

    NASA Astrophysics Data System (ADS)

    Reddingius, Nicolaas

    1989-12-01

    The Assessment System for Aircraft Noise (ASAN) includes a database of several thousand references to the literature on the impact of noise and sonic booms on humans, animals and structures. Bibliographic data, abstracts and critical reviews of key documents can be retrieved. The manual for the database maintenance module is presented. It is only intended for use by the critical maintenance organization to prepare new releases of the database. Several programs used to add, delete and update the database are discussed are needed together with Vol. 2 to properly maintain the database.

  4. PvTFDB: a Phaseolus vulgaris transcription factors database for expediting functional genomics in legumes.

    PubMed

    Bhawna; Bonthala, V S; Gajula, Mnv Prasad

    2016-01-01

    The common bean [Phaseolus vulgaris (L.)] is one of the essential proteinaceous vegetables grown in developing countries. However, its production is challenged by low yields caused by numerous biotic and abiotic stress conditions. Regulatory transcription factors (TFs) symbolize a key component of the genome and are the most significant targets for producing stress tolerant crop and hence functional genomic studies of these TFs are important. Therefore, here we have constructed a web-accessible TFs database for P. vulgaris, called PvTFDB, which contains 2370 putative TF gene models in 49 TF families. This database provides a comprehensive information for each of the identified TF that includes sequence data, functional annotation, SSRs with their primer sets, protein physical properties, chromosomal location, phylogeny, tissue-specific gene expression data, orthologues, cis-regulatory elements and gene ontology (GO) assignment. Altogether, this information would be used in expediting the functional genomic studies of a specific TF(s) of interest. The objectives of this database are to understand functional genomics study of common bean TFs and recognize the regulatory mechanisms underlying various stress responses to ease breeding strategy for variety production through a couple of search interfaces including gene ID, functional annotation and browsing interfaces including by family and by chromosome. This database will also serve as a promising central repository for researchers as well as breeders who are working towards crop improvement of legume crops. In addition, this database provide the user unrestricted public access and the user can download entire data present in the database freely.Database URL: http://www.multiomics.in/PvTFDB/.

  5. PvTFDB: a Phaseolus vulgaris transcription factors database for expediting functional genomics in legumes

    PubMed Central

    Bhawna; Bonthala, V.S.; Gajula, MNV Prasad

    2016-01-01

    The common bean [Phaseolus vulgaris (L.)] is one of the essential proteinaceous vegetables grown in developing countries. However, its production is challenged by low yields caused by numerous biotic and abiotic stress conditions. Regulatory transcription factors (TFs) symbolize a key component of the genome and are the most significant targets for producing stress tolerant crop and hence functional genomic studies of these TFs are important. Therefore, here we have constructed a web-accessible TFs database for P. vulgaris, called PvTFDB, which contains 2370 putative TF gene models in 49 TF families. This database provides a comprehensive information for each of the identified TF that includes sequence data, functional annotation, SSRs with their primer sets, protein physical properties, chromosomal location, phylogeny, tissue-specific gene expression data, orthologues, cis-regulatory elements and gene ontology (GO) assignment. Altogether, this information would be used in expediting the functional genomic studies of a specific TF(s) of interest. The objectives of this database are to understand functional genomics study of common bean TFs and recognize the regulatory mechanisms underlying various stress responses to ease breeding strategy for variety production through a couple of search interfaces including gene ID, functional annotation and browsing interfaces including by family and by chromosome. This database will also serve as a promising central repository for researchers as well as breeders who are working towards crop improvement of legume crops. In addition, this database provide the user unrestricted public access and the user can download entire data present in the database freely. Database URL: http://www.multiomics.in/PvTFDB/ PMID:27465131

  6. The Hawaiian Freshwater Algal Database (HfwADB): a laboratory LIMS and online biodiversity resource

    PubMed Central

    2012-01-01

    Background Biodiversity databases serve the important role of highlighting species-level diversity from defined geographical regions. Databases that are specially designed to accommodate the types of data gathered during regional surveys are valuable in allowing full data access and display to researchers not directly involved with the project, while serving as a Laboratory Information Management System (LIMS). The Hawaiian Freshwater Algal Database, or HfwADB, was modified from the Hawaiian Algal Database to showcase non-marine algal specimens collected from the Hawaiian Archipelago by accommodating the additional level of organization required for samples including multiple species. Description The Hawaiian Freshwater Algal Database is a comprehensive and searchable database containing photographs and micrographs of samples and collection sites, geo-referenced collecting information, taxonomic data and standardized DNA sequence data. All data for individual samples are linked through unique 10-digit accession numbers (“Isolate Accession”), the first five of which correspond to the collection site (“Environmental Accession”). Users can search online for sample information by accession number, various levels of taxonomy, habitat or collection site. HfwADB is hosted at the University of Hawaii, and was made publicly accessible in October 2011. At the present time the database houses data for over 2,825 samples of non-marine algae from 1,786 collection sites from the Hawaiian Archipelago. These samples include cyanobacteria, red and green algae and diatoms, as well as lesser representation from some other algal lineages. Conclusions HfwADB is a digital repository that acts as a Laboratory Information Management System for Hawaiian non-marine algal data. Users can interact with the repository through the web to view relevant habitat data (including geo-referenced collection locations) and download images of collection sites, specimen photographs and

  7. PvTFDB: a Phaseolus vulgaris transcription factors database for expediting functional genomics in legumes.

    PubMed

    Bhawna; Bonthala, V S; Gajula, Mnv Prasad

    2016-01-01

    The common bean [Phaseolus vulgaris (L.)] is one of the essential proteinaceous vegetables grown in developing countries. However, its production is challenged by low yields caused by numerous biotic and abiotic stress conditions. Regulatory transcription factors (TFs) symbolize a key component of the genome and are the most significant targets for producing stress tolerant crop and hence functional genomic studies of these TFs are important. Therefore, here we have constructed a web-accessible TFs database for P. vulgaris, called PvTFDB, which contains 2370 putative TF gene models in 49 TF families. This database provides a comprehensive information for each of the identified TF that includes sequence data, functional annotation, SSRs with their primer sets, protein physical properties, chromosomal location, phylogeny, tissue-specific gene expression data, orthologues, cis-regulatory elements and gene ontology (GO) assignment. Altogether, this information would be used in expediting the functional genomic studies of a specific TF(s) of interest. The objectives of this database are to understand functional genomics study of common bean TFs and recognize the regulatory mechanisms underlying various stress responses to ease breeding strategy for variety production through a couple of search interfaces including gene ID, functional annotation and browsing interfaces including by family and by chromosome. This database will also serve as a promising central repository for researchers as well as breeders who are working towards crop improvement of legume crops. In addition, this database provide the user unrestricted public access and the user can download entire data present in the database freely.Database URL: http://www.multiomics.in/PvTFDB/. PMID:27465131

  8. Construction of Database for Pulsating Variable Stars

    NASA Astrophysics Data System (ADS)

    Chen, B. Q.; Yang, M.; Jiang, B. W.

    2011-07-01

    A database for the pulsating variable stars is constructed for Chinese astronomers to study the variable stars conveniently. The database includes about 230000 variable stars in the Galactic bulge, LMC and SMC observed by the MACHO (MAssive Compact Halo Objects) and OGLE (Optical Gravitational Lensing Experiment) projects at present. The software used for the construction is LAMP, i.e., Linux+Apache+MySQL+PHP. A web page is provided to search the photometric data and the light curve in the database through the right ascension and declination of the object. More data will be incorporated into the database.

  9. Applicability of large databases in outcomes research.

    PubMed

    Malay, Sunitha; Shauver, Melissa J; Chung, Kevin C

    2012-07-01

    Outcomes research serves as a mechanism to assess the quality of care, cost effectiveness of treatment, and other aspects of health care. The use of administrative databases in outcomes research is increasing in all medical specialties, including hand surgery. However, the real value of databases can be maximized with a thorough understanding of their contents, advantages, and limitations. We performed a literature review pertaining to databases in medical, surgical, and epidemiologic research, with special emphasis on orthopedic and hand surgery. This article provides an overview of the available database resources for outcomes research, their potential value to hand surgeons, and suggestions to improve their effective use. PMID:22522104

  10. Applicability of large databases in outcomes research.

    PubMed

    Malay, Sunitha; Shauver, Melissa J; Chung, Kevin C

    2012-07-01

    Outcomes research serves as a mechanism to assess the quality of care, cost effectiveness of treatment, and other aspects of health care. The use of administrative databases in outcomes research is increasing in all medical specialties, including hand surgery. However, the real value of databases can be maximized with a thorough understanding of their contents, advantages, and limitations. We performed a literature review pertaining to databases in medical, surgical, and epidemiologic research, with special emphasis on orthopedic and hand surgery. This article provides an overview of the available database resources for outcomes research, their potential value to hand surgeons, and suggestions to improve their effective use.

  11. National Geochronological Database

    USGS Publications Warehouse

    Revised by Sloan, Jan; Henry, Christopher D.; Hopkins, Melanie; Ludington, Steve; Original database by Zartman, Robert E.; Bush, Charles A.; Abston, Carl

    2003-01-01

    The National Geochronological Data Base (NGDB) was established by the United States Geological Survey (USGS) to collect and organize published isotopic (also known as radiometric) ages of rocks in the United States. The NGDB (originally known as the Radioactive Age Data Base, RADB) was started in 1974. A committee appointed by the Director of the USGS was given the mission to investigate the feasibility of compiling the published radiometric ages for the United States into a computerized data bank for ready access by the user community. A successful pilot program, which was conducted in 1975 and 1976 for the State of Wyoming, led to a decision to proceed with the compilation of the entire United States. For each dated rock sample reported in published literature, a record containing information on sample location, rock description, analytical data, age, interpretation, and literature citation was constructed and included in the NGDB. The NGDB was originally constructed and maintained on a mainframe computer, and later converted to a Helix Express relational database maintained on an Apple Macintosh desktop computer. The NGDB and a program to search the data files were published and distributed on Compact Disc-Read Only Memory (CD-ROM) in standard ISO 9660 format as USGS Digital Data Series DDS-14 (Zartman and others, 1995). As of May 1994, the NGDB consisted of more than 18,000 records containing over 30,000 individual ages, which is believed to represent approximately one-half the number of ages published for the United States through 1991. Because the organizational unit responsible for maintaining the database was abolished in 1996, and because we wanted to provide the data in more usable formats, we have reformatted the data, checked and edited the information in some records, and provided this online version of the NGDB. This report describes the changes made to the data and formats, and provides instructions for the use of the database in geographic

  12. Developing Database Files for Student Use.

    ERIC Educational Resources Information Center

    Warner, Michael

    1988-01-01

    Presents guidelines for creating student database files that supplement classroom teaching. Highlights include determining educational objectives, planning the database with computer specialists and subject area specialists, data entry, and creating student worksheets. Specific examples concerning elements of the periodic table and…

  13. Diet History Questionnaire II: Database Utility Program

    Cancer.gov

    If you need to modify the standard nutrient database, a single nutrient value must be provided by gender and portion size. If you have modified the database to have fewer or greater demographic groups, nutrient values must be included for each group.

  14. Diet History Questionnaire: Database Utility Program

    Cancer.gov

    If you need to modify the standard nutrient database, a single nutrient value must be provided by gender and portion size. If you have modified the database to have fewer or greater demographic groups, nutrient values must be included for each group.

  15. Designing Corporate Databases to Support Technology Innovation

    ERIC Educational Resources Information Center

    Gultz, Michael Jarett

    2012-01-01

    Based on a review of the existing literature on database design, this study proposed a unified database model to support corporate technology innovation. This study assessed potential support for the model based on the opinions of 200 technology industry executives, including Chief Information Officers, Chief Knowledge Officers and Chief Learning…

  16. Library Instruction and Online Database Searching.

    ERIC Educational Resources Information Center

    Mercado, Heidi

    1999-01-01

    Reviews changes in online database searching in academic libraries. Topics include librarians conducting all searches; the advent of end-user searching and the need for user instruction; compact disk technology; online public catalogs; the Internet; full text databases; electronic information literacy; user education and the remote library user;…

  17. Development of a Water Infrastructure Knowledge Database

    EPA Science Inventory

    This paper presents a methodology for developing a national database, as applied to water infrastructure systems, which includes both drinking water and wastewater. The database is branded as "WATERiD" and can be accessed at www.waterid.org. Water infrastructure in the U.S. is ag...

  18. RLIN Special Databases: Serving the Humanist.

    ERIC Educational Resources Information Center

    Muratori, Fred

    1990-01-01

    Describes online databases available through the Research Libraries Information Network (RLIN) that focus on research in the humanities. A survey of five databases (AVERY, ESTC, RIPD, SCIPIO, and AMC) is presented that includes contents, search capabilities, and record formats; and future plans are discussed. (12 references) (LRW)

  19. The Status of Statewide Subscription Databases

    ERIC Educational Resources Information Center

    Krueger, Karla S.

    2012-01-01

    This qualitative content analysis presents subscription databases available to school libraries through statewide purchases. The results may help school librarians evaluate grade and subject-area coverage, make comparisons to recommended databases, and note potential suggestions for their states to include in future contracts or for local…

  20. U.S. EPA'S ECOTOX DATABASE

    EPA Science Inventory

    In formulating hypothesis related to extrapolations across species and/or chemicals, the ECOTOX database provides researchers a means of locating high quality ecological effects data for a wide-range of terrestrial and aquatic receptors. Currently the database includes more than ...

  1. US-LHC MAGNET DATABASE AND CONVENTIONS.

    SciTech Connect

    WEI,J.; MCCHESNEY,D.; JAIN,A.; PEGGS,S.; PILAT,F.; BOTTURA,L.; SABBI,G.

    1999-03-29

    The US-LHC Magnet Database is designed for production-magnet quality assurance, field and alignment error impact analysis, cryostat assembly assistance, and ring installation assistance. The database consists of tables designed to store magnet field and alignment measurements data and quench data. This information will also be essential for future machine operations including local IR corrections.

  2. ITS-90 Thermocouple Database

    National Institute of Standards and Technology Data Gateway

    SRD 60 NIST ITS-90 Thermocouple Database (Web, free access)   Web version of Standard Reference Database 60 and NIST Monograph 175. The database gives temperature -- electromotive force (emf) reference functions and tables for the letter-designated thermocouple types B, E, J, K, N, R, S and T. These reference functions have been adopted as standards by the American Society for Testing and Materials (ASTM) and the International Electrotechnical Commission (IEC).

  3. Databases for Microbiologists

    DOE PAGES

    Zhulin, Igor B.

    2015-05-26

    Databases play an increasingly important role in biology. They archive, store, maintain, and share information on genes, genomes, expression data, protein sequences and structures, metabolites and reactions, interactions, and pathways. All these data are critically important to microbiologists. Furthermore, microbiology has its own databases that deal with model microorganisms, microbial diversity, physiology, and pathogenesis. Thousands of biological databases are currently available, and it becomes increasingly difficult to keep up with their development. Finally, the purpose of this minireview is to provide a brief survey of current databases that are of interest to microbiologists.

  4. Veterans Administration Databases

    Cancer.gov

    The Veterans Administration Information Resource Center provides database and informatics experts, customer service, expert advice, information products, and web technology to VA researchers and others.

  5. Databases for Microbiologists

    PubMed Central

    2015-01-01

    Databases play an increasingly important role in biology. They archive, store, maintain, and share information on genes, genomes, expression data, protein sequences and structures, metabolites and reactions, interactions, and pathways. All these data are critically important to microbiologists. Furthermore, microbiology has its own databases that deal with model microorganisms, microbial diversity, physiology, and pathogenesis. Thousands of biological databases are currently available, and it becomes increasingly difficult to keep up with their development. The purpose of this minireview is to provide a brief survey of current databases that are of interest to microbiologists. PMID:26013493

  6. Databases for LDEF results

    NASA Technical Reports Server (NTRS)

    Bohnhoff-Hlavacek, Gail

    1992-01-01

    One of the objectives of the team supporting the LDEF Systems and Materials Special Investigative Groups is to develop databases of experimental findings. These databases identify the hardware flown, summarize results and conclusions, and provide a system for acknowledging investigators, tracing sources of data, and future design suggestions. To date, databases covering the optical experiments, and thermal control materials (chromic acid anodized aluminum, silverized Teflon blankets, and paints) have been developed at Boeing. We used the Filemaker Pro software, the database manager for the Macintosh computer produced by the Claris Corporation. It is a flat, text-retrievable database that provides access to the data via an intuitive user interface, without tedious programming. Though this software is available only for the Macintosh computer at this time, copies of the databases can be saved to a format that is readable on a personal computer as well. Further, the data can be exported to more powerful relational databases, capabilities, and use of the LDEF databases and describe how to get copies of the database for your own research.

  7. Integrating Variances into an Analytical Database

    NASA Technical Reports Server (NTRS)

    Sanchez, Carlos

    2010-01-01

    For this project, I enrolled in numerous SATERN courses that taught the basics of database programming. These include: Basic Access 2007 Forms, Introduction to Database Systems, Overview of Database Design, and others. My main job was to create an analytical database that can handle many stored forms and make it easy to interpret and organize. Additionally, I helped improve an existing database and populate it with information. These databases were designed to be used with data from Safety Variances and DCR forms. The research consisted of analyzing the database and comparing the data to find out which entries were repeated the most. If an entry happened to be repeated several times in the database, that would mean that the rule or requirement targeted by that variance has been bypassed many times already and so the requirement may not really be needed, but rather should be changed to allow the variance's conditions permanently. This project did not only restrict itself to the design and development of the database system, but also worked on exporting the data from the database to a different format (e.g. Excel or Word) so it could be analyzed in a simpler fashion. Thanks to the change in format, the data was organized in a spreadsheet that made it possible to sort the data by categories or types and helped speed up searches. Once my work with the database was done, the records of variances could be arranged so that they were displayed in numerical order, or one could search for a specific document targeted by the variances and restrict the search to only include variances that modified a specific requirement. A great part that contributed to my learning was SATERN, NASA's resource for education. Thanks to the SATERN online courses I took over the summer, I was able to learn many new things about computers and databases and also go more in depth into topics I already knew about.

  8. The Plant Organelles Database 3 (PODB3) update 2014: integrating electron micrographs and new options for plant organelle research.

    PubMed

    Mano, Shoji; Nakamura, Takanori; Kondo, Maki; Miwa, Tomoki; Nishikawa, Shuh-ichi; Mimura, Tetsuro; Nagatani, Akira; Nishimura, Mikio

    2014-01-01

    The Plant Organelles Database 2 (PODB2), which was first launched in 2006 as PODB, provides static image and movie data of plant organelles, protocols for plant organelle research and external links to relevant websites. PODB2 has facilitated plant organellar research and the understanding of plant organelle dynamics. To provide comprehensive information on plant organelles in more detail, PODB2 was updated to PODB3 (http://podb.nibb.ac.jp/Organellome/). PODB3 contains two additional components: the electron micrograph database and the perceptive organelles database. Through the electron micrograph database, users can examine the subcellular and/or suborganellar structures in various organs of wild-type and mutant plants. The perceptive organelles database provides information on organelle dynamics in response to external stimuli. In addition to the extra components, the user interface for access has been enhanced in PODB3. The data in PODB3 are directly submitted by plant researchers and can be freely downloaded for use in further analysis. PODB3 contains all the information included in PODB2, and the volume of data and protocols deposited in PODB3 continue to grow steadily. We welcome contributions of data from all plant researchers to enhance the utility and comprehensiveness of PODB3. PMID:24092884

  9. ONRLDB--manually curated database of experimentally validated ligands for orphan nuclear receptors: insights into new drug discovery.

    PubMed

    Nanduri, Ravikanth; Bhutani, Isha; Somavarapu, Arun Kumar; Mahajan, Sahil; Parkesh, Raman; Gupta, Pawan

    2015-01-01

    Orphan nuclear receptors are potential therapeutic targets. The Orphan Nuclear Receptor Ligand Binding Database (ONRLDB) is an interactive, comprehensive and manually curated database of small molecule ligands targeting orphan nuclear receptors. Currently, ONRLDB consists of ∼11,000 ligands, of which ∼6500 are unique. All entries include information for the ligand, such as EC50 and IC50, number of aromatic rings and rotatable bonds, XlogP, hydrogen donor and acceptor count, molecular weight (MW) and structure. ONRLDB is a cross-platform database, where either the cognate small molecule modulators of a receptor or the cognate receptors to a ligand can be searched. The database can be searched using three methods: text search, advanced search or similarity search. Substructure search, cataloguing tools, and clustering tools can be used to perform advanced analysis of the ligand based on chemical similarity fingerprints, hierarchical clustering, binning partition and multidimensional scaling. These tools, together with the Tree function provided, deliver an interactive platform and a comprehensive resource for identification of common and unique scaffolds. As demonstrated, ONRLDB is designed to allow selection of ligands based on various properties and for designing novel ligands or to improve the existing ones. Database URL: http://www.onrldb.org/.

  10. ONRLDB—manually curated database of experimentally validated ligands for orphan nuclear receptors: insights into new drug discovery

    PubMed Central

    Nanduri, Ravikanth; Bhutani, Isha; Somavarapu, Arun Kumar; Mahajan, Sahil; Parkesh, Raman; Gupta, Pawan

    2015-01-01

    Orphan nuclear receptors are potential therapeutic targets. The Orphan Nuclear Receptor Ligand Binding Database (ONRLDB) is an interactive, comprehensive and manually curated database of small molecule ligands targeting orphan nuclear receptors. Currently, ONRLDB consists of ∼11 000 ligands, of which ∼6500 are unique. All entries include information for the ligand, such as EC50 and IC50, number of aromatic rings and rotatable bonds, XlogP, hydrogen donor and acceptor count, molecular weight (MW) and structure. ONRLDB is a cross-platform database, where either the cognate small molecule modulators of a receptor or the cognate receptors to a ligand can be searched. The database can be searched using three methods: text search, advanced search or similarity search. Substructure search, cataloguing tools, and clustering tools can be used to perform advanced analysis of the ligand based on chemical similarity fingerprints, hierarchical clustering, binning partition and multidimensional scaling. These tools, together with the Tree function provided, deliver an interactive platform and a comprehensive resource for identification of common and unique scaffolds. As demonstrated, ONRLDB is designed to allow selection of ligands based on various properties and for designing novel ligands or to improve the existing ones. Database URL: http://www.onrldb.org/ PMID:26637529

  11. Interactive bibliographical database on color

    NASA Astrophysics Data System (ADS)

    Caivano, Jose L.

    2002-06-01

    The paper describes the methodology and results of a project under development, aimed at the elaboration of an interactive bibliographical database on color in all fields of application: philosophy, psychology, semiotics, education, anthropology, physical and natural sciences, biology, medicine, technology, industry, architecture and design, arts, linguistics, geography, history. The project is initially based upon an already developed bibliography, published in different journals, updated in various opportunities, and now available at the Internet, with more than 2,000 entries. The interactive database will amplify that bibliography, incorporating hyperlinks and contents (indexes, abstracts, keywords, introductions, or eventually the complete document), and devising mechanisms for information retrieval. The sources to be included are: books, doctoral dissertations, multimedia publications, reference works. The main arrangement will be chronological, but the design of the database will allow rearrangements or selections by different fields: subject, Decimal Classification System, author, language, country, publisher, etc. A further project is to develop another database, including color-specialized journals or newsletters, and articles on color published in international journals, arranged in this case by journal name and date of publication, but allowing also rearrangements or selections by author, subject and keywords.

  12. The Effect of Explicit Teaching of Comprehension Strategies on Reading Comprehension in Elementary School

    ERIC Educational Resources Information Center

    Atkins, James Grant

    2013-01-01

    This study investigated the effect of explicit teaching of comprehension strategies on the comprehension performance of elementary school students. Two schools with similar demographics, including a significant proportion of students at risk for reading failure, participated in the study. One school utilized an explicit comprehension strategy…

  13. SymbioGenomesDB: a database for the integration and access to knowledge on host-symbiont relationships.

    PubMed

    Reyes-Prieto, Mariana; Vargas-Chávez, Carlos; Latorre, Amparo; Moya, Andrés

    2015-01-01

    Symbiotic relationships occur naturally throughout the tree of life, either in a commensal, mutualistic or pathogenic manner. The genomes of multiple organisms involved in symbiosis are rapidly being sequenced and becoming available, especially those from the microbial world. Currently, there are numerous databases that offer information on specific organisms or models, but none offer a global understanding on relationships between organisms, their interactions and capabilities within their niche, as well as their role as part of a system, in this case, their role in symbiosis. We have developed the SymbioGenomesDB as a community database resource for laboratories which intend to investigate and use information on the genetics and the genomics of organisms involved in these relationships. The ultimate goal of SymbioGenomesDB is to host and support the growing and vast symbiotic-host relationship information, to uncover the genetic basis of such associations. SymbioGenomesDB maintains a comprehensive organization of information on genomes of symbionts from diverse hosts throughout the Tree of Life, including their sequences, their metadata and their genomic features. This catalog of relationships was generated using computational tools, custom R scripts and manual integration of data available in public literature. As a highly curated and comprehensive systems database, SymbioGenomesDB provides web access to all the information of symbiotic organisms, their features and links to the central database NCBI. Three different tools can be found within the database to explore symbiosis-related organisms, their genes and their genomes. Also, we offer an orthology search for one or multiple genes in one or multiple organisms within symbiotic relationships, and every table, graph and output file is downloadable and easy to parse for further analysis. The robust SymbioGenomesDB will be constantly updated to cope with all the data being generated and included in major

  14. SymbioGenomesDB: a database for the integration and access to knowledge on host–symbiont relationships

    PubMed Central

    Reyes-Prieto, Mariana; Vargas-Chávez, Carlos; Latorre, Amparo; Moya, Andrés

    2015-01-01

    Symbiotic relationships occur naturally throughout the tree of life, either in a commensal, mutualistic or pathogenic manner. The genomes of multiple organisms involved in symbiosis are rapidly being sequenced and becoming available, especially those from the microbial world. Currently, there are numerous databases that offer information on specific organisms or models, but none offer a global understanding on relationships between organisms, their interactions and capabilities within their niche, as well as their role as part of a system, in this case, their role in symbiosis. We have developed the SymbioGenomesDB as a community database resource for laboratories which intend to investigate and use information on the genetics and the genomics of organisms involved in these relationships. The ultimate goal of SymbioGenomesDB is to host and support the growing and vast symbiotic–host relationship information, to uncover the genetic basis of such associations. SymbioGenomesDB maintains a comprehensive organization of information on genomes of symbionts from diverse hosts throughout the Tree of Life, including their sequences, their metadata and their genomic features. This catalog of relationships was generated using computational tools, custom R scripts and manual integration of data available in public literature. As a highly curated and comprehensive systems database, SymbioGenomesDB provides web access to all the information of symbiotic organisms, their features and links to the central database NCBI. Three different tools can be found within the database to explore symbiosis-related organisms, their genes and their genomes. Also, we offer an orthology search for one or multiple genes in one or multiple organisms within symbiotic relationships, and every table, graph and output file is downloadable and easy to parse for further analysis. The robust SymbioGenomesDB will be constantly updated to cope with all the data being generated and included in major

  15. SymbioGenomesDB: a database for the integration and access to knowledge on host-symbiont relationships.

    PubMed

    Reyes-Prieto, Mariana; Vargas-Chávez, Carlos; Latorre, Amparo; Moya, Andrés

    2015-01-01

    Symbiotic relationships occur naturally throughout the tree of life, either in a commensal, mutualistic or pathogenic manner. The genomes of multiple organisms involved in symbiosis are rapidly being sequenced and becoming available, especially those from the microbial world. Currently, there are numerous databases that offer information on specific organisms or models, but none offer a global understanding on relationships between organisms, their interactions and capabilities within their niche, as well as their role as part of a system, in this case, their role in symbiosis. We have developed the SymbioGenomesDB as a community database resource for laboratories which intend to investigate and use information on the genetics and the genomics of organisms involved in these relationships. The ultimate goal of SymbioGenomesDB is to host and support the growing and vast symbiotic-host relationship information, to uncover the genetic basis of such associations. SymbioGenomesDB maintains a comprehensive organization of information on genomes of symbionts from diverse hosts throughout the Tree of Life, including their sequences, their metadata and their genomic features. This catalog of relationships was generated using computational tools, custom R scripts and manual integration of data available in public literature. As a highly curated and comprehensive systems database, SymbioGenomesDB provides web access to all the information of symbiotic organisms, their features and links to the central database NCBI. Three different tools can be found within the database to explore symbiosis-related organisms, their genes and their genomes. Also, we offer an orthology search for one or multiple genes in one or multiple organisms within symbiotic relationships, and every table, graph and output file is downloadable and easy to parse for further analysis. The robust SymbioGenomesDB will be constantly updated to cope with all the data being generated and included in major

  16. A Web-based database of genetic association studies in cutaneous melanoma enhanced with network-driven data exploration tools

    PubMed Central

    Athanasiadis, Emmanouil I.; Antonopoulou, Kyriaki; Chatzinasiou, Foteini; Lill, Christina M.; Bourdakou, Marilena M.; Sakellariou, Argiris; Kypreou, Katerina; Stefanaki, Irene; Evangelou, Evangelos; Ioannidis, John P.A.; Bertram, Lars; Stratigos, Alexander J.; Spyrou, George M.

    2014-01-01

    The publicly available online database MelGene provides a comprehensive, regularly updated, collection of data from genetic association studies in cutaneous melanoma (CM), including random-effects meta-analysis results of all eligible polymorphisms. The updated database version includes data from 192 publications with information on 1114 significantly associated polymorphisms across 280 genes, along with new front-end and back-end capabilities. Various types of relationships between data are calculated and visualized as networks. We constructed 13 different networks containing the polymorphisms and the genes included in MelGene. We explored the derived network representations under the following questions: (i) are there nodes that deserve consideration regarding their network connectivity characteristics? (ii) What is the relation of either the genome-wide or nominally significant CM polymorphisms/genes with the ones highlighted by the network representation? We show that our network approach using the MelGene data reveals connections between statistically significant genes/ polymorphisms and other genes/polymorphisms acting as ‘hubs’ in the reconstructed networks. To the best of our knowledge, this is the first database containing data from a comprehensive field synopsis and systematic meta-analyses of genetic polymorphisms in CM that provides user-friendly tools for in-depth molecular network visualization and exploration. The proposed network connections highlight potentially new loci requiring further investigation of their relation to melanoma risk. Database URL: http://www.melgene.org. PMID:25380778

  17. A Web-based database of genetic association studies in cutaneous melanoma enhanced with network-driven data exploration tools.

    PubMed

    Athanasiadis, Emmanouil I; Antonopoulou, Kyriaki; Chatzinasiou, Foteini; Lill, Christina M; Bourdakou, Marilena M; Sakellariou, Argiris; Kypreou, Katerina; Stefanaki, Irene; Evangelou, Evangelos; Ioannidis, John P A; Bertram, Lars; Stratigos, Alexander J; Spyrou, George M

    2014-01-01

    The publicly available online database MelGene provides a comprehensive, regularly updated, collection of data from genetic association studies in cutaneous melanoma (CM), including random-effects meta-analysis results of all eligible polymorphisms. The updated database version includes data from 192 publications with information on 1114 significantly associated polymorphisms across 280 genes, along with new front-end and back-end capabilities. Various types of relationships between data are calculated and visualized as networks. We constructed 13 different networks containing the polymorphisms and the genes included in MelGene. We explored the derived network representations under the following questions: (i) are there nodes that deserve consideration regarding their network connectivity characteristics? (ii) What is the relation of either the genome-wide or nominally significant CM polymorphisms/genes with the ones highlighted by the network representation? We show that our network approach using the MelGene data reveals connections between statistically significant genes/ polymorphisms and other genes/polymorphisms acting as 'hubs' in the reconstructed networks. To the best of our knowledge, this is the first database containing data from a comprehensive field synopsis and systematic meta-analyses of genetic polymorphisms in CM that provides user-friendly tools for in-depth molecular network visualization and exploration. The proposed network connections highlight potentially new loci requiring further investigation of their relation to melanoma risk. Database URL: http://www.melgene.org. PMID:25380778

  18. PAGES-Powell North America 2k database

    NASA Astrophysics Data System (ADS)

    McKay, N.

    2014-12-01

    Syntheses of paleoclimate data in North America are essential for understanding long-term spatiotemporal variability in climate and for properly assessing risk on decadal and longer timescales. Existing reconstructions of the past 2,000 years rely almost exclusively on tree-ring records, which can underestimate low-frequency variability and rarely extend beyond the last millennium. Meanwhile, many records from the full spectrum of paleoclimate archives are available and hold the potential of enhancing our understanding of past climate across North America over the past 2000 years. The second phase of the Past Global Changes (PAGES) North America 2k project began in 2014, with a primary goal of assembling these disparate paleoclimate records into a unified database. This effort is currently supported by the USGS Powell Center together with PAGES. Its success requires grassroots support from the community of researchers developing and interpreting paleoclimatic evidence relevant to the past 2000 years. Most likely, fewer than half of the published records appropriate for this database are publicly archived, and far fewer include the data needed to quantify geochronologic uncertainty, or to concisely describe how best to interpret the data in context of a large-scale paleoclimatic synthesis. The current version of the database includes records that (1) have been published in a peer-reviewed journal (including evidence of the record's relationship to climate), (2) cover a substantial portion of the past 2000 yr (>300 yr for annual records, >500 yr for lower frequency records) at relatively high resolution (<50 yr/observation), and (3) have reasonably small and quantifiable age uncertainty. Presently, the database includes records from boreholes, ice cores, lake and marine sediments, speleothems, and tree rings. This poster presentation will display the site locations and basic metadata of the records currently in the database. We invite anyone with interest in

  19. Reference Fluid Thermodynamic and Transport Properties Database (REFPROP)

    National Institute of Standards and Technology Data Gateway

    SRD 23 NIST Reference Fluid Thermodynamic and Transport Properties Database (REFPROP) (PC database for purchase)   NIST 23 contains revised data in a Windows version of the database, including 105 pure fluids and allowing mixtures of up to 20 components. The fluids include the environmentally acceptable HFCs, traditional HFCs and CFCs and 'natural' refrigerants like ammonia

  20. The CATDAT damaging earthquakes database

    NASA Astrophysics Data System (ADS)

    Daniell, J. E.; Khazai, B.; Wenzel, F.; Vervaeck, A.

    2011-08-01

    The global CATDAT damaging earthquakes and secondary effects (tsunami, fire, landslides, liquefaction and fault rupture) database was developed to validate, remove discrepancies, and expand greatly upon existing global databases; and to better understand the trends in vulnerability, exposure, and possible future impacts of such historic earthquakes. Lack of consistency and errors in other earthquake loss databases frequently cited and used in analyses was a major shortcoming in the view of the authors which needed to be improved upon. Over 17 000 sources of information have been utilised, primarily in the last few years, to present data from over 12 200 damaging earthquakes historically, with over 7000 earthquakes since 1900 examined and validated before insertion into the database. Each validated earthquake includes seismological information, building damage, ranges of social losses to account for varying sources (deaths, injuries, homeless, and affected), and economic losses (direct, indirect, aid, and insured). Globally, a slightly increasing trend in economic damage due to earthquakes is not consistent with the greatly increasing exposure. The 1923 Great Kanto (214 billion USD damage; 2011 HNDECI-adjusted dollars) compared to the 2011 Tohoku (>300 billion USD at time of writing), 2008 Sichuan and 1995 Kobe earthquakes show the increasing concern for economic loss in urban areas as the trend should be expected to increase. Many economic and social loss values not reported in existing databases have been collected. Historical GDP (Gross Domestic Product), exchange rate, wage information, population, HDI (Human Development Index), and insurance information have been collected globally to form comparisons. This catalogue is the largest known cross-checked global historic damaging earthquake database and should have far-reaching consequences for earthquake loss estimation, socio-economic analysis, and the global reinsurance field.

  1. The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection.

    PubMed

    Rigden, Daniel J; Fernández-Suárez, Xosé M; Galperin, Michael Y

    2016-01-01

    The 2016 Database Issue of Nucleic Acids Research starts with overviews of the resources provided by three major bioinformatics centers, the U.S. National Center for Biotechnology Information (NCBI), the European Bioinformatics Institute (EMBL-EBI) and Swiss Institute for Bioinformatics (SIB). Also included are descriptions of 62 new databases and updates on 95 databases that have been previously featured in NAR plus 17 previously described elsewhere. A number of papers in this issue deal with resources on nucleic acids, including various kinds of non-coding RNAs and their interactions, molecular dynamics simulations of nucleic acid structure, and two databases of super-enhancers. The protein database section features important updates on the EBI's Pfam, PDBe and PRIDE databases, as well as a variety of resources on pathways, metabolomics and metabolic modeling. This issue also includes updates on popular metagenomics resources, such as MG-RAST, EBI Metagenomics, and probeBASE, as well as a newly compiled Human Pan-Microbe Communities database. A significant fraction of the new and updated databases are dedicated to the genetic basis of disease, primarily cancer, and various aspects of drug research, including resources for patented drugs, their side effects, withdrawn drugs, and potential drug targets. A further six papers present updated databases of various antimicrobial and anticancer peptides. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/). The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been updated with the addition of 88 new resources and removal of 23 obsolete websites, which brought the current listing to 1685 databases. PMID:26740669

  2. The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection

    PubMed Central

    Rigden, Daniel J.; Fernández-Suárez, Xosé M.; Galperin, Michael Y.

    2016-01-01

    The 2016 Database Issue of Nucleic Acids Research starts with overviews of the resources provided by three major bioinformatics centers, the U.S. National Center for Biotechnology Information (NCBI), the European Bioinformatics Institute (EMBL-EBI) and Swiss Institute for Bioinformatics (SIB). Also included are descriptions of 62 new databases and updates on 95 databases that have been previously featured in NAR plus 17 previously described elsewhere. A number of papers in this issue deal with resources on nucleic acids, including various kinds of non-coding RNAs and their interactions, molecular dynamics simulations of nucleic acid structure, and two databases of super-enhancers. The protein database section features important updates on the EBI's Pfam, PDBe and PRIDE databases, as well as a variety of resources on pathways, metabolomics and metabolic modeling. This issue also includes updates on popular metagenomics resources, such as MG-RAST, EBI Metagenomics, and probeBASE, as well as a newly compiled Human Pan-Microbe Communities database. A significant fraction of the new and updated databases are dedicated to the genetic basis of disease, primarily cancer, and various aspects of drug research, including resources for patented drugs, their side effects, withdrawn drugs, and potential drug targets. A further six papers present updated databases of various antimicrobial and anticancer peptides. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/). The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been updated with the addition of 88 new resources and removal of 23 obsolete websites, which brought the current listing to 1685 databases. PMID:26740669

  3. The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection.

    PubMed

    Rigden, Daniel J; Fernández-Suárez, Xosé M; Galperin, Michael Y

    2016-01-01

    The 2016 Database Issue of Nucleic Acids Research starts with overviews of the resources provided by three major bioinformatics centers, the U.S. National Center for Biotechnology Information (NCBI), the European Bioinformatics Institute (EMBL-EBI) and Swiss Institute for Bioinformatics (SIB). Also included are descriptions of 62 new databases and updates on 95 databases that have been previously featured in NAR plus 17 previously described elsewhere. A number of papers in this issue deal with resources on nucleic acids, including various kinds of non-coding RNAs and their interactions, molecular dynamics simulations of nucleic acid structure, and two databases of super-enhancers. The protein database section features important updates on the EBI's Pfam, PDBe and PRIDE databases, as well as a variety of resources on pathways, metabolomics and metabolic modeling. This issue also includes updates on popular metagenomics resources, such as MG-RAST, EBI Metagenomics, and probeBASE, as well as a newly compiled Human Pan-Microbe Communities database. A significant fraction of the new and updated databases are dedicated to the genetic basis of disease, primarily cancer, and various aspects of drug research, including resources for patented drugs, their side effects, withdrawn drugs, and potential drug targets. A further six papers present updated databases of various antimicrobial and anticancer peptides. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/). The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been updated with the addition of 88 new resources and removal of 23 obsolete websites, which brought the current listing to 1685 databases.

  4. ORNL RAIL & BARGE DB. Network Database

    SciTech Connect

    Johnson, P.

    1991-07-01

    The Oak Ridge National Laboratory (ORNL) Rail and Barge Network Database is a representation of the rail and barge system of the United States. The network is derived from the Federal Rail Administration (FRA) rail database. The database consists of 96 subnetworks. Each of the subnetworks represent an individual railroad, a waterway system, or a composite group of small railroads. Two subnetworks represent waterways; one being barge/intercoastal, and the other coastal merchant marine with access through the Great Lakes/Saint Lawrence Seaway, Atlantic and Gulf Coasts, the Panama Canal, and Pacific Coast. Two other subnetworks represent small shortline railroads and terminal railroad operations. One subnetwork is maintained for the representation of Amtrak operations. The remaining 91 subnetworks represent individual or corporate groups of railroads. Coordinate locations are included as part of the database. The rail portion of the database is similar to the original FRA rail network. The waterway coordinates are greatly enhanced in the current release. Inland waterway representation was extracted from the 1:2,000,000 United States Geological Survey data. An important aspect of the database is the transfer file. This file identifies where two railroads interline traffic between their systems. Also included are locations where rail/waterway intermodal transfers could occur. Other files in the database include a translation table between Association of American Railroad (AAR) codes to the 96 subnetworks in the database, a list of names of the 96 subnetworks, and a file of names for a large proportion of the nodes in the network.

  5. ORNL RAIL & BARGE DB. Network Database

    SciTech Connect

    Johnson, P.

    1992-03-16

    The Oak Ridge National Laboratory (ORNL) Rail and Barge Network Database is a representation of the rail and barge system of the United States. The network is derived from the Federal Rail Administration (FRA) rail database. The database consists of 96 subnetworks. Each of the subnetworks represent an individual railroad, a waterway system, or a composite group of small railroads. Two subnetworks represent waterways; one being barge/intercoastal, and the other coastal merchant marine with access through the Great Lakes/Saint Lawrence Seaway, Atlantic and Gulf Coasts, the Panama Canal, and Pacific Coast. Two other subnetworks represent small shortline railroads and terminal railroad operations. One subnetwork is maintained for the representation of Amtrak operations. The remaining 91 subnetworks represent individual or corporate groups of railroads. Coordinate locations are included as part of the database. The rail portion of the database is similar to the original FRA rail network. The waterway coordinates are greatly enhanced in the current release. Inland waterway representation was extracted from the 1:2,000,000 United States Geological Survey data. An important aspect of the database is the transfer file. This file identifies where two railroads interline traffic between their systems. Also included are locations where rail/waterway intermodal transfers could occur. Other files in the database include a translation table between Association of American Railroad (AAR) codes to the 96 subnetworks in the database, a list of names of the 96 subnetworks, and a file of names for a large proportion of the nodes in the network.

  6. The UCSC Genome Browser database: 2015 update

    PubMed Central

    Rosenbloom, Kate R.; Armstrong, Joel; Barber, Galt P.; Casper, Jonathan; Clawson, Hiram; Diekhans, Mark; Dreszer, Timothy R.; Fujita, Pauline A.; Guruvadoo, Luvina; Haeussler, Maximilian; Harte, Rachel A.; Heitner, Steve; Hickey, Glenn; Hinrichs, Angie S.; Hubley, Robert; Karolchik, Donna; Learned, Katrina; Lee, Brian T.; Li, Chin H.; Miga, Karen H.; Nguyen, Ngan; Paten, Benedict; Raney, Brian J.; Smit, Arian F. A.; Speir, Matthew L.; Zweig, Ann S.; Haussler, David; Kuhn, Robert M.; Kent, W. James

    2015-01-01

    Launched in 2001 to showcase the draft human genome assembly, the UCSC Genome Browser database (http://genome.ucsc.edu) and associated tools continue to grow, providing a comprehensive resource of genome assemblies and annotations to scientists and students worldwide. Highlights of the past year include the release of a browser for the first new human genome reference assembly in 4 years in December 2013 (GRCh38, UCSC hg38), a watershed comparative genomics annotation (100-species multiple alignment and conservation) and a novel distribution mechanism for the browser (GBiB: Genome Browser in a Box). We created browsers for new species (Chinese hamster, elephant shark, minke whale), ‘mined the web’ for DNA sequences and expanded the browser display with stacked color graphs and region highlighting. As our user community increasingly adopts the UCSC track hub and assembly hub representations for sharing large-scale genomic annotation data sets and genome sequencing projects, our menu of public data hubs has tripled. PMID:25428374

  7. The worldwide collapse caldera database (CCDB): A tool for studying and understanding caldera processes

    NASA Astrophysics Data System (ADS)

    Geyer, Adelina; Marti, Joan

    2015-04-01

    Collapse calderas are one of the most important volcanic structures not only because of their hazard implications, but also because of their high geothermal energy potential and their association with mineral deposits of economic interest. In 2008 we presented a new general worldwide Collapse Caldera DataBase (CCDB), in order to provide a useful and accessible tool for studying and understanding caldera collapse processes. The principal aim of the CCDB is to update the current field based knowledge on calderas, merging together the existing databases and complementing them with new examples found in the bibliography, and leaving it open for the incorporation of new data from future studies. Currently, the database includes over 450 documented calderas around the world, trying to be representative enough to promote further studies and analyses. We have performed a comprehensive compilation of published field studies of collapse calderas including more than 500 references, and their information has been summarized in a database linked to a Geographical Information System (GIS) application. Thus, it is possible to visualize the selected calderas on a world map and to filter them according to different features recorded in the database (e.g. age, structure). The information recorded in the CCDB can be grouped in seven main information classes: caldera features, properties of the caldera-forming deposits, magmatic system, geodynamic setting, pre-caldera volcanism,caldera-forming eruption sequence and post-caldera activity. Additionally, we have added two extra classes. The first records the references consulted for each caldera. The second allows users to introduce comments on the caldera sample such as possible controversies concerning the caldera origin. During the last seven years, the database has been available on-line at http://www.gvb-csic.es/CCDB.htm previous registration. This year, the CCDB webpage will be updated and improved so the database content can be

  8. Comprehension for All

    ERIC Educational Resources Information Center

    Snowball, Diane

    2006-01-01

    Teaching and learning about comprehension is of paramount importance, from the beginning of reading instruction right through all years of school, in all curriculum areas. Research has shown that comprehension instruction can improve the reading comprehension of all readers, even beginning readers and struggling older readers. This article offers…

  9. Teaching Comprehension Skills.

    ERIC Educational Resources Information Center

    Georgia Association of School Superintendents.

    Materials used in a one-day conference on teaching reading comprehension skills are summarized in this publication. Contents consist of three articles on teaching the comprehension skills, informal reading inventories in science and in geography, Lincoln's Gettysburg Address with comprehension questions, a checklist for the evaluation of teaching…

  10. Selective Comprehensives: The Social Composition of Top Comprehensive Schools

    ERIC Educational Resources Information Center

    Sutton Trust, 2013

    2013-01-01

    This study looks at publicly available data on the proportion of pupils eligible and claiming for free school meals (FSM) in the top 500 comprehensive state schools and at how representative they are of their localities and of their school type. We have looked at the top 500 when measured by five good GCSEs including English and Maths and at the…

  11. IPShocks: Database of Interplanetary Shock Waves

    NASA Astrophysics Data System (ADS)

    Isavnin, Alexey; Lumme, Erkka; Kilpua, Emilia; Lotti, Mikko; Andreeova, Katerina; Koskinen, Hannu; Nikbakhsh, Shabnam

    2016-04-01

    Fast collisionless shocks are one of the key interplanetary structures, which have also paramount role for solar-terrestrial physics. In particular, coronal mass ejection driven shocks accelerate particles to high energies and turbulent post-shock flows may drive intense geomagnetic storms. We present comprehensive Heliospheric Shock Database (ipshocks.fi) developed and hosted at University of Helsinki. The database contains currently over 2000 fast forward and fast reverse shocks observed by Wind, ACE, STEREO, Helios, Ulysses and Cluster spacecraft. In addition, the database has search and sort tools based on the spacecraft, time range, and several key shock parameters (e.g., shock type, shock strength, shock angle), data plots for each shock and data download options. These features allow easy access to shocks and quick statistical analyses. All current shocks are identified visually and analysed using the same procedure.

  12. SCOP: a structural classification of proteins database.

    PubMed

    Hubbard, T J; Murzin, A G; Brenner, S E; Chothia, C

    1997-01-01

    The Structural Classification of Proteins (SCOP) database provides a detailed and comprehensive description of the relationships of all known proteins structures. The classification is on hierarchical levels: the first two levels, family and superfamily, describe near and far evolutionary relationships; the third, fold, describes geometrical relationships. The distinction between evolutionary relationships and those that arise from the physics and chemistry of proteins is a feature that is unique to this database, so far. SCOP also provides for each structure links to atomic co-ordinates, images of the structures, interactive viewers, sequence data, data on any conformational changes related to function and literature references. The database is freely accessible on the World Wide Web (WWW) with an entry point at URL http://scop.mrc-lmb.cam.ac.uk/scop/

  13. The PIR-International Protein Sequence Database.

    PubMed

    Barker, W C; Garavelli, J S; Haft, D H; Hunt, L T; Marzec, C R; Orcutt, B C; Srinivasarao, G Y; Yeh, L S; Ledley, R S; Mewes, H W; Pfeiffer, F; Tsugita, A

    1998-01-01

    From its origin the Protein Information Resource (http://www-nbrf. georgetown.edu/pir/) has supported research on evolution and computational biology by designing and compiling a comprehensive, quality controlled, and well-organized protein sequence database. The database has been produced and updated on a regular schedule since 1984. Since 1988 it has been maintained collaboratively by the PIR-International, an association of data collection centers engaged in international cooperation for the development of this research resource during a period of explosive acquisition of new data. As of June 1997, essentially all sequence entries have been classified into families, allowing the efficient application of methods to propagate and standardize annotation among related sequences. The databases are available through the Internet by the World-Wide Web and FTP, or on CD-ROM and magnetic media.

  14. Database in Artificial Intelligence.

    ERIC Educational Resources Information Center

    Wilkinson, Julia

    1986-01-01

    Describes a specialist bibliographic database of literature in the field of artificial intelligence created by the Turing Institute (Glasgow, Scotland) using the BRS/Search information retrieval software. The subscription method for end-users--i.e., annual fee entitles user to unlimited access to database, document provision, and printed awareness…

  15. BioImaging Database

    SciTech Connect

    David Nix, Lisa Simirenko

    2006-10-25

    The Biolmaging Database (BID) is a relational database developed to store the data and meta-data for the 3D gene expression in early Drosophila embryo development on a cellular level. The schema was written to be used with the MySQL DBMS but with minor modifications can be used on any SQL compliant relational DBMS.

  16. Biological Macromolecule Crystallization Database

    National Institute of Standards and Technology Data Gateway

    SRD 21 Biological Macromolecule Crystallization Database (Web, free access)   The Biological Macromolecule Crystallization Database and NASA Archive for Protein Crystal Growth Data (BMCD) contains the conditions reported for the crystallization of proteins and nucleic acids used in X-ray structure determinations and archives the results of microgravity macromolecule crystallization studies.

  17. Online Database Searching Workbook.

    ERIC Educational Resources Information Center

    Littlejohn, Alice C.; Parker, Joan M.

    Designed primarily for use by first-time searchers, this workbook provides an overview of online searching. Following a brief introduction which defines online searching, databases, and database producers, five steps in carrying out a successful search are described: (1) identifying the main concepts of the search statement; (2) selecting a…

  18. Ionic Liquids Database- (ILThermo)

    National Institute of Standards and Technology Data Gateway

    SRD 147 Ionic Liquids Database- (ILThermo) (Web, free access)   IUPAC Ionic Liquids Database, ILThermo, is a free web research tool that allows users worldwide to access an up-to-date data collection from the publications on experimental investigations of thermodynamic, and transport properties of ionic liquids as well as binary and ternary mixtures containing ionic liquids.

  19. HIV Structural Database

    National Institute of Standards and Technology Data Gateway

    SRD 102 HIV Structural Database (Web, free access)   The HIV Protease Structural Database is an archive of experimentally determined 3-D structures of Human Immunodeficiency Virus 1 (HIV-1), Human Immunodeficiency Virus 2 (HIV-2) and Simian Immunodeficiency Virus (SIV) Proteases and their complexes with inhibitors or products of substrate cleavage.

  20. Morchella MLST database

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Welcome to the Morchella MLST database. This dedicated database was set up at the CBS-KNAW Biodiversity Center by Vincent Robert in February 2012, using BioloMICS software (Robert et al., 2011), to facilitate DNA sequence-based identifications of Morchella species via the Internet. The current datab...

  1. Atomic Spectra Database (ASD)

    National Institute of Standards and Technology Data Gateway

    SRD 78 NIST Atomic Spectra Database (ASD) (Web, free access)   This database provides access and search capability for NIST critically evaluated data on atomic energy levels, wavelengths, and transition probabilities that are reasonably up-to-date. The NIST Atomic Spectroscopy Data Center has carried out these critical compilations.

  2. First Look: TRADEMARKSCAN Database.

    ERIC Educational Resources Information Center

    Fernald, Anne Conway; Davidson, Alan B.

    1984-01-01

    Describes database produced by Thomson and Thomson and available on Dialog which contains over 700,000 records representing all active federal trademark registrations and applications for registrations filed in United States Patent and Trademark Office. A typical record, special features, database applications, learning to use TRADEMARKSCAN, and…

  3. Structural Ceramics Database

    National Institute of Standards and Technology Data Gateway

    SRD 30 NIST Structural Ceramics Database (Web, free access)   The NIST Structural Ceramics Database (WebSCD) provides evaluated materials property data for a wide range of advanced ceramics known variously as structural ceramics, engineering ceramics, and fine ceramics.

  4. Build Your Own Database.

    ERIC Educational Resources Information Center

    Jacso, Peter; Lancaster, F. W.

    This book is intended to help librarians and others to produce databases of better value and quality, especially if they have had little previous experience in database construction. Drawing upon almost 40 years of experience in the field of information retrieval, this book emphasizes basic principles and approaches rather than in-depth and…

  5. Knowledge Discovery in Databases.

    ERIC Educational Resources Information Center

    Norton, M. Jay

    1999-01-01

    Knowledge discovery in databases (KDD) revolves around the investigation and creation of knowledge, processes, algorithms, and mechanisms for retrieving knowledge from data collections. The article is an introductory overview of KDD. The rationale and environment of its development and applications are discussed. Issues related to database design…

  6. Database Searching by Managers.

    ERIC Educational Resources Information Center

    Arnold, Stephen E.

    Managers and executives need the easy and quick access to business and management information that online databases can provide, but many have difficulty articulating their search needs to an intermediary. One possible solution would be to encourage managers and their immediate support staff members to search textual databases directly as they now…

  7. Assignment to database industy

    NASA Astrophysics Data System (ADS)

    Abe, Kohichiroh

    Various kinds of databases are considered to be essential part in future large sized systems. Information provision only by databases is also considered to be growing as the market becomes mature. This paper discusses how such circumstances have been built and will be developed from now on.

  8. MTBreg: The Database of Conditionally Regulated Proteins in Mycobacterium Tuberculosis

    DOE Data Explorer

    Kaufman, Markus; Pal, Debnath; Eisenberg, David

    Proteins up- and down- regulated in Mycobacterium tuberculosis grown under conditions mimicking infection are included in this database. It also includes information on proteins that are regulated by selected transcription factors or other regulatory proteins. The literature data provided here is complimentary to the databases provided by Michael Strong that include recent TB computational functional linkages and the Prolinks Database by Peter Bowers. The experimental condition, the experimental dataset and a literature reference will be displayed, including links to the computationally linked proteins in the Prolinks Database and the entry in the Mycobacterium tuberculosis Structural Genomics Database.[Copied from information at http://www.doe-mbi.ucla.edu/Services/MTBreg/

  9. Pfam: the protein families database

    PubMed Central

    Finn, Robert D.; Bateman, Alex; Clements, Jody; Coggill, Penelope; Eberhardt, Ruth Y.; Eddy, Sean R.; Heger, Andreas; Hetherington, Kirstie; Holm, Liisa; Mistry, Jaina; Sonnhammer, Erik L. L.; Tate, John; Punta, Marco

    2014-01-01

    Pfam, available via servers in the UK (http://pfam.sanger.ac.uk/) and the USA (http://pfam.janelia.org/), is a widely used database of protein families, containing 14 831 manually curated entries in the current release, version 27.0. Since the last update article 2 years ago, we have generated 1182 new families and maintained sequence coverage of the UniProt Knowledgebase (UniProtKB) at nearly 80%, despite a 50% increase in the size of the underlying sequence database. Since our 2012 article describing Pfam, we have also undertaken a comprehensive review of the features that are provided by Pfam over and above the basic family data. For each feature, we determined the relevance, computational burden, usage statistics and the functionality of the feature in a website context. As a consequence of this review, we have removed some features, enhanced others and developed new ones to meet the changing demands of computational biology. Here, we describe the changes to Pfam content. Notably, we now provide family alignments based on four different representative proteome sequence data sets and a new interactive DNA search interface. We also discuss the mapping between Pfam and known 3D structures. PMID:24288371

  10. Nuclear data made easily accessible through the Notre Dame Nuclear Database

    NASA Astrophysics Data System (ADS)

    Khouw, Timothy; Lee, Kevin; Fasano, Patrick; Mumpower, Matthew; Aprahamian, Ani

    2014-09-01

    In 1994, the NNDC revolutionized nuclear research by providing a colorful, clickable, searchable database over the internet. Over the last twenty years, web technology has evolved dramatically. Our project, the Notre Dame Nuclear Database, aims to provide a more comprehensive and broadly searchable interactive body of data. The database can be searched by an array of filters which includes metadata such as the facility where a measurement is made, the author(s), or date of publication for the datum of interest. The user interface takes full advantage of HTML, a web markup language, CSS (cascading style sheets to define the aesthetics of the website), and JavaScript, a language that can process complex data. A command-line interface is supported that interacts with the database directly on a user's local machine which provides single command access to data. This is possible through the use of a standardized API (application programming interface) that relies upon well-defined filtering variables to produce customized search results. We offer an innovative chart of nuclides utilizing scalable vector graphics (SVG) to deliver users an unsurpassed level of interactivity supported on all computers and mobile devices. We will present a functional demo of our database at the conference.

  11. An updated active structure database of Taiwan for seismic hazard assessments

    NASA Astrophysics Data System (ADS)

    Shyu, J. B. H.; Chuang, Y. R.; Chen, Y. L.; Lee, Y.; Cheng, T. C. T.

    2014-12-01

    In order to build a complete seismogenic source model to assess future seismic hazards in Taiwan, we have constructed an updated active structure database for the island. We reviewed existing active structure databases, and obtained new information for structures that have not been thoroughly analyzed before. For example, the Central Geological Survey of Taiwan has published a comprehensive database of active faults in Taiwan, including all of the historically ruptured faults. Many other active structures, such as blind faults or folds that can be identified from geomorphic or structural analysis, have also been mapped and reported in several previous investigations. We have combined information from these existing databases to build an updated and digitized three-dimensional active structure map for Taiwan. Furthermore, for detailed information of individual structure such as long-term slip rates and potential recurrence intervals, we have collected the data from existing publications, as well as calculated from results of our own field surveys and investigations. We hope this updated database would become a significant constraint for the calculations of seismic hazard assessments in Taiwan, and would provide important information for engineers and hazard mitigation agencies.

  12. The InSiGHT database: utilizing 100 years of insights into Lynch syndrome.

    PubMed

    Plazzer, J P; Sijmons, R H; Woods, M O; Peltomäki, P; Thompson, B; Den Dunnen, J T; Macrae, F

    2013-06-01

    This article provides a historical overview of the online database ( www.insight-group.org/mutations ) maintained by the International Society for Gastrointestinal Hereditary Tumours. The focus is on the mismatch repair genes which are mutated in Lynch Syndrome. APC, MUTYH and other genes are also an important part of the database, but are not covered here. Over time, as the understanding of the genetics of Lynch Syndrome increased, databases were created to centralise and share the variants which were being detected in ever greater numbers. These databases were eventually merged into the InSiGHT database, a comprehensive repository of gene variant and disease phenotype information, serving as a starting point for important endeavours including variant interpretation, research, diagnostics and enhanced global collection. Pivotal to its success has been the collaborative spirit in which it has been developed, its association with the Human Variome Project, the appointment of a full time curator and its governance stemming from the well established organizational structure of InSiGHT.

  13. Protein-protein interaction databases: keeping up with growing interactomes

    PubMed Central

    2009-01-01

    Over the past few years, the number of known protein-protein interactions has increased substantially. To make this information more readily available, a number of publicly available databases have set out to collect and store protein-protein interaction data. Protein-protein interactions have been retrieved from six major databases, integrated and the results compared. The six databases (the Biological General Repository for Interaction Datasets [BioGRID], the Molecular INTeraction database [MINT], the Biomolecular Interaction Network Database [BIND], the Database of Interacting Proteins [DIP], the IntAct molecular interaction database [IntAct] and the Human Protein Reference Database [HPRD]) differ in scope and content; integration of all datasets is non-trivial owing to differences in data annotation. With respect to human protein-protein interaction data, HPRD seems to be the most comprehensive. To obtain a complete dataset, however, interactions from all six databases have to be combined. To overcome this limitation, meta-databases such as the Agile Protein Interaction Database (APID) offer access to integrated protein-protein interaction datasets, although these also currently have certain restrictions. PMID:19403463

  14. MannDB: A microbial annotation database for protein characterization

    SciTech Connect

    Zhou, C; Lam, M; Smith, J; Zemla, A; Dyer, M; Kuczmarski, T; Vitalis, E; Slezak, T

    2006-05-19

    MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high

  15. HDACiDB: a database for histone deacetylase inhibitors

    PubMed Central

    Murugan, Kasi; Sangeetha, Shanmugasamy; Ranjitha, Shanmugasamy; Vimala, Antony; Al-Sohaibani, Saleh; Rameshkumar, Gopal

    2015-01-01

    An histone deacetylase (HDAC) inhibitor database (HDACiDB) was constructed to enable rapid access to data relevant to the development of epigenetic modulators (HDAC inhibitors [HDACi]), helping bring precision cancer medicine a step closer. Thousands of HDACi targeting HDACs are in various stages of development and are being tested in clinical trials as monotherapy and in combination with other cancer agents. Despite the abundance of HDACi, information resources are limited. Tools for in silico experiments on specific HDACi prediction, for designing and analyzing the generated data, as well as custom-made specific tools and interactive databases, are needed. We have developed an HDACiDB that is a composite collection of HDACi and currently comprises 1,445 chemical compounds, including 419 natural and 1,026 synthetic ones having the potential to inhibit histone deacetylation. Most importantly, it will allow application of Lipinski’s rule of five drug-likeness and other physicochemical property-based screening of the inhibitors. It also provides easy access to information on their source of origin, molecular properties, drug likeness, as well as bioavailability with relevant references cited. Being the first comprehensive database on HDACi that contains all known natural and synthetic HDACi, the HDACiDB may help to improve our knowledge concerning the mechanisms of actions of available HDACi and enable us to selectively target individual HDAC isoforms and establish a new paradigm for intelligent epigenetic cancer drug design. The database is freely available on the http://hdacidb.bioinfo.au-kbc.org.in/hdacidb/website. PMID:25945037

  16. The Mouse Genome Database: integration of and access to knowledge about the laboratory mouse.

    PubMed

    Blake, Judith A; Bult, Carol J; Eppig, Janan T; Kadin, James A; Richardson, Joel E

    2014-01-01

    The Mouse Genome Database (MGD) (http://www.informatics.jax.org) is the community model organism database resource for the laboratory mouse, a premier animal model for the study of genetic and genomic systems relevant to human biology and disease. MGD maintains a comprehensive catalog of genes, functional RNAs and other genome features as well as heritable phenotypes and quantitative trait loci. The genome feature catalog is generated by the integration of computational and manual genome annotations generated by NCBI, Ensembl and Vega/HAVANA. MGD curates and maintains the comprehensive listing of functional annotations for mouse genes using the Gene Ontology, and MGD curates and integrates comprehensive phenotype annotations including associations of mouse models with human diseases. Recent improvements include integration of the latest mouse genome build (GRCm38), improved access to comparative and functional annotations for mouse genes with expanded representation of comparative vertebrate genomes and new loads of phenotype data from high-throughput phenotyping projects. All MGD resources are freely available to the research community.

  17. The New NRL Crystallographic Database

    NASA Astrophysics Data System (ADS)

    Mehl, Michael; Curtarolo, Stefano; Hicks, David; Toher, Cormac; Levy, Ohad; Hart, Gus

    For many years the Naval Research Laboratory maintained an online graphical database of crystal structures for a wide variety of materials. This database has now been redesigned, updated and integrated with the AFLOW framework for high throughput computational materials discovery (http://materials.duke.edu/aflow.html). For each structure we provide an image showing the atomic positions; the primitive vectors of the lattice and the basis vectors of every atom in the unit cell; the space group and Wyckoff positions; Pearson symbols; common names; and Strukturbericht designations, where available. References for each structure are provided, as well as a Crystallographic Information File (CIF). The database currently includes almost 300 entries and will be continuously updated and expanded. It enables easy search of the various structures based on their underlying symmetries, either by Bravais lattice, Pearson symbol, Strukturbericht designation or commonly used prototypes. The talk will describe the features of the database, and highlight its utility for high throughput computational materials design. Work at NRL is funded by a Contract with the Duke University Department of Mechanical Engineering.

  18. SCOR: a Structural Classification of RNA database.

    PubMed

    Klosterman, Peter S; Tamura, Makio; Holbrook, Stephen R; Brenner, Steven E

    2002-01-01

    The Structural Classification of RNA (SCOR) database provides a survey of the three-dimensional motifs contained in 259 NMR and X-ray RNA structures. In one classification, the structures are grouped according to function. The RNA motifs, including internal and external loops, are also organized in a hierarchical classification. The 259 database entries contain 223 internal and 203 external loops; 52 entries consist of fully complementary duplexes. A classification of the well-characterized tertiary interactions found in the larger RNA structures is also included along with examples. The SCOR database is accessible at http://scor.lbl.gov.

  19. Electron-Impact Ionization Cross Section Database

    National Institute of Standards and Technology Data Gateway

    SRD 107 Electron-Impact Ionization Cross Section Database (Web, free access)   This is a database primarily of total ionization cross sections of molecules by electron impact. The database also includes cross sections for a small number of atoms and energy distributions of ejected electrons for H, He, and H2. The cross sections were calculated using the Binary-Encounter-Bethe (BEB) model, which combines the Mott cross section with the high-incident energy behavior of the Bethe cross section. Selected experimental data are included.

  20. US Department of Energy Chernobyl Database

    SciTech Connect

    Kennedy, R.A.; Mahaffey, J.A.; Smith, S.K.; Carr, F. Jr.

    1992-05-01

    The Chernobyl Database project is developing and maintaining an information to provide researchers with data and resource materials relating to the Chernobyl nuclear accident of April 1986. The system is the official United States repository for Chernobyl data. The system includes a collection of Chernobyl-related documents, a database of bibliographic references, and a collection of radiological measurements records. In addition, projects have been developed to make the resources more accessible and easy to use. These products include a personal-computer-based bibliographic search system (ChernoLit{trademark}), two printed bibliographies, and a personal- computer-based radiological measurements database system (ChernoDat).

  1. DRAMP: a comprehensive data repository of antimicrobial peptides

    PubMed Central

    Fan, Linlin; Sun, Jian; Zhou, Meifeng; Zhou, Jie; Lao, Xingzhen; Zheng, Heng; Xu, Hanmei

    2016-01-01

    The growing problem of antibiotic-resistant microorganisms results in an urgent need for substitutes to conventional antibiotics with novel modes of action and effective activities. Antimicrobial peptides (AMPs), produced by a wide variety of living organisms acting as a defense mechanism against invading pathogenic microbes, are considered to be such promising alternatives. AMPs display a broad spectrum of antimicrobial activity and a low propensity for developing resistance. Therefore, a thorough understanding of AMPs is essential to exploit them as antimicrobial drugs. Considering this, we developed a comprehensive user-friendly data repository of antimicrobial peptides (DRAMP), which holds 17349 antimicrobial sequences, including 4571 general AMPs, 12704 patented sequences and 74 peptides in drug development. Entries in the database have detailed annotations, especially detailed antimicrobial activity data (shown as target organism with MIC value) and structure information. Annotations also include accession numbers crosslinking to Pubmed, Swiss-prot and Protein Data Bank (PDB). The website of the database comes with easy-to-operate browsing as well as searching with sorting and filtering functionalities. Several useful sequence analysis tools are provided, including similarity search, sequence alignment and conserved domain search (CD-Search). DRAMP should be a useful resource for the development of novel antimicrobial peptide drugs. PMID:27075512

  2. DRAMP: a comprehensive data repository of antimicrobial peptides.

    PubMed

    Fan, Linlin; Sun, Jian; Zhou, Meifeng; Zhou, Jie; Lao, Xingzhen; Zheng, Heng; Xu, Hanmei

    2016-01-01

    The growing problem of antibiotic-resistant microorganisms results in an urgent need for substitutes to conventional antibiotics with novel modes of action and effective activities. Antimicrobial peptides (AMPs), produced by a wide variety of living organisms acting as a defense mechanism against invading pathogenic microbes, are considered to be such promising alternatives. AMPs display a broad spectrum of antimicrobial activity and a low propensity for developing resistance. Therefore, a thorough understanding of AMPs is essential to exploit them as antimicrobial drugs. Considering this, we developed a comprehensive user-friendly data repository of antimicrobial peptides (DRAMP), which holds 17349 antimicrobial sequences, including 4571 general AMPs, 12704 patented sequences and 74 peptides in drug development. Entries in the database have detailed annotations, especially detailed antimicrobial activity data (shown as target organism with MIC value) and structure information. Annotations also include accession numbers crosslinking to Pubmed, Swiss-prot and Protein Data Bank (PDB). The website of the database comes with easy-to-operate browsing as well as searching with sorting and filtering functionalities. Several useful sequence analysis tools are provided, including similarity search, sequence alignment and conserved domain search (CD-Search). DRAMP should be a useful resource for the development of novel antimicrobial peptide drugs. PMID:27075512

  3. Repbase Update, a database of eukaryotic repetitive elements.

    PubMed

    Jurka, J; Kapitonov, V V; Pavlicek, A; Klonowski, P; Kohany, O; Walichiewicz, J

    2005-01-01

    Repbase Update is a comprehensive database of repetitive elements from diverse eukaryotic organisms. Currently, it contains over 3600 annotated sequences representing different families and subfamilies of repeats, many of which are unreported anywhere else. Each sequence is accompanied by a short description and references to the original contributors. Repbase Update includes Repbase Reports, an electronic journal publishing newly discovered transposable elements, and the Transposon Pub, a web-based browser of selected chromosomal maps of transposable elements. Sequences from Repbase Update are used to screen and annotate repetitive elements using programs such as Censor and RepeatMasker. Repbase Update is available on the worldwide web at http://www.girinst.org/Repbase_Update.html.

  4. Hazard Analysis Database Report

    SciTech Connect

    GRAMS, W.H.

    2000-12-28

    The Hazard Analysis Database was developed in conjunction with the hazard analysis activities conducted in accordance with DOE-STD-3009-94, Preparation Guide for U S . Department of Energy Nonreactor Nuclear Facility Safety Analysis Reports, for HNF-SD-WM-SAR-067, Tank Farms Final Safety Analysis Report (FSAR). The FSAR is part of the approved Authorization Basis (AB) for the River Protection Project (RPP). This document describes, identifies, and defines the contents and structure of the Tank Farms FSAR Hazard Analysis Database and documents the configuration control changes made to the database. The Hazard Analysis Database contains the collection of information generated during the initial hazard evaluations and the subsequent hazard and accident analysis activities. The Hazard Analysis Database supports the preparation of Chapters 3 ,4 , and 5 of the Tank Farms FSAR and the Unreviewed Safety Question (USQ) process and consists of two major, interrelated data sets: (1) Hazard Analysis Database: Data from the results of the hazard evaluations, and (2) Hazard Topography Database: Data from the system familiarization and hazard identification.

  5. Hazard Analysis Database Report

    SciTech Connect

    GAULT, G.W.

    1999-10-13

    The Hazard Analysis Database was developed in conjunction with the hazard analysis activities conducted in accordance with DOE-STD-3009-94, Preparation Guide for US Department of Energy Nonreactor Nuclear Facility Safety Analysis Reports, for the Tank Waste Remediation System (TWRS) Final Safety Analysis Report (FSAR). The FSAR is part of the approved TWRS Authorization Basis (AB). This document describes, identifies, and defines the contents and structure of the TWRS FSAR Hazard Analysis Database and documents the configuration control changes made to the database. The TWRS Hazard Analysis Database contains the collection of information generated during the initial hazard evaluations and the subsequent hazard and accident analysis activities. The database supports the preparation of Chapters 3,4, and 5 of the TWRS FSAR and the USQ process and consists of two major, interrelated data sets: (1) Hazard Evaluation Database--Data from the results of the hazard evaluations; and (2) Hazard Topography Database--Data from the system familiarization and hazard identification.

  6. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases.

    PubMed

    Caspi, Ron; Altman, Tomer; Billington, Richard; Dreher, Kate; Foerster, Hartmut; Fulcher, Carol A; Holland, Timothy A; Keseler, Ingrid M; Kothari, Anamika; Kubo, Aya; Krummenacker, Markus; Latendresse, Mario; Mueller, Lukas A; Ong, Quang; Paley, Suzanne; Subhraveti, Pallavi; Weaver, Daniel S; Weerasinghe, Deepika; Zhang, Peifen; Karp, Peter D

    2014-01-01

    The MetaCyc database (MetaCyc.org) is a comprehensive and freely accessible database describing metabolic pathways and enzymes from all domains of life. MetaCyc pathways are experimentally determined, mostly small-molecule metabolic pathways and are curated from the primary scientific literature. MetaCyc contains >2100 pathways derived from >37,000 publications, and is the largest curated collection of metabolic pathways currently available. BioCyc (BioCyc.org) is a collection of >3000 organism-specific Pathway/Genome Databases (PGDBs), each containing the full genome and predicted metabolic network of one organism, including metabolites, enzymes, reactions, metabolic pathways, predicted operons, transport systems and pathway-hole fillers. Additions to BioCyc over the past 2 years include YeastCyc, a PGDB for Saccharomyces cerevisiae, and 891 new genomes from the Human Microbiome Project. The BioCyc Web site offers a variety of tools for querying and analysis of PGDBs, including Omics Viewers and tools for comparative analysis. New developments include atom mappings in reactions, a new representation of glycan degradation pathways, improved compound structure display, better coverage of enzyme kinetic data, enhancements of the Web Groups functionality, improvements to the Omics viewers, a new representation of the Enzyme Commission system and, for the desktop version of the software, the ability to save display states.

  7. Database for propagation models

    NASA Technical Reports Server (NTRS)

    Kantak, Anil V.

    1991-01-01

    A propagation researcher or a systems engineer who intends to use the results of a propagation experiment is generally faced with various database tasks such as the selection of the computer software, the hardware, and the writing of the programs to pass the data through the models of interest. This task is repeated every time a new experiment is conducted or the same experiment is carried out at a different location generating different data. Thus the users of this data have to spend a considerable portion of their time learning how to implement the computer hardware and the software towards the desired end. This situation may be facilitated considerably if an easily accessible propagation database is created that has all the accepted (standardized) propagation phenomena models approved by the propagation research community. Also, the handling of data will become easier for the user. Such a database construction can only stimulate the growth of the propagation research it if is available to all the researchers, so that the results of the experiment conducted by one researcher can be examined independently by another, without different hardware and software being used. The database may be made flexible so that the researchers need not be confined only to the contents of the database. Another way in which the database may help the researchers is by the fact that they will not have to document the software and hardware tools used in their research since the propagation research community will know the database already. The following sections show a possible database construction, as well as properties of the database for the propagation research.

  8. International Comparisions Database

    National Institute of Standards and Technology Data Gateway

    International Comparisions Database (Web, free access)   The International Comparisons Database (ICDB) serves the U.S. and the Inter-American System of Metrology (SIM) with information based on Appendices B (International Comparisons), C (Calibration and Measurement Capabilities) and D (List of Participating Countries) of the Comit� International des Poids et Mesures (CIPM) Mutual Recognition Arrangement (MRA). The official source of the data is The BIPM key comparison database. The ICDB provides access to results of comparisons of measurements and standards organized by the consultative committees of the CIPM and the Regional Metrology Organizations.

  9. JICST Factual Database

    NASA Astrophysics Data System (ADS)

    Suzuki, Kazuaki; Shimura, Kazuki; Monma, Yoshio; Sakamoto, Masao; Morishita, Hiroshi; Kanazawa, Kenji

    The Japan Information Center of Science and Technology (JICST) has started the on-line service of JICST/NRIM Materials Strength Database for Engineering Steels and Alloys (JICST ME) in this March (1990). This database has been developed under the joint research between JICST and the National Research Institute for Metals (NRIM). It provides material strength data (creep, fatigue, etc.) of engineering steels and alloys. It is able to search and display on-line, and to analyze the searched data statistically and plot the result on graphic display. The database system and the data in JICST ME are described.

  10. Hybrid Terrain Database

    NASA Technical Reports Server (NTRS)

    Arthur, Trey

    2006-01-01

    A prototype hybrid terrain database is being developed in conjunction with other databases and with hardware and software that constitute subsystems of aerospace cockpit display systems (known in the art as synthetic vision systems) that generate images to increase pilots' situation awareness and eliminate poor visibility as a cause of aviation accidents. The basic idea is to provide a clear view of the world around an aircraft by displaying computer-generated imagery derived from an onboard database of terrain, obstacle, and airport information.

  11. Nigeria Anopheles Vector Database: An Overview of 100 Years' Research

    PubMed Central

    Okorie, Patricia Nkem; McKenzie, F. Ellis; Ademowo, Olusegun George; Bockarie, Moses; Kelly-Hope, Louise

    2011-01-01

    Anopheles mosquitoes are important vectors of malaria and lymphatic filariasis (LF), which are major public health diseases in Nigeria. Malaria is caused by infection with a protozoan parasite of the genus Plasmodium and LF by the parasitic worm Wuchereria bancrofti. Updating our knowledge of the Anopheles species is vital in planning and implementing evidence based vector control programs. To present a comprehensive report on the spatial distribution and composition of these vectors, all published data available were collated into a database. Details recorded for each source were the locality, latitude/longitude, time/period of study, species, abundance, sampling/collection methods, morphological and molecular species identification methods, insecticide resistance status, including evidence of the kdr allele, and P. falciparum sporozoite rate and W. bancrofti microfilaria prevalence. This collation resulted in a total of 110 publications, encompassing 484,747 Anopheles mosquitoes in 632 spatially unique descriptions at 142 georeferenced locations being identified across Nigeria from 1900 to 2010. Overall, the highest number of vector species reported included An. gambiae complex (65.2%), An. funestus complex (17.3%), An. gambiae s.s. (6.5%). An. arabiensis (5.0%) and An. funestus s.s. (2.5%), with the molecular forms An. gambiae M and S identified at 120 locations. A variety of sampling/collection and species identification methods were used with an increase in molecular techniques in recent decades. Insecticide resistance to pyrethroids and organochlorines was found in the main Anopheles species across 45 locations. Presence of P. falciparum and W. bancrofti varied between species with the highest sporozoite rates found in An. gambiae s.s, An. funestus s.s. and An. moucheti, and the highest microfilaria prevalence in An. gambiae s.l., An. arabiensis, and An. gambiae s.s. This comprehensive geo-referenced database provides an essential baseline on Anopheles

  12. HypoxiaDB: a database of hypoxia-regulated proteins.

    PubMed

    Khurana, Pankaj; Sugadev, Ragumani; Jain, Jaspreet; Singh, Shashi Bala

    2013-01-01

    There has been intense interest in the cellular response to hypoxia, and a large number of differentially expressed proteins have been identified through various high-throughput experiments. These valuable data are scattered, and there have been no systematic attempts to document the various proteins regulated by hypoxia. Compilation, curation and annotation of these data are important in deciphering their role in hypoxia and hypoxia-related disorders. Therefore, we have compiled HypoxiaDB, a database of hypoxia-regulated proteins. It is a comprehensive, manually-curated, non-redundant catalog of proteins whose expressions are shown experimentally to be altered at different levels and durations of hypoxia. The database currently contains 72 000 manually curated entries taken on 3500 proteins extracted from 73 peer-reviewed publications selected from PubMed. HypoxiaDB is distinctive from other generalized databases: (i) it compiles tissue-specific protein expression changes under different levels and duration of hypoxia. Also, it provides manually curated literature references to support the inclusion of the protein in the database and establish its association with hypoxia. (ii) For each protein, HypoxiaDB integrates data on gene ontology, KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway, protein-protein interactions, protein family (Pfam), OMIM (Online Mendelian Inheritance in Man), PDB (Protein Data Bank) structures and homology to other sequenced genomes. (iii) It also provides pre-compiled information on hypoxia-proteins, which otherwise requires tedious computational analysis. This includes information like chromosomal location, identifiers like Entrez, HGNC, Unigene, Uniprot, Ensembl, Vega, GI numbers and Genbank accession numbers associated with the protein. These are further cross-linked to respective public databases augmenting HypoxiaDB to the external repositories. (iv) In addition, HypoxiaDB provides an online sequence-similarity search tool for

  13. Database and knowledge base integration in decision support systems.

    PubMed Central

    Johansson, B.; Shahsavar, N.; Ahlfeldt, H.; Wigertz, O.

    1996-01-01

    Since decision support systems (DSS) in medicine often are linked to clinical databases it is important to find methods that facilitate the work for DSS developers to implement database queries in the knowledge base (KB). This paper presents a method for linking clinical databases to a KB with Arden Syntax modules. The method is based on a query meta database including templates for SQL queries. During knowledge module authoring the medical expert only refers to a code in the query meta database. Our method uses standard tools so it can be implemented on different platforms and linked to different clinical databases. PMID:8947666

  14. The PEP-2 project-wide database

    NASA Astrophysics Data System (ADS)

    Chan, A.; Calish, S.; Crane, G.; MacGregor, I.; Meyer, S.; Wong, J.; Weinstein, A.

    1995-05-01

    The PEP-2 Project Database is a tool for monitoring the technical and documentation aspects of this accelerator construction. It holds the PEP-2 design specifications, fabrication and installation data in one integrated system. Key pieces of the database include the machine parameter list, magnet and vacuum fabrication data. CAD drawings, publications and documentation, survey and alignment data and property control. The database can be extended to contain information required for the operations phase of the accelerator and detector. Features such as viewing CAD drawing graphics from the database will be implemented in the future. This central Oracle database on a UNIX server is built using ORACLE Case tools. Users at the three collaborating laboratories (SLAC, LBL, LLNL) can access the data remotely, using various desktop computer platforms and graphical interfaces.

  15. Profiling of Escherichia coli Chromosome database.

    PubMed

    Yamazaki, Yukiko; Niki, Hironori; Kato, Jun-ichi

    2008-01-01

    The Profiling of Escherichia coli Chromosome (PEC) database (http://www.shigen.nig.ac.jp/ecoli/pec/) is designed to allow E. coli researchers to efficiently access information from functional genomics studies. The database contains two principal types of data: gene essentiality and a large collection of E. coli genetic research resources. The essentiality data are based on data compilation from published single-gene essentiality studies and on cell growth studies of large-deletion mutants. Using the circular and linear viewers for both whole genomes and the minimal genome, users can not only gain an overview of the genome structure but also retrieve information on contigs, gene products, mutants, deletions, and so forth. In particular, genome-wide exhaustive mutants are an essential resource for studying E. coli gene functions. Although the genomic database was constructed independently from the genetic resources database, users may seamlessly access both types of data. In addition to these data, the PEC database also provides a summary of homologous genes of other bacterial genomes and of protein structure information, with a comprehensive interface. The PEC is thus a convenient and useful platform for contemporary E. coli researchers. PMID:18392982

  16. ARTI Refrigerant Database

    SciTech Connect

    Calm, J.M.

    1994-05-27

    The Refrigerant Database consolidates and facilitates access to information to assist industry in developing equipment using alternative refrigerants. The underlying purpose is to accelerate phase out of chemical compounds of environmental concern.

  17. Hawaii bibliographic database

    USGS Publications Warehouse

    Wright, T.L.; Takahashi, T.J.

    1998-01-01

    The Hawaii bibliographic database has been created to contain all of the literature, from 1779 to the present, pertinent to the volcanological history of the Hawaiian-Emperor volcanic chain. References are entered in a PC- and Macintosh-compatible EndNote Plus bibliographic database with keywords and abstracts or (if no abstract) with annotations as to content. Keywords emphasize location, discipline, process, identification of new chemical data or age determinations, and type of publication. The database is updated approximately three times a year and is available to upload from an ftp site. The bibliography contained 8460 references at the time this paper was submitted for publication. Use of the database greatly enhances the power and completeness of library searches for anyone interested in Hawaiian volcanism.

  18. TREATABILITY DATABASE DESCRIPTION

    EPA Science Inventory

    The Drinking Water Treatability Database (TDB) presents referenced information on the control of contaminants in drinking water. It allows drinking water utilities, first responders to spills or emergencies, treatment process designers, research organizations, academics, regulato...

  19. THE CTEPP DATABASE

    EPA Science Inventory

    The CTEPP (Children's Total Exposure to Persistent Pesticides and Other Persistent Organic Pollutants) database contains a wealth of data on children's aggregate exposures to pollutants in their everyday surroundings. Chemical analysis data for the environmental media and ques...

  20. Requirements Management Database

    2009-08-13

    This application is a simplified and customized version of the RBA and CTS databases to capture federal, site, and facility requirements, link to actions that must be performed to maintain compliance with their contractual and other requirements.