density functional database: Topics by Science.gov

Sample records for density functional database

Quest for a universal density functional: the accuracy of density functionals across a broad spectrum of databases in chemistry and physics.

PubMed

Peverati, Roberto; Truhlar, Donald G

2014-03-13

Kohn-Sham density functional theory is in principle an exact formulation of quantum mechanical electronic structure theory, but in practice we have to rely on approximate exchange-correlation (xc) functionals. The objective of our work has been to design an xc functional with broad accuracy across as wide an expanse of chemistry and physics as possible, leading--as a long-range goal--to a functional with good accuracy for all problems, i.e. a universal functional. To guide our path towards that goal and to measure our progress, we have developed-building on earlier work of our group-a set of databases of reference data for a variety of energetic and structural properties in chemistry and physics. These databases include energies of molecular processes, such as atomization, complexation, proton addition and ionization; they also include molecular geometries and solid-state lattice constants, chemical reaction barrier heights, and cohesive energies and band gaps of solids. For this paper, we gather many of these databases into four comprehensive databases, two with 384 energetic data for chemistry and solid-state physics and another two with 68 structural data for chemistry and solid-state physics, and we test two wave function methods and 77 density functionals (12 Minnesota meta functionals and 65 others) in a consistent way across this same broad set of data. We especially highlight the Minnesota density functionals, but the results have broader implications in that one may see the successes and failures of many kinds of density functionals when they are all applied to the same data. Therefore, the results provide a status report on the quest for a universal functional.
Geographical Distribution of Biomass Carbon in Tropical Southeast Asian Forests: A Database (NPD-068)

DOE Data Explorer

Brown, Sandra [University of Illinois, Urbana, Illinois (USA); Iverson, Louis R. [University of Illinois, Urbana, Illinois (USA); Prasad, Anantha [University of Illinois, Urbana, Illinois (USA); Beaty, Tammy W. [CDIAC, Oak Ridge National Laboratory, Oak Ridge, TN (USA); Olsen, Lisa M. [CDIAC, Oak Ridge National Laboratory, Oak Ridge, TN (USA); Cushman, Robert M. [CDIAC, Oak Ridge National Laboratory, Oak Ridge, TN (USA); Brenkert, Antoinette L. [CDIAC, Oak Ridge National Laboratory, Oak Ridge, TN (USA)

2001-03-01

A database was generated of estimates of geographically referenced carbon densities of forest vegetation in tropical Southeast Asia for 1980. A geographic information system (GIS) was used to incorporate spatial databases of climatic, edaphic, and geomorphological indices and vegetation to estimate potential (i.e., in the absence of human intervention and natural disturbance) carbon densities of forests. The resulting map was then modified to estimate actual 1980 carbon density as a function of population density and climatic zone. The database covers the following 13 countries: Bangladesh, Brunei, Cambodia (Campuchea), India, Indonesia, Laos, Malaysia, Myanmar (Burma), Nepal, the Philippines, Sri Lanka, Thailand, and Vietnam.
Nonseparable exchange–correlation functional for molecules, including homogeneous catalysis involving transition metals

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yu, Haoyu S.; Zhang, Wenjing; Verma, Pragya

2015-01-01

The goal of this work is to develop a gradient approximation to the exchange–correlation functional of Kohn–Sham density functional theory for treating molecular problems with a special emphasis on the prediction of quantities important for homogeneous catalysis and other molecular energetics. Our training and validation of exchange–correlation functionals is organized in terms of databases and subdatabases. The key properties required for homogeneous catalysis are main group bond energies (database MGBE137), transition metal bond energies (database TMBE32), reaction barrier heights (database BH76), and molecular structures (database MS10). We also consider 26 other databases, most of which are subdatabases of a newlymore » extended broad database called Database 2015, which is presented in the present article and in its ESI. Based on the mathematical form of a nonseparable gradient approximation (NGA), as first employed in the N12 functional, we design a new functional by using Database 2015 and by adding smoothness constraints to the optimization of the functional. The resulting functional is called the gradient approximation for molecules, or GAM. The GAM functional gives better results for MGBE137, TMBE32, and BH76 than any available generalized gradient approximation (GGA) or than N12. The GAM functional also gives reasonable results for MS10 with an MUE of 0.018 Å. The GAM functional provides good results both within the training sets and outside the training sets. The convergence tests and the smooth curves of exchange–correlation enhancement factor as a function of the reduced density gradient show that the GAM functional is a smooth functional that should not lead to extra expense or instability in optimizations. NGAs, like GGAs, have the advantage over meta-GGAs and hybrid GGAs of respectively smaller grid-size requirements for integrations and lower costs for extended systems. These computational advantages combined with the relatively high accuracy for all the key properties needed for molecular catalysis make the GAM functional very promising for future applications.« less
Multiconfiguration Pair-Density Functional Theory Is as Accurate as CASPT2 for Electronic Excitation.

PubMed

Hoyer, Chad E; Ghosh, Soumen; Truhlar, Donald G; Gagliardi, Laura

2016-02-04

A correct description of electronically excited states is critical to the interpretation of visible-ultraviolet spectra, photochemical reactions, and excited-state charge-transfer processes in chemical systems. We have recently proposed a theory called multiconfiguration pair-density functional theory (MC-PDFT), which is based on a combination of multiconfiguration wave function theory and a new kind of density functional called an on-top density functional. Here, we show that MC-PDFT with a first-generation on-top density functional performs as well as CASPT2 for an organic chemistry database including valence, Rydberg, and charge-transfer excitations. The results are very encouraging for practical applications.
An Improved Model for Operational Specification of the Electron Density Structure up to Geosynchronous Heights

DTIC Science & Technology

2010-07-01

http://www.iono.noa.gr/ElectronDensity/EDProfile.php The web service has been developed with the following open source tools: a) PHP , for the... MySQL for the database, which was based on the enhancement of the DIAS database. Below we present some screen shots to demonstrate the functionality
A theoretical-electron-density databank using a model of real and virtual spherical atoms.

PubMed

Nassour, Ayoub; Domagala, Slawomir; Guillot, Benoit; Leduc, Theo; Lecomte, Claude; Jelsch, Christian

2017-08-01

A database describing the electron density of common chemical groups using combinations of real and virtual spherical atoms is proposed, as an alternative to the multipolar atom modelling of the molecular charge density. Theoretical structure factors were computed from periodic density functional theory calculations on 38 crystal structures of small molecules and the charge density was subsequently refined using a density model based on real spherical atoms and additional dummy charges on the covalent bonds and on electron lone-pair sites. The electron-density parameters of real and dummy atoms present in a similar chemical environment were averaged on all the molecules studied to build a database of transferable spherical atoms. Compared with the now-popular databases of transferable multipolar parameters, the spherical charge modelling needs fewer parameters to describe the molecular electron density and can be more easily incorporated in molecular modelling software for the computation of electrostatic properties. The construction method of the database is described. In order to analyse to what extent this modelling method can be used to derive meaningful molecular properties, it has been applied to the urea molecule and to biotin/streptavidin, a protein/ligand complex.
How Well Can Modern Density Functionals Predict Internuclear Distances at Transition States?

PubMed

Xu, Xuefei; Alecu, I M; Truhlar, Donald G

2011-06-14

We introduce a new database called TSG48 containing 48 transition state geometrical data (in particular, internuclear distances in transition state structures) for 16 main group reactions. The 16 reactions are the 12 reactions in the previously published DBH24 database (which includes hydrogen transfer reactions, heavy-atom transfer reactions, nucleophilic substitution reactions, and association reactions plus one unimolecular isomerization) plus four H-transfer reactions in which a hydrogen atom is abstracted by the methyl or hydroperoxyl radical from the two different positions in methanol. The data in TSG48 include data for four reactions that have previously been treated at a very high level in the literature. These data are used to test and validate methods that are affordable for the entire test suite, and the most accurate of these methods is found to be the multilevel BMC-CCSD method. The data that constitute the TSG48 database are therefore taken to consist of these very high level calculations for the four reactions where they are available and BMC-CCSD calculations for the other 12 reactions. The TSG48 database is used to assess the performance of the eight Minnesota density functionals from the M05-M08 families and 26 other high-performance and popular density functionals for locating transition state geometries. For comparison, the MP2 and QCISD wave function methods have also been tested for transition state geometries. The MC3BB and MC3MPW doubly hybrid functionals and the M08-HX and M06-2X hybrid meta-GGAs are found to have the best performance of all of the density functionals tested. M08-HX is the most highly recommended functional due to the excellent performance for all five subsets of TSG48, as well as having a lower cost when compared to doubly hybrid functionals. The mean absolute errors in transition state internuclear distances associated with breaking and forming bonds as calculated by the B2PLYP, MP2, and B3LYP methods are respectively about 2, 3, and 5 times larger than those calculated by MC3BB and M08-HX.
Validation of electronic structure methods for isomerization reactions of large organic molecules.

PubMed

Luo, Sijie; Zhao, Yan; Truhlar, Donald G

2011-08-14

In this work the ISOL24 database of isomerization energies of large organic molecules presented by Huenerbein et al. [Phys. Chem. Chem. Phys., 2010, 12, 6940] is updated, resulting in the new benchmark database called ISOL24/11, and this database is used to test 50 electronic model chemistries. To accomplish the update, the very expensive and highly accurate CCSD(T)-F12a/aug-cc-pVDZ method is first exploited to investigate a six-reaction subset of the 24 reactions, and by comparison of various methods with the benchmark, MCQCISD-MPW is confirmed to be of high accuracy. The final ISOL24/11 database is composed of six reaction energies calculated by CCSD(T)-F12a/aug-cc-pVDZ and 18 calculated by MCQCISD-MPW. We then tested 40 single-component density functionals (both local and hybrid), eight doubly hybrid functionals, and two other methods against ISOL24/11. It is found that the SCS-MP3/CBS method, which is used as benchmark for the original ISOL24, has an MUE of 1.68 kcal mol(-1), which is close to or larger than some of the best tested DFT methods. Using the new benchmark, we find ωB97X-D and MC3MPWB to be the best single-component and doubly hybrid functionals respectively, with PBE0-D3 and MC3MPW performing almost as well. The best single-component density functionals without molecular mechanics dispersion-like terms are M08-SO, M08-HX, M05-2X, and M06-2X. The best single-component density functionals without Hartree-Fock exchange are M06-L-D3 when MM terms are included and M06-L when they are not.
How Accurate Are the Minnesota Density Functionals for Noncovalent Interactions, Isomerization Energies, Thermochemistry, and Barrier Heights Involving Molecules Composed of Main-Group Elements?

DOE PAGES

Mardirossian, Narbe; Head-Gordon, Martin

2016-08-18

The 14 Minnesota density functionals published between the years 2005 and early 2016 are benchmarked on a comprehensive database of 4986 data points (84 data sets) involving molecules composed of main-group elements. The database includes noncovalent interactions, isomerization energies, thermochemistry, and barrier heights, as well as equilibrium bond lengths and equilibrium binding energies of noncovalent dimers. Additionally, the sensitivity of the Minnesota density functionals to the choice of basis set and integration grid is explored for both noncovalent interactions and thermochemistry. By and large, the main strength of the hybrid Minnesota density functionals is that the best ones provide verymore » good performance for thermochemistry (e.g., M06-2X), barrier heights (e.g., M08-HX, M08-SO, MN15), and systems heavily characterized by self-interaction error (e.g., M06-2X, M08-HX, M08-SO, MN15), while the main weakness is that none of them are state-of-the-art for the full spectrum of noncovalent interactions and isomerization energies (although M06-2X is recommended from the 10 hybrid Minnesota functionals). Similarly, the main strength of the local Minnesota density functionals is that the best ones provide very good performance for thermochemistry (e.g., MN15-L), barrier heights (e.g., MN12-L), and systems heavily characterized by self-interaction error (e.g., MN12-L and MN15-L), while the main weakness is that none of them are state-of-the-art for the full spectrum of noncovalent interactions and isomerization energies (although M06-L is clearly the best from the four local Minnesota functionals). Finally, as an overall guide, M06-2X and MN15 are perhaps the most broadly useful hybrid Minnesota functionals, while M06-L and MN15-L are perhaps the most broadly useful local Minnesota functionals, although each has different strengths and weaknesses.« less
Use of the rVV10 Nonlocal Correlation Functional in the B97M-V Density Functional: Defining B97M-rV and Related Functionals [On the Use of the rVV10 Nonlocal Correlation Functional in the B97M-V Density Functional: Defining B97M-rV and Related Functionals

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mardirossian, Narbe; Ruiz Pestana, Luis; Womack, James C.

The VV10 and rVV10 nonlocal correlation functionals are consistently implemented and assessed, with the goal of determining if the rVV10 nonlocal correlation functional can replace the VV10 nonlocal correlation functional in the recently developed B97M-V density functional, to give the B97M-rV density functional. Along the way, four density functionals are simultaneously tested: VV10, rVV10, B97M-V, and B97M-rV. An initial assessment is carried out across the S22 data set, and the short-range damping variable, b, is varied for all four density functionals in order to determine the sensitivity of the functionals to the empirical parameter. The results of this test indicatemore » that a value of b = 6 (fortuitously the same as that in B97M-V) is suitable for B97M-rV. The functionals are then compared across an extensive database of interaction energies, and it is demonstrated that B97M-rV either matches or outperforms B97M-V for all of the tests considered. Finally, the optimization of b across the S22 data set is extended to two range-separated hybrid density functionals, ωB97X-V and ωB97M-V, and a value of b = 6.2 is recommended for both ωB97X-rV and ωB97M-rV.« less
Use of the rVV10 Nonlocal Correlation Functional in the B97M-V Density Functional: Defining B97M-rV and Related Functionals [On the Use of the rVV10 Nonlocal Correlation Functional in the B97M-V Density Functional: Defining B97M-rV and Related Functionals

DOE PAGES

Mardirossian, Narbe; Ruiz Pestana, Luis; Womack, James C.; ...

2016-12-06

The VV10 and rVV10 nonlocal correlation functionals are consistently implemented and assessed, with the goal of determining if the rVV10 nonlocal correlation functional can replace the VV10 nonlocal correlation functional in the recently developed B97M-V density functional, to give the B97M-rV density functional. Along the way, four density functionals are simultaneously tested: VV10, rVV10, B97M-V, and B97M-rV. An initial assessment is carried out across the S22 data set, and the short-range damping variable, b, is varied for all four density functionals in order to determine the sensitivity of the functionals to the empirical parameter. The results of this test indicatemore » that a value of b = 6 (fortuitously the same as that in B97M-V) is suitable for B97M-rV. The functionals are then compared across an extensive database of interaction energies, and it is demonstrated that B97M-rV either matches or outperforms B97M-V for all of the tests considered. Finally, the optimization of b across the S22 data set is extended to two range-separated hybrid density functionals, ωB97X-V and ωB97M-V, and a value of b = 6.2 is recommended for both ωB97X-rV and ωB97M-rV.« less
VitisExpDB: a database resource for grape functional genomics.

PubMed

Doddapaneni, Harshavardhan; Lin, Hong; Walker, M Andrew; Yao, Jiqiang; Civerolo, Edwin L

2008-02-28

The family Vitaceae consists of many different grape species that grow in a range of climatic conditions. In the past few years, several studies have generated functional genomic information on different Vitis species and cultivars, including the European grape vine, Vitis vinifera. Our goal is to develop a comprehensive web data source for Vitaceae. VitisExpDB is an online MySQL-PHP driven relational database that houses annotated EST and gene expression data for V. vinifera and non-vinifera grape species and varieties. Currently, the database stores approximately 320,000 EST sequences derived from 8 species/hybrids, their annotation (BLAST top match) details and Gene Ontology based structured vocabulary. Putative homologs for each EST in other species and varieties along with information on their percent nucleotide identities, phylogenetic relationship and common primers can be retrieved. The database also includes information on probe sequence and annotation features of the high density 60-mer gene expression chip consisting of approximately 20,000 non-redundant set of ESTs. Finally, the database includes 14 processed global microarray expression profile sets. Data from 12 of these expression profile sets have been mapped onto metabolic pathways. A user-friendly web interface with multiple search indices and extensively hyperlinked result features that permit efficient data retrieval has been developed. Several online bioinformatics tools that interact with the database along with other sequence analysis tools have been added. In addition, users can submit their ESTs to the database. The developed database provides genomic resource to grape community for functional analysis of genes in the collection and for the grape genome annotation and gene function identification. The VitisExpDB database is available through our website http://cropdisease.ars.usda.gov/vitis_at/main-page.htm.
VitisExpDB: A database resource for grape functional genomics

PubMed Central

Doddapaneni, Harshavardhan; Lin, Hong; Walker, M Andrew; Yao, Jiqiang; Civerolo, Edwin L

2008-01-01

Background The family Vitaceae consists of many different grape species that grow in a range of climatic conditions. In the past few years, several studies have generated functional genomic information on different Vitis species and cultivars, including the European grape vine, Vitis vinifera. Our goal is to develop a comprehensive web data source for Vitaceae. Description VitisExpDB is an online MySQL-PHP driven relational database that houses annotated EST and gene expression data for V. vinifera and non-vinifera grape species and varieties. Currently, the database stores ~320,000 EST sequences derived from 8 species/hybrids, their annotation (BLAST top match) details and Gene Ontology based structured vocabulary. Putative homologs for each EST in other species and varieties along with information on their percent nucleotide identities, phylogenetic relationship and common primers can be retrieved. The database also includes information on probe sequence and annotation features of the high density 60-mer gene expression chip consisting of ~20,000 non-redundant set of ESTs. Finally, the database includes 14 processed global microarray expression profile sets. Data from 12 of these expression profile sets have been mapped onto metabolic pathways. A user-friendly web interface with multiple search indices and extensively hyperlinked result features that permit efficient data retrieval has been developed. Several online bioinformatics tools that interact with the database along with other sequence analysis tools have been added. In addition, users can submit their ESTs to the database. Conclusion The developed database provides genomic resource to grape community for functional analysis of genes in the collection and for the grape genome annotation and gene function identification. The VitisExpDB database is available through our website . PMID:18307813
OGRO: The Overview of functionally characterized Genes in Rice online database.

PubMed

Yamamoto, Eiji; Yonemaru, Jun-Ichi; Yamamoto, Toshio; Yano, Masahiro

2012-12-01

The high-quality sequence information and rich bioinformatics tools available for rice have contributed to remarkable advances in functional genomics. To facilitate the application of gene function information to the study of natural variation in rice, we comprehensively searched for articles related to rice functional genomics and extracted information on functionally characterized genes. As of 31 March 2012, 702 functionally characterized genes were annotated. This number represents about 1.6% of the predicted loci in the Rice Annotation Project Database. The compiled gene information is organized to facilitate direct comparisons with quantitative trait locus (QTL) information in the Q-TARO database. Comparison of genomic locations between functionally characterized genes and the QTLs revealed that QTL clusters were often co-localized with high-density gene regions, and that the genes associated with the QTLs in these clusters were different genes, suggesting that these QTL clusters are likely to be explained by tightly linked but distinct genes. Information on the functionally characterized genes compiled during this study is now available in the O verview of Functionally Characterized G enes in R ice O nline database (OGRO) on the Q-TARO website ( http://qtaro.abr.affrc.go.jp/ogro ). The database has two interfaces: a table containing gene information, and a genome viewer that allows users to compare the locations of QTLs and functionally characterized genes. OGRO on Q-TARO will facilitate a candidate-gene approach to identifying the genes responsible for QTLs. Because the QTL descriptions in Q-TARO contain information on agronomic traits, such comparisons will also facilitate the annotation of functionally characterized genes in terms of their effects on traits important for rice breeding. The increasing amount of information on rice gene function being generated from mutant panels and other types of studies will make the OGRO database even more valuable in the future.
High-throughput density-functional perturbation theory phonons for inorganic materials

NASA Astrophysics Data System (ADS)

Petretto, Guido; Dwaraknath, Shyam; P. C. Miranda, Henrique; Winston, Donald; Giantomassi, Matteo; van Setten, Michiel J.; Gonze, Xavier; Persson, Kristin A.; Hautier, Geoffroy; Rignanese, Gian-Marco

2018-05-01

The knowledge of the vibrational properties of a material is of key importance to understand physical phenomena such as thermal conductivity, superconductivity, and ferroelectricity among others. However, detailed experimental phonon spectra are available only for a limited number of materials, which hinders the large-scale analysis of vibrational properties and their derived quantities. In this work, we perform ab initio calculations of the full phonon dispersion and vibrational density of states for 1521 semiconductor compounds in the harmonic approximation based on density functional perturbation theory. The data is collected along with derived dielectric and thermodynamic properties. We present the procedure used to obtain the results, the details of the provided database and a validation based on the comparison with experimental data.
High-throughput ab-initio dilute solute diffusion database.

PubMed

Wu, Henry; Mayeshiba, Tam; Morgan, Dane

2016-07-19

We demonstrate automated generation of diffusion databases from high-throughput density functional theory (DFT) calculations. A total of more than 230 dilute solute diffusion systems in Mg, Al, Cu, Ni, Pd, and Pt host lattices have been determined using multi-frequency diffusion models. We apply a correction method for solute diffusion in alloys using experimental and simulated values of host self-diffusivity. We find good agreement with experimental solute diffusion data, obtaining a weighted activation barrier RMS error of 0.176 eV when excluding magnetic solutes in non-magnetic alloys. The compiled database is the largest collection of consistently calculated ab-initio solute diffusion data in the world.
The charger transfer electronic coupling in diabatic perspective: A multi-state density functional theory study

NASA Astrophysics Data System (ADS)

Guo, Xinwei; Qu, Zexing; Gao, Jiali

2018-01-01

The multi-state density functional theory (MSDFT) provides a convenient way to estimate electronic coupling of charge transfer processes based on a diabatic representation. Its performance has been benchmarked against the HAB11 database with a mean unsigned error (MUE) of 17 meV between MSDFT and ab initio methods. The small difference may be attributed to different representations, diabatic from MSDFT and adiabatic from ab initio calculations. In this discussion, we conclude that MSDFT provides a general and efficient way to estimate the electronic coupling for charge-transfer rate calculations based on the Marcus-Hush model.
Recent progress in density functional theory

NASA Astrophysics Data System (ADS)

Truhlar, Donald

2014-03-01

Ongoing work involves several areas of density functional theory: new methods for computing electronic excitation energies, including a new way to remove spin contamination in the spin-flip Tamm-Dancoff approximation and a configuration-interaction-corrected Tamm-Dancoff Approximation for treating conical intersections; new ways to treat open-shell states, including a reinterpreted broken-symmetry method and multi-configuration Kohn-Sham theory; a new exchange-correlation functional; new tests of density functional theory against databases for electronic transition energies and molecules and solids containing metal atoms; and applications. A selection of results will be presented. I am grateful to the following collaborators for contributions to the ongoing work: Boris Averkiev, Rebecca Carlson, Laura Fernandez, Laura Gagliardi, Chad Hoyer, Francesc Illas, Miho Isegawa, Shaohong Li, Giovanni Li Manni, Sijie Luo, Dongxia Ma, Remi Maurice, Rubén Means-Pañeda, Roberto Peverati, Nora Planas, Prasenjit Seal, Pragya Verma, Bo Wang, Xuefei Xu, Ke R. Yang, Haoyu Yu, Wenjing Zhang, and Jingjing Zheng. Supported in part by the AFOSR and U.S. DOE.
Column Store for GWAC: A High-cadence, High-density, Large-scale Astronomical Light Curve Pipeline and Distributed Shared-nothing Database

NASA Astrophysics Data System (ADS)

Wan, Meng; Wu, Chao; Wang, Jing; Qiu, Yulei; Xin, Liping; Mullender, Sjoerd; Mühleisen, Hannes; Scheers, Bart; Zhang, Ying; Nes, Niels; Kersten, Martin; Huang, Yongpan; Deng, Jinsong; Wei, Jianyan

2016-11-01

The ground-based wide-angle camera array (GWAC), a part of the SVOM space mission, will search for various types of optical transients by continuously imaging a field of view (FOV) of 5000 degrees2 every 15 s. Each exposure consists of 36 × 4k × 4k pixels, typically resulting in 36 × ˜175,600 extracted sources. For a modern time-domain astronomy project like GWAC, which produces massive amounts of data with a high cadence, it is challenging to search for short timescale transients in both real-time and archived data, and to build long-term light curves for variable sources. Here, we develop a high-cadence, high-density light curve pipeline (HCHDLP) to process the GWAC data in real-time, and design a distributed shared-nothing database to manage the massive amount of archived data which will be used to generate a source catalog with more than 100 billion records during 10 years of operation. First, we develop HCHDLP based on the column-store DBMS of MonetDB, taking advantage of MonetDB’s high performance when applied to massive data processing. To realize the real-time functionality of HCHDLP, we optimize the pipeline in its source association function, including both time and space complexity from outside the database (SQL semantic) and inside (RANGE-JOIN implementation), as well as in its strategy of building complex light curves. The optimized source association function is accelerated by three orders of magnitude. Second, we build a distributed database using a two-level time partitioning strategy via the MERGE TABLE and REMOTE TABLE technology of MonetDB. Intensive tests validate that our database architecture is able to achieve both linear scalability in response time and concurrent access by multiple users. In summary, our studies provide guidance for a solution to GWAC in real-time data processing and management of massive data.
Fingerprint Ridge Density as a Potential Forensic Anthropological Tool for Sex Identification.

PubMed

Dhall, Jasmine Kaur; Kapoor, Anup Kumar

2016-03-01

In cases of partial or poor print recovery and lack of database/suspect print, fingerprint evidence is generally neglected. In light of such constraints, this study was designed to examine whether ridge density can aid in narrowing down the investigation for sex identification. The study was conducted on the right-hand index digit of 245 males and 246 females belonging to the Punjabis of Delhi region. Five ridge density count areas, namely upper radial, radial, ulnar, upper ulnar, and proximal, were selected and designated. Probability of sex origin was calculated, and stepwise discriminant function analysis was performed to determine the discriminating ability of the selected areas. Females were observed with a significantly higher ridge density than males in all the five areas. Discriminant function analysis and logistic regression exhibited 96.8% and 97.4% accuracy, respectively, in sex identification. Hence, fingerprint ridge density is a potential tool for sex identification, even from partial prints. © 2015 American Academy of Forensic Sciences.

Electronic couplings for molecular charge transfer: Benchmarking CDFT, FODFT, and FODFTB against high-level ab initio calculations

NASA Astrophysics Data System (ADS)

Kubas, Adam; Hoffmann, Felix; Heck, Alexander; Oberhofer, Harald; Elstner, Marcus; Blumberger, Jochen

2014-03-01

We introduce a database (HAB11) of electronic coupling matrix elements (Hab) for electron transfer in 11 π-conjugated organic homo-dimer cations. High-level ab inito calculations at the multireference configuration interaction MRCI+Q level of theory, n-electron valence state perturbation theory NEVPT2, and (spin-component scaled) approximate coupled cluster model (SCS)-CC2 are reported for this database to assess the performance of three DFT methods of decreasing computational cost, including constrained density functional theory (CDFT), fragment-orbital DFT (FODFT), and self-consistent charge density functional tight-binding (FODFTB). We find that the CDFT approach in combination with a modified PBE functional containing 50% Hartree-Fock exchange gives best results for absolute Hab values (mean relative unsigned error = 5.3%) and exponential distance decay constants β (4.3%). CDFT in combination with pure PBE overestimates couplings by 38.7% due to a too diffuse excess charge distribution, whereas the economic FODFT and highly cost-effective FODFTB methods underestimate couplings by 37.6% and 42.4%, respectively, due to neglect of interaction between donor and acceptor. The errors are systematic, however, and can be significantly reduced by applying a uniform scaling factor for each method. Applications to dimers outside the database, specifically rotated thiophene dimers and larger acenes up to pentacene, suggests that the same scaling procedure significantly improves the FODFT and FODFTB results for larger π-conjugated systems relevant to organic semiconductors and DNA.
Opportune Landing Site CBR and Low-Density Laboratory Database

DTIC Science & Technology

2008-05-01

Program Opportune Landing Site CBR and Low- Density Laboratory Database Larry S. Danyluk, Sally A. Shoop, Rosa T. Affleck, and Wendy L. Wieder...Opportune Landing Site Program ERDC/CRREL TR-08-9 May 2008 Opportune Landing Site CBR and Low- Density Laboratory Database Larry S. Danyluk, Sally A...reproduce in-situ density , moisture, and CBR values and therefore do not accurately repre- sent the complete range of these values measured in the field
Survival of the most transferable at the top of Jacob's ladder: Defining and testing the ωB97M(2) double hybrid density functional

NASA Astrophysics Data System (ADS)

Mardirossian, Narbe; Head-Gordon, Martin

2018-06-01

A meta-generalized gradient approximation, range-separated double hybrid (DH) density functional with VV10 non-local correlation is presented. The final 14-parameter functional form is determined by screening trillions of candidate fits through a combination of best subset selection, forward stepwise selection, and random sample consensus (RANSAC) outlier detection. The MGCDB84 database of 4986 data points is employed in this work, containing a training set of 870 data points, a validation set of 2964 data points, and a test set of 1152 data points. Following an xDH approach, orbitals from the ωB97M-V density functional are used to compute the second-order perturbation theory correction. The resulting functional, ωB97M(2), is benchmarked against a variety of leading double hybrid density functionals, including B2PLYP-D3(BJ), B2GPPLYP-D3(BJ), ωB97X-2(TQZ), XYG3, PTPSS-D3(0), XYGJ-OS, DSD-PBEP86-D3(BJ), and DSD-PBEPBE-D3(BJ). Encouragingly, the overall performance of ωB97M(2) on nearly 5000 data points clearly surpasses that of all of the tested density functionals. As a Rung 5 density functional, ωB97M(2) completes our family of combinatorially optimized functionals, complementing B97M-V on Rung 3, and ωB97X-V and ωB97M-V on Rung 4. The results suggest that ωB97M(2) has the potential to serve as a powerful predictive tool for accurate and efficient electronic structure calculations of main-group chemistry.
High-throughput ab-initio dilute solute diffusion database

PubMed Central

Wu, Henry; Mayeshiba, Tam; Morgan, Dane

2016-01-01

We demonstrate automated generation of diffusion databases from high-throughput density functional theory (DFT) calculations. A total of more than 230 dilute solute diffusion systems in Mg, Al, Cu, Ni, Pd, and Pt host lattices have been determined using multi-frequency diffusion models. We apply a correction method for solute diffusion in alloys using experimental and simulated values of host self-diffusivity. We find good agreement with experimental solute diffusion data, obtaining a weighted activation barrier RMS error of 0.176 eV when excluding magnetic solutes in non-magnetic alloys. The compiled database is the largest collection of consistently calculated ab-initio solute diffusion data in the world. PMID:27434308
Negative Example Selection for Protein Function Prediction: The NoGO Database

PubMed Central

Youngs, Noah; Penfold-Brown, Duncan; Bonneau, Richard; Shasha, Dennis

2014-01-01

Negative examples – genes that are known not to carry out a given protein function – are rarely recorded in genome and proteome annotation databases, such as the Gene Ontology database. Negative examples are required, however, for several of the most powerful machine learning methods for integrative protein function prediction. Most protein function prediction efforts have relied on a variety of heuristics for the choice of negative examples. Determining the accuracy of methods for negative example prediction is itself a non-trivial task, given that the Open World Assumption as applied to gene annotations rules out many traditional validation metrics. We present a rigorous comparison of these heuristics, utilizing a temporal holdout, and a novel evaluation strategy for negative examples. We add to this comparison several algorithms adapted from Positive-Unlabeled learning scenarios in text-classification, which are the current state of the art methods for generating negative examples in low-density annotation contexts. Lastly, we present two novel algorithms of our own construction, one based on empirical conditional probability, and the other using topic modeling applied to genes and annotations. We demonstrate that our algorithms achieve significantly fewer incorrect negative example predictions than the current state of the art, using multiple benchmarks covering multiple organisms. Our methods may be applied to generate negative examples for any type of method that deals with protein function, and to this end we provide a database of negative examples in several well-studied organisms, for general use (The NoGO database, available at: bonneaulab.bio.nyu.edu/nogo.html). PMID:24922051
Computational Thermochemistry: Scale Factor Databases and Scale Factors for Vibrational Frequencies Obtained from Electronic Model Chemistries.

PubMed

Alecu, I M; Zheng, Jingjing; Zhao, Yan; Truhlar, Donald G

2010-09-14

Optimized scale factors for calculating vibrational harmonic and fundamental frequencies and zero-point energies have been determined for 145 electronic model chemistries, including 119 based on approximate functionals depending on occupied orbitals, 19 based on single-level wave function theory, three based on the neglect-of-diatomic-differential-overlap, two based on doubly hybrid density functional theory, and two based on multicoefficient correlation methods. Forty of the scale factors are obtained from large databases, which are also used to derive two universal scale factor ratios that can be used to interconvert between scale factors optimized for various properties, enabling the derivation of three key scale factors at the effort of optimizing only one of them. A reduced scale factor optimization model is formulated in order to further reduce the cost of optimizing scale factors, and the reduced model is illustrated by using it to obtain 105 additional scale factors. Using root-mean-square errors from the values in the large databases, we find that scaling reduces errors in zero-point energies by a factor of 2.3 and errors in fundamental vibrational frequencies by a factor of 3.0, but it reduces errors in harmonic vibrational frequencies by only a factor of 1.3. It is shown that, upon scaling, the balanced multicoefficient correlation method based on coupled cluster theory with single and double excitations (BMC-CCSD) can lead to very accurate predictions of vibrational frequencies. With a polarized, minimally augmented basis set, the density functionals with zero-point energy scale factors closest to unity are MPWLYP1M (1.009), τHCTHhyb (0.989), BB95 (1.012), BLYP (1.013), BP86 (1.014), B3LYP (0.986), MPW3LYP (0.986), and VSXC (0.986).
WHOLE-BODY VIBRATION EXERCISE IMPROVES FUNCTIONAL PARAMETERS IN PATIENTS WITH OSTEOGENESIS IMPERFECTA: A SYSTEMATIC REVIEW WITH A SUITABLE APPROACH.

PubMed

Sá-Caputo, Danubia C; Dionello, Carla da F; Frederico, Éric Heleno F F; Paineiras-Domingos, Laisa L; Sousa-Gonçalves, Cintia Renata; Morel, Danielle S; Moreira-Marconi, Eloá; Unger, Marianne; Bernardo-Filho, Mario

2017-01-01

Patients with osteogenesis imperfecta (OI) have abnormal bone modelling and resorption. The bone tissue adaptation and responsivity to dynamic and mechanical loading may be of therapeutic use under controlled circumstances. Improvements due to the wholebody vibration (WBV) exercises have been reported in strength, motion, gait, balance, posture and bone density in several osteopenic individuals, as in post-menopausal women or children with disabling conditions, as patients with OI. The aim of this investigation was to systematically analyse the current available literature to determine the effect of WBV exercises on functional parameters of OI patients. Three reviewers independently accessed bibliographical databases. Searches were performed in the PubMed, Scopus, Science Direct and PEDro databases using keywords related to possible interventions (including WBV) used in the management of patients with osteogenesis imperfecta . Three eligible studies were identified by searches in the analysed databases. It was concluded that WBV exercises could be an important option in the management of OI patients improving the mobility and functional parameters. However, further studies are necessary for establishing suitable protocols for these patients.
A Database of COBE-normalized Cold Dark Matter Simulations

NASA Astrophysics Data System (ADS)

Martel, Hugo; Matzner, Richard

2000-02-01

We have simulated the formation and evolution of large-scale structure in the universe, for 68 different COBE-normalized cosmological models. For each cosmological model, we have performed between one and three simulations, for a total of 160 simulations. This constitutes the largest database of cosmological simulations ever assembled, and the largest cosmological parameter space ever covered by such simulations. We are making this database available to the astronomical community. We provide instructions for accessing the database and for converting the data from computational units to physical units. The database includes tilted cold dark matter (TCDM) models, tilted open cold dark matter (TOCDM) models, and tilted Λ cold dark matter (TΛCDM) models. (For several simulations, the primordial exponent n of the power spectrum is near unity, hence these simulations can be considered as ``untilted.'') The simulations cover a four-dimensional cosmological parameter phase space, the parameters being the present density parameter Ω0, cosmological constant λ0, and Hubble constant H0, and the rms density fluctuation σ8 at scale 8 h-1 Mpc. All simulations were performed using a P3M algorithm with 643 particles on a 1283 mesh, in a cubic volume of comoving size 128 Mpc. Each simulation starts at a redshift of 24 and is carried up to the present. More simulations will be added to the database in the future. We have performed a limited amount of data reduction and analysis of the final states of the simulations. We computed the rms density fluctuation, the two-point correlation function, the velocity moments, and the properties of clusters. Our results are the following:1. The numerical value σnum8 of the rms density fluctuation differs from the value σcont8 obtained by integrating the power spectrum at early times and extrapolating linearly up the present. This results from the combined effects of discreteness in the numerical representation of the power spectrum, the presence of a Gaussian factor in the initial conditions, and late-time nonlinear evolution. The first of these three effects is negligible. The second and third are comparable, and can both modify the value of σ8 by up to 10%. Nonlinear effects, however, are important only for models with σ8>0.6, and can result in either an increase or a decrease in σ8.2. The observed galaxy two-point correlation function is well reproduced (assuming an unbiased relation between galaxies and mass) by models with σ8~0.8, nearly independently of the values of the other parameters, Ω0, λ0, and H0. For models with σ8>0.8, the correlation function is too large and its slope is too steep. For models with σ8<0.8, the correlation function is too small and its slope is too shallow.3. At small separations, r<1 Mpc, the velocity moments indicate that small clusters have reached virial equilibrium, while still accreting matter from the field. The velocity moments depend essentially upon Ω0 and σ8, and not λ0 and H0. The pairwise particle velocity dispersions are much larger than the observed pairwise galaxy velocity dispersion, for nearly all models. Velocity bias between galaxies and dark matter is needed to reconcile the simulations with observations.4. The cluster multiplicity function is decreasing for models with σ8~0.3. It has a horizontal plateau for models with σ8 in the range 0.4-0.9. For models with σ8>0.9, it has a U shape, which is probably a numerical artifact caused by the finite number of particles used in the simulations. For all models, clusters have densities in the range 100-1000 times the mean background density, the spin parameters λ are in the range 0.008-0.2, with the median near 0.05, and about 2/3 of the clusters are prolate. Rotationally supported disks do not form in these simulations.
Research Update: The materials genome initiative: Data sharing and the impact of collaborative ab initio databases

DOE PAGES

Jain, Anubhav; Persson, Kristin A.; Ceder, Gerbrand

2016-03-24

Materials innovations enable new technological capabilities and drive major societal advancements but have historically required long and costly development cycles. The Materials Genome Initiative (MGI) aims to greatly reduce this time and cost. Here, we focus on data reuse in the MGI and, in particular, discuss the impact of three different computational databases based on density functional theory methods to the research community. Finally, we discuss and provide recommendations on technical aspects of data reuse, outline remaining fundamental challenges, and present an outlook on the future of MGI's vision of data sharing.
Materials Databases Infrastructure Constructed by First Principles Calculations: A Review

DOE PAGES

Lin, Lianshan

2015-10-13

The First Principles calculations, especially the calculation based on High-Throughput Density Functional Theory, have been widely accepted as the major tools in atom scale materials design. The emerging super computers, along with the powerful First Principles calculations, have accumulated hundreds of thousands of crystal and compound records. The exponential growing of computational materials information urges the development of the materials databases, which not only provide unlimited storage for the daily increasing data, but still keep the efficiency in data storage, management, query, presentation and manipulation. This review covers the most cutting edge materials databases in materials design, and their hotmore » applications such as in fuel cells. By comparing the advantages and drawbacks of these high-throughput First Principles materials databases, the optimized computational framework can be identified to fit the needs of fuel cell applications. The further development of high-throughput DFT materials database, which in essence accelerates the materials innovation, is discussed in the summary as well.« less
Computational Thermochemistry of Jet Fuels and Rocket Propellants

NASA Technical Reports Server (NTRS)

Crawford, T. Daniel

2002-01-01

The design of new high-energy density molecules as candidates for jet and rocket fuels is an important goal of modern chemical thermodynamics. The NASA Glenn Research Center is home to a database of thermodynamic data for over 2000 compounds related to this goal, in the form of least-squares fits of heat capacities, enthalpies, and entropies as functions of temperature over the range of 300 - 6000 K. The chemical equilibrium with applications (CEA) program written and maintained by researchers at NASA Glenn over the last fifty years, makes use of this database for modeling the performance of potential rocket propellants. During its long history, the NASA Glenn database has been developed based on experimental results and data published in the scientific literature such as the standard JANAF tables. The recent development of efficient computational techniques based on quantum chemical methods provides an alternative source of information for expansion of such databases. For example, it is now possible to model dissociation or combustion reactions of small molecules to high accuracy using techniques such as coupled cluster theory or density functional theory. Unfortunately, the current applicability of reliable computational models is limited to relatively small molecules containing only around a dozen (non-hydrogen) atoms. We propose to extend the applicability of coupled cluster theory- often referred to as the 'gold standard' of quantum chemical methods- to molecules containing 30-50 non-hydrogen atoms. The centerpiece of this work is the concept of local correlation, in which the description of the electron interactions- known as electron correlation effects- are reduced to only their most important localized components. Such an advance has the potential to greatly expand the current reach of computational thermochemistry and thus to have a significant impact on the theoretical study of jet and rocket propellants.
Relating polarizability to volume, ionization energy, electronegativity, hardness, moments of momentum, and other molecular properties

DOE Office of Scientific and Technical Information (OSTI.GOV)

Blair, Shamus A.; Thakkar, Ajit J., E-mail: ajit@unb.ca

2014-08-21

Semiquantitative relationships between the mean static dipole polarizability and other molecular properties such as the volume, ionization energy, electronegativity, hardness, and moments of momentum are explored. The relationships are tested using density functional theory computations on the 1641 neutral, ground-state, organic molecules in the TABS database. The best polarizability approximations have median errors under 5%.
Relating polarizability to volume, ionization energy, electronegativity, hardness, moments of momentum, and other molecular properties.

PubMed

Blair, Shamus A; Thakkar, Ajit J

2014-08-21

Semiquantitative relationships between the mean static dipole polarizability and other molecular properties such as the volume, ionization energy, electronegativity, hardness, and moments of momentum are explored. The relationships are tested using density functional theory computations on the 1641 neutral, ground-state, organic molecules in the TABS database. The best polarizability approximations have median errors under 5%.
WHOLE-BODY VIBRATION EXERCISE IMPROVES FUNCTIONAL PARAMETERS IN PATIENTS WITH OSTEOGENESIS IMPERFECTA: A SYSTEMATIC REVIEW WITH A SUITABLE APPROACH

PubMed Central

Sá-Caputo, Danubia C; Dionello, Carla da F; Frederico, Éric Heleno F. F; Paineiras-Domingos, Laisa L; Sousa-Gonçalves, Cintia Renata; Morel, Danielle S; Moreira-Marconi, Eloá; Unger, Marianne; Bernardo-Filho, Mario

2017-01-01

Background: Patients with osteogenesis imperfecta (OI) have abnormal bone modelling and resorption. The bone tissue adaptation and responsivity to dynamic and mechanical loading may be of therapeutic use under controlled circumstances. Improvements due to the wholebody vibration (WBV) exercises have been reported in strength, motion, gait, balance, posture and bone density in several osteopenic individuals, as in post-menopausal women or children with disabling conditions, as patients with OI. The aim of this investigation was to systematically analyse the current available literature to determine the effect of WBV exercises on functional parameters of OI patients. Materials and methods: Three reviewers independently accessed bibliographical databases. Searches were performed in the PubMed, Scopus, Science Direct and PEDro databases using keywords related to possible interventions (including WBV) used in the management of patients with osteogenesis imperfecta. Results: Three eligible studies were identified by searches in the analysed databases. Conclusion: It was concluded that WBV exercises could be an important option in the management of OI patients improving the mobility and functional parameters. However, further studies are necessary for establishing suitable protocols for these patients. PMID:28480432
Very large database of lipids: rationale and design.

PubMed

Martin, Seth S; Blaha, Michael J; Toth, Peter P; Joshi, Parag H; McEvoy, John W; Ahmed, Haitham M; Elshazly, Mohamed B; Swiger, Kristopher J; Michos, Erin D; Kwiterovich, Peter O; Kulkarni, Krishnaji R; Chimera, Joseph; Cannon, Christopher P; Blumenthal, Roger S; Jones, Steven R

2013-11-01

Blood lipids have major cardiovascular and public health implications. Lipid-lowering drugs are prescribed based in part on categorization of patients into normal or abnormal lipid metabolism, yet relatively little emphasis has been placed on: (1) the accuracy of current lipid measures used in clinical practice, (2) the reliability of current categorizations of dyslipidemia states, and (3) the relationship of advanced lipid characterization to other cardiovascular disease biomarkers. To these ends, we developed the Very Large Database of Lipids (NCT01698489), an ongoing database protocol that harnesses deidentified data from the daily operations of a commercial lipid laboratory. The database includes individuals who were referred for clinical purposes for a Vertical Auto Profile (Atherotech Inc., Birmingham, AL), which directly measures cholesterol concentrations of low-density lipoprotein, very low-density lipoprotein, intermediate-density lipoprotein, high-density lipoprotein, their subclasses, and lipoprotein(a). Individual Very Large Database of Lipids studies, ranging from studies of measurement accuracy, to dyslipidemia categorization, to biomarker associations, to characterization of rare lipid disorders, are investigator-initiated and utilize peer-reviewed statistical analysis plans to address a priori hypotheses/aims. In the first database harvest (Very Large Database of Lipids 1.0) from 2009 to 2011, there were 1 340 614 adult and 10 294 pediatric patients; the adult sample had a median age of 59 years (interquartile range, 49-70 years) with even representation by sex. Lipid distributions closely matched those from the population-representative National Health and Nutrition Examination Survey. The second harvest of the database (Very Large Database of Lipids 2.0) is underway. Overall, the Very Large Database of Lipids database provides an opportunity for collaboration and new knowledge generation through careful examination of granular lipid data on a large scale. © 2013 Wiley Periodicals, Inc.
Wood anatomical correlates with theoretical conductivity and wood density across China: evolutionary evidence of the functional differentiation of axial and radial parenchyma

PubMed Central

Zheng, Jingming; Martínez-Cabrera, Hugo I.

2013-01-01

Background and Aims In recent years considerable effort has focused on linking wood anatomy and key ecological traits. Studies analysing large databases have described how these ecological traits vary as a function of wood anatomical traits related to conduction and support, but have not considered how these functions interact with cells involved in storage of water and carbohydrates (i.e. parenchyma cells). Methods We analyzed, in a phylogenetic context, the functional relationship between cell types performing each of the three xylem functions (conduction, support and storage) and wood density and theoretical conductivity using a sample of approx. 800 tree species from China. Key Results Axial parenchyma and rays had distinct evolutionary correlation patterns. An evolutionary link was found between high conduction capacity and larger amounts of axial parenchyma that is probably related to water storage capacity and embolism repair, while larger amounts of ray tissue have evolved with increased mechanical support and reduced hydraulic capacity. In a phylogenetic principal component analysis this association of axial parenchyma with increased conduction capacity and rays with wood density represented orthogonal axes of variation. In multivariate space, however, the proportion of rays might be positively associated with conductance and negatively with wood density, indicating flexibility in these axes in species with wide rays. Conclusions The findings suggest that parenchyma types may differ in function. The functional axes represented by different cell types were conserved across lineages, suggesting a significant role in the ecological strategies of the angiosperms. PMID:23904446
Overview and evaluation of different nuclear level density models for the 123I radionuclide production.

PubMed

Nikjou, A; Sadeghi, M

2018-06-01

The 123 I radionuclide (T 1/2 = 13.22 h, β+ = 100%) is one of the most potent gamma emitters for nuclear medicine. In this study, the cyclotron production of this radionuclide via different nuclear reactions namely, the 121 Sb(α,2n), 122 Te(d,n), 123 Te(p,n), 124 Te(p,2n), 124 Xe(p,2n), 127 I(p,5n) and 127 I(d,6n) were investigated. The effect of the various phenomenological nuclear level density models such as Fermi gas model (FGM), Back-shifted Fermi gas model (BSFGM), Generalized superfluid model (GSM) and Enhanced generalized superfluid model (EGSM) moreover, the three microscopic level density models were evaluated for predicting of cross sections and production yield predictions. The SRIM code was used to obtain the target thickness. The 123 I excitation function of reactions were calculated by using of the TALYS-1.8, EMPIRE-3.2 nuclear codes and with data which taken from TENDL-2015 database, and finally the theoretical calculations were compared with reported experimental measurements in which taken from EXFOR database. Copyright © 2018 Elsevier Ltd. All rights reserved.
An actuarial approach to retrofit savings in buildings

DOE Office of Scientific and Technical Information (OSTI.GOV)

Subbarao, Krishnappa; Etingov, Pavel V.; Reddy, T. A.

An actuarial method has been developed for determining energy savings from retrofits from energy use data for a number of buildings. This method should be contrasted with the traditional method of using pre- and post-retrofit data on the same building. This method supports the U.S. Department of Energy Building Performance Database of real building performance data and related tools that enable engineering and financial practitioners to evaluate retrofits. The actuarial approach derives, from the database, probability density functions (PDFs) for energy savings from retrofits by creating peer groups for the user’s pre post buildings. From the energy use distribution ofmore » the two groups, the savings PDF is derived. This provides the basis for engineering analysis as well as financial risk analysis leading to investment decisions. Several technical issues are addressed: The savings PDF is obtained from the pre- and post-PDF through a convolution. Smoothing using kernel density estimation is applied to make the PDF more realistic. The low data density problem can be mitigated through a neighborhood methodology. Correlations between pre and post buildings are addressed to improve the savings PDF. Sample size effects are addressed through the Kolmogorov--Smirnov tests and quantile-quantile plots.« less
Interactive Database of Pulsar Flux Density Measurements

NASA Astrophysics Data System (ADS)

Koralewska, O.; Krzeszowski, K.; Kijak, J.; Lewandowski, W.

2012-12-01

The number of astronomical observations is steadily growing, giving rise to the need of cataloguing the obtained results. There are a lot of databases, created to store different types of data and serve a variety of purposes, e. g. databases providing basic data for astronomical objects (SIMBAD Astronomical Database), databases devoted to one type of astronomical object (ATNF Pulsar Database) or to a set of values of the specific parameter (Lorimer 1995 - database of flux density measurements for 280 pulsars on the frequencies up to 1606 MHz), etc. We found that creating an online database of pulsar flux measurements, provided with facilities for plotting diagrams and histograms, calculating mean values for a chosen set of data, filtering parameter values and adding new measurements by the registered users, could be useful in further studies on pulsar spectra.
Bias-Free Chemically Diverse Test Sets from Machine Learning.

PubMed

Swann, Ellen T; Fernandez, Michael; Coote, Michelle L; Barnard, Amanda S

2017-08-14

Current benchmarking methods in quantum chemistry rely on databases that are built using a chemist's intuition. It is not fully understood how diverse or representative these databases truly are. Multivariate statistical techniques like archetypal analysis and K-means clustering have previously been used to summarize large sets of nanoparticles however molecules are more diverse and not as easily characterized by descriptors. In this work, we compare three sets of descriptors based on the one-, two-, and three-dimensional structure of a molecule. Using data from the NIST Computational Chemistry Comparison and Benchmark Database and machine learning techniques, we demonstrate the functional relationship between these structural descriptors and the electronic energy of molecules. Archetypes and prototypes found with topological or Coulomb matrix descriptors can be used to identify smaller, statistically significant test sets that better capture the diversity of chemical space. We apply this same method to find a diverse subset of organic molecules to demonstrate how the methods can easily be reapplied to individual research projects. Finally, we use our bias-free test sets to assess the performance of density functional theory and quantum Monte Carlo methods.

The GermOnline cross-species systems browser provides comprehensive information on genes and gene products relevant for sexual reproduction.

PubMed

Gattiker, Alexandre; Niederhauser-Wiederkehr, Christa; Moore, James; Hermida, Leandro; Primig, Michael

2007-01-01

We report a novel release of the GermOnline knowledgebase covering genes relevant for the cell cycle, gametogenesis and fertility. GermOnline was extended into a cross-species systems browser including information on DNA sequence annotation, gene expression and the function of gene products. The database covers eight model organisms and Homo sapiens, for which complete genome annotation data are available. The database is now built around a sophisticated genome browser (Ensembl), our own microarray information management and annotation system (MIMAS) used to extensively describe experimental data obtained with high-density oligonucleotide microarrays (GeneChips) and a comprehensive system for online editing of database entries (MediaWiki). The RNA data include results from classical microarrays as well as tiling arrays that yield information on RNA expression levels, transcript start sites and lengths as well as exon composition. Members of the research community are solicited to help GermOnline curators keep database entries on genes and gene products complete and accurate. The database is accessible at http://www.germonline.org/.
An Automated Ab Initio Framework for Identifying New Ferroelectrics

NASA Astrophysics Data System (ADS)

Smidt, Tess; Reyes-Lillo, Sebastian E.; Jain, Anubhav; Neaton, Jeffrey B.

Ferroelectric materials have a wide-range of technological applications including non-volatile RAM and optoelectronics. In this work, we present an automated first-principles search for ferroelectrics. We integrate density functional theory, crystal structure databases, symmetry tools, workflow software, and a custom analysis toolkit to build a library of known and proposed ferroelectrics. We screen thousands of candidates using symmetry relations between nonpolar and polar structure pairs. We use two search strategies 1) polar-nonpolar pairs with the same composition and 2) polar-nonpolar structure type pairs. Results are automatically parsed, stored in a database, and accessible via a web interface showing distortion animations and plots of polarization and total energy as a function of distortion. We benchmark our results against experimental data, present new ferroelectric candidates found through our search, and discuss future work on expanding this search methodology to other material classes such as anti-ferroelectrics and multiferroics.
Reference models for thermospheric NO

NASA Technical Reports Server (NTRS)

Barth, C. A.

1989-01-01

Nitric oxide has been measured with an ultraviolet spectrometer on the polar-orbiting satellite Solar Mesosphere Explorer (SME) for the period January 1982 to August 1986. The nitric oxide database contains densities at all latitudes sorted into 5 degree bins and at altitudes between 100 and 140 km sorted into 3.3 km-bins. The largest densities occur at latitudes in the auroral zones where the density varies as a function of geomagnetic activity. Variations of a factor of 10 occur between times of intense activity and quiet times. At low latitudes, the nitric oxide density at 110 km varies from a mean value of 3 times 10(exp 7) molecules per cubic cm in January 1982 to a mean value of 4 times 10(exp 6) molecules per cubic cm during solar minimum conditions in 1986. In addition, the low-latitude nitric oxide density varies plus or minus 50 percent with a period of 27 days during times of high solar activity.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Baer, M.R.; Hobbs, M.L.; McGee, B.C.

Exponential-13,6 (EXP-13,6) potential pammeters for 750 gases composed of 48 elements were determined and assembled in a database, referred to as the JCZS database, for use with the Jacobs Cowperthwaite Zwisler equation of state (JCZ3-EOS)~l) The EXP- 13,6 force constants were obtained by using literature values of Lennard-Jones (LJ) potential functions, by using corresponding states (CS) theory, by matching pure liquid shock Hugoniot data, and by using molecular volume to determine the approach radii with the well depth estimated from high-pressure isen- tropes. The JCZS database was used to accurately predict detonation velocity, pressure, and temperature for 50 dif- 3more » Accurate predictions were also ferent explosives with initial densities ranging from 0.25 glcm3 to 1.97 g/cm . obtained for pure liquid shock Hugoniots, static properties of nitrogen, and gas detonations at high initial pressures.« less
Combined use of computational chemistry and chemoinformatics methods for chemical discovery

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sugimoto, Manabu, E-mail: sugimoto@kumamoto-u.ac.jp; Institute for Molecular Science, 38 Nishigo-Naka, Myodaiji, Okazaki 444-8585; CREST, Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi, Saitama 332-0012

2015-12-31

Data analysis on numerical data by the computational chemistry calculations is carried out to obtain knowledge information of molecules. A molecular database is developed to systematically store chemical, electronic-structure, and knowledge-based information. The database is used to find molecules related to a keyword of “cancer”. Then the electronic-structure calculations are performed to quantitatively evaluate quantum chemical similarity of the molecules. Among the 377 compounds registered in the database, 24 molecules are found to be “cancer”-related. This set of molecules includes both carcinogens and anticancer drugs. The quantum chemical similarity analysis, which is carried out by using numerical results of themore » density-functional theory calculations, shows that, when some energy spectra are referred to, carcinogens are reasonably distinguished from the anticancer drugs. Therefore these spectral properties are considered of as important measures for classification.« less
A database to enable discovery and design of piezoelectric materials

PubMed Central

de Jong, Maarten; Chen, Wei; Geerlings, Henry; Asta, Mark; Persson, Kristin Aslaug

2015-01-01

Piezoelectric materials are used in numerous applications requiring a coupling between electrical fields and mechanical strain. Despite the technological importance of this class of materials, for only a small fraction of all inorganic compounds which display compatible crystallographic symmetry, has piezoelectricity been characterized experimentally or computationally. In this work we employ first-principles calculations based on density functional perturbation theory to compute the piezoelectric tensors for nearly a thousand compounds, thereby increasing the available data for this property by more than an order of magnitude. The results are compared to select experimental data to establish the accuracy of the calculated properties. The details of the calculations are also presented, along with a description of the format of the database developed to make these computational results publicly available. In addition, the ways in which the database can be accessed and applied in materials development efforts are described. PMID:26451252
Expert Knowledge-Based Automatic Sleep Stage Determination by Multi-Valued Decision Making Method

NASA Astrophysics Data System (ADS)

Wang, Bei; Sugi, Takenao; Kawana, Fusae; Wang, Xingyu; Nakamura, Masatoshi

In this study, an expert knowledge-based automatic sleep stage determination system working on a multi-valued decision making method is developed. Visual inspection by a qualified clinician is adopted to obtain the expert knowledge database. The expert knowledge database consists of probability density functions of parameters for various sleep stages. Sleep stages are determined automatically according to the conditional probability. Totally, four subjects were participated. The automatic sleep stage determination results showed close agreements with the visual inspection on sleep stages of awake, REM (rapid eye movement), light sleep and deep sleep. The constructed expert knowledge database reflects the distributions of characteristic parameters which can be adaptive to variable sleep data in hospitals. The developed automatic determination technique based on expert knowledge of visual inspection can be an assistant tool enabling further inspection of sleep disorder cases for clinical practice.
Estimation of vegetation cover at subpixel resolution using LANDSAT data

NASA Technical Reports Server (NTRS)

Jasinski, Michael F.; Eagleson, Peter S.

1986-01-01

The present report summarizes the various approaches relevant to estimating canopy cover at subpixel resolution. The approaches are based on physical models of radiative transfer in non-homogeneous canopies and on empirical methods. The effects of vegetation shadows and topography are examined. Simple versions of the model are tested, using the Taos, New Mexico Study Area database. Emphasis has been placed on using relatively simple models requiring only one or two bands. Although most methods require some degree of ground truth, a two-band method is investigated whereby the percent cover can be estimated without ground truth by examining the limits of the data space. Future work is proposed which will incorporate additional surface parameters into the canopy cover algorithm, such as topography, leaf area, or shadows. The method involves deriving a probability density function for the percent canopy cover based on the joint probability density function of the observed radiances.
Compatibility between livestock databases used for quantitative biosecurity response in New Zealand.

PubMed

Jewell, C P; van Andel, M; Vink, W D; McFadden, A M J

2016-05-01

To characterise New Zealand's livestock biosecurity databases, and investigate their compatibility and capacity to provide a single integrated data source for quantitative outbreak analysis. Contemporary snapshots of the data in three national livestock biosecurity databases, AgriBase, FarmsOnLine (FOL) and the National Animal Identification and Tracing Scheme (NAIT), were obtained on 16 September, 1 September and 30 April 2014, respectively, and loaded into a relational database. A frequency table of animal numbers per farm was calculated for the AgriBase and FOL datasets. A two dimensional kernel density estimate was calculated for farms reporting the presence of cattle, pigs, deer, and small ruminants in each database and the ratio of farm densities for AgriBase versus FOL calculated. The extent to which records in the three databases could be matched and linked was quantified, and the level of agreement amongst them for the presence of different species on properties assessed using Cohen's kappa statistic. AgriBase contained fewer records than FOL, but recorded animal numbers present on each farm, whereas FOL contained more records, but captured only presence/absence of animals. The ratio of farm densities in AgriBase relative to FOL for pigs and deer was reasonably homogeneous across New Zealand, with AgriBase having a farm density approximately 80% of FOL. For cattle and small ruminants, there was considerable heterogeneity, with AgriBase showing a density of cattle farms in the Central Otago region that was 20% of FOL, and a density of small ruminant farms in the central West Coast area that was twice that of FOL. Only 37% of records in FOL could be linked to AgriBase, but the level of agreement for the presence of different species between these databases was substantial (kappa>0.6). Both NAIT and FOL shared common farm identifiers which could be used to georeference animal movements, and there was a fair to substantial agreement (kappa 0.32-0.69) between these databases for the presence of cattle and deer on properties. The three databases broadly agreed with each other, but important differences existed in both species composition and spatial coverage which raises concern over their accuracy. Importantly, they cannot be reliably linked together to provide a single picture of New Zealand's livestock industry, limiting the ability to use advanced quantitative techniques to provide effective decision support during disease outbreaks. We recommend that a single integrated database be developed, with alignment of resources and legislation for its upkeep.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Brown, S

A database was generated of estimates of geographically referenced carbon densities of forest vegetation in tropical Southeast Asia for 1980. A geographic information system (GIS) was used to incorporate spatial databases of climatic, edaphic, and geomorphological indices and vegetation to estimate potential (i.e., in the absence of human intervention and natural disturbance) carbon densities of forests. The resulting map was then modified to estimate actual 1980 carbon density as a function of population density and climatic zone. The database covers the following 13 countries: Bangladesh, Brunei, Cambodia (Campuchea), India, Indonesia, Laos, Malaysia, Myanmar (Burma), Nepal, the Philippines, Sri Lanka, Thailand,more » and Vietnam. The data sets within this database are provided in three file formats: ARC/INFOTM exported integer grids, ASCII (American Standard Code for Information Interchange) files formatted for raster-based GIS software packages, and generic ASCII files with x, y coordinates for use with non-GIS software packages. This database includes ten ARC/INFO exported integer grid files (five with the pixel size 3.75 km x 3.75 km and five with the pixel size 0.25 degree longitude x 0.25 degree latitude) and 27 ASCII files. The first ASCII file contains the documentation associated with this database. Twenty-four of the ASCII files were generated by means of the ARC/INFO GRIDASCII command and can be used by most raster-based GIS software packages. The 24 files can be subdivided into two groups of 12 files each. These files contain real data values representing actual carbon and potential carbon density in Mg C/ha (1 megagram = 10{sup 6} grams) and integer-coded values for country name, Weck's Climatic Index, ecofloristic zone, elevation, forest or non-forest designation, population density, mean annual precipitation, slope, soil texture, and vegetation classification. One set of 12 files contains these data at a spatial resolution of 3.75 km, whereas the other set of 12 files has a spatial resolution of 0.25 degree. The remaining two ASCII data files combine all of the data from the 24 ASCII data files into 2 single generic data files. The first file has a spatial resolution of 3.75 km, and the second has a resolution of 0.25 degree. Both files also provide a grid-cell identification number and the longitude and latitude of the center-point of each grid cell. The 3.75-km data in this numeric data package yield an actual total carbon estimate of 42.1 Pg (1 petagram = 10{sup 15} grams) and a potential carbon estimate of 73.6 Pg; whereas the 0.25-degree data produced an actual total carbon estimate of 41.8 Pg and a total potential carbon estimate of 73.9 Pg. Fortran and SAS{trademark} access codes are provided to read the ASCII data files, and ARC/INFO and ARCVIEW command syntax are provided to import the ARC/INFO exported integer grid files. The data files and this documentation are available without charge on a variety of media and via the Internet from the Carbon Dioxide Information Analysis Center (CDIAC).« less
DOE Office of Scientific and Technical Information (OSTI.GOV)

Brown, S.

A database was generated of estimates of geographically referenced carbon densities of forest vegetation in tropical Southeast Asia for 1980. A geographic information system (GIS) was used to incorporate spatial databases of climatic, edaphic, and geomorphological indices and vegetation to estimate potential (i.e., in the absence of human intervention and natural disturbance) carbon densities of forests. The resulting map was then modified to estimate actual 1980 carbon density as a function of population density and climatic zone. The database covers the following 13 countries: Bangladesh, Brunei, Cambodia (Campuchea), India, Indonesia, Laos, Malaysia, Myanmar (Burma), Nepal, the Philippines, Sri Lanka, Thailand,more » and Vietnam. The data sets within this database are provided in three file formats: ARC/INFO{trademark} exported integer grids, ASCII (American Standard Code for Information Interchange) files formatted for raster-based GIS software packages, and generic ASCII files with x, y coordinates for use with non-GIS software packages. This database includes ten ARC/INFO exported integer grid files (five with the pixel size 3.75 km x 3.75 km and five with the pixel size 0.25 degree longitude x 0.25 degree latitude) and 27 ASCII files. The first ASCII file contains the documentation associated with this database. Twenty-four of the ASCII files were generated by means of the ARC/INFO GRIDASCII command and can be used by most raster-based GIS software packages. The 24 files can be subdivided into two groups of 12 files each. These files contain real data values representing actual carbon and potential carbon density in Mg C/ha (1 megagram = 10{sup 6} grams) and integer- coded values for country name, Weck's Climatic Index, ecofloristic zone, elevation, forest or non-forest designation, population density, mean annual precipitation, slope, soil texture, and vegetation classification. One set of 12 files contains these data at a spatial resolution of 3.75 km, whereas the other set of 12 files has a spatial resolution of 0.25 degree. The remaining two ASCII data files combine all of the data from the 24 ASCII data files into 2 single generic data files. The first file has a spatial resolution of 3.75 km, and the second has a resolution of 0.25 degree. Both files also provide a grid-cell identification number and the longitude and latitude of the centerpoint of each grid cell. The 3.75-km data in this numeric data package yield an actual total carbon estimate of 42.1 Pg (1 petagram = 10{sup 15} grams) and a potential carbon estimate of 73.6 Pg; whereas the 0.25-degree data produced an actual total carbon estimate of 41.8 Pg and a total potential carbon estimate of 73.9 Pg. Fortran and SASTM access codes are provided to read the ASCII data files, and ARC/INFO and ARCVIEW command syntax are provided to import the ARC/INFO exported integer grid files. The data files and this documentation are available without charge on a variety of media and via the Internet from the Carbon Dioxide Information Analysis Center (CDIAC).« less
Variation in trait trade-offs allows differentiation among predefined plant functional types: implications for predictive ecology.

PubMed

Verheijen, Lieneke M; Aerts, Rien; Bönisch, Gerhard; Kattge, Jens; Van Bodegom, Peter M

2016-01-01

Plant functional types (PFTs) aggregate the variety of plant species into a small number of functionally different classes. We examined to what extent plant traits, which reflect species' functional adaptations, can capture functional differences between predefined PFTs and which traits optimally describe these differences. We applied Gaussian kernel density estimation to determine probability density functions for individual PFTs in an n-dimensional trait space and compared predicted PFTs with observed PFTs. All possible combinations of 1-6 traits from a database with 18 different traits (total of 18 287 species) were tested. A variety of trait sets had approximately similar performance, and 4-5 traits were sufficient to classify up to 85% of the species into PFTs correctly, whereas this was 80% for a bioclimatically defined tree PFT classification. Well-performing trait sets included combinations of correlated traits that are considered functionally redundant within a single plant strategy. This analysis quantitatively demonstrates how structural differences between PFTs are reflected in functional differences described by particular traits. Differentiation between PFTs is possible despite large overlap in plant strategies and traits, showing that PFTs are differently positioned in multidimensional trait space. This study therefore provides the foundation for important applications for predictive ecology. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Comparisons of thermospheric density data sets and models

NASA Astrophysics Data System (ADS)

Doornbos, Eelco; van Helleputte, Tom; Emmert, John; Drob, Douglas; Bowman, Bruce R.; Pilinski, Marcin

During the past decade, continuous long-term data sets of thermospheric density have become available to researchers. These data sets have been derived from accelerometer measurements made by the CHAMP and GRACE satellites and from Space Surveillance Network (SSN) tracking data and related Two-Line Element (TLE) sets. These data have already resulted in a large number of publications on physical interpretation and improvement of empirical density modelling. This study compares four different density data sets and two empirical density models, for the period 2002-2009. These data sources are the CHAMP (1) and GRACE (2) accelerometer measurements, the long-term database of densities derived from TLE data (3), the High Accuracy Satellite Drag Model (4) run by Air Force Space Command, calibrated using SSN data, and the NRLMSISE-00 (5) and Jacchia-Bowman 2008 (6) empirical models. In describing these data sets and models, specific attention is given to differences in the geo-metrical and aerodynamic satellite modelling, applied in the conversion from drag to density measurements, which are main sources of density biases. The differences in temporal and spa-tial resolution of the density data sources are also described and taken into account. With these aspects in mind, statistics of density comparisons have been computed, both as a function of solar and geomagnetic activity levels, and as a function of latitude and local solar time. These statistics give a detailed view of the relative accuracy of the different data sets and of the biases between them. The differences are analysed with the aim at providing rough error bars on the data and models and pinpointing issues which could receive attention in future iterations of data processing algorithms and in future model development.
Cholesterol oxidase: ultrahigh-resolution crystal structure and multipolar atom model-based analysis.

PubMed

Zarychta, Bartosz; Lyubimov, Artem; Ahmed, Maqsood; Munshi, Parthapratim; Guillot, Benoît; Vrielink, Alice; Jelsch, Christian

2015-04-01

Examination of protein structure at the subatomic level is required to improve the understanding of enzymatic function. For this purpose, X-ray diffraction data have been collected at 100 K from cholesterol oxidase crystals using synchrotron radiation to an optical resolution of 0.94 Å. After refinement using the spherical atom model, nonmodelled bonding peaks were detected in the Fourier residual electron density on some of the individual bonds. Well defined bond density was observed in the peptide plane after averaging maps on the residues with the lowest thermal motion. The multipolar electron density of the protein-cofactor complex was modelled by transfer of the ELMAM2 charge-density database, and the topology of the intermolecular interactions between the protein and the flavin adenine dinucleotide (FAD) cofactor was subsequently investigated. Taking advantage of the high resolution of the structure, the stereochemistry of main-chain bond lengths and of C=O···H-N hydrogen bonds was analyzed with respect to the different secondary-structure elements.
Altered auditory and vestibular functioning in individuals with low bone mineral density: a systematic review.

PubMed

Singh, Niraj Kumar; Jha, Raghav Hira; Gargeshwari, Aditi; Kumar, Prawin

2018-01-01

Alteration in the process of bone remodelling is associated with falls and fractures due to increased bone fragility and altered calcium functioning. The auditory system consists of skeletal structures and is, therefore, prone to getting affected by altered bone remodelling. In addition, the vestibule consists of huge volumes of calcium (CaCO3) in the form of otoconia crystals and alteration in functioning calcium levels could, therefore, result in vestibular symptoms. Thus, the present study aimed at compiling information from various studies on assessment of auditory or vestibular systems in individuals with reduced bone mineral density (BMD). A total of 1977 articles were searched using various databases and 19 full-length articles which reported auditory and vestibular outcomes in persons with low BMD were reviewed. An intricate relationship between altered BMD and audio-vestibular function was evident from the studies; nonetheless, how one aspect of hearing or balance affects the other is not clear. Significant effect of reduced bone mineral density could probably be due to the metabolic changes at the level of cochlea, secondary to alterations in BMD. One could also conclude that sympathetic remodelling is associated with vestibular problems in individual; however, whether vestibular problems lead to altered BMD cannot be ascertained with confidence. The studies reviewed in the article provide an evidence of possible involvement of hearing and vestibular system abnormalities in individuals with reduced bone mineral density. Hence, the assessment protocol for these individuals must include hearing and balance evaluation as mandatory for planning appropriate management.
A technique for routinely updating the ITU-R database using radio occultation electron density profiles

NASA Astrophysics Data System (ADS)

Brunini, Claudio; Azpilicueta, Francisco; Nava, Bruno

2013-09-01

Well credited and widely used ionospheric models, such as the International Reference Ionosphere or NeQuick, describe the variation of the electron density with height by means of a piecewise profile tied to the F2-peak parameters: the electron density,, and the height, . Accurate values of these parameters are crucial for retrieving reliable electron density estimations from those models. When direct measurements of these parameters are not available, the models compute the parameters using the so-called ITU-R database, which was established in the early 1960s. This paper presents a technique aimed at routinely updating the ITU-R database using radio occultation electron density profiles derived from GPS measurements gathered from low Earth orbit satellites. Before being used, these radio occultation profiles are validated by fitting to them an electron density model. A re-weighted Least Squares algorithm is used for down-weighting unreliable measurements (occasionally, entire profiles) and to retrieve and values—together with their error estimates—from the profiles. These values are used to monthly update the database, which consists of two sets of ITU-R-like coefficients that could easily be implemented in the IRI or NeQuick models. The technique was tested with radio occultation electron density profiles that are delivered to the community by the COSMIC/FORMOSAT-3 mission team. Tests were performed for solstices and equinoxes seasons in high and low-solar activity conditions. The global mean error of the resulting maps—estimated by the Least Squares technique—is between and elec/m for the F2-peak electron density (which is equivalent to 7 % of the value of the estimated parameter) and from 2.0 to 5.6 km for the height (2 %).
Evaluation of a vortex-based subgrid stress model using DNS databases

NASA Technical Reports Server (NTRS)

Misra, Ashish; Lund, Thomas S.

1996-01-01

The performance of a SubGrid Stress (SGS) model for Large-Eddy Simulation (LES) developed by Misra k Pullin (1996) is studied for forced and decaying isotropic turbulence on a 32(exp 3) grid. The physical viability of the model assumptions are tested using DNS databases. The results from LES of forced turbulence at Taylor Reynolds number R(sub (lambda)) approximately equals 90 are compared with filtered DNS fields. Probability density functions (pdfs) of the subgrid energy transfer, total dissipation, and the stretch of the subgrid vorticity by the resolved velocity-gradient tensor show reasonable agreement with the DNS data. The model is also tested in LES of decaying isotropic turbulence where it correctly predicts the decay rate and energy spectra measured by Comte-Bellot & Corrsin (1971).
Improvement of endothelial function by pitavastatin: a meta-analysis.

PubMed

Katsiki, Niki; Reiner, Željko; Tedeschi Reiner, Eugenia; Al-Rasadi, Khalid; Pirro, Matteo; Mikhailidis, Dimitri P; Sahebkar, Amirhossein

2018-02-01

Dyslipidemia is commonly associated with endothelial dysfunction and increased cardiovascular risk. Pitavastatin has been shown to reduce total and low-density lipoprotein cholesterol, to increase high-density lipoprotein (HDL)-cholesterol and improve HDL function. Furthermore, several trials explored its effects on flow-mediated dilation (FMD), as an index of endothelial function. The authors evaluated the effect of pitavastatin therapy on FMD. The authors performed a systematic review and meta-analysis of all clinical trials exploring the impact of pitavastatin on FMD. The search included PubMed-Medline, Scopus, ISI Web of Knowledge and Google Scholar databases. Quantitative data synthesis was performed using a random-effects model, with weighted mean difference (WMD) and 95% confidence interval (CI) as summary statistics. Six eligible studies comprising 7 treatment arms were selected for this meta-analysis. Overall, WMD was significant for the effect of pitavastatin on FMD (2.45%, 95% CI: 1.31, 3.60, p < 0.001) and the effect size was robust in the leave-one-out sensitivity analysis. This meta-analysis of all available clinical trials revealed a significant increase of FMD induced by pitavastatin.
Gene Expression Profiling Reveals Functional Specialization along the Intestinal Tract of a Carnivorous Teleostean Fish (Dicentrarchus labrax)

PubMed Central

Calduch-Giner, Josep A.; Sitjà-Bobadilla, Ariadna; Pérez-Sánchez, Jaume

2016-01-01

High-quality sequencing reads from the intestine of European sea bass were assembled, annotated by similarity against protein reference databases and combined with nucleotide sequences from public and private databases. After redundancy filtering, 24,906 non-redundant annotated sequences encoding 15,367 different gene descriptions were obtained. These annotated sequences were used to design a custom, high-density oligo-microarray (8 × 15 K) for the transcriptomic profiling of anterior (AI), middle (MI), and posterior (PI) intestinal segments. Similar molecular signatures were found for AI and MI segments, which were combined in a single group (AI-MI) whereas the PI outstood separately, with more than 1900 differentially expressed genes with a fold-change cutoff of 2. Functional analysis revealed that molecular and cellular functions related to feed digestion and nutrient absorption and transport were over-represented in AI-MI segments. By contrast, the initiation and establishment of immune defense mechanisms became especially relevant in PI, although the microarray expression profiling validated by qPCR indicated that these functional changes are gradual from anterior to posterior intestinal segments. This functional divergence occurred in association with spatial transcriptional changes in nutrient transporters and the mucosal chemosensing system via G protein-coupled receptors. These findings contribute to identify key indicators of gut functions and to compare different fish feeding strategies and immune defense mechanisms acquired along the evolution of teleosts. PMID:27610085
Gene Expression Profiling Reveals Functional Specialization along the Intestinal Tract of a Carnivorous Teleostean Fish (Dicentrarchus labrax).

PubMed

Calduch-Giner, Josep A; Sitjà-Bobadilla, Ariadna; Pérez-Sánchez, Jaume

2016-01-01

High-quality sequencing reads from the intestine of European sea bass were assembled, annotated by similarity against protein reference databases and combined with nucleotide sequences from public and private databases. After redundancy filtering, 24,906 non-redundant annotated sequences encoding 15,367 different gene descriptions were obtained. These annotated sequences were used to design a custom, high-density oligo-microarray (8 × 15 K) for the transcriptomic profiling of anterior (AI), middle (MI), and posterior (PI) intestinal segments. Similar molecular signatures were found for AI and MI segments, which were combined in a single group (AI-MI) whereas the PI outstood separately, with more than 1900 differentially expressed genes with a fold-change cutoff of 2. Functional analysis revealed that molecular and cellular functions related to feed digestion and nutrient absorption and transport were over-represented in AI-MI segments. By contrast, the initiation and establishment of immune defense mechanisms became especially relevant in PI, although the microarray expression profiling validated by qPCR indicated that these functional changes are gradual from anterior to posterior intestinal segments. This functional divergence occurred in association with spatial transcriptional changes in nutrient transporters and the mucosal chemosensing system via G protein-coupled receptors. These findings contribute to identify key indicators of gut functions and to compare different fish feeding strategies and immune defense mechanisms acquired along the evolution of teleosts.

Assessment of imputation methods using varying ecological information to fill the gaps in a tree functional trait database

NASA Astrophysics Data System (ADS)

Poyatos, Rafael; Sus, Oliver; Vilà-Cabrera, Albert; Vayreda, Jordi; Badiella, Llorenç; Mencuccini, Maurizio; Martínez-Vilalta, Jordi

2016-04-01

Plant functional traits are increasingly being used in ecosystem ecology thanks to the growing availability of large ecological databases. However, these databases usually contain a large fraction of missing data because measuring plant functional traits systematically is labour-intensive and because most databases are compilations of datasets with different sampling designs. As a result, within a given database, there is an inevitable variability in the number of traits available for each data entry and/or the species coverage in a given geographical area. The presence of missing data may severely bias trait-based analyses, such as the quantification of trait covariation or trait-environment relationships and may hamper efforts towards trait-based modelling of ecosystem biogeochemical cycles. Several data imputation (i.e. gap-filling) methods have been recently tested on compiled functional trait databases, but the performance of imputation methods applied to a functional trait database with a regular spatial sampling has not been thoroughly studied. Here, we assess the effects of data imputation on five tree functional traits (leaf biomass to sapwood area ratio, foliar nitrogen, maximum height, specific leaf area and wood density) in the Ecological and Forest Inventory of Catalonia, an extensive spatial database (covering 31900 km2). We tested the performance of species mean imputation, single imputation by the k-nearest neighbors algorithm (kNN) and a multiple imputation method, Multivariate Imputation with Chained Equations (MICE) at different levels of missing data (10%, 30%, 50%, and 80%). We also assessed the changes in imputation performance when additional predictors (species identity, climate, forest structure, spatial structure) were added in kNN and MICE imputations. We evaluated the imputed datasets using a battery of indexes describing departure from the complete dataset in trait distribution, in the mean prediction error, in the correlation matrix and in selected bivariate trait relationships. MICE yielded imputations which better preserved the variability and covariance structure of the data and provided an estimate of between-imputation uncertainty. We found that adding species identity as a predictor in MICE and kNN improved imputation for all traits, but adding climate did not lead to any appreciable improvement. However, forest structure and spatial structure did reduce imputation errors in maximum height and in leaf biomass to sapwood area ratios, respectively. Although species mean imputations showed the lowest error for 3 out the 5 studied traits, dataset-averaged errors were lowest for MICE imputations with all additional predictors, when missing data levels were 50% or lower. Species mean imputations always resulted in larger errors in the correlation matrix and appreciably altered the studied bivariate trait relationships. In conclusion, MICE imputations using species identity, climate, forest structure and spatial structure as predictors emerged as the most suitable method of the ones tested here, but it was also evident that imputation performance deteriorates at high levels of missing data (80%).
Structure and Stability of Molecular Crystals with Many-Body Dispersion-Inclusive Density Functional Tight Binding.

PubMed

Mortazavi, Majid; Brandenburg, Jan Gerit; Maurer, Reinhard J; Tkatchenko, Alexandre

2018-01-18

Accurate prediction of structure and stability of molecular crystals is crucial in materials science and requires reliable modeling of long-range dispersion interactions. Semiempirical electronic structure methods are computationally more efficient than their ab initio counterparts, allowing structure sampling with significant speedups. We combine the Tkatchenko-Scheffler van der Waals method (TS) and the many-body dispersion method (MBD) with third-order density functional tight-binding (DFTB3) via a charge population-based method. We find an overall good performance for the X23 benchmark database of molecular crystals, despite an underestimation of crystal volume that can be traced to the DFTB parametrization. We achieve accurate lattice energy predictions with DFT+MBD energetics on top of vdW-inclusive DFTB3 structures, resulting in a speedup of up to 3000 times compared with a full DFT treatment. This suggests that vdW-inclusive DFTB3 can serve as a viable structural prescreening tool in crystal structure prediction.
Deficient serum 25-hydroxyvitamin D is associated with an atherogenic lipid profile: The Very Large Database of Lipids (VLDL-3) study.

PubMed

Lupton, Joshua R; Faridi, Kamil F; Martin, Seth S; Sharma, Sristi; Kulkarni, Krishnaji; Jones, Steven R; Michos, Erin D

2016-01-01

Cross-sectional studies have found an association between deficiencies in serum vitamin D, as measured by 25-hydroxyvitamin D (25[OH]D), and an atherogenic lipid profile. These studies have focused on a limited panel of lipid values including low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), and triglycerides (TG). Our study examines the relationship between serum 25(OH)D and an extended lipid panel (Vertical Auto Profile) while controlling for age, gender, glycemic status, and kidney function. We used the Very Large Database of Lipids, which includes US adults clinically referred for analysis of their lipid profile from 2009 to 2011. Our study focused on 20,360 subjects who had data for lipids, 25(OH)D, age, gender, hemoglobin A1c, insulin, creatinine, and blood urea nitrogen. Subjects were split into groups based on serum 25(OH)D: deficient (<20 ng/mL), intermediate (≥ 20-30 ng/mL), and optimal (≥ 30 ng/mL). The deficient group was compared to the optimal group using multivariable linear regression. In multivariable-adjusted linear regression, deficient serum 25(OH)D was associated with significantly lower serum HDL-C (-5.1%) and higher total cholesterol (+9.4%), non-HDL-C (+15.4%), directly measured LDL-C (+13.5%), intermediate-density lipoprotein cholesterol (+23.7%), very low-density lipoprotein cholesterol (+19.0%), remnant lipoprotein cholesterol (+18.4%), and TG (+26.4%) when compared with the optimal group. Deficient serum 25(OH)D is associated with significantly lower HDL-C and higher directly measured LDL-C, intermediate-density lipoprotein cholesterol, very low-density lipoproteins cholesterol, remnant lipoprotein cholesterol, and TG. Future trials examining vitamin D supplementation and cardiovascular disease risk should consider using changes in an extended lipid panel as an additional outcome measurement. Copyright © 2016 National Lipid Association. Published by Elsevier Inc. All rights reserved.
Benchmarking density functional tight binding models for barrier heights and reaction energetics of organic molecules.

PubMed

Gruden, Maja; Andjeklović, Ljubica; Jissy, Akkarapattiakal Kuriappan; Stepanović, Stepan; Zlatar, Matija; Cui, Qiang; Elstner, Marcus

2017-09-30

Density Functional Tight Binding (DFTB) models are two to three orders of magnitude faster than ab initio and Density Functional Theory (DFT) methods and therefore are particularly attractive in applications to large molecules and condensed phase systems. To establish the applicability of DFTB models to general chemical reactions, we conduct benchmark calculations for barrier heights and reaction energetics of organic molecules using existing databases and several new ones compiled in this study. Structures for the transition states and stable species have been fully optimized at the DFTB level, making it possible to characterize the reliability of DFTB models in a more thorough fashion compared to conducting single point energy calculations as done in previous benchmark studies. The encouraging results for the diverse sets of reactions studied here suggest that DFTB models, especially the most recent third-order version (DFTB3/3OB augmented with dispersion correction), in most cases provide satisfactory description of organic chemical reactions with accuracy almost comparable to popular DFT methods with large basis sets, although larger errors are also seen for certain cases. Therefore, DFTB models can be effective for mechanistic analysis (e.g., transition state search) of large (bio)molecules, especially when coupled with single point energy calculations at higher levels of theory. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Topology-Scaling Identification of Layered Solids and Stable Exfoliated 2D Materials.

PubMed

Ashton, Michael; Paul, Joshua; Sinnott, Susan B; Hennig, Richard G

2017-03-10

The Materials Project crystal structure database has been searched for materials possessing layered motifs in their crystal structures using a topology-scaling algorithm. The algorithm identifies and measures the sizes of bonded atomic clusters in a structure's unit cell, and determines their scaling with cell size. The search yielded 826 stable layered materials that are considered as candidates for the formation of two-dimensional monolayers via exfoliation. Density-functional theory was used to calculate the exfoliation energy of each material and 680 monolayers emerge with exfoliation energies below those of already-existent two-dimensional materials. The crystal structures of these two-dimensional materials provide templates for future theoretical searches of stable two-dimensional materials. The optimized structures and other calculated data for all 826 monolayers are provided at our database (https://materialsweb.org).
Characterizing fishing effort and spatial extent of coastal fisheries.

PubMed

Stewart, Kelly R; Lewison, Rebecca L; Dunn, Daniel C; Bjorkland, Rhema H; Kelez, Shaleyla; Halpin, Patrick N; Crowder, Larry B

2010-12-29

Biodiverse coastal zones are often areas of intense fishing pressure due to the high relative density of fishing capacity in these nearshore regions. Although overcapacity is one of the central challenges to fisheries sustainability in coastal zones, accurate estimates of fishing pressure in coastal zones are limited, hampering the assessment of the direct and collateral impacts (e.g., habitat degradation, bycatch) of fishing. We compiled a comprehensive database of fishing effort metrics and the corresponding spatial limits of fisheries and used a spatial analysis program (FEET) to map fishing effort density (measured as boat-meters per km²) in the coastal zones of six ocean regions. We also considered the utility of a number of socioeconomic variables as indicators of fishing pressure at the national level; fishing density increased as a function of population size and decreased as a function of coastline length. Our mapping exercise points to intra and interregional 'hotspots' of coastal fishing pressure. The significant and intuitive relationships we found between fishing density and population size and coastline length may help with coarse regional characterizations of fishing pressure. However, spatially-delimited fishing effort data are needed to accurately map fishing hotspots, i.e., areas of intense fishing activity. We suggest that estimates of fishing effort, not just target catch or yield, serve as a necessary measure of fishing activity, which is a key link to evaluating sustainability and environmental impacts of coastal fisheries.
Advances in Satellite Microwave Precipitation Retrieval Algorithms Over Land

NASA Astrophysics Data System (ADS)

Wang, N. Y.; You, Y.; Ferraro, R. R.

2015-12-01

Precipitation plays a key role in the earth's climate system, particularly in the aspect of its water and energy balance. Satellite microwave (MW) observations of precipitation provide a viable mean to achieve global measurement of precipitation with sufficient sampling density and accuracy. However, accurate precipitation information over land from satellite MW is a challenging problem. The Goddard Profiling Algorithm (GPROF) algorithm for the Global Precipitation Measurement (GPM) is built around the Bayesian formulation (Evans et al., 1995; Kummerow et al., 1996). GPROF uses the likelihood function and the prior probability distribution function to calculate the expected value of precipitation rate, given the observed brightness temperatures. It is particularly convenient to draw samples from a prior PDF from a predefined database of observations or models. GPROF algorithm does not search all database entries but only the subset thought to correspond to the actual observation. The GPM GPROF V1 database focuses on stratification by surface emissivity class, land surface temperature and total precipitable water. However, there is much uncertainty as to what is the optimal information needed to subset the database for different conditions. To this end, we conduct a database stratification study of using National Mosaic and Multi-Sensor Quantitative Precipitation Estimation, Special Sensor Microwave Imager/Sounder (SSMIS) and Advanced Technology Microwave Sounder (ATMS) and reanalysis data from Modern-Era Retrospective Analysis for Research and Applications (MERRA). Our database study (You et al., 2015) shows that environmental factors such as surface elevation, relative humidity, and storm vertical structure and height, and ice thickness can help in stratifying a single large database to smaller and more homogeneous subsets, in which the surface condition and precipitation vertical profiles are similar. It is found that the probability of detection (POD) increases about 8% and 12% by using stratified databases for rainfall and snowfall detection, respectively. In addition, by considering the relative humidity at lower troposphere and the vertical velocity at 700 hPa in the precipitation detection process, the POD for snowfall detection is further increased by 20.4% from 56.0% to 76.4%.
FTIR spectroscopy supported by statistical techniques for the structural characterization of plastic debris in the marine environment: Application to monitoring studies.

PubMed

Mecozzi, Mauro; Pietroletti, Marco; Monakhova, Yulia B

2016-05-15

We inserted 190 FTIR spectra of plastic samples in a digital database and submitted it to Independent Component Analysis (ICA) to extract the "pure" plastic polymers present. These identified plastics were polypropylene (PP), high density polyethylene (HDPE), low density polyethylene (LDPE), high density polyethylene terephthalate (HDPET), low density polyethylene terephthalate (LDPET), polystyrene (PS), Nylon (NL), polyethylene oxide (OPE), and Teflon (TEF) and they were used to establish the similarity with unknown plastics using the correlation coefficient (r), and the crosscorrelation function (CC). For samples with r<0.8 we determined the Mahalanobis Distance (MD) as additional tool of identification. For instance, for the four plastic fragments found in the Carretta carretta, one plastic sample was assigned to OPE due to its r=0.87; for all the other three plastic samples, due to the r values ranging between 0.83 and0.70, the support of MD suggested LDPET and OPE as co-polymer constituents. Copyright © 2016 Elsevier Ltd. All rights reserved.
B97-3c: A revised low-cost variant of the B97-D density functional method

NASA Astrophysics Data System (ADS)

Brandenburg, Jan Gerit; Bannwarth, Christoph; Hansen, Andreas; Grimme, Stefan

2018-02-01

A revised version of the well-established B97-D density functional approximation with general applicability for chemical properties of large systems is proposed. Like B97-D, it is based on Becke's power-series ansatz from 1997 and is explicitly parametrized by including the standard D3 semi-classical dispersion correction. The orbitals are expanded in a modified valence triple-zeta Gaussian basis set, which is available for all elements up to Rn. Remaining basis set errors are mostly absorbed in the modified B97 parametrization, while an established atom-pairwise short-range potential is applied to correct for the systematically too long bonds of main group elements which are typical for most semi-local density functionals. The new composite scheme (termed B97-3c) completes the hierarchy of "low-cost" electronic structure methods, which are all mainly free of basis set superposition error and account for most interactions in a physically sound and asymptotically correct manner. B97-3c yields excellent molecular and condensed phase geometries, similar to most hybrid functionals evaluated in a larger basis set expansion. Results on the comprehensive GMTKN55 energy database demonstrate its good performance for main group thermochemistry, kinetics, and non-covalent interactions, when compared to functionals of the same class. This also transfers to metal-organic reactions, which is a major area of applicability for semi-local functionals. B97-3c can be routinely applied to hundreds of atoms on a single processor and we suggest it as a robust computational tool, in particular, for more strongly correlated systems where our previously published "3c" schemes might be problematic.
Prediction of Liquefaction Potential of Dredge Fill Sand by DCP and Dynamic Probing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Alam, Md. Jahangir; Azad, Abul Kalam; Rahman, Ziaur

2008-07-08

From many research it is proved that liquefaction potential of sand is function of mainly relative density and confining pressure. During routine site investigations, high-quality sampling and laboratory testing of sands are not feasible because of inevitable sample disturbance effects and budgetary constraints. On the other hand quality control of sand fill can be done by determining in situ density of sand in layer by layer which is expensive and time consuming. In this paper TRL DCP (Transportation Research Laboratory Dynamic Cone Penetration) and DPL (Dynamic Probing Light) are calibrated to predict the relative density of sand deposit. For thismore » purpose sand of known relative density is prepared in a calibration chamber which is a mild steel cylinder with diameter 0.5 m and height 1.0 m. Relative density of sand is varied by controlling height of fall and diameter of hole of sand discharge bowl. After filling, every time DPL and DCP tests are performed and for every blow the penetration of cone is recorded. N10 is then calculated from penetration records. Thus a database is compiled where N10 and relative densities are known. A correlation is made between N{sub 10} and relative density for two types of sand. A good correlation of N{sub 10} and relative density is found.« less
Aging and functional brain networks

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tomasi D.; Tomasi, D.; Volkow, N.D.

2011-07-11

Aging is associated with changes in human brain anatomy and function and cognitive decline. Recent studies suggest the aging decline of major functional connectivity hubs in the 'default-mode' network (DMN). Aging effects on other networks, however, are largely unknown. We hypothesized that aging would be associated with a decline of short- and long-range functional connectivity density (FCD) hubs in the DMN. To test this hypothesis, we evaluated resting-state data sets corresponding to 913 healthy subjects from a public magnetic resonance imaging database using functional connectivity density mapping (FCDM), a voxelwise and data-driven approach, together with parallel computing. Aging was associatedmore » with pronounced long-range FCD decreases in DMN and dorsal attention network (DAN) and with increases in somatosensory and subcortical networks. Aging effects in these networks were stronger for long-range than for short-range FCD and were also detected at the level of the main functional hubs. Females had higher short- and long-range FCD in DMN and lower FCD in the somatosensory network than males, but the gender by age interaction effects were not significant for any of the networks or hubs. These findings suggest that long-range connections may be more vulnerable to aging effects than short-range connections and that, in addition to the DMN, the DAN is also sensitive to aging effects, which could underlie the deterioration of attention processes that occurs with aging.« less
Quadratic integrand double-hybrid made spin-component-scaled

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brémond, Éric, E-mail: eric.bremond@iit.it; Savarese, Marika; Sancho-García, Juan C.

2016-03-28

We propose two analytical expressions aiming to rationalize the spin-component-scaled (SCS) and spin-opposite-scaled (SOS) schemes for double-hybrid exchange-correlation density-functionals. Their performances are extensively tested within the framework of the nonempirical quadratic integrand double-hybrid (QIDH) model on energetic properties included into the very large GMTKN30 benchmark database, and on structural properties of semirigid medium-sized organic compounds. The SOS variant is revealed as a less computationally demanding alternative to reach the accuracy of the original QIDH model without losing any theoretical background.
Comparison of different models for predicting soil bulk density. Case study - Slovakian agricultural soils

NASA Astrophysics Data System (ADS)

Makovníková, Jarmila; Širáň, Miloš; Houšková, Beata; Pálka, Boris; Jones, Arwyn

2017-10-01

Soil bulk density is one of the main direct indicators of soil health, and is an important aspect of models for determining agroecosystem services potential. By way of applying multi-regression methods, we have created a distributed prediction of soil bulk density used subsequently for topsoil carbon stock estimation. The soil data used for this study were from the Slovakian partial monitoring system-soil database. In our work, two models of soil bulk density in an equilibrium state, with different combinations of input parameters (soil particle size distribution and soil organic carbon content in %), have been created, and subsequently validated using a data set from 15 principal sampling sites of Slovakian partial monitoring system-soil, that were different from those used to generate the bulk density equations. We have made a comparison of measured bulk density data and data calculated by the pedotransfer equations against soil bulk density calculated according to equations recommended by Joint Research Centre Sustainable Resources for Europe. The differences between measured soil bulk density and the model values vary from -0.144 to 0.135 g cm-3 in the verification data set. Furthermore, all models based on pedotransfer functions give moderately lower values. The soil bulk density model was then applied to generate a first approximation of soil bulk density map for Slovakia using texture information from 17 523 sampling sites, and was subsequently utilised for topsoil organic carbon estimation.
SBMDb: first whole genome putative microsatellite DNA marker database of sugarbeet for bioenergy and industrial applications

PubMed Central

Iquebal, Mir Asif; Jaiswal, Sarika; Angadi, U.B.; Sablok, Gaurav; Arora, Vasu; Kumar, Sunil; Rai, Anil; Kumar, Dinesh

2015-01-01

DNA marker plays important role as valuable tools to increase crop productivity by finding plausible answers to genetic variations and linking the Quantitative Trait Loci (QTL) of beneficial trait. Prior approaches in development of Short Tandem Repeats (STR) markers were time consuming and inefficient. Recent methods invoking the development of STR markers using whole genomic or transcriptomics data has gained wide importance with immense potential in developing breeding and cultivator improvement approaches. Availability of whole genome sequences and in silico approaches has revolutionized bulk marker discovery. We report world’s first sugarbeet whole genome marker discovery having 145 K markers along with 5 K functional domain markers unified in common platform using MySQL, Apache and PHP in SBMDb. Embedded markers and corresponding location information can be selected for desired chromosome, location/interval and primers can be generated using Primer3 core, integrated at backend. Our analyses revealed abundance of ‘mono’ repeat (76.82%) over ‘di’ repeats (13.68%). Highest density (671.05 markers/Mb) was found in chromosome 1 and lowest density (341.27 markers/Mb) in chromosome 6. Current investigation of sugarbeet genome marker density has direct implications in increasing mapping marker density. This will enable present linkage map having marker distance of ∼2 cM, i.e. from 200 to 2.6 Kb, thus facilitating QTL/gene mapping. We also report e-PCR-based detection of 2027 polymorphic markers in panel of five genotypes. These markers can be used for DUS test of variety identification and MAS/GAS in variety improvement program. The present database presents wide source of potential markers for developing and implementing new approaches for molecular breeding required to accelerate industrious use of this crop, especially for sugar, health care products, medicines and color dye. Identified markers will also help in improvement of bioenergy trait of bioethanol and biogas production along with reaping advantage of crop efficiency in terms of low water and carbon footprint especially in era of climate change. Database URL: http://webapp.cabgrid.res.in/sbmdb/ PMID:26647370
SBMDb: first whole genome putative microsatellite DNA marker database of sugarbeet for bioenergy and industrial applications.

PubMed

Iquebal, Mir Asif; Jaiswal, Sarika; Angadi, U B; Sablok, Gaurav; Arora, Vasu; Kumar, Sunil; Rai, Anil; Kumar, Dinesh

2015-01-01

DNA marker plays important role as valuable tools to increase crop productivity by finding plausible answers to genetic variations and linking the Quantitative Trait Loci (QTL) of beneficial trait. Prior approaches in development of Short Tandem Repeats (STR) markers were time consuming and inefficient. Recent methods invoking the development of STR markers using whole genomic or transcriptomics data has gained wide importance with immense potential in developing breeding and cultivator improvement approaches. Availability of whole genome sequences and in silico approaches has revolutionized bulk marker discovery. We report world's first sugarbeet whole genome marker discovery having 145 K markers along with 5 K functional domain markers unified in common platform using MySQL, Apache and PHP in SBMDb. Embedded markers and corresponding location information can be selected for desired chromosome, location/interval and primers can be generated using Primer3 core, integrated at backend. Our analyses revealed abundance of 'mono' repeat (76.82%) over 'di' repeats (13.68%). Highest density (671.05 markers/Mb) was found in chromosome 1 and lowest density (341.27 markers/Mb) in chromosome 6. Current investigation of sugarbeet genome marker density has direct implications in increasing mapping marker density. This will enable present linkage map having marker distance of ∼2 cM, i.e. from 200 to 2.6 Kb, thus facilitating QTL/gene mapping. We also report e-PCR-based detection of 2027 polymorphic markers in panel of five genotypes. These markers can be used for DUS test of variety identification and MAS/GAS in variety improvement program. The present database presents wide source of potential markers for developing and implementing new approaches for molecular breeding required to accelerate industrious use of this crop, especially for sugar, health care products, medicines and color dye. Identified markers will also help in improvement of bioenergy trait of bioethanol and biogas production along with reaping advantage of crop efficiency in terms of low water and carbon footprint especially in era of climate change. Database URL: http://webapp.cabgrid.res.in/sbmdb/. © The Author(s) 2015. Published by Oxford University Press.
An Array Library for Microsoft SQL Server with Astrophysical Applications

NASA Astrophysics Data System (ADS)

Dobos, L.; Szalay, A. S.; Blakeley, J.; Falck, B.; Budavári, T.; Csabai, I.

2012-09-01

Today's scientific simulations produce output on the 10-100 TB scale. This unprecedented amount of data requires data handling techniques that are beyond what is used for ordinary files. Relational database systems have been successfully used to store and process scientific data, but the new requirements constantly generate new challenges. Moving terabytes of data among servers on a timely basis is a tough problem, even with the newest high-throughput networks. Thus, moving the computations as close to the data as possible and minimizing the client-server overhead are absolutely necessary. At least data subsetting and preprocessing have to be done inside the server process. Out of the box commercial database systems perform very well in scientific applications from the prospective of data storage optimization, data retrieval, and memory management but lack basic functionality like handling scientific data structures or enabling advanced math inside the database server. The most important gap in Microsoft SQL Server is the lack of a native array data type. Fortunately, the technology exists to extend the database server with custom-written code that enables us to address these problems. We present the prototype of a custom-built extension to Microsoft SQL Server that adds array handling functionality to the database system. With our Array Library, fix-sized arrays of all basic numeric data types can be created and manipulated efficiently. Also, the library is designed to be able to be seamlessly integrated with the most common math libraries, such as BLAS, LAPACK, FFTW, etc. With the help of these libraries, complex operations, such as matrix inversions or Fourier transformations, can be done on-the-fly, from SQL code, inside the database server process. We are currently testing the prototype with two different scientific data sets: The Indra cosmological simulation will use it to store particle and density data from N-body simulations, and the Milky Way Laboratory project will use it to store galaxy simulation data.
Using the structure-function linkage database to characterize functional domains in enzymes.

PubMed

Brown, Shoshana; Babbitt, Patricia

2014-12-12

The Structure-Function Linkage Database (SFLD; http://sfld.rbvi.ucsf.edu/) is a Web-accessible database designed to link enzyme sequence, structure, and functional information. This unit describes the protocols by which a user may query the database to predict the function of uncharacterized enzymes and to correct misannotated functional assignments. The information in this unit is especially useful in helping a user discriminate functional capabilities of a sequence that is only distantly related to characterized sequences in publicly available databases. Copyright © 2014 John Wiley & Sons, Inc.
MGIS: managing banana (Musa spp.) genetic resources information and high-throughput genotyping data

PubMed Central

Guignon, V.; Sempere, G.; Sardos, J.; Hueber, Y.; Duvergey, H.; Andrieu, A.; Chase, R.; Jenny, C.; Hazekamp, T.; Irish, B.; Jelali, K.; Adeka, J.; Ayala-Silva, T.; Chao, C.P.; Daniells, J.; Dowiya, B.; Effa effa, B.; Gueco, L.; Herradura, L.; Ibobondji, L.; Kempenaers, E.; Kilangi, J.; Muhangi, S.; Ngo Xuan, P.; Paofa, J.; Pavis, C.; Thiemele, D.; Tossou, C.; Sandoval, J.; Sutanto, A.; Vangu Paka, G.; Yi, G.; Van den houwe, I.; Roux, N.

2017-01-01

Abstract Unraveling the genetic diversity held in genebanks on a large scale is underway, due to advances in Next-generation sequence (NGS) based technologies that produce high-density genetic markers for a large number of samples at low cost. Genebank users should be in a position to identify and select germplasm from the global genepool based on a combination of passport, genotypic and phenotypic data. To facilitate this, a new generation of information systems is being designed to efficiently handle data and link it with other external resources such as genome or breeding databases. The Musa Germplasm Information System (MGIS), the database for global ex situ-held banana genetic resources, has been developed to address those needs in a user-friendly way. In developing MGIS, we selected a generic database schema (Chado), the robust content management system Drupal for the user interface, and Tripal, a set of Drupal modules which links the Chado schema to Drupal. MGIS allows germplasm collection examination, accession browsing, advanced search functions, and germplasm orders. Additionally, we developed unique graphical interfaces to compare accessions and to explore them based on their taxonomic information. Accession-based data has been enriched with publications, genotyping studies and associated genotyping datasets reporting on germplasm use. Finally, an interoperability layer has been implemented to facilitate the link with complementary databases like the Banana Genome Hub and the MusaBase breeding database. Database URL: https://www.crop-diversity.org/mgis/ PMID:29220435
Predicting the thermodynamic stability of double-perovskite halides from density functional theory

DOE PAGES

Han, Dan; Zhang, Tao; Huang, Menglin; ...

2018-05-24

Recently, a series of double-perovskite halide compounds such as Cs 2AgBiCl 6 and Cs 2AgBiBr 6 have attracted intensive interest as promising alternatives to the solar absorber material CH 3NH 3PbI 3 because they are Pb-free and may exhibit enhanced stability. The thermodynamic stability of a number of double-perovskite halides has been predicted based on density functional theory (DFT) calculations of compound formation energies. In this paper, we found that the stability prediction can be dependent on the approximations used for the exchange-correlation functionals, e.g., the DFT calculations using the widely used Perdew, Burke, Ernzerhof (PBE) functional predict that Csmore » 2AgBiBr 6 is thermodynamically unstable against phase-separation into the competing phases such as AgBr, Cs 2AgBr 3, Cs 3Bi 2Br 9, etc., obviously inconsistent with the good stability observed experimentally. The incorrect prediction by the PBE calculation results from its failure to predict the correct ground-state structures of AgBr, AgCl, and CsCl. By contrast, the DFT calculations based on local density approximation, optB86b-vdW, and optB88-vdW functionals predict the ground-state structures of these binary halides correctly. Furthermore, the optB88-vdW functional is found to give the most accurate description of the lattice constants of the double-perovskite halides and their competing phases. Given these two aspects, we suggest that the optB88-vdW functional should be used for predicting thermodynamic stability in the future high-throughput computational material design or the construction of the Materials Genome database for new double-perovskite halides. As a result, using different exchange-correlation functionals has little influence on the dispersion of the conduction and the valence bands near the electronic bandgap; however, the calculated bandgap can be affected indirectly by the optimized lattice constant, which varies for different functionals.« less
Predicting the thermodynamic stability of double-perovskite halides from density functional theory

DOE Office of Scientific and Technical Information (OSTI.GOV)

Han, Dan; Zhang, Tao; Huang, Menglin

Recently, a series of double-perovskite halide compounds such as Cs 2AgBiCl 6 and Cs 2AgBiBr 6 have attracted intensive interest as promising alternatives to the solar absorber material CH 3NH 3PbI 3 because they are Pb-free and may exhibit enhanced stability. The thermodynamic stability of a number of double-perovskite halides has been predicted based on density functional theory (DFT) calculations of compound formation energies. In this paper, we found that the stability prediction can be dependent on the approximations used for the exchange-correlation functionals, e.g., the DFT calculations using the widely used Perdew, Burke, Ernzerhof (PBE) functional predict that Csmore » 2AgBiBr 6 is thermodynamically unstable against phase-separation into the competing phases such as AgBr, Cs 2AgBr 3, Cs 3Bi 2Br 9, etc., obviously inconsistent with the good stability observed experimentally. The incorrect prediction by the PBE calculation results from its failure to predict the correct ground-state structures of AgBr, AgCl, and CsCl. By contrast, the DFT calculations based on local density approximation, optB86b-vdW, and optB88-vdW functionals predict the ground-state structures of these binary halides correctly. Furthermore, the optB88-vdW functional is found to give the most accurate description of the lattice constants of the double-perovskite halides and their competing phases. Given these two aspects, we suggest that the optB88-vdW functional should be used for predicting thermodynamic stability in the future high-throughput computational material design or the construction of the Materials Genome database for new double-perovskite halides. As a result, using different exchange-correlation functionals has little influence on the dispersion of the conduction and the valence bands near the electronic bandgap; however, the calculated bandgap can be affected indirectly by the optimized lattice constant, which varies for different functionals.« less

Adsorption structures and energetics of molecules on metal surfaces: Bridging experiment and theory

NASA Astrophysics Data System (ADS)

Maurer, Reinhard J.; Ruiz, Victor G.; Camarillo-Cisneros, Javier; Liu, Wei; Ferri, Nicola; Reuter, Karsten; Tkatchenko, Alexandre

2016-05-01

Adsorption geometry and stability of organic molecules on surfaces are key parameters that determine the observable properties and functions of hybrid inorganic/organic systems (HIOSs). Despite many recent advances in precise experimental characterization and improvements in first-principles electronic structure methods, reliable databases of structures and energetics for large adsorbed molecules are largely amiss. In this review, we present such a database for a range of molecules adsorbed on metal single-crystal surfaces. The systems we analyze include noble-gas atoms, conjugated aromatic molecules, carbon nanostructures, and heteroaromatic compounds adsorbed on five different metal surfaces. The overall objective is to establish a diverse benchmark dataset that enables an assessment of current and future electronic structure methods, and motivates further experimental studies that provide ever more reliable data. Specifically, the benchmark structures and energetics from experiment are here compared with the recently developed van der Waals (vdW) inclusive density-functional theory (DFT) method, DFT + vdWsurf. In comparison to 23 adsorption heights and 17 adsorption energies from experiment we find a mean average deviation of 0.06 Å and 0.16 eV, respectively. This confirms the DFT + vdWsurf method as an accurate and efficient approach to treat HIOSs. A detailed discussion identifies remaining challenges to be addressed in future development of electronic structure methods, for which the here presented benchmark database may serve as an important reference.
Tomato functional genomics database (TFGD): a comprehensive collection and analysis package for tomato functional genomics

USDA-ARS?s Scientific Manuscript database

Tomato Functional Genomics Database (TFGD; http://ted.bti.cornell.edu) provides a comprehensive systems biology resource to store, mine, analyze, visualize and integrate large-scale tomato functional genomics datasets. The database is expanded from the previously described Tomato Expression Database...
Application of artificial neural networks for conformity analysis of fuel performed with an optical fiber sensor

NASA Astrophysics Data System (ADS)

Possetti, Gustavo Rafael Collere; Coradin, Francelli Klemba; Côcco, Lílian Cristina; Yamamoto, Carlos Itsuo; de Arruda, Lucia Valéria Ramos; Falate, Rosane; Muller, Marcia; Fabris, José Luís

2008-04-01

The liquid fuel quality control is an important issue that brings benefits for the State, for the consumers and for the environment. The conformity analysis, in special for gasoline, demands a rigorous sampling technique among gas stations and other economic agencies, followed by a series of standard physicochemical tests. Such procedures are commonly expensive and time demanding and, moreover, a specialist is often required to carry out the tasks. Such drawbacks make the development of alternative analysis tools an important research field. The fuel refractive index is an additional parameter to help the fuel conformity analysis, besides the prospective optical fiber sensors, which operate like transducers with singular properties. When this parameter is correlated with the sample density, it becomes possible to determine conformity zones that cannot be analytically defined. This work presents an application of artificial neural networks based on Radial Basis Function to determine these zones. A set of 45 gasoline samples, collected in several gas stations and previously analyzed according to the rules of Agência Nacional do Petróleo, Gás Natural e Biocombustíveis, a Brazilian regulatory agency, constituted the database to build two neural networks. The input variables of first network are the samples refractive indices, measured with an Abbe refractometer, and the density of the samples measured with a digital densimeter. For the second network the input variables included, besides the samples densities, the wavelength response of a long-period grating to the samples refractive indices. The used grating was written in an optical fiber using the point-to-point technique by submitting the fiber to consecutive electrical arcs from a splice machine. The output variables of both Radial Basis Function Networks are represented by the conformity status of each sample, according to report of tests carried out following the American Society for Testing and Materials and/or Brazilian Association of Technical Rules standards. A subset of 35 samples, randomly chosen from the database, was used to design and calibrate (train) both networks. The two networks topologies (numbers of Radial Basis Function neurons of the hidden layer and function radius) were built in order to minimize the root mean square error. The subset composed by the other 10 samples was used to validate the final networks architectures. The obtained results have demonstrated that both networks reach a good predictive capability.
Neighborhood alcohol outlet density and genetic influences on alcohol use: evidence for gene-environment interaction.

PubMed

Slutske, Wendy S; Deutsch, Arielle R; Piasecki, Thomas M

2018-05-07

Genetic influences on alcohol involvement are likely to vary as a function of the 'alcohol environment,' given that exposure to alcohol is a necessary precondition for genetic risk to be expressed. However, few gene-environment interaction studies of alcohol involvement have focused on characteristics of the community-level alcohol environment. The goal of this study was to examine whether living in a community with more alcohol outlets would facilitate the expression of the genetic propensity to drink in a genetically-informed national survey of United States young adults. The participants were 2434 18-26-year-old twin, full-, and half-sibling pairs from Wave III of the National Longitudinal Study of Adolescent to Adult Health. Participants completed in-home interviews in which alcohol use was assessed. Alcohol outlet densities were extracted from state-level liquor license databases aggregated at the census tract level to derive the density of outlets. There was evidence that the estimates of genetic and environmental influences on alcohol use varied as a function of the density of alcohol outlets in the community. For example, the heritability of the frequency of alcohol use for those residing in a neighborhood with ten or more outlets was 74% (95% confidence limits = 55-94%), compared with 16% (95% confidence limits = 0-34%) for those in a neighborhood with zero outlets. This moderating effect of alcohol outlet density was not explained by the state of residence, population density, or neighborhood sociodemographic characteristics. The results suggest that living in a neighborhood with many alcohol outlets may be especially high-risk for those individuals who are genetically predisposed to frequently drink.
Computational screening of high-performance optoelectronic materials using OptB88vdW and TB-mBJ formalisms.

PubMed

Choudhary, Kamal; Zhang, Qin; Reid, Andrew C E; Chowdhury, Sugata; Van Nguyen, Nhan; Trautt, Zachary; Newrock, Marcus W; Congo, Faical Yannick; Tavazza, Francesca

2018-05-08

We perform high-throughput density functional theory (DFT) calculations for optoelectronic properties (electronic bandgap and frequency dependent dielectric function) using the OptB88vdW functional (OPT) and the Tran-Blaha modified Becke Johnson potential (MBJ). This data is distributed publicly through JARVIS-DFT database. We used this data to evaluate the differences between these two formalisms and quantify their accuracy, comparing to experimental data whenever applicable. At present, we have 17,805 OPT and 7,358 MBJ bandgaps and dielectric functions. MBJ is found to predict better bandgaps and dielectric functions than OPT, so it can be used to improve the well-known bandgap problem of DFT in a relatively inexpensive way. The peak positions in dielectric functions obtained with OPT and MBJ are in comparable agreement with experiments. The data is available on our websites http://www.ctcms.nist.gov/~knc6/JVASP.html and https://jarvis.nist.gov.
Matrix- and tensor-based recommender systems for the discovery of currently unknown inorganic compounds

NASA Astrophysics Data System (ADS)

Seko, Atsuto; Hayashi, Hiroyuki; Kashima, Hisashi; Tanaka, Isao

2018-01-01

Chemically relevant compositions (CRCs) and atomic arrangements of inorganic compounds have been collected as inorganic crystal structure databases. Machine learning is a unique approach to search for currently unknown CRCs from vast candidates. Herein we propose matrix- and tensor-based recommender system approaches to predict currently unknown CRCs from database entries of CRCs. Firstly, the performance of the recommender system approaches to discover currently unknown CRCs is examined. A Tucker decomposition recommender system shows the best discovery rate of CRCs as the majority of the top 100 recommended ternary and quaternary compositions correspond to CRCs. Secondly, systematic density functional theory (DFT) calculations are performed to investigate the phase stability of the recommended compositions. The phase stability of the 27 compositions reveals that 23 currently unknown compounds are newly found to be stable. These results indicate that the recommender system has great potential to accelerate the discovery of new compounds.
In silico design and screening of hypothetical MOF-74 analogs and their experimental synthesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Witman, Matthew; Ling, Sanliang; Anderson, Samantha

Here, we present the in silico design of metal-organic frameworks (MOFs) exhibiting 1-dimensional rod topologies. We then introduce an algorithm for construction of this family of MOF topologies, and illustrate its application for enumerating MOF-74-type analogs. Furthermore, we perform a broad search for new linkers that satisfy the topological requirements of MOF-74 and consider the largest database of known chemical space for organic compounds, the PubChem database. Our in silico crystal assembly, when combined with dispersion-corrected density functional theory (DFT) calculations, is demonstrated to generate a hypothetical library of open-metal site containing MOF-74 analogs in the 1-D rod topology frommore » which we can simulate the adsorption behavior of CO 2 . We conclude that these hypothetical structures have synthesizable potential through computational identification and experimental validation of a novel MOF-74 analog, Mg 2 (olsalazine).« less
In silico design and screening of hypothetical MOF-74 analogs and their experimental synthesis

DOE PAGES

Witman, Matthew; Ling, Sanliang; Anderson, Samantha; ...

2016-06-21

Here, we present the in silico design of metal-organic frameworks (MOFs) exhibiting 1-dimensional rod topologies. We then introduce an algorithm for construction of this family of MOF topologies, and illustrate its application for enumerating MOF-74-type analogs. Furthermore, we perform a broad search for new linkers that satisfy the topological requirements of MOF-74 and consider the largest database of known chemical space for organic compounds, the PubChem database. Our in silico crystal assembly, when combined with dispersion-corrected density functional theory (DFT) calculations, is demonstrated to generate a hypothetical library of open-metal site containing MOF-74 analogs in the 1-D rod topology frommore » which we can simulate the adsorption behavior of CO 2 . We conclude that these hypothetical structures have synthesizable potential through computational identification and experimental validation of a novel MOF-74 analog, Mg 2 (olsalazine).« less
Mouse Phenome Database

PubMed Central

Grubb, Stephen C.; Bult, Carol J.; Bogue, Molly A.

2014-01-01

The Mouse Phenome Database (MPD; phenome.jax.org) was launched in 2001 as the data coordination center for the international Mouse Phenome Project. MPD integrates quantitative phenotype, gene expression and genotype data into a common annotated framework to facilitate query and analysis. MPD contains >3500 phenotype measurements or traits relevant to human health, including cancer, aging, cardiovascular disorders, obesity, infectious disease susceptibility, blood disorders, neurosensory disorders, drug addiction and toxicity. Since our 2012 NAR report, we have added >70 new data sets, including data from Collaborative Cross lines and Diversity Outbred mice. During this time we have completely revamped our homepage, improved search and navigational aspects of the MPD application, developed several web-enabled data analysis and visualization tools, annotated phenotype data to public ontologies, developed an ontology browser and released new single nucleotide polymorphism query functionality with much higher density coverage than before. Here, we summarize recent data acquisitions and describe our latest improvements. PMID:24243846
Automatic Sleep Stage Determination by Multi-Valued Decision Making Based on Conditional Probability with Optimal Parameters

NASA Astrophysics Data System (ADS)

Wang, Bei; Sugi, Takenao; Wang, Xingyu; Nakamura, Masatoshi

Data for human sleep study may be affected by internal and external influences. The recorded sleep data contains complex and stochastic factors, which increase the difficulties for the computerized sleep stage determination techniques to be applied for clinical practice. The aim of this study is to develop an automatic sleep stage determination system which is optimized for variable sleep data. The main methodology includes two modules: expert knowledge database construction and automatic sleep stage determination. Visual inspection by a qualified clinician is utilized to obtain the probability density function of parameters during the learning process of expert knowledge database construction. Parameter selection is introduced in order to make the algorithm flexible. Automatic sleep stage determination is manipulated based on conditional probability. The result showed close agreement comparing with the visual inspection by clinician. The developed system can meet the customized requirements in hospitals and institutions.
Simulated Aging of Spacecraft External Materials on Orbit

NASA Astrophysics Data System (ADS)

Khatipov, S.

Moscow State Engineering Physics Institute (MIFI), in cooperation with Air Force Research Laboratory's Satellite Assessment Center (SatAC), the European Office of Aerospace Research and Development (EOARD), and the International Science and Technology Center (ISTC), has developed a database describing the changes in optical properties of materials used on the external surfaces of spacecraft due to space environmental factors. The database includes data acquired from tests completed under contract with the ISTC and EOARD, as well as from previous Russian materials studies conducted within the last 30 years. The space environmental factors studied are for those found in Low Earth Orbits (LEO) and Geosynchronous orbits (GEO), including electron irradiation at 50, 100, and 200 keV, proton irradiation at 50, 150, 300, and 500 keV, and ultraviolet irradiation equivalent to 1 sun-year. The material characteristics investigated were solar absorption (aS), spectral reflectance (rl), solar reflectance (rS), emissivity (e), spectral transmission coefficient (Tl), solar transmittance (TS), optical density (D), relative optical density (D/x), Bi-directional Reflectance Distribution Function (BRDF), and change of appearance and color in the visible wavelengths. The materials tested in the project were thermal control coatings (paints), multilayer insulation (films), and solar cells. The ability to predict changes in optical properties of spacecraft materials is important to increase the fidelity of space observation tools, better understand observation of space objects, and increase the longevity of spacecraft. The end goal of our project is to build semi-empirical mathematical models to predict the long-term effects of space aging as a function of time and orbit.
3D pharmacophore-based virtual screening, docking and density functional theory approach towards the discovery of novel human epidermal growth factor receptor-2 (HER2) inhibitors.

PubMed

Gogoi, Dhrubajyoti; Baruah, Vishwa Jyoti; Chaliha, Amrita Kashyap; Kakoti, Bibhuti Bhushan; Sarma, Diganta; Buragohain, Alak Kumar

2016-12-21

Human epidermal growth factor receptor 2 (HER2) is one of the four members of the epidermal growth factor receptor (EGFR) family and is expressed to facilitate cellular proliferation across various tissue types. Therapies targeting HER2, which is a transmembrane glycoprotein with tyrosine kinase activity, offer promising prospects especially in breast and gastric/gastroesophageal cancer patients. Persistence of both primary and acquired resistance to various routine drugs/antibodies is a disappointing outcome in the treatment of many HER2 positive cancer patients and is a challenge that requires formulation of new and improved strategies to overcome the same. Identification of novel HER2 inhibitors with improved therapeutics index was performed with a highly correlating (r=0.975) ligand-based pharmacophore model (Hypo1) in this study. Hypo1 was generated from a training set of 22 compounds with HER2 inhibitory activity and this well-validated hypothesis was subsequently used as a 3D query to screen compounds in a total of four databases of which two were natural product databases. Further, these compounds were analyzed for compliance with Veber's drug-likeness rule and optimum ADMET parameters. The selected compounds were then subjected to molecular docking and Density Functional Theory (DFT) analysis to discern their molecular interactions at the active site of HER2. The findings thus presented would be an important starting point towards the development of novel HER2 inhibitors using well-validated computational techniques. Copyright © 2016 Elsevier Ltd. All rights reserved.
From atomic structure to excess entropy: a neutron diffraction and density functional theory study of CaO-Al2O3-SiO2 melts

NASA Astrophysics Data System (ADS)

Liu, Maoyuan; Jacob, Aurélie; Schmetterer, Clemens; Masset, Patrick J.; Hennet, Louis; Fischer, Henry E.; Kozaily, Jad; Jahn, Sandro; Gray-Weale, Angus

2016-04-01

Calcium aluminosilicate \\text{CaO}-\\text{A}{{\\text{l}}2}{{\\text{O}}3}-\\text{Si}{{\\text{O}}2} (CAS) melts with compositions {{≤ft(\\text{CaO}-\\text{Si}{{\\text{O}}2}\\right)}x}{{≤ft(\\text{A}{{\\text{l}}2}{{\\text{O}}3}\\right)}1-x} for x < 0.5 and {{≤ft(\\text{A}{{\\text{l}}2}{{\\text{O}}3}\\right)}x}{{≤ft(\\text{Si}{{\\text{O}}2}\\right)}1-x} for x≥slant 0.5 are studied using neutron diffraction with aerodynamic levitation and density functional theory molecular dynamics modelling. Simulated structure factors are found to be in good agreement with experimental structure factors. Local atomic structures from simulations reveal the role of calcium cations as a network modifier, and aluminium cations as a non-tetrahedral network former. Distributions of tetrahedral order show that an increasing concentration of the network former Al increases entropy, while an increasing concentration of the network modifier Ca decreases entropy. This trend is opposite to the conventional understanding that increasing amounts of network former should increase order in the network liquid, and so decrease entropy. The two-body correlation entropy S 2 is found to not correlate with the excess entropy values obtained from thermochemical databases, while entropies including higher-order correlations such as tetrahedral order, O-M-O or M-O-M bond angles and Q N environments show a clear linear correlation between computed entropy and database excess entropy. The possible relationship between atomic structures and excess entropy is discussed.
Carbon deposition model for oxygen-hydrocarbon combustion

NASA Technical Reports Server (NTRS)

Bossard, John A.

1988-01-01

The objectives are to use existing hardware to verify and extend the database generated on the original test programs. The data to be obtained are the carbon deposition characteristics when methane is used at injection densities comparable to full scale values. The database will be extended to include liquid natural gas (LNG) testing at low injection densities for gas generator/preburner conditions. The testing will be performed at mixture ratios between 0.25 and 0.60, and at chamber pressures between 750 and 1500 psi.
[Comparing results of methicillin-resistant Staphylococcus aureus (MRSA) surveillance using the French DRG-based information system (PMSI)].

PubMed

Nuemi, G; Astruc, K; Aho, S; Quantin, C

2013-10-01

The surveillance of methicillin-resistant Staphylococcus aureus (MRSA) is a national priority. The rate of MRSA infections is one of six indicators tracked by the Department of Health. Since 2002, the French institute for public health surveillance (InVS) has monitored MRSA infections to estimate incidence density. Today, the use of the French administrative database (PMSI) could facilitate this surveillance. The aim of this study was to compare MRSA incidence density computed at a national level using PMSI databases with the results from the InVS taken as the reference. PMSI databases for the years 2006 to 2009 were used. The reference results were those published by the InVS from 2006 to 2009. MRSA density defined as the number of MRSA infections recorded per year over 1000 hospital stays was computed. It was then compared with the MRSA incidence density measured by InVS. The time course of MRSA incidence in the PMSI records was modeled using a Poisson regression. The incidence density measured by the InVS was higher than the MRSA density computed using the PMSI, but this difference appeared to decrease over time. The PMSI density/InVS MRSA incidence density ratio was 0.8% in 2006 and about 9.2% in 2009. We observed inverted trends with a growing trend in MRSA density identified by the PMSI. Furthermore, the year of study was significantly associated with incidence density (P=0.01). Using PMSI data as an additional source of information in the hospital MRSA surveillance process makes it possible to detect and analyze patient repeats at the regional and national levels with linkage facilities. Estimation of incidence density for hospitals not participating to this surveillance system will be the next step. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Functional integration of automated system databases by means of artificial intelligence

NASA Astrophysics Data System (ADS)

Dubovoi, Volodymyr M.; Nikitenko, Olena D.; Kalimoldayev, Maksat; Kotyra, Andrzej; Gromaszek, Konrad; Iskakova, Aigul

2017-08-01

The paper presents approaches for functional integration of automated system databases by means of artificial intelligence. The peculiarities of turning to account the database in the systems with the usage of a fuzzy implementation of functions were analyzed. Requirements for the normalization of such databases were defined. The question of data equivalence in conditions of uncertainty and collisions in the presence of the databases functional integration is considered and the model to reveal their possible occurrence is devised. The paper also presents evaluation method of standardization of integrated database normalization.
Natural History of Breast Density and Breast Cancer Risk

DTIC Science & Technology

2001-07-01

mammography database. We have estimated breast density on the oldest mammogram from both cases and controls, using our semi-automated software and...using the oldest mammogram) with breast cancer risk. Next winter, we will continue these analyses investigating the change in density over time and
New Embedded Denotes Fuzzy C-Mean Application for Breast Cancer Density Segmentation in Digital Mammograms

NASA Astrophysics Data System (ADS)

Othman, Khairulnizam; Ahmad, Afandi

2016-11-01

In this research we explore the application of normalize denoted new techniques in advance fast c-mean in to the problem of finding the segment of different breast tissue regions in mammograms. The goal of the segmentation algorithm is to see if new denotes fuzzy c- mean algorithm could separate different densities for the different breast patterns. The new density segmentation is applied with multi-selection of seeds label to provide the hard constraint, whereas the seeds labels are selected based on user defined. New denotes fuzzy c- mean have been explored on images of various imaging modalities but not on huge format digital mammograms just yet. Therefore, this project is mainly focused on using normalize denoted new techniques employed in fuzzy c-mean to perform segmentation to increase visibility of different breast densities in mammography images. Segmentation of the mammogram into different mammographic densities is useful for risk assessment and quantitative evaluation of density changes. Our proposed methodology for the segmentation of mammograms on the basis of their region into different densities based categories has been tested on MIAS database and Trueta Database.
Accessing protein conformational ensembles using room-temperature X-ray crystallography

PubMed Central

Fraser, James S.; van den Bedem, Henry; Samelson, Avi J.; Lang, P. Therese; Holton, James M.; Echols, Nathaniel; Alber, Tom

2011-01-01

Modern protein crystal structures are based nearly exclusively on X-ray data collected at cryogenic temperatures (generally 100 K). The cooling process is thought to introduce little bias in the functional interpretation of structural results, because cryogenic temperatures minimally perturb the overall protein backbone fold. In contrast, here we show that flash cooling biases previously hidden structural ensembles in protein crystals. By analyzing available data for 30 different proteins using new computational tools for electron-density sampling, model refinement, and molecular packing analysis, we found that crystal cryocooling remodels the conformational distributions of more than 35% of side chains and eliminates packing defects necessary for functional motions. In the signaling switch protein, H-Ras, an allosteric network consistent with fluctuations detected in solution by NMR was uncovered in the room-temperature, but not the cryogenic, electron-density maps. These results expose a bias in structural databases toward smaller, overpacked, and unrealistically unique models. Monitoring room-temperature conformational ensembles by X-ray crystallography can reveal motions crucial for catalysis, ligand binding, and allosteric regulation. PMID:21918110
Achieving DFT accuracy with a machine-learning interatomic potential: Thermomechanics and defects in bcc ferromagnetic iron

NASA Astrophysics Data System (ADS)

Dragoni, Daniele; Daff, Thomas D.; Csányi, Gábor; Marzari, Nicola

2018-01-01

We show that the Gaussian Approximation Potential (GAP) machine-learning framework can describe complex magnetic potential energy surfaces, taking ferromagnetic iron as a paradigmatic challenging case. The training database includes total energies, forces, and stresses obtained from density-functional theory in the generalized-gradient approximation, and comprises approximately 150,000 local atomic environments, ranging from pristine and defected bulk configurations to surfaces and generalized stacking faults with different crystallographic orientations. We find the structural, vibrational, and thermodynamic properties of the GAP model to be in excellent agreement with those obtained directly from first-principles electronic-structure calculations. There is good transferability to quantities, such as Peierls energy barriers, which are determined to a large extent by atomic configurations that were not part of the training set. We observe the benefit and the need of using highly converged electronic-structure calculations to sample a target potential energy surface. The end result is a systematically improvable potential that can achieve the same accuracy of density-functional theory calculations, but at a fraction of the computational cost.

Characterizing Fishing Effort and Spatial Extent of Coastal Fisheries

PubMed Central

Stewart, Kelly R.; Lewison, Rebecca L.; Dunn, Daniel C.; Bjorkland, Rhema H.; Kelez, Shaleyla; Halpin, Patrick N.; Crowder, Larry B.

2010-01-01

Biodiverse coastal zones are often areas of intense fishing pressure due to the high relative density of fishing capacity in these nearshore regions. Although overcapacity is one of the central challenges to fisheries sustainability in coastal zones, accurate estimates of fishing pressure in coastal zones are limited, hampering the assessment of the direct and collateral impacts (e.g., habitat degradation, bycatch) of fishing. We compiled a comprehensive database of fishing effort metrics and the corresponding spatial limits of fisheries and used a spatial analysis program (FEET) to map fishing effort density (measured as boat-meters per km2) in the coastal zones of six ocean regions. We also considered the utility of a number of socioeconomic variables as indicators of fishing pressure at the national level; fishing density increased as a function of population size and decreased as a function of coastline length. Our mapping exercise points to intra and interregional ‘hotspots’ of coastal fishing pressure. The significant and intuitive relationships we found between fishing density and population size and coastline length may help with coarse regional characterizations of fishing pressure. However, spatially-delimited fishing effort data are needed to accurately map fishing hotspots, i.e., areas of intense fishing activity. We suggest that estimates of fishing effort, not just target catch or yield, serve as a necessary measure of fishing activity, which is a key link to evaluating sustainability and environmental impacts of coastal fisheries. PMID:21206903
Radio-science performance analysis software

NASA Astrophysics Data System (ADS)

Morabito, D. D.; Asmar, S. W.

1995-02-01

The Radio Science Systems Group (RSSG) provides various support functions for several flight project radio-science teams. Among these support functions are uplink and sequence planning, real-time operations monitoring and support, data validation, archiving and distribution functions, and data processing and analysis. This article describes the support functions that encompass radio-science data performance analysis. The primary tool used by the RSSG to fulfill this support function is the STBLTY program set. STBLTY is used to reconstruct observable frequencies and calculate model frequencies, frequency residuals, frequency stability in terms of Allan deviation, reconstructed phase, frequency and phase power spectral density, and frequency drift rates. In the case of one-way data, using an ultrastable oscillator (USO) as a frequency reference, the program set computes the spacecraft transmitted frequency and maintains a database containing the in-flight history of the USO measurements. The program set also produces graphical displays. Some examples and discussions on operating the program set on Galileo and Ulysses data will be presented.
Radio-Science Performance Analysis Software

NASA Astrophysics Data System (ADS)

Morabito, D. D.; Asmar, S. W.

1994-10-01

The Radio Science Systems Group (RSSG) provides various support functions for several flight project radio-science teams. Among these support functions are uplink and sequence planning, real-time operations monitoring and support, data validation, archiving and distribution functions, and data processing and analysis. This article describes the support functions that encompass radio science data performance analysis. The primary tool used by the RSSG to fulfill this support function is the STBLTY program set. STBLTY is used to reconstruct observable frequencies and calculate model frequencies, frequency residuals, frequency stability in terms of Allan deviation, reconstructed phase, frequency and phase power spectral density, and frequency drift rates. In the case of one-way data, using an ultrastable oscillator (USO) as a frequency reference, the program set computes the spacecraft transmitted frequency and maintains a database containing the in-flight history of the USO measurements. The program set also produces graphical displays. Some examples and discussion on operating the program set on Galileo and Ulysses data will be presented.
Radio-science performance analysis software

NASA Technical Reports Server (NTRS)

Morabito, D. D.; Asmar, S. W.

1995-01-01

The Radio Science Systems Group (RSSG) provides various support functions for several flight project radio-science teams. Among these support functions are uplink and sequence planning, real-time operations monitoring and support, data validation, archiving and distribution functions, and data processing and analysis. This article describes the support functions that encompass radio-science data performance analysis. The primary tool used by the RSSG to fulfill this support function is the STBLTY program set. STBLTY is used to reconstruct observable frequencies and calculate model frequencies, frequency residuals, frequency stability in terms of Allan deviation, reconstructed phase, frequency and phase power spectral density, and frequency drift rates. In the case of one-way data, using an ultrastable oscillator (USO) as a frequency reference, the program set computes the spacecraft transmitted frequency and maintains a database containing the in-flight history of the USO measurements. The program set also produces graphical displays. Some examples and discussions on operating the program set on Galileo and Ulysses data will be presented.
Cost Modeling for Space Optical Telescope Assemblies

NASA Technical Reports Server (NTRS)

Stahl, H. Philip; Henrichs, Todd; Luedtke, Alexander; West, Miranda

2011-01-01

Parametric cost models are used to plan missions, compare concepts and justify technology investments. This paper reviews an on-going effort to develop cost modes for space telescopes. This paper summarizes the methodology used to develop cost models and documents how changes to the database have changed previously published preliminary cost models. While the cost models are evolving, the previously published findings remain valid: it costs less per square meter of collecting aperture to build a large telescope than a small telescope; technology development as a function of time reduces cost; and lower areal density telescopes cost more than more massive telescopes.
MN15-L: A New Local Exchange-Correlation Functional for Kohn-Sham Density Functional Theory with Broad Accuracy for Atoms, Molecules, and Solids.

PubMed

Yu, Haoyu S; He, Xiao; Truhlar, Donald G

2016-03-08

Kohn-Sham density functional theory is widely used for applications of electronic structure theory in chemistry, materials science, and condensed-matter physics, but the accuracy depends on the quality of the exchange-correlation functional. Here, we present a new local exchange-correlation functional called MN15-L that predicts accurate results for a broad range of molecular and solid-state properties including main-group bond energies, transition metal bond energies, reaction barrier heights, noncovalent interactions, atomic excitation energies, ionization potentials, electron affinities, total atomic energies, hydrocarbon thermochemistry, and lattice constants of solids. The MN15-L functional has the same mathematical form as a previous meta-nonseparable gradient approximation exchange-correlation functional, MN12-L, but it is improved because we optimized it against a larger database, designated 2015A, and included smoothness restraints; the optimization has a much better representation of transition metals. The mean unsigned error on 422 chemical energies is 2.32 kcal/mol, which is the best among all tested functionals, with or without nonlocal exchange. The MN15-L functional also provides good results for test sets that are outside the training set. A key issue is that the functional is local (no nonlocal exchange or nonlocal correlation), which makes it relatively economical for treating large and complex systems and solids. Another key advantage is that medium-range correlation energy is built in so that one does not need to add damped dispersion by molecular mechanics in order to predict accurate noncovalent binding energies. We believe that the MN15-L functional should be useful for a wide variety of applications in chemistry, physics, materials science, and molecular biology.
Applications of Database Machines in Library Systems.

ERIC Educational Resources Information Center

Salmon, Stephen R.

1984-01-01

Characteristics and advantages of database machines are summarized and their applications to library functions are described. The ability to attach multiple hosts to the same database and flexibility in choosing operating and database management systems for different functions without loss of access to common database are noted. (EJS)
Creating a normative database of age-specific 3D geometrical data, bone density, and bone thickness of the developing skull: a pilot study.

PubMed

Delye, Hans; Clijmans, Tim; Mommaerts, Maurice Yves; Sloten, Jos Vnder; Goffin, Jan

2015-12-01

Finite element models (FEMs) of the head are used to study the biomechanics of traumatic brain injury and depend heavily on the use of accurate material properties and head geometry. Any FEM aimed at investigating traumatic head injury in children should therefore use age-specific dimensions of the head, as well as age-specific material properties of the different tissues. In this study, the authors built a database of age-corrected skull geometry, skull thickness, and bone density of the developing skull to aid in the development of an age-specific FEM of a child's head. Such a database, containing age-corrected normative skull geometry data, can also be used for preoperative surgical planning and postoperative long-term follow-up of craniosynostosis surgery results. Computed tomography data were processed for 187 patients (age range 0-20 years old). A 3D surface model was calculated from segmented skull surfaces. Skull models, reference points, and sutures were processed into a MATLAB-supported database. This process included automatic calculation of 2D measurements as well as 3D measurements: length of the coronal suture, length of the lambdoid suture, and the 3D anterior-posterior length, defined as the sum of the metopic and sagittal suture. Skull thickness and skull bone density calculations were included. Cephalic length, cephalic width, intercoronal distance, lateral orbital distance, intertemporal distance, and 3D measurements were obtained, confirming the well-established general growth pattern of the skull. Skull thickness increases rapidly in the first year of life, slowing down during the second year of life, while skull density increases with a fast but steady pace during the first 3 years of life. Both skull thickness and density continue to increase up to adulthood. This is the first report of normative data on 2D and 3D measurements, skull bone thickness, and skull bone density for children aged 0-20 years. This database can help build an age-specific FEM of a child's head. It can also help to tailor preoperative virtual planning in craniosynostosis surgery toward patient-specific normative target values and to perform objective long-term follow-up in craniosynostosis surgery.
The choice of normative pediatric reference database changes spine bone mineral density Z-scores but not the relationship between bone mineral density and prevalent vertebral fractures.

PubMed

Ma, Jinhui; Siminoski, Kerry; Alos, Nathalie; Halton, Jacqueline; Ho, Josephine; Lentle, Brian; Matzinger, MaryAnn; Shenouda, Nazih; Atkinson, Stephanie; Barr, Ronald; Cabral, David A; Couch, Robert; Cummings, Elizabeth A; Fernandez, Conrad V; Grant, Ronald M; Rodd, Celia; Sbrocchi, Anne Marie; Scharke, Maya; Rauch, Frank; Ward, Leanne M

2015-03-01

Our objectives were to assess the magnitude of the disparity in lumbar spine bone mineral density (LSBMD) Z-scores generated by different reference databases and to evaluate whether the relationship between LSBMD Z-scores and vertebral fractures (VF) varies by choice of database. Children with leukemia underwent LSBMD by cross-calibrated dual-energy x-ray absorptiometry, with Z-scores generated according to Hologic and Lunar databases. VF were assessed by the Genant method on spine radiographs. Logistic regression was used to assess the association between fractures and LSBMD Z-scores. Net reclassification improvement and area under the receiver operating characteristic curve were calculated to assess the predictive accuracy of LSBMD Z-scores for VF. For the 186 children from 0 to 18 years of age, 6 different age ranges were studied. The Z-scores generated for the 0 to 18 group were highly correlated (r ≥ 0.90), but the proportion of children with LSBMD Z-scores ≤-2.0 among those with VF varied substantially (from 38-66%). Odds ratios (OR) for the association between LSBMD Z-score and VF were similar regardless of database (OR = 1.92, 95% confidence interval 1.44, 2.56 to OR = 2.70, 95% confidence interval 1.70, 4.28). Area under the receiver operating characteristic curve and net reclassification improvement ranged from 0.71 to 0.75 and -0.15 to 0.07, respectively. Although the use of a LSBMD Z-score threshold as part of the definition of osteoporosis in a child with VF does not appear valid, the study of relationships between BMD and VF is valid regardless of the BMD database that is used.
Molecular Quantum Similarity, Chemical Reactivity and Database Screening of 3D Pharmacophores of the Protein Kinases A, B and G from Mycobacterium tuberculosis.

PubMed

Morales-Bayuelo, Alejandro

2017-06-21

Mycobacterium tuberculosis remains one of the world's most devastating pathogens. For this reason, we developed a study involving 3D pharmacophore searching, selectivity analysis and database screening for a series of anti-tuberculosis compounds, associated with the protein kinases A, B, and G. This theoretical study is expected to shed some light onto some molecular aspects that could contribute to the knowledge of the molecular mechanics behind interactions of these compounds, with anti-tuberculosis activity. Using the Molecular Quantum Similarity field and reactivity descriptors supported in the Density Functional Theory, it was possible to measure the quantification of the steric and electrostatic effects through the Overlap and Coulomb quantitative convergence (alpha and beta) scales. In addition, an analysis of reactivity indices using global and local descriptors was developed, identifying the binding sites and selectivity on these anti-tuberculosis compounds in the active sites. Finally, the reported pharmacophores to PKn A, B and G, were used to carry out database screening, using a database with anti-tuberculosis drugs from the Kelly Chibale research group (http://www.kellychibaleresearch.uct.ac.za/), to find the compounds with affinity for the specific protein targets associated with PKn A, B and G. In this regard, this hybrid methodology (Molecular Mechanic/Quantum Chemistry) shows new insights into drug design that may be useful in the tuberculosis treatment today.
K-SPAN: A lexical database of Korean surface phonetic forms and phonological neighborhood density statistics.

PubMed

Holliday, Jeffrey J; Turnbull, Rory; Eychenne, Julien

2017-10-01

This article presents K-SPAN (Korean Surface Phonetics and Neighborhoods), a database of surface phonetic forms and several measures of phonological neighborhood density for 63,836 Korean words. Currently publicly available Korean corpora are limited by the fact that they only provide orthographic representations in Hangeul, which is problematic since phonetic forms in Korean cannot be reliably predicted from orthographic forms. We describe the method used to derive the surface phonetic forms from a publicly available orthographic corpus of Korean, and report on several statistics calculated using this database; namely, segment unigram frequencies, which are compared to previously reported results, along with segment-based and syllable-based neighborhood density statistics for three types of representation: an "orthographic" form, which is a quasi-phonological representation, a "conservative" form, which maintains all known contrasts, and a "modern" form, which represents the pronunciation of contemporary Seoul Korean. These representations are rendered in an ASCII-encoded scheme, which allows users to query the corpus without having to read Korean orthography, and permits the calculation of a wide range of phonological measures.
NRLMSISE-00 Empirical Model of the Atmosphere: Statistical Comparisons and Scientific Issues

NASA Technical Reports Server (NTRS)

Aikin, A. C.; Picone, J. M.; Hedin, A. E.; Drob, D. P.

2001-01-01

The new NRLMSISE-00 model and the associated NRLMSIS database now include the following data: (1) total mass density from satellite accelerometers and from orbit determination, including the Jacchia and Barlier data; (2) temperature from incoherent scatter radar, and; (3) molecular oxygen number density, [O2], from solar ultraviolet occultation aboard the Solar Maximum Mission (SMM). A new component, 'anomalous oxygen,' allows for appreciable O(+) and hot atomic oxygen contributions to the total mass density at high altitudes and applies primarily to drag estimation above 500 km. Extensive tables compare our entire database to the NRLMSISE-00, MSISE-90, and Jacchia-70 models for different altitude bands and levels of geomagnetic activity. We also investigate scientific issues related to the new data sets in the NRLMSIS database. Especially noteworthy is the solar activity dependence of the Jacchia data, with which we investigate a large O(+) contribution to the total mass density under the combination of summer, low solar activity, high latitudes, and high altitudes. Under these conditions, except at very low solar activity, the Jacchia data and the Jacchia-70 model indeed show a significantly higher total mass density than does MSISE-90. However, under the corresponding winter conditions, the MSIS-class models represent a noticeable improvement relative to Jacchia-70 over a wide range of F(sub 10.7). Considering the two regimes together, NRLMSISE-00 achieves an improvement over both MSISE-90 and Jacchia-70 by incorporating advantages of each.
Mode-of-action evaluation for the effect of trans fatty acids on low-density lipoprotein cholesterol.

PubMed

Reichard, John F; Haber, Lynne T

2016-12-01

The purpose of this work is to systematically consider the data relating to the mode of action (MOA) for the effects of industrially produced trans fatty acid (iTFA) on plasma low-density lipoprotein (LDL) levels. The hypothesized MOA is composed of two key events: increased LDL production and decreased LDL clearance. A substantial database supports this MOA, although the key events are likely to be interdependent, rather than sequential. Both key events are functions of nonlinear biological processes including rate-limited clearance, receptor-mediated transcription, and both positive and negative feedback regulation. Each key event was evaluated based on weight-of-evidence analysis and for human relevance. We conclude that the data are inadequate for a detailed dose-response analysis in the context of the evolved Bradford Hill considerations; however, the weight of evidence is strong and the overall shape of the dose-response curves for markers of the key events and the key determinants of those relationships is well understood in many cases and is nonlinear. Feedback controls are responsible for maintaining homeostasis of cholesterol and triglyceride levels and underlie both of the key events, resulting in a less-than-linear or thresholded relationship between TFA and LDL-C. The inconsistencies and gaps in the database are discussed. Copyright Â© 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
A computational platform to maintain and migrate manual functional annotations for BioCyc databases.

PubMed

Walsh, Jesse R; Sen, Taner Z; Dickerson, Julie A

2014-10-12

BioCyc databases are an important resource for information on biological pathways and genomic data. Such databases represent the accumulation of biological data, some of which has been manually curated from literature. An essential feature of these databases is the continuing data integration as new knowledge is discovered. As functional annotations are improved, scalable methods are needed for curators to manage annotations without detailed knowledge of the specific design of the BioCyc database. We have developed CycTools, a software tool which allows curators to maintain functional annotations in a model organism database. This tool builds on existing software to improve and simplify annotation data imports of user provided data into BioCyc databases. Additionally, CycTools automatically resolves synonyms and alternate identifiers contained within the database into the appropriate internal identifiers. Automating steps in the manual data entry process can improve curation efforts for major biological databases. The functionality of CycTools is demonstrated by transferring GO term annotations from MaizeCyc to matching proteins in CornCyc, both maize metabolic pathway databases available at MaizeGDB, and by creating strain specific databases for metabolic engineering.
An overview of the multi-database manipulation language MDSL

DOE Office of Scientific and Technical Information (OSTI.GOV)

Litwin, W.; Abdellatif, A.

With the increase in availability of databases, data needed by a user are frequently in separate autonomous databases. The logical properties of such data differ from the classical ones with a single database. In particular, they call for new functions for data manipulation. MDSL is a new data manipulation language providing such functions. Most of the MDSL functions are not available in other languages.
The thermodynamic scale of inorganic crystalline metastability

PubMed Central

Sun, Wenhao; Dacek, Stephen T.; Ong, Shyue Ping; Hautier, Geoffroy; Jain, Anubhav; Richards, William D.; Gamst, Anthony C.; Persson, Kristin A.; Ceder, Gerbrand

2016-01-01

The space of metastable materials offers promising new design opportunities for next-generation technological materials, such as complex oxides, semiconductors, pharmaceuticals, steels, and beyond. Although metastable phases are ubiquitous in both nature and technology, only a heuristic understanding of their underlying thermodynamics exists. We report a large-scale data-mining study of the Materials Project, a high-throughput database of density functional theory–calculated energetics of Inorganic Crystal Structure Database structures, to explicitly quantify the thermodynamic scale of metastability for 29,902 observed inorganic crystalline phases. We reveal the influence of chemistry and composition on the accessible thermodynamic range of crystalline metastability for polymorphic and phase-separating compounds, yielding new physical insights that can guide the design of novel metastable materials. We further assert that not all low-energy metastable compounds can necessarily be synthesized, and propose a principle of ‘remnant metastability’—that observable metastable crystalline phases are generally remnants of thermodynamic conditions where they were once the lowest free-energy phase. PMID:28138514
Implementation of Quantum Private Queries Using Nuclear Magnetic Resonance

NASA Astrophysics Data System (ADS)

Wang, Chuan; Hao, Liang; Zhao, Lian-Jie

2011-08-01

We present a modified protocol for the realization of a quantum private query process on a classical database. Using one-qubit query and CNOT operation, the query process can be realized in a two-mode database. In the query process, the data privacy is preserved as the sender would not reveal any information about the database besides her query information, and the database provider cannot retain any information about the query. We implement the quantum private query protocol in a nuclear magnetic resonance system. The density matrix of the memory registers are constructed.
Suppression of radiation-induced point defects by rhenium and osmium interstitials in tungsten

PubMed Central

Suzudo, Tomoaki; Hasegawa, Akira

2016-01-01

Modeling the evolution of radiation-induced defects is important for finding radiation-resistant materials, which would be greatly appreciated in nuclear applications. We apply the density functional theory combined with comprehensive analyses of massive experimental database to indicate a mechanism to mitigate the effect of radiation on W crystals by adding particular solute elements that change the migration property of interstitials. The resultant mechanism is applicable to any body-centered-cubic (BCC) metals whose self-interstitial atoms become a stable crowdion and is expected to provide a general guideline for computational design of radiation-resistant alloys in the field of nuclear applications. PMID:27824134
SNPchiMp: a database to disentangle the SNPchip jungle in bovine livestock.

PubMed

Nicolazzi, Ezequiel Luis; Picciolini, Matteo; Strozzi, Francesco; Schnabel, Robert David; Lawley, Cindy; Pirani, Ali; Brew, Fiona; Stella, Alessandra

2014-02-11

Currently, six commercial whole-genome SNP chips are available for cattle genotyping, produced by two different genotyping platforms. Technical issues need to be addressed to combine data that originates from the different platforms, or different versions of the same array generated by the manufacturer. For example: i) genome coordinates for SNPs may refer to different genome assemblies; ii) reference genome sequences are updated over time changing the positions, or even removing sequences which contain SNPs; iii) not all commercial SNP ID's are searchable within public databases; iv) SNPs can be coded using different formats and referencing different strands (e.g. A/B or A/C/T/G alleles, referencing forward/reverse, top/bottom or plus/minus strand); v) Due to new information being discovered, higher density chips do not necessarily include all the SNPs present in the lower density chips; and, vi) SNP IDs may not be consistent across chips and platforms. Most researchers and breed associations manage SNP data in real-time and thus require tools to standardise data in a user-friendly manner. Here we present SNPchiMp, a MySQL database linked to an open access web-based interface. Features of this interface include, but are not limited to, the following functions: 1) referencing the SNP mapping information to the latest genome assembly, 2) extraction of information contained in dbSNP for SNPs present in all commercially available bovine chips, and 3) identification of SNPs in common between two or more bovine chips (e.g. for SNP imputation from lower to higher density). In addition, SNPchiMp can retrieve this information on subsets of SNPs, accessing such data either via physical position on a supported assembly, or by a list of SNP IDs, rs or ss identifiers. This tool combines many different sources of information, that otherwise are time consuming to obtain and difficult to integrate. The SNPchiMp not only provides the information in a user-friendly format, but also enables researchers to perform a large number of operations with a few clicks of the mouse. This significantly reduces the time needed to execute the large number of operations required to manage SNP data.
[A Terahertz Spectral Database Based on Browser/Server Technique].

PubMed

Zhang, Zhuo-yong; Song, Yue

2015-09-01

With the solution of key scientific and technical problems and development of instrumentation, the application of terahertz technology in various fields has been paid more and more attention. Owing to the unique characteristic advantages, terahertz technology has been showing a broad future in the fields of fast, non-damaging detections, as well as many other fields. Terahertz technology combined with other complementary methods can be used to cope with many difficult practical problems which could not be solved before. One of the critical points for further development of practical terahertz detection methods depends on a good and reliable terahertz spectral database. We developed a BS (browser/server) -based terahertz spectral database recently. We designed the main structure and main functions to fulfill practical requirements. The terahertz spectral database now includes more than 240 items, and the spectral information was collected based on three sources: (1) collection and citation from some other abroad terahertz spectral databases; (2) collected from published literatures; and (3) spectral data measured in our laboratory. The present paper introduced the basic structure and fundament functions of the terahertz spectral database developed in our laboratory. One of the key functions of this THz database is calculation of optical parameters. Some optical parameters including absorption coefficient, refractive index, etc. can be calculated based on the input THz time domain spectra. The other main functions and searching methods of the browser/server-based terahertz spectral database have been discussed. The database search system can provide users convenient functions including user registration, inquiry, displaying spectral figures and molecular structures, spectral matching, etc. The THz database system provides an on-line searching function for registered users. Registered users can compare the input THz spectrum with the spectra of database, according to the obtained correlation coefficient one can perform the searching task very fast and conveniently. Our terahertz spectral database can be accessed at http://www.teralibrary.com. The proposed terahertz spectral database is based on spectral information so far, and will be improved in the future. We hope this terahertz spectral database can provide users powerful, convenient, and high efficient functions, and could promote the broader applications of terahertz technology.

search GenBank: interactive orchestration and ad-hoc choreography of Web services in the exploration of the biomedical resources of the National Center For Biotechnology Information

PubMed Central

2013-01-01

Background Due to the growing number of biomedical entries in data repositories of the National Center for Biotechnology Information (NCBI), it is difficult to collect, manage and process all of these entries in one place by third-party software developers without significant investment in hardware and software infrastructure, its maintenance and administration. Web services allow development of software applications that integrate in one place the functionality and processing logic of distributed software components, without integrating the components themselves and without integrating the resources to which they have access. This is achieved by appropriate orchestration or choreography of available Web services and their shared functions. After the successful application of Web services in the business sector, this technology can now be used to build composite software tools that are oriented towards biomedical data processing. Results We have developed a new tool for efficient and dynamic data exploration in GenBank and other NCBI databases. A dedicated search GenBank system makes use of NCBI Web services and a package of Entrez Programming Utilities (eUtils) in order to provide extended searching capabilities in NCBI data repositories. In search GenBank users can use one of the three exploration paths: simple data searching based on the specified user’s query, advanced data searching based on the specified user’s query, and advanced data exploration with the use of macros. search GenBank orchestrates calls of particular tools available through the NCBI Web service providing requested functionality, while users interactively browse selected records in search GenBank and traverse between NCBI databases using available links. On the other hand, by building macros in the advanced data exploration mode, users create choreographies of eUtils calls, which can lead to the automatic discovery of related data in the specified databases. Conclusions search GenBank extends standard capabilities of the NCBI Entrez search engine in querying biomedical databases. The possibility of creating and saving macros in the search GenBank is a unique feature and has a great potential. The potential will further grow in the future with the increasing density of networks of relationships between data stored in particular databases. search GenBank is available for public use at http://sgb.biotools.pl/. PMID:23452691
search GenBank: interactive orchestration and ad-hoc choreography of Web services in the exploration of the biomedical resources of the National Center For Biotechnology Information.

PubMed

Mrozek, Dariusz; Małysiak-Mrozek, Bożena; Siążnik, Artur

2013-03-01

Due to the growing number of biomedical entries in data repositories of the National Center for Biotechnology Information (NCBI), it is difficult to collect, manage and process all of these entries in one place by third-party software developers without significant investment in hardware and software infrastructure, its maintenance and administration. Web services allow development of software applications that integrate in one place the functionality and processing logic of distributed software components, without integrating the components themselves and without integrating the resources to which they have access. This is achieved by appropriate orchestration or choreography of available Web services and their shared functions. After the successful application of Web services in the business sector, this technology can now be used to build composite software tools that are oriented towards biomedical data processing. We have developed a new tool for efficient and dynamic data exploration in GenBank and other NCBI databases. A dedicated search GenBank system makes use of NCBI Web services and a package of Entrez Programming Utilities (eUtils) in order to provide extended searching capabilities in NCBI data repositories. In search GenBank users can use one of the three exploration paths: simple data searching based on the specified user's query, advanced data searching based on the specified user's query, and advanced data exploration with the use of macros. search GenBank orchestrates calls of particular tools available through the NCBI Web service providing requested functionality, while users interactively browse selected records in search GenBank and traverse between NCBI databases using available links. On the other hand, by building macros in the advanced data exploration mode, users create choreographies of eUtils calls, which can lead to the automatic discovery of related data in the specified databases. search GenBank extends standard capabilities of the NCBI Entrez search engine in querying biomedical databases. The possibility of creating and saving macros in the search GenBank is a unique feature and has a great potential. The potential will further grow in the future with the increasing density of networks of relationships between data stored in particular databases. search GenBank is available for public use at http://sgb.biotools.pl/.
47 CFR 0.241 - Authority delegated.

Code of Federal Regulations, 2014 CFR

2014-10-01

... individual database managers; and to perform other functions as needed for the administration of the TV bands... database functions for unlicensed devices operating in the television broadcast bands (TV bands) as set... methods that will be used to designate TV bands database managers, to designate these database managers...
A database of new zeolite-like materials.

PubMed

Pophale, Ramdas; Cheeseman, Phillip A; Deem, Michael W

2011-07-21

We here describe a database of computationally predicted zeolite-like materials. These crystals were discovered by a Monte Carlo search for zeolite-like materials. Positions of Si atoms as well as unit cell, space group, density, and number of crystallographically unique atoms were explored in the construction of this database. The database contains over 2.6 M unique structures. Roughly 15% of these are within +30 kJ mol(-1) Si of α-quartz, the band in which most of the known zeolites lie. These structures have topological, geometrical, and diffraction characteristics that are similar to those of known zeolites. The database is the result of refinement by two interatomic potentials that both satisfy the Pauli exclusion principle. The database has been deposited in the publicly available PCOD database and in www.hypotheticalzeolites.net/database/deem/. This journal is © the Owner Societies 2011
Anopheles atroparvus density modeling using MODIS NDVI in a former malarious area in Portugal.

PubMed

Lourenço, Pedro M; Sousa, Carla A; Seixas, Júlia; Lopes, Pedro; Novo, Maria T; Almeida, A Paulo G

2011-12-01

Malaria is dependent on environmental factors and considered as potentially re-emerging in temperate regions. Remote sensing data have been used successfully for monitoring environmental conditions that influence the patterns of such arthropod vector-borne diseases. Anopheles atroparvus density data were collected from 2002 to 2005, on a bimonthly basis, at three sites in a former malarial area in Southern Portugal. The development of the Remote Vector Model (RVM) was based upon two main variables: temperature and the Normalized Differential Vegetation Index (NDVI) from the Moderate Resolution Imaging Spectroradiometer (MODIS) Terra satellite. Temperature influences the mosquito life cycle and affects its intra-annual prevalence, and MODIS NDVI was used as a proxy for suitable habitat conditions. Mosquito data were used for calibration and validation of the model. For areas with high mosquito density, the model validation demonstrated a Pearson correlation of 0.68 (p<0.05) and a modelling efficiency/Nash-Sutcliffe of 0.44 representing the model's ability to predict intra- and inter-annual vector density trends. RVM estimates the density of the former malarial vector An. atroparvus as a function of temperature and of MODIS NDVI. RVM is a satellite data-based assimilation algorithm that uses temperature fields to predict the intra- and inter-annual densities of this mosquito species using MODIS NDVI. RVM is a relevant tool for vector density estimation, contributing to the risk assessment of transmission of mosquito-borne diseases and can be part of the early warning system and contingency plans providing support to the decision making process of relevant authorities. © 2011 The Society for Vector Ecology.
Construction and validation of a population-based bone densitometry database.

PubMed

Leslie, William D; Caetano, Patricia A; Macwilliam, Leonard R; Finlayson, Gregory S

2005-01-01

Utilization of dual-energy X-ray absorptiometry (DXA) for the initial diagnostic assessment of osteoporosis and in monitoring treatment has risen dramatically in recent years. Population-based studies of the impact of DXA and osteoporosis remain challenging because of incomplete and fragmented test data that exist in most regions. Our aim was to create and assess completeness of a database of all clinical DXA services and test results for the province of Manitoba, Canada and to present descriptive data resulting from testing. A regionally based bone density program for the province of Manitoba, Canada was established in 1997. Subsequent DXA services were prospectively captured in a program database. This database was retrospectively populated with earlier DXA results dating back to 1990 (the year that the first DXA scanner was installed) by integrating multiple data sources. A random chart audit was performed to assess completeness and accuracy of this dataset. For comparison, testing rates determined from the DXA database were compared with physician administrative claims data. There was a high level of completeness of this database (>99%) and accurate personal identifier information sufficient for linkage with other health care administrative data (>99%). This contrasted with physician billing data that were found to be markedly incomplete. Descriptive data provide a profile of individuals receiving DXA and their test results. In conclusion, the Manitoba bone density database has great potential as a resource for clinical and health policy research because it is population based with a high level of completeness and accuracy.
Improved Saturated Hydraulic Conductivity Pedotransfer Functions Using Machine Learning Methods

NASA Astrophysics Data System (ADS)

Araya, S. N.; Ghezzehei, T. A.

2017-12-01

Saturated hydraulic conductivity (Ks) is one of the fundamental hydraulic properties of soils. Its measurement, however, is cumbersome and instead pedotransfer functions (PTFs) are often used to estimate it. Despite a lot of progress over the years, generic PTFs that estimate hydraulic conductivity generally don't have a good performance. We develop significantly improved PTFs by applying state of the art machine learning techniques coupled with high-performance computing on a large database of over 20,000 soils—USKSAT and the Florida Soil Characterization databases. We compared the performance of four machine learning algorithms (k-nearest neighbors, gradient boosted model, support vector machine, and relevance vector machine) and evaluated the relative importance of several soil properties in explaining Ks. An attempt is also made to better account for soil structural properties; we evaluated the importance of variables derived from transformations of soil water retention characteristics and other soil properties. The gradient boosted models gave the best performance with root mean square errors less than 0.7 and mean errors in the order of 0.01 on a log scale of Ks [cm/h]. The effective particle size, D10, was found to be the single most important predictor. Other important predictors included percent clay, bulk density, organic carbon percent, coefficient of uniformity and values derived from water retention characteristics. Model performances were consistently better for Ks values greater than 10 cm/h. This study maximizes the extraction of information from a large database to develop generic machine learning based PTFs to estimate Ks. The study also evaluates the importance of various soil properties and their transformations in explaining Ks.
The 2006 Cape Canaveral Air Force Station Range Reference Atmosphere Model Validation Study and Sensitivity Analysis to the National Aeronautics and Space Administration's Space Shuttle

NASA Technical Reports Server (NTRS)

Decker, Ryan; Burns, Lee; Merry, Carl; Harrington, Brian

2008-01-01

NASA's Space Shuttle utilizes atmospheric thermodynamic properties to evaluate structural dynamics and vehicle flight performance impacts by the atmosphere during ascent. Statistical characteristics of atmospheric thermodynamic properties at Kennedy Space Center (KSC) used in Space. Shuttle Vehicle assessments are contained in the Cape Canaveral Air Force Station (CCAFS) Range Reference Atmosphere (RRA) Database. Database contains tabulations for monthly and annual means (mu), standard deviations (sigma) and skewness of wind and thermodynamic variables. Wind, Thermodynamic, Humidity and Hydrostatic parameters 1 km resolution interval from 0-30 km 2 km resolution interval 30-70 km Multiple revisions of the CCAFS RRA database have been developed since initial RRA published in 1963. 1971, 1983, 2006 Space Shuttle program utilized 1983 version for use in deriving "hot" and "cold" atmospheres, atmospheric density dispersions for use in vehicle certification analyses and selection of atmospheric thermodynamic profiles for use in vehicle ascent design and certification analyses. During STS-114 launch preparations in July 2005 atmospheric density observations between 50-80 kft exceeded density limits used for aerodynamic ascent heating constraints in vehicle certification analyses. Mission specific analyses were conducted and concluded that the density bias resulted in small changes to heating rates and integrated heat loading on the vehicle. In 2001, the Air Force Combat Climatology Center began developing an updated RRA for CCAFS.
Impact of stone density on outcomes in percutaneous nephrolithotomy (PCNL): an analysis of the clinical research office of the endourological society (CROES) pcnl global study database.

PubMed

Anastasiadis, Anastasios; Onal, Bulent; Modi, Pranjal; Turna, Burak; Duvdevani, Mordechai; Timoney, Anthony; Wolf, J Stuart; De La Rosette, Jean

2013-12-01

This study aimed to explore the relationship between stone density and outcomes of percutaneous nephrolithotomy (PCNL) using the Clinical Research Office of the Endourological Society (CROES) PCNL Global Study database. Patients undergoing PCNL treatment were assigned to a low stone density [LSD, ≤ 1000 Hounsfield units (HU)] or high stone density (HSD, > 1000 HU) group based on the radiological density of the primary renal stone. Preoperative characteristics and outcomes were compared in the two groups. Retreatment for residual stones was more frequent in the LSD group. The overall stone-free rate achieved was higher in the HSD group (79.3% vs 74.8%, p = 0.113). By univariate regression analysis, the probability of achieving a stone-free outcome peaked at approximately 1250 HU. Below or above this density resulted in lower treatment success, particularly at very low HU values. With increasing radiological stone density, operating time decreased to a minimum at approximately 1000 HU, then increased with further increase in stone density. Multivariate non-linear regression analysis showed a similar relationship between the probability of a stone-free outcome and stone density. Higher treatment success rates were found with low stone burden, pelvic stone location and use of pneumatic lithotripsy. Very low and high stone densities are associated with lower rates of treatment success and longer operating time in PCNL. Preoperative assessment of stone density may help in the selection of treatment modality for patients with renal stones.
Fingerprint-Based Structure Retrieval Using Electron Density

PubMed Central

Yin, Shuangye; Dokholyan, Nikolay V.

2010-01-01

We present a computational approach that can quickly search a large protein structural database to identify structures that fit a given electron density, such as determined by cryo-electron microscopy. We use geometric invariants (fingerprints) constructed using 3D Zernike moments to describe the electron density, and reduce the problem of fitting of the structure to the electron density to simple fingerprint comparison. Using this approach, we are able to screen the entire Protein Data Bank and identify structures that fit two experimental electron densities determined by cryo-electron microscopy. PMID:21287628
Fingerprint-based structure retrieval using electron density.

PubMed

Yin, Shuangye; Dokholyan, Nikolay V

2011-03-01

We present a computational approach that can quickly search a large protein structural database to identify structures that fit a given electron density, such as determined by cryo-electron microscopy. We use geometric invariants (fingerprints) constructed using 3D Zernike moments to describe the electron density, and reduce the problem of fitting of the structure to the electron density to simple fingerprint comparison. Using this approach, we are able to screen the entire Protein Data Bank and identify structures that fit two experimental electron densities determined by cryo-electron microscopy. Copyright © 2010 Wiley-Liss, Inc.
Development of a DNA microarray to detect antimicrobial resistance genes identified in the national center for biotechnology information database

USDA-ARS?s Scientific Manuscript database

High density genotyping techniques are needed for investigating antimicrobial resistance especially in the case of multi-drug resistant (MDR) isolates. To achieve this all antimicrobial resistance genes in the NCBI Genbank database were identified by key word searches of sequence annotations and the...
COMBREX-DB: an experiment centered database of protein function: knowledge, predictions and knowledge gaps.

PubMed

Chang, Yi-Chien; Hu, Zhenjun; Rachlin, John; Anton, Brian P; Kasif, Simon; Roberts, Richard J; Steffen, Martin

2016-01-04

The COMBREX database (COMBREX-DB; combrex.bu.edu) is an online repository of information related to (i) experimentally determined protein function, (ii) predicted protein function, (iii) relationships among proteins of unknown function and various types of experimental data, including molecular function, protein structure, and associated phenotypes. The database was created as part of the novel COMBREX (COMputational BRidges to EXperiments) effort aimed at accelerating the rate of gene function validation. It currently holds information on ∼ 3.3 million known and predicted proteins from over 1000 completely sequenced bacterial and archaeal genomes. The database also contains a prototype recommendation system for helping users identify those proteins whose experimental determination of function would be most informative for predicting function for other proteins within protein families. The emphasis on documenting experimental evidence for function predictions, and the prioritization of uncharacterized proteins for experimental testing distinguish COMBREX from other publicly available microbial genomics resources. This article describes updates to COMBREX-DB since an initial description in the 2011 NAR Database Issue. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Towards an understanding of wheat chloroplasts: a methodical investigation of thylakoid proteome.

PubMed

Kamal, Abu Hena Mostafa; Cho, Kun; Komatsu, Setsuko; Uozumi, Nobuyuki; Choi, Jong-Soon; Woo, Sun Hee

2012-05-01

We utilized Percoll density gradient centrifugation to isolate and fractionate chloroplasts of Korean winter wheat cultivar cv. Kumgang (Triticum aestivum L.). The resulting protein fractions were separated by one dimensional polyacrylamide gel electrophoresis (1D-PAGE) coupled with LTQ-FTICR mass spectrometry. This enabled us to detect and identify 767 unique proteins. Our findings represent the most comprehensive exploration of a proteome to date. Based on annotation information from the UniProtKB/Swiss-Prot database and our analyses via WoLF PSORT and PSORT, these proteins are localized in the chloroplast (607 proteins), chloroplast stroma (145), thylakoid membrane (342), lumens (163), and integral membranes (166). In all, 67% were confirmed as chloroplast thylakoid proteins. Although nearly complete protein coverage (89% proteins) has been accomplished for the key chloroplast pathways in wheat, such as for photosynthesis, many other proteins are involved in regulating carbon metabolism. The identified proteins were assigned to 103 functional categories according to a classification system developed by the iProClass database and provided through Protein Information Resources. Those functions include electron transport, energy, cellular organization and biogenesis, transport, stress responses, and other metabolic processes. Whereas most of these proteins are associated with known complexes and metabolic pathways, about 13% of the proteins have unknown functions. The chloroplast proteome contains many proteins that are localized to the thylakoids but as yet have no known function. We propose that some of these familiar proteins participate in the photosynthetic pathway. Thus, our new and comprehensive protein profile may provide clues for better understanding that photosynthetic process in wheat.
Fast in-database cross-matching of high-cadence, high-density source lists with an up-to-date sky model

NASA Astrophysics Data System (ADS)

Scheers, B.; Bloemen, S.; Mühleisen, H.; Schellart, P.; van Elteren, A.; Kersten, M.; Groot, P. J.

2018-04-01

Coming high-cadence wide-field optical telescopes will image hundreds of thousands of sources per minute. Besides inspecting the near real-time data streams for transient and variability events, the accumulated data archive is a wealthy laboratory for making complementary scientific discoveries. The goal of this work is to optimise column-oriented database techniques to enable the construction of a full-source and light-curve database for large-scale surveys, that is accessible by the astronomical community. We adopted LOFAR's Transients Pipeline as the baseline and modified it to enable the processing of optical images that have much higher source densities. The pipeline adds new source lists to the archive database, while cross-matching them with the known cataloguedsources in order to build a full light-curve archive. We investigated several techniques of indexing and partitioning the largest tables, allowing for faster positional source look-ups in the cross matching algorithms. We monitored all query run times in long-term pipeline runs where we processed a subset of IPHAS data that have image source density peaks over 170,000 per field of view (500,000 deg-2). Our analysis demonstrates that horizontal table partitions of declination widths of one-degree control the query run times. Usage of an index strategy where the partitions are densely sorted according to source declination yields another improvement. Most queries run in sublinear time and a few (< 20%) run in linear time, because of dependencies on input source-list and result-set size. We observed that for this logical database partitioning schema the limiting cadence the pipeline achieved with processing IPHAS data is 25 s.
PROFESS: a PROtein Function, Evolution, Structure and Sequence database

PubMed Central

Triplet, Thomas; Shortridge, Matthew D.; Griep, Mark A.; Stark, Jaime L.; Powers, Robert; Revesz, Peter

2010-01-01

The proliferation of biological databases and the easy access enabled by the Internet is having a beneficial impact on biological sciences and transforming the way research is conducted. There are ∼1100 molecular biology databases dispersed throughout the Internet. To assist in the functional, structural and evolutionary analysis of the abundant number of novel proteins continually identified from whole-genome sequencing, we introduce the PROFESS (PROtein Function, Evolution, Structure and Sequence) database. Our database is designed to be versatile and expandable and will not confine analysis to a pre-existing set of data relationships. A fundamental component of this approach is the development of an intuitive query system that incorporates a variety of similarity functions capable of generating data relationships not conceived during the creation of the database. The utility of PROFESS is demonstrated by the analysis of the structural drift of homologous proteins and the identification of potential pancreatic cancer therapeutic targets based on the observation of protein–protein interaction networks. Database URL: http://cse.unl.edu/∼profess/ PMID:20624718
A fully traits-based approach to modeling global vegetation distribution.

PubMed

van Bodegom, Peter M; Douma, Jacob C; Verheijen, Lieneke M

2014-09-23

Dynamic Global Vegetation Models (DGVMs) are indispensable for our understanding of climate change impacts. The application of traits in DGVMs is increasingly refined. However, a comprehensive analysis of the direct impacts of trait variation on global vegetation distribution does not yet exist. Here, we present such analysis as proof of principle. We run regressions of trait observations for leaf mass per area, stem-specific density, and seed mass from a global database against multiple environmental drivers, making use of findings of global trait convergence. This analysis explained up to 52% of the global variation of traits. Global trait maps, generated by coupling the regression equations to gridded soil and climate maps, showed up to orders of magnitude variation in trait values. Subsequently, nine vegetation types were characterized by the trait combinations that they possess using Gaussian mixture density functions. The trait maps were input to these functions to determine global occurrence probabilities for each vegetation type. We prepared vegetation maps, assuming that the most probable (and thus, most suited) vegetation type at each location will be realized. This fully traits-based vegetation map predicted 42% of the observed vegetation distribution correctly. Our results indicate that a major proportion of the predictive ability of DGVMs with respect to vegetation distribution can be attained by three traits alone if traits like stem-specific density and seed mass are included. We envision that our traits-based approach, our observation-driven trait maps, and our vegetation maps may inspire a new generation of powerful traits-based DGVMs.
Background Noise Characteristics in the Western Part of Romania

NASA Astrophysics Data System (ADS)

Grecu, B.; Neagoe, C.; Tataru, D.; Stuart, G.

2012-04-01

The seismological database of the western part of Romania increased significantly during the last years, when 33 broadband seismic stations provided by SEIS-UK (10 CMG 40 T's - 30 s, 9 CMG 3T's - 120 s, 14 CMG 6T's - 30 s) were deployed in the western part of the country in July 2009 to operate autonomously for two years. These stations were installed within a joint project (South Carpathian Project - SCP) between University of Leeds, UK and National Institute for Earth Physics (NIEP), Romania that aimed at determining the lithospheric structure and geodynamical evolution of the South Carpathian Orogen. The characteristics of the background seismic noise recorded at the SCP broadband seismic network have been studied in order to identify the variations in background seismic noise as a function of time of day, season, and particular conditions at the stations. Power spectral densities (PSDs) and their corresponding probability density functions (PDFs) are used to characterize the background seismic noise. At high frequencies (> 1 Hz), seismic noise seems to have cultural origin, since notable variations between daytime and nighttime noise levels are observed at most of the stations. The seasonal variations are seen in the microseisms band. The noise levels increase during the winter and autumn months and decrease in summer and spring seasons, while the double-frequency peak shifts from lower periods in summer to longer periods in winter. The analysis of the probability density functions for stations located in different geologic conditions points out that the noise level is higher for stations sited on softer formations than those sited on hard rocks. Finally, the polarization analysis indicates that the main sources of secondary microseisms are found in the Mediterranean Sea and Atlantic Ocean.
Development of a database system for near-future climate change projections under the Japanese National Project SI-CAT

NASA Astrophysics Data System (ADS)

Nakagawa, Y.; Kawahara, S.; Araki, F.; Matsuoka, D.; Ishikawa, Y.; Fujita, M.; Sugimoto, S.; Okada, Y.; Kawazoe, S.; Watanabe, S.; Ishii, M.; Mizuta, R.; Murata, A.; Kawase, H.

2017-12-01

Analyses of large ensemble data are quite useful in order to produce probabilistic effect projection of climate change. Ensemble data of "+2K future climate simulations" are currently produced by Japanese national project "Social Implementation Program on Climate Change Adaptation Technology (SI-CAT)" as a part of a database for Policy Decision making for Future climate change (d4PDF; Mizuta et al. 2016) produced by Program for Risk Information on Climate Change. Those data consist of global warming simulations and regional downscaling simulations. Considering that those data volumes are too large (a few petabyte) to download to a local computer of users, a user-friendly system is required to search and download data which satisfy requests of the users. We develop "a database system for near-future climate change projections" for providing functions to find necessary data for the users under SI-CAT. The database system for near-future climate change projections mainly consists of a relational database, a data download function and user interface. The relational database using PostgreSQL is a key function among them. Temporally and spatially compressed data are registered on the relational database. As a first step, we develop the relational database for precipitation, temperature and track data of typhoon according to requests by SI-CAT members. The data download function using Open-source Project for a Network Data Access Protocol (OPeNDAP) provides a function to download temporally and spatially extracted data based on search results obtained by the relational database. We also develop the web-based user interface for using the relational database and the data download function. A prototype of the database system for near-future climate change projections are currently in operational test on our local server. The database system for near-future climate change projections will be released on Data Integration and Analysis System Program (DIAS) in fiscal year 2017. Techniques of the database system for near-future climate change projections might be quite useful for simulation and observational data in other research fields. We report current status of development and some case studies of the database system for near-future climate change projections.
Using GIS databases for simulated nightlight imagery

NASA Astrophysics Data System (ADS)

Zollweg, Joshua D.; Gartley, Michael; Roskovensky, John; Mercier, Jeffery

2012-06-01

Proposed is a new technique for simulating nighttime scenes with realistically-modelled urban radiance. While nightlight imagery is commonly used to measure urban sprawl,1 it is uncommon to use urbanization as metric to develop synthetic nighttime scenes. In the developed methodology, the open-source Open Street Map (OSM) Geographic Information System (GIS) database is used. The database is comprised of many nodes, which are used to dene the position of dierent types of streets, buildings, and other features. These nodes are the driver used to model urban nightlights, given several assumptions. The rst assumption is that the spatial distribution of nodes is closely related to the spatial distribution of nightlights. Work by Roychowdhury et al has demonstrated the relationship between urban lights and development. 2 So, the real assumption being made is that the density of nodes corresponds to development, which is reasonable. Secondly, the local density of nodes must relate directly to the upwelled radiance within the given locality. Testing these assumptions using Albuquerque and Indianapolis as example cities revealed that dierent types of nodes produce more realistic results than others. Residential street nodes oered the best performance for any single node type, among the types tested in this investigation. Other node types, however, still provide useful supplementary data. Using streets and buildings dened in the OSM database allowed automated generation of simulated nighttime scenes of Albuquerque and Indianapolis in the Digital Imaging and Remote Sensing Image Generation (DIRSIG) model. The simulation was compared to real data from the recently deployed National Polar-orbiting Operational Environmental Satellite System(NPOESS) Visible Infrared Imager Radiometer Suite (VIIRS) platform. As a result of the comparison, correction functions were used to correct for discrepancies between simulated and observed radiance. Future work will include investigating more advanced approaches for mapping the spatial extent of nightlights, based on the distribution of dierent node types in local neighbourhoods. This will allow the spectral prole of each region to be dynamically adjusted, in addition to simply modifying the magnitude of a single source type.

Modeling shape and topology of low-resolution density maps of biological macromolecules.

PubMed Central

De-Alarcón, Pedro A; Pascual-Montano, Alberto; Gupta, Amarnath; Carazo, Jose M

2002-01-01

In the present work we develop an efficient way of representing the geometry and topology of volumetric datasets of biological structures from medium to low resolution, aiming at storing and querying them in a database framework. We make use of a new vector quantization algorithm to select the points within the macromolecule that best approximate the probability density function of the original volume data. Connectivity among points is obtained with the use of the alpha shapes theory. This novel data representation has a number of interesting characteristics, such as 1) it allows us to automatically segment and quantify a number of important structural features from low-resolution maps, such as cavities and channels, opening the possibility of querying large collections of maps on the basis of these quantitative structural features; 2) it provides a compact representation in terms of size; 3) it contains a subset of three-dimensional points that optimally quantify the densities of medium resolution data; and 4) a general model of the geometry and topology of the macromolecule (as opposite to a spatially unrelated bunch of voxels) is easily obtained by the use of the alpha shapes theory. PMID:12124252
Reference Correlation for the Viscosity of Carbon Dioxide

NASA Astrophysics Data System (ADS)

Laesecke, Arno; Muzny, Chris D.

2017-03-01

A comprehensive database of experimental and computed data for the viscosity of carbon dioxide (CO2) was compiled and a new reference correlation was developed. Literature results based on an ab initio potential energy surface were the foundation of the correlation of the viscosity in the limit of zero density in the temperature range from 100 to 2000 K. Guided symbolic regression was employed to obtain a new functional form that extrapolates correctly to 0 and to 10 000 K. Coordinated measurements at low density made it possible to implement the temperature dependence of the Rainwater-Friend theory in the linear-in-density viscosity term. The residual viscosity could be formulated with a scaling term ργ/T, the significance of which was confirmed by symbolic regression. The final viscosity correlation covers temperatures from 100 to 2000 K for gaseous CO2 and from 220 to 700 K with pressures along the melting line up to 8000 MPa for compressed and supercritical liquid states. The data representation is more accurate than with the previous correlations, and the covered pressure and temperature range is significantly extended. The critical enhancement of the viscosity of CO2 is included in the new correlation.
The density, compressibility and seismic velocity of hydrous melts at crustal and upper mantle conditions

NASA Astrophysics Data System (ADS)

Ueki, K.; Iwamori, H.

2015-12-01

Various processes of subduction zone magmatism, such as upward migration of partial melts and fractional crystallization depend on the density of the hydrous silicate melt. The density and the compressibility of the hydrous melt are key factors for the thermodynamic calculation of phase relation of the hydrous melt, and the geophysical inversion to predict physicochemical conditions of the melting region based on the seismic velocity. This study presents a new model for the calculations of the density of the hydrous silicate melts as a function of T, P, H2O content and melt composition. The Birch-Murnaghan equation is used for the equation of state. We compile the experimentally determined densities of various hydrous melts, and optimize the partial molar volume, compressibility, thermal expansibility and its pressure derivative, and K' of the H2O component in the silicate melt. P-T ranges of the calibration database are 0.48-4.29 GPa and 1033-2073 K. As such, this model covers the P-T ranges of the entire melting region of the subduction zone. Parameter set provided by Lange and Carmichael [1990] is used for the partial molar volume and KT value of the anhydrous silicate melt. K' of anhydrous melt is newly parameterized as a function of SiO2 content. The new model accurately reproduces the experimentally determined density variations of various hydrous melts from basalt to rhyolite. Our result shows that the hydrous melt is more compressive and less dense than the anhydrous melt; with the 5 wt% of H2O in melt, density and KT decrease by ~10% and ~30% from those of the anhydrous melt, respectively. For the application of the model, we calculated the P-wave velocity of the hydrous melt. With the 5 wt% of H2O, P-wave velocity of the silicate melt decreases by >10%. Based on the melt P-wave velocity, we demonstrate the effect of the melt H2O content on the seismic velocity of the partially molten zone of the subduction zone.
Hardrock Elastic Physical Properties: Birch's Seismic Parameter Revisited

NASA Astrophysics Data System (ADS)

Wu, M.; Milkereit, B.

2014-12-01

Identifying rock composition and properties is imperative in a variety of fields including geotechnical engineering, mining, and petroleum exploration, in order to accurately make any petrophysical calculations. Density is, in particular, an important parameter that allows us to differentiate between lithologies and estimate or calculate other petrophysical properties. It is well established that compressional and shear wave velocities of common crystalline rocks increase with increasing densities (i.e. the Birch and Nafe-Drake relationships). Conventional empirical relations do not take into account S-wave velocity. Physical properties of Fe-oxides and massive sulfides, however, differ significantly from the empirical velocity-density relationships. Currently, acquiring in-situ density data is challenging and problematic, and therefore, developing an approximation for density based on seismic wave velocity and elastic moduli would be beneficial. With the goal of finding other possible or better relationships between density and the elastic moduli, a database of density, P-wave velocity, S-wave velocity, bulk modulus, shear modulus, Young's modulus, and Poisson's ratio was compiled based on a multitude of lab samples. The database is comprised of isotropic, non-porous metamorphic rock. Multi-parameter cross plots of the various elastic parameters have been analyzed in order to find a suitable parameter combination that reduces high density outliers. As expected, the P-wave velocity to S-wave velocity ratios show no correlation with density. However, Birch's seismic parameter, along with the bulk modulus, shows promise in providing a link between observed compressional and shear wave velocities and rock densities, including massive sulfides and Fe-oxides.
neXtA5: accelerating annotation of articles via automated approaches in neXtProt.

PubMed

Mottin, Luc; Gobeill, Julien; Pasche, Emilie; Michel, Pierre-André; Cusin, Isabelle; Gaudet, Pascale; Ruch, Patrick

2016-01-01

The rapid increase in the number of published articles poses a challenge for curated databases to remain up-to-date. To help the scientific community and database curators deal with this issue, we have developed an application, neXtA5, which prioritizes the literature for specific curation requirements. Our system, neXtA5, is a curation service composed of three main elements. The first component is a named-entity recognition module, which annotates MEDLINE over some predefined axes. This report focuses on three axes: Diseases, the Molecular Function and Biological Process sub-ontologies of the Gene Ontology (GO). The automatic annotations are then stored in a local database, BioMed, for each annotation axis. Additional entities such as species and chemical compounds are also identified. The second component is an existing search engine, which retrieves the most relevant MEDLINE records for any given query. The third component uses the content of BioMed to generate an axis-specific ranking, which takes into account the density of named-entities as stored in the Biomed database. The two ranked lists are ultimately merged using a linear combination, which has been specifically tuned to support the annotation of each axis. The fine-tuning of the coefficients is formally reported for each axis-driven search. Compared with PubMed, which is the system used by most curators, the improvement is the following: +231% for Diseases, +236% for Molecular Functions and +3153% for Biological Process when measuring the precision of the top-returned PMID (P0 or mean reciprocal rank). The current search methods significantly improve the search effectiveness of curators for three important curation axes. Further experiments are being performed to extend the curation types, in particular protein-protein interactions, which require specific relationship extraction capabilities. In parallel, user-friendly interfaces powered with a set of JSON web services are currently being implemented into the neXtProt annotation pipeline.Available on: http://babar.unige.ch:8082/neXtA5Database URL: http://babar.unige.ch:8082/neXtA5/fetcher.jsp. © The Author(s) 2016. Published by Oxford University Press.
neXtA5: accelerating annotation of articles via automated approaches in neXtProt

PubMed Central

Mottin, Luc; Gobeill, Julien; Pasche, Emilie; Michel, Pierre-André; Cusin, Isabelle; Gaudet, Pascale; Ruch, Patrick

2016-01-01

The rapid increase in the number of published articles poses a challenge for curated databases to remain up-to-date. To help the scientific community and database curators deal with this issue, we have developed an application, neXtA5, which prioritizes the literature for specific curation requirements. Our system, neXtA5, is a curation service composed of three main elements. The first component is a named-entity recognition module, which annotates MEDLINE over some predefined axes. This report focuses on three axes: Diseases, the Molecular Function and Biological Process sub-ontologies of the Gene Ontology (GO). The automatic annotations are then stored in a local database, BioMed, for each annotation axis. Additional entities such as species and chemical compounds are also identified. The second component is an existing search engine, which retrieves the most relevant MEDLINE records for any given query. The third component uses the content of BioMed to generate an axis-specific ranking, which takes into account the density of named-entities as stored in the Biomed database. The two ranked lists are ultimately merged using a linear combination, which has been specifically tuned to support the annotation of each axis. The fine-tuning of the coefficients is formally reported for each axis-driven search. Compared with PubMed, which is the system used by most curators, the improvement is the following: +231% for Diseases, +236% for Molecular Functions and +3153% for Biological Process when measuring the precision of the top-returned PMID (P0 or mean reciprocal rank). The current search methods significantly improve the search effectiveness of curators for three important curation axes. Further experiments are being performed to extend the curation types, in particular protein–protein interactions, which require specific relationship extraction capabilities. In parallel, user-friendly interfaces powered with a set of JSON web services are currently being implemented into the neXtProt annotation pipeline. Available on: http://babar.unige.ch:8082/neXtA5 Database URL: http://babar.unige.ch:8082/neXtA5/fetcher.jsp PMID:27374119
Primates in 21st century ecosystems: does primate conservation promote ecosystem conservation?

PubMed

Norconk, Marilyn A; Boinski, Sue; Forget, Pierre-Michel

2011-01-01

Contributors to this issue of the American Journal of Primatology were among the participants in an invited symposium at the 2008 Association for Tropical Biology and Conservation meeting in Paramaribo, Suriname. They were asked to assess how essential primates are to tropical ecosystems and, given their research interests, discuss how primate research contributes to the broader understanding about how ecosystems function. This introduction to the issue is divided into three parts: a review of the roles that nonhuman primates play in tropical ecosystems; the implementation of large-scale landscape methods used to identify primate densities; and concerns about the increasingly porous boundaries between humans, nonhuman primates, and pathogens. Although 20th century primate research created a rich database on individual species, including both theoretical and descriptive approaches, the dual effects of high human population densities and widespread habitat destruction should warn us that creative, interdisciplinary and human-related research is needed to solve 21st century problems. © 2010 Wiley-Liss, Inc.
Nonlinear associations between plasma cholesterol levels and neuropsychological function.

PubMed

Wendell, Carrington R; Zonderman, Alan B; Katzel, Leslie I; Rosenberger, William F; Plamadeala, Victoria V; Hosey, Megan M; Waldstein, Shari R

2016-11-01

Although both high and low levels of total and low-density lipoprotein (LDL) cholesterol have been associated with poor neuropsychological function, little research has examined nonlinear effects. We examined quadratic relations of cholesterol to performance on a comprehensive neuropsychological battery. Participants were 190 older adults (53% men, ages 54-83) free of major medical, neurologic, and psychiatric disease. Measures of fasting plasma total and high-density lipoprotein (HDL) cholesterol were assayed, and LDL cholesterol was calculated. Participants completed neuropsychological measures of attention, executive function, memory, visuospatial judgment, and manual speed and dexterity. Multiple regression analyses examined cholesterol levels as quadratic predictors of each measure of cognitive performance, with age (dichotomized as <70 vs. 70+) as an effect modifier. A significant quadratic effect of Total Cholesterol² × Age was identified for Logical Memory II ( b = -.0013, p = .039), such that the 70+ group performed best at high and low levels of total cholesterol than at midrange total cholesterol (U-shaped) and the <70 group performed worse at high and low levels of total cholesterol than at midrange total cholesterol (inverted U shape). Similarly, significant U- and J-shaped effects of LDL Cholesterol² × Age were identified for Visual Reproduction II ( b = -.0020, p = .026) and log of the Trail Making Test, Part B (b = .0001, p = .044). Quadratic associations between HDL cholesterol and cognitive performance were nonsignificant. Results indicate differential associations between cholesterol and neuropsychological function across different ages and domains of function. High and low total and LDL cholesterol may confer both risk and benefit for suboptimal cognitive function at different ages. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Average snowcover density values in Eastern Alps mountain

NASA Astrophysics Data System (ADS)

Valt, M.; Moro, D.

2009-04-01

The Italian Avalanche Warning Services monitor the snow cover characteristics through networks evenly distributed all over the alpine chain. Measurements of snow stratigraphy and density are very frequently performed with sampling rates of 1 -2 times per week. Snow cover density values are used to compute the dimensions of the building roofs as well as to design avalanche barriers. Based on the measured snow densities the Electricity Board can predict the amount of water resources deriving from snow melt in high relieves drainage basins. In this work it was possible to compute characteristic density values of the snow cover in the Eastern Alps using the information contained in the database from the ARPA (Agenzia Regionale Protezione Ambiente)-Centro Valanghe di Arabba, and Ufficio Valanghe- Udine. Among the other things, this database includes 15 years of stratigraphic measurements. More than 6,000 snow stratigraphic logs were analysed, in order to derive typical values as for geographical area, altitude, exposure, snow cover thickness and season. Computed values were compared to those established by the current Italian laws. Eventually, experts identified and evaluated the correlations between the seasonal variations of the average snow density and the variations related to the snowfall rate in the period 1994-2008 in the Eastern Alps mountain range
A WISE Measurement of the 2:4 mum Galaxy Luminosity Function and its Implications for the Extragalactic Background Light at 3:4 mum

NASA Astrophysics Data System (ADS)

Lake, Sean Earl

2017-05-01

The measurement of the the Extragalactic Background Light (EBL) has seen some controversy in recent works, with direct and indirect measures conflicting. Specifi- cally, upper limits based on analyzing the plausible opacity obscuring TeV spectra of blazars suggests that the density of radiation with wavelengths near 3.4 mum is onethirdtoonehalfasintenseasdirectmeasuresofthesame(forexample: Aharonian et al., 2006; Levenson et al., 2007; Matsumoto et al., 2005). The dominant contributor of the EBL at 3.4mum is expected to be ordinary starlight from relatively local, z < 1, galaxies, so an estimate of the amount of light emitted by galaxies based on the galaxy Luminosity Function (LF) should provide a useful lower limit to the EBL. While analyses of this sort have been done by others (Dominguez et al., 2011; Helgason et al., 2012), the full sky coverage of the AllWISE database has made it possible for us to improve the measurement of both the LF at 2.4 mum and the EBL using the large public spectroscopic redshift surveys. In order to do so, we had to develop a mathematical model for the measurement of a generalization of the LF, which is the density of galaxies per unit comoving volume per unit luminosity, to the Spectro-Luminosity Functional (SLF), which replaces the density per unit single luminosity, dL, with the density per luminosi- ii ties at all frequencies, DL nu. Our best combined analysis of the data yields present day Shechter Function LF parameters of: L⋆ = 6.4+/-[0.1 stat, 0.3sys]x1010 L2.4mum [solar mass](M⋆ = -21.67+/-[0.02 stat, 0.05sys] AB mag), φ⋆ = 5.8+/-[0.3stat, 0.3sys]x10 -3 Mpc-3, and alpha = -1.050 +/- [0.004stat, 0.03sys]; this implies a present day density of galaxies of 0.08 Mpc-3 brighter that 106 L2.4mum [solar mass] (10-3 Mpc-3 brighter than L⋆) and a luminosity density equivalent to 3.8 x 108 L2.4mum [solar mass] Mpc-3. The net EBL at 3.4mum that our synthesis model produces from galaxies closer than z = 5 is Inu = 9.0 +/- 0.5 kJy sr-1 (nuInu = 8.0 +/- 0.4 nW m-2 sr -1), largely in agreement with similar LF based estimates of the EBL.
Electron and Positron Stopping Powers of Materials

National Institute of Standards and Technology Data Gateway

SRD 7 NIST Electron and Positron Stopping Powers of Materials (PC database for purchase) The EPSTAR database provides rapid calculations of stopping powers (collisional, radiative, and total), CSDA ranges, radiation yields and density effect corrections for incident electrons or positrons with kinetic energies from 1 keV to 10 GeV, and for any chemically defined target material.
Functional and Database Architecture Design.

DTIC Science & Technology

1983-09-26

I AD-At3.N 275 FUNCTIONAL AND D ATABASE ARCHITECTURE DESIGN (U) ALPHA / OMEGA GROUP INC HARVARD MA 26 SEP 83 NODS 4-83-C 0525 UNCLASSIFIED FG52 N EE...0525 REPORT AOO1 FUNCTIONAL AND DATABASE ARCHITECTURE DESIGN Submitted to: Office of Naval Research Department of the Navy 800 N. Quincy Street...ZNTIS GRA& I DTIC TAB Unannounced 0 Justification REPORT ON Distribution/ Availability Codes Avail and/or FUNCTIONAL AND DATABASE ARCHITECTURE DESIGN Dist
Statistics for laminar flamelet modeling

NASA Technical Reports Server (NTRS)

Cant, R. S.; Rutland, C. J.; Trouve, A.

1990-01-01

Statistical information required to support modeling of turbulent premixed combustion by laminar flamelet methods is extracted from a database of the results of Direct Numerical Simulation of turbulent flames. The simulations were carried out previously by Rutland (1989) using a pseudo-spectral code on a three dimensional mesh of 128 points in each direction. One-step Arrhenius chemistry was employed together with small heat release. A framework for the interpretation of the data is provided by the Bray-Moss-Libby model for the mean turbulent reaction rate. Probability density functions are obtained over surfaces of the constant reaction progress variable for the tangential strain rate and the principal curvature. New insights are gained which will greatly aid the development of modeling approaches.
Developing High-resolution Soil Database for Regional Crop Modeling in East Africa

NASA Astrophysics Data System (ADS)

Han, E.; Ines, A. V. M.

2014-12-01

The most readily available soil data for regional crop modeling in Africa is the World Inventory of Soil Emission potentials (WISE) dataset, which has 1125 soil profiles for the world, but does not extensively cover countries Ethiopia, Kenya, Uganda and Tanzania in East Africa. Another dataset available is the HC27 (Harvest Choice by IFPRI) in a gridded format (10km) but composed of generic soil profiles based on only three criteria (texture, rooting depth, and organic carbon content). In this paper, we present a development and application of a high-resolution (1km), gridded soil database for regional crop modeling in East Africa. Basic soil information is extracted from Africa Soil Information Service (AfSIS), which provides essential soil properties (bulk density, soil organic carbon, soil PH and percentages of sand, silt and clay) for 6 different standardized soil layers (5, 15, 30, 60, 100 and 200 cm) in 1km resolution. Soil hydraulic properties (e.g., field capacity and wilting point) are derived from the AfSIS soil dataset using well-proven pedo-transfer functions and are customized for DSSAT-CSM soil data requirements. The crop model is used to evaluate crop yield forecasts using the new high resolution soil database and compared with WISE and HC27. In this paper we will present also the results of DSSAT loosely coupled with a hydrologic model (VIC) to assimilate root-zone soil moisture. Creating a grid-based soil database, which provides a consistent soil input for two different models (DSSAT and VIC) is a critical part of this work. The created soil database is expected to contribute to future applications of DSSAT crop simulation in East Africa where food security is highly vulnerable.
Arc_Mat: a Matlab-based spatial data analysis toolbox

NASA Astrophysics Data System (ADS)

Liu, Xingjian; Lesage, James

2010-03-01

This article presents an overview of Arc_Mat, a Matlab-based spatial data analysis software package whose source code has been placed in the public domain. An earlier version of the Arc_Mat toolbox was developed to extract map polygon and database information from ESRI shapefiles and provide high quality mapping in the Matlab software environment. We discuss revisions to the toolbox that: utilize enhanced computing and graphing capabilities of more recent versions of Matlab, restructure the toolbox with object-oriented programming features, and provide more comprehensive functions for spatial data analysis. The Arc_Mat toolbox functionality includes basic choropleth mapping; exploratory spatial data analysis that provides exploratory views of spatial data through various graphs, for example, histogram, Moran scatterplot, three-dimensional scatterplot, density distribution plot, and parallel coordinate plots; and more formal spatial data modeling that draws on the extensive Spatial Econometrics Toolbox functions. A brief review of the design aspects of the revised Arc_Mat is described, and we provide some illustrative examples that highlight representative uses of the toolbox. Finally, we discuss programming with and customizing the Arc_Mat toolbox functionalities.
CLEARPOND: Cross-Linguistic Easy-Access Resource for Phonological and Orthographic Neighborhood Densities

PubMed Central

Marian, Viorica; Bartolotti, James; Chabal, Sarah; Shook, Anthony

2012-01-01

Past research has demonstrated cross-linguistic, cross-modal, and task-dependent differences in neighborhood density effects, indicating a need to control for neighborhood variables when developing and interpreting research on language processing. The goals of the present paper are two-fold: (1) to introduce CLEARPOND (Cross-Linguistic Easy-Access Resource for Phonological and Orthographic Neighborhood Densities), a centralized database of phonological and orthographic neighborhood information, both within and between languages, for five commonly-studied languages: Dutch, English, French, German, and Spanish; and (2) to show how CLEARPOND can be used to compare general properties of phonological and orthographic neighborhoods across languages. CLEARPOND allows researchers to input a word or list of words and obtain phonological and orthographic neighbors, neighborhood densities, mean neighborhood frequencies, word lengths by number of phonemes and graphemes, and spoken-word frequencies. Neighbors can be defined by substitution, deletion, and/or addition, and the database can be queried separately along each metric or summed across all three. Neighborhood values can be obtained both within and across languages, and outputs can optionally be restricted to neighbors of higher frequency. To enable researchers to more quickly and easily develop stimuli, CLEARPOND can also be searched by features, generating lists of words that meet precise criteria, such as a specific range of neighborhood sizes, lexical frequencies, and/or word lengths. CLEARPOND is freely-available to researchers and the public as a searchable, online database and for download at http://clearpond.northwestern.edu. PMID:22916227
Systematic analysis of snake neurotoxins' functional classification using a data warehousing approach.

PubMed

Siew, Joyce Phui Yee; Khan, Asif M; Tan, Paul T J; Koh, Judice L Y; Seah, Seng Hong; Koo, Chuay Yeng; Chai, Siaw Ching; Armugam, Arunmozhiarasi; Brusic, Vladimir; Jeyaseelan, Kandiah

2004-12-12

Sequence annotations, functional and structural data on snake venom neurotoxins (svNTXs) are scattered across multiple databases and literature sources. Sequence annotations and structural data are available in the public molecular databases, while functional data are almost exclusively available in the published articles. There is a need for a specialized svNTXs database that contains NTX entries, which are organized, well annotated and classified in a systematic manner. We have systematically analyzed svNTXs and classified them using structure-function groups based on their structural, functional and phylogenetic properties. Using conserved motifs in each phylogenetic group, we built an intelligent module for the prediction of structural and functional properties of unknown NTXs. We also developed an annotation tool to aid the functional prediction of newly identified NTXs as an additional resource for the venom research community. We created a searchable online database of NTX proteins sequences (http://research.i2r.a-star.edu.sg/Templar/DB/snake_neurotoxin). This database can also be found under Swiss-Prot Toxin Annotation Project website (http://www.expasy.org/sprot/).
In-vehicle signing concepts: An analytical precursor to an in-vehicle information system

DOE Office of Scientific and Technical Information (OSTI.GOV)

Spelt, P.F.; Tufano, D.R.; Knee, H.E.

The purpose of the project described in this report is to develop alternative In-Vehicle Signing (IVS) system concepts based on allocation of the functions associated with driving a road vehicle. In the driving milieu, tasks can be assigned to one of three agents, the driver, the vehicle or the infrastructure. Assignment of tasks is based on a philosophy of function allocation which can emphasize any of several philosophical approaches. In this project, function allocations were made according to the current practice in vehicle design and signage as well as a human-centered strategy. Several IVS system concepts are presented based onmore » differing functional allocation outcomes. A design space for IVS systems is described, and a technical analysis of a map-based and sever beacon-based IVS systems are presented. Because of problems associated with both map-based and beacon-based concepts, a hybrid IVS concept was proposed. The hybrid system uses on-board map-based databases to serve those areas in which signage can be anticipated to be relatively static, such as large metropolitan areas where few if any new roads will be built. For areas where sign density is low, and/or where population growth causes changes in traffic flow, beacon-based concepts function best. For this situation, changes need only occur in the central database from which sign information is transmitted. This report presents system concepts which enable progress from the IVS system concept-independent functional requirements to a more specific set of system concepts which facilitate analysis and selection of hardware and software to perform the functions of IVS. As such, this phase of the project represents a major step toward the design and development of a prototype WS system. Once such a system is developed, a program of testing, evaluation, an revision will be undertaken. Ultimately, such a system can become part of the road vehicle of the future.« less
Predictions of new AB O3 perovskite compounds by combining machine learning and density functional theory

NASA Astrophysics Data System (ADS)

Balachandran, Prasanna V.; Emery, Antoine A.; Gubernatis, James E.; Lookman, Turab; Wolverton, Chris; Zunger, Alex

2018-04-01

We apply machine learning (ML) methods to a database of 390 experimentally reported A B O3 compounds to construct two statistical models that predict possible new perovskite materials and possible new cubic perovskites. The first ML model classified the 390 compounds into 254 perovskites and 136 that are not perovskites with a 90% average cross-validation (CV) accuracy; the second ML model further classified the perovskites into 22 known cubic perovskites and 232 known noncubic perovskites with a 94% average CV accuracy. We find that the most effective chemical descriptors affecting our classification include largely geometric constructs such as the A and B Shannon ionic radii, the tolerance and octahedral factors, the A -O and B -O bond length, and the A and B Villars' Mendeleev numbers. We then construct an additional list of 625 A B O3 compounds assembled from charge conserving combinations of A and B atoms absent from our list of known compounds. Then, using the two ML models constructed on the known compounds, we predict that 235 of the 625 exist in a perovskite structure with a confidence greater than 50% and among them that 20 exist in the cubic structure (albeit, the latter with only ˜50 % confidence). We find that the new perovskites are most likely to occur when the A and B atoms are a lanthanide or actinide, when the A atom is an alkali, alkali earth, or late transition metal atom, or when the B atom is a p -block atom. We also compare the ML findings with the density functional theory calculations and convex hull analyses in the Open Quantum Materials Database (OQMD), which predicts the T =0 K ground-state stability of all the A B O3 compounds. We find that OQMD predicts 186 of 254 of the perovskites in the experimental database to be thermodynamically stable within 100 meV/atom of the convex hull and predicts 87 of the 235 ML-predicted perovskite compounds to be thermodynamically stable within 100 meV/atom of the convex hull, including 6 of these to be in cubic structures. We suggest these 87 as the most promising candidates for future experimental synthesis of novel perovskites.
Electronic coupling through natural amino acids.

PubMed

Berstis, Laura; Beckham, Gregg T; Crowley, Michael F

2015-12-14

Myriad scientific domains concern themselves with biological electron transfer (ET) events that span across vast scales of rate and efficiency through a remarkably fine-tuned integration of amino acid (AA) sequences, electronic structure, dynamics, and environment interactions. Within this intricate scheme, many questions persist as to how proteins modulate electron-tunneling properties. To help elucidate these principles, we develop a model set of peptides representing the common α-helix and β-strand motifs including all natural AAs within implicit protein-environment solvation. Using an effective Hamiltonian strategy with density functional theory, we characterize the electronic coupling through these peptides, furthermore considering side-chain dynamics. For both motifs, predictions consistently show that backbone-mediated electronic coupling is distinctly sensitive to AA type (aliphatic, polar, aromatic, negatively charged and positively charged), and to side-chain orientation. The unique properties of these residues may be employed to design activated, deactivated, or switch-like superexchange pathways. Electronic structure calculations and Green's function analyses indicate that localized shifts in the electron density along the peptide play a role in modulating these pathways, and further substantiate the experimentally observed behavior of proline residues as superbridges. The distinct sensitivities of tunneling pathways to sequence and conformation revealed in this electronic coupling database help improve our fundamental understanding of the broad diversity of ET reactivity and provide guiding principles for peptide design.

Theoretical study of the ammonia nitridation rate on an Fe (100) surface: a combined density functional theory and kinetic Monte Carlo study.

PubMed

Yeo, Sang Chul; Lo, Yu Chieh; Li, Ju; Lee, Hyuck Mo

2014-10-07

Ammonia (NH3) nitridation on an Fe surface was studied by combining density functional theory (DFT) and kinetic Monte Carlo (kMC) calculations. A DFT calculation was performed to obtain the energy barriers (Eb) of the relevant elementary processes. The full mechanism of the exact reaction path was divided into five steps (adsorption, dissociation, surface migration, penetration, and diffusion) on an Fe (100) surface pre-covered with nitrogen. The energy barrier (Eb) depended on the N surface coverage. The DFT results were subsequently employed as a database for the kMC simulations. We then evaluated the NH3 nitridation rate on the N pre-covered Fe surface. To determine the conditions necessary for a rapid NH3 nitridation rate, the eight reaction events were considered in the kMC simulations: adsorption, desorption, dissociation, reverse dissociation, surface migration, penetration, reverse penetration, and diffusion. This study provides a real-time-scale simulation of NH3 nitridation influenced by nitrogen surface coverage that allowed us to theoretically determine a nitrogen coverage (0.56 ML) suitable for rapid NH3 nitridation. In this way, we were able to reveal the coverage dependence of the nitridation reaction using the combined DFT and kMC simulations.
Ground-state densities from the Rayleigh-Ritz variation principle and from density-functional theory.

PubMed

Kvaal, Simen; Helgaker, Trygve

2015-11-14

The relationship between the densities of ground-state wave functions (i.e., the minimizers of the Rayleigh-Ritz variation principle) and the ground-state densities in density-functional theory (i.e., the minimizers of the Hohenberg-Kohn variation principle) is studied within the framework of convex conjugation, in a generic setting covering molecular systems, solid-state systems, and more. Having introduced admissible density functionals as functionals that produce the exact ground-state energy for a given external potential by minimizing over densities in the Hohenberg-Kohn variation principle, necessary and sufficient conditions on such functionals are established to ensure that the Rayleigh-Ritz ground-state densities and the Hohenberg-Kohn ground-state densities are identical. We apply the results to molecular systems in the Born-Oppenheimer approximation. For any given potential v ∈ L(3/2)(ℝ(3)) + L(∞)(ℝ(3)), we establish a one-to-one correspondence between the mixed ground-state densities of the Rayleigh-Ritz variation principle and the mixed ground-state densities of the Hohenberg-Kohn variation principle when the Lieb density-matrix constrained-search universal density functional is taken as the admissible functional. A similar one-to-one correspondence is established between the pure ground-state densities of the Rayleigh-Ritz variation principle and the pure ground-state densities obtained using the Hohenberg-Kohn variation principle with the Levy-Lieb pure-state constrained-search functional. In other words, all physical ground-state densities (pure or mixed) are recovered with these functionals and no false densities (i.e., minimizing densities that are not physical) exist. The importance of topology (i.e., choice of Banach space of densities and potentials) is emphasized and illustrated. The relevance of these results for current-density-functional theory is examined.
47 CFR 15.713 - TV bands database.

Code of Federal Regulations, 2010 CFR

2010-10-01

... 47 Telecommunication 1 2010-10-01 2010-10-01 false TV bands database. 15.713 Section 15.713... TV bands database. (a) Purpose. The TV bands database serves the following functions: (1) To... databases. (b) Information in the TV bands database. (1) Facilities already recorded in Commission databases...
Development of spatial density maps based on geoprocessing web services: application to tuberculosis incidence in Barcelona, Spain.

PubMed

Dominkovics, Pau; Granell, Carlos; Pérez-Navarro, Antoni; Casals, Martí; Orcau, Angels; Caylà, Joan A

2011-11-29

Health professionals and authorities strive to cope with heterogeneous data, services, and statistical models to support decision making on public health. Sophisticated analysis and distributed processing capabilities over geocoded epidemiological data are seen as driving factors to speed up control and decision making in these health risk situations. In this context, recent Web technologies and standards-based web services deployed on geospatial information infrastructures have rapidly become an efficient way to access, share, process, and visualize geocoded health-related information. Data used on this study is based on Tuberculosis (TB) cases registered in Barcelona city during 2009. Residential addresses are geocoded and loaded into a spatial database that acts as a backend database. The web-based application architecture and geoprocessing web services are designed according to the Representational State Transfer (REST) principles. These web processing services produce spatial density maps against the backend database. The results are focused on the use of the proposed web-based application to the analysis of TB cases in Barcelona. The application produces spatial density maps to ease the monitoring and decision making process by health professionals. We also include a discussion of how spatial density maps may be useful for health practitioners in such contexts. In this paper, we developed web-based client application and a set of geoprocessing web services to support specific health-spatial requirements. Spatial density maps of TB incidence were generated to help health professionals in analysis and decision-making tasks. The combined use of geographic information tools, map viewers, and geoprocessing services leads to interesting possibilities in handling health data in a spatial manner. In particular, the use of spatial density maps has been effective to identify the most affected areas and its spatial impact. This study is an attempt to demonstrate how web processing services together with web-based mapping capabilities suit the needs of health practitioners in epidemiological analysis scenarios.
Development of spatial density maps based on geoprocessing web services: application to tuberculosis incidence in Barcelona, Spain

PubMed Central

2011-01-01

Background Health professionals and authorities strive to cope with heterogeneous data, services, and statistical models to support decision making on public health. Sophisticated analysis and distributed processing capabilities over geocoded epidemiological data are seen as driving factors to speed up control and decision making in these health risk situations. In this context, recent Web technologies and standards-based web services deployed on geospatial information infrastructures have rapidly become an efficient way to access, share, process, and visualize geocoded health-related information. Methods Data used on this study is based on Tuberculosis (TB) cases registered in Barcelona city during 2009. Residential addresses are geocoded and loaded into a spatial database that acts as a backend database. The web-based application architecture and geoprocessing web services are designed according to the Representational State Transfer (REST) principles. These web processing services produce spatial density maps against the backend database. Results The results are focused on the use of the proposed web-based application to the analysis of TB cases in Barcelona. The application produces spatial density maps to ease the monitoring and decision making process by health professionals. We also include a discussion of how spatial density maps may be useful for health practitioners in such contexts. Conclusions In this paper, we developed web-based client application and a set of geoprocessing web services to support specific health-spatial requirements. Spatial density maps of TB incidence were generated to help health professionals in analysis and decision-making tasks. The combined use of geographic information tools, map viewers, and geoprocessing services leads to interesting possibilities in handling health data in a spatial manner. In particular, the use of spatial density maps has been effective to identify the most affected areas and its spatial impact. This study is an attempt to demonstrate how web processing services together with web-based mapping capabilities suit the needs of health practitioners in epidemiological analysis scenarios. PMID:22126392
Building a database for statistical characterization of ELMs on DIII-D

NASA Astrophysics Data System (ADS)

Fritch, B. J.; Marinoni, A.; Bortolon, A.

2017-10-01

Edge localized modes (ELMs) are bursty instabilities which occur in the edge region of H-mode plasmas and have the potential to damage in-vessel components of future fusion machines by exposing the divertor region to large energy and particle fluxes during each ELM event. While most ELM studies focus on average quantities (e.g. energy loss per ELM), this work investigates the statistical distributions of ELM characteristics, as a function of plasma parameters. A semi-automatic algorithm is being used to create a database documenting trigger times of the tens of thousands of ELMs for DIII-D discharges in scenarios relevant to ITER, thus allowing statistically significant analysis. Probability distributions of inter-ELM periods and energy losses will be determined and related to relevant plasma parameters such as density, stored energy, and current in order to constrain models and improve estimates of the expected inter-ELM periods and sizes, both of which must be controlled in future reactors. Work supported in part by US DoE under the Science Undergraduate Laboratory Internships (SULI) program, DE-FC02-04ER54698 and DE-FG02- 94ER54235.
Data-based diffraction kernels for surface waves from convolution and correlation processes through active seismic interferometry

NASA Astrophysics Data System (ADS)

Chmiel, Malgorzata; Roux, Philippe; Herrmann, Philippe; Rondeleux, Baptiste; Wathelet, Marc

2018-05-01

We investigated the construction of diffraction kernels for surface waves using two-point convolution and/or correlation from land active seismic data recorded in the context of exploration geophysics. The high density of controlled sources and receivers, combined with the application of the reciprocity principle, allows us to retrieve two-dimensional phase-oscillation diffraction kernels (DKs) of surface waves between any two source or receiver points in the medium at each frequency (up to 15 Hz, at least). These DKs are purely data-based as no model calculations and no synthetic data are needed. They naturally emerge from the interference patterns of the recorded wavefields projected on the dense array of sources and/or receivers. The DKs are used to obtain multi-mode dispersion relations of Rayleigh waves, from which near-surface shear velocity can be extracted. Using convolution versus correlation with a grid of active sources is an important step in understanding the physics of the retrieval of surface wave Green's functions. This provides the foundation for future studies based on noise sources or active sources with a sparse spatial distribution.
An overview of multidisciplinary research resources at the Osaka University Center for Twin Research.

PubMed

Hayakawa, Kazuo; Iwatani, Yoshinori

2013-02-01

Osaka University Center for Twin Research is currently organizing a government-funded, multidisciplinary research project using a large registry of aged twins living in Japan. The purpose of the project is to collect various information as well as biological resources from registered twins, and to establish a biobank and databases for preserving and managing these data and resources. The Center is collecting data from twin pairs, both of whom have agreed to participate in a one-day comprehensive medical examination. The following data are being collected: physical data (e.g., height, body mass, blood pressure, theoretical visceral fat, pulse wave velocity, and bone density), data regarding epidemiology (e.g., medical history, lifestyle, quality of life, mood status, cognitive function, and nutrition), electrocardiogram, ultrasonography (carotid artery and thyroid), dentistry, plastic surgery, positron emission tomography, magnetoencephalogram, and magnetic resonance imaging of brain. These data are then aggregated and systematically stored in specific databases. In addition, peripheral blood is obtained from the participants, and then genomic DNA is purified and sera are stored. A wide variety of studies are ongoing, and more are in the planning stage.
Evaluation and comparison of classical interatomic potentials through a user-friendly interactive web-interface

NASA Astrophysics Data System (ADS)

Choudhary, Kamal; Congo, Faical Yannick P.; Liang, Tao; Becker, Chandler; Hennig, Richard G.; Tavazza, Francesca

2017-01-01

Classical empirical potentials/force-fields (FF) provide atomistic insights into material phenomena through molecular dynamics and Monte Carlo simulations. Despite their wide applicability, a systematic evaluation of materials properties using such potentials and, especially, an easy-to-use user-interface for their comparison is still lacking. To address this deficiency, we computed energetics and elastic properties of variety of materials such as metals and ceramics using a wide range of empirical potentials and compared them to density functional theory (DFT) as well as to experimental data, where available. The database currently consists of 3248 entries including energetics and elastic property calculations, and it is still increasing. We also include computational tools for convex-hull plots for DFT and FF calculations. The data covers 1471 materials and 116 force-fields. In addition, both the complete database and the software coding used in the process have been released for public use online (presently at http://www.ctcms.nist.gov/˜knc6/periodic.html) in a user-friendly way designed to enable further material design and discovery.
Designing Semiconductor Heterostructures Using Digitally Accessible Electronic-Structure Data

NASA Astrophysics Data System (ADS)

Shapera, Ethan; Schleife, Andre

Semiconductor sandwich structures, so-called heterojunctions, are at the heart of modern applications with tremendous societal impact: Light-emitting diodes shape the future of lighting and solar cells are promising for renewable energy. However, their computer-based design is hampered by the high cost of electronic structure techniques used to select materials based on alignment of valence and conduction bands and to evaluate excited state properties. We describe, validate, and demonstrate an open source Python framework which rapidly screens existing online databases and user-provided data to find combinations of suitable, previously fabricated materials for optoelectronic applications. The branch point energy aligns valence and conduction bands of different materials, requiring only the bulk density functional theory band structure. We train machine learning algorithms to predict the dielectric constant, electron mobility, and hole mobility with material descriptors available in online databases. Using CdSe and InP as emitting layers for LEDs and CH3NH3PbI3 and nanoparticle PbS as absorbers for solar cells, we demonstrate our broadly applicable, automated method.
Evaluation and comparison of classical interatomic potentials through a user-friendly interactive web-interface

PubMed Central

Choudhary, Kamal; Congo, Faical Yannick P.; Liang, Tao; Becker, Chandler; Hennig, Richard G.; Tavazza, Francesca

2017-01-01

Classical empirical potentials/force-fields (FF) provide atomistic insights into material phenomena through molecular dynamics and Monte Carlo simulations. Despite their wide applicability, a systematic evaluation of materials properties using such potentials and, especially, an easy-to-use user-interface for their comparison is still lacking. To address this deficiency, we computed energetics and elastic properties of variety of materials such as metals and ceramics using a wide range of empirical potentials and compared them to density functional theory (DFT) as well as to experimental data, where available. The database currently consists of 3248 entries including energetics and elastic property calculations, and it is still increasing. We also include computational tools for convex-hull plots for DFT and FF calculations. The data covers 1471 materials and 116 force-fields. In addition, both the complete database and the software coding used in the process have been released for public use online (presently at http://www.ctcms.nist.gov/∼knc6/periodic.html) in a user-friendly way designed to enable further material design and discovery. PMID:28140407
PvTFDB: a Phaseolus vulgaris transcription factors database for expediting functional genomics in legumes

PubMed Central

Bhawna; Bonthala, V.S.; Gajula, MNV Prasad

2016-01-01

The common bean [Phaseolus vulgaris (L.)] is one of the essential proteinaceous vegetables grown in developing countries. However, its production is challenged by low yields caused by numerous biotic and abiotic stress conditions. Regulatory transcription factors (TFs) symbolize a key component of the genome and are the most significant targets for producing stress tolerant crop and hence functional genomic studies of these TFs are important. Therefore, here we have constructed a web-accessible TFs database for P. vulgaris, called PvTFDB, which contains 2370 putative TF gene models in 49 TF families. This database provides a comprehensive information for each of the identified TF that includes sequence data, functional annotation, SSRs with their primer sets, protein physical properties, chromosomal location, phylogeny, tissue-specific gene expression data, orthologues, cis-regulatory elements and gene ontology (GO) assignment. Altogether, this information would be used in expediting the functional genomic studies of a specific TF(s) of interest. The objectives of this database are to understand functional genomics study of common bean TFs and recognize the regulatory mechanisms underlying various stress responses to ease breeding strategy for variety production through a couple of search interfaces including gene ID, functional annotation and browsing interfaces including by family and by chromosome. This database will also serve as a promising central repository for researchers as well as breeders who are working towards crop improvement of legume crops. In addition, this database provide the user unrestricted public access and the user can download entire data present in the database freely. Database URL: http://www.multiomics.in/PvTFDB/ PMID:27465131
Web-based access to near real-time and archived high-density time-series data: cyber infrastructure challenges & developments in the open-source Waveform Server

NASA Astrophysics Data System (ADS)

Reyes, J. C.; Vernon, F. L.; Newman, R. L.; Steidl, J. H.

2010-12-01

The Waveform Server is an interactive web-based interface to multi-station, multi-sensor and multi-channel high-density time-series data stored in Center for Seismic Studies (CSS) 3.0 schema relational databases (Newman et al., 2009). In the last twelve months, based on expanded specifications and current user feedback, both the server-side infrastructure and client-side interface have been extensively rewritten. The Python Twisted server-side code-base has been fundamentally modified to now present waveform data stored in cluster-based databases using a multi-threaded architecture, in addition to supporting the pre-existing single database model. This allows interactive web-based access to high-density (broadband @ 40Hz to strong motion @ 200Hz) waveform data that can span multiple years; the common lifetime of broadband seismic networks. The client-side interface expands on it's use of simple JSON-based AJAX queries to now incorporate a variety of User Interface (UI) improvements including standardized calendars for defining time ranges, applying on-the-fly data calibration to display SI-unit data, and increased rendering speed. This presentation will outline the various cyber infrastructure challenges we have faced while developing this application, the use-cases currently in existence, and the limitations of web-based application development.
Application of kernel functions for accurate similarity search in large chemical databases.

PubMed

Wang, Xiaohong; Huan, Jun; Smalter, Aaron; Lushington, Gerald H

2010-04-29

Similarity search in chemical structure databases is an important problem with many applications in chemical genomics, drug design, and efficient chemical probe screening among others. It is widely believed that structure based methods provide an efficient way to do the query. Recently various graph kernel functions have been designed to capture the intrinsic similarity of graphs. Though successful in constructing accurate predictive and classification models, graph kernel functions can not be applied to large chemical compound database due to the high computational complexity and the difficulties in indexing similarity search for large databases. To bridge graph kernel function and similarity search in chemical databases, we applied a novel kernel-based similarity measurement, developed in our team, to measure similarity of graph represented chemicals. In our method, we utilize a hash table to support new graph kernel function definition, efficient storage and fast search. We have applied our method, named G-hash, to large chemical databases. Our results show that the G-hash method achieves state-of-the-art performance for k-nearest neighbor (k-NN) classification. Moreover, the similarity measurement and the index structure is scalable to large chemical databases with smaller indexing size, and faster query processing time as compared to state-of-the-art indexing methods such as Daylight fingerprints, C-tree and GraphGrep. Efficient similarity query processing method for large chemical databases is challenging since we need to balance running time efficiency and similarity search accuracy. Our previous similarity search method, G-hash, provides a new way to perform similarity search in chemical databases. Experimental study validates the utility of G-hash in chemical databases.
The Protein Information Resource: an integrated public resource of functional annotation of proteins

PubMed Central

Wu, Cathy H.; Huang, Hongzhan; Arminski, Leslie; Castro-Alvear, Jorge; Chen, Yongxing; Hu, Zhang-Zhi; Ledley, Robert S.; Lewis, Kali C.; Mewes, Hans-Werner; Orcutt, Bruce C.; Suzek, Baris E.; Tsugita, Akira; Vinayaka, C. R.; Yeh, Lai-Su L.; Zhang, Jian; Barker, Winona C.

2002-01-01

The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific discovery. The PIR, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the PIR-International Protein Sequence Database (PSD), the major annotated protein sequence database in the public domain, containing about 250 000 proteins. To improve protein annotation and the coverage of experimentally validated data, a bibliography submission system is developed for scientists to submit, categorize and retrieve literature information. Comprehensive protein information is available from iProClass, which includes family classification at the superfamily, domain and motif levels, structural and functional features of proteins, as well as cross-references to over 40 biological databases. To provide timely and comprehensive protein data with source attribution, we have introduced a non-redundant reference protein database, PIR-NREF. The database consists of about 800 000 proteins collected from PIR-PSD, SWISS-PROT, TrEMBL, GenPept, RefSeq and PDB, with composite protein names and literature data. To promote database interoperability, we provide XML data distribution and open database schema, and adopt common ontologies. The PIR web site (http://pir.georgetown.edu/) features data mining and sequence analysis tools for information retrieval and functional identification of proteins based on both sequence and annotation information. The PIR databases and other files are also available by FTP (ftp://nbrfa.georgetown.edu/pir_databases). PMID:11752247
SNPdbe: constructing an nsSNP functional impacts database.

PubMed

Schaefer, Christian; Meier, Alice; Rost, Burkhard; Bromberg, Yana

2012-02-15

Many existing databases annotate experimentally characterized single nucleotide polymorphisms (SNPs). Each non-synonymous SNP (nsSNP) changes one amino acid in the gene product (single amino acid substitution;SAAS). This change can either affect protein function or be neutral in that respect. Most polymorphisms lack experimental annotation of their functional impact. Here, we introduce SNPdbe-SNP database of effects, with predictions of computationally annotated functional impacts of SNPs. Database entries represent nsSNPs in dbSNP and 1000 Genomes collection, as well as variants from UniProt and PMD. SAASs come from >2600 organisms; 'human' being the most prevalent. The impact of each SAAS on protein function is predicted using the SNAP and SIFT algorithms and augmented with experimentally derived function/structure information and disease associations from PMD, OMIM and UniProt. SNPdbe is consistently updated and easily augmented with new sources of information. The database is available as an MySQL dump and via a web front end that allows searches with any combination of organism names, sequences and mutation IDs. http://www.rostlab.org/services/snpdbe.
ITS-90 Thermocouple Database

National Institute of Standards and Technology Data Gateway

SRD 60 NIST ITS-90 Thermocouple Database (Web, free access) Web version of Standard Reference Database 60 and NIST Monograph 175. The database gives temperature -- electromotive force (emf) reference functions and tables for the letter-designated thermocouple types B, E, J, K, N, R, S and T. These reference functions have been adopted as standards by the American Society for Testing and Materials (ASTM) and the International Electrotechnical Commission (IEC).
Spin-Multiplet Components and Energy Splittings by Multistate Density Functional Theory.

PubMed

Grofe, Adam; Chen, Xin; Liu, Wenjian; Gao, Jiali

2017-10-05

Kohn-Sham density functional theory has been tremendously successful in chemistry and physics. Yet, it is unable to describe the energy degeneracy of spin-multiplet components with any approximate functional. This work features two contributions. (1) We present a multistate density functional theory (MSDFT) to represent spin-multiplet components and to determine multiplet energies. MSDFT is a hybrid approach, taking advantage of both wave function theory and density functional theory. Thus, the wave functions, electron densities and energy density-functionals for ground and excited states and for different components are treated on the same footing. The method is illustrated on valence excitations of atoms and molecules. (2) Importantly, a key result is that for cases in which the high-spin components can be determined separately by Kohn-Sham density functional theory, the transition density functional in MSDFT (which describes electronic coupling) can be defined rigorously. The numerical results may be explored to design and optimize transition density functionals for configuration coupling in multiconfigurational DFT.
High energy-density liquid rocket fuel performance

NASA Technical Reports Server (NTRS)

Rapp, Douglas C.

1990-01-01

A fuel performance database of liquid hydrocarbons and aluminum-hydrocarbon fuels was compiled using engine parametrics from the Space Transportation Engine Program as a baseline. Propellant performance parameters are introduced. General hydrocarbon fuel performance trends are discussed with respect to hydrogen-to-carbon ratio and heat of formation. Aluminum-hydrocarbon fuel performance is discussed with respect to aluminum metal loading. Hydrocarbon and aluminum-hydrocarbon fuel performance is presented with respect to fuel density, specific impulse, and propellant density specific impulse.
Hydrogen-bond landscapes, geometry and energetics of squaric acid and its mono- and dianions: a Cambridge Structural Database, IsoStar and computational study.

PubMed

Allen, Frank H; Cruz-Cabeza, Aurora J; Wood, Peter A; Bardwell, David A

2013-10-01

As part of a programme of work to extend central-group coverage in the Cambridge Crystallographic Data Centre's (CCDC) IsoStar knowledge base of intermolecular interactions, we have studied the hydrogen-bonding abilities of squaric acid (H2SQ) and its mono- and dianions (HSQ(-) and SQ(2-)) using the Cambridge Structural Database (CSD) along with dispersion-corrected density functional theory (DFT-D) calculations for a range of hydrogen-bonded dimers. The -OH and -C=O groups of H2SQ, HSQ(-) and SQ(2-) are potent donors and acceptors, as indicated by their hydrogen-bond geometries in available crystal structures in the CSD, and by the attractive energies calculated for their dimers with acetone and methanol, which were used as model acceptors and donors. The two anions have sufficient examples in the CSD for their addition as new central groups in IsoStar. It is also shown that charge- and resonance-assisted hydrogen bonds involving H2SQ and HSQ(-) are similar in strength to those made by carboxylate COO(-) acceptors, while hydrogen bonds made by the dianion SQ(2-) are somewhat stronger. The study reinforces the value of squaric acid and its anions as cocrystal formers and their actual and potential importance as isosteric replacements for carboxylic acid and carboxylate functions.

Leaf and stem economics spectra drive diversity of functional plant traits in a dynamic global vegetation model.

PubMed

Sakschewski, Boris; von Bloh, Werner; Boit, Alice; Rammig, Anja; Kattge, Jens; Poorter, Lourens; Peñuelas, Josep; Thonicke, Kirsten

2015-01-22

Functional diversity is critical for ecosystem dynamics, stability and productivity. However, dynamic global vegetation models (DGVMs) which are increasingly used to simulate ecosystem functions under global change, condense functional diversity to plant functional types (PFTs) with constant parameters. Here, we develop an individual- and trait-based version of the DGVM LPJmL (Lund-Potsdam-Jena managed Land) called LPJmL- flexible individual traits (LPJmL-FIT) with flexible individual traits) which we apply to generate plant trait maps for the Amazon basin. LPJmL-FIT incorporates empirical ranges of five traits of tropical trees extracted from the TRY global plant trait database, namely specific leaf area (SLA), leaf longevity (LL), leaf nitrogen content (N area ), the maximum carboxylation rate of Rubisco per leaf area (vcmaxarea), and wood density (WD). To scale the individual growth performance of trees, the leaf traits are linked by trade-offs based on the leaf economics spectrum, whereas wood density is linked to tree mortality. No preselection of growth strategies is taking place, because individuals with unique trait combinations are uniformly distributed at tree establishment. We validate the modeled trait distributions by empirical trait data and the modeled biomass by a remote sensing product along a climatic gradient. Including trait variability and trade-offs successfully predicts natural trait distributions and achieves a more realistic representation of functional diversity at the local to regional scale. As sites of high climatic variability, the fringes of the Amazon promote trait divergence and the coexistence of multiple tree growth strategies, while lower plant trait diversity is found in the species-rich center of the region with relatively low climatic variability. LPJmL-FIT enables to test hypotheses on the effects of functional biodiversity on ecosystem functioning and to apply the DGVM to current challenges in ecosystem management from local to global scales, that is, deforestation and climate change effects. © 2015 John Wiley & Sons Ltd.
SinEx DB: a database for single exon coding sequences in mammalian genomes.

PubMed

Jorquera, Roddy; Ortiz, Rodrigo; Ossandon, F; Cárdenas, Juan Pablo; Sepúlveda, Rene; González, Carolina; Holmes, David S

2016-01-01

Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as 'single exon genes' (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs.Database URL: www.sinex.cl. © The Author(s) 2016. Published by Oxford University Press.
Error estimates for (semi-)empirical dispersion terms and large biomacromolecules.

PubMed

Korth, Martin

2013-10-14

The first-principles modeling of biomaterials has made tremendous advances over the last few years with the ongoing growth of computing power and impressive developments in the application of density functional theory (DFT) codes to large systems. One important step forward was the development of dispersion corrections for DFT methods, which account for the otherwise neglected dispersive van der Waals (vdW) interactions. Approaches at different levels of theory exist, with the most often used (semi-)empirical ones based on pair-wise interatomic C6R(-6) terms. Similar terms are now also used in connection with semiempirical QM (SQM) methods and density functional tight binding methods (SCC-DFTB). Their basic structure equals the attractive term in Lennard-Jones potentials, common to most force field approaches, but they usually use some type of cutoff function to make the mixing of the (long-range) dispersion term with the already existing (short-range) dispersion and exchange-repulsion effects from the electronic structure theory methods possible. All these dispersion approximations were found to perform accurately for smaller systems, but error estimates for larger systems are very rare and completely missing for really large biomolecules. We derive such estimates for the dispersion terms of DFT, SQM and MM methods using error statistics for smaller systems and dispersion contribution estimates for the PDBbind database of protein-ligand interactions. We find that dispersion terms will usually not be a limiting factor for reaching chemical accuracy, though some force fields and large ligand sizes are problematic.
Inequalities in Specialist Hand Surgeon Distribution across the United States.

PubMed

Rios-Diaz, Arturo J; Metcalfe, David; Singh, Mansher; Zogg, Cheryl K; Olufajo, Olubode A; Ramos, Margarita S; Caterson, Edward J; Talbot, Simon G

2016-05-01

Unequal access to hospital specialists for emergency care is an issue in the United States. The authors sought to describe the geographic distribution of specialist hand surgeons and associated factors in the United States. Geographic distributions of surgeons holding a Subspecialty Certificate in Surgery of the Hand and hand surgery fellowship positions were identified from the American Board of Medical Specialties Database and the literature (2013), respectively. State-level population and per capita income were ascertained using U.S. Census data. Variations in hand trauma admissions were determined using Healthcare Cost and Utilization Project national/state inpatient databases. Risk-adjusted generalized linear models were used to assess independent association between hand surgeon density and hand trauma admission density, fellowship position density, and per capita income. Among 2019 specialist hand surgeons identified, 72.1 percent were orthopedic surgeons, 18.3 percent were plastic surgeons, and 9.6 percent were general surgeons. There were 157 hand surgery fellowship positions nationwide. There were 149,295 annual hand trauma admissions. The national density of specialist hand surgeons and density of trauma admission were 0.6 and 47.6, respectively. The density of specialist hand surgeons varied significantly between states. State-level variations in density of surgeons were independent and significantly associated with median per capita income (p < 0.001) and with density of fellowships (p = 0.014). Specialist hand surgeons are distributed unevenly across the United States. State-level analyses suggest that states with lower per capita incomes may be particularly underserved, which may contribute to regional disparities in access to emergency hand trauma care.
Thigh muscle segmentation of chemical shift encoding-based water-fat magnetic resonance images: The reference database MyoSegmenTUM.

PubMed

Schlaeger, Sarah; Freitag, Friedemann; Klupp, Elisabeth; Dieckmeyer, Michael; Weidlich, Dominik; Inhuber, Stephanie; Deschauer, Marcus; Schoser, Benedikt; Bublitz, Sarah; Montagnese, Federica; Zimmer, Claus; Rummeny, Ernst J; Karampinos, Dimitrios C; Kirschke, Jan S; Baum, Thomas

2018-01-01

Magnetic resonance imaging (MRI) can non-invasively assess muscle anatomy, exercise effects and pathologies with different underlying causes such as neuromuscular diseases (NMD). Quantitative MRI including fat fraction mapping using chemical shift encoding-based water-fat MRI has emerged for reliable determination of muscle volume and fat composition. The data analysis of water-fat images requires segmentation of the different muscles which has been mainly performed manually in the past and is a very time consuming process, currently limiting the clinical applicability. An automatization of the segmentation process would lead to a more time-efficient analysis. In the present work, the manually segmented thigh magnetic resonance imaging database MyoSegmenTUM is presented. It hosts water-fat MR images of both thighs of 15 healthy subjects and 4 patients with NMD with a voxel size of 3.2x2x4 mm3 with the corresponding segmentation masks for four functional muscle groups: quadriceps femoris, sartorius, gracilis, hamstrings. The database is freely accessible online at https://osf.io/svwa7/?view_only=c2c980c17b3a40fca35d088a3cdd83e2. The database is mainly meant as ground truth which can be used as training and test dataset for automatic muscle segmentation algorithms. The segmentation allows extraction of muscle cross sectional area (CSA) and volume. Proton density fat fraction (PDFF) of the defined muscle groups from the corresponding images and quadriceps muscle strength measurements/neurological muscle strength rating can be used for benchmarking purposes.
gEVE: a genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes.

PubMed

Nakagawa, So; Takahashi, Mahoko Ueda

2016-01-01

In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species.Database URL: http://geve.med.u-tokai.ac.jp. © The Author(s) 2016. Published by Oxford University Press.
gEVE: a genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes

PubMed Central

Nakagawa, So; Takahashi, Mahoko Ueda

2016-01-01

In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species. Database URL: http://geve.med.u-tokai.ac.jp PMID:27242033
The thyrotropin receptor mutation database: update 2003.

PubMed

Führer, Dagmar; Lachmund, Peter; Nebel, Istvan-Tibor; Paschke, Ralf

2003-12-01

In 1999 we have created a TSHR mutation database compiling TSHR mutations with their basic characteristics and associated clinical conditions (www.uni-leipzig.de/innere/tshr). Since then, more than 2887 users from 36 countries have logged into the TSHR mutation database and have contributed several valuable suggestions for further improvement of the database. We now present an updated and extended version of the TSHR database to which several novel features have been introduced: 1. detailed functional characteristics on all 65 mutations (43 activating and 22 inactivating mutations) reported to date, 2. 40 pedigrees with detailed information on molecular aspects, clinical courses and treatment options in patients with gain-of-function and loss-of-function germline TSHR mutations, 3. a first compilation of site-directed mutagenesis studies, 4. references with Medline links, 5. a user friendly search tool for specific database searches, user-specific database output and 6. an administrator tool for the submission of novel TSHR mutations. The TSHR mutation database is installed as one of the locus specific HUGO mutation databases. It is listed under index TSHR 603372 (http://ariel.ucs.unimelb.edu.au/~cotton/glsdbq.htm) and can be accessed via www.uni-leipzig.de/innere/tshr.
MEGGASENSE - The Metagenome/Genome Annotated Sequence Natural Language Search Engine: A Platform for  the Construction of Sequence Data Warehouses.

PubMed

Gacesa, Ranko; Zucko, Jurica; Petursdottir, Solveig K; Gudmundsdottir, Elisabet Eik; Fridjonsson, Olafur H; Diminic, Janko; Long, Paul F; Cullum, John; Hranueli, Daslav; Hreggvidsson, Gudmundur O; Starcevic, Antonio

2017-06-01

The MEGGASENSE platform constructs relational databases of DNA or protein sequences. The default functional analysis uses 14 106 hidden Markov model (HMM) profiles based on sequences in the KEGG database. The Solr search engine allows sophisticated queries and a BLAST search function is also incorporated. These standard capabilities were used to generate the SCATT database from the predicted proteome of Streptomyces cattleya . The implementation of a specialised metagenome database (AMYLOMICS) for bioprospecting of carbohydrate-modifying enzymes is described. In addition to standard assembly of reads, a novel 'functional' assembly was developed, in which screening of reads with the HMM profiles occurs before the assembly. The AMYLOMICS database incorporates additional HMM profiles for carbohydrate-modifying enzymes and it is illustrated how the combination of HMM and BLAST analyses helps identify interesting genes. A variety of different proteome and metagenome databases have been generated by MEGGASENSE.
PvTFDB: a Phaseolus vulgaris transcription factors database for expediting functional genomics in legumes.

PubMed

Bhawna; Bonthala, V S; Gajula, Mnv Prasad

2016-01-01

The common bean [Phaseolus vulgaris (L.)] is one of the essential proteinaceous vegetables grown in developing countries. However, its production is challenged by low yields caused by numerous biotic and abiotic stress conditions. Regulatory transcription factors (TFs) symbolize a key component of the genome and are the most significant targets for producing stress tolerant crop and hence functional genomic studies of these TFs are important. Therefore, here we have constructed a web-accessible TFs database for P. vulgaris, called PvTFDB, which contains 2370 putative TF gene models in 49 TF families. This database provides a comprehensive information for each of the identified TF that includes sequence data, functional annotation, SSRs with their primer sets, protein physical properties, chromosomal location, phylogeny, tissue-specific gene expression data, orthologues, cis-regulatory elements and gene ontology (GO) assignment. Altogether, this information would be used in expediting the functional genomic studies of a specific TF(s) of interest. The objectives of this database are to understand functional genomics study of common bean TFs and recognize the regulatory mechanisms underlying various stress responses to ease breeding strategy for variety production through a couple of search interfaces including gene ID, functional annotation and browsing interfaces including by family and by chromosome. This database will also serve as a promising central repository for researchers as well as breeders who are working towards crop improvement of legume crops. In addition, this database provide the user unrestricted public access and the user can download entire data present in the database freely.Database URL: http://www.multiomics.in/PvTFDB/. © The Author(s) 2016. Published by Oxford University Press.
Multiconfiguration Pair-Density Functional Theory.

PubMed

Li Manni, Giovanni; Carlson, Rebecca K; Luo, Sijie; Ma, Dongxia; Olsen, Jeppe; Truhlar, Donald G; Gagliardi, Laura

2014-09-09

We present a new theoretical framework, called Multiconfiguration Pair-Density Functional Theory (MC-PDFT), which combines multiconfigurational wave functions with a generalization of density functional theory (DFT). A multiconfigurational self-consistent-field (MCSCF) wave function with correct spin and space symmetry is used to compute the total electronic density, its gradient, the on-top pair density, and the kinetic and Coulomb contributions to the total electronic energy. We then use a functional of the total density, its gradient, and the on-top pair density to calculate the remaining part of the energy, which we call the on-top-density-functional energy in contrast to the exchange-correlation energy of Kohn-Sham DFT. Because the on-top pair density is an element of the two-particle density matrix, this goes beyond the Hohenberg-Kohn theorem that refers only to the one-particle density. To illustrate the theory, we obtain first approximations to the required new type of density functionals by translating conventional density functionals of the spin densities using a simple prescription, and we perform post-SCF density functional calculations using the total density, density gradient, and on-top pair density from the MCSCF calculations. Double counting of dynamic correlation or exchange does not occur because the MCSCF energy is not used. The theory is illustrated by applications to the bond energies and potential energy curves of H2, N2, F2, CaO, Cr2, and NiCl and the electronic excitation energies of Be, C, N, N(+), O, O(+), Sc(+), Mn, Co, Mo, Ru, N2, HCHO, C4H6, c-C5H6, and pyrazine. The method presented has a computational cost and scaling similar to MCSCF, but a quantitative accuracy, even with the present first approximations to the new types of density functionals, that is comparable to much more expensive multireference perturbation theory methods.
SoyFN: a knowledge database of soybean functional networks.

PubMed

Xu, Yungang; Guo, Maozu; Liu, Xiaoyan; Wang, Chunyu; Liu, Yang

2014-01-01

Many databases for soybean genomic analysis have been built and made publicly available, but few of them contain knowledge specifically targeting the omics-level gene-gene, gene-microRNA (miRNA) and miRNA-miRNA interactions. Here, we present SoyFN, a knowledge database of soybean functional gene networks and miRNA functional networks. SoyFN provides user-friendly interfaces to retrieve, visualize, analyze and download the functional networks of soybean genes and miRNAs. In addition, it incorporates much information about KEGG pathways, gene ontology annotations and 3'-UTR sequences as well as many useful tools including SoySearch, ID mapping, Genome Browser, eFP Browser and promoter motif scan. SoyFN is a schema-free database that can be accessed as a Web service from any modern programming language using a simple Hypertext Transfer Protocol call. The Web site is implemented in Java, JavaScript, PHP, HTML and Apache, with all major browsers supported. We anticipate that this database will be useful for members of research communities both in soybean experimental science and bioinformatics. Database URL: http://nclab.hit.edu.cn/SoyFN.
5-Methylation of Cytosine in CG:CG Base-Pair Steps: A Physicochemical Mechanism for the Epigenetic Control of DNA Nanomechanics

NASA Astrophysics Data System (ADS)

Yusufaly, Tahir; Olson, Wilma; Li, Yun

2014-03-01

Van der Waals density functional theory is integrated with analysis of a non-redundant set of protein-DNA crystal structures from the Nucleic Acid Database to study the stacking energetics of CG:CG base-pair steps, specifically the role of cytosine 5-methylation. Principal component analysis of the steps reveals the dominant collective motions to correspond to a tensile ``opening'' mode and two shear ``sliding'' and ``tearing'' modes in the orthogonal plane. The stacking interactions of the methyl groups are observed to globally inhibit CG:CG step overtwisting while simultaneously softening the modes locally via potential energy modulations that create metastable states. The results have implications for the epigenetic control of DNA mechanics.
Analytical bond order potential for simulations of BeO 1D and 2D nanostructures and plasma-surface interactions

NASA Astrophysics Data System (ADS)

Byggmästar, J.; Hodille, E. A.; Ferro, Y.; Nordlund, K.

2018-04-01

An analytical interatomic bond order potential for the Be-O system is presented. The potential is fitted and compared to a large database of bulk BeO and point defect properties obtained using density functional theory. Its main applications include simulations of plasma-surface interactions involving oxygen or oxide layers on beryllium, as well as simulations of BeO nanotubes and nanosheets. We apply the potential in a study of oxygen irradiation of Be surfaces, and observe the early stages of an oxide layer forming on the Be surface. Predicted thermal and elastic properties of BeO nanotubes and nanosheets are simulated and compared with published ab initio data.
Technical efficiency of rural primary health care system for diabetes treatment in Iran: a stochastic frontier analysis.

PubMed

Qorbani, Mostafa; Farzadfar, Farshad; Majdzadeh, Reza; Mohammad, Kazem; Motevalian, Abbas

2017-01-01

Our aim was to explore the technical efficiency (TE) of the Iranian rural primary healthcare (PHC) system for diabetes treatment coverage rate using the stochastic frontier analysis (SFA) as well as to examine the strength and significance of the effect of human resources density on diabetes treatment. In the SFA model diabetes treatment coverage rate, as a output, is a function of health system inputs (Behvarz worker density, physician density, and rural health center density) and non-health system inputs (urbanization rate, median age of population, and wealth index) as a set of covariates. Data about the rate of self-reported diabetes treatment coverage was obtained from the Non-Communicable Disease Surveillance Survey, data about health system inputs were collected from the health census database and data about non-health system inputs were collected from the census data and household survey. In 2008, rate of diabetes treatment coverage was 67% (95% CI: 63%-71%) nationally, and at the provincial level it varied from 44% to 81%. The TE score at the national level was 87.84%, with considerable variation across provinces (from 59.65% to 98.28%).Among health system and non-health system inputs, only the Behvarz density (per 1000 population)was significantly associated with diabetes treatment coverage (β (95%CI): 0.50 (0.29-0.70), p < 0.001). Our findings show that although the rural PHC system can considered efficient in diabetes treatment at the national level, a wide variation exists in TE at the provincial level. Because the only variable that is predictor of TE is the Behvarz density, the PHC system may extend the diabetes treatment coverage by using this group of health care workers.
Reformulation of Density Functional Theory for N-Representable Densities and the Resolution of the v-Representability Problem

DOE PAGES

Gonis, A.; Zhang, X. G.; Stocks, G. M.; ...

2015-10-23

Density functional theory for the case of general, N-representable densities is reformulated in terms of density functional derivatives of expectation values of operators evaluated with wave functions leading to a density, making no reference to the concept of potential. The developments provide a complete solution of the v-representability problem by establishing a mathematical procedure that determines whether a density is v-representable and in the case of an affirmative answer determines the potential (within an additive constant) as a derivative with respect to the density of a constrained search functional. It also establishes the existence of an energy functional of themore » density that, for v-representable densities, assumes its minimum value at the density describing the ground state of an interacting many-particle system. The theorems of Hohenberg and Kohn emerge as special cases of the formalism.« less
Reference Correlation for the Viscosity of Carbon Dioxide1

PubMed Central

Laesecke, Arno; Muzny, Chris D.

2017-01-01

A comprehensive database of experimental and computed data for the viscosity of carbon dioxide (CO2) was compiled and a new reference correlation was developed. Literature results based on an ab initio potential energy surface were the foundation of the correlation of the viscosity in the limit of zero density in the temperature range from 100 K to 2000 K. Guided symbolic regression was employed to obtain a new functional form that extrapolates correctly to T → 0 K and to 10 000 K. Coordinated measurements at low density made it possible to implement the temperature dependence of the Rainwater-Friend theory in the linear-in-density viscosity term. The residual viscosity could be formulated with a scaling term ργ/T the significance of which was confirmed by symbolic regression. The final viscosity correlation covers temperatures from 100 K to 2000 K for gaseous CO2, and from 220 K to 700 K with pressures along the melting line up to 8000 MPa for compressed and supercritical liquid states. The data representation is more accurate than with the previous correlations, and the covered pressure and temperature range is significantly extended. The critical enhancement of the viscosity of CO2 is included in the new correlation. PMID:28736460
Weighted-density functionals for cavity formation and dispersion energies in continuum solvation models

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sundararaman, Ravishankar; Gunceler, Deniz; Arias, T. A.

2014-10-07

Continuum solvation models enable efficient first principles calculations of chemical reactions in solution, but require extensive parametrization and fitting for each solvent and class of solute systems. Here, we examine the assumptions of continuum solvation models in detail and replace empirical terms with physical models in order to construct a minimally-empirical solvation model. Specifically, we derive solvent radii from the nonlocal dielectric response of the solvent from ab initio calculations, construct a closed-form and parameter-free weighted-density approximation for the free energy of the cavity formation, and employ a pair-potential approximation for the dispersion energy. We show that the resulting modelmore » with a single solvent-independent parameter: the electron density threshold (n c), and a single solvent-dependent parameter: the dispersion scale factor (s 6), reproduces solvation energies of organic molecules in water, chloroform, and carbon tetrachloride with RMS errors of 1.1, 0.6 and 0.5 kcal/mol, respectively. We additionally show that fitting the solvent-dependent s 6 parameter to the solvation energy of a single non-polar molecule does not substantially increase these errors. Parametrization of this model for other solvents, therefore, requires minimal effort and is possible without extensive databases of experimental solvation free energies.« less
Weighted-density functionals for cavity formation and dispersion energies in continuum solvation models

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sundararaman, Ravishankar; Gunceler, Deniz; Arias, T. A.

2014-10-07

Continuum solvation models enable efficient first principles calculations of chemical reactions in solution, but require extensive parametrization and fitting for each solvent and class of solute systems. Here, we examine the assumptions of continuum solvation models in detail and replace empirical terms with physical models in order to construct a minimally-empirical solvation model. Specifically, we derive solvent radii from the nonlocal dielectric response of the solvent from ab initio calculations, construct a closed-form and parameter-free weighted-density approximation for the free energy of the cavity formation, and employ a pair-potential approximation for the dispersion energy. We show that the resulting modelmore » with a single solvent-independent parameter: the electron density threshold (n{sub c}), and a single solvent-dependent parameter: the dispersion scale factor (s{sub 6}), reproduces solvation energies of organic molecules in water, chloroform, and carbon tetrachloride with RMS errors of 1.1, 0.6 and 0.5 kcal/mol, respectively. We additionally show that fitting the solvent-dependent s{sub 6} parameter to the solvation energy of a single non-polar molecule does not substantially increase these errors. Parametrization of this model for other solvents, therefore, requires minimal effort and is possible without extensive databases of experimental solvation free energies.« less
Alkamid database: Chemistry, occurrence and functionality of plant N-alkylamides.

PubMed

Boonen, Jente; Bronselaer, Antoon; Nielandt, Joachim; Veryser, Lieselotte; De Tré, Guy; De Spiegeleer, Bart

2012-08-01

N-Alkylamides (NAAs) are a promising group of bioactive compounds, which are anticipated to act as important lead compounds for plant protection and biocidal products, functional food, cosmeceuticals and drugs in the next decennia. These molecules, currently found in more than 25 plant families and with a wide structural diversity, exert a variety of biological-pharmacological effects and are of high ethnopharmacological importance. However, information is scattered in literature, with different, often unstandardized, pharmacological methodologies being used. Therefore, a comprehensive NAA database (acronym: Alkamid) was constructed to collect the available structural and functional NAA data, linked to their occurrence in plants (family, tribe, species, genus). For loading information in the database, literature data was gathered over the period 1950-2010, by using several search engines. In order to represent the collected information about NAAs, the plants in which they occur and the functionalities for which they have been examined, a relational database is constructed and implemented on a MySQL back-end. The database is supported by describing the NAA plant-, functional- and chemical-space. The chemical space includes a NAA classification, according to their fatty acid and amine structures. The Alkamid database (publicly available on the website http://alkamid.ugent.be/) is not only a central information point, but can also function as a useful tool to prioritize the NAA choice in the evaluation of their functionality, to perform data mining leading to quantitative structure-property relationships (QSPRs), functionality comparisons, clustering, plant biochemistry and taxonomic evaluations. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

Exchange-correlation energies of atoms from efficient density functionals: influence of the electron density

DOE PAGES

Tao, Jianmin; Ye, Lin -Hui; Duan, Yuhua

2017-11-20

The primary goal of Kohn–Sham density functional theory is to evaluate the exchange-correlation contribution to electronic properties. However, the accuracy of a density functional can be affected by the electron density. Here we apply the nonempirical Tao–Mo (TM) semilocal functional to study the influence of the electron density on the exchange and correlation energies of atoms and ions, and compare the results with the commonly used nonempirical semilocal functionals local spin-density approximation (LSDA), Perdew–Burke–Ernzerhof (PBE), Tao–Perdew–Staroverov–Scuseria (TPSS), and hybrid functional PBE0. We find that the spin-restricted Hartree–Fock density yields the exchange and correlation energies in good agreement with the Optimizedmore » Effective Potential method, particularly for spherical atoms and ions. However, the errors of these semilocal and hybrid functionals become larger for self-consistent densities. We further find that the quality of the electron density have greater effect on the exchange-correlation energies of kinetic energy density-dependent meta-GGA functionals TPSS and TM than on those of the LSDA and GGA, and therefore, should have greater influence on the performance of meta-GGA functionals. Lastly, we show that the influence of the density quality on PBE0 is slightly reduced, compared to that of PBE, due to the exact mixing.« less
Exchange-correlation energies of atoms from efficient density functionals: influence of the electron density

NASA Astrophysics Data System (ADS)

Tao, Jianmin; Ye, Lin-Hui; Duan, Yuhua

2017-12-01

The primary goal of Kohn-Sham density functional theory is to evaluate the exchange-correlation contribution to electronic properties. However, the accuracy of a density functional can be affected by the electron density. Here we apply the nonempirical Tao-Mo (TM) semilocal functional to study the influence of the electron density on the exchange and correlation energies of atoms and ions, and compare the results with the commonly used nonempirical semilocal functionals local spin-density approximation (LSDA), Perdew-Burke-Ernzerhof (PBE), Tao-Perdew-Staroverov-Scuseria (TPSS), and hybrid functional PBE0. We find that the spin-restricted Hartree-Fock density yields the exchange and correlation energies in good agreement with the Optimized Effective Potential method, particularly for spherical atoms and ions. However, the errors of these semilocal and hybrid functionals become larger for self-consistent densities. We further find that the quality of the electron density have greater effect on the exchange-correlation energies of kinetic energy density-dependent meta-GGA functionals TPSS and TM than on those of the LSDA and GGA, and therefore, should have greater influence on the performance of meta-GGA functionals. Finally, we show that the influence of the density quality on PBE0 is slightly reduced, compared to that of PBE, due to the exact mixing.
Exchange-correlation energies of atoms from efficient density functionals: influence of the electron density

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tao, Jianmin; Ye, Lin -Hui; Duan, Yuhua

The primary goal of Kohn–Sham density functional theory is to evaluate the exchange-correlation contribution to electronic properties. However, the accuracy of a density functional can be affected by the electron density. Here we apply the nonempirical Tao–Mo (TM) semilocal functional to study the influence of the electron density on the exchange and correlation energies of atoms and ions, and compare the results with the commonly used nonempirical semilocal functionals local spin-density approximation (LSDA), Perdew–Burke–Ernzerhof (PBE), Tao–Perdew–Staroverov–Scuseria (TPSS), and hybrid functional PBE0. We find that the spin-restricted Hartree–Fock density yields the exchange and correlation energies in good agreement with the Optimizedmore » Effective Potential method, particularly for spherical atoms and ions. However, the errors of these semilocal and hybrid functionals become larger for self-consistent densities. We further find that the quality of the electron density have greater effect on the exchange-correlation energies of kinetic energy density-dependent meta-GGA functionals TPSS and TM than on those of the LSDA and GGA, and therefore, should have greater influence on the performance of meta-GGA functionals. Lastly, we show that the influence of the density quality on PBE0 is slightly reduced, compared to that of PBE, due to the exact mixing.« less
The University of Minnesota Biocatalysis/Biodegradation Database: specialized metabolism for functional genomics.

PubMed Central

Ellis, L B; Hershberger, C D; Wackett, L P

1999-01-01

The University of Minnesota Biocatalysis/Biodegradation Database (UM-BBD, http://www.labmed.umn.edu/umbbd/i nde x.html) first became available on the web in 1995 to provide information on microbial biocatalytic reactions of, and biodegradation pathways for, organic chemical compounds, especially those produced by man. Its goal is to become a representative database of biodegradation, spanning the diversity of known microbial metabolic routes, organic functional groups, and environmental conditions under which biodegradation occurs. The database can be used to enhance understanding of basic biochemistry, biocatalysis leading to speciality chemical manufacture, and biodegradation of environmental pollutants. It is also a resource for functional genomics, since it contains information on enzymes and genes involved in specialized metabolism not found in intermediary metabolism databases, and thus can assist in assigning functions to genes homologous to such less common genes. With information on >400 reactions and compounds, it is poised to become a resource for prediction of microbial biodegradation pathways for compounds it does not contain, a process complementary to predicting the functions of new classes of microbial genes. PMID:9847233
Self-Interaction Error in Density Functional Theory: An Appraisal.

PubMed

Bao, Junwei Lucas; Gagliardi, Laura; Truhlar, Donald G

2018-05-03

Self-interaction error (SIE) is considered to be one of the major sources of error in most approximate exchange-correlation functionals for Kohn-Sham density-functional theory (KS-DFT), and it is large with all local exchange-correlation functionals and with some hybrid functionals. In this work, we consider systems conventionally considered to be dominated by SIE. For these systems, we demonstrate that by using multiconfiguration pair-density functional theory (MC-PDFT), the error of a translated local density-functional approximation is significantly reduced (by a factor of 3) when using an MCSCF density and on-top density, as compared to using KS-DFT with the parent functional; the error in MC-PDFT with local on-top functionals is even lower than the error in some popular KS-DFT hybrid functionals. Density-functional theory, either in MC-PDFT form with local on-top functionals or in KS-DFT form with some functionals having 50% or more nonlocal exchange, has smaller errors for SIE-prone systems than does CASSCF, which has no SIE.
76 FR 41792 - Information Collection Being Submitted for Review and Approval to the Office of Management and...

Federal Register 2010, 2011, 2012, 2013, 2014

2011-07-15

... administrator from the private sector to create and operate TV band databases. The TV band database... database administrator will be responsible for operation of their database and coordination of the overall functioning of the database with other administrators, and will provide database access to TVBDs. The...
Esophageal eosinophilia is increased in rural areas with low population density: results from a national pathology database.

PubMed

Jensen, Elizabeth T; Hoffman, Kate; Shaheen, Nicholas J; Genta, Robert M; Dellon, Evan S

2014-05-01

Eosinophilic esophagitis (EoE) is an increasingly prevalent chronic disease arising from an allergy/immune-mediated process. Generally, the risk of atopic disease differs in rural and urban environments. The relationship between population density and EoE is unknown. Our aim was to assess the relationship between EoE and population density. We conducted a cross-sectional, case-control study of patients with esophageal biopsies in a US national pathology database between January 2009 and June 2012 to assess the relationship between population density and EoE. Using geographic information systems, the population density (individuals per square mile) was determined for each patient zip code. The odds of esophageal eosinophilia and EoE were estimated for each quintile of population density and adjusted for potential confounders. Sensitivity analyses were conducted with varying case definitions and to evaluate the potential for bias from endoscopy volume and patient factors. Of 292,621 unique patients in the source population, 89,754 had normal esophageal biopsies and 14,381 had esophageal eosinophilia with ≥15 eosinophils per high-power field. The odds of having esophageal eosinophilia increased with decreasing population density (P for trend <0.001). Compared with those in the highest quintile of population density, odds of having esophageal eosinophilia were significantly higher among those in the lowest quintile of population density (adjusted odds ratio (aOR) 1.27, 95% confidence interval (CI): 1.18, 1.36). A similar dose-response trend was observed across case definitions with increased odds of EoE in the lowest population density quintile (aOR 1.59, 95% CI: 1.45-1.76). Estimates were robust to sensitivity analyses. Population density is strongly and inversely associated with esophageal eosinophilia and EoE. This association is robust to varying case definitions and adjustment factors. Environmental exposures that are more prominent in rural areas may be relevant to the pathogenesis of EoE.
Esophageal eosinophilia is increased in rural areas with low population density: Results from a national pathology database

PubMed Central

Jensen, Elizabeth T.; Hoffman, Kate; Shaheen, Nicholas J.; Genta, Robert M.; Dellon, Evan S.

2015-01-01

Objectives Eosinophilic esophagitis (EoE) is an increasingly prevalent chronic disease arising from an allergy/immune-mediated process. Generally, the risk of atopic disease differs in rural and urban environments. The relationship between population density and EoE is unknown. Our aim was to assess the relationship between EoE and population density. Methods : We conducted a cross-sectional, case-control study of patients with esophageal biopsies in a U.S. national pathology database between January 2009 and June 2012 to assess the relationship between population density and EoE. Using Geographic Information Systems (GIS), the population density (individuals/mile2) was determined for each patient zip code. The odds of esophageal eosinophilia and EoE were estimated for each quintile of population density and adjusted for potential confounders. Sensitivity analyses were conducted with varying case definitions and to evaluate the potential for bias from endoscopy volume and patient factors. Results Of 292,621 unique patients in the source population, 89,754 had normal esophageal biopsies and 14,381 had esophageal eosinophilia with ≥15 eosinophils per high-power field (eos/hpf). The odds of esophageal eosinophilia increased with decreasing population density (p for trend < 0.001). Compared to those in the highest quintile of population density, odds of esophageal eosinophilia were significantly higher amongst those in the lowest quintile of population density (aOR 1.27, 95% CI: 1.18, 1.36). A similar dose-response trend was observed across case definitions with odds of EoE increased in the lowest population density quintile (aOR 1.59, 95% CI: 1.45-1.76). Estimates were robust to sensitivity analyses. Conclusions Population density is strongly and inversely associated with esophageal eosinophilia and EoE. This association is robust to varying case definitions and adjustment factors. Environmental exposures more prominent in rural areas may be relevant to the pathogenesis of EoE. PMID:24667575
FARME DB: a functional antibiotic resistance element database

PubMed Central

Wallace, James C.; Port, Jesse A.; Smith, Marissa N.; Faustman, Elaine M.

2017-01-01

Antibiotic resistance (AR) is a major global public health threat but few resources exist that catalog AR genes outside of a clinical context. Current AR sequence databases are assembled almost exclusively from genomic sequences derived from clinical bacterial isolates and thus do not include many microbial sequences derived from environmental samples that confer resistance in functional metagenomic studies. These environmental metagenomic sequences often show little or no similarity to AR sequences from clinical isolates using standard classification criteria. In addition, existing AR databases provide no information about flanking sequences containing regulatory or mobile genetic elements. To help address this issue, we created an annotated database of DNA and protein sequences derived exclusively from environmental metagenomic sequences showing AR in laboratory experiments. Our Functional Antibiotic Resistant Metagenomic Element (FARME) database is a compilation of publically available DNA sequences and predicted protein sequences conferring AR as well as regulatory elements, mobile genetic elements and predicted proteins flanking antibiotic resistant genes. FARME is the first database to focus on functional metagenomic AR gene elements and provides a resource to better understand AR in the 99% of bacteria which cannot be cultured and the relationship between environmental AR sequences and antibiotic resistant genes derived from cultured isolates. Database URL: http://staff.washington.edu/jwallace/farme PMID:28077567
Generalized Database Management System Support for Numeric Database Environments.

ERIC Educational Resources Information Center

Dominick, Wayne D.; Weathers, Peggy G.

1982-01-01

This overview of potential for utilizing database management systems (DBMS) within numeric database environments highlights: (1) major features, functions, and characteristics of DBMS; (2) applicability to numeric database environment needs and user needs; (3) current applications of DBMS technology; and (4) research-oriented and…
[National Database of Genotypes--ethical and legal issues].

PubMed

Franková, Vera; Tesínová, Jolana; Brdicka, Radim

2011-01-01

National Database of Genotypes--ethical and legal issues The aim of the project National Database of Genotypes is to outline structure and rules for the database operation collecting information about genotypes of individual persons. The database should be used entirely for health care. Its purpose is to enable physicians to gain quick and easy access to the information about persons requiring specialized care due to their genetic constitution. In the future, another introduction of new genetic tests into the clinical practice can be expected thus the database of genotypes facilitates substantial financial savings by exclusion of duplicates of the expensive genetic testing. Ethical questions connected with the creating and functioning of such database concern mainly privacy protection, confidentiality of personal sensitive data, protection of database from misuse, consent with participation and public interests. Due to necessity of correct interpretation by qualified professional (= clinical geneticist), particular categorization of genetic data within the database is discussed. The function of proposed database has to be governed in concordance with the Czech legislation together with solving ethical problems.
Coal database for Cook Inlet and North Slope, Alaska

USGS Publications Warehouse

Stricker, Gary D.; Spear, Brianne D.; Sprowl, Jennifer M.; Dietrich, John D.; McCauley, Michael I.; Kinney, Scott A.

2011-01-01

This database is a compilation of published and nonconfidential unpublished coal data from Alaska. Although coal occurs in isolated areas throughout Alaska, this study includes data only from the Cook Inlet and North Slope areas. The data include entries from and interpretations of oil and gas well logs, coal-core geophysical logs (such as density, gamma, and resistivity), seismic shot hole lithology descriptions, measured coal sections, and isolated coal outcrops.
The NOAO Data Lab PHAT Photometry Database

NASA Astrophysics Data System (ADS)

Olsen, Knut; Williams, Ben; Fitzpatrick, Michael; PHAT Team

2018-01-01

We present a database containing both the combined photometric object catalog and the single epoch measurements from the Panchromatic Hubble Andromeda Treasury (PHAT). This database is hosted by the NOAO Data Lab (http://datalab.noao.edu), and as such exposes a number of data services to the PHAT photometry, including access through a Table Access Protocol (TAP) service, direct PostgreSQL queries, web-based and programmatic query interfaces, remote storage space for personal database tables and files, and a JupyterHub-based Notebook analysis environment, as well as image access through a Simple Image Access (SIA) service. We show how the Data Lab database and Jupyter Notebook environment allow for straightforward and efficient analyses of PHAT catalog data, including maps of object density, depth, and color, extraction of light curves of variable objects, and proper motion exploration.
Human population, urban settlement patterns and their impact on Plasmodium falciparum malaria endemicity.

PubMed

Tatem, Andrew J; Guerra, Carlos A; Kabaria, Caroline W; Noor, Abdisalan M; Hay, Simon I

2008-10-27

The efficient allocation of financial resources for malaria control and the optimal distribution of appropriate interventions require accurate information on the geographic distribution of malaria risk and of the human populations it affects. Low population densities in rural areas and high population densities in urban areas can influence malaria transmission substantially. Here, the Malaria Atlas Project (MAP) global database of Plasmodium falciparum parasite rate (PfPR) surveys, medical intelligence and contemporary population surfaces are utilized to explore these relationships and other issues involved in combining malaria risk maps with those of human population distribution in order to define populations at risk more accurately. First, an existing population surface was examined to determine if it was sufficiently detailed to be used reliably as a mask to identify areas of very low and very high population density as malaria free regions. Second, the potential of international travel and health guidelines (ITHGs) for identifying malaria free cities was examined. Third, the differences in PfPR values between surveys conducted in author-defined rural and urban areas were examined. Fourth, the ability of various global urban extent maps to reliably discriminate these author-based classifications of urban and rural in the PfPR database was investigated. Finally, the urban map that most accurately replicated the author-based classifications was analysed to examine the effects of urban classifications on PfPR values across the entire MAP database. Masks of zero population density excluded many non-zero PfPR surveys, indicating that the population surface was not detailed enough to define areas of zero transmission resulting from low population densities. In contrast, the ITHGs enabled the identification and mapping of 53 malaria free urban areas within endemic countries. Comparison of PfPR survey results showed significant differences between author-defined 'urban' and 'rural' designations in Africa, but not for the remainder of the malaria endemic world. The Global Rural Urban Mapping Project (GRUMP) urban extent mask proved most accurate for mapping these author-defined rural and urban locations, and further sub-divisions of urban extents into urban and peri-urban classes enabled the effects of high population densities on malaria transmission to be mapped and quantified. The availability of detailed, contemporary census and urban extent data for the construction of coherent and accurate global spatial population databases is often poor. These known sources of uncertainty in population surfaces and urban maps have the potential to be incorporated into future malaria burden estimates. Currently, insufficient spatial information exists globally to identify areas accurately where population density is low enough to impact upon transmission. Medical intelligence does however exist to reliably identify malaria free cities. Moreover, in Africa, urban areas that have a significant effect on malaria transmission can be mapped.
Domain fusion analysis by applying relational algebra to protein sequence and domain databases

PubMed Central

Truong, Kevin; Ikura, Mitsuhiko

2003-01-01

Background Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful. Results This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in H. sapiens and S. cerevisiae, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at . Conclusion As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time. PMID:12734020
Predicting Ligand Binding Sites on Protein Surfaces by 3-Dimensional Probability Density Distributions of Interacting Atoms

PubMed Central

Jian, Jhih-Wei; Elumalai, Pavadai; Pitti, Thejkiran; Wu, Chih Yuan; Tsai, Keng-Chang; Chang, Jeng-Yih; Peng, Hung-Pin; Yang, An-Suei

2016-01-01

Predicting ligand binding sites (LBSs) on protein structures, which are obtained either from experimental or computational methods, is a useful first step in functional annotation or structure-based drug design for the protein structures. In this work, the structure-based machine learning algorithm ISMBLab-LIG was developed to predict LBSs on protein surfaces with input attributes derived from the three-dimensional probability density maps of interacting atoms, which were reconstructed on the query protein surfaces and were relatively insensitive to local conformational variations of the tentative ligand binding sites. The prediction accuracy of the ISMBLab-LIG predictors is comparable to that of the best LBS predictors benchmarked on several well-established testing datasets. More importantly, the ISMBLab-LIG algorithm has substantial tolerance to the prediction uncertainties of computationally derived protein structure models. As such, the method is particularly useful for predicting LBSs not only on experimental protein structures without known LBS templates in the database but also on computationally predicted model protein structures with structural uncertainties in the tentative ligand binding sites. PMID:27513851
MoonProt: a database for proteins that are known to moonlight

PubMed Central

Mani, Mathew; Chen, Chang; Amblee, Vaishak; Liu, Haipeng; Mathur, Tanu; Zwicke, Grant; Zabad, Shadi; Patel, Bansi; Thakkar, Jagravi; Jeffery, Constance J.

2015-01-01

Moonlighting proteins comprise a class of multifunctional proteins in which a single polypeptide chain performs multiple biochemical functions that are not due to gene fusions, multiple RNA splice variants or pleiotropic effects. The known moonlighting proteins perform a variety of diverse functions in many different cell types and species, and information about their structures and functions is scattered in many publications. We have constructed the manually curated, searchable, internet-based MoonProt Database (http://www.moonlightingproteins.org) with information about the over 200 proteins that have been experimentally verified to be moonlighting proteins. The availability of this organized information provides a more complete picture of what is currently known about moonlighting proteins. The database will also aid researchers in other fields, including determining the functions of genes identified in genome sequencing projects, interpreting data from proteomics projects and annotating protein sequence and structural databases. In addition, information about the structures and functions of moonlighting proteins can be helpful in understanding how novel protein functional sites evolved on an ancient protein scaffold, which can also help in the design of proteins with novel functions. PMID:25324305
47 CFR 15.713 - TV bands database.

Code of Federal Regulations, 2013 CFR

2013-10-01

... 47 Telecommunication 1 2013-10-01 2013-10-01 false TV bands database. 15.713 Section 15.713... TV bands database. (a) Purpose. The TV bands database serves the following functions: (1) To... channels are determined based on the interference protection requirements in § 15.712. A database must...
47 CFR 15.713 - TV bands database.

Code of Federal Regulations, 2012 CFR

2012-10-01

... 47 Telecommunication 1 2012-10-01 2012-10-01 false TV bands database. 15.713 Section 15.713... TV bands database. (a) Purpose. The TV bands database serves the following functions: (1) To... channels are determined based on the interference protection requirements in § 15.712. A database must...
A Relational Database System for Student Use.

ERIC Educational Resources Information Center

Fertuck, Len

1982-01-01

Describes an APL implementation of a relational database system suitable for use in a teaching environment in which database development and database administration are studied, and discusses the functions of the user and the database administrator. An appendix illustrating system operation and an eight-item reference list are attached. (Author/JL)

2006 Compilation of Alaska Gravity Data and Historical Reports

USGS Publications Warehouse

Saltus, Richard W.; Brown, Philip J.; Morin, Robert L.; Hill, Patricia L.

2008-01-01

Gravity anomalies provide fundamental geophysical information about Earth structure and dynamics. To increase geologic and geodynamic understanding of Alaska, the U.S. Geological Survey (USGS) has collected and processed Alaska gravity data for the past 50 years. This report introduces and describes an integrated, State-wide gravity database and provides accompanying gravity calculation tools to assist in its application. Additional information includes gravity base station descriptions and digital scans of historical USGS reports. The gravity calculation tools enable the user to reduce new gravity data in a consistent manner for combination with the existing database. This database has sufficient resolution to define the regional gravity anomalies of Alaska. Interpretation of regional gravity anomalies in parts of the State are hampered by the lack of local isostatic compensation in both southern and northern Alaska. However, when filtered appropriately, the Alaska gravity data show regional features having geologic significance. These features include gravity lows caused by low-density rocks of Cenozoic basins, flysch belts, and felsic intrusions, as well as many gravity highs associated with high-density mafic and ultramafic complexes.
Molecular Oxygen in the Thermosphere: Issues and Measurement Strategies

NASA Astrophysics Data System (ADS)

Picone, J. M.; Hedin, A. E.; Drob, D. P.; Meier, R. R.; Bishop, J.; Budzien, S. A.

2002-05-01

We review the state of empirical knowledge regarding the distribution of molecular oxygen in the lower thermosphere (100-200 km), as embodied by the new NRLMSISE-00 empirical atmospheric model, its predecessors, and the underlying databases. For altitudes above 120 km, the two major classes of data (mass spectrometer and solar ultraviolet [UV] absorption) disagree significantly regarding the magnitude of the O2 density and the dependence on solar activity. As a result, the addition of the Solar Maximum Mission (SMM) data set (based on solar UV absorption) to the NRLMSIS database has directly impacted the new model, increasing the complexity of the model's formulation and generally reducing the thermospheric O2 density relative to MSISE-90. Beyond interest in the thermosphere itself, this issue materially affects detailed models of ionospheric chemistry and dynamics as well as modeling of the upper atmospheric airglow. Because these are key elements of both experimental and operational systems which measure and forecast the near-Earth space environment, we present strategies for augmenting the database through analysis of existing data and through future measurements in order to resolve this issue.
Electronic coupling through natural amino acids

DOE Office of Scientific and Technical Information (OSTI.GOV)

Berstis, Laura; Beckham, Gregg T., E-mail: michael.crowley@nrel.gov, E-mail: gregg.beckham@nrel.gov; Crowley, Michael F., E-mail: michael.crowley@nrel.gov, E-mail: gregg.beckham@nrel.gov

2015-12-14

Myriad scientific domains concern themselves with biological electron transfer (ET) events that span across vast scales of rate and efficiency through a remarkably fine-tuned integration of amino acid (AA) sequences, electronic structure, dynamics, and environment interactions. Within this intricate scheme, many questions persist as to how proteins modulate electron-tunneling properties. To help elucidate these principles, we develop a model set of peptides representing the common α-helix and β-strand motifs including all natural AAs within implicit protein-environment solvation. Using an effective Hamiltonian strategy with density functional theory, we characterize the electronic coupling through these peptides, furthermore considering side-chain dynamics. For bothmore » motifs, predictions consistently show that backbone-mediated electronic coupling is distinctly sensitive to AA type (aliphatic, polar, aromatic, negatively charged and positively charged), and to side-chain orientation. The unique properties of these residues may be employed to design activated, deactivated, or switch-like superexchange pathways. Electronic structure calculations and Green’s function analyses indicate that localized shifts in the electron density along the peptide play a role in modulating these pathways, and further substantiate the experimentally observed behavior of proline residues as superbridges. The distinct sensitivities of tunneling pathways to sequence and conformation revealed in this electronic coupling database help improve our fundamental understanding of the broad diversity of ET reactivity and provide guiding principles for peptide design.« less
Automated 3D Ultrasound Image Segmentation to Aid Breast Cancer Image Interpretation

PubMed Central

Gu, Peng; Lee, Won-Mean; Roubidoux, Marilyn A.; Yuan, Jie; Wang, Xueding; Carson, Paul L.

2015-01-01

Segmentation of an ultrasound image into functional tissues is of great importance to clinical diagnosis of breast cancer. However, many studies are found to segment only the mass of interest and not all major tissues. Differences and inconsistencies in ultrasound interpretation call for an automated segmentation method to make results operator-independent. Furthermore, manual segmentation of entire three-dimensional (3D) ultrasound volumes is time-consuming, resource-intensive, and clinically impractical. Here, we propose an automated algorithm to segment 3D ultrasound volumes into three major tissue types: cyst/mass, fatty tissue, and fibro-glandular tissue. To test its efficacy and consistency, the proposed automated method was employed on a database of 21 cases of whole breast ultrasound. Experimental results show that our proposed method not only distinguishes fat and non-fat tissues correctly, but performs well in classifying cyst/mass. Comparison of density assessment between the automated method and manual segmentation demonstrates good consistency with an accuracy of 85.7%. Quantitative comparison of corresponding tissue volumes, which uses overlap ratio, gives an average similarity of 74.54%, consistent with values seen in MRI brain segmentations. Thus, our proposed method exhibits great potential as an automated approach to segment 3D whole breast ultrasound volumes into functionally distinct tissues that may help to correct ultrasound speed of sound aberrations and assist in density based prognosis of breast cancer. PMID:26547117
Automated 3D ultrasound image segmentation for assistant diagnosis of breast cancer

NASA Astrophysics Data System (ADS)

Wang, Yuxin; Gu, Peng; Lee, Won-Mean; Roubidoux, Marilyn A.; Du, Sidan; Yuan, Jie; Wang, Xueding; Carson, Paul L.

2016-04-01

Segmentation of an ultrasound image into functional tissues is of great importance to clinical diagnosis of breast cancer. However, many studies are found to segment only the mass of interest and not all major tissues. Differences and inconsistencies in ultrasound interpretation call for an automated segmentation method to make results operator-independent. Furthermore, manual segmentation of entire three-dimensional (3D) ultrasound volumes is time-consuming, resource-intensive, and clinically impractical. Here, we propose an automated algorithm to segment 3D ultrasound volumes into three major tissue types: cyst/mass, fatty tissue, and fibro-glandular tissue. To test its efficacy and consistency, the proposed automated method was employed on a database of 21 cases of whole breast ultrasound. Experimental results show that our proposed method not only distinguishes fat and non-fat tissues correctly, but performs well in classifying cyst/mass. Comparison of density assessment between the automated method and manual segmentation demonstrates good consistency with an accuracy of 85.7%. Quantitative comparison of corresponding tissue volumes, which uses overlap ratio, gives an average similarity of 74.54%, consistent with values seen in MRI brain segmentations. Thus, our proposed method exhibits great potential as an automated approach to segment 3D whole breast ultrasound volumes into functionally distinct tissues that may help to correct ultrasound speed of sound aberrations and assist in density based prognosis of breast cancer.
Evaluating Oilseed Biofuel Production Feasibility in California’s San Joaquin Valley Using Geophysical and Remote Sensing Techniques

PubMed Central

Corwin, Dennis L.; Yemoto, Kevin; Clary, Wes; Banuelos, Gary; Skaggs, Todd H.; Lesch, Scott M.

2017-01-01

Though more costly than petroleum-based fuels and a minor component of overall military fuel sources, biofuels are nonetheless strategically valuable to the military because of intentional reliance on multiple, reliable, secure fuel sources. Significant reduction in oilseed biofuel cost occurs when grown on marginally productive saline-sodic soils plentiful in California’s San Joaquin Valley (SJV). The objective is to evaluate the feasibility of oilseed production on marginal soils in the SJV to support a 115 ML yr−1 biofuel conversion facility. The feasibility evaluation involves: (1) development of an Ida Gold mustard oilseed yield model for marginal soils; (2) identification of marginally productive soils; (3) development of a spatial database of edaphic factors influencing oilseed yield and (4) performance of Monte Carlo simulations showing potential biofuel production on marginally productive SJV soils. The model indicates oilseed yield is related to boron, salinity, leaching fraction, and water content at field capacity. Monte Carlo simulations for the entire SJV fit a shifted gamma probability density function: Q = 68.986 + gamma (6.134,5.285), where Q is biofuel production in ML yr−1. The shifted gamma cumulative density function indicates a 0.15–0.17 probability of meeting the target biofuel-production level of 115 ML yr−1, making adequate biofuel production unlikely. PMID:29036925
Theoretical study of the ammonia nitridation rate on an Fe (100) surface: A combined density functional theory and kinetic Monte Carlo study

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yeo, Sang Chul; Lee, Hyuck Mo, E-mail: hmlee@kaist.ac.kr; Lo, Yu Chieh

2014-10-07

Ammonia (NH{sub 3}) nitridation on an Fe surface was studied by combining density functional theory (DFT) and kinetic Monte Carlo (kMC) calculations. A DFT calculation was performed to obtain the energy barriers (E{sub b}) of the relevant elementary processes. The full mechanism of the exact reaction path was divided into five steps (adsorption, dissociation, surface migration, penetration, and diffusion) on an Fe (100) surface pre-covered with nitrogen. The energy barrier (E{sub b}) depended on the N surface coverage. The DFT results were subsequently employed as a database for the kMC simulations. We then evaluated the NH{sub 3} nitridation rate onmore » the N pre-covered Fe surface. To determine the conditions necessary for a rapid NH{sub 3} nitridation rate, the eight reaction events were considered in the kMC simulations: adsorption, desorption, dissociation, reverse dissociation, surface migration, penetration, reverse penetration, and diffusion. This study provides a real-time-scale simulation of NH{sub 3} nitridation influenced by nitrogen surface coverage that allowed us to theoretically determine a nitrogen coverage (0.56 ML) suitable for rapid NH{sub 3} nitridation. In this way, we were able to reveal the coverage dependence of the nitridation reaction using the combined DFT and kMC simulations.« less
Density functional theory studies on the electronic, structural, phonon dynamical and thermo-stability properties of bicarbonates MHCO3, M = Li, Na, K

NASA Astrophysics Data System (ADS)

Duan, Yuhua; Zhang, Bo; Sorescu, Dan C.; Johnson, J. Karl; Majzoub, Eric H.; Luebke, David R.

2012-08-01

The structural, electronic, phonon dispersion and thermodynamic properties of MHCO3 (M = Li, Na, K) solids were investigated using density functional theory. The calculated bulk properties for both their ambient and the high-pressure phases are in good agreement with available experimental measurements. Solid phase LiHCO3 has not yet been observed experimentally. We have predicted several possible crystal structures for LiHCO3 using crystallographic database searching and prototype electrostatic ground state modeling. Our total energy and phonon free energy (FPH) calculations predict that LiHCO3 will be stable under suitable conditions of temperature and partial pressures of CO2 and H2O. Our calculations indicate that the {{HCO}}_{3}^{-} groups in LiHCO3 and NaHCO3 form an infinite chain structure through O⋯H⋯O hydrogen bonds. In contrast, the {{HCO}}_{3}^{-} anions form dimers, ({{HCO}}_{3}^{-})_{2}, connected through double hydrogen bonds in all phases of KHCO3. Based on density functional perturbation theory, the Born effective charge tensor of each atom type was obtained for all phases of the bicarbonates. Their phonon dispersions with the longitudinal optical-transverse optical splitting were also investigated. Based on lattice phonon dynamics study, the infrared spectra and the thermodynamic properties of these bicarbonates were obtained. Over the temperature range 0-900 K, the FPH and the entropies (S) of MHCO3 (M =Li, Na, K) systems vary as FPH(LiHCO3) > FPH(NaHCO3) > FPH(KHCO3) and S(KHCO3) > S(NaHCO3) > S(LiHCO3), respectively, in agreement with the available experimental data. Analysis of the predicted thermodynamics of the CO2 capture reactions indicates that the carbonate/bicarbonate transition reactions for Na and K could be used for CO2 capture technology, in agreement with experiments.
Density-functional theory for internal magnetic fields

NASA Astrophysics Data System (ADS)

Tellgren, Erik I.

2018-01-01

A density-functional theory is developed based on the Maxwell-Schrödinger equation with an internal magnetic field in addition to the external electromagnetic potentials. The basic variables of this theory are the electron density and the total magnetic field, which can equivalently be represented as a physical current density. Hence, the theory can be regarded as a physical current density-functional theory and an alternative to the paramagnetic current density-functional theory due to Vignale and Rasolt. The energy functional has strong enough convexity properties to allow a formulation that generalizes Lieb's convex analysis formulation of standard density-functional theory. Several variational principles as well as a Hohenberg-Kohn-like mapping between potentials and ground-state densities follow from the underlying convex structure. Moreover, the energy functional can be regarded as the result of a standard approximation technique (Moreau-Yosida regularization) applied to the conventional Schrödinger ground-state energy, which imposes limits on the maximum curvature of the energy (with respect to the magnetic field) and enables construction of a (Fréchet) differentiable universal density functional.
Electronic spectra from TDDFT and machine learning in chemical space

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ramakrishnan, Raghunathan; Hartmann, Mia; Tapavicza, Enrico

Due to its favorable computational efficiency, time-dependent (TD) density functional theory (DFT) enables the prediction of electronic spectra in a high-throughput manner across chemical space. Its predictions, however, can be quite inaccurate. We resolve this issue with machine learning models trained on deviations of reference second-order approximate coupled-cluster (CC2) singles and doubles spectra from TDDFT counterparts, or even from DFT gap. We applied this approach to low-lying singlet-singlet vertical electronic spectra of over 20 000 synthetically feasible small organic molecules with up to eight CONF atoms. The prediction errors decay monotonously as a function of training set size. For amore » training set of 10 000 molecules, CC2 excitation energies can be reproduced to within +/- 0.1 eV for the remaining molecules. Analysis of our spectral database via chromophore counting suggests that even higher accuracies can be achieved. Based on the evidence collected, we discuss open challenges associated with data-driven modeling of high-lying spectra and transition intensities.« less
Design principles for radiation-resistant solid solutions

NASA Astrophysics Data System (ADS)

Schuler, Thomas; Trinkle, Dallas R.; Bellon, Pascal; Averback, Robert

2017-05-01

We develop a multiscale approach to quantify the increase in the recombined fraction of point defects under irradiation resulting from dilute solute additions to a solid solution. This methodology provides design principles for radiation-resistant materials. Using an existing database of solute diffusivities, we identify Sb as one of the most efficient solutes for this purpose in a Cu matrix. We perform density-functional-theory calculations to obtain binding and migration energies of Sb atoms, vacancies, and self-interstitial atoms in various configurations. The computed data informs the self-consistent mean-field formalism to calculate transport coefficients, allowing us to make quantitative predictions of the recombined fraction of point defects as a function of temperature and irradiation rate using homogeneous rate equations. We identify two different mechanisms according to which solutes lead to an increase in the recombined fraction of point defects; at low temperature, solutes slow down vacancies (kinetic effect), while at high temperature, solutes stabilize vacancies in the solid solution (thermodynamic effect). Extension to other metallic matrices and solutes are discussed.
Automatic classification and detection of clinically relevant images for diabetic retinopathy

NASA Astrophysics Data System (ADS)

Xu, Xinyu; Li, Baoxin

2008-03-01

We proposed a novel approach to automatic classification of Diabetic Retinopathy (DR) images and retrieval of clinically-relevant DR images from a database. Given a query image, our approach first classifies the image into one of the three categories: microaneurysm (MA), neovascularization (NV) and normal, and then it retrieves DR images that are clinically-relevant to the query image from an archival image database. In the classification stage, the query DR images are classified by the Multi-class Multiple-Instance Learning (McMIL) approach, where images are viewed as bags, each of which contains a number of instances corresponding to non-overlapping blocks, and each block is characterized by low-level features including color, texture, histogram of edge directions, and shape. McMIL first learns a collection of instance prototypes for each class that maximizes the Diverse Density function using Expectation- Maximization algorithm. A nonlinear mapping is then defined using the instance prototypes and maps every bag to a point in a new multi-class bag feature space. Finally a multi-class Support Vector Machine is trained in the multi-class bag feature space. In the retrieval stage, we retrieve images from the archival database who bear the same label with the query image, and who are the top K nearest neighbors of the query image in terms of similarity in the multi-class bag feature space. The classification approach achieves high classification accuracy, and the retrieval of clinically-relevant images not only facilitates utilization of the vast amount of hidden diagnostic knowledge in the database, but also improves the efficiency and accuracy of DR lesion diagnosis and assessment.
Multiconfiguration Pair-Density Functional Theory Outperforms Kohn-Sham Density Functional Theory and Multireference Perturbation Theory for Ground-State and Excited-State Charge Transfer.

PubMed

Ghosh, Soumen; Sonnenberger, Andrew L; Hoyer, Chad E; Truhlar, Donald G; Gagliardi, Laura

2015-08-11

The correct description of charge transfer in ground and excited states is very important for molecular interactions, photochemistry, electrochemistry, and charge transport, but it is very challenging for Kohn-Sham (KS) density functional theory (DFT). KS-DFT exchange-correlation functionals without nonlocal exchange fail to describe both ground- and excited-state charge transfer properly. We have recently proposed a theory called multiconfiguration pair-density functional theory (MC-PDFT), which is based on a combination of multiconfiguration wave function theory with a new type of density functional called an on-top density functional. Here we have used MC-PDFT to study challenging ground- and excited-state charge-transfer processes by using on-top density functionals obtained by translating KS exchange-correlation functionals. For ground-state charge transfer, MC-PDFT performs better than either the PBE exchange-correlation functional or CASPT2 wave function theory. For excited-state charge transfer, MC-PDFT (unlike KS-DFT) shows qualitatively correct behavior at long-range with great improvement in predicted excitation energies.
Statistics of strain rates and surface density function in a flame-resolved high-fidelity simulation of a turbulent premixed bluff body burner

NASA Astrophysics Data System (ADS)

Sandeep, Anurag; Proch, Fabian; Kempf, Andreas M.; Chakraborty, Nilanjan

2018-06-01

The statistical behavior of the surface density function (SDF, the magnitude of the reaction progress variable gradient) and the strain rates, which govern the evolution of the SDF, have been analyzed using a three-dimensional flame-resolved simulation database of a turbulent lean premixed methane-air flame in a bluff-body configuration. It has been found that the turbulence intensity increases with the distance from the burner, changing the flame curvature distribution and increasing the probability of the negative curvature in the downstream direction. The curvature dependences of dilatation rate ∇ṡu → and displacement speed Sd give rise to variations of these quantities in the axial direction. These variations affect the nature of the alignment between the progress variable gradient and the local principal strain rates, which in turn affects the mean flame normal strain rate, which assumes positive values close to the burner but increasingly becomes negative as the effect of turbulence increases with the axial distance from the burner exit. The axial distance dependences of the curvature and displacement speed also induce a considerable variation in the mean value of the curvature stretch. The axial distance dependences of the dilatation rate and flame normal strain rate govern the behavior of the flame tangential strain rate, and its mean value increases in the downstream direction. The current analysis indicates that the statistical behaviors of different strain rates and displacement speed and their curvature dependences need to be included in the modeling of flame surface density and scalar dissipation rate in order to accurately capture their local behaviors.
Development of a Life History Database for Upper Mississippi River Fishes

DTIC Science & Technology

2007-05-01

prevailing ecological and river theories with existing empirical data, investigating anthropogenic controls on functional attributes of ecosystems...2001; 2005a). database closely reflect the ecological attributes Finally, the life history database will allow the of UMRS fish species. These...34 Functional Feeding Guilds attribute class provide information on reproductive capacity, timing and mode for UMRS fish species. Our first example used the
Data-driven discovery of new Dirac semimetal materials

NASA Astrophysics Data System (ADS)

Yan, Qimin; Chen, Ru; Neaton, Jeffrey

In recent years, a significant amount of materials property data from high-throughput computations based on density functional theory (DFT) and the application of database technologies have enabled the rise of data-driven materials discovery. In this work, we initiate the extension of the data-driven materials discovery framework to the realm of topological semimetal materials and to accelerate the discovery of novel Dirac semimetals. We implement current available and develop new workflows to data-mine the Materials Project database for novel Dirac semimetals with desirable band structures and symmetry protected topological properties. This data-driven effort relies on the successful development of several automatic data generation and analysis tools, including a workflow for the automatic identification of topological invariants and pattern recognition techniques to find specific features in a massive number of computed band structures. Utilizing this approach, we successfully identified more than 15 novel Dirac point and Dirac nodal line systems that have not been theoretically predicted or experimentally identified. This work is supported by the Materials Project Predictive Modeling Center through the U.S. Department of Energy, Office of Basic Energy Sciences, Materials Sciences and Engineering Division, under Contract No. DE-AC02-05CH11231.
Development Algorithm of the Technological Process of Manufacturing Gas Turbine Parts by Selective Laser Melting

NASA Astrophysics Data System (ADS)

Sotov, A. V.; Agapovichev, A. V.; Smelov, V. G.; Kyarimov, R. R.

2018-01-01

The technology of the selective laser melting (SLM) allows making products from powders of aluminum, titanium, heat-resistant alloys and stainless steels. Today the use of SLM technology develops at manufacture of the functional parts. This in turn requires development of a methodology projection of technological processes (TP) for manufacturing parts including databases of standard TP. Use of a technique will allow to exclude influence of technologist’s qualification on made products quality, and also to reduce labor input and energy consumption by development of TP due to use of the databases of standard TP integrated into a methodology. As approbation of the developed methodology the research of influence of the modes of a laser emission on a roughness of a surface of synthesized material was presented. It is established that the best values of a roughness of exemplars in the longitudinal and transversal directions make 1.98 μm and 3.59 μm respectively. These values of a roughness were received at specific density of energy 6.25 J/mm2 that corresponds to power and the speed of scanning of 200 W and 400 mm/s, respectively, and a hatch distance of 0.08 mm.
Retinal vessel segmentation using the 2-D Gabor wavelet and supervised classification.

PubMed

Soares, João V B; Leandro, Jorge J G; Cesar Júnior, Roberto M; Jelinek, Herbert F; Cree, Michael J

2006-09-01

We present a method for automated segmentation of the vasculature in retinal images. The method produces segmentations by classifying each image pixel as vessel or nonvessel, based on the pixel's feature vector. Feature vectors are composed of the pixel's intensity and two-dimensional Gabor wavelet transform responses taken at multiple scales. The Gabor wavelet is capable of tuning to specific frequencies, thus allowing noise filtering and vessel enhancement in a single step. We use a Bayesian classifier with class-conditional probability density functions (likelihoods) described as Gaussian mixtures, yielding a fast classification, while being able to model complex decision surfaces. The probability distributions are estimated based on a training set of labeled pixels obtained from manual segmentations. The method's performance is evaluated on publicly available DRIVE (Staal et al., 2004) and STARE (Hoover et al., 2000) databases of manually labeled images. On the DRIVE database, it achieves an area under the receiver operating characteristic curve of 0.9614, being slightly superior than that presented by state-of-the-art approaches. We are making our implementation available as open source MATLAB scripts for researchers interested in implementation details, evaluation, or development of methods.
Spatial distribution of citizen science casuistic observations for different taxonomic groups.

PubMed

Tiago, Patrícia; Ceia-Hasse, Ana; Marques, Tiago A; Capinha, César; Pereira, Henrique M

2017-10-16

Opportunistic citizen science databases are becoming an important way of gathering information on species distributions. These data are temporally and spatially dispersed and could have limitations regarding biases in the distribution of the observations in space and/or time. In this work, we test the influence of landscape variables in the distribution of citizen science observations for eight taxonomic groups. We use data collected through a Portuguese citizen science database (biodiversity4all.org). We use a zero-inflated negative binomial regression to model the distribution of observations as a function of a set of variables representing the landscape features plausibly influencing the spatial distribution of the records. Results suggest that the density of paths is the most important variable, having a statistically significant positive relationship with number of observations for seven of the eight taxa considered. Wetland coverage was also identified as having a significant, positive relationship, for birds, amphibians and reptiles, and mammals. Our results highlight that the distribution of species observations, in citizen science projects, is spatially biased. Higher frequency of observations is driven largely by accessibility and by the presence of water bodies. We conclude that efforts are required to increase the spatial evenness of sampling effort from volunteers.
Electron Stark Broadening Database for Atomic N, O, and C Lines

NASA Technical Reports Server (NTRS)

Liu, Yen; Yao, Winifred M.; Wray, Alan A.; Carbon, Duane F.

2012-01-01

A database for efficiently computing the electron Stark broadening line widths for atomic N, O, and C lines is constructed. The line width is expressed in terms of the electron number density and electronatom scattering cross sections based on the Baranger impact theory. The state-to-state cross sections are computed using the semiclassical approximation, in which the atom is treated quantum mechanically whereas the motion of the free electron follows a classical trajectory. These state-to-state cross sections are calculated based on newly compiled line lists. Each atomic line list consists of a careful merger of NIST, Vanderbilt, and TOPbase line datasets from wavelength 50 nm to 50 micrometers covering the VUV to IR spectral regions. There are over 10,000 lines in each atomic line list. The widths for each line are computed at 13 electron temperatures between 1,000 K 50,000 K. A linear least squares method using a four-term fractional power series is then employed to obtain an analytical fit for each line-width variation as a function of the electron temperature. The maximum L2 error of the analytic fits for all lines in our line lists is about 5%.

Analysis of flash flood parameters and human impacts in the US from 2006 to 2012

NASA Astrophysics Data System (ADS)

Špitalar, Maruša; Gourley, Jonathan J.; Lutoff, Celine; Kirstetter, Pierre-Emmanuel; Brilly, Mitja; Carr, Nicholas

2014-11-01

Several different factors external to the natural hazard of flash flooding can contribute to the type and magnitude of their resulting damages. Human exposure, vulnerability, fatality and injury rates can be minimized by identifying and then mitigating the causative factors for human impacts. A database of flash flooding was used for statistical analysis of human impacts across the U.S. 21,549 flash flood events were analyzed during a 6-year period from October 2006 to 2012. Based on the information available in the database, physical parameters were introduced and then correlated to the reported human impacts. Probability density functions of the frequency of flash flood events and the PDF of occurrences weighted by the number of injuries and fatalities were used to describe the influence of each parameter. The factors that emerged as the most influential on human impacts are short flood durations, small catchment sizes in rural areas, vehicles, and nocturnal events with low visibility. Analyzing and correlating a diverse range of parameters to human impacts give us important insights into what contributes to fatalities and injuries and further raises questions on how to manage them.
RARGE II: an integrated phenotype database of Arabidopsis mutant traits using a controlled vocabulary.

PubMed

Akiyama, Kenji; Kurotani, Atsushi; Iida, Kei; Kuromori, Takashi; Shinozaki, Kazuo; Sakurai, Tetsuya

2014-01-01

Arabidopsis thaliana is one of the most popular experimental plants. However, only 40% of its genes have at least one experimental Gene Ontology (GO) annotation assigned. Systematic observation of mutant phenotypes is an important technique for elucidating gene functions. Indeed, several large-scale phenotypic analyses have been performed and have generated phenotypic data sets from many Arabidopsis mutant lines and overexpressing lines, which are freely available online. Since each Arabidopsis mutant line database uses individual phenotype expression, the differences in the structured term sets used by each database make it difficult to compare data sets and make it impossible to search across databases. Therefore, we obtained publicly available information for a total of 66,209 Arabidopsis mutant lines, including loss-of-function (RATM and TARAPPER) and gain-of-function (AtFOX and OsFOX) lines, and integrated the phenotype data by mapping the descriptions onto Plant Ontology (PO) and Phenotypic Quality Ontology (PATO) terms. This approach made it possible to manage the four different phenotype databases as one large data set. Here, we report a publicly accessible web-based database, the RIKEN Arabidopsis Genome Encyclopedia II (RARGE II; http://rarge-v2.psc.riken.jp/), in which all of the data described in this study are included. Using the database, we demonstrated consistency (in terms of protein function) with a previous study and identified the presumed function of an unknown gene. We provide examples of AT1G21600, which is a subunit in the plastid-encoded RNA polymerase complex, and AT5G56980, which is related to the jasmonic acid signaling pathway.
Mass-storage management for distributed image/video archives

NASA Astrophysics Data System (ADS)

Franchi, Santina; Guarda, Roberto; Prampolini, Franco

1993-04-01

The realization of image/video database requires a specific design for both database structures and mass storage management. This issue has addressed the project of the digital image/video database system that has been designed at IBM SEMEA Scientific & Technical Solution Center. Proper database structures have been defined to catalog image/video coding technique with the related parameters, and the description of image/video contents. User workstations and servers are distributed along a local area network. Image/video files are not managed directly by the DBMS server. Because of their wide size, they are stored outside the database on network devices. The database contains the pointers to the image/video files and the description of the storage devices. The system can use different kinds of storage media, organized in a hierarchical structure. Three levels of functions are available to manage the storage resources. The functions of the lower level provide media management. They allow it to catalog devices and to modify device status and device network location. The medium level manages image/video files on a physical basis. It manages file migration between high capacity media and low access time media. The functions of the upper level work on image/video file on a logical basis, as they archive, move and copy image/video data selected by user defined queries. These functions are used to support the implementation of a storage management strategy. The database information about characteristics of both storage devices and coding techniques are used by the third level functions to fit delivery/visualization requirements and to reduce archiving costs.
Inter-station coda wavefield studies using a novel icequake database on Erebus volcano

NASA Astrophysics Data System (ADS)

Chaput, J. A.; Campillo, M.; Roux, P.; Aster, R. C.

2013-12-01

Recent theoretical advances pertaining to the properties of multiply scattered wavefields have yielded a plethora of numerical and controlled source studies aiming to better understand what information may be derived from these otherwise chaotic signals. Practically, multiply scattered wavefields are difficult to compare to numerically derived models due to a combination of source paucity/directionality and array density limitations, particularly in passive seismology scenarios. Furthermore, in situations where data quantities are abundant, such as for ambient noise correlations, it remains very difficult to recover pseudo-Green's function symmetry in the ballistic components of the wavefield, let alone in the coda of the correlations. In this study, we use a large network of short period and broadband instruments on Erebus volcano to show that actual Green's function recovery is indeed possible in some cases. We make use of a large database of small impulsive icequakes distributed randomly on the summit plateau and, using fundamental theoretical properties of equipartitioned wavefields and interstation icequake coda correlations, are able to directly derive notoriously difficult quantities such as the bulk elastic mean free path for the volcano, demonstrations of correlation coda symmetry and its dependence on the number of icequakes used, and a theoretically predicted coherent backscattering amplification factor associated with weak localization. We furthermore show that stable equipartition and H^2/V^2 ratios may be consistently observed for icequake coda, and we perform simple depth inversions of these frequency dependent quantities to compare with known structures.
Chemical composition of Earth's core

NASA Astrophysics Data System (ADS)

Saxena, S.

2017-12-01

Many planetary scientists accept that the condensed planetesimals in the solar nebula eventually led to accretion of the earth. The details of the process have not been worked out. From the metallurgical experience, it is assumed that Earth's core may have formed by density differentiation with iron sinking to the core and the slag forming the mantle. This would be a post-accretionary process with temperature developing with self-compression. The problem with this hypothesis was recognized some time ago in that the seismic density profile of the core does not match the density of iron and requires the addition of a light element. Many elements such as Si, O, C and s have been proposed as diluents to decrease the density of a purely iron core. How and when this will be accomplished is still under discussion. Since the planetesimals (or condensates) formed in a well stirred nebula, it may be argued that a variety of condensed solids and fluids may have accreted and compressed without differentiation and the core does not necessarily contain mainly the differentiated iron. It is a matter of accumulating the condensate composition that would result in a density of 12 to 13 g/cm3 in the inner core. Therefore, we need a thermodynamic database that extends to 6000 K over the pressure range of ambient to 360 GPa. The development of such a database is currently in progress. It is a database with multicomponent solutions (C-Fe-Ni-S-Si) and all the major elements in the solar gas. Thermodynamic calculations using a preliminary dataset reveal that the solid species condensed at a temperature of 650 K and a pressure of 0.001 bar pressure, when self-compressed to various pressures and temperatures, yield densities that are appropriate for the mantle and core. Depending on H2/O of the escaping fluid, the formation of hydrous minerals, carbides, carbonates and iron melts with significant other elements have been found. Earth's core may have formed from solar condensate materials representing a range of solids avaeraging the seismic density of 13 kg/m3. Such material does not have to be Fe-Ni alloy but could be many different solids and a multielement alloy. Appropriate PVT equations of state have been used in arriving at this conclusion.
Domain fusion analysis by applying relational algebra to protein sequence and domain databases.

PubMed

Truong, Kevin; Ikura, Mitsuhiko

2003-05-06

Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful. This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in H. sapiens and S. cerevisiae, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at http://calcium.uhnres.utoronto.ca/pi. As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time.
A note on the accuracy of KS-DFT densities

NASA Astrophysics Data System (ADS)

Ranasinghe, Duminda S.; Perera, Ajith; Bartlett, Rodney J.

2017-11-01

The accuracy of the density of wave function methods and Kohn-Sham (KS) density functionals is studied using moments of the density, ⟨rn ⟩ =∫ ρ (r )rnd τ =∫0∞4 π r2ρ (r ) rnd r ,where n =-1 ,-2,0,1,2 ,and 3 provides information about the short- and long-range behavior of the density. Coupled cluster (CC) singles, doubles, and perturbative triples (CCSD(T)) is considered as the reference density. Three test sets are considered: boron through neon neutral atoms, two and four electron cations, and 3d transition metals. The total density and valence only density are distinguished by dropping appropriate core orbitals. Among density functionals tested, CAMQTP00 and ωB97x show the least deviation for boron through neon neutral atoms. They also show accurate eigenvalues for the HOMO indicating that they should have a more correct long-range behavior for the density. For transition metals, some density functional approximations outperform some wave function methods, suggesting that the KS determinant could be a better starting point for some kinds of correlated calculations. By using generalized many-body perturbation theory (MBPT), the convergence of second-, third-, and fourth-order KS-MBPT for the density is addressed as it converges to the infinite-order coupled cluster result. For the transition metal test set, the deviations in the KS density functional theory methods depend on the amount of exact exchange the functional uses. Functionals with exact exchange close to 25% show smaller deviations from the CCSD(T) density.
First-principles configurational thermodynamics of alloyed nanoparticles with adsorbates

NASA Astrophysics Data System (ADS)

Wang, Lin-Lin; Tan, Teck L.; Johnson, Duane D.

2014-03-01

Transition-metal, alloyed nanoparticles (NPs) are key components in current and emerging energy technologies because they are found to improve catalytic activity and selectivity for many energy-conversion processes. However, thermodynamic investigations of the compositional profile of alloyed nanoparticles, which determines their catalytic properties, have been limited mostly to NP core-shell preference without the presence of adsorbates. Here, by extending cluster expansion methods to treat both alloyed nanoparticles and adsorbates, we study the configurational thermodynamics of bimetallic NPs under chemically reactive conditions, using databases from density functional theory calculations. With a few examples, we show that such simulations can provide information needed for rational design of NP catalysts. DOE/BES under DE-FG02-03ER15476 (Catalysis) and DE-AC02-07CH11358 at the Ames Laboratory.
Adequacy of damped dynamics to represent the electron-phonon interaction in solids

DOE PAGES

Caro, A.; Correa, A. A.; Tamm, A.; ...

2015-10-16

Time-dependent density functional theory and Ehrenfest dynamics are used to calculate the electronic excitations produced by a moving Ni ion in a Ni crystal in the case of energetic MeV range (electronic stopping power regime), as well as thermal energy meV range (electron-phonon interaction regime). Results at high energy compare well to experimental databases of stopping power, and at low energy the electron-phonon interaction strength determined in this way is very similar to the linear response calculation and experimental measurements. This approach to electron-phonon interaction as an electronic stopping process provides the basis for a unified framework to perform classicalmore » molecular dynamics of ion-solid interaction with ab initio type nonadiabatic terms in a wide range of energies.« less
A Review on Data Stream Classification

NASA Astrophysics Data System (ADS)

Haneen, A. A.; Noraziah, A.; Wahab, Mohd Helmy Abd

2018-05-01

At this present time, the significance of data streams cannot be denied as many researchers have placed their focus on the research areas of databases, statistics, and computer science. In fact, data streams refer to some data points sequences that are found in order with the potential to be non-binding, which is generated from the process of generating information in a manner that is not stationary. As such the typical tasks of searching data have been linked to streams of data that are inclusive of clustering, classification, and repeated mining of pattern. This paper presents several data stream clustering approaches, which are based on density, besides attempting to comprehend the function of the related algorithms; both semi-supervised and active learning, along with reviews of a number of recent studies.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Vogel, Thomas; Perez, Danny

We recently introduced a novel replica-exchange scheme in which an individual replica can sample from states encountered by other replicas at any previous time by way of a global configuration database, enabling the fast propagation of relevant states through the whole ensemble of replicas. This mechanism depends on the knowledge of global thermodynamic functions which are measured during the simulation and not coupled to the heat bath temperatures driving the individual simulations. Therefore, this setup also allows for a continuous adaptation of the temperature set. In this paper, we will review the new scheme and demonstrate its capability. Furthermore, themore » method is particularly useful for the fast and reliable estimation of the microcanonical temperature T(U) or, equivalently, of the density of states g(U) over a wide range of energies.« less
Statistical properties of business firms structure and growth

NASA Astrophysics Data System (ADS)

Matia, K.; Fu, Dongfeng; Buldyrev, S. V.; Pammolli, F.; Riccaboni, M.; Stanley, H. E.

2004-08-01

We analyze a database comprising quarterly sales of 55624 pharmaceutical products commercialized by 3939 pharmaceutical firms in the period 1992 2001. We study the probability density function (PDF) of growth in firms and product sales and find that the width of the PDF of growth decays with the sales as a power law with exponent β = 0.20 ± 0.01. We also find that the average sales of products scales with the firm sales as a power law with exponent α = 0.57 ± 0.02. And that the average number products of a firm scales with the firm sales as a power law with exponent γ = 0.42 ± 0.02. We compare these findings with the predictions of models proposed till date on growth of business firms.
47 CFR 0.241 - Authority delegated.

Code of Federal Regulations, 2012 CFR

2012-10-01

... database functions for unlicensed devices operating in the television broadcast bands (TV bands) as set... methods that will be used to designate TV bands database managers, to designate these database managers; to develop procedures that these database managers will use to ensure compliance with the...
47 CFR 0.241 - Authority delegated.

Code of Federal Regulations, 2013 CFR

2013-10-01

... database functions for unlicensed devices operating in the television broadcast bands (TV bands) as set... methods that will be used to designate TV bands database managers, to designate these database managers; to develop procedures that these database managers will use to ensure compliance with the...
47 CFR 0.241 - Authority delegated.

Code of Federal Regulations, 2011 CFR

2011-10-01

... database functions for unlicensed devices operating in the television broadcast bands (TV bands) as set... methods that will be used to designate TV bands database managers, to designate these database managers; to develop procedures that these database managers will use to ensure compliance with the...
Segmentation Algorithms for Detection of Targets in IR Imagery (Algorithmes de Segmentation pour la Detection de Cibles sur Images IR),

DTIC Science & Technology

1981-01-01

This fact being established, leptokurtic and platykurtic density functions are defined in terms of deviations from the normal density function. Thus...the usual definitions (Ref. 6) are: Leptokurtic - A density function that is peaked, K > 0, [18] and Platykurtic - A density function that is flat, K...has long Deen accepted that a symmetrical platykurtic density function, with K<O, is characterized by a flatter top and more abrupt terminals than the
DB Dehydrogenase: an online integrated structural database on enzyme dehydrogenase.

PubMed

Nandy, Suman Kumar; Bhuyan, Rajabrata; Seal, Alpana

2012-01-01

Dehydrogenase enzymes are almost inevitable for metabolic processes. Shortage or malfunctioning of dehydrogenases often leads to several acute diseases like cancers, retinal diseases, diabetes mellitus, Alzheimer, hepatitis B & C etc. With advancement in modern-day research, huge amount of sequential, structural and functional data are generated everyday and widens the gap between structural attributes and its functional understanding. DB Dehydrogenase is an effort to relate the functionalities of dehydrogenase with its structures. It is a completely web-based structural database, covering almost all dehydrogenases [~150 enzyme classes, ~1200 entries from ~160 organisms] whose structures are known. It is created by extracting and integrating various online resources to provide the true and reliable data and implemented by MySQL relational database through user friendly web interfaces using CGI Perl. Flexible search options are there for data extraction and exploration. To summarize, sequence, structure, function of all dehydrogenases in one place along with the necessary option of cross-referencing; this database will be utile for researchers to carry out further work in this field. The database is available for free at http://www.bifku.in/DBD/
A global database and "state of the field" review of research into ecosystem engineering by land animals.

PubMed

Coggan, Nicole V; Hayward, Matthew W; Gibb, Heloise

2018-02-28

Ecosystem engineers have been widely studied for terrestrial systems, but global trends in research encompassing the range of taxa and functions have not previously been synthesised. We reviewed contemporary understanding of engineer fauna in terrestrial habitats and assessed the methods used to document patterns and processes, asking: (a) which species act as ecosystem engineers and with whom do they interact? (b) What are the impacts of ecosystem engineers in terrestrial habitats and how are they distributed? (c) What are the primary methods used to examine engineer effects and how have these developed over time? We considered the strengths, weaknesses and gaps in knowledge related to each of these questions and suggested a conceptual framework to delineate "significant impacts" of engineering interactions for all terrestrial animals. We collected peer-reviewed publications examining ecosystem engineer impacts and created a database of engineer species to assess experimental approaches and any additional covariates that influenced the magnitude of engineer impacts. One hundred and twenty-two species from 28 orders were identified as ecosystem engineers, performing five ecological functions. Burrowing mammals were the most researched group (27%). Half of all studies occurred in dry/arid habitats. Mensurative studies comparing sites with and without engineers (80%) were more common than manipulative studies (20%). These provided a broad framework for predicting engineer impacts upon abundance and species diversity. However, the roles of confounding factors, processes driving these patterns and the consequences of experimentally adjusting variables, such as engineer density, have been neglected. True spatial and temporal replication has also been limited, particularly for emerging studies of engineer reintroductions. Climate change and habitat modification will challenge the roles that engineers play in regulating ecosystems, and these will become important avenues for future research. We recommend future studies include simulation of engineer effects and experimental manipulation of engineer densities to determine the potential for ecological cascades through trophic and engineering pathways due to functional decline. We also recommend improving knowledge of long-term engineering effects and replication of engineer reintroductions across landscapes to better understand how large-scale ecological gradients alter the magnitude of engineering impacts. © 2018 The Authors. Journal of Animal Ecology © 2018 British Ecological Society.
IDESSA: An Integrative Decision Support System for Sustainable Rangeland Management in Southern African Savannas

NASA Astrophysics Data System (ADS)

Meyer, Hanna; Authmann, Christian; Dreber, Niels; Hess, Bastian; Kellner, Klaus; Morgenthal, Theunis; Nauss, Thomas; Seeger, Bernhard; Tsvuura, Zivanai; Wiegand, Kerstin

2017-04-01

Bush encroachment is a syndrome of land degradation that occurs in many savannas including those of southern Africa. The increase in density, cover or biomass of woody vegetation often has negative effects on a range of ecosystem functions and services, which are hardly reversible. However, despite its importance, neither the causes of bush encroachment, nor the consequences of different resource management strategies to combat or mitigate related shifts in savanna states are fully understood. The project "IDESSA" (An Integrative Decision Support System for Sustainable Rangeland Management in Southern African Savannas) aims to improve the understanding of the complex interplays between land use, climate patterns and vegetation dynamics and to implement an integrative monitoring and decision-support system for the sustainable management of different savanna types. For this purpose, IDESSA follows an innovative approach that integrates local knowledge, botanical surveys, remote-sensing and machine-learning based time-series of atmospheric and land-cover dynamics, spatially explicit simulation modeling and analytical database management. The integration of the heterogeneous data will be implemented in a user oriented database infrastructure and scientific workflow system. Accessible via web-based interfaces, this database and analysis system will allow scientists to manage and analyze monitoring data and scenario computations, as well as allow stakeholders (e. g. land users, policy makers) to retrieve current ecosystem information and seasonal outlooks. We present the concept of the project and show preliminary results of the realization steps towards the integrative savanna management and decision-support system.
The insertional history of an active family of L1 retrotransposons in humans.

PubMed

Boissinot, Stéphane; Entezam, Ali; Young, Lynn; Munson, Peter J; Furano, Anthony V

2004-07-01

As humans contain a currently active L1 (LINE-1) non-LTR retrotransposon family (Ta-1), the human genome database likely provides only a partial picture of Ta-1-generated diversity. We used a non-biased method to clone Ta-1 retrotransposon-containing loci from representatives of four ethnic populations. We obtained 277 distinct Ta-1 loci and identified an additional 67 loci in the human genome database. This collection represents approximately 90% of the Ta-1 population in the individuals examined and is thus more representative of the insertional history of Ta-1 than the human genome database, which lacked approximately 40% of our cloned Ta-1 elements. As both polymorphic and fixed Ta-1 elements are as abundant in the GC-poor genomic regions as in ancestral L1 elements, the enrichment of L1 elements in GC-poor areas is likely due to insertional bias rather than selection. Although the chromosomal distribution of Ta-1 inserts is generally a function of chromosomal length and gene density, chromosome 4 significantly deviates from this pattern and has been much more hospitable to Ta-1 insertions than any other chromosome. Also, the intra-chromosomal distribution of Ta-1 elements is not uniform. Ta-1 elements tend to cluster, and the maximal gaps between Ta-1 inserts are larger than would be expected from a model of uniform random insertion. Copyright 2004 Cold Spring Harbor Laboratory Press ISSN

A Ruby API to query the Ensembl database for genomic features.

PubMed

Strozzi, Francesco; Aerts, Jan

2011-04-01

The Ensembl database makes genomic features available via its Genome Browser. It is also possible to access the underlying data through a Perl API for advanced querying. We have developed a full-featured Ruby API to the Ensembl databases, providing the same functionality as the Perl interface with additional features. A single Ruby API is used to access different releases of the Ensembl databases and is also able to query multi-species databases. Most functionality of the API is provided using the ActiveRecord pattern. The library depends on introspection to make it release independent. The API is available through the Rubygem system and can be installed with the command gem install ruby-ensembl-api.
Sprawl in European urban areas

NASA Astrophysics Data System (ADS)

Prastacos, Poulicos; Lagarias, Apostolos

2016-08-01

In this paper the 2006 edition of the Urban Atlas database is used to tabulate areas of low development density, usually referred to as "sprawl", for many European cities. The Urban Atlas database contains information on the land use distribution in the 305 largest European cities. Twenty different land use types are recognized, with six of them representing urban fabric. Urban fabric classes are residential areas differentiated by the density of development, which is measured by the sealing degree parameter that ranges from 0% to 100% (non-developed, fully developed). Analysis is performed on the distribution of the middle to low density areas defined as those with sealing degree less than 50%. Seven different country groups in which urban areas have similar sprawl characteristics are identified and some key characteristics of sprawl are discussed. Population of an urban area is another parameter considered in the analysis. Two spatial metrics, average patch size and mean distance to the nearest neighboring patch of the same class, are used to describe proximity/separation characteristics of sprawl in the urban areas of the seven groups.
Filling the gap in functional trait databases: use of ecological hypotheses to replace missing data.

PubMed

Taugourdeau, Simon; Villerd, Jean; Plantureux, Sylvain; Huguenin-Elie, Olivier; Amiaud, Bernard

2014-04-01

Functional trait databases are powerful tools in ecology, though most of them contain large amounts of missing values. The goal of this study was to test the effect of imputation methods on the evaluation of trait values at species level and on the subsequent calculation of functional diversity indices at community level using functional trait databases. Two simple imputation methods (average and median), two methods based on ecological hypotheses, and one multiple imputation method were tested using a large plant trait database, together with the influence of the percentage of missing data and differences between functional traits. At community level, the complete-case approach and three functional diversity indices calculated from grassland plant communities were included. At the species level, one of the methods based on ecological hypothesis was for all traits more accurate than imputation with average or median values, but the multiple imputation method was superior for most of the traits. The method based on functional proximity between species was the best method for traits with an unbalanced distribution, while the method based on the existence of relationships between traits was the best for traits with a balanced distribution. The ranking of the grassland communities for their functional diversity indices was not robust with the complete-case approach, even for low percentages of missing data. With the imputation methods based on ecological hypotheses, functional diversity indices could be computed with a maximum of 30% of missing data, without affecting the ranking between grassland communities. The multiple imputation method performed well, but not better than single imputation based on ecological hypothesis and adapted to the distribution of the trait values for the functional identity and range of the communities. Ecological studies using functional trait databases have to deal with missing data using imputation methods corresponding to their specific needs and making the most out of the information available in the databases. Within this framework, this study indicates the possibilities and limits of single imputation methods based on ecological hypothesis and concludes that they could be useful when studying the ranking of communities for their functional diversity indices.
Filling the gap in functional trait databases: use of ecological hypotheses to replace missing data

PubMed Central

Taugourdeau, Simon; Villerd, Jean; Plantureux, Sylvain; Huguenin-Elie, Olivier; Amiaud, Bernard

2014-01-01

Functional trait databases are powerful tools in ecology, though most of them contain large amounts of missing values. The goal of this study was to test the effect of imputation methods on the evaluation of trait values at species level and on the subsequent calculation of functional diversity indices at community level using functional trait databases. Two simple imputation methods (average and median), two methods based on ecological hypotheses, and one multiple imputation method were tested using a large plant trait database, together with the influence of the percentage of missing data and differences between functional traits. At community level, the complete-case approach and three functional diversity indices calculated from grassland plant communities were included. At the species level, one of the methods based on ecological hypothesis was for all traits more accurate than imputation with average or median values, but the multiple imputation method was superior for most of the traits. The method based on functional proximity between species was the best method for traits with an unbalanced distribution, while the method based on the existence of relationships between traits was the best for traits with a balanced distribution. The ranking of the grassland communities for their functional diversity indices was not robust with the complete-case approach, even for low percentages of missing data. With the imputation methods based on ecological hypotheses, functional diversity indices could be computed with a maximum of 30% of missing data, without affecting the ranking between grassland communities. The multiple imputation method performed well, but not better than single imputation based on ecological hypothesis and adapted to the distribution of the trait values for the functional identity and range of the communities. Ecological studies using functional trait databases have to deal with missing data using imputation methods corresponding to their specific needs and making the most out of the information available in the databases. Within this framework, this study indicates the possibilities and limits of single imputation methods based on ecological hypothesis and concludes that they could be useful when studying the ranking of communities for their functional diversity indices. PMID:24772273
Dynamic Structure Factor: An Introduction

NASA Astrophysics Data System (ADS)

Sturm, K.

1993-02-01

The doubly differential cross-section for weak inelastic scattering of waves or particles by manybody systems is derived in Born approximation and expressed in terms of the dynamic structure factor according to van Hove. The application of this very general scheme to scattering of neutrons, x-rays and high-energy electrons is discussed briefly. The dynamic structure factor, which is the space and time Fourier transform of the density-density correlation function, is a property of the many-body system independent of the external probe and carries information on the excitation spectrum of the system. The relation of the electronic structure factor to the density-density response function defined in linear-response theory is shown using the fluctuation-dissipation theorem. This is important for calculations, since the response function can be calculated approximately from the independent-particle response function in self-consistent field approximations, such as the random-phase approximation or the local-density approximation of the density functional theory. Since the density-density response function also determines the dielectric function, the dynamic structure can be expressed by the dielectric function.
Scalable population estimates using spatial-stream-network (SSN) models, fish density surveys, and national geospatial database frameworks for streams

Treesearch

Daniel J. Isaak; Jay M. Ver Hoef; Erin E. Peterson; Dona L. Horan; David E. Nagel

2017-01-01

Population size estimates for stream fishes are important for conservation and management, but sampling costs limit the extent of most estimates to small portions of river networks that encompass 100sâ10 000s of linear kilometres. However, the advent of large fish density data sets, spatial-stream-network (SSN) models that benefit from nonindependence among samples,...
Comparing ab initio density-functional and wave function theories: the impact of correlation on the electronic density and the role of the correlation potential.

PubMed

Grabowski, Ireneusz; Teale, Andrew M; Śmiga, Szymon; Bartlett, Rodney J

2011-09-21

The framework of ab initio density-functional theory (DFT) has been introduced as a way to provide a seamless connection between the Kohn-Sham (KS) formulation of DFT and wave-function based ab initio approaches [R. J. Bartlett, I. Grabowski, S. Hirata, and S. Ivanov, J. Chem. Phys. 122, 034104 (2005)]. Recently, an analysis of the impact of dynamical correlation effects on the density of the neon atom was presented [K. Jankowski, K. Nowakowski, I. Grabowski, and J. Wasilewski, J. Chem. Phys. 130, 164102 (2009)], contrasting the behaviour for a variety of standard density functionals with that of ab initio approaches based on second-order Møller-Plesset (MP2) and coupled cluster theories at the singles-doubles (CCSD) and singles-doubles perturbative triples [CCSD(T)] levels. In the present work, we consider ab initio density functionals based on second-order many-body perturbation theory and coupled cluster perturbation theory in a similar manner, for a range of small atomic and molecular systems. For comparison, we also consider results obtained from MP2, CCSD, and CCSD(T) calculations. In addition to this density based analysis, we determine the KS correlation potentials corresponding to these densities and compare them with those obtained for a range of ab initio density functionals via the optimized effective potential method. The correlation energies, densities, and potentials calculated using ab initio DFT display a similar systematic behaviour to those derived from electronic densities calculated using ab initio wave function theories. In contrast, typical explicit density functionals for the correlation energy, such as VWN5 and LYP, do not show behaviour consistent with this picture of dynamical correlation, although they may provide some degree of correction for already erroneous explicitly density-dependent exchange-only functionals. The results presented here using orbital dependent ab initio density functionals show that they provide a treatment of exchange and correlation contributions within the KS framework that is more consistent with traditional ab initio wave function based methods.
Local and linear chemical reactivity response functions at finite temperature in density functional theory

DOE Office of Scientific and Technical Information (OSTI.GOV)

Franco-Pérez, Marco, E-mail: francopj@mcmaster.ca, E-mail: ayers@mcmaster.ca, E-mail: jlgm@xanum.uam.mx, E-mail: avela@cinvestav.mx; Departamento de Química, Universidad Autónoma Metropolitana-Iztapalapa, Av. San Rafael Atlixco 186, México, D.F. 09340; Ayers, Paul W., E-mail: francopj@mcmaster.ca, E-mail: ayers@mcmaster.ca, E-mail: jlgm@xanum.uam.mx, E-mail: avela@cinvestav.mx

2015-12-28

We explore the local and nonlocal response functions of the grand canonical potential density functional at nonzero temperature. In analogy to the zero-temperature treatment, local (e.g., the average electron density and the local softness) and nonlocal (e.g., the softness kernel) intrinsic response functions are defined as partial derivatives of the grand canonical potential with respect to its thermodynamic variables (i.e., the chemical potential of the electron reservoir and the external potential generated by the atomic nuclei). To define the local and nonlocal response functions of the electron density (e.g., the Fukui function, the linear density response function, and the dualmore » descriptor), we differentiate with respect to the average electron number and the external potential. The well-known mathematical relationships between the intrinsic response functions and the electron-density responses are generalized to nonzero temperature, and we prove that in the zero-temperature limit, our results recover well-known identities from the density functional theory of chemical reactivity. Specific working equations and numerical results are provided for the 3-state ensemble model.« less
Local and linear chemical reactivity response functions at finite temperature in density functional theory.

PubMed

Franco-Pérez, Marco; Ayers, Paul W; Gázquez, José L; Vela, Alberto

2015-12-28

We explore the local and nonlocal response functions of the grand canonical potential density functional at nonzero temperature. In analogy to the zero-temperature treatment, local (e.g., the average electron density and the local softness) and nonlocal (e.g., the softness kernel) intrinsic response functions are defined as partial derivatives of the grand canonical potential with respect to its thermodynamic variables (i.e., the chemical potential of the electron reservoir and the external potential generated by the atomic nuclei). To define the local and nonlocal response functions of the electron density (e.g., the Fukui function, the linear density response function, and the dual descriptor), we differentiate with respect to the average electron number and the external potential. The well-known mathematical relationships between the intrinsic response functions and the electron-density responses are generalized to nonzero temperature, and we prove that in the zero-temperature limit, our results recover well-known identities from the density functional theory of chemical reactivity. Specific working equations and numerical results are provided for the 3-state ensemble model.
Improving the Automatic Inversion of Digital Alouette/ISIS Ionogram Reflection Traces into Topside Electron Density Profiles

NASA Technical Reports Server (NTRS)

Benson, Robert F.; Truhlik, Vladimir; Huang, Xueqin; Wang, Yongli; Bilitza, Dieter

2012-01-01

The topside sounders of the International Satellites for Ionospheric Studies (ISIS) program were designed as analog systems. The resulting ionograms were displayed on 35 mm film for analysis by visual inspection. Each of these satellites, launched between 1962 and 1971, produced data for 10 to 20 years. A number of the original telemetry tapes from this large data set have been converted directly into digital records. Software, known as the Topside Ionogram Scalar With True-Height (TOPIST) algorithm, has been produced and used for the automatic inversion of the ionogram reflection traces on more than 100,000 ISIS-2 digital topside ionograms into topside vertical electron density profiles Ne(h). Here we present some topside ionospheric solar cycle variations deduced from the TOPIST database to illustrate the scientific benefit of improving and expanding the topside ionospheric Ne(h) database. The profile improvements will be based on improvements in the TOPIST software motivated by direct comparisons between TOPIST profiles and profiles produced by manual scaling in the early days of the ISIS program. The database expansion will be based on new software designed to overcome limitations in the original digital topside ionogram database caused by difficulties encountered during the analog-to-digital conversion process in the detection of the ionogram frame sync pulse and/or the frequency markers. This improved and expanded TOPIST topside Ne(h) database will greatly enhance investigations into both short- and long-term ionospheric changes, e.g., the observed topside ionospheric responses to magnetic storms, induced by interplanetary magnetic clouds, and solar cycle variations, respectively.
Concordance of Commercial Data Sources for Neighborhood-Effects Studies

PubMed Central

Schootman, Mario

2010-01-01

Growing evidence supports a relationship between neighborhood-level characteristics and important health outcomes. One source of neighborhood data includes commercial databases integrated with geographic information systems to measure availability of certain types of businesses or destinations that may have either favorable or adverse effects on health outcomes; however, the quality of these data sources is generally unknown. This study assessed the concordance of two commercial databases for ascertaining the presence, locations, and characteristics of businesses. Businesses in the St. Louis, Missouri area were selected based on their four-digit Standard Industrial Classification (SIC) codes and classified into 14 business categories. Business listings in the two commercial databases were matched by standardized business name within specified distances. Concordance and coverage measures were calculated using capture–recapture methods for all businesses and by business type, with further stratification by census-tract-level population density, percent below poverty, and racial composition. For matched listings, distance between listings and agreement in four-digit SIC code, sales volume, and employee size were calculated. Overall, the percent agreement was 32% between the databases. Concordance and coverage estimates were lowest for health-care facilities and leisure/entertainment businesses; highest for popular walking destinations, eating places, and alcohol/tobacco establishments; and varied somewhat by population density. The mean distance (SD) between matched listings was 108.2 (179.0) m with varying levels of agreement in four-digit SIC (percent agreement = 84.6%), employee size (weighted kappa = 0.63), and sales volume (weighted kappa = 0.04). Researchers should cautiously interpret findings when using these commercial databases to yield measures of the neighborhood environment. PMID:20480397
Multiconfiguration pair-density functional theory: barrier heights and main group and transition metal energetics.

PubMed

Carlson, Rebecca K; Li Manni, Giovanni; Sonnenberger, Andrew L; Truhlar, Donald G; Gagliardi, Laura

2015-01-13

Kohn-Sham density functional theory, resting on the representation of the electronic density and kinetic energy by a single Slater determinant, has revolutionized chemistry, but for open-shell systems, the Kohn-Sham Slater determinant has the wrong symmetry properties as compared to an accurate wave function. We have recently proposed a theory, called multiconfiguration pair-density functional theory (MC-PDFT), in which the electronic kinetic energy and classical Coulomb energy are calculated from a multiconfiguration wave function with the correct symmetry properties, and the rest of the energy is calculated from a density functional, called the on-top density functional, that depends on the density and the on-top pair density calculated from this wave function. We also proposed a simple way to approximate the on-top density functional by translation of Kohn-Sham exchange-correlation functionals. The method is much less expensive than other post-SCF methods for calculating the dynamical correlation energy starting with a multiconfiguration self-consistent-field wave function as the reference wave function, and initial tests of the theory were quite encouraging. Here, we provide a broader test of the theory by applying it to bond energies of main-group molecules and transition metal complexes, barrier heights and reaction energies for diverse chemical reactions, proton affinities, and the water dimerization energy. Averaged over 56 data points, the mean unsigned error is 3.2 kcal/mol for MC-PDFT, as compared to 6.9 kcal/mol for Kohn-Sham theory with a comparable density functional. MC-PDFT is more accurate on average than complete active space second-order perturbation theory (CASPT2) for main-group small-molecule bond energies, alkyl bond dissociation energies, transition-metal-ligand bond energies, proton affinities, and the water dimerization energy.
The PyCASSO database: spatially resolved stellar population properties for CALIFA galaxies

NASA Astrophysics Data System (ADS)

de Amorim, A. L.; García-Benito, R.; Cid Fernandes, R.; Cortijo-Ferrero, C.; González Delgado, R. M.; Lacerda, E. A. D.; López Fernández, R.; Pérez, E.; Vale Asari, N.

2017-11-01

The Calar Alto Legacy Integral Field Area (CALIFA) survey, a pioneer in integral field spectroscopy legacy projects, has fostered many studies exploring the information encoded on the spatially resolved data on gaseous and stellar features in the optical range of galaxies. We describe a value-added catalogue of stellar population properties for CALIFA galaxies analysed with the spectral synthesis code starlight and processed with the pycasso platform. Our public database (http://pycasso.ufsc.br/, mirror at http://pycasso.iaa.es/) comprises 445 galaxies from the CALIFA Data Release 3 with COMBO data. The catalogue provides maps for the stellar mass surface density, mean stellar ages and metallicities, stellar dust attenuation, star formation rates, and kinematics. Example applications both for individual galaxies and for statistical studies are presented to illustrate the power of this data set. We revisit and update a few of our own results on mass density radial profiles and on the local mass-metallicity relation. We also show how to employ the catalogue for new investigations, and show a pseudo Schmidt-Kennicutt relation entirely made with information extracted from the stellar continuum. Combinations to other databases are also illustrated. Among other results, we find a very good agreement between star formation rate surface densities derived from the stellar continuum and the H α emission. This public catalogue joins the scientific community's effort towards transparency and reproducibility, and will be useful for researchers focusing on (or complementing their studies with) stellar properties of CALIFA galaxies.
Geographical Distribution of Woody Biomass Carbon in Tropical Africa: An Updated Database for 2000 (NDP-055.2007, NDP-055b))

DOE Data Explorer

Gibbs, Holly K. [Center for Sustainability and the Global Environment (SAGE), University of Wisconsin, Madison, WI (USA); Brown, Sandra [Winrock International, Arlington, VA (USA); Olsen, L. M. [Carbon Dioxide Information Analysis Center (CDIAC), Oak Ridge National Laboratory, Oak Ridge, TN (USA); Boden, Thomas A. [Carbon Dioxide Information Analysis Center (CDIAC), Oak Ridge National Laboratory, Oak Ridge, TN (USA)

2007-09-01

Maps of biomass density are critical inputs for estimating carbon emissions from deforestation and degradation of tropical forests. Brown and Gatson (1996) pioneered methods to use GIS analysis to map forest biomass based on forest inventory data (ndp055). This database is an update of ndp055 (which represent conditions in circa 1980) and accounts for land cover changes occurring up to the year 2000.
Resources | Division of Cancer Prevention

Cancer.gov

Manual of Operations Version 3, 12/13/2012 (PDF, 162KB) Database Sources Consortium for Functional Glycomics databases Design Studies Related to the Development of Distributed, Web-based European Carbohydrate Databases (EUROCarbDB) |
New seismogenic stress fields for southern Italy from a Bayesian approach

NASA Astrophysics Data System (ADS)

Totaro, Cristina; Orecchio, Barbara; Presti, Debora; Scolaro, Silvia; Neri, Giancarlo

2017-04-01

A new database of high-quality waveform inversion focal mechanism has been compiled for southern Italy by integrating the highest quality solutions, available from literature and catalogues, and 146 newly-computed ones. All the selected focal mechanisms are (i) coming from the Italian CMT, Regional CMT and TDMT catalogues (Pondrelli et al., PEPI 2006, PEPI 2011; http://www.ingv.it), or (ii) computed by using the Cut And Paste (CAP) method (Zhao & Helmberger, BSSA 1994; Zhu & Helmberger, BSSA 1996). Specific tests have been carried out in order to evaluate the robustness of the obtained solutions (e.g., by varying both seismic network configuration and Earth structure parameters) and to estimate uncertainties on the focal mechanism parameters. Only the resulting highest-quality solutions have been enclosed in the database, that has then been used for computation of posterior density distributions of stress tensor components by a Bayesian method (Arnold & Townend, GJI 2007). This algorithm furnishes the posterior density function of the principal components of stress tensor (maximum σ1, intermediate σ2, and minimum σ3 compressive stress, respectively) and the stress-magnitude ratio (R). Before stress computation, we applied the k-means clustering algorithm to subdivide the focal mechanism catalog on the basis of earthquake locations. This approach allows identifying the sectors to be investigated without any "a priori" constraint from faulting type distribution. The large amount of data and the application of the Bayesian algorithm allowed us to provide a more accurate local-to-regional scale stress distribution that has shed new light on the kinematics and dynamics of this very complex area, where lithospheric unit configuration and geodynamic engines are still strongly debated. The new high-quality information here furnished will then represent very useful tools and constraints for future geophysical analyses and geodynamic modeling.
[Establishement for regional pelvic trauma database in Hunan Province].

PubMed

Cheng, Liang; Zhu, Yong; Long, Haitao; Yang, Junxiao; Sun, Buhua; Li, Kanghua

2017-04-28

To establish a database for pelvic trauma in Hunan Province, and to start the work of multicenter pelvic trauma registry.  Methods: To establish the database, literatures relevant to pelvic trauma were screened, the experiences from the established trauma database in China and abroad were learned, and the actual situations for pelvic trauma rescue in Hunan Province were considered. The database for pelvic trauma was established based on the PostgreSQL and the advanced programming language Java 1.6.  Results: The complex procedure for pelvic trauma rescue was described structurally. The contents for the database included general patient information, injurious condition, prehospital rescue, conditions in admission, treatment in hospital, status on discharge, diagnosis, classification, complication, trauma scoring and therapeutic effect. The database can be accessed through the internet by browser/servicer. The functions for the database include patient information management, data export, history query, progress report, video-image management and personal information management.  Conclusion: The database with whole life cycle pelvic trauma is successfully established for the first time in China. It is scientific, functional, practical, and user-friendly.
Identification of functional enolase genes of the silkworm Bombyx mori from public databases with a combination of dry and wet bench processes.

PubMed

Kikuchi, Akira; Nakazato, Takeru; Ito, Katsuhiko; Nojima, Yosui; Yokoyama, Takeshi; Iwabuchi, Kikuo; Bono, Hidemasa; Toyoda, Atsushi; Fujiyama, Asao; Sato, Ryoichi; Tabunoki, Hiroko

2017-01-13

Various insect species have been added to genomic databases over the years. Thus, researchers can easily obtain online genomic information on invertebrates and insects. However, many incorrectly annotated genes are included in these databases, which can prevent the correct interpretation of subsequent functional analyses. To address this problem, we used a combination of dry and wet bench processes to select functional genes from public databases. Enolase is an important glycolytic enzyme in all organisms. We used a combination of dry and wet bench processes to identify functional enolases in the silkworm Bombyx mori (BmEno). First, we detected five annotated enolases from public databases using a Hidden Markov Model (HMM) search, and then through cDNA cloning, Northern blotting, and RNA-seq analysis, we revealed three functional enolases in B. mori: BmEno1, BmEno2, and BmEnoC. BmEno1 contained a conserved key amino acid residue for metal binding and substrate binding in other species. However, BmEno2 and BmEnoC showed a change in this key amino acid. Phylogenetic analysis showed that BmEno2 and BmEnoC were distinct from BmEno1 and other enolases, and were distributed only in lepidopteran clusters. BmEno1 was expressed in all of the tissues used in our study. In contrast, BmEno2 was mainly expressed in the testis with some expression in the ovary and suboesophageal ganglion. BmEnoC was weakly expressed in the testis. Quantitative RT-PCR showed that the mRNA expression of BmEno2 and BmEnoC correlated with testis development; thus, BmEno2 and BmEnoC may be related to lepidopteran-specific spermiogenesis. We identified and characterized three functional enolases from public databases with a combination of dry and wet bench processes in the silkworm B. mori. In addition, we determined that BmEno2 and BmEnoC had species-specific functions. Our strategy could be helpful for the detection of minor genes and functional genes in non-model organisms from public databases.
Database constraints applied to metabolic pathway reconstruction tools.

PubMed

Vilaplana, Jordi; Solsona, Francesc; Teixido, Ivan; Usié, Anabel; Karathia, Hiren; Alves, Rui; Mateo, Jordi

2014-01-01

Our group developed two biological applications, Biblio-MetReS and Homol-MetReS, accessing the same database of organisms with annotated genes. Biblio-MetReS is a data-mining application that facilitates the reconstruction of molecular networks based on automated text-mining analysis of published scientific literature. Homol-MetReS allows functional (re)annotation of proteomes, to properly identify both the individual proteins involved in the process(es) of interest and their function. It also enables the sets of proteins involved in the process(es) in different organisms to be compared directly. The efficiency of these biological applications is directly related to the design of the shared database. We classified and analyzed the different kinds of access to the database. Based on this study, we tried to adjust and tune the configurable parameters of the database server to reach the best performance of the communication data link to/from the database system. Different database technologies were analyzed. We started the study with a public relational SQL database, MySQL. Then, the same database was implemented by a MapReduce-based database named HBase. The results indicated that the standard configuration of MySQL gives an acceptable performance for low or medium size databases. Nevertheless, tuning database parameters can greatly improve the performance and lead to very competitive runtimes.
Functions and Relations: Some Applications from Database Management for the Teaching of Classroom Mathematics.

ERIC Educational Resources Information Center

Hauge, Sharon K.

While functions and relations are important concepts in the teaching of mathematics, research suggests that many students lack an understanding and appreciation of these concepts. The present paper discusses an approach for teaching functions and relations that draws on the use of illustrations from database management. This approach has the…

MTF Database: A Repository of Students' Academic Performance Measurements for the Development of Techniques for Evaluating Team Functioning

ERIC Educational Resources Information Center

Hsiung, Chin-Min; Zheng, Xiang-Xiang

2015-01-01

The Measurements for Team Functioning (MTF) database contains a series of student academic performance measurements obtained at a national university in Taiwan. The measurements are acquired from unit tests and homework tests performed during a core mechanical engineering course, and provide an objective means of assessing the functioning of…
MPID-T2: a database for sequence-structure-function analyses of pMHC and TR/pMHC structures.

PubMed

Khan, Javed Mohammed; Cheruku, Harish Reddy; Tong, Joo Chuan; Ranganathan, Shoba

2011-04-15

Sequence-structure-function information is critical in understanding the mechanism of pMHC and TR/pMHC binding and recognition. A database for sequence-structure-function information on pMHC and TR/pMHC interactions, MHC-Peptide Interaction Database-TR version 2 (MPID-T2), is now available augmented with the latest PDB and IMGT/3Dstructure-DB data, advanced features and new parameters for the analysis of pMHC and TR/pMHC structures. http://biolinfo.org/mpid-t2. shoba.ranganathan@mq.edu.au Supplementary data are available at Bioinformatics online.
PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification.

PubMed

Thomas, Paul D; Kejariwal, Anish; Campbell, Michael J; Mi, Huaiyu; Diemer, Karen; Guo, Nan; Ladunga, Istvan; Ulitsky-Lazareva, Betty; Muruganujan, Anushya; Rabkin, Steven; Vandergriff, Jody A; Doremieux, Olivier

2003-01-01

The PANTHER database was designed for high-throughput analysis of protein sequences. One of the key features is a simplified ontology of protein function, which allows browsing of the database by biological functions. Biologist curators have associated the ontology terms with groups of protein sequences rather than individual sequences. Statistical models (Hidden Markov Models, or HMMs) are built from each of these groups. The advantage of this approach is that new sequences can be automatically classified as they become available. To ensure accurate functional classification, HMMs are constructed not only for families, but also for functionally distinct subfamilies. Multiple sequence alignments and phylogenetic trees, including curator-assigned information, are available for each family. The current version of the PANTHER database includes training sequences from all organisms in the GenBank non-redundant protein database, and the HMMs have been used to classify gene products across the entire genomes of human, and Drosophila melanogaster. The ontology terms and protein families and subfamilies, as well as Drosophila gene c;assifications, can be browsed and searched for free. Due to outstanding contractual obligations, access to human gene classifications and to protein family trees and multiple sequence alignments will temporarily require a nominal registration fee. PANTHER is publicly available on the web at http://panther.celera.com.
EVLncRNAs: a manually curated database for long non-coding RNAs validated by low-throughput experiments.

PubMed

Zhou, Bailing; Zhao, Huiying; Yu, Jiafeng; Guo, Chengang; Dou, Xianghua; Song, Feng; Hu, Guodong; Cao, Zanxia; Qu, Yuanxu; Yang, Yuedong; Zhou, Yaoqi; Wang, Jihua

2018-01-04

Long non-coding RNAs (lncRNAs) play important functional roles in various biological processes. Early databases were utilized to deposit all lncRNA candidates produced by high-throughput experimental and/or computational techniques to facilitate classification, assessment and validation. As more lncRNAs are validated by low-throughput experiments, several databases were established for experimentally validated lncRNAs. However, these databases are small in scale (with a few hundreds of lncRNAs only) and specific in their focuses (plants, diseases or interactions). Thus, it is highly desirable to have a comprehensive dataset for experimentally validated lncRNAs as a central repository for all of their structures, functions and phenotypes. Here, we established EVLncRNAs by curating lncRNAs validated by low-throughput experiments (up to 1 May 2016) and integrating specific databases (lncRNAdb, LncRANDisease, Lnc2Cancer and PLNIncRBase) with additional functional and disease-specific information not covered previously. The current version of EVLncRNAs contains 1543 lncRNAs from 77 species that is 2.9 times larger than the current largest database for experimentally validated lncRNAs. Seventy-four percent lncRNA entries are partially or completely new, comparing to all existing experimentally validated databases. The established database allows users to browse, search and download as well as to submit experimentally validated lncRNAs. The database is available at http://biophy.dzu.edu.cn/EVLncRNAs. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
EVLncRNAs: a manually curated database for long non-coding RNAs validated by low-throughput experiments

PubMed Central

Zhao, Huiying; Yu, Jiafeng; Guo, Chengang; Dou, Xianghua; Song, Feng; Hu, Guodong; Cao, Zanxia; Qu, Yuanxu

2018-01-01

Abstract Long non-coding RNAs (lncRNAs) play important functional roles in various biological processes. Early databases were utilized to deposit all lncRNA candidates produced by high-throughput experimental and/or computational techniques to facilitate classification, assessment and validation. As more lncRNAs are validated by low-throughput experiments, several databases were established for experimentally validated lncRNAs. However, these databases are small in scale (with a few hundreds of lncRNAs only) and specific in their focuses (plants, diseases or interactions). Thus, it is highly desirable to have a comprehensive dataset for experimentally validated lncRNAs as a central repository for all of their structures, functions and phenotypes. Here, we established EVLncRNAs by curating lncRNAs validated by low-throughput experiments (up to 1 May 2016) and integrating specific databases (lncRNAdb, LncRANDisease, Lnc2Cancer and PLNIncRBase) with additional functional and disease-specific information not covered previously. The current version of EVLncRNAs contains 1543 lncRNAs from 77 species that is 2.9 times larger than the current largest database for experimentally validated lncRNAs. Seventy-four percent lncRNA entries are partially or completely new, comparing to all existing experimentally validated databases. The established database allows users to browse, search and download as well as to submit experimentally validated lncRNAs. The database is available at http://biophy.dzu.edu.cn/EVLncRNAs. PMID:28985416
Levelling and merging of two discrete national-scale geochemical databases: A case study showing the surficial expression of metalliferous black shales

USGS Publications Warehouse

Smith, Steven M.; Neilson, Ryan T.; Giles, Stuart A.

2015-01-01

Government-sponsored, national-scale, soil and sediment geochemical databases are used to estimate regional and local background concentrations for environmental issues, identify possible anthropogenic contamination, estimate mineral endowment, explore for new mineral deposits, evaluate nutrient levels for agriculture, and establish concentration relationships with human or animal health. Because of these different uses, it is difficult for any single database to accommodate all the needs of each client. Smith et al. (2013, p. 168) reviewed six national-scale soil and sediment geochemical databases for the United States (U.S.) and, for each, evaluated “its appropriateness as a national-scale geochemical database and its usefulness for national-scale geochemical mapping.” Each of the evaluated databases has strengths and weaknesses that were listed in that review.Two of these U.S. national-scale geochemical databases are similar in their sample media and collection protocols but have different strengths—primarily sampling density and analytical consistency. This project was implemented to determine whether those databases could be merged to produce a combined dataset that could be used for mineral resource assessments. The utility of the merged database was tested to see whether mapped distributions could identify metalliferous black shales at a national scale.
Phylogenomics databases for facilitating functional genomics in rice.

PubMed

Jung, Ki-Hong; Cao, Peijian; Sharma, Rita; Jain, Rashmi; Ronald, Pamela C

2015-12-01

The completion of whole genome sequence of rice (Oryza sativa) has significantly accelerated functional genomics studies. Prior to the release of the sequence, only a few genes were assigned a function each year. Since sequencing was completed in 2005, the rate has exponentially increased. As of 2014, 1,021 genes have been described and added to the collection at The Overview of functionally characterized Genes in Rice online database (OGRO). Despite this progress, that number is still very low compared with the total number of genes estimated in the rice genome. One limitation to progress is the presence of functional redundancy among members of the same rice gene family, which covers 51.6 % of all non-transposable element-encoding genes. There remain a significant portion or rice genes that are not functionally redundant, as reflected in the recovery of loss-of-function mutants. To more accurately analyze functional redundancy in the rice genome, we have developed a phylogenomics databases for six large gene families in rice, including those for glycosyltransferases, glycoside hydrolases, kinases, transcription factors, transporters, and cytochrome P450 monooxygenases. In this review, we introduce key features and applications of these databases. We expect that they will serve as a very useful guide in the post-genomics era of research.
Magnetic-Field Density-Functional Theory (BDFT): Lessons from the Adiabatic Connection.

PubMed

Reimann, Sarah; Borgoo, Alex; Tellgren, Erik I; Teale, Andrew M; Helgaker, Trygve

2017-09-12

We study the effects of magnetic fields in the context of magnetic field density-functional theory (BDFT), where the energy is a functional of the electron density ρ and the magnetic field B. We show that this approach is a worthwhile alternative to current-density functional theory (CDFT) and may provide a viable route to the study of many magnetic phenomena using density-functional theory (DFT). The relationship between BDFT and CDFT is developed and clarified within the framework of the four-way correspondence of saddle functions and their convex and concave parents in convex analysis. By decomposing the energy into its Kohn-Sham components, we demonstrate that the magnetizability is mainly determined by those energy components that are related to the density. For existing density functional approximations, this implies that, for the magnetizability, improvements of the density will be more beneficial than introducing a magnetic-field dependence in the correlation functional. However, once a good charge density is achieved, we show that high accuracy is likely only obtainable by including magnetic-field dependence. We demonstrate that adiabatic-connection (AC) curves at different field strengths resemble one another closely provided each curve is calculated at the equilibrium geometry of that field strength. In contrast, if all AC curves are calculated at the equilibrium geometry of the field-free system, then the curves change strongly with increasing field strength due to the increasing importance of static correlation. This holds also for density functional approximations, for which we demonstrate that the main error encountered in the presence of a field is already present at zero field strength, indicating that density-functional approximations may be applied to systems in strong fields, without the need to treat additional static correlation.
Applications of density functional theory calculations to selected problems in hydrocarbon processing

NASA Astrophysics Data System (ADS)

Nabar, Rahul

Recent advances in theoretical techniques and computational hardware have made it possible to apply Density Functional Theory (DFT) methods to realistic problems in heterogeneous catalysis. Hydrocarbon processing is economically, and strategically, a very important industrial sector in today's world. In this thesis, we employ DFT methods to examine several important problems in hydrocarbon processing. Fischer Tropsch Synthesis (FTS) is a mature technology to convert synthesis gas derived from coal, natural-gas or biomass into liquid fuels, specifically diesel. Iron is an active FTS catalyst, but the absence of detailed reaction mechanisms make it difficult to maximize activity and optimize product distribution. We evaluate thermochemistry, kinetics and Rate Determining Steps (RDS) for Fischer Tropsch Synthesis on several models of Fe catalysts: Fe(110), Fe(211) and Pt promoted Fe(110). Our studies indicated that CO-dissociation is likely to be the RDS under most reaction conditions, but the DFT-calculated activation energy ( Ea) for direct CO dissociation was too large to explain the observed catalyst activity. Consequently we demonstrate that H-assisted CO-dissociation pathways are competitive with direct CO dissociation on both Co and Fe catalysts and could be responsible for a major fraction of the reaction flux (especially at high CO coverages). We then extend this alternative mechanistic model to closed-packed facets of nine transition metal catalysts (Fe, Co, Ni, Ru, Rh, Pd, Os, Ir and Pt). H-assisted CO dissociation offers a kinetically easier route on each of the metals studied. DFT methods are also applied to another problem from the petroleum industry: discovery of poison-resistant, bimetallic, alloy catalysts (poisons: C, S, CI, P). Our systematic screening studies identify several Near Surface Alloys (NSAs) that are expected to be highly poison-resistant yet stable and avoiding adsorbate induced reconstruction. Adsorption trends are also correlated with electronic structure. Eventually we extend this work to compile a database of Binding Energies for 17 adsorbates of catalytic interest on a set of 17 transition metals and their NSAs. Practical examples of how such a database, in conjunction with screening criteria, can be fruitfully utilized for rational catalyst design, are also provided.
47 CFR 15.713 - TV bands database.

Code of Federal Regulations, 2014 CFR

2014-10-01

... 47 Telecommunication 1 2014-10-01 2014-10-01 false TV bands database. 15.713 Section 15.713... TV bands database. Link to an amendment published at 79 FR 48536, August 15, 2014. (a) Purpose. The TV bands database serves the following functions: (1) To determine and provide to a TVBD, upon...
Tourism through Travel Club: A Database Project

ERIC Educational Resources Information Center

Pratt, Renée M. E.; Smatt, Cindi T.; Wynn, Donald E.

2017-01-01

This applied database exercise utilizes a scenario-based case study to teach the basics of Microsoft Access and database management in introduction to information systems and introduction to database course. The case includes background information on a start-up business (i.e., Carol's Travel Club), description of functional business requirements,…
NABIC marker database: A molecular markers information network of agricultural crops.

PubMed

Kim, Chang-Kug; Seol, Young-Joo; Lee, Dong-Jun; Jeong, In-Seon; Yoon, Ung-Han; Lee, Gang-Seob; Hahn, Jang-Ho; Park, Dong-Suk

2013-01-01

In 2013, National Agricultural Biotechnology Information Center (NABIC) reconstructs a molecular marker database for useful genetic resources. The web-based marker database consists of three major functional categories: map viewer, RSN marker and gene annotation. It provides 7250 marker locations, 3301 RSN marker property, 3280 molecular marker annotation information in agricultural plants. The individual molecular marker provides information such as marker name, expressed sequence tag number, gene definition and general marker information. This updated marker-based database provides useful information through a user-friendly web interface that assisted in tracing any new structures of the chromosomes and gene positional functions using specific molecular markers. The database is available for free at http://nabic.rda.go.kr/gere/rice/molecularMarkers/
On the use of the noncentral chi-square density function for the distribution of helicopter spectral estimates

NASA Technical Reports Server (NTRS)

Garber, Donald P.

1993-01-01

A probability density function for the variability of ensemble averaged spectral estimates from helicopter acoustic signals in Gaussian background noise was evaluated. Numerical methods for calculating the density function and for determining confidence limits were explored. Density functions were predicted for both synthesized and experimental data and compared with observed spectral estimate variability.
Human Mitochondrial Protein Database

National Institute of Standards and Technology Data Gateway

SRD 131 Human Mitochondrial Protein Database (Web, free access) The Human Mitochondrial Protein Database (HMPDb) provides comprehensive data on mitochondrial and human nuclear encoded proteins involved in mitochondrial biogenesis and function. This database consolidates information from SwissProt, LocusLink, Protein Data Bank (PDB), GenBank, Genome Database (GDB), Online Mendelian Inheritance in Man (OMIM), Human Mitochondrial Genome Database (mtDB), MITOMAP, Neuromuscular Disease Center and Human 2-D PAGE Databases. This database is intended as a tool not only to aid in studying the mitochondrion but in studying the associated diseases.
Patterns of lymph node sampling and the impact of lymph node density in favorable histology Wilms tumor: An analysis of the national cancer database.

PubMed

Saltzman, A F; Carrasco, A; Amini, A; Aldrink, J H; Dasgupta, R; Gow, K W; Glick, R D; Ehrlich, P F; Cost, N G

2018-04-01

There is controversy about the role of lymph node (LN) sampling or dissection in the management of favorable histology (FH) Wilms tumor (WT), specifically how it performed and how it may impact survival. The objective of this study was to analyze factors affecting LN sampling patterns and the impact of LN yield and density (number of positive LNs/LNs examined) on overall survival (OS) in patients with advanced-stage favorable histology Wilms tumor (FHWT). The National Cancer Database (NCDB) was queried for patients with FHWT during 2004-2013. Demographic, clinical and OS data were abstracted for those who underwent surgical resection. Poisson regression was performed to analyze how factors influenced LN yield. Patients with positive LNs had LN density calculated and were further analyzed. A total of 2340 patients met criteria, with a median age at diagnosis of 3 years (range 0-78 years). The median number of LNs examined was three (range 0-87). Lymph node yield was affected by age, race, insurance, tumor size, laterality, advanced stage, LN positivity, and institutional volume. A total of 390 (16.6%) patients had LN-positive disease. Median LN density for these LN-positive patients was 0.38 (range 0.02-1) (Summary Figure). Estimated 5-year OS was significantly improved for those with LN density ≤0.38 vs. >0.38 (94% vs. 84.6%, P = 0.012). In this population, on multivariate analysis, age and LN density were significant predictors of OS. It is difficult to compile large numbers of cases in rare diseases like WT, and fortunately a large administrative database such as the NCDB can serve as a great resource. However, administrative data come with inherent limitations such as missing data and inability to account for a variety of factors that may influence LN yield and/or OS (specimen designation, pathologist experience, surgeon experience/volume, institutional Children's Oncology Group (COG) association, etc.). In this specific disease, the American Joint Committee on Cancer staging (captured by the NCDB) is different than the COG WT staging system that is used clinically, and the NCDB does not capture oncologic outcomes beyond OS. In a review of the NCDB, various factors associated with LN yield and observed LN density were identified to be significantly associated with OS in patients with LN-positive FHWT. This reinforces the need for adequate LN sampling at the time of WT surgery, to maximize surgical disease control. It was proposed that LN density as a metric may allow for improved risk-stratification, and possibly allow for therapeutic reduction in a sub-set of patients with low LN density. Copyright © 2017 Journal of Pediatric Urology Company. Published by Elsevier Ltd. All rights reserved.
Stretched hydrogen molecule from a constrained-search density-functional perspective

DOE Office of Scientific and Technical Information (OSTI.GOV)

Valone, Steven M; Levy, Mel

2009-01-01

Constrained-search density functional theory gives valuable insights into the fundamentals of density functional theory. It provides exact results and bounds on the ground- and excited-state density functionals. An important advantage of the theory is that it gives guidance in the construction of functionals. Here they engage constrained search theory to explore issues associated with the functional behavior of 'stretched bonds' in molecular hydrogen. A constrained search is performed with familiar valence bond wavefunctions ordinarily used to describe molecular hydrogen. The effective, one-electron hamiltonian is computed and compared to the corresponding uncorrelated, Hartree-Fock effective hamiltonian. Analysis of the functional suggests themore » need to construct different functionals for the same density and to allow a competition among these functions. As a result the correlation energy functional is composed explicitly of energy gaps from the different functionals.« less
Building a Better Model: A Comprehensive Breast Cancer Risk Model Incorporating Breast Density to Stratify Risk and Apply Resources

DTIC Science & Technology

2014-10-01

variability with well trained readers. Figure 7: comparison between the PD (percent density using Cumulus area) and the automatic PD. The...evaluation of outlier correction, comparison of several different software methods, precision measurement, and evaluation of variation by mammography...chart review for selected cases (month 4-6). Comparison of information from the Breast Cancer Database and medical records showed good consistency
RICD: a rice indica cDNA database resource for rice functional genomics.

PubMed

Lu, Tingting; Huang, Xuehui; Zhu, Chuanrang; Huang, Tao; Zhao, Qiang; Xie, Kabing; Xiong, Lizhong; Zhang, Qifa; Han, Bin

2008-11-26

The Oryza sativa L. indica subspecies is the most widely cultivated rice. During the last few years, we have collected over 20,000 putative full-length cDNAs and over 40,000 ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63. A database of the rice indica cDNAs was therefore built to provide a comprehensive web data source for searching and retrieving the indica cDNA clones. Rice Indica cDNA Database (RICD) is an online MySQL-PHP driven database with a user-friendly web interface. It allows investigators to query the cDNA clones by keyword, genome position, nucleotide or protein sequence, and putative function. It also provides a series of information, including sequences, protein domain annotations, similarity search results, SNPs and InDels information, and hyperlinks to gene annotation in both The Rice Annotation Project Database (RAP-DB) and The TIGR Rice Genome Annotation Resource, expression atlas in RiceGE and variation report in Gramene of each cDNA. The online rice indica cDNA database provides cDNA resource with comprehensive information to researchers for functional analysis of indica subspecies and for comparative genomics. The RICD database is available through our website http://www.ncgr.ac.cn/ricd.
PSD Applicability: TEX-USS High Density Polyethylene Plant

EPA Pesticide Factsheets

This document may be of assistance in applying the New Source Review (NSR) air permitting regulations including the Prevention of Significant Deterioration (PSD) requirements. This document is part of the NSR Policy and Guidance Database. Some documents in the database are a scanned or retyped version of a paper photocopy of the original. Although we have taken considerable effort to quality assure the documents, some may contain typographical errors. Contact the office that issued the document if you need a copy of the original.
dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts

PubMed Central

Vincent, Jonathan; Dai, Zhanwu; Ravel, Catherine; Choulet, Frédéric; Mouzeyar, Said; Bouzidi, M. Fouad; Agier, Marie; Martre, Pierre

2013-01-01

The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/ PMID:23660284

MultitaskProtDB: a database of multitasking proteins.

PubMed

Hernández, Sergio; Ferragut, Gabriela; Amela, Isaac; Perez-Pons, JosepAntoni; Piñol, Jaume; Mozo-Villarias, Angel; Cedano, Juan; Querol, Enrique

2014-01-01

We have compiled MultitaskProtDB, available online at http://wallace.uab.es/multitask, to provide a repository where the many multitasking proteins found in the literature can be stored. Multitasking or moonlighting is the capability of some proteins to execute two or more biological functions. Usually, multitasking proteins are experimentally revealed by serendipity. This ability of proteins to perform multitasking functions helps us to understand one of the ways used by cells to perform many complex functions with a limited number of genes. Even so, the study of this phenomenon is complex because, among other things, there is no database of moonlighting proteins. The existence of such a tool facilitates the collection and dissemination of these important data. This work reports the database, MultitaskProtDB, which is designed as a friendly user web page containing >288 multitasking proteins with their NCBI and UniProt accession numbers, canonical and additional biological functions, monomeric/oligomeric states, PDB codes when available and bibliographic references. This database also serves to gain insight into some characteristics of multitasking proteins such as frequencies of the different pairs of functions, phylogenetic conservation and so forth.
Active Space Dependence in Multiconfiguration Pair-Density Functional Theory.

PubMed

Sharma, Prachi; Truhlar, Donald G; Gagliardi, Laura

2018-02-13

In multiconfiguration pair-density functional theory (MC-PDFT), multiconfiguration self-consistent-field calculations and on-top density functionals are combined to describe both static and dynamic correlation. Here, we investigate how the MC-PDFT total energy and its components depend on the active space choice in the case of the H 2 and N 2 molecules. The active space dependence of the on-top pair density, the total density, the ratio of on-top pair density to half the square of the electron density, and the satisfaction of the virial theorem are also explored. We find that the density and on-top pair density do not change significantly with changes in the active space. However, the on-top ratio does change significantly with respect to active space change, and this affects the on-top energy. This study provides a foundation for designing on-top density functionals and automatizing the active space choice in MC-PDFT.
Rice proteome database: a step toward functional analysis of the rice genome.

PubMed

Komatsu, Setsuko

2005-09-01

The technique of proteome analysis using two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) has the power to monitor global changes that occur in the protein complement of tissues and subcellular compartments. In this study, the proteins of rice were cataloged, a rice proteome database was constructed, and a functional characterization of some of the identified proteins was undertaken. Proteins extracted from various tissues and subcellular compartments in rice were separated by 2D-PAGE and an image analyzer was used to construct a display of the proteins. The Rice Proteome Database contains 23 reference maps based on 2D-PAGE of proteins from various rice tissues and subcellular compartments. These reference maps comprise 13129 identified proteins, and the amino acid sequences of 5092 proteins are entered in the database. Major proteins involved in growth or stress responses were identified using the proteome approach. Some of these proteins, including a beta-tubulin, calreticulin, and ribulose-1,5-bisphosphate carboxylase/oxygenase activase in rice, have unexpected functions. The information obtained from the Rice Proteome Database will aid in cloning the genes for and predicting the function of unknown proteins.
Ensemble gene function prediction database reveals genes important for complex I formation in Arabidopsis thaliana.

PubMed

Hansen, Bjoern Oest; Meyer, Etienne H; Ferrari, Camilla; Vaid, Neha; Movahedi, Sara; Vandepoele, Klaas; Nikoloski, Zoran; Mutwil, Marek

2018-03-01

Recent advances in gene function prediction rely on ensemble approaches that integrate results from multiple inference methods to produce superior predictions. Yet, these developments remain largely unexplored in plants. We have explored and compared two methods to integrate 10 gene co-function networks for Arabidopsis thaliana and demonstrate how the integration of these networks produces more accurate gene function predictions for a larger fraction of genes with unknown function. These predictions were used to identify genes involved in mitochondrial complex I formation, and for five of them, we confirmed the predictions experimentally. The ensemble predictions are provided as a user-friendly online database, EnsembleNet. The methods presented here demonstrate that ensemble gene function prediction is a powerful method to boost prediction performance, whereas the EnsembleNet database provides a cutting-edge community tool to guide experimentalists. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Multicomponent density functional theory embedding formulation.

PubMed

Culpitt, Tanner; Brorsen, Kurt R; Pak, Michael V; Hammes-Schiffer, Sharon

2016-07-28

Multicomponent density functional theory (DFT) methods have been developed to treat two types of particles, such as electrons and nuclei, quantum mechanically at the same level. In the nuclear-electronic orbital (NEO) approach, all electrons and select nuclei, typically key protons, are treated quantum mechanically. For multicomponent DFT methods developed within the NEO framework, electron-proton correlation functionals based on explicitly correlated wavefunctions have been designed and used in conjunction with well-established electronic exchange-correlation functionals. Herein a general theory for multicomponent embedded DFT is developed to enable the accurate treatment of larger systems. In the general theory, the total electronic density is separated into two subsystem densities, denoted as regular and special, and different electron-proton correlation functionals are used for these two electronic densities. In the specific implementation, the special electron density is defined in terms of spatially localized Kohn-Sham electronic orbitals, and electron-proton correlation is included only for the special electron density. The electron-proton correlation functional depends on only the special electron density and the proton density, whereas the electronic exchange-correlation functional depends on the total electronic density. This scheme includes the essential electron-proton correlation, which is a relatively local effect, as well as the electronic exchange-correlation for the entire system. This multicomponent DFT-in-DFT embedding theory is applied to the HCN and FHF(-) molecules in conjunction with two different electron-proton correlation functionals and three different electronic exchange-correlation functionals. The results illustrate that this approach provides qualitatively accurate nuclear densities in a computationally tractable manner. The general theory is also easily extended to other types of partitioning schemes for multicomponent systems.
Multicomponent density functional theory embedding formulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Culpitt, Tanner; Brorsen, Kurt R.; Pak, Michael V.

Multicomponent density functional theory (DFT) methods have been developed to treat two types of particles, such as electrons and nuclei, quantum mechanically at the same level. In the nuclear-electronic orbital (NEO) approach, all electrons and select nuclei, typically key protons, are treated quantum mechanically. For multicomponent DFT methods developed within the NEO framework, electron-proton correlation functionals based on explicitly correlated wavefunctions have been designed and used in conjunction with well-established electronic exchange-correlation functionals. Herein a general theory for multicomponent embedded DFT is developed to enable the accurate treatment of larger systems. In the general theory, the total electronic density ismore » separated into two subsystem densities, denoted as regular and special, and different electron-proton correlation functionals are used for these two electronic densities. In the specific implementation, the special electron density is defined in terms of spatially localized Kohn-Sham electronic orbitals, and electron-proton correlation is included only for the special electron density. The electron-proton correlation functional depends on only the special electron density and the proton density, whereas the electronic exchange-correlation functional depends on the total electronic density. This scheme includes the essential electron-proton correlation, which is a relatively local effect, as well as the electronic exchange-correlation for the entire system. This multicomponent DFT-in-DFT embedding theory is applied to the HCN and FHF{sup −} molecules in conjunction with two different electron-proton correlation functionals and three different electronic exchange-correlation functionals. The results illustrate that this approach provides qualitatively accurate nuclear densities in a computationally tractable manner. The general theory is also easily extended to other types of partitioning schemes for multicomponent systems.« less
Non-random mate choice in humans: insights from a genome scan.

PubMed

Laurent, R; Toupance, B; Chaix, R

2012-02-01

Little is known about the genetic factors influencing mate choice in humans. Still, there is evidence for non-random mate choice with respect to physical traits. In addition, some studies suggest that the Major Histocompatibility Complex may affect pair formation. Nowadays, the availability of high density genomic data sets gives the opportunity to scan the genome for signatures of non-random mate choice without prior assumptions on which genes may be involved, while taking into account socio-demographic factors. Here, we performed a genome scan to detect extreme patterns of similarity or dissimilarity among spouses throughout the genome in three populations of African, European American, and Mexican origins from the HapMap 3 database. Our analyses identified genes and biological functions that may affect pair formation in humans, including genes involved in skin appearance, morphogenesis, immunity and behaviour. We found little overlap between the three populations, suggesting that the biological functions potentially influencing mate choice are population specific, in other words are culturally driven. Moreover, whenever the same functional category of genes showed a significant signal in two populations, different genes were actually involved, which suggests the possibility of evolutionary convergences. © 2011 Blackwell Publishing Ltd.
High Temperature, high pressure equation of state density correlations and viscosity correlations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tapriyal, D.; Enick, R.; McHugh, M.

2012-07-31

Global increase in oil demand and depleting reserves has derived a need to find new oil resources. To find these untapped reservoirs, oil companies are exploring various remote and harsh locations such as deep waters in Gulf of Mexico, remote arctic regions, unexplored deep deserts, etc. Further, the depth of new oil/gas wells being drilled has increased considerably to tap these new resources. With the increase in the well depth, the bottomhole temperature and pressure are also increasing to extreme values (i.e. up to 500 F and 35,000 psi). The density and viscosity of natural gas and crude oil atmore » reservoir conditions are critical fundamental properties required for accurate assessment of the amount of recoverable petroleum within a reservoir and the modeling of the flow of these fluids within the porous media. These properties are also used to design appropriate drilling and production equipment such as blow out preventers, risers, etc. With the present state of art, there is no accurate database for these fluid properties at extreme conditions. As we have begun to expand this experimental database it has become apparent that there are neither equations of state for density or transport models for viscosity that can be used to predict these fundamental properties of multi-component hydrocarbon mixtures over a wide range of temperature and pressure. Presently, oil companies are using correlations based on lower temperature and pressure databases that exhibit an unsatisfactory predictive capability at extreme conditions (e.g. as great as {+-} 50%). From the perspective of these oil companies that are committed to safely producing these resources, accurately predicting flow rates, and assuring the integrity of the flow, the absence of an extensive experimental database at extreme conditions and models capable of predicting these properties over an extremely wide range of temperature and pressure (including extreme conditions) makes their task even more daunting.« less
Combining Density Functional Theory and Green's Function Theory: Range-Separated, Nonlocal, Dynamic, and Orbital-Dependent Hybrid Functional.

PubMed

Kananenka, Alexei A; Zgid, Dominika

2017-11-14

We present a rigorous framework which combines single-particle Green's function theory with density functional theory based on a separation of electron-electron interactions into short- and long-range components. Short-range contribution to the total energy and exchange-correlation potential is provided by a density functional approximation, while the long-range contribution is calculated using an explicit many-body Green's function method. Such a hybrid results in a nonlocal, dynamic, and orbital-dependent exchange-correlation functional of a single-particle Green's function. In particular, we present a range-separated hybrid functional called srSVWN5-lrGF2 which combines the local-density approximation and the second-order Green's function theory. We illustrate that similarly to density functional approximations, the new functional is weakly basis-set dependent. Furthermore, it offers an improved description of the short-range dynamic correlation. The many-body contribution to the functional mitigates the many-electron self-interaction error present in many density functional approximations and provides a better description of molecular properties. Additionally, we illustrate that the new functional can be used to scale down the self-energy and, therefore, introduce an additional sparsity to the self-energy matrix that in the future can be exploited in calculations for large molecules or periodic systems.
The Nuclear Protein Database (NPD): sub-nuclear localisation and functional annotation of the nuclear proteome

PubMed Central

Dellaire, G.; Farrall, R.; Bickmore, W.A.

2003-01-01

The Nuclear Protein Database (NPD) is a curated database that contains information on more than 1300 vertebrate proteins that are thought, or are known, to localise to the cell nucleus. Each entry is annotated with information on predicted protein size and isoelectric point, as well as any repeats, motifs or domains within the protein sequence. In addition, information on the sub-nuclear localisation of each protein is provided and the biological and molecular functions are described using Gene Ontology (GO) terms. The database is searchable by keyword, protein name, sub-nuclear compartment and protein domain/motif. Links to other databases are provided (e.g. Entrez, SWISS-PROT, OMIM, PubMed, PubMed Central). Thus, NPD provides a gateway through which the nuclear proteome may be explored. The database can be accessed at http://npd.hgu.mrc.ac.uk and is updated monthly. PMID:12520015
A Knowledge Database on Thermal Control in Manufacturing Processes

NASA Astrophysics Data System (ADS)

Hirasawa, Shigeki; Satoh, Isao

A prototype version of a knowledge database on thermal control in manufacturing processes, specifically, molding, semiconductor manufacturing, and micro-scale manufacturing has been developed. The knowledge database has search functions for technical data, evaluated benchmark data, academic papers, and patents. The database also displays trends and future roadmaps for research topics. It has quick-calculation functions for basic design. This paper summarizes present research topics and future research on thermal control in manufacturing engineering to collate the information to the knowledge database. In the molding process, the initial mold and melt temperatures are very important parameters. In addition, thermal control is related to many semiconductor processes, and the main parameter is temperature variation in wafers. Accurate in-situ temperature measurment of wafers is important. And many technologies are being developed to manufacture micro-structures. Accordingly, the knowledge database will help further advance these technologies.
New Directions in Library and Information Science Education. Final Report. Volume 2.6: Database Distributor/Service Professional Competencies.

ERIC Educational Resources Information Center

Griffiths, Jose-Marie; And Others

This document contains validated activities and competencies needed by librarians working in a database distributor/service organization. The activities of professionals working in database distributor/service organizations are listed by function: Database Processing; Customer Support; System Administration; and Planning. The competencies are…
Information Management Tools for Classrooms: Exploring Database Management Systems. Technical Report No. 28.

ERIC Educational Resources Information Center

Freeman, Carla; And Others

In order to understand how the database software or online database functioned in the overall curricula, the use of database management (DBMs) systems was studied at eight elementary and middle schools through classroom observation and interviews with teachers and administrators, librarians, and students. Three overall areas were addressed:…
78 FR 28756 - Defense Federal Acquisition Regulation Supplement: System for Award Management Name Changes...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-05-16

... Excluded Parties Listing System (EPLS) databases into the System for Award Management (SAM) database. DATES... combined the functional capabilities of the CCR, ORCA, and EPLS procurement systems into the SAM database... identification number and the type of organization from the System for Award Management database. 0 3. Revise the...
Exact conditions on the temperature dependence of density functionals

DOE PAGES

Burke, K.; Smith, J. C.; Grabowski, P. E.; ...

2016-05-15

Universal exact conditions guided the construction of most ground-state density functional approximations in use today. Here, we derive the relation between the entropy and Mermin free energy density functionals for thermal density functional theory. Both the entropy and sum of kinetic and electron-electron repulsion functionals are shown to be monotonically increasing with temperature, while the Mermin functional is concave downwards. Analogous relations are found for both exchange and correlation. The importance of these conditions is illustrated in two extremes: the Hubbard dimer and the uniform gas.
Spatial heterogeneity in response of male greater sage-grouse lek attendance to energy development.

PubMed

Gregory, Andrew J; Beck, Jeffrey L

2014-01-01

Landscape modification due to rapidly expanding energy development, in particular oil and gas, in the westernUSA, have prompted concerns over how such developments may impact wildlife. One species of conservation concern across much of the Intermountain West is the greater sage-grouse (Centrocercusurophasianus). Sage-grouse have been petitioned for listing under provisions of the Endangered Species Act 7 times and the state of Wyoming alone represents 64% of the extant sage-grouse population in the eastern portion of their range. Consequently, the relationship between sage-grouse populations and oil and gas development in Wyoming is an important component to managing the long-term viability of this species. We used 814 leks from the Wyoming Game and Fish Department's lek survey database and well pad data from the Wyoming Oil and Gas Conservation Commission to evaluate changes in sage-grouse lek counts as a function of oil and gas development since 1991.From 1991-2011 we found that oil and gas well-pad density increased 3.6-fold across the state and was associated with a 24% decline in the number of male sage-grouse. Using a spatial and temporally structured analysis via Geographically Weighted Regression, we found a 1-to-4 year time lag between development density and lek decline. Sage-grouse also responded to development densities at multiple spatial neighborhoods surrounding leks, including broad scales of 10 km. However, sage-grouse lek counts do not always decline as a result of oil and gas development. We found similar development densities resulting in different sage-grouse lek count responses, suggesting that development density alone is insufficient to predict the impacts that oil and gas development have on sage-grouse. Finally, our analysis suggests a maximum development density of 1 well-pad within 2 km of leks to avoid measurable impacts within 1 year, and <6 well-pads within 10 km of leks to avoid delayed impacts.
Exploring Genetic, Genomic, and Phenotypic Data at the Rat Genome Database

PubMed Central

Laulederkind, Stanley J. F.; Hayman, G. Thomas; Wang, Shur-Jen; Lowry, Timothy F.; Nigam, Rajni; Petri, Victoria; Smith, Jennifer R.; Dwinell, Melinda R.; Jacob, Howard J.; Shimoyama, Mary

2013-01-01

The laboratory rat, Rattus norvegicus, is an important model of human health and disease, and experimental findings in the rat have relevance to human physiology and disease. The Rat Genome Database (RGD, http://rgd.mcw.edu) is a model organism database that provides access to a wide variety of curated rat data including disease associations, phenotypes, pathways, molecular functions, biological processes and cellular components for genes, quantitative trait loci, and strains. We present an overview of the database followed by specific examples that can be used to gain experience in employing RGD to explore the wealth of functional data available for the rat. PMID:23255149
Database Constraints Applied to Metabolic Pathway Reconstruction Tools

PubMed Central

Vilaplana, Jordi; Solsona, Francesc; Teixido, Ivan; Usié, Anabel; Karathia, Hiren; Alves, Rui; Mateo, Jordi

2014-01-01

Our group developed two biological applications, Biblio-MetReS and Homol-MetReS, accessing the same database of organisms with annotated genes. Biblio-MetReS is a data-mining application that facilitates the reconstruction of molecular networks based on automated text-mining analysis of published scientific literature. Homol-MetReS allows functional (re)annotation of proteomes, to properly identify both the individual proteins involved in the process(es) of interest and their function. It also enables the sets of proteins involved in the process(es) in different organisms to be compared directly. The efficiency of these biological applications is directly related to the design of the shared database. We classified and analyzed the different kinds of access to the database. Based on this study, we tried to adjust and tune the configurable parameters of the database server to reach the best performance of the communication data link to/from the database system. Different database technologies were analyzed. We started the study with a public relational SQL database, MySQL. Then, the same database was implemented by a MapReduce-based database named HBase. The results indicated that the standard configuration of MySQL gives an acceptable performance for low or medium size databases. Nevertheless, tuning database parameters can greatly improve the performance and lead to very competitive runtimes. PMID:25202745
AIM: a comprehensive Arabidopsis interactome module database and related interologs in plants.

PubMed

Wang, Yi; Thilmony, Roger; Zhao, Yunjun; Chen, Guoping; Gu, Yong Q

2014-01-01

Systems biology analysis of protein modules is important for understanding the functional relationships between proteins in the interactome. Here, we present a comprehensive database named AIM for Arabidopsis (Arabidopsis thaliana) interactome modules. The database contains almost 250,000 modules that were generated using multiple analysis methods and integration of microarray expression data. All the modules in AIM are well annotated using multiple gene function knowledge databases. AIM provides a user-friendly interface for different types of searches and offers a powerful graphical viewer for displaying module networks linked to the enrichment annotation terms. Both interactive Venn diagram and power graph viewer are integrated into the database for easy comparison of modules. In addition, predicted interologs from other plant species (homologous proteins from different species that share a conserved interaction module) are available for each Arabidopsis module. AIM is a powerful systems biology platform for obtaining valuable insights into the function of proteins in Arabidopsis and other plants using the modules of the Arabidopsis interactome. Database URL:http://probes.pw.usda.gov/AIM Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.
Safety of oral tenofovir disoproxil fumarate-based pre-exposure prophylaxis for HIV prevention.

PubMed

Mugwanya, Kenneth K; Baeten, Jared M

2016-01-01

Tenofovir disoproxil fumarate (TDF)-based pre-exposure prophylaxis is a novel HIV prevention strategy for individuals at increased sexual risk for HIV infection. For any biomedical prevention intervention, the bar for tolerating adverse effects in healthy persons is high compared to therapeutic interventions. We provide a concise summary of the clinical safety of TDF-based pre-exposure prophylaxis with focus on TDF-related effects on tolerability, kidney function, bone density, HIV resistance, sexual and reproductive health. The evidence base for this review is derived from a literature search of both randomized and observational studies evaluating efficacy and safety of TDF-based PrEP, TDF alone or in combination with emtricitabine, identified from PUBMED and EMBASE electronic databases, clinicaltrials.gov and major HIV conferences. TDF-based pre-exposure prophylaxis is a potent intervention against HIV acquisition when taken which is generally safe and well tolerated. The risk of the small, non-progressive, and reversible decline in glomerular filtration rate and bone mineral density as well as the potential selection for drug resistance associated with PrEP are outweighed, at the population level and broadly for individuals, by PrEP's substantial reduction in the risk of HIV infection.

Use of a Drosophila Genome-Wide Conserved Sequence Database to Identify Functionally Related cis-Regulatory Enhancers

PubMed Central

Brody, Thomas; Yavatkar, Amarendra S; Kuzin, Alexander; Kundu, Mukta; Tyson, Leonard J; Ross, Jermaine; Lin, Tzu-Yang; Lee, Chi-Hon; Awasaki, Takeshi; Lee, Tzumin; Odenwald, Ward F

2012-01-01

Background: Phylogenetic footprinting has revealed that cis-regulatory enhancers consist of conserved DNA sequence clusters (CSCs). Currently, there is no systematic approach for enhancer discovery and analysis that takes full-advantage of the sequence information within enhancer CSCs. Results: We have generated a Drosophila genome-wide database of conserved DNA consisting of >100,000 CSCs derived from EvoPrints spanning over 90% of the genome. cis-Decoder database search and alignment algorithms enable the discovery of functionally related enhancers. The program first identifies conserved repeat elements within an input enhancer and then searches the database for CSCs that score highly against the input CSC. Scoring is based on shared repeats as well as uniquely shared matches, and includes measures of the balance of shared elements, a diagnostic that has proven to be useful in predicting cis-regulatory function. To demonstrate the utility of these tools, a temporally-restricted CNS neuroblast enhancer was used to identify other functionally related enhancers and analyze their structural organization. Conclusions: cis-Decoder reveals that co-regulating enhancers consist of combinations of overlapping shared sequence elements, providing insights into the mode of integration of multiple regulating transcription factors. The database and accompanying algorithms should prove useful in the discovery and analysis of enhancers involved in any developmental process. Developmental Dynamics 241:169–189, 2012. © 2011 Wiley Periodicals, Inc. Key findings A genome-wide catalog of Drosophila conserved DNA sequence clusters. cis-Decoder discovers functionally related enhancers. Functionally related enhancers share balanced sequence element copy numbers. Many enhancers function during multiple phases of development. PMID:22174086
Achieving high confidence protein annotations in a sea of unknowns

NASA Astrophysics Data System (ADS)

Timmins-Schiffman, E.; May, D. H.; Noble, W. S.; Nunn, B. L.; Mikan, M.; Harvey, H. R.

2016-02-01

Increased sensitivity of mass spectrometry (MS) technology allows deep and broad insight into community functional analyses. Metaproteomics holds the promise to reveal functional responses of natural microbial communities, whereas metagenomics alone can only hint at potential functions. The complex datasets resulting from ocean MS have the potential to inform diverse realms of the biological, chemical, and physical ocean sciences, yet the extent of bacterial functional diversity and redundancy has not been fully explored. To take advantage of these impressive datasets, we need a clear bioinformatics pipeline for metaproteomics peptide identification and annotation with a database that can provide confident identifications. Researchers must consider whether it is sufficient to leverage the vast quantities of available ocean sequence data or if they must invest in site-specific metagenomic sequencing. We have sequenced, to our knowledge, the first western arctic metagenomes from the Bering Strait and the Chukchi Sea. We have addressed the long standing question: Is a metagenome required to accurately complete metaproteomics and assess the biological distribution of metabolic functions controlling nutrient acquisition in the ocean? Two different protein databases were constructed from 1) a site-specific metagenome and 2) subarctic/arctic groups available in NCBI's non-redundant database. Multiple proteomic search strategies were employed, against each individual database and against both databases combined, to determine the algorithm and approach that yielded the balance of high sensitivity and confident identification. Results yielded over 8200 confidently identified proteins. Our comparison of these results allows us to quantify the utility of investing resources in a metagenome versus using the constantly expanding and immediately available public databases for metaproteomic studies.
High-throughput density functional calculations to optimize properties and interfacial chemistry of piezoelectric materials

NASA Astrophysics Data System (ADS)

Barr, Jordan A.; Lin, Fang-Yin; Ashton, Michael; Hennig, Richard G.; Sinnott, Susan B.

2018-02-01

High-throughput density functional theory calculations are conducted to search through 1572 A B O3 compounds to find a potential replacement material for lead zirconate titanate (PZT) that exhibits the same excellent piezoelectric properties as PZT and lacks both its use of the toxic element lead (Pb) and the formation of secondary alloy phases with platinum (Pt) electrodes. The first screening criterion employed a search through the Materials Project database to find A -B combinations that do not form ternary compounds with Pt. The second screening criterion aimed to eliminate potential candidates through first-principles calculations of their electronic structure, in which compounds with a band gap of 0.25 eV or higher were retained. Third, thermodynamic stability calculations were used to compare the candidates in a Pt environment to compounds already calculated to be stable within the Materials Project. Formation energies below or equal to 100 meV/atom were considered to be thermodynamically stable. The fourth screening criterion employed lattice misfit to identify those candidate perovskites that have low misfit with the Pt electrode and high misfit of potential secondary phases that can be formed when Pt alloys with the different A and B components. To aid in the final analysis, dynamic stability calculations were used to determine those perovskites that have dynamic instabilities that favor the ferroelectric distortion. Analysis of the data finds three perovskites warranting further investigation: CsNb O3 , RbNb O3 , and CsTa O3 .
Long-range corrected density functional through the density matrix expansion based semilocal exchange hole.

PubMed

Patra, Bikash; Jana, Subrata; Samal, Prasanjit

2018-03-28

The exchange hole, which is one of the principal constituents of the density functional formalism, can be used to design accurate range-separated hybrid functionals in association with appropriate correlation. In this regard, the exchange hole derived from the density matrix expansion has gained attention due to its fulfillment of some of the desired exact constraints. Thus, the new long-range corrected density functional proposed here combines the meta generalized gradient approximation level exchange functional designed from the density matrix expansion based exchange hole coupled with the ab initio Hartree-Fock exchange through the range separation of the Coulomb interaction operator using the standard error function technique. Then, in association with the Lee-Yang-Parr correlation functional, the assessment and benchmarking of the above newly constructed range-separated functional with various well-known test sets shows its reasonable performance for a broad range of molecular properties, such as thermochemistry, non-covalent interaction and barrier heights of the chemical reactions.
Beyond Kohn-Sham Approximation: Hybrid Multistate Wave Function and Density Functional Theory.

PubMed

Gao, Jiali; Grofe, Adam; Ren, Haisheng; Bao, Peng

2016-12-15

A multistate density functional theory (MSDFT) is presented in which the energies and densities for the ground and excited states are treated on the same footing using multiconfigurational approaches. The method can be applied to systems with strong correlation and to correctly describe the dimensionality of the conical intersections between strongly coupled dissociative potential energy surfaces. A dynamic-then-static framework for treating electron correlation is developed to first incorporate dynamic correlation into contracted state functions through block-localized Kohn-Sham density functional theory (KSDFT), followed by diagonalization of the effective Hamiltonian to include static correlation. MSDFT can be regarded as a hybrid of wave function and density functional theory. The method is built on and makes use of the current approximate density functional developed in KSDFT, yet it retains its computational efficiency to treat strongly correlated systems that are problematic for KSDFT but too large for accurate WFT. The results presented in this work show that MSDFT can be applied to photochemical processes involving conical intersections.
Extending FEAST-METAL for analysis of low content minor actinide bearing and zirconium rich metallic fuels for sodium fast reactors

NASA Astrophysics Data System (ADS)

Karahan, Aydın

2011-07-01

Computational models in FEAST-METAL fuel behaviour code have been upgraded to simulate minor actinide bearing zirconium rich metallic fuels for use in sodium fast reactors. Increasing the zirconium content to 20-40 wt.% causes significant changes in fuel slug microstructure affecting thermal, mechanical, chemical, and fission gas behaviour. Inclusion of zirconium rich phase reduces the fission gas swelling rate significantly in early irradiation. Above the threshold fission gas swelling, formation of micro-cracks, and open pores increase material compliancy enhance diffusivity, leading to rapid fuel gas swelling, interconnected porosity development and release of the fission gases and helium. Production and release of helium was modelled empirically as a function of americium content and fission gas production, consistent with previous Idaho National Laboratory studies. Predicted fuel constituent redistribution is much smaller compared to typical U-Pu-10Zr fuel operated at EBR-II. Material properties such as fuel thermal conductivity, modulus of elasticity, and thermal expansion coefficient have been approximated using the available database. Creep rate and fission gas diffusivity of high zirconium fuel is lowered by an order of magnitude with respect to the reference low zirconium fuel based on limited database and in order to match experimental observations. The new code is benchmarked against the AFC-1F fuel assembly post irradiation examination results. Satisfactory match was obtained for fission gas release and swelling behaviour. Finally, the study considers a comparison of fuel behaviour between high zirconium content minor actinide bearing fuel and typical U-15Pu-6Zr fuel pins with 75% smear density. The new fuel has much higher fissile content, allowing for operating at lower neutron flux level compared to fuel with lower fissile density. This feature allows the designer to reach a much higher burnup before reaching the cladding dose limit. On the other hand, in order to accommodate solid fission product swelling and to control fuel clad mechanical interaction of the stiffer fuel, the fuel smear density is reduced to 70%. In addition, plenum height is increased to accommodate for fission gases.
Extension modules for storage, visualization and querying of genomic, genetic and breeding data in Tripal databases

PubMed Central

Lee, Taein; Cheng, Chun-Huai; Ficklin, Stephen; Yu, Jing; Humann, Jodi; Main, Dorrie

2017-01-01

Abstract Tripal is an open-source database platform primarily used for development of genomic, genetic and breeding databases. We report here on the release of the Chado Loader, Chado Data Display and Chado Search modules to extend the functionality of the core Tripal modules. These new extension modules provide additional tools for (1) data loading, (2) customized visualization and (3) advanced search functions for supported data types such as organism, marker, QTL/Mendelian Trait Loci, germplasm, map, project, phenotype, genotype and their respective metadata. The Chado Loader module provides data collection templates in Excel with defined metadata and data loaders with front end forms. The Chado Data Display module contains tools to visualize each data type and the metadata which can be used as is or customized as desired. The Chado Search module provides search and download functionality for the supported data types. Also included are the tools to visualize map and species summary. The use of materialized views in the Chado Search module enables better performance as well as flexibility of data modeling in Chado, allowing existing Tripal databases with different metadata types to utilize the module. These Tripal Extension modules are implemented in the Genome Database for Rosaceae (rosaceae.org), CottonGen (cottongen.org), Citrus Genome Database (citrusgenomedb.org), Genome Database for Vaccinium (vaccinium.org) and the Cool Season Food Legume Database (coolseasonfoodlegume.org). Database URL: https://www.citrusgenomedb.org/, https://www.coolseasonfoodlegume.org/, https://www.cottongen.org/, https://www.rosaceae.org/, https://www.vaccinium.org/
Development of SRS.php, a Simple Object Access Protocol-based library for data acquisition from integrated biological databases.

PubMed

Barbosa-Silva, A; Pafilis, E; Ortega, J M; Schneider, R

2007-12-11

Data integration has become an important task for biological database providers. The current model for data exchange among different sources simplifies the manner that distinct information is accessed by users. The evolution of data representation from HTML to XML enabled programs, instead of humans, to interact with biological databases. We present here SRS.php, a PHP library that can interact with the data integration Sequence Retrieval System (SRS). The library has been written using SOAP definitions, and permits the programmatic communication through webservices with the SRS. The interactions are possible by invoking the methods described in WSDL by exchanging XML messages. The current functions available in the library have been built to access specific data stored in any of the 90 different databases (such as UNIPROT, KEGG and GO) using the same query syntax format. The inclusion of the described functions in the source of scripts written in PHP enables them as webservice clients to the SRS server. The functions permit one to query the whole content of any SRS database, to list specific records in these databases, to get specific fields from the records, and to link any record among any pair of linked databases. The case study presented exemplifies the library usage to retrieve information regarding registries of a Plant Defense Mechanisms database. The Plant Defense Mechanisms database is currently being developed, and the proposal of SRS.php library usage is to enable the data acquisition for the further warehousing tasks related to its setup and maintenance.
Global Distribution of Outbreaks of Water-Associated Infectious Diseases

PubMed Central

Yang, Kun; LeJeune, Jeffrey; Alsdorf, Doug; Lu, Bo; Shum, C. K.; Liang, Song

2012-01-01

Background Water plays an important role in the transmission of many infectious diseases, which pose a great burden on global public health. However, the global distribution of these water-associated infectious diseases and underlying factors remain largely unexplored. Methods and Findings Based on the Global Infectious Disease and Epidemiology Network (GIDEON), a global database including water-associated pathogens and diseases was developed. In this study, reported outbreak events associated with corresponding water-associated infectious diseases from 1991 to 2008 were extracted from the database. The location of each reported outbreak event was identified and geocoded into a GIS database. Also collected in the GIS database included geo-referenced socio-environmental information including population density (2000), annual accumulated temperature, surface water area, and average annual precipitation. Poisson models with Bayesian inference were developed to explore the association between these socio-environmental factors and distribution of the reported outbreak events. Based on model predictions a global relative risk map was generated. A total of 1,428 reported outbreak events were retrieved from the database. The analysis suggested that outbreaks of water-associated diseases are significantly correlated with socio-environmental factors. Population density is a significant risk factor for all categories of reported outbreaks of water-associated diseases; water-related diseases (e.g., vector-borne diseases) are associated with accumulated temperature; water-washed diseases (e.g., conjunctivitis) are inversely related to surface water area; both water-borne and water-related diseases are inversely related to average annual rainfall. Based on the model predictions, “hotspots” of risks for all categories of water-associated diseases were explored. Conclusions At the global scale, water-associated infectious diseases are significantly correlated with socio-environmental factors, impacting all regions which are affected disproportionately by different categories of water-associated infectious diseases. PMID:22348158
The UCL low-density lipoprotein receptor gene variant database: pathogenicity update

PubMed Central

Futema, Marta; Whittall, Ros; Taylor-Beadling, Alison; Williams, Maggie; den Dunnen, Johan T; Humphries, Steve E

2017-01-01

Background Familial hypercholesterolaemia (OMIM 143890) is most frequently caused by variations in the low-density lipoprotein receptor (LDLR) gene. Predicting whether novel variants are pathogenic may not be straightforward, especially for missense and synonymous variants. In 2013, the Association of Clinical Genetic Scientists published guidelines for the classification of variants, with categories 1 and 2 representing clearly not or unlikely pathogenic, respectively, 3 representing variants of unknown significance (VUS), and 4 and 5 representing likely to be or clearly pathogenic, respectively. Here, we update the University College London (UCL) LDLR variant database according to these guidelines. Methods PubMed searches and alerts were used to identify novel LDLR variants for inclusion in the database. Standard in silico tools were used to predict potential pathogenicity. Variants were designated as class 4/5 only when the predictions from the different programs were concordant and as class 3 when predictions were discordant. Results The updated database (http://www.lovd.nl/LDLR) now includes 2925 curated variants, representing 1707 independent events. All 129 nonsense variants, 337 small frame-shifting and 117/118 large rearrangements were classified as 4 or 5. Of the 795 missense variants, 115 were in classes 1 and 2, 605 in class 4 and 75 in class 3. 111/181 intronic variants, 4/34 synonymous variants and 14/37 promoter variants were assigned to classes 4 or 5. Overall, 112 (7%) of reported variants were class 3. Conclusions This study updates the LDLR variant database and identifies a number of reported VUS where additional family and in vitro studies will be required to confirm or refute their pathogenicity. PMID:27821657
Determining object orientation with a hierarchical database of binary synthetic discriminant function filters

NASA Technical Reports Server (NTRS)

Reid, Max B.; Ma, Paul W.; Downie, John D.

1990-01-01

An optical correlation-based system is demonstrated which recognizes an object and determines its angular orientation by traversing a hierarchical data base of binary filters. The data-base architecture is made possible by the development of binary synthetic discriminant function filters.
On extending Kohn-Sham density functionals to systems with fractional number of electrons.

PubMed

Li, Chen; Lu, Jianfeng; Yang, Weitao

2017-06-07

We analyze four ways of formulating the Kohn-Sham (KS) density functionals with a fractional number of electrons, through extending the constrained search space from the Kohn-Sham and the generalized Kohn-Sham (GKS) non-interacting v-representable density domain for integer systems to four different sets of densities for fractional systems. In particular, these density sets are (I) ensemble interacting N-representable densities, (II) ensemble non-interacting N-representable densities, (III) non-interacting densities by the Janak construction, and (IV) non-interacting densities whose composing orbitals satisfy the Aufbau occupation principle. By proving the equivalence of the underlying first order reduced density matrices associated with these densities, we show that sets (I), (II), and (III) are equivalent, and all reduce to the Janak construction. Moreover, for functionals with the ensemble v-representable assumption at the minimizer, (III) reduces to (IV) and thus justifies the previous use of the Aufbau protocol within the (G)KS framework in the study of the ground state of fractional electron systems, as defined in the grand canonical ensemble at zero temperature. By further analyzing the Aufbau solution for different density functional approximations (DFAs) in the (G)KS scheme, we rigorously prove that there can be one and only one fractional occupation for the Hartree Fock functional, while there can be multiple fractional occupations for general DFAs in the presence of degeneracy. This has been confirmed by numerical calculations using the local density approximation as a representative of general DFAs. This work thus clarifies important issues on density functional theory calculations for fractional electron systems.
Functional thermo-dynamics: a generalization of dynamic density functional theory to non-isothermal situations.

PubMed

Anero, Jesús G; Español, Pep; Tarazona, Pedro

2013-07-21

We present a generalization of Density Functional Theory (DFT) to non-equilibrium non-isothermal situations. By using the original approach set forth by Gibbs in his consideration of Macroscopic Thermodynamics (MT), we consider a Functional Thermo-Dynamics (FTD) description based on the density field and the energy density field. A crucial ingredient of the theory is an entropy functional, which is a concave functional. Therefore, there is a one to one connection between the density and energy fields with the conjugate thermodynamic fields. The connection between the three levels of description (MT, DFT, FTD) is clarified through a bridge theorem that relates the entropy of different levels of description and that constitutes a generalization of Mermin's theorem to arbitrary levels of description whose relevant variables are connected linearly. Although the FTD level of description does not provide any new information about averages and correlations at equilibrium, it is a crucial ingredient for the dynamics in non-equilibrium states. We obtain with the technique of projection operators the set of dynamic equations that describe the evolution of the density and energy density fields from an initial non-equilibrium state towards equilibrium. These equations generalize time dependent density functional theory to non-isothermal situations. We also present an explicit model for the entropy functional for hard spheres.
Functionally Graded Materials Database

NASA Astrophysics Data System (ADS)

Kisara, Katsuto; Konno, Tomomi; Niino, Masayuki

2008-02-01

Functionally Graded Materials Database (hereinafter referred to as FGMs Database) was open to the society via Internet in October 2002, and since then it has been managed by the Japan Aerospace Exploration Agency (JAXA). As of October 2006, the database includes 1,703 research information entries with 2,429 researchers data, 509 institution data and so on. Reading materials such as "Applicability of FGMs Technology to Space Plane" and "FGMs Application to Space Solar Power System (SSPS)" were prepared in FY 2004 and 2005, respectively. The English version of "FGMs Application to Space Solar Power System (SSPS)" is now under preparation. This present paper explains the FGMs Database, describing the research information data, the sitemap and how to use it. From the access analysis, user access results and users' interests are discussed.
Micro-Tom Tomato as an Alternative Plant Model System: Mutant Collection and Efficient Transformation.

PubMed

Shikata, Masahito; Ezura, Hiroshi

2016-01-01

Tomato is a model plant for fruit development, a unique feature that classical model plants such as Arabidopsis and rice do not have. The tomato genome was sequenced in 2012 and tomato is becoming very popular as an alternative system for plant research. Among many varieties of tomato, Micro-Tom has been recognized as a model cultivar for tomato research because it shares some key advantages with Arabidopsis including its small size, short life cycle, and capacity to grow under fluorescent lights at a high density. Mutants and transgenic plants are essential materials for functional genomics research, and therefore, the availability of mutant resources and methods for genetic transformation are key tools to facilitate tomato research. Here, we introduce the Micro-Tom mutant database "TOMATOMA" and an efficient transformation protocol for Micro-Tom.
Intermolecular symmetry-adapted perturbation theory study of large organic complexes.

PubMed

Heßelmann, Andreas; Korona, Tatiana

2014-09-07

Binding energies for the complexes of the S12L database by Grimme [Chem. Eur. J. 18, 9955 (2012)] were calculated using intermolecular symmetry-adapted perturbation theory combined with a density-functional theory description of the interacting molecules. The individual interaction energy decompositions revealed no particular change in the stabilisation pattern as compared to smaller dimer systems at equilibrium structures. This demonstrates that, to some extent, the qualitative description of the interaction of small dimer systems may be extrapolated to larger systems, a method that is widely used in force-fields in which the total interaction energy is decomposed into atom-atom contributions. A comparison of the binding energies with accurate experimental reference values from Grimme, the latter including thermodynamic corrections from semiempirical calculations, has shown a fairly good agreement to within the error range of the reference binding energies.
Accelerating the Convergence of Replica Exchange Simulations Using Gibbs Sampling and Adaptive Temperature Sets

DOE PAGES

Vogel, Thomas; Perez, Danny

2015-08-28

We recently introduced a novel replica-exchange scheme in which an individual replica can sample from states encountered by other replicas at any previous time by way of a global configuration database, enabling the fast propagation of relevant states through the whole ensemble of replicas. This mechanism depends on the knowledge of global thermodynamic functions which are measured during the simulation and not coupled to the heat bath temperatures driving the individual simulations. Therefore, this setup also allows for a continuous adaptation of the temperature set. In this paper, we will review the new scheme and demonstrate its capability. Furthermore, themore » method is particularly useful for the fast and reliable estimation of the microcanonical temperature T(U) or, equivalently, of the density of states g(U) over a wide range of energies.« less
An empirical potential for simulating vacancy clusters in tungsten.

PubMed

Mason, D R; Nguyen-Manh, D; Becquart, C S

2017-12-20

We present an empirical interatomic potential for tungsten, particularly well suited for simulations of vacancy-type defects. We compare energies and structures of vacancy clusters generated with the empirical potential with an extensive new database of values computed using density functional theory, and show that the new potential predicts low-energy defect structures and formation energies with high accuracy. A significant difference to other popular embedded-atom empirical potentials for tungsten is the correct prediction of surface energies. Interstitial properties and short-range pairwise behaviour remain similar to the Ackford-Thetford potential on which it is based, making this potential well-suited to simulations of microstructural evolution following irradiation damage cascades. Using atomistic kinetic Monte Carlo simulations, we predict vacancy cluster dissociation in the range 1100-1300 K, the temperature range generally associated with stage IV recovery.
Universal functions of nuclear proximity potential for Skyrme nucleus-nucleus interaction in a semiclassical approach

NASA Astrophysics Data System (ADS)

Gupta, Raj K.; Singh, Dalip; Kumar, Raj; Greiner, Walter

2009-07-01

The universal function of the nuclear proximity potential is obtained for the Skyrme nucleus-nucleus interaction in the semiclassical extended Thomas-Fermi (ETF) approach. This is obtained as a sum of the spin-orbit-density-independent and spin-orbit-density-dependent parts of the Hamiltonian density, since the two terms behave differently, the spin-orbit-density-independent part mainly attractive and the spin-orbit-density-dependent part mainly repulsive. The semiclassical expansions of kinetic energy density and spin-orbit density are allowed up to second order, and the two-parameter Fermi density, with its parameters fitted to experiments, is used for the nuclear density. The universal functions or the resulting nuclear proximity potential reproduce the 'exact' Skyrme nucleus-nucleus interaction potential in the semiclassical approach, within less than ~1 MeV of difference, both at the maximum attraction and in the surface region. An application of the resulting interaction potential to fusion excitation functions shows clearly that the parameterized universal functions of nuclear proximity potential substitute completely the 'exact' potential in the Skyrme energy density formalism based on the semiclassical ETF method, including also the modifications of interaction barriers at sub-barrier energies in terms of modifying the constants of the universal functions.
Double-hybrid density-functional theory with meta-generalized-gradient approximations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Souvi, Sidi M. O., E-mail: sidi.souvi@irsn.fr; Sharkas, Kamal; Toulouse, Julien, E-mail: julien.toulouse@upmc.fr

2014-02-28

We extend the previously proposed one-parameter double-hybrid density-functional theory [K. Sharkas, J. Toulouse, and A. Savin, J. Chem. Phys. 134, 064113 (2011)] to meta-generalized-gradient-approximation (meta-GGA) exchange-correlation density functionals. We construct several variants of one-parameter double-hybrid approximations using the Tao-Perdew-Staroverov-Scuseria (TPSS) meta-GGA functional and test them on test sets of atomization energies and reaction barrier heights. The most accurate variant uses the uniform coordinate scaling of the density and of the kinetic energy density in the correlation functional, and improves over both standard Kohn-Sham TPSS and second-order Møller-Plesset calculations.

dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions.

PubMed

Wu, Jiaxin; Wu, Mengmeng; Li, Lianshuo; Liu, Zhuo; Zeng, Wanwen; Jiang, Rui

2016-01-01

The recent advancement of the next generation sequencing technology has enabled the fast and low-cost detection of all genetic variants spreading across the entire human genome, making the application of whole-genome sequencing a tendency in the study of disease-causing genetic variants. Nevertheless, there still lacks a repository that collects predictions of functionally damaging effects of human genetic variants, though it has been well recognized that such predictions play a central role in the analysis of whole-genome sequencing data. To fill this gap, we developed a database named dbWGFP (a database and web server of human whole-genome single nucleotide variants and their functional predictions) that contains functional predictions and annotations of nearly 8.58 billion possible human whole-genome single nucleotide variants. Specifically, this database integrates 48 functional predictions calculated by 17 popular computational methods and 44 valuable annotations obtained from various data sources. Standalone software, user-friendly query services and free downloads of this database are available at http://bioinfo.au.tsinghua.edu.cn/dbwgfp. dbWGFP provides a valuable resource for the analysis of whole-genome sequencing, exome sequencing and SNP array data, thereby complementing existing data sources and computational resources in deciphering genetic bases of human inherited diseases. © The Author(s) 2016. Published by Oxford University Press.
Building a database for long-term monitoring of benthic macrofauna in the Pertuis-Charentais (2004-2014).

PubMed

Philippe, Anne S; Plumejeaud-Perreau, Christine; Jourde, Jérôme; Pineau, Philippe; Lachaussée, Nicolas; Joyeux, Emmanuel; Corre, Frédéric; Delaporte, Philippe; Bocher, Pierrick

2017-01-01

Long-term benthic monitoring is rewarding in terms of science, but labour-intensive, whether in the field, the laboratory, or behind the computer. Building and managing databases require multiple skills, including consistency over time as well as organisation via a systematic approach. Here, we introduce and share our spatially explicit benthic database, comprising 11 years of benthic data. It is the result of intensive benthic sampling that has been conducted on a regular grid (259 stations) covering the intertidal mudflats of the Pertuis-Charentais (Marennes-Oléron Bay and Aiguillon Bay). Samples were taken by foot or by boats during winter depending on tidal height, from December 2003 to February 2014. The present dataset includes abundances and biomass densities of all mollusc species of the study regions and principal polychaetes as well as their length, accessibility to shorebirds, energy content and shell mass when appropriate and available. This database has supported many studies dealing with the spatial distribution of benthic invertebrates and temporal variations in food resources for shorebird species as well as latitudinal comparisons with other databases. In this paper, we introduce our benthos monitoring, share our data, and present a "guide of good practices" for building, cleaning and using it efficiently, providing examples of results with associated R code. The dataset has been formatted into a geo-referenced relational database, using PostgreSQL open-source DBMS. We provide density information, measurements, energy content and accessibility of thirteen bivalve, nine gastropod and two polychaete taxa (a total of 66,620 individuals) for 11 consecutive winters. Figures and maps are provided to describe how the dataset was built, cleaned, and how it can be used. This dataset can again support studies concerning spatial and temporal variations in species abundance, interspecific interactions as well as evaluations of the availability of food resources for small- and medium size shorebirds and, potentially, conservation and impact assessment studies.
MIPS: a database for genomes and protein sequences

PubMed Central

Mewes, H. W.; Frishman, D.; Güldener, U.; Mannhaupt, G.; Mayer, K.; Mokrejs, M.; Morgenstern, B.; Münsterkötter, M.; Rudd, S.; Weil, B.

2002-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz–Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91–93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155–158; Barker et al. (2001) Nucleic Acids Res., 29, 29–32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de). PMID:11752246
MIPS: a database for genomes and protein sequences.

PubMed

Mewes, H W; Frishman, D; Güldener, U; Mannhaupt, G; Mayer, K; Mokrejs, M; Morgenstern, B; Münsterkötter, M; Rudd, S; Weil, B

2002-01-01

The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz-Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91-93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155-158; Barker et al. (2001) Nucleic Acids Res., 29, 29-32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de).
Metal-Air Batteries: (Latest citations from the Aerospace Database)

NASA Technical Reports Server (NTRS)

1997-01-01

The bibliography contains citations concerning applications of metal-air batteries. Topics include systems that possess different practical energy densities at specific powers. Coverage includes the operation of air electrodes at different densities and performance results. The systems are used in electric vehicles as a cost-effective method to achieve reliability and efficiency. Zinc-air batteries are covered more thoroughly in a separate bibliography. (Contains 50-250 citations and includes a subject term index and title list.)
FunSimMat: a comprehensive functional similarity database

PubMed Central

Schlicker, Andreas; Albrecht, Mario

2008-01-01

Functional similarity based on Gene Ontology (GO) annotation is used in diverse applications like gene clustering, gene expression data analysis, protein interaction prediction and evaluation. However, there exists no comprehensive resource of functional similarity values although such a database would facilitate the use of functional similarity measures in different applications. Here, we describe FunSimMat (Functional Similarity Matrix, http://funsimmat.bioinf.mpi-inf.mpg.de/), a large new database that provides several different semantic similarity measures for GO terms. It offers various precomputed functional similarity values for proteins contained in UniProtKB and for protein families in Pfam and SMART. The web interface allows users to efficiently perform both semantic similarity searches with GO terms and functional similarity searches with proteins or protein families. All results can be downloaded in tab-delimited files for use with other tools. An additional XML–RPC interface gives automatic online access to FunSimMat for programs and remote services. PMID:17932054
The force distribution probability function for simple fluids by density functional theory.

PubMed

Rickayzen, G; Heyes, D M

2013-02-28

Classical density functional theory (DFT) is used to derive a formula for the probability density distribution function, P(F), and probability distribution function, W(F), for simple fluids, where F is the net force on a particle. The final formula for P(F) ∝ exp(-AF(2)), where A depends on the fluid density, the temperature, and the Fourier transform of the pair potential. The form of the DFT theory used is only applicable to bounded potential fluids. When combined with the hypernetted chain closure of the Ornstein-Zernike equation, the DFT theory for W(F) agrees with molecular dynamics computer simulations for the Gaussian and bounded soft sphere at high density. The Gaussian form for P(F) is still accurate at lower densities (but not too low density) for the two potentials, but with a smaller value for the constant, A, than that predicted by the DFT theory.
Single-particle energies and density of states in density functional theory

NASA Astrophysics Data System (ADS)

van Aggelen, H.; Chan, G. K.-L.

2015-07-01

Time-dependent density functional theory (TD-DFT) is commonly used as the foundation to obtain neutral excited states and transition weights in DFT, but does not allow direct access to density of states and single-particle energies, i.e. ionisation energies and electron affinities. Here we show that by extending TD-DFT to a superfluid formulation, which involves operators that break particle-number symmetry, we can obtain the density of states and single-particle energies from the poles of an appropriate superfluid response function. The standard Kohn- Sham eigenvalues emerge as the adiabatic limit of the superfluid response under the assumption that the exchange- correlation functional has no dependence on the superfluid density. The Kohn- Sham eigenvalues can thus be interpreted as approximations to the ionisation energies and electron affinities. Beyond this approximation, the formalism provides an incentive for creating a new class of density functionals specifically targeted at accurate single-particle eigenvalues and bandgaps.
Improved protein surface comparison and application to low-resolution protein structure data.

PubMed

Sael, Lee; Kihara, Daisuke

2010-12-14

Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM), which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs). The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy.
Trivial constraints on orbital-free kinetic energy density functionals

NASA Astrophysics Data System (ADS)

Luo, Kai; Trickey, S. B.

2018-03-01

Approximate kinetic energy density functionals (KEDFs) are central to orbital-free density functional theory. Limitations on the spatial derivative dependencies of KEDFs have been claimed from differential virial theorems. We identify a central defect in the argument: the relationships are not true for an arbitrary density but hold only for the minimizing density and corresponding chemical potential. Contrary to the claims therefore, the relationships are not constraints and provide no independent information about the spatial derivative dependencies of approximate KEDFs. A simple argument also shows that validity for arbitrary v-representable densities is not restored by appeal to the density-potential bijection.
Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

PubMed

Wang, Weijing; Jiang, Wenjie; Hou, Lin; Duan, Haiping; Wu, Yili; Xu, Chunsheng; Tan, Qihua; Li, Shuxia; Zhang, Dongfeng

2017-11-13

The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins. In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly correlated with BMI (r = 0.56, P = 0.04), and hub genes of KCNN1 and AQP10 were differentially expressed. We identified significant genes and specific modules potentially related to BMI based on the gene expression profile data of monozygotic twins. The findings may help further elucidate the underlying mechanisms of obesity development and provide novel insights to research potential gene biomarkers and signaling pathways for obesity treatment. Further analysis and validation of the findings reported here are important and necessary when more sample size is acquired.
Ecological Functions of Off-Channel Habitats of the Willamette River, Oregon, Database and Documentation (1997-2001)

EPA Science Inventory

The database from the Ecological Functions of Off-Channel Habitats of the Willamette River, Oregon project (OCH Project) contains data collected from 1997 through 2001 from multiple research areas of the project, and project documents such as the OCH Research Plan, Quality Assura...
Towards driving mantle convection by mineral physics

NASA Astrophysics Data System (ADS)

Piazzoni, A. S.; Bunge, H.; Steinle-Neumann, G.

2005-12-01

Models of mantle convection have become increasingly sophisticated over the past decade, accounting, for example, for 3 D spherical geometry, and changes in mantle rheology due to variations in temperature and stress. In light of such advances it is surprising that growing constraints on mantle structure derived from mineral physics have not yet been fully brought to bear on mantle convection models. In fact, despite much progress in our understanding of mantle mineralogy a partial description of the equation of state is often used to relate density changes to pressure and temperature alone, without taking into account compositional and mineralogical models of the mantle. Similarly, for phase transitions an incomplete description of thermodynamic constraints is often used, resulting in significant uncertainties in model behavior. While a number of thermodynamic models (some with limited scope) have been constructed recently, some lack the rigor in thermodynamics - for example with respect to the treatment of solid solution - that is needed to make predictions about mantle structure. Here we have constructed a new thermodynamic database for the mantle and have coupled the resulting density dynamically with mantle convection models. The database is build on a self-consistent Gibb's free energy minimization of the system MgO-FeO-SiO2-CaO-Al2O3 that is appropriate for standard (dry) chemical models of the Earth's mantle for relevant high pressure and temperature phases. We have interfaced the database with a high-resolution 2-D convection code (2DTERRA), dynamically coupling the thermodynamic model (density) with the conservation equations of mantle flow. The coupled model is run for different parameterizations of viscosity, initial temperature conditions, and varying the internal vs. external heating. We compare the resulting flow and temperature fields to cases with the Boussinesq approximation and other classical descriptions of the equation of state in mantle dynamics to assess the influence of realistic mineralogical density on mantle convection.
Digital mining claim density map for federal lands in Nevada: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in Nevada as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill, and tunnel sites must be recorded at the appropriate Bureau of Land Management (BLM) State office. BLM maintains a cumulative computer listing of mining claims in the MCRS database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Digital mining claim density map for federal lands in Utah: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in Utah as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill, and tunnel sites must be recorded at the appropriate BLM State office. BLM maintains a cumulative computer listing of mining claims in the MCRS database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Digital mining claim density map for federal lands in Wyoming: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in Wyoming as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill, and tunnel sites must be recorded at the appropriate BLM State office. BLM maintains a cumulative computer listing of mining claims in the Mining Claim Recordation System (MCRS) database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Digital mining claim density map for federal lands in Colorado: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in Colorado as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill, and tunnel sites must be recorded at the appropriate BLM State office. BLM maintains a cumulative computer listing of mining claims in the Mining Claim Recordation System (MCRS) database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Digital mining claim density map for federal lands in California: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in California as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill, and tunnel sites must be recorded at the appropriate BLM State office. BLM maintains a cumulative computer listing of mining claims in the MCRS database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Digital mining claim density map for federal lands in New Mexico: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in New Mexico as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill, and tunnel sites must be recorded at the appropriate BLM State office. BLM maintains a cumulative computer listing of mining claims in the MCRS database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Digital mining claim density map for federal lands in Washington: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in Washington as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill, and tunnel sites must be recorded at the appropriate BLM State office. BLM maintains a cumulative computer listing of mining claims in the Mining Claim Recordation System (MCRS) database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.

Digital mining claim density map for federal lands in Arizona: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in Arizona as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill, and tunnel sites must be recorded at the appropriate BLM State office. BLM maintains a cumulative computer listing of mining claims in the MCRS database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Patterns and determinants of wood physical and mechanical properties across major tree species in China.

PubMed

Zhu, JiangLing; Shi, Yue; Fang, LeQi; Liu, XingE; Ji, ChengJun

2015-06-01

The physical and mechanical properties of wood affect the growth and development of trees, and also act as the main criteria when determining wood usage. Our understanding on patterns and controls of wood physical and mechanical properties could provide benefits for forestry management and bases for wood application and forest tree breeding. However, current studies on wood properties mainly focus on wood density and ignore other wood physical properties. In this study, we established a comprehensive database of wood physical properties across major tree species in China. Based on this database, we explored spatial patterns and driving factors of wood properties across major tree species in China. Our results showed that (i) compared with wood density, air-dried density, tangential shrinkage coefficient and resilience provide more accuracy and higher explanation power when used as the evaluation index of wood physical properties. (ii) Among life form, climatic and edaphic variables, life form is the dominant factor shaping spatial patterns of wood physical properties, climatic factors the next, and edaphic factors have the least effects, suggesting that the effects of climatic factors on spatial variations of wood properties are indirectly induced by their effects on species distribution.
Assessing hydrologic impacts of future Land Change scenarios in the San Pedro River (U.S./Mexico)

NASA Astrophysics Data System (ADS)

Kepner, W. G.; Burns, S.; Sidman, G.; Levick, L.; Goodrich, D. C.; Guertin, P.; Yee, W.; Scianni, M.

2012-12-01

An approach was developed to characterize the hydrologic impacts of urban expansion through time for the San Pedro River, a watershed of immense international importance that straddles the U.S./Mexico border. Future urban growth is a key driving force altering local and regional hydrology and is represented by decadal changes in housing density maps from 2010 to 2100 derived from the Integrated Climate and Land-Use Scenarios (ICLUS) database. ICLUS developed future housing density maps by adapting the Intergovernmental Panel on Climate Change (IPCC) Special Report on Emissions Scenarios (SRES) social, economic, and demographic storylines to the conterminous United States. To characterize the hydrologic impacts of future growth, the housing density maps were reclassified to National Land Cover Database 2006 land cover classes and used to parameterize the Soil and Water Assessment Tool (SWAT) using the Automated Geospatial Watershed Assessment (AGWA) tool. The presentation will report 1) the methodology for adapting the ICLUS data for use in AGWA as an approach to evaluate basin-wide impacts of development on water-quantity and -quality, 2) initial results of the application of the methodology, and 3) discuss implications of the analysis.
Columba: an integrated database of proteins, structures, and annotations.

PubMed

Trissl, Silke; Rother, Kristian; Müller, Heiko; Steinke, Thomas; Koch, Ina; Preissner, Robert; Frömmel, Cornelius; Leser, Ulf

2005-03-31

Structural and functional research often requires the computation of sets of protein structures based on certain properties of the proteins, such as sequence features, fold classification, or functional annotation. Compiling such sets using current web resources is tedious because the necessary data are spread over many different databases. To facilitate this task, we have created COLUMBA, an integrated database of annotations of protein structures. COLUMBA currently integrates twelve different databases, including PDB, KEGG, Swiss-Prot, CATH, SCOP, the Gene Ontology, and ENZYME. The database can be searched using either keyword search or data source-specific web forms. Users can thus quickly select and download PDB entries that, for instance, participate in a particular pathway, are classified as containing a certain CATH architecture, are annotated as having a certain molecular function in the Gene Ontology, and whose structures have a resolution under a defined threshold. The results of queries are provided in both machine-readable extensible markup language and human-readable format. The structures themselves can be viewed interactively on the web. The COLUMBA database facilitates the creation of protein structure data sets for many structure-based studies. It allows to combine queries on a number of structure-related databases not covered by other projects at present. Thus, information on both many and few protein structures can be used efficiently. The web interface for COLUMBA is available at http://www.columba-db.de.
Ensemble density variational methods with self- and ghost-interaction-corrected functionals

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pastorczak, Ewa; Pernal, Katarzyna, E-mail: pernalk@gmail.com

2014-05-14

Ensemble density functional theory (DFT) offers a way of predicting excited-states energies of atomic and molecular systems without referring to a density response function. Despite a significant theoretical work, practical applications of the proposed approximations have been scarce and they do not allow for a fair judgement of the potential usefulness of ensemble DFT with available functionals. In the paper, we investigate two forms of ensemble density functionals formulated within ensemble DFT framework: the Gross, Oliveira, and Kohn (GOK) functional proposed by Gross et al. [Phys. Rev. A 37, 2809 (1988)] alongside the orbital-dependent eDFT form of the functional introducedmore » by Nagy [J. Phys. B 34, 2363 (2001)] (the acronym eDFT proposed in analogy to eHF – ensemble Hartree-Fock method). Local and semi-local ground-state density functionals are employed in both approaches. Approximate ensemble density functionals contain not only spurious self-interaction but also the so-called ghost-interaction which has no counterpart in the ground-state DFT. We propose how to correct the GOK functional for both kinds of interactions in approximations that go beyond the exact-exchange functional. Numerical applications lead to a conclusion that functionals free of the ghost-interaction by construction, i.e., eDFT, yield much more reliable results than approximate self- and ghost-interaction-corrected GOK functional. Additionally, local density functional corrected for self-interaction employed in the eDFT framework yields excitations energies of the accuracy comparable to that of the uncorrected semi-local eDFT functional.« less
Extension of many-body theory and approximate density functionals to fractional charges and fractional spins.

PubMed

Yang, Weitao; Mori-Sánchez, Paula; Cohen, Aron J

2013-09-14

The exact conditions for density functionals and density matrix functionals in terms of fractional charges and fractional spins are known, and their violation in commonly used functionals has been shown to be the root of many major failures in practical applications. However, approximate functionals are designed for physical systems with integer charges and spins, not in terms of the fractional variables. Here we develop a general framework for extending approximate density functionals and many-electron theory to fractional-charge and fractional-spin systems. Our development allows for the fractional extension of any approximate theory that is a functional of G(0), the one-electron Green's function of the non-interacting reference system. The extension to fractional charge and fractional spin systems is based on the ensemble average of the basic variable, G(0). We demonstrate the fractional extension for the following theories: (1) any explicit functional of the one-electron density, such as the local density approximation and generalized gradient approximations; (2) any explicit functional of the one-electron density matrix of the non-interacting reference system, such as the exact exchange functional (or Hartree-Fock theory) and hybrid functionals; (3) many-body perturbation theory; and (4) random-phase approximations. A general rule for such an extension has also been derived through scaling the orbitals and should be useful for functionals where the link to the Green's function is not obvious. The development thus enables the examination of approximate theories against known exact conditions on the fractional variables and the analysis of their failures in chemical and physical applications in terms of violations of exact conditions of the energy functionals. The present work should facilitate the calculation of chemical potentials and fundamental bandgaps with approximate functionals and many-electron theories through the energy derivatives with respect to the fractional charge. It should play an important role in developing accurate approximate density functionals and many-body theory.
Density-functional theory of spherical electric double layers and zeta potentials of colloidal particles in restricted-primitive-model electrolyte solutions.

PubMed

Yu, Yang-Xin; Wu, Jianzhong; Gao, Guang-Hua

2004-04-15

A density-functional theory is proposed to describe the density profiles of small ions around an isolated colloidal particle in the framework of the restricted primitive model where the small ions have uniform size and the solvent is represented by a dielectric continuum. The excess Helmholtz energy functional is derived from a modified fundamental measure theory for the hard-sphere repulsion and a quadratic functional Taylor expansion for the electrostatic interactions. The theoretical predictions are in good agreement with the results from Monte Carlo simulations and from previous investigations using integral-equation theory for the ionic density profiles and the zeta potentials of spherical particles at a variety of solution conditions. Like the integral-equation approaches, the density-functional theory is able to capture the oscillatory density profiles of small ions and the charge inversion (overcharging) phenomena for particles with elevated charge density. In particular, our density-functional theory predicts the formation of a second counterion layer near the surface of highly charged spherical particle. Conversely, the nonlinear Poisson-Boltzmann theory and its variations are unable to represent the oscillatory behavior of small ion distributions and charge inversion. Finally, our density-functional theory predicts charge inversion even in a 1:1 electrolyte solution as long as the salt concentration is sufficiently high. (c) 2004 American Institute of Physics.
The DNA Data Bank of Japan launches a new resource, the DDBJ Omics Archive of functional genomics experiments.

PubMed

Kodama, Yuichi; Mashima, Jun; Kaminuma, Eli; Gojobori, Takashi; Ogasawara, Osamu; Takagi, Toshihisa; Okubo, Kousaku; Nakamura, Yasukazu

2012-01-01

The DNA Data Bank of Japan (DDBJ; http://www.ddbj.nig.ac.jp) maintains and provides archival, retrieval and analytical resources for biological information. The central DDBJ resource consists of public, open-access nucleotide sequence databases including raw sequence reads, assembly information and functional annotation. Database content is exchanged with EBI and NCBI within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). In 2011, DDBJ launched two new resources: the 'DDBJ Omics Archive' (DOR; http://trace.ddbj.nig.ac.jp/dor) and BioProject (http://trace.ddbj.nig.ac.jp/bioproject). DOR is an archival database of functional genomics data generated by microarray and highly parallel new generation sequencers. Data are exchanged between the ArrayExpress at EBI and DOR in the common MAGE-TAB format. BioProject provides an organizational framework to access metadata about research projects and the data from the projects that are deposited into different databases. In this article, we describe major changes and improvements introduced to the DDBJ services, and the launch of two new resources: DOR and BioProject.
Subsystem density functional theory with meta-generalized gradient approximation exchange-correlation functionals.

PubMed

Śmiga, Szymon; Fabiano, Eduardo; Laricchia, Savio; Constantin, Lucian A; Della Sala, Fabio

2015-04-21

We analyze the methodology and the performance of subsystem density functional theory (DFT) with meta-generalized gradient approximation (meta-GGA) exchange-correlation functionals for non-bonded molecular systems. Meta-GGA functionals depend on the Kohn-Sham kinetic energy density (KED), which is not known as an explicit functional of the density. Therefore, they cannot be directly applied in subsystem DFT calculations. We propose a Laplacian-level approximation to the KED which overcomes this limitation and provides a simple and accurate way to apply meta-GGA exchange-correlation functionals in subsystem DFT calculations. The so obtained density and energy errors, with respect to the corresponding supermolecular calculations, are comparable with conventional approaches, depending almost exclusively on the approximations in the non-additive kinetic embedding term. An embedding energy error decomposition explains the accuracy of our method.
ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.

PubMed

Meiler, Arno; Klinger, Claudia; Kaufmann, Michael

2012-09-08

The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

PubMed Central

2012-01-01

Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
Modern pollen data from North America and Greenland for multi-scale paleoenvironmental applications

USGS Publications Warehouse

Whitmore, J.; Gajewski, K.; Sawada, M.; Williams, J.W.; Shuman, B.; Bartlein, P.J.; Minckley, T.; Viau, A.E.; Webb, T.; Shafer, S.; Anderson, P.; Brubaker, L.

2005-01-01

The modern pollen network in North America and Greenland is presented as a database for use in quantitative calibration studies and paleoenvironmental reconstructions. The georeferenced database includes 4634 samples from all regions of the continent and 134 pollen taxa that range from ubiquitous to regionally diagnostic taxa. Climate data and vegetation characteristics were assigned to every site. Automated and manual procedures were used to verify the accuracy of geographic coordinates and identify duplicate records among datasets, incomplete pollen sums, and other potential errors. Data are currently available for almost all of North America, with variable density. Pollen taxonomic diversity, as measured by the Shannon-Weiner coefficient, varies as a function of location, as some vegetation regions are dominated by one or two major pollen producers, while other regions have a more even composition of pollen taxa. Squared-chord distances computed between samples show that most modern pollen samples find analogues within their own vegetation zone. Both temperature and precipitation inferred from best analogues are highly correlated with observed values but temperature exhibits the strongest relation. Maps of the contemporary distribution of several pollen types in relation to the range of the plant taxon illustrate the correspondence between plant and pollen ranges. ?? 2005 Elsevier Ltd. All rights reserved.
High-throughput screening for thermoelectric sulphides by using crystal structure features as descriptors

NASA Astrophysics Data System (ADS)

Zhang, Ruizhi; Du, Baoli; Chen, Kan; Reece, Mike; Materials Research Insititute Team

With the increasing computational power and reliable databases, high-throughput screening is playing a more and more important role in the search of new thermoelectric materials. Rather than the well established density functional theory (DFT) calculation based methods, we propose an alternative approach to screen for new TE materials: using crystal structural features as 'descriptors'. We show that a non-distorted transition metal sulphide polyhedral network can be a good descriptor for high power factor according to crystal filed theory. By using Cu/S containing compounds as an example, 1600+ Cu/S containing entries in the Inorganic Crystal Structure Database (ICSD) were screened, and of those 84 phases are identified as promising thermoelectric materials. The screening results are validated by both electronic structure calculations and experimental results from the literature. We also fabricated some new compounds to test our screening results. Another advantage of using crystal structure features as descriptors is that we can easily establish structural relationships between the identified phases. Based on this, two material design approaches are discussed: 1) High-pressure synthesis of metastable phase; 2) In-situ 2-phase composites with coherent interface. This work was supported by a Marie Curie International Incoming Fellowship of the European Community Human Potential Program.
High-Throughput Combinatorial Development of High-Entropy Alloys For Light-Weight Structural Applications

DOE Office of Scientific and Technical Information (OSTI.GOV)

Van Duren, Jeroen K; Koch, Carl; Luo, Alan

The primary limitation of today’s lightweight structural alloys is that specific yield strengths (SYS) higher than 200MPa x cc/g (typical value for titanium alloys) are extremely difficult to achieve. This holds true especially at a cost lower than 5dollars/kg (typical value for magnesium alloys). Recently, high-entropy alloys (HEA) have shown promising SYS, yet the large composition space of HEA makes screening compositions complex and time-consuming. Over the course of this 2-year project we started from 150 billion compositions and reduced the number of potential low-density (<5g/cc), low-cost (<5dollars/kg) high-entropy alloy (LDHEA) candidates that are single-phase, disordered, solid-solution (SPSS) to amore » few thousand compositions. This was accomplished by means of machine learning to guide design for SPSS LDHEA based on a combination of recursive partitioning, an extensive, experimental HEA database compiled from 24 literature sources, and 91 calculated parameters serving as phenomenological selection rules. Machine learning shows an accuracy of 82% in identifying which compositions of a separate, smaller, experimental HEA database are SPSS HEA. Calculation of Phase Diagrams (CALPHAD) shows an accuracy of 71-77% for the alloys supported by the CALPHAD database, where 30% of the compiled HEA database is not supported by CALPHAD. In addition to machine learning, and CALPHAD, a third tool was developed to aid design of SPSS LDHEA. Phase diagrams were calculated by constructing the Gibbs-free energy convex hull based on easily accessible enthalpy and entropy terms. Surprisingly, accuracy was 78%. Pursuing these LDHEA candidates by high-throughput experimental methods resulted in SPSS LDHEA composed of transition metals (e.g. Cr, Mn, Fe, Ni, Cu) alloyed with Al, yet the high concentration of Al, necessary to bring the mass density below 5.0g/cc, makes these materials hard and brittle, body-centered-cubic (BCC) alloys. A related, yet multi-phase BCC alloy, based on Al-Cr-Fe-Ni, shows compressive strain >10% and specific compressive yield strength of 229 MPa x cc/g, yet does not show ductility in tensile tests due to cleavage. When replacing Cr in Al-Cr-Fe-based 4- and 5-element LDHEA with Mn, hardness drops 2x. Combined with compression test results, including those on the ternaries Al-Cr-Fe and Al-Mn-Fe suggest that Al-Mn-Fe-based LDHEA are still worth pursuing. These initial results only represent one compressive stress-strain curve per composition without any property optimization. As such, reproducibility needs to be followed by optimization to show their full potential. When including Li, Mg, and Zn, single-phase Li-Mg-Al-Ti-Zn LDHEA has been found with a specific ultimate compressive strength of 289MPa x cc/g. Al-Ti-Mn-Zn showed a specific ultimate compressive strength of 73MPa x cc/g. These initial results after hot isostatic pressing (HIP) of the ball-milled powders represent the lower end of what is possible, since no secondary processing (e.g. extrusion) has been performed to optimize strength and ductility. Compositions for multi-phase (e.g. dual-phase) LDHEA were identified largely by automated searches through CALPHAD databases, while screening for large face-centered-cubic (FCC) volume fractions, followed by experimental verification. This resulted in several new alloys. Li-Mg-Al-Mn-Fe and Mg-Mn-Fe-Co ball-milled powders upon HIP show specific ultimate compressive strengths of 198MPa x cc/g and 45MPa x cc/g, respectively. Several malleable quarternary Al-Zn-based alloys have been found upon arc/induction melting, yet with limited specific compressive yield strength (<75 MPa x cc/g). These initial results are all without any optimization for strength and/or ductility. High-throughput experimentation allowed us to triple the existing experimental HEA database as published in the past 10 years in less than 2 years which happened at a rate 10x higher than previous methods. Furthermore, we showed that high-throughput thin-film combinatorial methods can be used to get insight in isothermal phase diagram slices. Although it is straightforward to map hardness as a function of composition for sputtered, thin-film, compositional gradients by nano-indentation and compare the results to micro-indentation on bulk samples, the simultaneous impact of composition, roughness, film density, and microstructure on hardness requires monitoring all these properties as a function of location on the compositional gradient, including dissecting the impact of these 4 factors on the hardness map. These additional efforts impact throughput significantly. This work shows that a lot of progress has been made over the years in predicting phase formation that aids the discovery of new alloys, yet that a lot of work needs to be done to predict phases more accurately for LDHEA, whether done by CALPHAD or by other means. More importantly, more work needs to be done to predict mechanical properties of novel alloys, like yield strength, and ductility. Furthermore, this work shows that there is a need for the generation of an empirical alloy database covering strategic points in a multi-dimensional composition space to allow for faster and more accurate predictive interpolations to identify the oasis in the dessert more quickly. Finally, this work suggests that it is worth pursuing a ductile alloy with a SYS > 300 MPa x cc/g in a mass density range of 6-7 g/cc, since the chances for a single-phase or majority-phase FCC increase significantly. Today’s lightweight steels are in this density range.« less
Aviation Safety Issues Database

NASA Technical Reports Server (NTRS)

Morello, Samuel A.; Ricks, Wendell R.

2009-01-01

The aviation safety issues database was instrumental in the refinement and substantiation of the National Aviation Safety Strategic Plan (NASSP). The issues database is a comprehensive set of issues from an extremely broad base of aviation functions, personnel, and vehicle categories, both nationally and internationally. Several aviation safety stakeholders such as the Commercial Aviation Safety Team (CAST) have already used the database. This broader interest was the genesis to making the database publically accessible and writing this report.
Development of a Dynamic Visco-elastic Vehicle-Soil Interaction Model for Rut Depth, and Power Determinations

DTIC Science & Technology

2011-09-06

Presentation Outline A) Review of Soil Model governing equations B) Development of pedo -transfer functions (terrain database to engineering properties) C...lateral earth pressure) UNCLASSIFIED B) Development of pedo -transfer functions Engineering parameters needed by soil model - compression index - rebound...inches, RCI for fine- grained soils, CI for coarse-grained soils. UNCLASSIFIED Pedo -transfer function • Need to transfer existing terrain database
MultitaskProtDB: a database of multitasking proteins

PubMed Central

Hernández, Sergio; Ferragut, Gabriela; Amela, Isaac; Perez-Pons, JosepAntoni; Piñol, Jaume; Mozo-Villarias, Angel; Cedano, Juan; Querol, Enrique

2014-01-01

We have compiled MultitaskProtDB, available online at http://wallace.uab.es/multitask, to provide a repository where the many multitasking proteins found in the literature can be stored. Multitasking or moonlighting is the capability of some proteins to execute two or more biological functions. Usually, multitasking proteins are experimentally revealed by serendipity. This ability of proteins to perform multitasking functions helps us to understand one of the ways used by cells to perform many complex functions with a limited number of genes. Even so, the study of this phenomenon is complex because, among other things, there is no database of moonlighting proteins. The existence of such a tool facilitates the collection and dissemination of these important data. This work reports the database, MultitaskProtDB, which is designed as a friendly user web page containing >288 multitasking proteins with their NCBI and UniProt accession numbers, canonical and additional biological functions, monomeric/oligomeric states, PDB codes when available and bibliographic references. This database also serves to gain insight into some characteristics of multitasking proteins such as frequencies of the different pairs of functions, phylogenetic conservation and so forth. PMID:24253302
A Gaussian Approximation Potential for Silicon

NASA Astrophysics Data System (ADS)

Bernstein, Noam; Bartók, Albert; Kermode, James; Csányi, Gábor

We present an interatomic potential for silicon using the Gaussian Approximation Potential (GAP) approach, which uses the Gaussian process regression method to approximate the reference potential energy surface as a sum of atomic energies. Each atomic energy is approximated as a function of the local environment around the atom, which is described with the smooth overlap of atomic environments (SOAP) descriptor. The potential is fit to a database of energies, forces, and stresses calculated using density functional theory (DFT) on a wide range of configurations from zero and finite temperature simulations. These include crystalline phases, liquid, amorphous, and low coordination structures, and diamond-structure point defects, dislocations, surfaces, and cracks. We compare the results of the potential to DFT calculations, as well as to previously published models including Stillinger-Weber, Tersoff, modified embedded atom method (MEAM), and ReaxFF. We show that it is very accurate as compared to the DFT reference results for a wide range of properties, including low energy bulk phases, liquid structure, as well as point, line, and plane defects in the diamond structure.
Computational screening of organic materials towards improved photovoltaic properties

NASA Astrophysics Data System (ADS)

Dai, Shuo; Olivares-Amaya, Roberto; Amador-Bedolla, Carlos; Aspuru-Guzik, Alan; Borunda, Mario

2015-03-01

The world today faces an energy crisis that is an obstruction to the development of the human civilization. One of the most promising solutions is solar energy harvested by economical solar cells. Being the third generation of solar cell materials, organic photovoltaic (OPV) materials is now under active development from both theoretical and experimental points of view. In this study, we constructed a parameter to select the desired molecules based on their optical spectra performance. We applied it to investigate a large collection of potential OPV materials, which were from the CEPDB database set up by the Harvard Clean Energy Project. Time dependent density functional theory (TD-DFT) modeling was used to calculate the absorption spectra of the molecules. Then based on the parameter, we screened out the top performing molecules for their potential OPV usage and suggested experimental efforts toward their synthesis. In addition, from those molecules, we summarized the functional groups that provided molecules certain spectrum capability. It is hoped that useful information could be mined out to provide hints to molecular design of OPV materials.
Analysis of Lunar Highland Regolith Samples From Apollo 16 Drive Core 64001/2 and Lunar Regolith Simulants - an Expanding Comparative Database

NASA Technical Reports Server (NTRS)

Schrader, Christian M.; Rickman, Doug; Stoeser, Douglas; Wentworth, Susan; McKay, Dave S.; Botha, Pieter; Butcher, Alan R.; Horsch, Hanna E.; Benedictus, Aukje; Gottlieb, Paul

2008-01-01

This slide presentation reviews the work to analyze the lunar highland regolith samples that came from the Apollo 16 core sample 64001/2 and simulants of lunar regolith, and build a comparative database. The work is part of a larger effort to compile an internally consistent database on lunar regolith (Apollo Samples) and lunar regolith simulants. This is in support of a future lunar outpost. The work is to characterize existing lunar regolith and simulants in terms of particle type, particle size distribution, particle shape distribution, bulk density, and other compositional characteristics, and to evaluate the regolith simulants by the same properties in comparison to the Apollo sample lunar regolith.

Migration of legacy mumps applications to relational database servers.

PubMed

O'Kane, K C

2001-07-01

An extended implementation of the Mumps language is described that facilitates vendor neutral migration of legacy Mumps applications to SQL-based relational database servers. Implemented as a compiler, this system translates Mumps programs to operating system independent, standard C code for subsequent compilation to fully stand-alone, binary executables. Added built-in functions and support modules extend the native hierarchical Mumps database with access to industry standard, networked, relational database management servers (RDBMS) thus freeing Mumps applications from dependence upon vendor specific, proprietary, unstandardized database models. Unlike Mumps systems that have added captive, proprietary RDMBS access, the programs generated by this development environment can be used with any RDBMS system that supports common network access protocols. Additional features include a built-in web server interface and the ability to interoperate directly with programs and functions written in other languages.
Locality of correlation in density functional theory.

PubMed

Burke, Kieron; Cancio, Antonio; Gould, Tim; Pittalis, Stefano

2016-08-07

The Hohenberg-Kohn density functional was long ago shown to reduce to the Thomas-Fermi (TF) approximation in the non-relativistic semiclassical (or large-Z) limit for all matter, i.e., the kinetic energy becomes local. Exchange also becomes local in this limit. Numerical data on the correlation energy of atoms support the conjecture that this is also true for correlation, but much less relevant to atoms. We illustrate how expansions around a large particle number are equivalent to local density approximations and their strong relevance to density functional approximations. Analyzing highly accurate atomic correlation energies, we show that EC → -AC ZlnZ + BCZ as Z → ∞, where Z is the atomic number, AC is known, and we estimate BC to be about 37 mhartree. The local density approximation yields AC exactly, but a very incorrect value for BC, showing that the local approximation is less relevant for the correlation alone. This limit is a benchmark for the non-empirical construction of density functional approximations. We conjecture that, beyond atoms, the leading correction to the local density approximation in the large-Z limit generally takes this form, but with BC a functional of the TF density for the system. The implications for the construction of approximate density functionals are discussed.
Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics

PubMed Central

Mitchell, Joshua M.; Fan, Teresa W.-M.; Lane, Andrew N.; Moseley, Hunter N. B.

2014-01-01

Large-scale identification of metabolites is key to elucidating and modeling metabolism at the systems level. Advances in metabolomics technologies, particularly ultra-high resolution mass spectrometry (MS) enable comprehensive and rapid analysis of metabolites. However, a significant barrier to meaningful data interpretation is the identification of a wide range of metabolites including unknowns and the determination of their role(s) in various metabolic networks. Chemoselective (CS) probes to tag metabolite functional groups combined with high mass accuracy provide additional structural constraints for metabolite identification and quantification. We have developed a novel algorithm, Chemically Aware Substructure Search (CASS) that efficiently detects functional groups within existing metabolite databases, allowing for combined molecular formula and functional group (from CS tagging) queries to aid in metabolite identification without a priori knowledge. Analysis of the isomeric compounds in both Human Metabolome Database (HMDB) and KEGG Ligand demonstrated a high percentage of isomeric molecular formulae (43 and 28%, respectively), indicating the necessity for techniques such as CS-tagging. Furthermore, these two databases have only moderate overlap in molecular formulae. Thus, it is prudent to use multiple databases in metabolite assignment, since each major metabolite database represents different portions of metabolism within the biosphere. In silico analysis of various CS-tagging strategies under different conditions for adduct formation demonstrate that combined FT-MS derived molecular formulae and CS-tagging can uniquely identify up to 71% of KEGG and 37% of the combined KEGG/HMDB database vs. 41 and 17%, respectively without adduct formation. This difference between database isomer disambiguation highlights the strength of CS-tagging for non-lipid metabolite identification. However, unique identification of complex lipids still needs additional information. PMID:25120557
Phosphorus allotropes: Stability of black versus red phosphorus re-examined by means of the van der Waals inclusive density functional method

NASA Astrophysics Data System (ADS)

Aykol, Muratahan; Doak, Jeff W.; Wolverton, C.

2017-06-01

We evaluate the energetic stabilities of white, red, and black allotropes of phosphorus using density functional theory (DFT) and hybrid functional methods, van der Waals (vdW) corrections (DFT+vdW and hybrid+vdW), vdW density functionals, and random phase approximation (RPA). We find that stability of black phosphorus over red-V (i.e., the violet form) is not ubiquitous among these methods, and the calculated enthalpies for the reaction phosphorus (red-V)→phosphorus (black) are scattered between -20 and 40 meV/atom. With local density and generalized gradient approximations, and hybrid functionals, mean absolute errors (MAEs) in densities of P allotropes relative to experiments are found to be around 10%-25%, whereas with vdW-inclusive methods, MAEs in densities drop below ˜5 %. While the inconsistency among the density functional methods could not shed light on the stability puzzle of black versus red phosphorus, comparison of their accuracy in predicting densities and the supplementary RPA results on relative stabilities indicate that opposite to the common belief, black and red phosphorus are almost degenerate, or the red-V (violet) form of phosphorus might even be the ground state.
Unstable density distribution associated with equatorial plasma bubble

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kherani, E. A., E-mail: esfhan.kherani@inpe.br; Meneses, F. Carlos de; Bharuthram, R.

2016-04-15

In this work, we present a simulation study of equatorial plasma bubble (EPB) in the evening time ionosphere. The fluid simulation is performed with a high grid resolution, enabling us to probe the steepened updrafting density structures inside EPB. Inside the density depletion that eventually evolves as EPB, both density and updraft are functions of space from which the density as implicit function of updraft velocity or the density distribution function is constructed. In the present study, this distribution function and the corresponding probability distribution function are found to evolve from Maxwellian to non-Maxwellian as the initial small depletion growsmore » to EPB. This non-Maxwellian distribution is of a gentle-bump type, in confirmation with the recently reported distribution within EPB from space-borne measurements that offer favorable condition for small scale kinetic instabilities.« less
The Danish Testicular Cancer database.

PubMed

Daugaard, Gedske; Kier, Maria Gry Gundgaard; Bandak, Mikkel; Mortensen, Mette Saksø; Larsson, Heidi; Søgaard, Mette; Toft, Birgitte Groenkaer; Engvad, Birte; Agerbæk, Mads; Holm, Niels Vilstrup; Lauritsen, Jakob

2016-01-01

The nationwide Danish Testicular Cancer database consists of a retrospective research database (DaTeCa database) and a prospective clinical database (Danish Multidisciplinary Cancer Group [DMCG] DaTeCa database). The aim is to improve the quality of care for patients with testicular cancer (TC) in Denmark, that is, by identifying risk factors for relapse, toxicity related to treatment, and focusing on late effects. All Danish male patients with a histologically verified germ cell cancer diagnosis in the Danish Pathology Registry are included in the DaTeCa databases. Data collection has been performed from 1984 to 2007 and from 2013 onward, respectively. The retrospective DaTeCa database contains detailed information with more than 300 variables related to histology, stage, treatment, relapses, pathology, tumor markers, kidney function, lung function, etc. A questionnaire related to late effects has been conducted, which includes questions regarding social relationships, life situation, general health status, family background, diseases, symptoms, use of medication, marital status, psychosocial issues, fertility, and sexuality. TC survivors alive on October 2014 were invited to fill in this questionnaire including 160 validated questions. Collection of questionnaires is still ongoing. A biobank including blood/sputum samples for future genetic analyses has been established. Both samples related to DaTeCa and DMCG DaTeCa database are included. The prospective DMCG DaTeCa database includes variables regarding histology, stage, prognostic group, and treatment. The DMCG DaTeCa database has existed since 2013 and is a young clinical database. It is necessary to extend the data collection in the prospective database in order to answer quality-related questions. Data from the retrospective database will be added to the prospective data. This will result in a large and very comprehensive database for future studies on TC patients.
Study of gravity waves propagation in the thermosphere of Mars based on MAVEN/NGIMS density measurements

NASA Astrophysics Data System (ADS)

Vals, M.

2017-09-01

We use MAVEN/NGIMS CO2 density measurements to analyse gravity waves in the thermosphere of Mars. In particular the seasonal/latitudinal variability of their amplitude is studied and interpreted. Key background parameters controlling the activity of gravity waves are analysed with the help of the Mars Climate Database (MCD). Gravity waves activity presents a good anti-correlation to the temperature variability retrieved from the MCD. An analysis at pressure levels is ongoing.
The Australian National Bibliographic Database and the Functional Requirements for the Bibliographic Database (FRBR)

ERIC Educational Resources Information Center

Rajapatirana, Bemal; Missingham, Roxanne

2005-01-01

The development of the conceptual "Functional Requirements for Bibliographic Records" (FRBR) Model enables records to be considered in terms of contextual relationships. Developments in software can capitalise on this to significantly improve the display of works through surfacing of these relationships. This paper reports on an investigation of…
Density Functionals of Chemical Bonding

PubMed Central

Putz, Mihai V.

2008-01-01

The behavior of electrons in general many-electronic systems throughout the density functionals of energy is reviewed. The basic physico-chemical concepts of density functional theory are employed to highlight the energy role in chemical structure while its extended influence in electronic localization function helps in chemical bonding understanding. In this context the energy functionals accompanied by electronic localization functions may provide a comprehensive description of the global-local levels electronic structures in general and of chemical bonds in special. Becke-Edgecombe and author’s Markovian electronic localization functions are discussed at atomic, molecular and solid state levels. Then, the analytical survey of the main workable kinetic, exchange, and correlation density functionals within local and gradient density approximations is undertaken. The hierarchy of various energy functionals is formulated by employing both the parabolic and statistical correlation degree of them with the electronegativity and chemical hardness indices by means of quantitative structure-property relationship (QSPR) analysis for basic atomic and molecular systems. PMID:19325846
Bypassing the Kohn-Sham equations with machine learning.

PubMed

Brockherde, Felix; Vogt, Leslie; Li, Li; Tuckerman, Mark E; Burke, Kieron; Müller, Klaus-Robert

2017-10-11

Last year, at least 30,000 scientific papers used the Kohn-Sham scheme of density functional theory to solve electronic structure problems in a wide variety of scientific fields. Machine learning holds the promise of learning the energy functional via examples, bypassing the need to solve the Kohn-Sham equations. This should yield substantial savings in computer time, allowing larger systems and/or longer time-scales to be tackled, but attempts to machine-learn this functional have been limited by the need to find its derivative. The present work overcomes this difficulty by directly learning the density-potential and energy-density maps for test systems and various molecules. We perform the first molecular dynamics simulation with a machine-learned density functional on malonaldehyde and are able to capture the intramolecular proton transfer process. Learning density models now allows the construction of accurate density functionals for realistic molecular systems.Machine learning allows electronic structure calculations to access larger system sizes and, in dynamical simulations, longer time scales. Here, the authors perform such a simulation using a machine-learned density functional that avoids direct solution of the Kohn-Sham equations.
Adding glycaemic index and glycaemic load functionality to DietPLUS, a Malaysian food composition database and diet intake calculator.

PubMed

Shyam, Sangeetha; Wai, Tony Ng Kock; Arshad, Fatimah

2012-01-01

This paper outlines the methodology to add glycaemic index (GI) and glycaemic load (GL) functionality to food DietPLUS, a Microsoft Excel-based Malaysian food composition database and diet intake calculator. Locally determined GI values and published international GI databases were used as the source of GI values. Previously published methodology for GI value assignment was modified to add GI and GL calculators to the database. Two popular local low GI foods were added to the DietPLUS database, bringing up the total number of foods in the database to 838 foods. Overall, in relation to the 539 major carbohydrate foods in the Malaysian Food Composition Database, 243 (45%) food items had local Malaysian values or were directly matched to International GI database and another 180 (33%) of the foods were linked to closely-related foods in the GI databases used. The mean ± SD dietary GI and GL of the dietary intake of 63 women with previous gestational diabetes mellitus, calculated using DietPLUS version3 were, 62 ± 6 and 142 ± 45, respectively. These values were comparable to those reported from other local studies. DietPLUS version3, a simple Microsoft Excel-based programme aids calculation of diet GI and GL for Malaysian diets based on food records.
AIM: A comprehensive Arabidopsis Interactome Module database and related interologs in plants

USDA-ARS?s Scientific Manuscript database

Systems biology analysis of protein modules is important for understanding the functional relationships between proteins in the interactome. Here, we present a comprehensive database named AIM for Arabidopsis (Arabidopsis thaliana) interactome modules. The database contains almost 250,000 modules th...
Cell survival fraction estimation based on the probability densities of domain and cell nucleus specific energies using improved microdosimetric kinetic models.

PubMed

Sato, Tatsuhiko; Furusawa, Yoshiya

2012-10-01

Estimation of the survival fractions of cells irradiated with various particles over a wide linear energy transfer (LET) range is of great importance in the treatment planning of charged-particle therapy. Two computational models were developed for estimating survival fractions based on the concept of the microdosimetric kinetic model. They were designated as the double-stochastic microdosimetric kinetic and stochastic microdosimetric kinetic models. The former model takes into account the stochastic natures of both domain and cell nucleus specific energies, whereas the latter model represents the stochastic nature of domain specific energy by its approximated mean value and variance to reduce the computational time. The probability densities of the domain and cell nucleus specific energies are the fundamental quantities for expressing survival fractions in these models. These densities are calculated using the microdosimetric and LET-estimator functions implemented in the Particle and Heavy Ion Transport code System (PHITS) in combination with the convolution or database method. Both the double-stochastic microdosimetric kinetic and stochastic microdosimetric kinetic models can reproduce the measured survival fractions for high-LET and high-dose irradiations, whereas a previously proposed microdosimetric kinetic model predicts lower values for these fractions, mainly due to intrinsic ignorance of the stochastic nature of cell nucleus specific energies in the calculation. The models we developed should contribute to a better understanding of the mechanism of cell inactivation, as well as improve the accuracy of treatment planning of charged-particle therapy.
Multiple Point Dynamic Gas Density Measurements Using Molecular Rayleigh Scattering

NASA Technical Reports Server (NTRS)

Seasholtz, Richard; Panda, Jayanta

1999-01-01

A nonintrusive technique for measuring dynamic gas density properties is described. Molecular Rayleigh scattering is used to measure the time-history of gas density simultaneously at eight spatial locations at a 50 kHz sampling rate. The data are analyzed using the Welch method of modified periodograms to reduce measurement uncertainty. Cross-correlations, power spectral density functions, cross-spectral density functions, and coherence functions may be obtained from the data. The technique is demonstrated using low speed co-flowing jets with a heated inner jet.
Dimensions of biodiversity loss: Spatial mismatch in land-use impacts on species, functional and phylogenetic diversity of European bees.

PubMed

De Palma, Adriana; Kuhlmann, Michael; Bugter, Rob; Ferrier, Simon; Hoskins, Andrew J; Potts, Simon G; Roberts, Stuart P M; Schweiger, Oliver; Purvis, Andy

2017-12-01

Agricultural intensification and urbanization are important drivers of biodiversity change in Europe. Different aspects of bee community diversity vary in their sensitivity to these pressures, as well as independently influencing ecosystem service provision (pollination). To obtain a more comprehensive understanding of human impacts on bee diversity across Europe, we assess multiple, complementary indices of diversity. One Thousand four hundred and forty six sites across Europe. We collated data on bee occurrence and abundance from the published literature and supplemented them with the PREDICTS database. Using Rao's Quadratic Entropy, we assessed how species, functional and phylogenetic diversity of 1,446 bee communities respond to land-use characteristics including land-use class, cropland intensity, human population density and distance to roads. We combined these models with statistically downscaled estimates of land use in 2005 to estimate and map-at a scale of approximately 1 km 2 -the losses in diversity relative to semi-natural/natural baseline (the predicted diversity of an uninhabited grid square, consisting only of semi-natural/natural vegetation). We show that-relative to the predicted local diversity in uninhabited semi-natural/natural habitat-half of all EU27 countries have lost over 10% of their average local species diversity and two-thirds of countries have lost over 5% of their average local functional and phylogenetic diversity. All diversity measures were generally lower in pasture and higher-intensity cropland than in semi-natural/natural vegetation, but facets of diversity showed less consistent responses to human population density. These differences have led to marked spatial mismatches in losses: losses in phylogenetic diversity were in some areas almost 20 percentage points (pp.) more severe than losses in species diversity, but in other areas losses were almost 40 pp. less severe. These results highlight the importance of exploring multiple measures of diversity when prioritizing and evaluating conservation actions, as species-diverse assemblages may be phylogenetically and functionally impoverished, potentially threatening pollination service provision.
RNAcentral: an international database of ncRNA sequences

DOE PAGES

Williams, Kelly Porter

2014-10-28

The field of non-coding RNA biology has been hampered by the lack of availability of a comprehensive, up-to-date collection of accessioned RNA sequences. Here we present the first release of RNAcentral, a database that collates and integrates information from an international consortium of established RNA sequence databases. The initial release contains over 8.1 million sequences, including representatives of all major functional classes. A web portal (http://rnacentral.org) provides free access to data, search functionality, cross-references, source code and an integrated genome browser for selected species.
Statistics of primordial density perturbations from discrete seed masses

NASA Technical Reports Server (NTRS)

Scherrer, Robert J.; Bertschinger, Edmund

1991-01-01

The statistics of density perturbations for general distributions of seed masses with arbitrary matter accretion is examined. Formal expressions for the power spectrum, the N-point correlation functions, and the density distribution function are derived. These results are applied to the case of uncorrelated seed masses, and power spectra are derived for accretion of both hot and cold dark matter plus baryons. The reduced moments (cumulants) of the density distribution are computed and used to obtain a series expansion for the density distribution function. Analytic results are obtained for the density distribution function in the case of a distribution of seed masses with a spherical top-hat accretion pattern. More generally, the formalism makes it possible to give a complete characterization of the statistical properties of any random field generated from a discrete linear superposition of kernels. In particular, the results can be applied to density fields derived by smoothing a discrete set of points with a window function.
Two-Component Noncollinear Time-Dependent Spin Density Functional Theory for Excited State Calculations.

PubMed

Egidi, Franco; Sun, Shichao; Goings, Joshua J; Scalmani, Giovanni; Frisch, Michael J; Li, Xiaosong

2017-06-13

We present a linear response formalism for the description of the electronic excitations of a noncollinear reference defined via Kohn-Sham spin density functional methods. A set of auxiliary variables, defined using the density and noncollinear magnetization density vector, allows the generalization of spin density functional kernels commonly used in collinear DFT to noncollinear cases, including local density, GGA, meta-GGA and hybrid functionals. Working equations and derivations of functional second derivatives with respect to the noncollinear density, required in the linear response noncollinear TDDFT formalism, are presented in this work. This formalism takes all components of the spin magnetization into account independent of the type of reference state (open or closed shell). As a result, the method introduced here is able to afford a nonzero local xc torque on the spin magnetization while still satisfying the zero-torque theorem globally. The formalism is applied to a few test cases using the variational exact-two-component reference including spin-orbit coupling to illustrate the capabilities of the method.
FunShift: a database of function shift analysis on protein subfamilies

PubMed Central

Abhiman, Saraswathi; Sonnhammer, Erik L. L.

2005-01-01

Members of a protein family normally have a general biochemical function in common, but frequently one or more subgroups have evolved a slightly different function, such as different substrate specificity. It is important to detect such function shifts for a more accurate functional annotation. The FunShift database described here is a compilation of function shift analysis performed between subfamilies in protein families. It consists of two main components: (i) subfamilies derived from protein domain families and (ii) pairwise subfamily comparisons analyzed for function shift. The present release, FunShift 12, was derived from Pfam 12 and consists of 151 934 subfamilies derived from 7300 families. We carried out function shift analysis by two complementary methods on families with up to 500 members. From a total of 179 210 subfamily pairs, 62 384 were predicted to be functionally shifted in 2881 families. Each subfamily pair is provided with a markup of probable functional specificity-determining sites. Tools for searching and exploring the data are provided to make this database a valuable resource for protein function annotation. Knowledge of these functionally important sites will be useful for experimental biologists performing functional mutation studies. FunShift is available at http://FunShift.cgb.ki.se. PMID:15608176
Work-function calculations for a symmetrical total-charge-density profile at the metallic surface

NASA Astrophysics Data System (ADS)

Wojciechowski, K. F.; Sobańska-Nowotnik, M.

1983-07-01

It is shown that, if the total-charge-density profile nT(x) at the surface of jellium satisfies the Budd-Vannimenus constraint and also is a symmetrical function of x, relative to the ordinate axis, then the work-function variation versus the Wigner-Seitz radius rs does not depend on the form of nT(x). Also the simple linear-density profile is used to calculate the work function by application of the variational principle for the energy, including the first and second density-gradient corrections to the kinetic energy and the first gradient correction to the exchange and correlation energy. The results for the work function are in good agreement with the polycrystalline values for low-density metals.

Combining electronic structure and many-body theory with large databases: A method for predicting the nature of 4 f states in Ce compounds

NASA Astrophysics Data System (ADS)

Herper, H. C.; Ahmed, T.; Wills, J. M.; Di Marco, I.; Björkman, T.; Iuşan, D.; Balatsky, A. V.; Eriksson, O.

2017-08-01

Recent progress in materials informatics has opened up the possibility of a new approach to accessing properties of materials in which one assays the aggregate properties of a large set of materials within the same class in addition to a detailed investigation of each compound in that class. Here we present a large scale investigation of electronic properties and correlated magnetism in Ce-based compounds accompanied by a systematic study of the electronic structure and 4 f -hybridization function of a large body of Ce compounds. We systematically study the electronic structure and 4 f -hybridization function of a large body of Ce compounds with the goal of elucidating the nature of the 4 f states and their interrelation with the measured Kondo energy in these compounds. The hybridization function has been analyzed for more than 350 data sets (being part of the IMS database) of cubic Ce compounds using electronic structure theory that relies on a full-potential approach. We demonstrate that the strength of the hybridization function, evaluated in this way, allows us to draw precise conclusions about the degree of localization of the 4 f states in these compounds. The theoretical results are entirely consistent with all experimental information, relevant to the degree of 4 f localization for all investigated materials. Furthermore, a more detailed analysis of the electronic structure and the hybridization function allows us to make precise statements about Kondo correlations in these systems. The calculated hybridization functions, together with the corresponding density of states, reproduce the expected exponential behavior of the observed Kondo temperatures and prove a consistent trend in real materials. This trend allows us to predict which systems may be correctly identified as Kondo systems. A strong anticorrelation between the size of the hybridization function and the volume of the systems has been observed. The information entropy for this set of systems is about 0.42. Our approach demonstrates the predictive power of materials informatics when a large number of materials is used to establish significant trends. This predictive power can be used to design new materials with desired properties. The applicability of this approach for other correlated electron systems is discussed.
Relativistic density functional theory with picture-change corrected electron density based on infinite-order Douglas-Kroll-Hess method

NASA Astrophysics Data System (ADS)

Oyama, Takuro; Ikabata, Yasuhiro; Seino, Junji; Nakai, Hiromi

2017-07-01

This Letter proposes a density functional treatment based on the two-component relativistic scheme at the infinite-order Douglas-Kroll-Hess (IODKH) level. The exchange-correlation energy and potential are calculated using the electron density based on the picture-change corrected density operator transformed by the IODKH method. Numerical assessments indicated that the picture-change uncorrected density functional terms generate significant errors, on the order of hartree for heavy atoms. The present scheme was found to reproduce the energetics in the four-component treatment with high accuracy.
Restoring the consistency with the contact density theorem of a classical density functional theory of ions at a planar electrical double layer.

PubMed

Gillespie, Dirk

2014-11-01

Classical density functional theory (DFT) of fluids is a fast and efficient theory to compute the structure of the electrical double layer in the primitive model of ions where ions are modeled as charged, hard spheres in a background dielectric. While the hard-core repulsive component of this ion-ion interaction can be accurately computed using well-established DFTs, the electrostatic component is less accurate. Moreover, many electrostatic functionals fail to satisfy a basic theorem, the contact density theorem, that relates the bulk pressure, surface charge, and ion densities at their distances of closest approach for ions in equilibrium at a smooth, hard, planar wall. One popular electrostatic functional that fails to satisfy the contact density theorem is a perturbation approach developed by Kierlik and Rosinberg [Phys. Rev. A 44, 5025 (1991)PLRAAN1050-294710.1103/PhysRevA.44.5025] and Rosenfeld [J. Chem. Phys. 98, 8126 (1993)JCPSA60021-960610.1063/1.464569], where the full free-energy functional is Taylor-expanded around a bulk (homogeneous) reference fluid. Here, it is shown that this functional fails to satisfy the contact density theorem because it also fails to satisfy the known low-density limit. When the functional is corrected to satisfy this limit, a corrected bulk pressure is derived and it is shown that with this pressure both the contact density theorem and the Gibbs adsorption theorem are satisfied.
Density functional with full exact exchange, balanced nonlocality of correlations, and constraint satisfaction

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tao, Jianmin; Perdew, John P; Staroverov, Viktor N

2008-01-01

We construct a nonlocal density functional approximation with full exact exchange, while preserving the constraint-satisfaction approach and justified error cancellations of simpler semilocal functionals. This is achieved by interpolating between different approximations suitable for two extreme regions of the electron density. In a 'normal' region, the exact exchange-correlation hole density around an electron is semilocal because its spatial range is reduced by correlation and because it integrates over a narrow range to -1. These regions are well described by popular semilocal approximations (many of which have been constructed nonempirically), because of proper accuracy for a slowly-varying density or because ofmore » error cancellation between exchange and correlation. 'Abnormal' regions, where non locality is unveiled, include those in which exchange can dominate correlation (one-electron, nonuniform high-density, and rapidly-varying limits), and those open subsystems of fluctuating electron number over which the exact exchange-correlation hole integrates to a value greater than -1. Regions between these extremes are described by a hybrid functional mixing exact and semi local exchange energy densities locally (i.e., with a mixing fraction that is a function of position r and a functional of the density). Because our mixing fraction tends to 1 in the high-density limit, we employ full exact exchange according to the rigorous definition of the exchange component of any exchange-correlation energy functional. Use of full exact exchange permits the satisfaction of many exact constraints, but the nonlocality of exchange also requires balanced nonlocality of correlation. We find that this nonlocality can demand at least five empirical parameters (corresponding roughly to the four kinds of abnormal regions). Our local hybrid functional is perhaps the first accurate size-consistent density functional with full exact exchange. It satisfies other known exact constraints, including exactness for all one-electron densities, and provides an excellent, fit 1.0 the 223 molecular enthalpies of formation of the G3/99 set and the 42 reaction barrier heights of the BH42/03 set, improving both (but especially the latter) over most semilocal functionals and global hybrids. Exact constraints, physical insights, and paradigm examples hopefully suppress 'overfitting'.« less
Raman Optical Activity Spectra from Density Functional Perturbation Theory and Density-Functional-Theory-Based Molecular Dynamics.

PubMed

Luber, Sandra

2017-03-14

We describe the calculation of Raman optical activity (ROA) tensors from density functional perturbation theory, which has been implemented into the CP2K software package. Using the mixed Gaussian and plane waves method, ROA spectra are evaluated in the double-harmonic approximation. Moreover, an approach for the calculation of ROA spectra by means of density functional theory-based molecular dynamics is derived and used to obtain an ROA spectrum via time correlation functions, which paves the way for the calculation of ROA spectra taking into account anharmonicities and dynamic effects at ambient conditions.
Analyzing forensic evidence based on density with magnetic levitation.

PubMed

Lockett, Matthew R; Mirica, Katherine A; Mace, Charles R; Blackledge, Robert D; Whitesides, George M

2013-01-01

This paper describes a method for determining the density of contact trace objects with magnetic levitation (MagLev). MagLev measurements accurately determine the density (± 0.0002 g/cm(3) ) of a diamagnetic object and are compatible with objects that are nonuniform in shape and size. The MagLev device (composed of two permanent magnets with like poles facing) and the method described provide a means of accurately determining the density of trace objects. This method is inexpensive, rapid, and verifiable and provides numerical values--independent of the specific apparatus or analyst--that correspond to the absolute density of the sample that may be entered into a searchable database. We discuss the feasibility of MagLev as a possible means of characterizing forensic-related evidence and demonstrate the ability of MagLev to (i) determine the density of samples of glitter and gunpowder, (ii) separate glitter particles of different densities, and (iii) determine the density of a glitter sample that was removed from a complex sample matrix. © 2012 American Academy of Forensic Sciences.
Exact density functional theory for ideal polymer fluids with nearest neighbor bonding constraints.

PubMed

Woodward, Clifford E; Forsman, Jan

2008-08-07

We present a new density functional theory of ideal polymer fluids, assuming nearest-neighbor bonding constraints. The free energy functional is expressed in terms of end site densities of chain segments and thus has a simpler mathematical structure than previously used expressions using multipoint distributions. This work is based on a formalism proposed by Tripathi and Chapman [Phys. Rev. Lett. 94, 087801 (2005)]. Those authors obtain an approximate free energy functional for ideal polymers in terms of monomer site densities. Calculations on both repulsive and attractive surfaces show that their theory is reasonably accurate in some cases, but does differ significantly from the exact result for longer polymers with attractive surfaces. We suggest that segment end site densities, rather than monomer site densities, are the preferred choice of "site functions" for expressing the free energy functional of polymer fluids. We illustrate the application of our theory to derive an expression for the free energy of an ideal fluid of infinitely long polymers.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Constantin, Lucian A.; Fabiano, Eduardo; Della Sala, Fabio

We introduce a novel non-local ingredient for the construction of exchange density functionals: the reduced Hartree parameter, which is invariant under the uniform scaling of the density and represents the exact exchange enhancement factor for one- and two-electron systems. The reduced Hartree parameter is used together with the conventional meta-generalized gradient approximation (meta-GGA) semilocal ingredients (i.e., the electron density, its gradient, and the kinetic energy density) to construct a new generation exchange functional, termed u-meta-GGA. This u-meta-GGA functional is exact for the exchange of any one- and two-electron systems, is size-consistent and non-empirical, satisfies the uniform density scaling relation, andmore » recovers the modified gradient expansion derived from the semiclassical atom theory. For atoms, ions, jellium spheres, and molecules, it shows a good accuracy, being often better than meta-GGA exchange functionals. Our construction validates the use of the reduced Hartree ingredient in exchange-correlation functional development, opening the way to an additional rung in the Jacob’s ladder classification of non-empirical density functionals.« less
Microscopically based energy density functionals for nuclei using the density matrix expansion: Implementation and pre-optimization

NASA Astrophysics Data System (ADS)

Stoitsov, M.; Kortelainen, M.; Bogner, S. K.; Duguet, T.; Furnstahl, R. J.; Gebremariam, B.; Schunck, N.

2010-11-01

In a recent series of articles, Gebremariam, Bogner, and Duguet derived a microscopically based nuclear energy density functional by applying the density matrix expansion (DME) to the Hartree-Fock energy obtained from chiral effective field theory two- and three-nucleon interactions. Owing to the structure of the chiral interactions, each coupling in the DME functional is given as the sum of a coupling constant arising from zero-range contact interactions and a coupling function of the density arising from the finite-range pion exchanges. Because the contact contributions have essentially the same structure as those entering empirical Skyrme functionals, a microscopically guided Skyrme phenomenology has been suggested in which the contact terms in the DME functional are released for optimization to finite-density observables to capture short-range correlation energy contributions from beyond Hartree-Fock. The present article is the first attempt to assess the ability of the newly suggested DME functional, which has a much richer set of density dependencies than traditional Skyrme functionals, to generate sensible and stable results for nuclear applications. The results of the first proof-of-principle calculations are given, and numerous practical issues related to the implementation of the new functional in existing Skyrme codes are discussed. Using a restricted singular value decomposition optimization procedure, it is found that the new DME functional gives numerically stable results and exhibits a small but systematic reduction of our test χ2 function compared to standard Skyrme functionals, thus justifying its suitability for future global optimizations and large-scale calculations.
A reference-modified density functional theory: An application to solvation free-energy calculations for a Lennard-Jones solution.

PubMed

Sumi, Tomonari; Maruyama, Yutaka; Mitsutake, Ayori; Koga, Kenichiro

2016-06-14

In the conventional classical density functional theory (DFT) for simple fluids, an ideal gas is usually chosen as the reference system because there is a one-to-one correspondence between the external field and the density distribution function, and the exact intrinsic free-energy functional is available for the ideal gas. In this case, the second-order density functional Taylor series expansion of the excess intrinsic free-energy functional provides the hypernetted-chain (HNC) approximation. Recently, it has been shown that the HNC approximation significantly overestimates the solvation free energy (SFE) for an infinitely dilute Lennard-Jones (LJ) solution, especially when the solute particles are several times larger than the solvent particles [T. Miyata and J. Thapa, Chem. Phys. Lett. 604, 122 (2014)]. In the present study, we propose a reference-modified density functional theory as a systematic approach to improve the SFE functional as well as the pair distribution functions. The second-order density functional Taylor series expansion for the excess part of the intrinsic free-energy functional in which a hard-sphere fluid is introduced as the reference system instead of an ideal gas is applied to the LJ pure and infinitely dilute solution systems and is proved to remarkably improve the drawbacks of the HNC approximation. Furthermore, the third-order density functional expansion approximation in which a factorization approximation is applied to the triplet direct correlation function is examined for the LJ systems. We also show that the third-order contribution can yield further refinements for both the pair distribution function and the excess chemical potential for the pure LJ liquids.
Content Independence in Multimedia Databases.

ERIC Educational Resources Information Center

de Vries, Arjen P.

2001-01-01

Investigates the role of data management in multimedia digital libraries, and its implications for the design of database management systems. Introduces the notions of content abstraction and content independence. Proposes a blueprint of a new class of database technology, which supports the basic functionality for the management of both content…
Determining Core Plasmaspheric Electron Densities with the Van Allen Probes

NASA Astrophysics Data System (ADS)

De Pascuale, S.; Hartley, D.; Kurth, W. S.; Kletzing, C.; Thaller, S. A.; Wygant, J. R.

2016-12-01

We survey three methods for obtaining electron densities inside of the core plasmasphere region (L < 4) to the perigee of the Van Allen Probes (L 1.1) from September 2012 to December 2014. Using the EMFISIS instrument on board the Van Allen Probes, electron densities are extracted from the upper hybrid resonance to an uncertainty of 10%. Some measurements are subject to larger errors given interpretational issues, especially at low densities (L > 4) resulting from geomagnetic activity. At high densities EMFISIS is restricted by an upper observable limit near 3000 cm-3. As this limit is encountered above perigee, we employ two additional methods validated against EMFISIS measurements to determine electron densities deep within the plasmasphere (L < 2). EMFISIS can extrapolate density estimates to lower L by calculating high densities, in good agreement with the upper hybrid technique when applicable, from plasma wave properties. Calibrated measurements, from the Van Allen Probes EFW potential instrument, also extend into this range. In comparison with the published EMFISIS database we provide a metric for the validity of core plasmaspheric density measurements obtained from these methods and an empirical density model for use in wave and particle simulations.
MODELING LEACHING OF VIRUSES BY THE MONTE CARLO METHOD

EPA Science Inventory

A predictive screening model was developed for fate and transport
of viruses in the unsaturated zone. A database of input parameters
allowed Monte Carlo analysis with the model. The resulting kernel
densities of predicted attenuation during percolation indicated very ...
Energies and 2'-Hydroxyl Group Orientations of RNA Backbone Conformations. Benchmark CCSD(T)/CBS Database, Electronic Analysis, and Assessment of DFT Methods and MD Simulations.

PubMed

Mládek, Arnošt; Banáš, Pavel; Jurečka, Petr; Otyepka, Michal; Zgarbová, Marie; Šponer, Jiří

2014-01-14

Sugar-phosphate backbone is an electronically complex molecular segment imparting RNA molecules high flexibility and architectonic heterogeneity necessary for their biological functions. The structural variability of RNA molecules is amplified by the presence of the 2'-hydroxyl group, capable of forming multitude of intra- and intermolecular interactions. Bioinformatics studies based on X-ray structure database revealed that RNA backbone samples at least 46 substates known as rotameric families. The present study provides a comprehensive analysis of RNA backbone conformational preferences and 2'-hydroxyl group orientations. First, we create a benchmark database of estimated CCSD(T)/CBS relative energies of all rotameric families and test performance of dispersion-corrected DFT-D3 methods and molecular mechanics in vacuum and in continuum solvent. The performance of the DFT-D3 methods is in general quite satisfactory. The B-LYP-D3 method provides the best trade-off between accuracy and computational demands. B3-LYP-D3 slightly outperforms the new PW6B95-D3 and MPW1B95-D3 and is the second most accurate density functional of the study. The best agreement with CCSD(T)/CBS is provided by DSD-B-LYP-D3 double-hybrid functional, although its large-scale applications may be limited by high computational costs. Molecular mechanics does not reproduce the fine energy differences between the RNA backbone substates. We also demonstrate that the differences in the magnitude of the hyperconjugation effect do not correlate with the energy ranking of the backbone conformations. Further, we investigated the 2'-hydroxyl group orientation preferences. For all families, we conducted a QM and MM hydroxyl group rigid scan in gas phase and solvent. We then carried out set of explicit solvent MD simulations of folded RNAs and analyze 2'-hydroxyl group orientations of different backbone families in MD. The solvent energy profiles determined primarily by the sugar pucker match well with the distribution data derived from the simulations. The QM and MM energy profiles predict the same 2'-hydroxyl group orientation preferences. Finally, we demonstrate that the high energy of unfavorable and rarely sampled 2'-hydroxyl group orientations can be attributed to clashes between occupied orbitals.
Metadata for selecting or submitting generic seismic vulnerability functions via GEM's vulnerability database

USGS Publications Warehouse

Jaiswal, Kishor

2013-01-01

This memo lays out a procedure for the GEM software to offer an available vulnerability function for any acceptable set of attributes that the user specifies for a particular building category. The memo also provides general guidelines on how to submit the vulnerability or fragility functions to the GEM vulnerability repository, stipulating which attributes modelers must provide so that their vulnerability or fragility functions can be queried appropriately by the vulnerability database. An important objective is to provide users guidance on limitations and applicability by providing the associated modeling assumptions and applicability of each vulnerability or fragility function.
Basis convergence of range-separated density-functional theory.

PubMed

Franck, Odile; Mussard, Bastien; Luppi, Eleonora; Toulouse, Julien

2015-02-21

Range-separated density-functional theory (DFT) is an alternative approach to Kohn-Sham density-functional theory. The strategy of range-separated density-functional theory consists in separating the Coulomb electron-electron interaction into long-range and short-range components and treating the long-range part by an explicit many-body wave-function method and the short-range part by a density-functional approximation. Among the advantages of using many-body methods for the long-range part of the electron-electron interaction is that they are much less sensitive to the one-electron atomic basis compared to the case of the standard Coulomb interaction. Here, we provide a detailed study of the basis convergence of range-separated density-functional theory. We study the convergence of the partial-wave expansion of the long-range wave function near the electron-electron coalescence. We show that the rate of convergence is exponential with respect to the maximal angular momentum L for the long-range wave function, whereas it is polynomial for the case of the Coulomb interaction. We also study the convergence of the long-range second-order Møller-Plesset correlation energy of four systems (He, Ne, N2, and H2O) with cardinal number X of the Dunning basis sets cc - p(C)V XZ and find that the error in the correlation energy is best fitted by an exponential in X. This leads us to propose a three-point complete-basis-set extrapolation scheme for range-separated density-functional theory based on an exponential formula.
Use of selection indices to model the functional response of predators

USGS Publications Warehouse

Joly, D.O.; Patterson, B.R.

2003-01-01

The functional response of a predator to changing prey density is an important determinant of stability of predatora??prey systems. We show how Manly's selection indices can be used to distinguish between hyperbolic and sigmoidal models of a predator functional response to primary prey density in the presence of alternative prey. Specifically, an inverse relationship between prey density and preference for that prey results in a hyperbolic functional response while a positive relationship can yield either a hyperbolic or sigmoidal functional response, depending on the form and relative magnitudes of the density-dependent preference model, attack rate, and handling time. As an example, we examine wolf (Canis lupus) functional response to moose (Alces alces) density in the presence of caribou (Rangifer tarandus). The use of selection indices to evaluate the form of the functional response has significant advantages over previous attempts to fit Holling's functional response curves to killing-rate data directly, including increased sensitivity, use of relatively easily collected data, and consideration of other explanatory factors (e.g., weather, seasons, productivity).
Locality of correlation in density functional theory

DOE Office of Scientific and Technical Information (OSTI.GOV)

Burke, Kieron; Cancio, Antonio; Gould, Tim

The Hohenberg-Kohn density functional was long ago shown to reduce to the Thomas-Fermi (TF) approximation in the non-relativistic semiclassical (or large-Z) limit for all matter, i.e., the kinetic energy becomes local. Exchange also becomes local in this limit. Numerical data on the correlation energy of atoms support the conjecture that this is also true for correlation, but much less relevant to atoms. We illustrate how expansions around a large particle number are equivalent to local density approximations and their strong relevance to density functional approximations. Analyzing highly accurate atomic correlation energies, we show that E{sub C} → −A{sub C} ZlnZ +more » B{sub C}Z as Z → ∞, where Z is the atomic number, A{sub C} is known, and we estimate B{sub C} to be about 37 mhartree. The local density approximation yields A{sub C} exactly, but a very incorrect value for B{sub C}, showing that the local approximation is less relevant for the correlation alone. This limit is a benchmark for the non-empirical construction of density functional approximations. We conjecture that, beyond atoms, the leading correction to the local density approximation in the large-Z limit generally takes this form, but with B{sub C} a functional of the TF density for the system. The implications for the construction of approximate density functionals are discussed.« less
Minimotif Miner 3.0: database expansion and significantly improved reduction of false-positive predictions from consensus sequences.

PubMed

Mi, Tian; Merlin, Jerlin Camilus; Deverasetty, Sandeep; Gryk, Michael R; Bill, Travis J; Brooks, Andrew W; Lee, Logan Y; Rathnayake, Viraj; Ross, Christian A; Sargeant, David P; Strong, Christy L; Watts, Paula; Rajasekaran, Sanguthevar; Schiller, Martin R

2012-01-01

Minimotif Miner (MnM available at http://minimotifminer.org or http://mnm.engr.uconn.edu) is an online database for identifying new minimotifs in protein queries. Minimotifs are short contiguous peptide sequences that have a known function in at least one protein. Here we report the third release of the MnM database which has now grown 60-fold to approximately 300,000 minimotifs. Since short minimotifs are by their nature not very complex we also summarize a new set of false-positive filters and linear regression scoring that vastly enhance minimotif prediction accuracy on a test data set. This online database can be used to predict new functions in proteins and causes of disease.
Integration of gel-based and gel-free proteomic data for functional analysis of proteins through Soybean Proteome Database.

PubMed

Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi

2017-06-23

The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max 'Enrei'). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. The Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all predicted proteins from genome sequences, though there are over lapped proteins. Based on the demonstrated application of data stored in the database for functional analyses, it is suggested that these data will be useful for analyses of biological mechanisms in soybean. Furthermore, coupled with recent advances in information and communication technology, the usefulness of this database would increase in the analyses of biological mechanisms. Copyright © 2017 Elsevier B.V. All rights reserved.

Multiconfiguration Pair-Density Functional Theory Is Free From Delocalization Error.

PubMed

Bao, Junwei Lucas; Wang, Ying; He, Xiao; Gagliardi, Laura; Truhlar, Donald G

2017-11-16

Delocalization error has been singled out by Yang and co-workers as the dominant error in Kohn-Sham density functional theory (KS-DFT) with conventional approximate functionals. In this Letter, by computing the vertical first ionization energy for well separated He clusters, we show that multiconfiguration pair-density functional theory (MC-PDFT) is free from delocalization error. To put MC-PDFT in perspective, we also compare it with some Kohn-Sham density functionals, including both traditional and modern functionals. Whereas large delocalization errors are almost universal in KS-DFT (the only exception being the very recent corrected functionals of Yang and co-workers), delocalization error is removed by MC-PDFT, which bodes well for its future as a step forward from KS-DFT.
Morphomics of the Talus.

PubMed

Gorman, David; Handy, Ebram; Wang, Sikui; Irwin, Annette L; Wang, Stewart

2016-11-01

Previous studies of frontal crash databases reported that ankle fractures are among the most common lower extremity fractures. While not generally life threatening, these injuries can be debilitating. Laboratory research into the mechanisms of ankle fractures has linked dorsiflexion with an increased risk of tibia and fibula malleolus fractures. However, talus fractures were not produced in the laboratory tests and appear to be caused by more complex loading of the joint. In this study, an analysis of the National Automotive Sampling System - Crashworthiness Data System (NASS-CDS) for the years 2004-2013 was conducted to investigate foot-ankle injury rates in front seat occupants involved in frontal impact crashes. A logistic regression model was developed indicating occupant weight, impact delta velocity and gender to be significant predictors of talus fracture (p<0.05). Separately, a specific set of Computed Tomography (CT) scans from the International Center for Automotive Medicine (ICAM) scan database was used to characterize the talar dome. This control population consisted of 207 adults aged 18 to 84, with no foot or ankle trauma, and scans that had suitable coverage of the talus. Size of the talus was determined using medial-to-lateral width and anterior-to-posterior depth measurements. Geometry was assessed by evaluating the radius of the articulating talus and strength was assessed using a combination of cross sectional area and density. Demographics were studied to investigate correlation with talus measurements from the CT scan database. A multi-variable linear regression model of the morphomics showed gender to be statistically significant (p<0.05) for talus depth, width, cross-sectional area, radius and strength. Body Mass Index (BMI) was significant for depth and radius. Weight was significant for depth, width, density and strength. Stature was significant for depth, cross-sectional area, radius and strength. Age was significant for radius and density.
Analyses of Factors Affecting Endothelial Cell Density in an Eye Bank Corneal Donor Database.

PubMed

Kwon, Ji Won; Cho, Kyong Jin; Kim, Hong Kyu; Lee, Jimmy K; Gore, Patrick K; McCartney, Mitchell D; Chuck, Roy S

2016-09-01

To analyze the factors affecting central corneal endothelial cell density (ECD) in an eye bank corneal donor database. The Lion's Eye Institute corneal donor database consisting of 18,665 donors (34,234 corneas) aged 20 years or older was analyzed. In particular, differences in the ECD based on age, sex, race, prior ocular surgery, a history of systemic diseases, and smoking were investigated. Furthermore, risk factors for donor cell count inadequacy (defined here as ECD less than 2000/mm) were identified. ECD decreased with age. Regarding race, the average ECD of African American donors was higher than those of white or Hispanic donors. A history of diabetes mellitus (DM) and ocular surgery were associated with a lower ECD. Donor medical history of hypertension, glaucoma, depression, dementia, Parkinson disease, hyper- or hypothyroidism, or smoking did not seem to affect the ECD. The risk factors for donor cell count inadequacy, based on binary logistic regression analyses were advanced age [65-74 years yielded an odds ratio of 17.8; confidence interval (CI), 10.6-29.8; P < 0.001; and 75-99 years yielded an odds ratio of 24.6 (CI, 14.5-41.61; P < 0.001) when compared with 20-34 years], cataract surgery (odds ratio, 4.3; CI, 4.0-4.8; P < 0.001), and DM (odds ratio, 1.2; CI, 1.1-1.3; P = 0.001). Age, race, ocular surgery (cataract and refractive), and DM seem to significantly affect donor corneal ECD. Of these variables, age, a history of cataract surgery, and DM were found to be the greatest risk factors for inadequate donor cell density (less than 2000/mm).
Type 2 Diabetes Research Yield, 1951-2012: Bibliometrics Analysis and Density-Equalizing Mapping

PubMed Central

Geaney, Fiona; Scutaru, Cristian; Kelly, Clare; Glynn, Ronan W.; Perry, Ivan J.

2015-01-01

The objective of this paper is to provide a detailed evaluation of type 2 diabetes mellitus research output from 1951-2012, using large-scale data analysis, bibliometric indicators and density-equalizing mapping. Data were retrieved from the Science Citation Index Expanded database, one of the seven curated databases within Web of Science. Using Boolean operators "OR", "AND" and "NOT", a search strategy was developed to estimate the total number of published items. Only studies with an English abstract were eligible. Type 1 diabetes and gestational diabetes items were excluded. Specific software developed for the database analysed the data. Information including titles, authors’ affiliations and publication years were extracted from all files and exported to excel. Density-equalizing mapping was conducted as described by Groenberg-Kloft et al, 2008. A total of 24,783 items were published and cited 476,002 times. The greatest number of outputs were published in 2010 (n=2,139). The United States contributed 28.8% to the overall output, followed by the United Kingdom (8.2%) and Japan (7.7%). Bilateral cooperation was most common between the United States and United Kingdom (n=237). Harvard University produced 2% of all publications, followed by the University of California (1.1%). The leading journals were Diabetes, Diabetologia and Diabetes Care and they contributed 9.3%, 7.3% and 4.0% of the research yield, respectively. In conclusion, the volume of research is rising in parallel with the increasing global burden of disease due to type 2 diabetes mellitus. Bibliometrics analysis provides useful information to scientists and funding agencies involved in the development and implementation of research strategies to address global health issues. PMID:26208117
Generalization of the Kohn-Sham system that can represent arbitrary one-electron density matrices

DOE PAGES

Hubertus J. J. van Dam

2016-04-27

Density functional theory is currently the most widely applied method in electronic structure theory. The Kohn-Sham method, based on a fictitious system of noninteracting particles, is the workhorse of the theory. The particular form of the Kohn-Sham wave function admits only idempotent one-electron density matrices whereas wave functions of correlated electrons in post-Hartree-Fock methods invariably have fractional occupation numbers. Here we show that by generalizing the orbital concept and introducing a suitable dot product as well as a probability density, a noninteracting system can be chosen that can represent the one-electron density matrix of any system, even one with fractionalmore » occupation numbers. This fictitious system ensures that the exact electron density is accessible within density functional theory. It can also serve as the basis for reduced density matrix functional theory. Moreover, to aid the analysis of the results the orbitals may be assigned energies from a mean-field Hamiltonian. This produces energy levels that are akin to Hartree-Fock orbital energies such that conventional analyses based on Koopmans' theorem are available. Lastly, this system is convenient in formalisms that depend on creation and annihilation operators as they are trivially applied to single-determinant wave functions.« less
Observational database for studies of nearby universe

NASA Astrophysics Data System (ADS)

Kaisina, E. I.; Makarov, D. I.; Karachentsev, I. D.; Kaisin, S. S.

2012-01-01

We present the description of a database of galaxies of the Local Volume (LVG), located within 10 Mpc around the Milky Way. It contains more than 800 objects. Based on an analysis of functional capabilities, we used the PostgreSQL DBMS as a management system for our LVG database. Applying semantic modelling methods, we developed a physical ER-model of the database. We describe the developed architecture of the database table structure, and the implemented web-access, available at http://www.sao.ru/lv/lvgdb.
A database of lotic invertebrate traits for North America

USGS Publications Warehouse

Vieira, Nicole K.M.; Poff, N. LeRoy; Carlisle, Daren M.; Moulton, Stephen R.; Koski, Marci L.; Kondratieff, Boris C.

2006-01-01

The assessment and study of stream communities may be enhanced if functional characteristics such as life-history, habitat preference, and reproductive strategy were more widely available for specific taxa. Species traits can be used to develop these functional indicators because many traits directly link functional roles of organisms with controlling environmental factors (for example, flow, substratum, temperature). In addition, some functional traits may not be constrained by taxonomy and are thus applicable at multiple spatial scales. Unfortunately, a comprehensive summary of traits for North American invertebrate taxa does not exist. Consequently, the U.S. Geological Survey's National Water-Quality Assessment Program in cooperation with Colorado State University compiled a database of traits for North American invertebrates. A total of 14,127 records for over 2,200 species, 1,165 genera, and 249 families have been entered into the database from 967 publications, texts and reports. Quality-assurance procedures indicated error rates of less than 3 percent in the data entry process. Species trait information was most complete for insect taxa. Traits describing resource acquisition and habitat preferences were most frequently reported, whereas those describing physiological tolerances and reproductive biology were the least frequently reported in the literature. The database is not exhaustive of the literature for North American invertebrates and is biased towards aquatic insects, but it represents a first attempt to compile traits in a web-accessible database. This report describes the database and discusses important decisions necessary for identifying ecologically relevant, environmentally sensitive, non-redundant, and statistically tractable traits for use in bioassessment programs.
A database system to support image algorithm evaluation

NASA Technical Reports Server (NTRS)

Lien, Y. E.

1977-01-01

The design is given of an interactive image database system IMDB, which allows the user to create, retrieve, store, display, and manipulate images through the facility of a high-level, interactive image query (IQ) language. The query language IQ permits the user to define false color functions, pixel value transformations, overlay functions, zoom functions, and windows. The user manipulates the images through generic functions. The user can direct images to display devices for visual and qualitative analysis. Image histograms and pixel value distributions can also be computed to obtain a quantitative analysis of images.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Jung, Haeryong; Lee, Eunyong; Jeong, YiYeong

Korea Radioactive-waste Management Corporation (KRMC) established in 2009 has started a new project to collect information on long-term stability of deep geological environments on the Korean Peninsula. The information has been built up in the integrated natural barrier database system available on web (www.deepgeodisposal.kr). The database system also includes socially and economically important information, such as land use, mining area, natural conservation area, population density, and industrial complex, because some of this information is used as exclusionary criteria during the site selection process for a deep geological repository for safe and secure containment and isolation of spent nuclear fuel andmore » other long-lived radioactive waste in Korea. Although the official site selection process has not been started yet in Korea, current integrated natural barrier database system and socio-economic database is believed that the database system will be effectively utilized to narrow down the number of sites where future investigation is most promising in the site selection process for a deep geological repository and to enhance public acceptance by providing readily-available relevant scientific information on deep geological environments in Korea. (authors)« less
Measures of dependence for multivariate Lévy distributions

NASA Astrophysics Data System (ADS)

Boland, J.; Hurd, T. R.; Pivato, M.; Seco, L.

2001-02-01

Recent statistical analysis of a number of financial databases is summarized. Increasing agreement is found that logarithmic equity returns show a certain type of asymptotic behavior of the largest events, namely that the probability density functions have power law tails with an exponent α≈3.0. This behavior does not vary much over different stock exchanges or over time, despite large variations in trading environments. The present paper proposes a class of multivariate distributions which generalizes the observed qualities of univariate time series. A new consequence of the proposed class is the "spectral measure" which completely characterizes the multivariate dependences of the extreme tails of the distribution. This measure on the unit sphere in M-dimensions, in principle completely general, can be determined empirically by looking at extreme events. If it can be observed and determined, it will prove to be of importance for scenario generation in portfolio risk management.
High-throughput screening of inorganic compounds for the discovery of novel dielectric and optical materials

DOE PAGES

Petousis, Ioannis; Mrdjenovich, David; Ballouz, Eric; ...

2017-01-31

Dielectrics are an important class of materials that are ubiquitous in modern electronic applications. Even though their properties are important for the performance of devices, the number of compounds with known dielectric constant is on the order of a few hundred. Here, we use Density Functional Perturbation Theory as a way to screen for the dielectric constant and refractive index of materials in a fast and computationally efficient way. Our results constitute the largest dielectric tensors database to date, containing 1,056 compounds. Details regarding the computational methodology and technical validation are presented along with the format of our publicly availablemore » data. In addition, we integrate our dataset with the Materials Project allowing users easy access to material properties. Finally, we explain how our dataset and calculation methodology can be used in the search for novel dielectric compounds.« less
High-throughput screening of inorganic compounds for the discovery of novel dielectric and optical materials

PubMed Central

Petousis, Ioannis; Mrdjenovich, David; Ballouz, Eric; Liu, Miao; Winston, Donald; Chen, Wei; Graf, Tanja; Schladt, Thomas D.; Persson, Kristin A.; Prinz, Fritz B.

2017-01-01

Dielectrics are an important class of materials that are ubiquitous in modern electronic applications. Even though their properties are important for the performance of devices, the number of compounds with known dielectric constant is on the order of a few hundred. Here, we use Density Functional Perturbation Theory as a way to screen for the dielectric constant and refractive index of materials in a fast and computationally efficient way. Our results constitute the largest dielectric tensors database to date, containing 1,056 compounds. Details regarding the computational methodology and technical validation are presented along with the format of our publicly available data. In addition, we integrate our dataset with the Materials Project allowing users easy access to material properties. Finally, we explain how our dataset and calculation methodology can be used in the search for novel dielectric compounds. PMID:28140408
Charge optimized many-body potential for aluminum.

PubMed

Choudhary, Kamal; Liang, Tao; Chernatynskiy, Aleksandr; Lu, Zizhe; Goyal, Anuj; Phillpot, Simon R; Sinnott, Susan B

2015-01-14

An interatomic potential for Al is developed within the third generation of the charge optimized many-body (COMB3) formalism. The database used for the parameterization of the potential consists of experimental data and the results of first-principles and quantum chemical calculations. The potential exhibits reasonable agreement with cohesive energy, lattice parameters, elastic constants, bulk and shear modulus, surface energies, stacking fault energies, point defect formation energies, and the phase order of metallic Al from experiments and density functional theory. In addition, the predicted phonon dispersion is in good agreement with the experimental data and first-principles calculations. Importantly for the prediction of the mechanical behavior, the unstable stacking fault energetics along the [Formula: see text] direction on the (1 1 1) plane are similar to those obtained from first-principles calculations. The polycrsytal when strained shows responses that are physical and the overall behavior is consistent with experimental observations.
Compositional descriptor-based recommender system for the materials discovery

NASA Astrophysics Data System (ADS)

Seko, Atsuto; Hayashi, Hiroyuki; Tanaka, Isao

2018-06-01

Structures and properties of many inorganic compounds have been collected historically. However, it only covers a very small portion of possible inorganic crystals, which implies the presence of numerous currently unknown compounds. A powerful machine-learning strategy is mandatory to discover new inorganic compounds from all chemical combinations. Herein we propose a descriptor-based recommender-system approach to estimate the relevance of chemical compositions where crystals can be formed [i.e., chemically relevant compositions (CRCs)]. In addition to data-driven compositional similarity used in the literature, the use of compositional descriptors as a prior knowledge is helpful for the discovery of new compounds. We validate our recommender systems in two ways. First, one database is used to construct a model, while another is used for the validation. Second, we estimate the phase stability for compounds at expected CRCs using density functional theory calculations.
High-throughput screening of inorganic compounds for the discovery of novel dielectric and optical materials.

PubMed

Petousis, Ioannis; Mrdjenovich, David; Ballouz, Eric; Liu, Miao; Winston, Donald; Chen, Wei; Graf, Tanja; Schladt, Thomas D; Persson, Kristin A; Prinz, Fritz B

2017-01-31

Dielectrics are an important class of materials that are ubiquitous in modern electronic applications. Even though their properties are important for the performance of devices, the number of compounds with known dielectric constant is on the order of a few hundred. Here, we use Density Functional Perturbation Theory as a way to screen for the dielectric constant and refractive index of materials in a fast and computationally efficient way. Our results constitute the largest dielectric tensors database to date, containing 1,056 compounds. Details regarding the computational methodology and technical validation are presented along with the format of our publicly available data. In addition, we integrate our dataset with the Materials Project allowing users easy access to material properties. Finally, we explain how our dataset and calculation methodology can be used in the search for novel dielectric compounds.
Metastable structure of Li13Si4

NASA Astrophysics Data System (ADS)

Gruber, Thomas; Bahmann, Silvia; Kortus, Jens

2016-04-01

The Li13Si4 phase is one out of several crystalline lithium silicide phases, which is a potential electrode material for lithium ion batteries and contains a high theoretical specific capacity. By means of ab initio methods like density functional theory (DFT) many properties such as heat capacity or heat of formation can be calculated. These properties are based on the calculation of phonon frequencies, which contain information about the thermodynamical stability. The current unit cell of "Li13Si4" given in the ICSD database is unstable with respect to DFT calculations. We propose a modified unit cell that is stable in the calculations. The evolutionary algorithm EVO found a structure very similar to the ICSD one with both of them containing metastable lithium positions. Molecular dynamic simulations show a phase transition between both structures where these metastable lithium atoms move. This phase transition is achieved by a very fast one-dimensional lithium diffusion and stabilizes this phase.
Prediction of a New Phase of Cu x S near Stoichiometric Composition

DOE PAGES

Khatri, Prashant; Huda, Muhammad N.

2015-01-01

Cumore » 2 S is known to be a promising solar absorber material due to its suitable band gap and the abundance of its constituent elements. 2 S is known to have complex phase structures depending on the concentration of vacancies. Its instability of phases is due to favorable formation of vacancies and the mobility of atoms within the crystal. Understanding its phase structures is of crucial important for its application as solar absorber material. In this paper, we have predicted a new crystal phase of copper sulfide ( x S) around chemical composition of x = 1.98 by utilizing crystal database search and density functional theory. We have shown that this new crystal phase of x S is more favorable than low chalcocite structure even at stoichiometric composition of x = 2 . However, vacancy formation probability was found to be higher in this new phase than the low chalcocite structure.« less
A power-law coupled three-form dark energy model

NASA Astrophysics Data System (ADS)

Yao, Yan-Hong; Yan, Yang-Jie; Meng, Xin-He

2018-02-01

We consider a field theory model of coupled dark energy which treats dark energy as a three-form field and dark matter as a spinor field. By assuming the effective mass of dark matter as a power-law function of the three-form field and neglecting the potential term of dark energy, we obtain three solutions of the autonomous system of evolution equations, including a de Sitter attractor, a tracking solution and an approximate solution. To understand the strength of the coupling, we confront the model with the latest Type Ia Supernova, Baryon Acoustic Oscillations and Cosmic Microwave Background radiation observations, with the conclusion that the combination of these three databases marginalized over the present dark matter density parameter Ω _{m0} and the present three-form field κ X0 gives stringent constraints on the coupling constant, - 0.017< λ <0.047 (2σ confidence level), by which we present the model's applicable parameter range.
Effects of Recombinant Human Growth Hormone for Osteoporosis: Systematic Review and Meta-Analysis.

PubMed

Atkinson, Hayden F; Moyer, Rebecca F; Yacoub, Daniel; Coughlin, Dexter; Birmingham, Trevor B

2017-03-01

Our objective was to evaluate the efficacy of recombinant human growth hormone (GH) on bone mineral density (BMD) in persons age 50 and older, with normal pituitary function, with or at risk for developing osteoporosis. We systematically reviewed randomized clinical trials (RCTs), searching six databases, and conducted meta-analyses to examine GH effects on BMD of the lumbar spine and femoral neck. Data for fracture incidence, bone metabolism biomarkers, and adverse events were also extracted and analysed. Thirteen RCTs met the eligibility criteria. Pooled effect sizes suggested no significant GH effect on BMD. Pooled effect sizes were largest, but nonsignificant, when compared to placebo. GH had a significant effect on several bone metabolism biomarkers. A significantly higher rate of adverse events was observed in the GH groups. Meta-analysis of RCTs suggests that GH treatment for persons with or at risk for developing osteoporosis results in very small, nonsignificant increases in BMD.
Subject Retrieval from Full-Text Databases in the Humanities

ERIC Educational Resources Information Center

East, John W.

2007-01-01

This paper examines the problems involved in subject retrieval from full-text databases of secondary materials in the humanities. Ten such databases were studied and their search functionality evaluated, focusing on factors such as Boolean operators, document surrogates, limiting by subject area, proximity operators, phrase searching, wildcards,…

Microscopically based energy density functionals for nuclei using the density matrix expansion. II. Full optimization and validation

NASA Astrophysics Data System (ADS)

Navarro Pérez, R.; Schunck, N.; Dyhdalo, A.; Furnstahl, R. J.; Bogner, S. K.

2018-05-01

Background: Energy density functional methods provide a generic framework to compute properties of atomic nuclei starting from models of nuclear potentials and the rules of quantum mechanics. Until now, the overwhelming majority of functionals have been constructed either from empirical nuclear potentials such as the Skyrme or Gogny forces, or from systematic gradient-like expansions in the spirit of the density functional theory for atoms. Purpose: We seek to obtain a usable form of the nuclear energy density functional that is rooted in the modern theory of nuclear forces. We thus consider a functional obtained from the density matrix expansion of local nuclear potentials from chiral effective field theory. We propose a parametrization of this functional carefully calibrated and validated on selected ground-state properties that is suitable for large-scale calculations of nuclear properties. Methods: Our energy functional comprises two main components. The first component is a non-local functional of the density and corresponds to the direct part (Hartree term) of the expectation value of local chiral potentials on a Slater determinant. Contributions to the mean field and the energy of this term are computed by expanding the spatial, finite-range components of the chiral potential onto Gaussian functions. The second component is a local functional of the density and is obtained by applying the density matrix expansion to the exchange part (Fock term) of the expectation value of the local chiral potential. We apply the UNEDF2 optimization protocol to determine the coupling constants of this energy functional. Results: We obtain a set of microscopically constrained functionals for local chiral potentials from leading order up to next-to-next-to-leading order with and without three-body forces and contributions from Δ excitations. These functionals are validated on the calculation of nuclear and neutron matter, nuclear mass tables, single-particle shell structure in closed-shell nuclei, and the fission barrier of 240Pu. Quantitatively, they perform noticeably better than the more phenomenological Skyrme functionals. Conclusions: The inclusion of higher-order terms in the chiral perturbation expansion seems to produce a systematic improvement in predicting nuclear binding energies while the impact on other observables is not really significant. This result is especially promising since all the fits have been performed at the single-reference level of the energy density functional approach, where important collective correlations such as center-of-mass correction, rotational correction, or zero-point vibrational energies have not been taken into account yet.
Influence of the volume and density functions within geometric models for estimating trunk inertial parameters.

PubMed

Wicke, Jason; Dumas, Genevieve A

2010-02-01

The geometric method combines a volume and a density function to estimate body segment parameters and has the best opportunity for developing the most accurate models. In the trunk, there are many different tissues that greatly differ in density (e.g., bone versus lung). Thus, the density function for the trunk must be particularly sensitive to capture this diversity, such that accurate inertial estimates are possible. Three different models were used to test this hypothesis by estimating trunk inertial parameters of 25 female and 24 male college-aged participants. The outcome of this study indicates that the inertial estimates for the upper and lower trunk are most sensitive to the volume function and not very sensitive to the density function. Although it appears that the uniform density function has a greater influence on inertial estimates in the lower trunk region than in the upper trunk region, this is likely due to the (overestimated) density value used. When geometric models are used to estimate body segment parameters, care must be taken in choosing a model that can accurately estimate segment volumes. Researchers wanting to develop accurate geometric models should focus on the volume function, especially in unique populations (e.g., pregnant or obese individuals).
Density, structure, and dynamics of water: The effect of van der Waals interactions

NASA Astrophysics Data System (ADS)

Wang, Jue; Román-Pérez, G.; Soler, Jose M.; Artacho, Emilio; Fernández-Serra, M.-V.

2011-01-01

It is known that ab initio molecular dynamics (AIMD) simulations of liquid water at ambient conditions, based on the generalized gradient approximation (GGA) to density functional theory (DFT), with commonly used functionals fail to produce structural and diffusive properties in reasonable agreement with experiment. This is true for canonical, constant temperature simulations where the density of the liquid is fixed to the experimental density. The equilibrium density, at ambient conditions, of DFT water has recently been shown by Schmidt et al. [J. Phys. Chem. B, 113, 11959 (2009)] to be underestimated by different GGA functionals for exchange and correlation, and corrected by the addition of interatomic pair potentials to describe van der Waals (vdW) interactions. In this contribution we present a DFT-AIMD study of liquid water using several GGA functionals as well as the van der Waals density functional (vdW-DF) of Dion et al. [Phys. Rev. Lett. 92, 246401 (2004)]. As expected, we find that the density of water is grossly underestimated by GGA functionals. When a vdW-DF is used, the density improves drastically and the experimental diffusivity is reproduced without the need of thermal corrections. We analyze the origin of the density differences between all the functionals. We show that the vdW-DF increases the population of non-H-bonded interstitial sites, at distances between the first and second coordination shells. However, it excessively weakens the H-bond network, collapsing the second coordination shell. This structural problem is partially associated to the choice of GGA exchange in the vdW-DF. We show that a different choice for the exchange functional is enough to achieve an overall improvement both in structure and diffusivity.
Salary Management System for Small and Medium-sized Enterprises

NASA Astrophysics Data System (ADS)

Hao, Zhang; Guangli, Xu; Yuhuan, Zhang; Yilong, Lei

Small and Medium-sized Enterprises (SMEs) in the process of wage entry, calculation, the total number are needed to be done manually in the past, the data volume is quite large, processing speed is low, and it is easy to make error, which is resulting in low efficiency. The main purpose of writing this paper is to present the basis of salary management system, establish a scientific database, the computer payroll system, using the computer instead of a lot of past manual work in order to reduce duplication of staff labor, it will improve working efficiency.This system combines the actual needs of SMEs, through in-depth study and practice of the C/S mode, PowerBuilder10.0 development tools, databases and SQL language, Completed a payroll system needs analysis, database design, application design and development work. Wages, departments, units and personnel database file are included in this system, and have data management, department management, personnel management and other functions, through the control and management of the database query, add, delete, modify, and other functions can be realized. This system is reasonable design, a more complete function, stable operation has been tested to meet the basic needs of the work.
Differentiating signals to make biological sense - A guide through databases for MS-based non-targeted metabolomics.

PubMed

Gil de la Fuente, Alberto; Grace Armitage, Emily; Otero, Abraham; Barbas, Coral; Godzien, Joanna

2017-09-01

Metabolite identification is one of the most challenging steps in metabolomics studies and reflects one of the greatest bottlenecks in the entire workflow. The success of this step determines the success of the entire research, therefore the quality at which annotations are given requires special attention. A variety of tools and resources are available to aid metabolite identification or annotation, offering different and often complementary functionalities. In preparation for this article, almost 50 databases were reviewed, from which 17 were selected for discussion, chosen for their online ESI-MS functionality. The general characteristics and functions of each database is discussed in turn, considering the advantages and limitations of each along with recommendations for optimal use of each tool, as derived from experiences encountered at the Centre for Metabolomics and Bioanalysis (CEMBIO) in Madrid. These databases were evaluated considering their utility in non-targeted metabolomics, including aspects such as identifier assignment, structural assignment and interpretation of results. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The COG database: a tool for genome-scale analysis of protein functions and evolution

PubMed Central

Tatusov, Roman L.; Galperin, Michael Y.; Natale, Darren A.; Koonin, Eugene V.

2000-01-01

Rational classification of proteins encoded in sequenced genomes is critical for making the genome sequences maximally useful for functional and evolutionary studies. The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes (http://www.ncbi.nlm.nih.gov/COG ). The COGs were constructed by applying the criterion of consistency of genome-specific best hits to the results of an exhaustive comparison of all protein sequences from these genomes. The database comprises 2091 COGs that include 56–83% of the gene products from each of the complete bacterial and archaeal genomes and ~35% of those from the yeast Saccharomyces cerevisiae genome. The COG database is accompanied by the COGNITOR program that is used to fit new proteins into the COGs and can be applied to functional and phylogenetic annotation of newly sequenced genomes. PMID:10592175
[The therapeutic drug monitoring network server of tacrolimus for Chinese renal transplant patients].

PubMed

Deng, Chen-Hui; Zhang, Guan-Min; Bi, Shan-Shan; Zhou, Tian-Yan; Lu, Wei

2011-07-01

This study is to develop a therapeutic drug monitoring (TDM) network server of tacrolimus for Chinese renal transplant patients, which can facilitate doctor to manage patients' information and provide three levels of predictions. Database management system MySQL was employed to build and manage the database of patients and doctors' information, and hypertext mark-up language (HTML) and Java server pages (JSP) technology were employed to construct network server for database management. Based on the population pharmacokinetic model of tacrolimus for Chinese renal transplant patients, above program languages were used to construct the population prediction and subpopulation prediction modules. Based on Bayesian principle and maximization of the posterior probability function, an objective function was established, and minimized by an optimization algorithm to estimate patient's individual pharmacokinetic parameters. It is proved that the network server has the basic functions for database management and three levels of prediction to aid doctor to optimize the regimen of tacrolimus for Chinese renal transplant patients.
An X-Ray Analysis Database of Photoionization Cross Sections Including Variable Ionization

NASA Technical Reports Server (NTRS)

Wang, Ping; Cohen, David H.; MacFarlane, Joseph J.; Cassinelli, Joseph P.

1997-01-01

Results of research efforts in the following areas are discussed: review of the major theoretical and experimental data of subshell photoionization cross sections and ionization edges of atomic ions to assess the accuracy of the data, and to compile the most reliable of these data in our own database; detailed atomic physics calculations to complement the database for all ions of 17 cosmically abundant elements; reconciling the data from various sources and our own calculations; and fitting cross sections with functional approximations and incorporating these functions into a compact computer code.Also, efforts included adapting an ionization equilibrium code, tabulating results, and incorporating them into the overall program and testing the code (both ionization equilibrium and opacity codes) with existing observational data. The background and scientific applications of this work are discussed. Atomic physics cross section models and calculations are described. Calculation results are compared with available experimental data and other theoretical data. The functional approximations used for fitting cross sections are outlined and applications of the database are discussed.
Recent developments in LIBXC - A comprehensive library of functionals for density functional theory

NASA Astrophysics Data System (ADS)

Lehtola, Susi; Steigemann, Conrad; Oliveira, Micael J. T.; Marques, Miguel A. L.

2018-01-01

LIBXC is a library of exchange-correlation functionals for density-functional theory. We are concerned with semi-local functionals (or the semi-local part of hybrid functionals), namely local-density approximations, generalized-gradient approximations, and meta-generalized-gradient approximations. Currently we include around 400 functionals for the exchange, correlation, and the kinetic energy, spanning more than 50 years of research. Moreover, LIBXC is by now used by more than 20 codes, not only from the atomic, molecular, and solid-state physics, but also from the quantum chemistry communities.
A meta-GGA level screened range-separated hybrid functional by employing short range Hartree-Fock with a long range semilocal functional.

PubMed

Jana, Subrata; Samal, Prasanjit

2018-03-28

The range-separated hybrid density functionals are very successful in describing a wide range of molecular and solid-state properties accurately. In principle, such functionals are designed from spherically averaged or system averaged as well as reverse engineered exchange holes. In the present attempt, the screened range-separated hybrid functional scheme has been applied to the meta-GGA rung by using the density matrix expansion based semilocal exchange hole (or functional). The hybrid functional proposed here utilizes the spherically averaged density matrix expansion based exchange hole in the range separation scheme. For slowly varying density correction the range separation scheme is employed only through the local density approximation based exchange hole coupled with the corresponding fourth order gradient approximate Tao-Mo enhancement factor. The comprehensive testing and performance of the newly constructed functional indicates its applicability in describing several molecular properties. The most appealing feature of this present screened hybrid functional is that it will be practically very useful in describing solid-state properties at the meta-GGA level.
Effect of the sequence data deluge on the performance of methods for detecting protein functional residues.

PubMed

Garrido-Martín, Diego; Pazos, Florencio

2018-02-27

The exponential accumulation of new sequences in public databases is expected to improve the performance of all the approaches for predicting protein structural and functional features. Nevertheless, this was never assessed or quantified for some widely used methodologies, such as those aimed at detecting functional sites and functional subfamilies in protein multiple sequence alignments. Using raw protein sequences as only input, these approaches can detect fully conserved positions, as well as those with a family-dependent conservation pattern. Both types of residues are routinely used as predictors of functional sites and, consequently, understanding how the sequence content of the databases affects them is relevant and timely. In this work we evaluate how the growth and change with time in the content of sequence databases affect five sequence-based approaches for detecting functional sites and subfamilies. We do that by recreating historical versions of the multiple sequence alignments that would have been obtained in the past based on the database contents at different time points, covering a period of 20 years. Applying the methods to these historical alignments allows quantifying the temporal variation in their performance. Our results show that the number of families to which these methods can be applied sharply increases with time, while their ability to detect potentially functional residues remains almost constant. These results are informative for the methods' developers and final users, and may have implications in the design of new sequencing initiatives.
GeneSCF: a real-time based functional enrichment tool with support for multiple organisms.

PubMed

Subhash, Santhilal; Kanduri, Chandrasekhar

2016-09-13

High-throughput technologies such as ChIP-sequencing, RNA-sequencing, DNA sequencing and quantitative metabolomics generate a huge volume of data. Researchers often rely on functional enrichment tools to interpret the biological significance of the affected genes from these high-throughput studies. However, currently available functional enrichment tools need to be updated frequently to adapt to new entries from the functional database repositories. Hence there is a need for a simplified tool that can perform functional enrichment analysis by using updated information directly from the source databases such as KEGG, Reactome or Gene Ontology etc. In this study, we focused on designing a command-line tool called GeneSCF (Gene Set Clustering based on Functional annotations), that can predict the functionally relevant biological information for a set of genes in a real-time updated manner. It is designed to handle information from more than 4000 organisms from freely available prominent functional databases like KEGG, Reactome and Gene Ontology. We successfully employed our tool on two of published datasets to predict the biologically relevant functional information. The core features of this tool were tested on Linux machines without the need for installation of more dependencies. GeneSCF is more reliable compared to other enrichment tools because of its ability to use reference functional databases in real-time to perform enrichment analysis. It is an easy-to-integrate tool with other pipelines available for downstream analysis of high-throughput data. More importantly, GeneSCF can run multiple gene lists simultaneously on different organisms thereby saving time for the users. Since the tool is designed to be ready-to-use, there is no need for any complex compilation and installation procedures.
Towards novel organic high-Tc superconductors: Data mining using density of states similarity search

NASA Astrophysics Data System (ADS)

Geilhufe, R. Matthias; Borysov, Stanislav S.; Kalpakchi, Dmytro; Balatsky, Alexander V.

2018-02-01

Identifying novel functional materials with desired key properties is an important part of bridging the gap between fundamental research and technological advancement. In this context, high-throughput calculations combined with data-mining techniques highly accelerated this process in different areas of research during the past years. The strength of a data-driven approach for materials prediction lies in narrowing down the search space of thousands of materials to a subset of prospective candidates. Recently, the open-access organic materials database OMDB was released providing electronic structure data for thousands of previously synthesized three-dimensional organic crystals. Based on the OMDB, we report about the implementation of a novel density of states similarity search tool which is capable of retrieving materials with similar density of states to a reference material. The tool is based on the approximate nearest neighbor algorithm as implemented in the ANNOY library and can be applied via the OMDB web interface. The approach presented here is wide ranging and can be applied to various problems where the density of states is responsible for certain key properties of a material. As the first application, we report about materials exhibiting electronic structure similarities to the aromatic hydrocarbon p-terphenyl which was recently discussed as a potential organic high-temperature superconductor exhibiting a transition temperature in the order of 120 K under strong potassium doping. Although the mechanism driving the remarkable transition temperature remains under debate, we argue that the density of states, reflecting the electronic structure of a material, might serve as a crucial ingredient for the observed high Tc. To provide candidates which might exhibit comparable properties, we present 15 purely organic materials with similar features to p-terphenyl within the electronic structure, which also tend to have structural similarities with p-terphenyl such as space group symmetries, chemical composition, and molecular structure. The experimental verification of these candidates might lead to a better understanding of the underlying mechanism in case similar superconducting properties are revealed.
A LOFAR census of non-recycled pulsars: average profiles, dispersion measures, flux densities, and spectra

NASA Astrophysics Data System (ADS)

Bilous, A. V.; Kondratiev, V. I.; Kramer, M.; Keane, E. F.; Hessels, J. W. T.; Stappers, B. W.; Malofeev, V. M.; Sobey, C.; Breton, R. P.; Cooper, S.; Falcke, H.; Karastergiou, A.; Michilli, D.; Osłowski, S.; Sanidas, S.; ter Veen, S.; van Leeuwen, J.; Verbiest, J. P. W.; Weltevrede, P.; Zarka, P.; Grießmeier, J.-M.; Serylak, M.; Bell, M. E.; Broderick, J. W.; Eislöffel, J.; Markoff, S.; Rowlinson, A.

2016-06-01

We present first results from a LOFAR census of non-recycled pulsars. The census includes almost all such pulsars known (194 sources) at declinations Dec > 8° and Galactic latitudes |Gb| > 3°, regardless of their expected flux densities and scattering times. Each pulsar was observed for ≥20 min in the contiguous frequency range of 110-188 MHz. Full-Stokes data were recorded. We present the dispersion measures, flux densities, and calibrated total intensity profiles for the 158 pulsars detected in the sample. The median uncertainty in census dispersion measures (1.5 × 10-3 pc cm-3) is ten times smaller, on average, than in the ATNF pulsar catalogue. We combined census flux densities with those in the literature and fitted the resulting broadband spectra with single or broken power-law functions. For 48 census pulsars such fits are being published for the first time. Typically, thechoice between single and broken power-laws, as well as the location of the spectral break, were highly influenced by the spectral coverage of the available flux density measurements. In particular, the inclusion of measurements below 100 MHz appears essential for investigating the low-frequency turnover in the spectra for most of the census pulsars. For several pulsars, we compared the spectral indices from different works and found the typical spread of values to be within 0.5-1.5, suggesting a prevailing underestimation of spectral index errors in the literature. The census observations yielded some unexpected individual source results, as we describe in the paper. Lastly, we will provide this unique sample of wide-band, low-frequency pulse profiles via the European Pulsar Network Database. Tables B.1-B.4 are only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (ftp://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/591/A134
The large-scale correlations of multicell densities and profiles: implications for cosmic variance estimates

NASA Astrophysics Data System (ADS)

Codis, Sandrine; Bernardeau, Francis; Pichon, Christophe

2016-08-01

In order to quantify the error budget in the measured probability distribution functions of cell densities, the two-point statistics of cosmic densities in concentric spheres is investigated. Bias functions are introduced as the ratio of their two-point correlation function to the two-point correlation of the underlying dark matter distribution. They describe how cell densities are spatially correlated. They are computed here via the so-called large deviation principle in the quasi-linear regime. Their large-separation limit is presented and successfully compared to simulations for density and density slopes: this regime is shown to be rapidly reached allowing to get sub-percent precision for a wide range of densities and variances. The corresponding asymptotic limit provides an estimate of the cosmic variance of standard concentric cell statistics applied to finite surveys. More generally, no assumption on the separation is required for some specific moments of the two-point statistics, for instance when predicting the generating function of cumulants containing any powers of concentric densities in one location and one power of density at some arbitrary distance from the rest. This exact `one external leg' cumulant generating function is used in particular to probe the rate of convergence of the large-separation approximation.
A data-based conservation planning tool for Florida panthers

USGS Publications Warehouse

Murrow, Jennifer L.; Thatcher, Cindy A.; Van Manen, Frank T.; Clark, Joseph D.

2013-01-01

Habitat loss and fragmentation are the greatest threats to the endangered Florida panther (Puma concolor coryi). We developed a data-based habitat model and user-friendly interface so that land managers can objectively evaluate Florida panther habitat. We used a geographic information system (GIS) and the Mahalanobis distance statistic (D2) to develop a model based on broad-scale landscape characteristics associated with panther home ranges. Variables in our model were Euclidean distance to natural land cover, road density, distance to major roads, human density, amount of natural land cover, amount of semi-natural land cover, amount of permanent or semi-permanent flooded area–open water, and a cost–distance variable. We then developed a Florida Panther Habitat Estimator tool, which automates and replicates the GIS processes used to apply the statistical habitat model. The estimator can be used by persons with moderate GIS skills to quantify effects of land-use changes on panther habitat at local and landscape scales. Example applications of the tool are presented.
A global characterization and identification of multifunctional enzymes.

PubMed

Cheng, Xian-Ying; Huang, Wei-Juan; Hu, Shi-Chang; Zhang, Hai-Lei; Wang, Hao; Zhang, Jing-Xian; Lin, Hong-Huang; Chen, Yu-Zong; Zou, Quan; Ji, Zhi-Liang

2012-01-01

Multi-functional enzymes are enzymes that perform multiple physiological functions. Characterization and identification of multi-functional enzymes are critical for communication and cooperation between different functions and pathways within a complex cellular system or between cells. In present study, we collected literature-reported 6,799 multi-functional enzymes and systematically characterized them in structural, functional, and evolutionary aspects. It was found that four physiochemical properties, that is, charge, polarizability, hydrophobicity, and solvent accessibility, are important for characterization of multi-functional enzymes. Accordingly, a combinational model of support vector machine and random forest model was constructed, based on which 6,956 potential novel multi-functional enzymes were successfully identified from the ENZYME database. Moreover, it was observed that multi-functional enzymes are non-evenly distributed in species, and that Bacteria have relatively more multi-functional enzymes than Archaebacteria and Eukaryota. Comparative analysis indicated that the multi-functional enzymes experienced a fluctuation of gene gain and loss during the evolution from S. cerevisiae to H. sapiens. Further pathway analyses indicated that a majority of multi-functional enzymes were well preserved in catalyzing several essential cellular processes, for example, metabolisms of carbohydrates, nucleotides, and amino acids. What's more, a database of known multi-functional enzymes and a server for novel multi-functional enzyme prediction were also constructed for free access at http://bioinf.xmu.edu.cn/databases/MFEs/index.htm.
FlyRNAi.org—the database of the Drosophila RNAi screening center and transgenic RNAi project: 2017 update

PubMed Central

Hu, Yanhui; Comjean, Aram; Roesel, Charles; Vinayagam, Arunachalam; Flockhart, Ian; Zirin, Jonathan; Perkins, Lizabeth; Perrimon, Norbert; Mohr, Stephanie E.

2017-01-01

The FlyRNAi database of the Drosophila RNAi Screening Center (DRSC) and Transgenic RNAi Project (TRiP) at Harvard Medical School and associated DRSC/TRiP Functional Genomics Resources website (http://fgr.hms.harvard.edu) serve as a reagent production tracking system, screen data repository, and portal to the community. Through this portal, we make available protocols, online tools, and other resources useful to researchers at all stages of high-throughput functional genomics screening, from assay design and reagent identification to data analysis and interpretation. In this update, we describe recent changes and additions to our website, database and suite of online tools. Recent changes reflect a shift in our focus from a single technology (RNAi) and model species (Drosophila) to the application of additional technologies (e.g. CRISPR) and support of integrated, cross-species approaches to uncovering gene function using functional genomics and other approaches. PMID:27924039
A Density Functional for Liquid 3He Based on the Aziz Potential

NASA Astrophysics Data System (ADS)

Barranco, M.; Hernández, E. S.; Mayol, R.; Navarro, J.; Pi, M.; Szybisz, L.

2006-09-01

We propose a new class of density functionals for liquid 3He based on the Aziz helium-helium interaction screened at short distances by the microscopically calculated two-body distribution function g(r). Our aim is to reduce to a minumum the unavoidable phenomenological ingredients inherent to any density functional approach. Results for the homogeneous liquid and droplets are presented and discussed.
LSMS

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eisenbach, Markus; Li, Ying Wai; Liu, Xianglin

2017-12-01

LSMS is a first principles, Density Functional theory based, electronic structure code targeted mainly at materials applications. LSMS calculates the local spin density approximation to the diagonal part of the electron Green's function. The electron/spin density and energy are easily determined once the Green's function is known. Linear scaling with system size is achieved in the LSMS by using several unique properties of the real space multiple scattering approach to the Green's function.

A vitamin D pathway gene-gene interaction affects low-density lipoprotein cholesterol levels.

PubMed

Grave, Nathália; Tovo-Rodrigues, Luciana; da Silveira, Janaína; Rovaris, Diego Luiz; Dal Bosco, Simone Morelo; Contini, Verônica; Genro, Júlia Pasqualini

2016-12-01

Much evidence suggests an association between vitamin D deficiency and chronic diseases such as obesity and dyslipidemia. Although genetic factors play an important role in the etiology of these diseases, only a few studies have investigated the relationship between vitamin D-related genes and anthropometric and lipid profiles. The aim of this study was to investigate the association of three vitamin D-related genes with anthropometric and lipid parameters in 542 adult individuals. We analyzed the rs2228570 polymorphism in the vitamin D receptor gene (VDR), rs2134095 in the retinoid X receptor gamma gene (RXRG) and rs7041 in the vitamin D-binding protein gene (GC). Polymorphisms were genotyped by TaqMan allelic discrimination. Gene-gene interactions were evaluated by the general linear model. The functionality of the polymorphisms was investigated using the following predictors and databases: SIFT (Sorting Intolerant from Tolerant), PolyPhen-2 (Polymorphism Phenotyping v2) and Human Splicing Finder 3. We identified a significant effect of the interaction between RXRG (rs2134095) and GC (rs7041) on low-density lipoprotein cholesterol (LDL-c) levels (P=.005). Furthermore, our in silico analysis suggested a functional role for both variants in the regulation of the gene products. Our results suggest that the vitamin D-related genes RXRG and GC affect LDL-c levels. These findings are in agreement with other studies that consistently associate vitamin D and lipid profile. Together, our results corroborate the idea that analyzing gene-gene interaction would be helpful to clarify the genetic component of lipid profile. Copyright © 2016 Elsevier Inc. All rights reserved.
Zn Coordination Chemistry: Development of Benchmark Suites for Geometries, Dipole Moments, and Bond Dissociation Energies and Their Use To Test and Validate Density Functionals and Molecular Orbital Theory.

PubMed

Amin, Elizabeth A; Truhlar, Donald G

2008-01-01

We present nonrelativistic and relativistic benchmark databases (obtained by coupled cluster calculations) of 10 Zn-ligand bond distances, 8 dipole moments, and 12 bond dissociation energies in Zn coordination compounds with O, S, NH3, H2O, OH, SCH3, and H ligands. These are used to test the predictions of 39 density functionals, Hartree-Fock theory, and seven more approximate molecular orbital theories. In the nonrelativisitic case, the M05-2X, B97-2, and mPW1PW functionals emerge as the most accurate ones for this test data, with unitless balanced mean unsigned errors (BMUEs) of 0.33, 0.38, and 0.43, respectively. The best local functionals (i.e., functionals with no Hartree-Fock exchange) are M06-L and τ-HCTH with BMUEs of 0.54 and 0.60, respectively. The popular B3LYP functional has a BMUE of 0.51, only slightly better than the value of 0.54 for the best local functional, which is less expensive. Hartree-Fock theory itself has a BMUE of 1.22. The M05-2X functional has a mean unsigned error of 0.008 Å for bond lengths, 0.19 D for dipole moments, and 4.30 kcal/mol for bond energies. The X3LYP functional has a smaller mean unsigned error (0.007 Å) for bond lengths but has mean unsigned errors of 0.43 D for dipole moments and 5.6 kcal/mol for bond energies. The M06-2X functional has a smaller mean unsigned error (3.3 kcal/mol) for bond energies but has mean unsigned errors of 0.017 Å for bond lengths and 0.37 D for dipole moments. The best of the semiempirical molecular orbital theories are PM3 and PM6, with BMUEs of 1.96 and 2.02, respectively. The ten most accurate functionals from the nonrelativistic benchmark analysis are then tested in relativistic calculations against new benchmarks obtained with coupled-cluster calculations and a relativistic effective core potential, resulting in M05-2X (BMUE = 0.895), PW6B95 (BMUE = 0.90), and B97-2 (BMUE = 0.93) as the top three functionals. We find significant relativistic effects (∼0.01 Å in bond lengths, ∼0.2 D in dipole moments, and ∼4 kcal/mol in Zn-ligand bond energies) that cannot be neglected for accurate modeling, but the same density functionals that do well in all-electron nonrelativistic calculations do well with relativistic effective core potentials. Although most tests are carried out with augmented polarized triple-ζ basis sets, we also carried out some tests with an augmented polarized double-ζ basis set, and we found, on average, that with the smaller basis set DFT has no loss in accuracy for dipole moments and only ∼10% less accurate bond lengths.
Relationships between human population density and burned area at continental and global scales.

PubMed

Bistinas, Ioannis; Oom, Duarte; Sá, Ana C L; Harrison, Sandy P; Prentice, I Colin; Pereira, José M C

2013-01-01

We explore the large spatial variation in the relationship between population density and burned area, using continental-scale Geographically Weighted Regression (GWR) based on 13 years of satellite-derived burned area maps from the global fire emissions database (GFED) and the human population density from the gridded population of the world (GPW 2005). Significant relationships are observed over 51.5% of the global land area, and the area affected varies from continent to continent: population density has a significant impact on fire over most of Asia and Africa but is important in explaining fire over < 22% of Europe and Australia. Increasing population density is associated with both increased and decreased in fire. The nature of the relationship depends on land-use: increasing population density is associated with increased burned are in rangelands but with decreased burned area in croplands. Overall, the relationship between population density and burned area is non-monotonic: burned area initially increases with population density and then decreases when population density exceeds a threshold. These thresholds vary regionally. Our study contributes to improved understanding of how human activities relate to burned area, and should contribute to a better estimate of atmospheric emissions from biomass burning.
Relationships between Human Population Density and Burned Area at Continental and Global Scales

PubMed Central

Bistinas, Ioannis; Oom, Duarte; Sá, Ana C. L.; Harrison, Sandy P.; Prentice, I. Colin; Pereira, José M. C.

2013-01-01

We explore the large spatial variation in the relationship between population density and burned area, using continental-scale Geographically Weighted Regression (GWR) based on 13 years of satellite-derived burned area maps from the global fire emissions database (GFED) and the human population density from the gridded population of the world (GPW 2005). Significant relationships are observed over 51.5% of the global land area, and the area affected varies from continent to continent: population density has a significant impact on fire over most of Asia and Africa but is important in explaining fire over < 22% of Europe and Australia. Increasing population density is associated with both increased and decreased in fire. The nature of the relationship depends on land-use: increasing population density is associated with increased burned are in rangelands but with decreased burned area in croplands. Overall, the relationship between population density and burned area is non-monotonic: burned area initially increases with population density and then decreases when population density exceeds a threshold. These thresholds vary regionally. Our study contributes to improved understanding of how human activities relate to burned area, and should contribute to a better estimate of atmospheric emissions from biomass burning. PMID:24358108
A-WINGS: an integrated genome database for Pleurocybella porrigens (Angel's wing oyster mushroom, Sugihiratake).

PubMed

Yamamoto, Naoki; Suzuki, Tomohiro; Kobayashi, Masaaki; Dohra, Hideo; Sasaki, Yohei; Hirai, Hirofumi; Yokoyama, Koji; Kawagishi, Hirokazu; Yano, Kentaro

2014-12-03

The angel's wing oyster mushroom (Pleurocybella porrigens, Sugihiratake) is a well-known delicacy. However, its potential risk in acute encephalopathy was recently revealed by a food poisoning incident. To disclose the genes underlying the accident and provide mechanistic insight, we seek to develop an information infrastructure containing omics data. In our previous work, we sequenced the genome and transcriptome using next-generation sequencing techniques. The next step in achieving our goal is to develop a web database to facilitate the efficient mining of large-scale omics data and identification of genes specifically expressed in the mushroom. This paper introduces a web database A-WINGS (http://bioinf.mind.meiji.ac.jp/a-wings/) that provides integrated genomic and transcriptomic information for the angel's wing oyster mushroom. The database contains structure and functional annotations of transcripts and gene expressions. Functional annotations contain information on homologous sequences from NCBI nr and UniProt, Gene Ontology, and KEGG Orthology. Digital gene expression profiles were derived from RNA sequencing (RNA-seq) analysis in the fruiting bodies and mycelia. The omics information stored in the database is freely accessible through interactive and graphical interfaces by search functions that include 'GO TREE VIEW' browsing, keyword searches, and BLAST searches. The A-WINGS database will accelerate omics studies on specific aspects of the angel's wing oyster mushroom and the family Tricholomataceae.
SInCRe—structural interactome computational resource for Mycobacterium tuberculosis

PubMed Central

Metri, Rahul; Hariharaputran, Sridhar; Ramakrishnan, Gayatri; Anand, Praveen; Raghavender, Upadhyayula S.; Ochoa-Montaño, Bernardo; Higueruelo, Alicia P.; Sowdhamini, Ramanathan; Chandra, Nagasuma R.; Blundell, Tom L.; Srinivasan, Narayanaswamy

2015-01-01

We have developed an integrated database for Mycobacterium tuberculosis H37Rv (Mtb) that collates information on protein sequences, domain assignments, functional annotation and 3D structural information along with protein–protein and protein–small molecule interactions. SInCRe (Structural Interactome Computational Resource) is developed out of CamBan (Cambridge and Bangalore) collaboration. The motivation for development of this database is to provide an integrated platform to allow easily access and interpretation of data and results obtained by all the groups in CamBan in the field of Mtb informatics. In-house algorithms and databases developed independently by various academic groups in CamBan are used to generate Mtb-specific datasets and are integrated in this database to provide a structural dimension to studies on tuberculosis. The SInCRe database readily provides information on identification of functional domains, genome-scale modelling of structures of Mtb proteins and characterization of the small-molecule binding sites within Mtb. The resource also provides structure-based function annotation, information on small-molecule binders including FDA (Food and Drug Administration)-approved drugs, protein–protein interactions (PPIs) and natural compounds that bind to pathogen proteins potentially and result in weakening or elimination of host–pathogen protein–protein interactions. Together they provide prerequisites for identification of off-target binding. Database URL: http://proline.biochem.iisc.ernet.in/sincre PMID:26130660
MECP2 variation in Rett syndrome-An overview of current coverage of genetic and phenotype data within existing databases.

PubMed

Townend, Gillian S; Ehrhart, Friederike; van Kranen, Henk J; Wilkinson, Mark; Jacobsen, Annika; Roos, Marco; Willighagen, Egon L; van Enckevort, David; Evelo, Chris T; Curfs, Leopold M G

2018-04-27

Rett syndrome (RTT) is a monogenic rare disorder that causes severe neurological problems. In most cases, it results from a loss-of-function mutation in the gene encoding methyl-CPG-binding protein 2 (MECP2). Currently, about 900 unique MECP2 variations (benign and pathogenic) have been identified and it is suspected that the different mutations contribute to different levels of disease severity. For researchers and clinicians, it is important that genotype-phenotype information is available to identify disease-causing mutations for diagnosis, to aid in clinical management of the disorder, and to provide counseling for parents. In this study, 13 genotype-phenotype databases were surveyed for their general functionality and availability of RTT-specific MECP2 variation data. For each database, we investigated findability and interoperability alongside practical user functionality, and type and amount of genetic and phenotype data. The main conclusions are that, as well as being challenging to find these databases and specific MECP2 variants held within, interoperability is as yet poorly developed and requires effort to search across databases. Nevertheless, we found several thousand online database entries for MECP2 variations and their associated phenotypes, diagnosis, or predicted variant effects, which is a good starting point for researchers and clinicians who want to provide, annotate, and use the data. © 2018 The Authors. Human Mutation published by Wiley Periodicals, Inc.
Using the genome aggregation database, computational pathogenicity prediction tools, and patch clamp heterologous expression studies to demote previously published long QT syndrome type 1 mutations from pathogenic to benign.

PubMed

Clemens, Daniel J; Lentino, Anne R; Kapplinger, Jamie D; Ye, Dan; Zhou, Wei; Tester, David J; Ackerman, Michael J

2018-04-01

Mutations in the KCNQ1-encoded Kv7.1 potassium channel cause long QT syndrome (LQTS) type 1 (LQT1). It has been suggested that ∼10%-20% of rare LQTS case-derived variants in the literature may have been published erroneously as LQT1-causative mutations and may be "false positives." The purpose of this study was to determine which previously published KCNQ1 case variants are likely false positives. A list of all published, case-derived KCNQ1 missense variants (MVs) was compiled. The occurrence of each MV within the Genome Aggregation Database (gnomAD) was assessed. Eight in silico tools were used to predict each variant's pathogenicity. Case-derived variants that were either (1) too frequently found in gnomAD or (2) absent in gnomAD but predicted to be pathogenic by ≤2 tools were considered potential false positives. Three of these variants were characterized functionally using whole-cell patch clamp technique. Overall, there were 244 KCNQ1 case-derived MVs. Of these, 29 (12%) were seen in ≥10 individuals in gnomAD and are demotable. However, 157 of 244 MVs (64%) were absent in gnomAD. Of these, 7 (4%) were predicted to be pathogenic by ≤2 tools, 3 of which we characterized functionally. There was no significant difference in current density between heterozygous KCNQ1-F127L, -P477L, or -L619M variant-containing channels compared to KCNQ1-WT. This study offers preliminary evidence for the demotion of 32 (13%) previously published LQT1 MVs. Of these, 29 were demoted because of their frequent sighting in gnomAD. Additionally, in silico analysis and in vitro functional studies have facilitated the demotion of 3 ultra-rare MVs (F127L, P477L, L619M). Copyright © 2017 Heart Rhythm Society. Published by Elsevier Inc. All rights reserved.
PoMaMo--a comprehensive database for potato genome data.

PubMed

Meyer, Svenja; Nagel, Axel; Gebhardt, Christiane

2005-01-01

A database for potato genome data (PoMaMo, Potato Maps and More) was established. The database contains molecular maps of all twelve potato chromosomes with about 1000 mapped elements, sequence data, putative gene functions, results from BLAST analysis, SNP and InDel information from different diploid and tetraploid potato genotypes, publication references, links to other public databases like GenBank (http://www.ncbi.nlm.nih.gov/) or SGN (Solanaceae Genomics Network, http://www.sgn.cornell.edu/), etc. Flexible search and data visualization interfaces enable easy access to the data via internet (https://gabi.rzpd.de/PoMaMo.html). The Java servlet tool YAMB (Yet Another Map Browser) was designed to interactively display chromosomal maps. Maps can be zoomed in and out, and detailed information about mapped elements can be obtained by clicking on an element of interest. The GreenCards interface allows a text-based data search by marker-, sequence- or genotype name, by sequence accession number, gene function, BLAST Hit or publication reference. The PoMaMo database is a comprehensive database for different potato genome data, and to date the only database containing SNP and InDel data from diploid and tetraploid potato genotypes.
PoMaMo—a comprehensive database for potato genome data

PubMed Central

Meyer, Svenja; Nagel, Axel; Gebhardt, Christiane

2005-01-01

A database for potato genome data (PoMaMo, Potato Maps and More) was established. The database contains molecular maps of all twelve potato chromosomes with about 1000 mapped elements, sequence data, putative gene functions, results from BLAST analysis, SNP and InDel information from different diploid and tetraploid potato genotypes, publication references, links to other public databases like GenBank (http://www.ncbi.nlm.nih.gov/) or SGN (Solanaceae Genomics Network, http://www.sgn.cornell.edu/), etc. Flexible search and data visualization interfaces enable easy access to the data via internet (https://gabi.rzpd.de/PoMaMo.html). The Java servlet tool YAMB (Yet Another Map Browser) was designed to interactively display chromosomal maps. Maps can be zoomed in and out, and detailed information about mapped elements can be obtained by clicking on an element of interest. The GreenCards interface allows a text-based data search by marker-, sequence- or genotype name, by sequence accession number, gene function, BLAST Hit or publication reference. The PoMaMo database is a comprehensive database for different potato genome data, and to date the only database containing SNP and InDel data from diploid and tetraploid potato genotypes. PMID:15608284
Development of an electronic database for Acute Pain Service outcomes

PubMed Central

Love, Brandy L; Jensen, Louise A; Schopflocher, Donald; Tsui, Ban CH

2012-01-01

BACKGROUND: Quality assurance is increasingly important in the current health care climate. An electronic database can be used for tracking patient information and as a research tool to provide quality assurance for patient care. OBJECTIVE: An electronic database was developed for the Acute Pain Service, University of Alberta Hospital (Edmonton, Alberta) to record patient characteristics, identify at-risk populations, compare treatment efficacies and guide practice decisions. METHOD: Steps in the database development involved identifying the goals for use, relevant variables to include, and a plan for data collection, entry and analysis. Protocols were also created for data cleaning quality control. The database was evaluated with a pilot test using existing data to assess data collection burden, accuracy and functionality of the database. RESULTS: A literature review resulted in an evidence-based list of demographic, clinical and pain management outcome variables to include. Time to assess patients and collect the data was 20 min to 30 min per patient. Limitations were primarily software related, although initial data collection completion was only 65% and accuracy of data entry was 96%. CONCLUSIONS: The electronic database was found to be relevant and functional for the identified goals of data storage and research. PMID:22518364
Force Density Function Relationships in 2-D Granular Media

NASA Technical Reports Server (NTRS)

Youngquist, Robert C.; Metzger, Philip T.; Kilts, Kelly N.

2004-01-01

An integral transform relationship is developed to convert between two important probability density functions (distributions) used in the study of contact forces in granular physics. Developing this transform has now made it possible to compare and relate various theoretical approaches with one another and with the experimental data despite the fact that one may predict the Cartesian probability density and another the force magnitude probability density. Also, the transforms identify which functional forms are relevant to describe the probability density observed in nature, and so the modified Bessel function of the second kind has been identified as the relevant form for the Cartesian probability density corresponding to exponential forms in the force magnitude distribution. Furthermore, it is shown that this transform pair supplies a sufficient mathematical framework to describe the evolution of the force magnitude distribution under shearing. Apart from the choice of several coefficients, whose evolution of values must be explained in the physics, this framework successfully reproduces the features of the distribution that are taken to be an indicator of jamming and unjamming in a granular packing. Key words. Granular Physics, Probability Density Functions, Fourier Transforms
Bluegill growth as modified by plant density: an exploration of underlying mechanisms

USGS Publications Warehouse

Savino, Jacqueline F.; Marschall, Elizabeth A.; Stein, Roy A.

1992-01-01

Bluegill (Lepomis macrochira) growth varies inconsistently with plant density. In laboratory and field experiments, we explored mechanisms underlying bluegill growth as a function of plant and invertebrate density. In the laboratory, bluegills captured more chironomids (Chironomus riparius) than damselflies (Enallagma spp. and Ischnura spp.), but energy intake per time spent searching did not differ between damselfly and chironomid treatments. From laboratory data, we described prey encounter rates as functions of plant and invertebrate density. In Clark Lake, Ohio, we created 0.05-ha mesocosms of inshore vegetation to generate macrophyte densities of 125, 270, and 385 stems/m2 of Potamogeton and Ceratophyllum and added 46-mm bluegill (1/m2). In these mesocosms, invertebrate density increased as a function of macrophyte density. Combining this function with encounter rate functions derived from laboratory data, we predicted that bluegill growth should peak at a high macrophyte density, greater than 1000 stems/m2, even though growth should change only slightly beyond 100 stems/m2. Consistent with our predictions, bluegills did not grow differentially, nor did their use of different prey taxa differ, across macrophyte densities in the field. Bluegills preferred chironomid pupae, which were relatively few in numbers but vulnerable to predation, whereas more cryptic, chironomid larvae, which were associated with vegetation but were relatively abundant, were eaten as encountered. Bluegill avoided physid snails. Contrary to previous work, vegetation did not influence growth or diet of bluegill beyond relatively low densities owing to the interaction between capture probabilities and macroinvertebrate densities.
Assessing Density Functionals Using Many Body Theory for Hybrid Perovskites

NASA Astrophysics Data System (ADS)

Bokdam, Menno; Lahnsteiner, Jonathan; Ramberger, Benjamin; Schäfer, Tobias; Kresse, Georg

2017-10-01

Which density functional is the "best" for structure simulations of a particular material? A concise, first principles, approach to answer this question is presented. The random phase approximation (RPA)—an accurate many body theory—is used to evaluate various density functionals. To demonstrate and verify the method, we apply it to the hybrid perovskite MAPbI3 , a promising new solar cell material. The evaluation is done by first creating finite temperature ensembles for small supercells using RPA molecular dynamics, and then evaluating the variance between the RPA and various approximate density functionals for these ensembles. We find that, contrary to recent suggestions, van der Waals functionals do not improve the description of the material, whereas hybrid functionals and the strongly constrained appropriately normed (SCAN) density functional yield very good agreement with the RPA. Finally, our study shows that in the room temperature tetragonal phase of MAPbI3 , the molecules are preferentially parallel to the shorter lattice vectors but reorientation on ps time scales is still possible.
Analytical gradients for subsystem density functional theory within the slater-function-based amsterdam density functional program.

PubMed

Schlüns, Danny; Franchini, Mirko; Götz, Andreas W; Neugebauer, Johannes; Jacob, Christoph R; Visscher, Lucas

2017-02-05

We present a new implementation of analytical gradients for subsystem density-functional theory (sDFT) and frozen-density embedding (FDE) into the Amsterdam Density Functional program (ADF). The underlying theory and necessary expressions for the implementation are derived and discussed in detail for various FDE and sDFT setups. The parallel implementation is numerically verified and geometry optimizations with different functional combinations (LDA/TF and PW91/PW91K) are conducted and compared to reference data. Our results confirm that sDFT-LDA/TF yields good equilibrium distances for the systems studied here (mean absolute deviation: 0.09 Å) compared to reference wave-function theory results. However, sDFT-PW91/PW91k quite consistently yields smaller equilibrium distances (mean absolute deviation: 0.23 Å). The flexibility of our new implementation is demonstrated for an HCN-trimer test system, for which several different setups are applied. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Towards computational improvement of DNA database indexing and short DNA query searching.

PubMed

Stojanov, Done; Koceski, Sašo; Mileva, Aleksandra; Koceska, Nataša; Bande, Cveta Martinovska

2014-09-03

In order to facilitate and speed up the search of massive DNA databases, the database is indexed at the beginning, employing a mapping function. By searching through the indexed data structure, exact query hits can be identified. If the database is searched against an annotated DNA query, such as a known promoter consensus sequence, then the starting locations and the number of potential genes can be determined. This is particularly relevant if unannotated DNA sequences have to be functionally annotated. However, indexing a massive DNA database and searching an indexed data structure with millions of entries is a time-demanding process. In this paper, we propose a fast DNA database indexing and searching approach, identifying all query hits in the database, without having to examine all entries in the indexed data structure, limiting the maximum length of a query that can be searched against the database. By applying the proposed indexing equation, the whole human genome could be indexed in 10 hours on a personal computer, under the assumption that there is enough RAM to store the indexed data structure. Analysing the methodology proposed by Reneker, we observed that hits at starting positions [Formula: see text] are not reported, if the database is searched against a query shorter than [Formula: see text] nucleotides, such that [Formula: see text] is the length of the DNA database words being mapped and [Formula: see text] is the length of the query. A solution of this drawback is also presented.
Building MapObjects attribute field in cadastral database based on the method of Jackson system development

NASA Astrophysics Data System (ADS)

Chen, Zhu-an; Zhang, Li-ting; Liu, Lu

2009-10-01

ESRI's GIS components MapObjects are applied in many cadastral information system because of its miniaturization and flexibility. Some cadastral information was saved in cadastral database directly by MapObjects's Shape file format in this cadastral information system. However, MapObjects didn't provide the function of building attribute field for map layer's attribute data file in cadastral database and user cann't save the result of analysis. This present paper designed and realized the function of building attribute field in MapObjects based on the method of Jackson's system development.
Design of Student Information Management Database Application System for Office and Departmental Target Responsibility System

NASA Astrophysics Data System (ADS)

Zhou, Hui

It is the inevitable outcome of higher education reform to carry out office and departmental target responsibility system, in which statistical processing of student's information is an important part of student's performance review. On the basis of the analysis of the student's evaluation, the student information management database application system is designed by using relational database management system software in this paper. In order to implement the function of student information management, the functional requirement, overall structure, data sheets and fields, data sheet Association and software codes are designed in details.
Utilizing semantic networks to database and retrieve generalized stochastic colored Petri nets

NASA Technical Reports Server (NTRS)

Farah, Jeffrey J.; Kelley, Robert B.

1992-01-01

Previous work has introduced the Planning Coordinator (PCOORD), a coordinator functioning within the hierarchy of the Intelligent Machine Mode. Within the structure of the Planning Coordinator resides the Primitive Structure Database (PSDB) functioning to provide the primitive structures utilized by the Planning Coordinator in the establishing of error recovery or on-line path plans. This report further explores the Primitive Structure Database and establishes the potential of utilizing semantic networks as a means of efficiently storing and retrieving the Generalized Stochastic Colored Petri Nets from which the error recovery plans are derived.
Multireference Density Functional Theory with Generalized Auxiliary Systems for Ground and Excited States.

PubMed

Chen, Zehua; Zhang, Du; Jin, Ye; Yang, Yang; Su, Neil Qiang; Yang, Weitao

2017-09-21

To describe static correlation, we develop a new approach to density functional theory (DFT), which uses a generalized auxiliary system that is of a different symmetry, such as particle number or spin, from that of the physical system. The total energy of the physical system consists of two parts: the energy of the auxiliary system, which is determined with a chosen density functional approximation (DFA), and the excitation energy from an approximate linear response theory that restores the symmetry to that of the physical system, thus rigorously leading to a multideterminant description of the physical system. The electron density of the physical system is different from that of the auxiliary system and is uniquely determined from the functional derivative of the total energy with respect to the external potential. Our energy functional is thus an implicit functional of the physical system density, but an explicit functional of the auxiliary system density. We show that the total energy minimum and stationary states, describing the ground and excited states of the physical system, can be obtained by a self-consistent optimization with respect to the explicit variable, the generalized Kohn-Sham noninteracting density matrix. We have developed the generalized optimized effective potential method for the self-consistent optimization. Among options of the auxiliary system and the associated linear response theory, reformulated versions of the particle-particle random phase approximation (pp-RPA) and the spin-flip time-dependent density functional theory (SF-TDDFT) are selected for illustration of principle. Numerical results show that our multireference DFT successfully describes static correlation in bond dissociation and double bond rotation.

Estimation of distances to stars with stellar parameters from LAMOST

DOE PAGES

Carlin, Jeffrey L.; Liu, Chao; Newberg, Heidi Jo; ...

2015-06-05

Here, we present a method to estimate distances to stars with spectroscopically derived stellar parameters. The technique is a Bayesian approach with likelihood estimated via comparison of measured parameters to a grid of stellar isochrones, and returns a posterior probability density function for each star's absolute magnitude. We tailor this technique specifically to data from the Large Sky Area Multi-object Fiber Spectroscopic Telescope (LAMOST) survey. Because LAMOST obtains roughly 3000 stellar spectra simultaneously within each ~5-degree diameter "plate" that is observed, we can use the stellar parameters of the observed stars to account for the stellar luminosity function and targetmore » selection effects. This removes biasing assumptions about the underlying populations, both due to predictions of the luminosity function from stellar evolution modeling, and from Galactic models of stellar populations along each line of sight. Using calibration data of stars with known distances and stellar parameters, we show that our method recovers distances for most stars within ~20%, but with some systematic overestimation of distances to halo giants. We apply our code to the LAMOST database, and show that the current precision of LAMOST stellar parameters permits measurements of distances with ~40% error bars. This precision should improve as the LAMOST data pipelines continue to be refined.« less
Estimation of distances to stars with stellar parameters from LAMOST

DOE Office of Scientific and Technical Information (OSTI.GOV)

Carlin, Jeffrey L.; Liu, Chao; Newberg, Heidi Jo

Here, we present a method to estimate distances to stars with spectroscopically derived stellar parameters. The technique is a Bayesian approach with likelihood estimated via comparison of measured parameters to a grid of stellar isochrones, and returns a posterior probability density function for each star's absolute magnitude. We tailor this technique specifically to data from the Large Sky Area Multi-object Fiber Spectroscopic Telescope (LAMOST) survey. Because LAMOST obtains roughly 3000 stellar spectra simultaneously within each ~5-degree diameter "plate" that is observed, we can use the stellar parameters of the observed stars to account for the stellar luminosity function and targetmore » selection effects. This removes biasing assumptions about the underlying populations, both due to predictions of the luminosity function from stellar evolution modeling, and from Galactic models of stellar populations along each line of sight. Using calibration data of stars with known distances and stellar parameters, we show that our method recovers distances for most stars within ~20%, but with some systematic overestimation of distances to halo giants. We apply our code to the LAMOST database, and show that the current precision of LAMOST stellar parameters permits measurements of distances with ~40% error bars. This precision should improve as the LAMOST data pipelines continue to be refined.« less
The Sequences of 1504 Mutants in the Model Rice Variety Kitaake Facilitate Rapid Functional Genomic Studies

PubMed Central

Pham, Nikki T.; Wei, Tong; Schackwitz, Wendy S.; Lipzen, Anna M.; Duong, Phat Q.; Jones, Kyle C.; Ruan, Deling; Bauer, Diane; Peng, Yi; Schmutz, Jeremy

2017-01-01

The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake (Oryza sativa ssp japonica), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportion of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. This work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations. PMID:28576844
Scaling and memory in volatility return intervals in financial markets

NASA Astrophysics Data System (ADS)

Yamasaki, Kazuko; Muchnik, Lev; Havlin, Shlomo; Bunde, Armin; Stanley, H. Eugene

2005-06-01

For both stock and currency markets, we study the return intervals τ between the daily volatilities of the price changes that are above a certain threshold q. We find that the distribution function Pq(τ) scales with the mean return interval [Formula] as [Formula]. The scaling function f(x) is similar in form for all seven stocks and for all seven currency databases analyzed, and f(x) is consistent with a power-law form, f(x) ˜ x-γ with γ ≈ 2. We also quantify how the conditional distribution Pq(τ|τ0) depends on the previous return interval τ0 and find that small (or large) return intervals are more likely to be followed by small (or large) return intervals. This “clustering” of the volatility return intervals is a previously unrecognized phenomenon that we relate to the long-term correlations known to be present in the volatility. Author contributions: S.H. and H.E.S. designed research; K.Y., L.M., S.H., and H.E.S. performed research; A.B. contributed new reagents/analytic tools; A.B. analyzed data; and S.H. wrote the paper.Abbreviations: pdf, probability density function; S&P 500, Standard and Poor's 500 Index; USD, U.S. dollar; JPY, Japanese yen; SEK, Swedish krona.
Basis convergence of range-separated density-functional theory

DOE Office of Scientific and Technical Information (OSTI.GOV)

Franck, Odile, E-mail: odile.franck@etu.upmc.fr; Mussard, Bastien, E-mail: bastien.mussard@upmc.fr; CNRS, UMR 7616, Laboratoire de Chimie Théorique, F-75005 Paris

2015-02-21

Range-separated density-functional theory (DFT) is an alternative approach to Kohn-Sham density-functional theory. The strategy of range-separated density-functional theory consists in separating the Coulomb electron-electron interaction into long-range and short-range components and treating the long-range part by an explicit many-body wave-function method and the short-range part by a density-functional approximation. Among the advantages of using many-body methods for the long-range part of the electron-electron interaction is that they are much less sensitive to the one-electron atomic basis compared to the case of the standard Coulomb interaction. Here, we provide a detailed study of the basis convergence of range-separated density-functional theory. Wemore » study the convergence of the partial-wave expansion of the long-range wave function near the electron-electron coalescence. We show that the rate of convergence is exponential with respect to the maximal angular momentum L for the long-range wave function, whereas it is polynomial for the case of the Coulomb interaction. We also study the convergence of the long-range second-order Møller-Plesset correlation energy of four systems (He, Ne, N{sub 2}, and H{sub 2}O) with cardinal number X of the Dunning basis sets cc − p(C)V XZ and find that the error in the correlation energy is best fitted by an exponential in X. This leads us to propose a three-point complete-basis-set extrapolation scheme for range-separated density-functional theory based on an exponential formula.« less
Digital mining claim density map for federal lands in Idaho: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in Idaho as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill and tunnel sites must be recorded at the appropriate Bureau of Land Management (BLM) State office. BLM maintains a cumulative computer listing of mining claims in the Mining Claim Recordation System (MCRS) database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Digital mining claim density map for federal lands in Oregon: 1996

USGS Publications Warehouse

Hyndman, Paul C.; Campbell, Harry W.

1999-01-01

This report describes a digital map generated by the U.S. Geological Survey (USGS) to provide digital spatial mining claim density information for federal lands in Oregon as of March 1997. Mining claim data is earth science information deemed to be relevant to the assessment of historic, current, and future ecological, economic, and social systems. There is no paper map included in this Open-File report. In accordance with the Federal Land Policy and Management Act of 1976 (FLPMA), all unpatented mining claims, mill and tunnel sites must be recorded at the appropriate Bureau of Land Management (BLM) State office. BLM maintains a cumulative computer listing of mining claims in the Mining Claim Recordation System (MCRS) database with locations given by meridian, township, range, and section. A mining claim is considered closed when the claim is relinquished or a formal BLM decision declaring the mining claim null and void has been issued and the appeal period has expired. All other mining claims filed with BLM are considered to be open and actively held. The digital map (figure 1.) with the mining claim density database available in this report are suitable for geographic information system (GIS)-based regional assessments at a scale of 1:100,000 or smaller.
Gesture recognition by instantaneous surface EMG images.

PubMed

Geng, Weidong; Du, Yu; Jin, Wenguang; Wei, Wentao; Hu, Yu; Li, Jiajun

2016-11-15

Gesture recognition in non-intrusive muscle-computer interfaces is usually based on windowed descriptive and discriminatory surface electromyography (sEMG) features because the recorded amplitude of a myoelectric signal may rapidly fluctuate between voltages above and below zero. Here, we present that the patterns inside the instantaneous values of high-density sEMG enables gesture recognition to be performed merely with sEMG signals at a specific instant. We introduce the concept of an sEMG image spatially composed from high-density sEMG and verify our findings from a computational perspective with experiments on gesture recognition based on sEMG images with a classification scheme of a deep convolutional network. Without any windowed features, the resultant recognition accuracy of an 8-gesture within-subject test reached 89.3% on a single frame of sEMG image and reached 99.0% using simple majority voting over 40 frames with a 1,000 Hz sampling rate. Experiments on the recognition of 52 gestures of NinaPro database and 27 gestures of CSL-HDEMG database also validated that our approach outperforms state-of-the-arts methods. Our findings are a starting point for the development of more fluid and natural muscle-computer interfaces with very little observational latency. For example, active prostheses and exoskeletons based on high-density electrodes could be controlled with instantaneous responses.
Development and Validation of the Agency for Healthcare Research and Quality Measures of Potentially Preventable Emergency Department (ED) Visits: The ED Prevention Quality Indicators for General Health Conditions.

PubMed

Davies, Sheryl; Schultz, Ellen; Raven, Maria; Wang, Nancy Ewen; Stocks, Carol L; Delgado, Mucio Kit; McDonald, Kathryn M

2017-10-01

To develop and validate rates of potentially preventable emergency department (ED) visits as indicators of community health. Agency for Healthcare Research and Quality, Healthcare Cost and Utilization Project 2008-2010 State Inpatient Databases and State Emergency Department Databases. Empirical analyses and structured panel reviews. Panels of 14-17 clinicians and end users evaluated a set of ED Prevention Quality Indicators (PQIs) using a Modified Delphi process. Empirical analyses included assessing variation in ED PQI rates across counties and sensitivity of those rates to county-level poverty, uninsurance, and density of primary care physicians (PCPs). ED PQI rates varied widely across U.S. communities. Indicator rates were significantly associated with county-level poverty, median income, Medicaid insurance, and levels of uninsurance. A few indicators were significantly associated with PCP density, with higher rates in areas with greater density. A clinical and an end-user panel separately rated the indicators as having strong face validity for most uses evaluated. The ED PQIs have undergone initial validation as indicators of community health with potential for use in public reporting, population health improvement, and research. © Health Research and Educational Trust.
Comparison of density determination of liquid samples by density meters

NASA Astrophysics Data System (ADS)

Buchner, C.; Wolf, H.; Vámossy, C.; Lorefice, S.; Lenard, E.; Spohr, I.; Mares, G.; Perkin, M.; Parlic-Risovic, T.; Grue, L.-L.; Tammik, K.; van Andel, I.; Zelenka, Z.

2016-01-01

Hydrostatic density determinations of liquids as reference material are mainly performed by National Metrology Institutes to provide means for calibrating or checking liquid density measuring instruments such as oscillation-type density meters. These density meters are used by most of the metrology institutes for their calibration and scientific work. The aim of this project was to compare the results of the liquid density determination by oscillating density meters of the participating laboratories. The results were linked to CCM.D.K-2 partly via Project EURAMET.M.D.K-2 (1019) "Comparison of liquid density standards" by hydrostatic weighing piloted by BEV in 2008. In this comparison pentadecane, water and of oil with a high viscosity were measured at atmospheric pressure using oscillation type density meter. The temperature range was from 15 °C to 40 °C. The measurement results were in some cases discrepant. Further studies, comparisons are essential to explore the capability and uncertainty of the density meters Main text To reach the main text of this paper, click on Final Report. Note that this text is that which appears in Appendix B of the BIPM key comparison database kcdb.bipm.org/. The final report has been peer-reviewed and approved for publication by the CCM, according to the provisions of the CIPM Mutual Recognition Arrangement (CIPM MRA).
Functional renormalization group and Kohn-Sham scheme in density functional theory

NASA Astrophysics Data System (ADS)

Liang, Haozhao; Niu, Yifei; Hatsuda, Tetsuo

2018-04-01

Deriving accurate energy density functional is one of the central problems in condensed matter physics, nuclear physics, and quantum chemistry. We propose a novel method to deduce the energy density functional by combining the idea of the functional renormalization group and the Kohn-Sham scheme in density functional theory. The key idea is to solve the renormalization group flow for the effective action decomposed into the mean-field part and the correlation part. Also, we propose a simple practical method to quantify the uncertainty associated with the truncation of the correlation part. By taking the φ4 theory in zero dimension as a benchmark, we demonstrate that our method shows extremely fast convergence to the exact result even for the highly strong coupling regime.
Rational Density Functional Selection Using Game Theory.

PubMed

McAnanama-Brereton, Suzanne; Waller, Mark P

2018-01-22

Theoretical chemistry has a paradox of choice due to the availability of a myriad of density functionals and basis sets. Traditionally, a particular density functional is chosen on the basis of the level of user expertise (i.e., subjective experiences). Herein we circumvent the user-centric selection procedure by describing a novel approach for objectively selecting a particular functional for a given application. We achieve this by employing game theory to identify optimal functional/basis set combinations. A three-player (accuracy, complexity, and similarity) game is devised, through which Nash equilibrium solutions can be obtained. This approach has the advantage that results can be systematically improved by enlarging the underlying knowledge base, and the deterministic selection procedure mathematically justifies the density functional and basis set selections.
Rice proteome analysis: a step toward functional analysis of the rice genome.

PubMed

Komatsu, Setsuko; Tanaka, Naoki

2005-03-01

The technique of proteome analysis using 2-DE has the power to monitor global changes that occur in the protein complement of tissues and subcellular compartments. In this review, we describe construction of the rice proteome database, the cataloging of rice proteins, and the functional characterization of some of the proteins identified. Initially, proteins extracted from various tissues and organelles were separated by 2-DE and an image analyzer was used to construct a display or reference map of the proteins. The rice proteome database currently contains 23 reference maps based on 2-DE of proteins from different rice tissues and subcellular compartments. These reference maps comprise 13 129 rice proteins, and the amino acid sequences of 5092 of these proteins are entered in the database. Major proteins involved in growth or stress responses have been identified by using a proteomics approach and some of these proteins have unique functions. Furthermore, initial work has also begun on analyzing the phosphoproteome and protein-protein interactions in rice. The information obtained from the rice proteome database will aid in the molecular cloning of rice genes and in predicting the function of unknown proteins.
HUNT: launch of a full-length cDNA database from the Helix Research Institute.

PubMed

Yudate, H T; Suwa, M; Irie, R; Matsui, H; Nishikawa, T; Nakamura, Y; Yamaguchi, D; Peng, Z Z; Yamamoto, T; Nagai, K; Hayashi, K; Otsuki, T; Sugiyama, T; Ota, T; Suzuki, Y; Sugano, S; Isogai, T; Masuho, Y

2001-01-01

The Helix Research Institute (HRI) in Japan is releasing 4356 HUman Novel Transcripts and related information in the newly established HUNT database. The institute is a joint research project principally funded by the Japanese Ministry of International Trade and Industry, and the clones were sequenced in the governmental New Energy and Industrial Technology Development Organization (NEDO) Human cDNA Sequencing Project. The HUNT database contains an extensive amount of annotation from advanced analysis and represents an essential bioinformatics contribution towards understanding of the gene function. The HRI human cDNA clones were obtained from full-length enriched cDNA libraries constructed with the oligo-capping method and have resulted in novel full-length cDNA sequences. A large fraction has little similarity to any proteins of known function and to obtain clues about possible function we have developed original analysis procedures. Any putative function deduced here can be validated or refuted by complementary analysis results. The user can also extract information from specific categories like PROSITE patterns, PFAM domains, PSORT localization, transmembrane helices and clones with GENIUS structure assignments. The HUNT database can be accessed at http://www.hri.co.jp/HUNT.
Modelling the detachment dependence on strike point location in the small angle slot divertor (SAS) with SOLPS

NASA Astrophysics Data System (ADS)

Casali, Livia; Covele, Brent; Guo, Houyang

2017-10-01

The new Small Angle Slot (SAS) divertor in DIII-D is characterized by a shallow-angle target enclosed by a slot structure about the strike point (SP). SOLPS modelling results of SAS have demonstrated divertor closure's utility in widening the range of acceptable densities for adequate heat handling. An extensive database of runs has been built to study the detachment dependence on SP location in SAS. Density scans show that lower Te at lower upstream density occur when the SP is at the critical location in the slot. The cooling front spreads across the entire target at higher densities, in agreement with experimental Langmuir probe measurements. A localized increase of the atomic and molecular density takes place near the SP, which reduces the target incident power density and facilitates detachment at lower upstream density. Systematic scans of variables such as power, transport, and viscosity have been carried out to assess the detachment sensitivity. Therein, a positive role of the viscosity is found. This work supported by DOE Contract Number DE-FC02-04ER54698.
Realistic simulated MRI and SPECT databases. Application to SPECT/MRI registration evaluation.

PubMed

Aubert-Broche, Berengere; Grova, Christophe; Reilhac, Anthonin; Evans, Alan C; Collins, D Louis

2006-01-01

This paper describes the construction of simulated SPECT and MRI databases that account for realistic anatomical and functional variability. The data is used as a gold-standard to evaluate four SPECT/MRI similarity-based registration methods. Simulation realism was accounted for using accurate physical models of data generation and acquisition. MRI and SPECT simulations were generated from three subjects to take into account inter-subject anatomical variability. Functional SPECT data were computed from six functional models of brain perfusion. Previous models of normal perfusion and ictal perfusion observed in Mesial Temporal Lobe Epilepsy (MTLE) were considered to generate functional variability. We studied the impact noise and intensity non-uniformity in MRI simulations and SPECT scatter correction may have on registration accuracy. We quantified the amount of registration error caused by anatomical and functional variability. Registration involving ictal data was less accurate than registration involving normal data. MR intensity nonuniformity was the main factor decreasing registration accuracy. The proposed simulated database is promising to evaluate many functional neuroimaging methods, involving MRI and SPECT data.
A Brief Review of RNA–Protein Interaction Database Resources

PubMed Central

Yi, Ying; Zhao, Yue; Huang, Yan; Wang, Dong

2017-01-01

RNA–Protein interactions play critical roles in various biological processes. By collecting and analyzing the RNA–Protein interactions and binding sites from experiments and predictions, RNA–Protein interaction databases have become an essential resource for the exploration of the transcriptional and post-transcriptional regulatory network. Here, we briefly review several widely used RNA–Protein interaction database resources developed in recent years to provide a guide of these databases. The content and major functions in databases are presented. The brief description of database helps users to quickly choose the database containing information they interested. In short, these RNA–Protein interaction database resources are continually updated, but the current state shows the efforts to identify and analyze the large amount of RNA–Protein interactions. PMID:29657278
The maximal-density mass function for primordial black hole dark matter

NASA Astrophysics Data System (ADS)

Lehmann, Benjamin V.; Profumo, Stefano; Yant, Jackson

2018-04-01

The advent of gravitational wave astronomy has rekindled interest in primordial black holes (PBH) as a dark matter candidate. As there are many different observational probes of the PBH density across different masses, constraints on PBH models are dependent on the functional form of the PBH mass function. This complicates general statements about the mass functions allowed by current data, and, in particular, about the maximum total density of PBH. Numerical studies suggest that some forms of extended mass functions face tighter constraints than monochromatic mass functions, but they do not preclude the existence of a functional form for which constraints are relaxed. We use analytical arguments to show that the mass function which maximizes the fraction of the matter density in PBH subject to all constraints is a finite linear combination of monochromatic mass functions. We explicitly compute the maximum fraction of dark matter in PBH for different combinations of current constraints, allowing for total freedom of the mass function. Our framework elucidates the dependence of the maximum PBH density on the form of observational constraints, and we discuss the implications of current and future constraints for the viability of the PBH dark matter paradigm.
MicroUse: The Database on Microcomputer Applications in Libraries and Information Centers.

ERIC Educational Resources Information Center

Chen, Ching-chih; Wang, Xiaochu

1984-01-01

Describes MicroUse, a microcomputer-based database on microcomputer applications in libraries and information centers which was developed using relational database manager dBASE II. The description includes its system configuration, software utilized, the in-house-developed dBASE programs, multifile structure, basic functions, MicroUse records,…
The Politics of Information: Building a Relational Database To Support Decision-Making at a Public University.

ERIC Educational Resources Information Center

Friedman, Debra; Hoffman, Phillip

2001-01-01

Describes creation of a relational database at the University of Washington supporting ongoing academic planning at several levels and affecting the culture of decision making. Addresses getting started; sharing the database; questions, worries, and issues; improving access to high-demand courses; the advising function; management of instructional…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.