Receptor-based 3D-QSAR in Drug Design: Methods and Applications in Kinase Studies.
Fang, Cheng; Xiao, Zhiyan
2016-01-01
Receptor-based 3D-QSAR strategy represents a superior integration of structure-based drug design (SBDD) and three-dimensional quantitative structure-activity relationship (3D-QSAR) analysis. It combines the accurate prediction of ligand poses by the SBDD approach with the good predictability and interpretability of statistical models derived from the 3D-QSAR approach. Extensive efforts have been devoted to the development of receptor-based 3D-QSAR methods and two alternative approaches have been exploited. One associates with computing the binding interactions between a receptor and a ligand to generate structure-based descriptors for QSAR analyses. The other concerns the application of various docking protocols to generate optimal ligand poses so as to provide reliable molecular alignments for the conventional 3D-QSAR operations. This review highlights new concepts and methodologies recently developed in the field of receptorbased 3D-QSAR, and in particular, covers its application in kinase studies.
Mohr, Johannes A; Jain, Brijnesh J; Obermayer, Klaus
2008-09-01
Quantitative structure activity relationship (QSAR) analysis is traditionally based on extracting a set of molecular descriptors and using them to build a predictive model. In this work, we propose a QSAR approach based directly on the similarity between the 3D structures of a set of molecules measured by a so-called molecule kernel, which is independent of the spatial prealignment of the compounds. Predictors can be build using the molecule kernel in conjunction with the potential support vector machine (P-SVM), a recently proposed machine learning method for dyadic data. The resulting models make direct use of the structural similarities between the compounds in the test set and a subset of the training set and do not require an explicit descriptor construction. We evaluated the predictive performance of the proposed method on one classification and four regression QSAR datasets and compared its results to the results reported in the literature for several state-of-the-art descriptor-based and 3D QSAR approaches. In this comparison, the proposed molecule kernel method performed better than the other QSAR methods.
Dong, Xialan; Ebalunode, Jerry O; Cho, Sung Jin; Zheng, Weifan
2010-02-22
Quantitative structure-activity relationship (QSAR) methods aim to build quantitatively predictive models for the discovery of new molecules. It has been widely used in medicinal chemistry for drug discovery. Many QSAR techniques have been developed since Hansch's seminal work, and more are still being developed. Motivated by Hopfinger's receptor-dependent QSAR (RD-QSAR) formalism and the Lukacova-Balaz scheme to treat multimode issues, we have initiated studies that focus on a structure-based multimode QSAR (SBMM QSAR) method, where the structure of the target protein is used in characterizing the ligand, and the multimode issue of ligand binding is systematically treated with a modified Lukacova-Balaz scheme. All ligand molecules are first docked to the target binding pocket to obtain a set of aligned ligand poses. A structure-based pharmacophore concept is adopted to characterize the binding pocket. Specifically, we represent the binding pocket as a geometric grid labeled by pharmacophoric features. Each pose of the ligand is also represented as a labeled grid, where each grid point is labeled according to the atom types of nearby ligand atoms. These labeled grids or three-dimensional (3D) maps (both the receptor map (R-map) and the ligand map (L-map)) are compared to each other to derive descriptors for each pose of the ligand, resulting in a multimode structure-activity relationship (SAR) table. Iterative partial least-squares (PLS) is employed to build the QSAR models. When we applied this method to analyze PDE-4 inhibitors, predictive models have been developed, obtaining models with excellent training correlation (r(2) = 0.65-0.66), as well as test correlation (R(2) = 0.64-0.65). A comparative analysis with 4 other QSAR techniques demonstrates that this new method affords better models, in terms of the prediction power for the test set.
Du, Qi-Shi; Huang, Ri-Bo; Wei, Yu-Tuo; Pang, Zong-Wen; Du, Li-Qin; Chou, Kuo-Chen
2009-01-30
In cooperation with the fragment-based design a new drug design method, the so-called "fragment-based quantitative structure-activity relationship" (FB-QSAR) is proposed. The essence of the new method is that the molecular framework in a family of drug candidates are divided into several fragments according to their substitutes being investigated. The bioactivities of molecules are correlated with the physicochemical properties of the molecular fragments through two sets of coefficients in the linear free energy equations. One coefficient set is for the physicochemical properties and the other for the weight factors of the molecular fragments. Meanwhile, an iterative double least square (IDLS) technique is developed to solve the two sets of coefficients in a training data set alternately and iteratively. The IDLS technique is a feedback procedure with machine learning ability. The standard Two-dimensional quantitative structure-activity relationship (2D-QSAR) is a special case, in the FB-QSAR, when the whole molecule is treated as one entity. The FB-QSAR approach can remarkably enhance the predictive power and provide more structural insights into rational drug design. As an example, the FB-QSAR is applied to build a predictive model of neuraminidase inhibitors for drug development against H5N1 influenza virus. (c) 2008 Wiley Periodicals, Inc.
Quantitative structure-activity relationship: promising advances in drug discovery platforms.
Wang, Tao; Wu, Mian-Bin; Lin, Jian-Ping; Yang, Li-Rong
2015-12-01
Quantitative structure-activity relationship (QSAR) modeling is one of the most popular computer-aided tools employed in medicinal chemistry for drug discovery and lead optimization. It is especially powerful in the absence of 3D structures of specific drug targets. QSAR methods have been shown to draw public attention since they were first introduced. In this review, the authors provide a brief discussion of the basic principles of QSAR, model development and model validation. They also highlight the current applications of QSAR in different fields, particularly in virtual screening, rational drug design and multi-target QSAR. Finally, in view of recent controversies, the authors detail the challenges faced by QSAR modeling and the relevant solutions. The aim of this review is to show how QSAR modeling can be applied in novel drug discovery, design and lead optimization. QSAR should intentionally be used as a powerful tool for fragment-based drug design platforms in the field of drug discovery and design. Although there have been an increasing number of experimentally determined protein structures in recent years, a great number of protein structures cannot be easily obtained (i.e., membrane transport proteins and G-protein coupled receptors). Fragment-based drug discovery, such as QSAR, could be applied further and have a significant role in dealing with these problems. Moreover, along with the development of computer software and hardware, it is believed that QSAR will be increasingly important.
Myint, Kyaw Z.; Xie, Xiang-Qun
2015-01-01
This chapter focuses on the fingerprint-based artificial neural networks QSAR (FANN-QSAR) approach to predict biological activities of structurally diverse compounds. Three types of fingerprints, namely ECFP6, FP2, and MACCS, were used as inputs to train the FANN-QSAR models. The results were benchmarked against known 2D and 3D QSAR methods, and the derived models were used to predict cannabinoid (CB) ligand binding activities as a case study. In addition, the FANN-QSAR model was used as a virtual screening tool to search a large NCI compound database for lead cannabinoid compounds. We discovered several compounds with good CB2 binding affinities ranging from 6.70 nM to 3.75 μM. The studies proved that the FANN-QSAR method is a useful approach to predict bioactivities or properties of ligands and to find novel lead compounds for drug discovery research. PMID:25502380
Valizade Hasanloei, Mohammad Amin; Sheikhpour, Razieh; Sarram, Mehdi Agha; Sheikhpour, Elnaz; Sharifi, Hamdollah
2018-02-01
Quantitative structure-activity relationship (QSAR) is an effective computational technique for drug design that relates the chemical structures of compounds to their biological activities. Feature selection is an important step in QSAR based drug design to select the most relevant descriptors. One of the most popular feature selection methods for classification problems is Fisher score which aim is to minimize the within-class distance and maximize the between-class distance. In this study, the properties of Fisher criterion were extended for QSAR models to define the new distance metrics based on the continuous activity values of compounds with known activities. Then, a semi-supervised feature selection method was proposed based on the combination of Fisher and Laplacian criteria which exploits both compounds with known and unknown activities to select the relevant descriptors. To demonstrate the efficiency of the proposed semi-supervised feature selection method in selecting the relevant descriptors, we applied the method and other feature selection methods on three QSAR data sets such as serine/threonine-protein kinase PLK3 inhibitors, ROCK inhibitors and phenol compounds. The results demonstrated that the QSAR models built on the selected descriptors by the proposed semi-supervised method have better performance than other models. This indicates the efficiency of the proposed method in selecting the relevant descriptors using the compounds with known and unknown activities. The results of this study showed that the compounds with known and unknown activities can be helpful to improve the performance of the combined Fisher and Laplacian based feature selection methods.
NASA Astrophysics Data System (ADS)
Valizade Hasanloei, Mohammad Amin; Sheikhpour, Razieh; Sarram, Mehdi Agha; Sheikhpour, Elnaz; Sharifi, Hamdollah
2018-02-01
Quantitative structure-activity relationship (QSAR) is an effective computational technique for drug design that relates the chemical structures of compounds to their biological activities. Feature selection is an important step in QSAR based drug design to select the most relevant descriptors. One of the most popular feature selection methods for classification problems is Fisher score which aim is to minimize the within-class distance and maximize the between-class distance. In this study, the properties of Fisher criterion were extended for QSAR models to define the new distance metrics based on the continuous activity values of compounds with known activities. Then, a semi-supervised feature selection method was proposed based on the combination of Fisher and Laplacian criteria which exploits both compounds with known and unknown activities to select the relevant descriptors. To demonstrate the efficiency of the proposed semi-supervised feature selection method in selecting the relevant descriptors, we applied the method and other feature selection methods on three QSAR data sets such as serine/threonine-protein kinase PLK3 inhibitors, ROCK inhibitors and phenol compounds. The results demonstrated that the QSAR models built on the selected descriptors by the proposed semi-supervised method have better performance than other models. This indicates the efficiency of the proposed method in selecting the relevant descriptors using the compounds with known and unknown activities. The results of this study showed that the compounds with known and unknown activities can be helpful to improve the performance of the combined Fisher and Laplacian based feature selection methods.
Zhou, Peng; Wang, Congcong; Tian, Feifei; Ren, Yanrong; Yang, Chao; Huang, Jian
2013-01-01
Quantitative structure-activity relationship (QSAR), a regression modeling methodology that establishes statistical correlation between structure feature and apparent behavior for a series of congeneric molecules quantitatively, has been widely used to evaluate the activity, toxicity and property of various small-molecule compounds such as drugs, toxicants and surfactants. However, it is surprising to see that such useful technique has only very limited applications to biomacromolecules, albeit the solved 3D atom-resolution structures of proteins, nucleic acids and their complexes have accumulated rapidly in past decades. Here, we present a proof-of-concept paradigm for the modeling, prediction and interpretation of the binding affinity of 144 sequence-nonredundant, structure-available and affinity-known protein complexes (Kastritis et al. Protein Sci 20:482-491, 2011) using a biomacromolecular QSAR (BioQSAR) scheme. We demonstrate that the modeling performance and predictive power of BioQSAR are comparable to or even better than that of traditional knowledge-based strategies, mechanism-type methods and empirical scoring algorithms, while BioQSAR possesses certain additional features compared to the traditional methods, such as adaptability, interpretability, deep-validation and high-efficiency. The BioQSAR scheme could be readily modified to infer the biological behavior and functions of other biomacromolecules, if their X-ray crystal structures, NMR conformation assemblies or computationally modeled structures are available.
Cheminformatics-aided pharmacovigilance: application to Stevens-Johnson Syndrome
Low, Yen S; Caster, Ola; Bergvall, Tomas; Fourches, Denis; Zang, Xiaoling; Norén, G Niklas; Rusyn, Ivan; Edwards, Ralph
2016-01-01
Objective Quantitative Structure-Activity Relationship (QSAR) models can predict adverse drug reactions (ADRs), and thus provide early warnings of potential hazards. Timely identification of potential safety concerns could protect patients and aid early diagnosis of ADRs among the exposed. Our objective was to determine whether global spontaneous reporting patterns might allow chemical substructures associated with Stevens-Johnson Syndrome (SJS) to be identified and utilized for ADR prediction by QSAR models. Materials and Methods Using a reference set of 364 drugs having positive or negative reporting correlations with SJS in the VigiBase global repository of individual case safety reports (Uppsala Monitoring Center, Uppsala, Sweden), chemical descriptors were computed from drug molecular structures. Random Forest and Support Vector Machines methods were used to develop QSAR models, which were validated by external 5-fold cross validation. Models were employed for virtual screening of DrugBank to predict SJS actives and inactives, which were corroborated using knowledge bases like VigiBase, ChemoText, and MicroMedex (Truven Health Analytics Inc, Ann Arbor, Michigan). Results We developed QSAR models that could accurately predict if drugs were associated with SJS (area under the curve of 75%–81%). Our 10 most active and inactive predictions were substantiated by SJS reports (or lack thereof) in the literature. Discussion Interpretation of QSAR models in terms of significant chemical descriptors suggested novel SJS structural alerts. Conclusions We have demonstrated that QSAR models can accurately identify SJS active and inactive drugs. Requiring chemical structures only, QSAR models provide effective computational means to flag potentially harmful drugs for subsequent targeted surveillance and pharmacoepidemiologic investigations. PMID:26499102
Alarms about structural alerts.
Alves, Vinicius; Muratov, Eugene; Capuzzi, Stephen; Politi, Regina; Low, Yen; Braga, Rodolpho; Zakharov, Alexey V; Sedykh, Alexander; Mokshyna, Elena; Farag, Sherif; Andrade, Carolina; Kuz'min, Victor; Fourches, Denis; Tropsha, Alexander
2016-08-21
Structural alerts are widely accepted in chemical toxicology and regulatory decision support as a simple and transparent means to flag potential chemical hazards or group compounds into categories for read-across. However, there has been a growing concern that alerts disproportionally flag too many chemicals as toxic, which questions their reliability as toxicity markers. Conversely, the rigorously developed and properly validated statistical QSAR models can accurately and reliably predict the toxicity of a chemical; however, their use in regulatory toxicology has been hampered by the lack of transparency and interpretability. We demonstrate that contrary to the common perception of QSAR models as "black boxes" they can be used to identify statistically significant chemical substructures (QSAR-based alerts) that influence toxicity. We show through several case studies, however, that the mere presence of structural alerts in a chemical, irrespective of the derivation method (expert-based or QSAR-based), should be perceived only as hypotheses of possible toxicological effect. We propose a new approach that synergistically integrates structural alerts and rigorously validated QSAR models for a more transparent and accurate safety assessment of new chemicals.
Integration of QSAR and in vitro toxicology.
Barratt, M D
1998-01-01
The principles of quantitative structure-activity relationships (QSAR) are based on the premise that the properties of a chemical are implicit in its molecular structure. Therefore, if a mechanistic hypothesis can be proposed linking a group of related chemicals with a particular toxic end point, the hypothesis can be used to define relevant parameters to establish a QSAR. Ways in which QSAR and in vitro toxicology can complement each other in development of alternatives to live animal experiments are described and illustrated by examples from acute toxicological end points. Integration of QSAR and in vitro methods is examined in the context of assessing mechanistic competence and improving the design of in vitro assays and the development of prediction models. The nature of biological variability is explored together with its implications for the selection of sets of chemicals for test development, optimization, and validation. Methods are described to support the use of data from in vivo tests that do not meet today's stringent requirements of acceptability. Integration of QSAR and in vitro methods into strategic approaches for the replacement, reduction, and refinement of the use of animals is described with examples. PMID:9599692
Ghafouri, Hamidreza; Ranjbar, Mohsen; Sakhteman, Amirhossein
2017-08-01
A great challenge in medicinal chemistry is to develop different methods for structural design based on the pattern of the previously synthesized compounds. In this study two different QSAR methods were established and compared for a series of piperidine acetylcholinesterase inhibitors. In one novel approach, PC-LS-SVM and PLS-LS-SVM was used for modeling 3D interaction descriptors, and in the other method the same nonlinear techniques were used to build QSAR equations based on field descriptors. Different validation methods were used to evaluate the models and the results revealed the more applicability and predictive ability of the model generated by field descriptors (Q 2 LOO-CV =1, R 2 ext =0.97). External validation criteria revealed that both methods can be used in generating reasonable QSAR models. It was concluded that due to ability of interaction descriptors in prediction of binding mode, using this approach can be implemented in future 3D-QSAR softwares. Copyright © 2017 Elsevier Ltd. All rights reserved.
Lakhlili, Wiame; Yasri, Abdelaziz; Ibrahimi, Azeddine
2016-01-01
The discovery of clinically relevant inhibitors of mammalian target of rapamycin (mTOR) for anticancer therapy has proved to be a challenging task. The quantitative structure–activity relationship (QSAR) approach is a very useful and widespread technique for ligand-based drug design, which can be used to identify novel and potent mTOR inhibitors. In this study, we performed two-dimensional QSAR tests, and molecular docking validation tests of a series of mTOR ATP-competitive inhibitors to elucidate their structural properties associated with their activity. The QSAR tests were performed using partial least square method with a correlation coefficient of r2=0.799 and a cross-validation of q2=0.714. The chemical library screening was done by associating ligand-based to structure-based approach using the three-dimensional structure of mTOR developed by homology modeling. We were able to select 22 compounds from two databases as inhibitors of the mTOR kinase active site. We believe that the method and applications highlighted in this study will help future efforts toward the design of selective ATP-competitive inhibitors. PMID:27980424
Papamokos, George; Silins, Ilona
2016-01-01
There is an increasing need for new reliable non-animal based methods to predict and test toxicity of chemicals. Quantitative structure-activity relationship (QSAR), a computer-based method linking chemical structures with biological activities, is used in predictive toxicology. In this study, we tested the approach to combine QSAR data with literature profiles of carcinogenic modes of action automatically generated by a text-mining tool. The aim was to generate data patterns to identify associations between chemical structures and biological mechanisms related to carcinogenesis. Using these two methods, individually and combined, we evaluated 96 rat carcinogens of the hematopoietic system, liver, lung, and skin. We found that skin and lung rat carcinogens were mainly mutagenic, while the group of carcinogens affecting the hematopoietic system and the liver also included a large proportion of non-mutagens. The automatic literature analysis showed that mutagenicity was a frequently reported endpoint in the literature of these carcinogens, however, less common endpoints such as immunosuppression and hormonal receptor-mediated effects were also found in connection with some of the carcinogens, results of potential importance for certain target organs. The combined approach, using QSAR and text-mining techniques, could be useful for identifying more detailed information on biological mechanisms and the relation with chemical structures. The method can be particularly useful in increasing the understanding of structure and activity relationships for non-mutagens.
Papamokos, George; Silins, Ilona
2016-01-01
There is an increasing need for new reliable non-animal based methods to predict and test toxicity of chemicals. Quantitative structure-activity relationship (QSAR), a computer-based method linking chemical structures with biological activities, is used in predictive toxicology. In this study, we tested the approach to combine QSAR data with literature profiles of carcinogenic modes of action automatically generated by a text-mining tool. The aim was to generate data patterns to identify associations between chemical structures and biological mechanisms related to carcinogenesis. Using these two methods, individually and combined, we evaluated 96 rat carcinogens of the hematopoietic system, liver, lung, and skin. We found that skin and lung rat carcinogens were mainly mutagenic, while the group of carcinogens affecting the hematopoietic system and the liver also included a large proportion of non-mutagens. The automatic literature analysis showed that mutagenicity was a frequently reported endpoint in the literature of these carcinogens, however, less common endpoints such as immunosuppression and hormonal receptor-mediated effects were also found in connection with some of the carcinogens, results of potential importance for certain target organs. The combined approach, using QSAR and text-mining techniques, could be useful for identifying more detailed information on biological mechanisms and the relation with chemical structures. The method can be particularly useful in increasing the understanding of structure and activity relationships for non-mutagens. PMID:27625608
Modeling Liver-Related Adverse Effects of Drugs Using kNN QSAR Method
Rodgers, Amie D.; Zhu, Hao; Fourches, Dennis; Rusyn, Ivan; Tropsha, Alexander
2010-01-01
Adverse effects of drugs (AEDs) continue to be a major cause of drug withdrawals both in development and post-marketing. While liver-related AEDs are a major concern for drug safety, there are few in silico models for predicting human liver toxicity for drug candidates. We have applied the Quantitative Structure Activity Relationship (QSAR) approach to model liver AEDs. In this study, we aimed to construct a QSAR model capable of binary classification (active vs. inactive) of drugs for liver AEDs based on chemical structure. To build QSAR models, we have employed an FDA spontaneous reporting database of human liver AEDs (elevations in activity of serum liver enzymes), which contains data on approximately 500 approved drugs. Approximately 200 compounds with wide clinical data coverage, structural similarity and balanced (40/60) active/inactive ratio were selected for modeling and divided into multiple training/test and external validation sets. QSAR models were developed using the k nearest neighbor method and validated using external datasets. Models with high sensitivity (>73%) and specificity (>94%) for prediction of liver AEDs in external validation sets were developed. To test applicability of the models, three chemical databases (World Drug Index, Prestwick Chemical Library, and Biowisdom Liver Intelligence Module) were screened in silico and the validity of predictions was determined, where possible, by comparing model-based classification with assertions in publicly available literature. Validated QSAR models of liver AEDs based on the data from the FDA spontaneous reporting system can be employed as sensitive and specific predictors of AEDs in pre-clinical screening of drug candidates for potential hepatotoxicity in humans. PMID:20192250
Chen, Shangying; Zhang, Peng; Liu, Xin; Qin, Chu; Tao, Lin; Zhang, Cheng; Yang, Sheng Yong; Chen, Yu Zong; Chui, Wai Keung
2016-06-01
The overall efficacy and safety profile of a new drug is partially evaluated by the therapeutic index in clinical studies and by the protective index (PI) in preclinical studies. In-silico predictive methods may facilitate the assessment of these indicators. Although QSAR and QSTR models can be used for predicting PI, their predictive capability has not been evaluated. To test this capability, we developed QSAR and QSTR models for predicting the activity and toxicity of anticonvulsants at accuracy levels above the literature-reported threshold (LT) of good QSAR models as tested by both the internal 5-fold cross validation and external validation method. These models showed significantly compromised PI predictive capability due to the cumulative errors of the QSAR and QSTR models. Therefore, in this investigation a new quantitative structure-index relationship (QSIR) model was devised and it showed improved PI predictive capability that superseded the LT of good QSAR models. The QSAR, QSTR and QSIR models were developed using support vector regression (SVR) method with the parameters optimized by using the greedy search method. The molecular descriptors relevant to the prediction of anticonvulsant activities, toxicities and PIs were analyzed by a recursive feature elimination method. The selected molecular descriptors are primarily associated with the drug-like, pharmacological and toxicological features and those used in the published anticonvulsant QSAR and QSTR models. This study suggested that QSIR is useful for estimating the therapeutic index of drug candidates. Copyright © 2016. Published by Elsevier Inc.
QSAR modeling of GPCR ligands: methodologies and examples of applications.
Tropsha, A; Wang, S X
2006-01-01
GPCR ligands represent not only one of the major classes of current drugs but the major continuing source of novel potent pharmaceutical agents. Because 3D structures of GPCRs as determined by experimental techniques are still unavailable, ligand-based drug discovery methods remain the major computational molecular modeling approaches to the analysis of growing data sets of tested GPCR ligands. This paper presents an overview of modern Quantitative Structure Activity Relationship (QSAR) modeling. We discuss the critical issue of model validation and the strategy for applying the successfully validated QSAR models to virtual screening of available chemical databases. We present several examples of applications of validated QSAR modeling approaches to GPCR ligands. We conclude with the comments on exciting developments in the QSAR modeling of GPCR ligands that focus on the study of emerging data sets of compounds with dual or even multiple activities against two or more of GPCRs.
NASA Astrophysics Data System (ADS)
Li, Peizhen; Tian, Yueli; Zhai, Honglin; Deng, Fangfang; Xie, Meihong; Zhang, Xiaoyun
2013-11-01
Non-purine derivatives have been shown to be promising novel drug candidates as xanthine oxidase inhibitors. Based on three-dimensional quantitative structure-activity relationship (3D-QSAR) methods including comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA), two 3D-QSAR models for a series of non-purine xanthine oxidase (XO) inhibitors were established, and their reliability was supported by statistical parameters. Combined 3D-QSAR modeling and the results of molecular docking between non-purine xanthine oxidase inhibitors and XO, the main factors that influenced activity of inhibitors were investigated, and the obtained results could explain known experimental facts. Furthermore, several new potential inhibitors with higher activity predicted were designed, which based on our analyses, and were supported by the simulation of molecular docking. This study provided some useful information for the development of non-purine xanthine oxidase inhibitors with novel structures.
Dixon, Steven L; Duan, Jianxin; Smith, Ethan; Von Bargen, Christopher D; Sherman, Woody; Repasky, Matthew P
2016-10-01
We introduce AutoQSAR, an automated machine-learning application to build, validate and deploy quantitative structure-activity relationship (QSAR) models. The process of descriptor generation, feature selection and the creation of a large number of QSAR models has been automated into a single workflow within AutoQSAR. The models are built using a variety of machine-learning methods, and each model is scored using a novel approach. Effectiveness of the method is demonstrated through comparison with literature QSAR models using identical datasets for six end points: protein-ligand binding affinity, solubility, blood-brain barrier permeability, carcinogenicity, mutagenicity and bioaccumulation in fish. AutoQSAR demonstrates similar or better predictive performance as compared with published results for four of the six endpoints while requiring minimal human time and expertise.
Structure and ligand-based design of P-glycoprotein inhibitors: a historical perspective.
Palmeira, Andreia; Sousa, Emilia; Vasconcelos, M Helena; Pinto, Madalena; Fernandes, Miguel X
2012-01-01
Computer-assisted drug design (CADD) is a valuable approach for the discovery of new chemical entities in the field of cancer therapy. There is a pressing need to design and develop new, selective, and safe drugs for the treatment of multidrug resistance (MDR) cancer forms, specifically active against P-glycoprotein (P-gp). Recently, a crystallographic structure for mouse P-gp was obtained. However, for decades the design of new P-gp inhibitors employed mainly ligand-based approaches (SAR, QSAR, 3D-QSAR and pharmacophore studies), and structure-based studies used P-gp homology models. However, some of those results are still the pillars used as a starting point for the design of potential P-gp inhibitors. Here, pharmacophore mapping, (Q)SAR, 3D-QSAR and homology modeling, for the discovery of P-gp inhibitors are reviewed. The importance of these methods for understanding mechanisms of drug resistance at a molecular level, and design P-gp inhibitors drug candidates are discussed. The examples mentioned in the review could provide insights into the wide range of possibilities of using CADD methodologies for the discovery of efficient P-gp inhibitors.
Chen, H F; Dong, X C; Zen, B S; Gao, K; Yuan, S G; Panaye, A; Doucet, J P; Fan, B T
2003-08-01
An efficient virtual and rational drug design method is presented. It combines virtual bioactive compound generation with 3D-QSAR model and docking. Using this method, it is possible to generate a lot of highly diverse molecules and find virtual active lead compounds. The method was validated by the study of a set of anti-tumor drugs. With the constraints of pharmacophore obtained by DISCO implemented in SYBYL 6.8, 97 virtual bioactive compounds were generated, and their anti-tumor activities were predicted by CoMFA. Eight structures with high activity were selected and screened by the 3D-QSAR model. The most active generated structure was further investigated by modifying its structure in order to increase the activity. A comparative docking study with telomeric receptor was carried out, and the results showed that the generated structures could form more stable complexes with receptor than the reference compound selected from experimental data. This investigation showed that the proposed method was a feasible way for rational drug design with high screening efficiency.
In silico study of in vitro GPCR assays by QSAR modeling ...
The U.S. EPA is screening thousands of chemicals of environmental interest in hundreds of in vitro high-throughput screening (HTS) assays (the ToxCast program). One goal is to prioritize chemicals for more detailed analyses based on activity in molecular initiating events (MIE) of adverse outcome pathways (AOPs). However, the chemical space of interest for environmental exposure is much wider than this set of chemicals. Thus, there is a need to fill data gaps with in silico methods, and quantitative structure-activity relationships (QSARs) are a proven and cost effective approach to predict biological activity. ToxCast in turn provides relatively large datasets that are ideal for training and testing QSAR models. The overall goal of the study described here was to develop QSAR models to fill the data gaps in a larger environmental database of ~32k structures. The specific aim of the current work was to build QSAR models for 18 G-Protein Coupled Receptor (GPCR) assays, part of the aminergic category. Two QSAR modeling strategies were adopted: classification models were developed to separate chemicals into active/non-active classes, and then regression models were built to predict the potency values of the bioassays for the active chemicals. Multiple software programs were used to calculate constitutional, topological and substructural molecular descriptors from two-dimensional (2D) chemical structures. Model-fitting methods included PLSDA (partial least squares d
Tuppurainen, Kari; Viisas, Marja; Laatikainen, Reino; Peräkylä, Mikael
2002-01-01
A novel electronic eigenvalue (EEVA) descriptor of molecular structure for use in the derivation of predictive QSAR/QSPR models is described. Like other spectroscopic QSAR/QSPR descriptors, EEVA is also invariant as to the alignment of the structures concerned. Its performance was tested with respect to the CBG (corticosteroid binding globulin) affinity of 31 benchmark steroids. It appeared that the electronic structure of the steroids, i.e., the "spectra" derived from molecular orbital energies, is directly related to the CBG binding affinities. The predictive ability of EEVA is compared to other QSAR approaches, and its performance is discussed in the context of the Hammett equation. The good performance of EEVA is an indication of the essential quantum mechanical nature of QSAR. The EEVA method is a supplement to conventional 3D QSAR methods, which employ fields or surface properties derived from Coulombic and van der Waals interactions.
Prediction of Environmental Impact of High-Energy Materials with Atomistic Computer Simulations
2010-11-01
from a training set of compounds. Other methods include Quantitative Struc- ture-Activity Relationship ( QSAR ) and Quantitative Structure-Property...26 28 the development of QSPR/ QSAR models, in contrast to boiling points and critical parameters derived from empirical correlations, to improve...Quadratic Configuration Interaction Singles Doubles QSAR Quantitative Structure-Activity Relationship QSPR Quantitative Structure-Property
Khanfar, Mohammad A; Banat, Fahmy; Alabed, Shada; Alqtaishat, Saja
2017-02-01
High expression of Nek2 has been detected in several types of cancer and it represents a novel target for human cancer. In the current study, structure-based pharmacophore modeling combined with multiple linear regression (MLR)-based QSAR analyses was applied to disclose the structural requirements for NEK2 inhibition. Generated pharmacophoric models were initially validated with receiver operating characteristic (ROC) curve, and optimum models were subsequently implemented in QSAR modeling with other physiochemical descriptors. QSAR-selected models were implied as 3D search filters to mine the National Cancer Institute (NCI) database for novel NEK2 inhibitors, whereas the associated QSAR model prioritized the bioactivities of captured hits for in vitro evaluation. Experimental validation identified several potent NEK2 inhibitors of novel structural scaffolds. The most potent captured hit exhibited an [Formula: see text] value of 237 nM.
Barigye, Stephen J; Freitas, Matheus P; Ausina, Priscila; Zancan, Patricia; Sola-Penna, Mauro; Castillo-Garit, Juan A
2018-02-12
We recently generalized the formerly alignment-dependent multivariate image analysis applied to quantitative structure-activity relationships (MIA-QSAR) method through the application of the discrete Fourier transform (DFT), allowing for its application to noncongruent and structurally diverse chemical compound data sets. Here we report the first practical application of this method in the screening of molecular entities of therapeutic interest, with human aromatase inhibitory activity as the case study. We developed an ensemble classification model based on the two-dimensional (2D) DFT MIA-QSAR descriptors, with which we screened the NCI Diversity Set V (1593 compounds) and obtained 34 chemical compounds with possible aromatase inhibitory activity. These compounds were docked into the aromatase active site, and the 10 most promising compounds were selected for in vitro experimental validation. Of these compounds, 7419 (nonsteroidal) and 89 201 (steroidal) demonstrated satisfactory antiproliferative and aromatase inhibitory activities. The obtained results suggest that the 2D-DFT MIA-QSAR method may be useful in ligand-based virtual screening of new molecular entities of therapeutic utility.
Comparison of in silico models for prediction of mutagenicity.
Bakhtyari, Nazanin G; Raitano, Giuseppa; Benfenati, Emilio; Martin, Todd; Young, Douglas
2013-01-01
Using a dataset with more than 6000 compounds, the performance of eight quantitative structure activity relationships (QSAR) models was evaluated: ACD/Tox Suite, Absorption, Distribution, Metabolism, Elimination, and Toxicity of chemical substances (ADMET) predictor, Derek, Toxicity Estimation Software Tool (T.E.S.T.), TOxicity Prediction by Komputer Assisted Technology (TOPKAT), Toxtree, CEASAR, and SARpy (SAR in python). In general, the results showed a high level of performance. To have a realistic estimate of the predictive ability, the results for chemicals inside and outside the training set for each model were considered. The effect of applicability domain tools (when available) on the prediction accuracy was also evaluated. The predictive tools included QSAR models, knowledge-based systems, and a combination of both methods. Models based on statistical QSAR methods gave better results.
Modelling the effect of structural QSAR parameters on skin penetration using genetic programming
NASA Astrophysics Data System (ADS)
Chung, K. K.; Do, D. Q.
2010-09-01
In order to model relationships between chemical structures and biological effects in quantitative structure-activity relationship (QSAR) data, an alternative technique of artificial intelligence computing—genetic programming (GP)—was investigated and compared to the traditional method—statistical. GP, with the primary advantage of generating mathematical equations, was employed to model QSAR data and to define the most important molecular descriptions in QSAR data. The models predicted by GP agreed with the statistical results, and the most predictive models of GP were significantly improved when compared to the statistical models using ANOVA. Recently, artificial intelligence techniques have been applied widely to analyse QSAR data. With the capability of generating mathematical equations, GP can be considered as an effective and efficient method for modelling QSAR data.
Residual-QSAR. Implications for genotoxic carcinogenesis
2011-01-01
Introduction Both main types of carcinogenesis, genotoxic and epigenetic, were examined in the context of non-congenericity and similarity, respectively, for the structure of ligand molecules, emphasizing the role of quantitative structure-activity relationship ((Q)SAR) studies in accordance with OECD (Organization for Economic and Cooperation Development) regulations. The main purpose of this report involves electrophilic theory and the need for meaningful physicochemical parameters to describe genotoxicity by a general mechanism. Residual-QSAR Method The double or looping multiple linear correlation was examined by comparing the direct and residual structural information against the observed activity. A self-consistent equation of observed-computed activity was assumed to give maximum correlation efficiency for those situations in which the direct correlations gave non-significant statistical information. Alternatively, it was also suited to describe slow and apparently non-noticeable cancer phenomenology, with special application to non-congeneric molecules involved in genotoxic carcinogenesis. Application and Discussions The QSAR principles were systematically applied to a given pool of molecules with genotoxic activity in rats to elucidate their carcinogenic mechanisms. Once defined, the endpoint associated with ligand-DNA interaction was used to select variables that retained the main Hansch physicochemical parameters of hydrophobicity, polarizability and stericity, computed by the custom PM3 semiempirical quantum method. The trial and test sets of working molecules were established by implementing the normal Gaussian principle of activities that applies when the applicability domain is not restrained to the congeneric compounds, as in the present study. The application of the residual, self-consistent QSAR method and the factor (or average) method yielded results characterized by extremely high and low correlations, respectively, with the latter resembling the direct activity to parameter QSARs. Nevertheless, such contrasted correlations were further incorporated into the advanced statistical minimum paths principle, which selects the minimum hierarchy from Euclidean distances between all considered QSAR models for all combinations and considered molecular sets (i.e., school and validation). This ultimately led to a mechanistic picture based on the identified alpha, beta and gamma paths connecting structural indicators (i.e., the causes) to the global endpoint, with all included causes. The molecular mechanism preserved the self-consistent feature of the residual QSAR, with each descriptor appearing twice in the course of one cycle of ligand-DNA interaction through inter-and intra-cellular stages. Conclusions Both basal features of the residual-QSAR principle of self-consistency and suitability for non-congeneric molecules make it appropriate for conceptually assessing the mechanistic description of genotoxic carcinogenesis. Additionally, it could be extended to enriched physicochemical structural indices by considering the molecular fragments or structural alerts (or other molecular residues), providing more detailed maps of chemical-biological interactions and pathways. PMID:21668999
Fu, Zhiqiang; Chen, Jingwen; Li, Xuehua; Wang, Ya'nan; Yu, Haiying
2016-04-01
The octanol-air partition coefficient (KOA) is needed for assessing multimedia transport and bioaccumulability of organic chemicals in the environment. As experimental determination of KOA for various chemicals is costly and laborious, development of KOA estimation methods is necessary. We investigated three methods for KOA prediction, conventional quantitative structure-activity relationship (QSAR) models based on molecular structural descriptors, group contribution models based on atom-centered fragments, and a novel model that predicts KOA via solvation free energy from air to octanol phase (ΔGO(0)), with a collection of 939 experimental KOA values for 379 compounds at different temperatures (263.15-323.15 K) as validation or training sets. The developed models were evaluated with the OECD guidelines on QSAR models validation and applicability domain (AD) description. Results showed that although the ΔGO(0) model is theoretically sound and has a broad AD, the prediction accuracy of the model is the poorest. The QSAR models perform better than the group contribution models, and have similar predictability and accuracy with the conventional method that estimates KOA from the octanol-water partition coefficient and Henry's law constant. One QSAR model, which can predict KOA at different temperatures, was recommended for application as to assess the long-range transport potential of chemicals. Copyright © 2016 Elsevier Ltd. All rights reserved.
Lee, Yunho; von Gunten, Urs
2012-12-01
Various oxidants such as chlorine, chlorine dioxide, ferrate(VI), ozone, and hydroxyl radicals can be applied for eliminating organic micropollutant by oxidative transformation during water treatment in systems such as drinking water, wastewater, and water reuse. Over the last decades, many second-order rate constants (k) have been determined for the reaction of these oxidants with model compounds and micropollutants. Good correlations (quantitative structure-activity relationships or QSARs) are often found between the k-values for an oxidation reaction of closely related compounds (i.e. having a common organic functional group) and substituent descriptor variables such as Hammett or Taft sigma constants. In this study, we developed QSARs for the oxidation of organic and some inorganic compounds and organic micropollutants transformation during oxidative water treatment. A number of 18 QSARs were developed based on overall 412 k-values for the reaction of chlorine, chlorine dioxide, ferrate, and ozone with organic compounds containing electron-rich moieties such as phenols, anilines, olefins, and amines. On average, 303 out of 412 (74%) k-values were predicted by these QSARs within a factor of 1/3-3 compared to the measured values. For HO(·) reactions, some principles and estimation methods of k-values (e.g. the Group Contribution Method) are discussed. The developed QSARs and the Group Contribution Method could be used to predict the k-values for various emerging organic micropollutants. As a demonstration, 39 out of 45 (87%) predicted k-values were found within a factor 1/3-3 compared to the measured values for the selected emerging micropollutants. Finally, it is discussed how the uncertainty in the predicted k-values using the QSARs affects the accuracy of prediction for micropollutant elimination during oxidative water treatment. Copyright © 2012 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Shevade, Abhijit V.; Ryan, Margaret A.; Homer, Margie L.; Zhou, Hanying; Manfreda, Allison M.; Lara, Liana M.; Yen, Shiao-Pin S.; Jewell, April D.; Manatt, Kenneth S.; Kisor, Adam K.
We have developed a Quantitative Structure-Activity Relationships (QSAR) based approach to correlate the response of chemical sensors in an array with molecular descriptors. A novel molecular descriptor set has been developed; this set combines descriptors of sensing film-analyte interactions, representing sensor response, with a basic analyte descriptor set commonly used in QSAR studies. The descriptors are obtained using a combination of molecular modeling tools and empirical and semi-empirical Quantitative Structure-Property Relationships (QSPR) methods. The sensors under investigation are polymer-carbon sensing films which have been exposed to analyte vapors at parts-per-million (ppm) concentrations; response is measured as change in film resistance. Statistically validated QSAR models have been developed using Genetic Function Approximations (GFA) for a sensor array for a given training data set. The applicability of the sensor response models has been tested by using it to predict the sensor activities for test analytes not considered in the training set for the model development. The validated QSAR sensor response models show good predictive ability. The QSAR approach is a promising computational tool for sensing materials evaluation and selection. It can also be used to predict response of an existing sensing film to new target analytes.
Begum, S; Achary, P Ganga Raju
2015-01-01
Quantitative structure-activity relationship (QSAR) models were built for the prediction of inhibition (pIC50, i.e. negative logarithm of the 50% effective concentration) of MAP kinase-interacting protein kinase (MNK1) by 43 potent inhibitors. The pIC50 values were modelled with five random splits, with the representations of the molecular structures by simplified molecular input line entry system (SMILES). QSAR model building was performed by the Monte Carlo optimisation using three methods: classic scheme; balance of correlations; and balance correlation with ideal slopes. The robustness of these models were checked by parameters as rm(2), r(*)m(2), [Formula: see text] and randomisation technique. The best QSAR model based on single optimal descriptors was applied to study in vitro structure-activity relationships of 6-(4-(2-(piperidin-1-yl) ethoxy) phenyl)-3-(pyridin-4-yl) pyrazolo [1,5-a] pyrimidine derivatives as a screening tool for the development of novel potent MNK1 inhibitors. The effects of alkyl group, -OH, -NO2, F, Cl, Br, I, etc. on the IC50 values towards the inhibition of MNK1 were also reported.
Towards interoperable and reproducible QSAR analyses: Exchange of datasets.
Spjuth, Ola; Willighagen, Egon L; Guha, Rajarshi; Eklund, Martin; Wikberg, Jarl Es
2010-06-30
QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML) which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join, extend, combine datasets and hence work collectively, but also allows for analyzing the effect descriptors have on the statistical model's performance. The presented Bioclipse plugins equip scientists with graphical tools that make QSAR-ML easily accessible for the community.
Towards interoperable and reproducible QSAR analyses: Exchange of datasets
2010-01-01
Background QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. Results We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML) which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Conclusions Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join, extend, combine datasets and hence work collectively, but also allows for analyzing the effect descriptors have on the statistical model's performance. The presented Bioclipse plugins equip scientists with graphical tools that make QSAR-ML easily accessible for the community. PMID:20591161
Algamal, Z Y; Lee, M H
2017-01-01
A high-dimensional quantitative structure-activity relationship (QSAR) classification model typically contains a large number of irrelevant and redundant descriptors. In this paper, a new design of descriptor selection for the QSAR classification model estimation method is proposed by adding a new weight inside L1-norm. The experimental results of classifying the anti-hepatitis C virus activity of thiourea derivatives demonstrate that the proposed descriptor selection method in the QSAR classification model performs effectively and competitively compared with other existing penalized methods in terms of classification performance on both the training and the testing datasets. Moreover, it is noteworthy that the results obtained in terms of stability test and applicability domain provide a robust QSAR classification model. It is evident from the results that the developed QSAR classification model could conceivably be employed for further high-dimensional QSAR classification studies.
Does rational selection of training and test sets improve the outcome of QSAR modeling?
Martin, Todd M; Harten, Paul; Young, Douglas M; Muratov, Eugene N; Golbraikh, Alexander; Zhu, Hao; Tropsha, Alexander
2012-10-22
Prior to using a quantitative structure activity relationship (QSAR) model for external predictions, its predictive power should be established and validated. In the absence of a true external data set, the best way to validate the predictive ability of a model is to perform its statistical external validation. In statistical external validation, the overall data set is divided into training and test sets. Commonly, this splitting is performed using random division. Rational splitting methods can divide data sets into training and test sets in an intelligent fashion. The purpose of this study was to determine whether rational division methods lead to more predictive models compared to random division. A special data splitting procedure was used to facilitate the comparison between random and rational division methods. For each toxicity end point, the overall data set was divided into a modeling set (80% of the overall set) and an external evaluation set (20% of the overall set) using random division. The modeling set was then subdivided into a training set (80% of the modeling set) and a test set (20% of the modeling set) using rational division methods and by using random division. The Kennard-Stone, minimal test set dissimilarity, and sphere exclusion algorithms were used as the rational division methods. The hierarchical clustering, random forest, and k-nearest neighbor (kNN) methods were used to develop QSAR models based on the training sets. For kNN QSAR, multiple training and test sets were generated, and multiple QSAR models were built. The results of this study indicate that models based on rational division methods generate better statistical results for the test sets than models based on random division, but the predictive power of both types of models are comparable.
A Hierarchical Clustering Methodology for the Estimation of Toxicity
A Quantitative Structure Activity Relationship (QSAR) methodology based on hierarchical clustering was developed to predict toxicological endpoints. This methodology utilizes Ward's method to divide a training set into a series of structurally similar clusters. The structural sim...
A hierarchical clustering methodology for the estimation of toxicity.
Martin, Todd M; Harten, Paul; Venkatapathy, Raghuraman; Das, Shashikala; Young, Douglas M
2008-01-01
ABSTRACT A quantitative structure-activity relationship (QSAR) methodology based on hierarchical clustering was developed to predict toxicological endpoints. This methodology utilizes Ward's method to divide a training set into a series of structurally similar clusters. The structural similarity is defined in terms of 2-D physicochemical descriptors (such as connectivity and E-state indices). A genetic algorithm-based technique is used to generate statistically valid QSAR models for each cluster (using the pool of descriptors described above). The toxicity for a given query compound is estimated using the weighted average of the predictions from the closest cluster from each step in the hierarchical clustering assuming that the compound is within the domain of applicability of the cluster. The hierarchical clustering methodology was tested using a Tetrahymena pyriformis acute toxicity data set containing 644 chemicals in the training set and with two prediction sets containing 339 and 110 chemicals. The results from the hierarchical clustering methodology were compared to the results from several different QSAR methodologies.
Martínez-Santiago, O; Marrero-Ponce, Y; Vivas-Reyes, R; Rivera-Borroto, O M; Hurtado, E; Treto-Suarez, M A; Ramos, Y; Vergara-Murillo, F; Orozco-Ugarriza, M E; Martínez-López, Y
2017-05-01
Graph derivative indices (GDIs) have recently been defined over N-atoms (N = 2, 3 and 4) simultaneously, which are based on the concept of derivatives in discrete mathematics (finite difference), metaphorical to the derivative concept in classical mathematical analysis. These molecular descriptors (MDs) codify topo-chemical and topo-structural information based on the concept of the derivative of a molecular graph with respect to a given event (S) over duplex, triplex and quadruplex relations of atoms (vertices). These GDIs have been successfully applied in the description of physicochemical properties like reactivity, solubility and chemical shift, among others, and in several comparative quantitative structure activity/property relationship (QSAR/QSPR) studies. Although satisfactory results have been obtained in previous modelling studies with the aforementioned indices, it is necessary to develop new, more rigorous analysis to assess the true predictive performance of the novel structure codification. So, in the present paper, an assessment and statistical validation of the performance of these novel approaches in QSAR studies are executed, as well as a comparison with those of other QSAR procedures reported in the literature. To achieve the main aim of this research, QSARs were developed on eight chemical datasets widely used as benchmarks in the evaluation/validation of several QSAR methods and/or many different MDs (fundamentally 3D MDs). Three to seven variable QSAR models were built for each chemical dataset, according to the original dissection into training/test sets. The models were developed by using multiple linear regression (MLR) coupled with a genetic algorithm as the feature wrapper selection technique in the MobyDigs software. Each family of GDIs (for duplex, triplex and quadruplex) behaves similarly in all modelling, although there were some exceptions. However, when all families were used in combination, the results achieved were quantitatively higher than those reported by other authors in similar experiments. Comparisons with respect to external correlation coefficients (q 2 ext ) revealed that the models based on GDIs possess superior predictive ability in seven of the eight datasets analysed, outperforming methodologies based on similar or more complex techniques and confirming the good predictive power of the obtained models. For the q 2 ext values, the non-parametric comparison revealed significantly different results to those reported so far, which demonstrated that the models based on DIVATI's indices presented the best global performance and yielded significantly better predictions than the 12 0-3D QSAR procedures used in the comparison. Therefore, GDIs are suitable for structure codification of the molecules and constitute a good alternative to build QSARs for the prediction of physicochemical, biological and environmental endpoints.
Classification of baseline toxicants for QSAR predictions to replace fish acute toxicity studies.
Nendza, Monika; Müller, Martin; Wenzel, Andrea
2017-03-22
Fish acute toxicity studies are required for environmental hazard and risk assessment of chemicals by national and international legislations such as REACH, the regulations of plant protection products and biocidal products, or the GHS (globally harmonised system) for classification and labelling of chemicals. Alternative methods like QSARs (quantitative structure-activity relationships) can replace many ecotoxicity tests. However, complete substitution of in vivo animal tests by in silico methods may not be realistic. For the so-called baseline toxicants, it is possible to predict the fish acute toxicity with sufficient accuracy from log K ow and, hence, valid QSARs can replace in vivo testing. In contrast, excess toxicants and chemicals not reliably classified as baseline toxicants require further in silico, in vitro or in vivo assessments. Thus, the critical task is to discriminate between baseline and excess toxicants. For fish acute toxicity, we derived a scheme based on structural alerts and physicochemical property thresholds to classify chemicals as either baseline toxicants (=predictable by QSARs) or as potential excess toxicants (=not predictable by baseline QSARs). The step-wise approach identifies baseline toxicants (true negatives) in a precautionary way to avoid false negative predictions. Therefore, a certain fraction of false positives can be tolerated, i.e. baseline toxicants without specific effects that may be tested instead of predicted. Application of the classification scheme to a new heterogeneous dataset for diverse fish species results in 40% baseline toxicants, 24% excess toxicants and 36% compounds not classified. Thus, we can conclude that replacing about half of the fish acute toxicity tests by QSAR predictions is realistic to be achieved in the short-term. The long-term goals are classification criteria also for further groups of toxicants and to replace as many in vivo fish acute toxicity tests as possible with valid QSAR predictions.
Winkler, David A; Le, Tu C
2017-01-01
Neural networks have generated valuable Quantitative Structure-Activity/Property Relationships (QSAR/QSPR) models for a wide variety of small molecules and materials properties. They have grown in sophistication and many of their initial problems have been overcome by modern mathematical techniques. QSAR studies have almost always used so-called "shallow" neural networks in which there is a single hidden layer between the input and output layers. Recently, a new and potentially paradigm-shifting type of neural network based on Deep Learning has appeared. Deep learning methods have generated impressive improvements in image and voice recognition, and are now being applied to QSAR and QSAR modelling. This paper describes the differences in approach between deep and shallow neural networks, compares their abilities to predict the properties of test sets for 15 large drug data sets (the kaggle set), discusses the results in terms of the Universal Approximation theorem for neural networks, and describes how DNN may ameliorate or remove troublesome "activity cliffs" in QSAR data sets. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
20180312 - Structure-based QSAR Models to Predict Systemic Toxicity Points of Departure (SOT)
Human health risk assessment associated with environmental chemical exposure is limited by the tens of thousands of chemicals with little or no experimental in vivo toxicity data. Data gap filling techniques, such as quantitative structure activity relationship (QSAR) models base...
QSAR and 3D-QSAR studies applied to compounds with anticonvulsant activity.
Garro Martinez, Juan C; Vega-Hissi, Esteban G; Andrada, Matías F; Estrada, Mario R
2015-01-01
Quantitative structure-activity relationships (QSAR and 3D-QSAR) have been applied in the last decade to obtain a reliable statistical model for the prediction of the anticonvulsant activities of new chemical entities. However, despite the large amount of information on QSAR, no recent review has published and discussed this data in detail. In this review, the authors provide a detailed discussion of QSAR studies that have been applied to compounds with anticonvulsant activity published between the years 2003 and 2013. They also evaluate the mathematical approaches and the main software used to develop the QSAR and 3D-QSAR model. QSAR methodologies continue to attract the attention of researchers and provide valuable information for the development of new potentially active compounds including those with anticonvulsant activity. This has been helped in part by improvements in the size and performance of computers; the development of specific software and the development of novel molecular descriptors, which have given rise to new and more predictive QSAR models. The extensive development of descriptors, and the way by which descriptor values are derived, have allowed the evolution of the QSAR methods. This evolution could strengthen the QSAR methods as an important tool in research and development of new and more potent anticonvulsant agents.
2010-11-01
estimate the pharmacokinetics of potential drugs (Horning and Klamt 2005). QSPR/ QSARs also have potential applications in the fuel science field...group contribution methods, and (2) quantitative structure-property/activity relationships (QSPR/ QSAR ). The group contribution methods are primarily...development of QSPR/ QSARs is the identification of the ap- propriate set of descriptors that allow the desired attribute of the compound to be adequately
COREPA-M: NEW MULTI-DIMENSIONAL FUNCTIONALITY OF THE COREPA METHOD
The COmmon REactivity PAttern (COREPA) method is a recently developed pattern recognition technique accounting for conformational flexibility of chemicals in 3-D quantitative structure-activity relationships (QSARs). The method is based on the assumption that non-congeneric chemi...
Quantitative structure-activity relationships (QSARs) are being developed to predict the toxicological endpoints for untested chemicals similar in structure to chemicals that have known experimental toxicological data. Based on a very large number of predetermined descriptors, a...
NASA Astrophysics Data System (ADS)
Masand, Vijay H.; El-Sayed, Nahed N. E.; Mahajan, Devidas T.; Mercader, Andrew G.; Alafeefy, Ahmed M.; Shibi, I. G.
2017-02-01
In the present work, sixty substituted 2-Phenylimidazopyridines previously reported with potent anti-human African trypanosomiasis (HAT) activity were selected to build genetic algorithm (GA) based QSAR models to determine the structural features that have significant correlation with the activity. Multiple QSAR models were built using easily interpretable descriptors that are directly associated with the presence or the absence of a structural scaffold, or a specific atom. All the QSAR models have been thoroughly validated according to the OECD principles. All the QSAR models are statistically very robust (R2 = 0.80-0.87) with high external predictive ability (CCCex = 0.81-0.92). The QSAR analysis reveals that the HAT activity has good correlation with the presence of five membered rings in the molecule.
Li, Yuqin; You, Guirong; Jia, Baoxiu; Si, Hongzong; Yao, Xiaojun
2014-01-01
Quantitative structure-activity relationships (QSAR) were developed to predict the inhibition ratio of pyrrolidine derivatives on matrix metalloproteinase via heuristic method (HM) and gene expression programming (GEP). The descriptors of 33 pyrrolidine derivatives were calculated by the software CODESSA, which can calculate quantum chemical, topological, geometrical, constitutional, and electrostatic descriptors. HM was also used for the preselection of 5 appropriate molecular descriptors. Linear and nonlinear QSAR models were developed based on the HM and GEP separately and two prediction models lead to a good correlation coefficient (R (2)) of 0.93 and 0.94. The two QSAR models are useful in predicting the inhibition ratio of pyrrolidine derivatives on matrix metalloproteinase during the discovery of new anticancer drugs and providing theory information for studying the new drugs.
Yang, Guang-Fu; Huang, Xiaoqin
2006-01-01
Over forty years have elapsed since Hansch and Fujita published their pioneering work of quantitative structure-activity relationships (QSAR). Following the introduction of Comparative Molecular Field Analysis (CoMFA) by Cramer in 1998, other three-dimensional QSAR methods have been developed. Currently, combination of classical QSAR and other computational techniques at three-dimensional level is of greatest interest and generally used in the process of modern drug discovery and design. During the last several decades, a number of different mythologies incorporating a range of molecular descriptors and different statistical regression ways have been proposed and successfully applied in developing of new drugs, thus QSAR method has been proven to be indispensable in not only the reliable prediction of specific properties of new compounds, but also the help to elucidate the possible molecular mechanism of the receptor-ligand interactions. Here, we review the recent developments in QSAR and their applications in rational drug design, focusing on the reasonable selection of novel molecular descriptors and the construction of predictive QSAR models by the help of advanced computational techniques.
Application of 3D-QSAR in the rational design of receptor ligands and enzyme inhibitors.
Mor, Marco; Rivara, Silvia; Lodola, Alessio; Lorenzi, Simone; Bordi, Fabrizio; Plazzi, Pier Vincenzo; Spadoni, Gilberto; Bedini, Annalida; Duranti, Andrea; Tontini, Andrea; Tarzia, Giorgio
2005-11-01
Quantitative structure-activity relationships (QSARs) are frequently employed in medicinal chemistry projects, both to rationalize structure-activity relationships (SAR) for known series of compounds and to help in the design of innovative structures endowed with desired pharmacological actions. As a difference from the so-called structure-based drug design tools, they do not require the knowledge of the biological target structure, but are based on the comparison of drug structural features, thus being defined ligand-based drug design tools. In the 3D-QSAR approach, structural descriptors are calculated from molecular models of the ligands, as interaction fields within a three-dimensional (3D) lattice of points surrounding the ligand structure. These descriptors are collected in a large X matrix, which is submitted to multivariate analysis to look for correlations with biological activity. Like for other QSARs, the reliability and usefulness of the correlation models depends on the validity of the assumptions and on the quality of the data. A careful selection of compounds and pharmacological data can improve the application of 3D-QSAR analysis in drug design. Some examples of the application of CoMFA and CoMSIA approaches to the SAR study and design of receptor or enzyme ligands is described, pointing the attention to the fields of melatonin receptor ligands and FAAH inhibitors.
3D-QSAR studies on 1,2,4-triazolyl 5-azaspiro [2.4]-heptanes as D3R antagonists
NASA Astrophysics Data System (ADS)
Zhang, Xin; Zhang, Hui
2018-07-01
Dopamine D3 receptor has become an attractive target in the treatment of abused drugs. 3D-QSAR studies were performed on a novel series of D3 receptor antagonists, 1,2,4-triazolyl 5-azaspiro [2.4]-heptanes, using CoMFA and CoMSIA methods. Two predictive 3D-QSAR models have been generated for the modified design of D3R antagonists. Based on the steric, electrostatic, hydrophobic and hydrogen-bond acceptor information of contour maps, key structural factors affecting the bioactivity were explored. This work gives helpful suggestions on the design of novel D3R antagonists with increased activities.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Xiaolin; Ye, Li; Wang, Xiaoxiang
2012-12-15
Several recent reports suggested that hydroxylated polybrominated diphenyl ethers (HO-PBDEs) may disturb thyroid hormone homeostasis. To illuminate the structural features for thyroid hormone activity of HO-PBDEs and the binding mode between HO-PBDEs and thyroid hormone receptor (TR), the hormone activity of a series of HO-PBDEs to thyroid receptors β was studied based on the combination of 3D-QSAR, molecular docking, and molecular dynamics (MD) methods. The ligand- and receptor-based 3D-QSAR models were obtained using Comparative Molecular Similarity Index Analysis (CoMSIA) method. The optimum CoMSIA model with region focusing yielded satisfactory statistical results: leave-one-out cross-validation correlation coefficient (q{sup 2}) was 0.571 andmore » non-cross-validation correlation coefficient (r{sup 2}) was 0.951. Furthermore, the results of internal validation such as bootstrapping, leave-many-out cross-validation, and progressive scrambling as well as external validation indicated the rationality and good predictive ability of the best model. In addition, molecular docking elucidated the conformations of compounds and key amino acid residues at the docking pocket, MD simulation further determined the binding process and validated the rationality of docking results. -- Highlights: ► The thyroid hormone activities of HO-PBDEs were studied by 3D-QSAR. ► The binding modes between HO-PBDEs and TRβ were explored. ► 3D-QSAR, molecular docking, and molecular dynamics (MD) methods were performed.« less
While relationships between chemical structure and observed properties or activities (QSAR - quantitative structure activity relationship) can be used to predict the behavior of unknown chemicals, this method is semiempirical in nature relying on high quality experimental data to...
Gholivand, Khodayar; Ebrahimi Valmoozi, Ali Asghar; Bonsaii, Mahyar
2014-06-01
Novel (thio)phosphoramidate derivatives based on piperidincarboxamide with the general formula of (NH2-C(O)-C5H9N)-P(X=O,S)R1R2 (1-5) and (NH2-C(O)-C5H9N)2-P(O)R (6-9) were synthesized and characterized by (31)P, (13)C, (1)H NMR, IR spectroscopy. Furthermore, the crystal structure of compound (NH2-C(O)-C5H9N)2-P(O)(OC6H5) (6) was investigated. The activities of derivatives on cholinesterases (ChE) were determined using a modified Ellman's method. Also the mixed-type mechanisms of these compounds were evaluated by Lineweaver-Burk plots. Molecular docking and quantitative structure-activity relationship (QSAR) were used to understand the relationship between molecular structural features and anti-ChE activity, and to predict the binding affinity of phosphoramido-piperidinecarboxamides (PAPCAs) to ChE receptors. From molecular docking analysis, noncovalent interactions especially hydrogen bonding as well as hydrophobic was found between PAPCAs and ChE. Based on the docking results, appropriate molecular structural parameters were adopted to develop a QSAR model. DFT-QSAR models for ChE enzymes demonstrated the importance of electrophilicity parameter in describing the anti-AChE and anti-BChE activities of the synthesized compounds. The correlation matrix of QSAR models and docking analysis confirmed that electrophilicity descriptor can control the influence of the hydrophobic properties of P=(O, S) and CO functional groups of PAPCA derivatives in the inhibition of human ChE enzymes. Copyright © 2014 Elsevier Inc. All rights reserved.
The QSAR study of flavonoid-metal complexes scavenging rad OH free radical
NASA Astrophysics Data System (ADS)
Wang, Bo-chu; Qian, Jun-zhen; Fan, Ying; Tan, Jun
2014-10-01
Flavonoid-metal complexes have antioxidant activities. However, quantitative structure-activity relationships (QSAR) of flavonoid-metal complexes and their antioxidant activities has still not been tackled. On the basis of 21 structures of flavonoid-metal complexes and their antioxidant activities for scavenging rad OH free radical, we optimised their structures using Gaussian 03 software package and we subsequently calculated and chose 18 quantum chemistry descriptors such as dipole, charge and energy. Then we chose several quantum chemistry descriptors that are very important to the IC50 of flavonoid-metal complexes for scavenging rad OH free radical through method of stepwise linear regression, Meanwhile we obtained 4 new variables through the principal component analysis. Finally, we built the QSAR models based on those important quantum chemistry descriptors and the 4 new variables as the independent variables and the IC50 as the dependent variable using an Artificial Neural Network (ANN), and we validated the two models using experimental data. These results show that the two models in this paper are reliable and predictable.
Mendenhall, Jeffrey; Meiler, Jens
2016-02-01
Dropout is an Artificial Neural Network (ANN) training technique that has been shown to improve ANN performance across canonical machine learning (ML) datasets. Quantitative Structure Activity Relationship (QSAR) datasets used to relate chemical structure to biological activity in Ligand-Based Computer-Aided Drug Discovery pose unique challenges for ML techniques, such as heavily biased dataset composition, and relatively large number of descriptors relative to the number of actives. To test the hypothesis that dropout also improves QSAR ANNs, we conduct a benchmark on nine large QSAR datasets. Use of dropout improved both enrichment false positive rate and log-scaled area under the receiver-operating characteristic curve (logAUC) by 22-46 % over conventional ANN implementations. Optimal dropout rates are found to be a function of the signal-to-noise ratio of the descriptor set, and relatively independent of the dataset. Dropout ANNs with 2D and 3D autocorrelation descriptors outperform conventional ANNs as well as optimized fingerprint similarity search methods.
Mendenhall, Jeffrey; Meiler, Jens
2016-01-01
Dropout is an Artificial Neural Network (ANN) training technique that has been shown to improve ANN performance across canonical machine learning (ML) datasets. Quantitative Structure Activity Relationship (QSAR) datasets used to relate chemical structure to biological activity in Ligand-Based Computer-Aided Drug Discovery (LB-CADD) pose unique challenges for ML techniques, such as heavily biased dataset composition, and relatively large number of descriptors relative to the number of actives. To test the hypothesis that dropout also improves QSAR ANNs, we conduct a benchmark on nine large QSAR datasets. Use of dropout improved both Enrichment false positive rate (FPR) and log-scaled area under the receiver-operating characteristic curve (logAUC) by 22–46% over conventional ANN implementations. Optimal dropout rates are found to be a function of the signal-to-noise ratio of the descriptor set, and relatively independent of the dataset. Dropout ANNs with 2D and 3D autocorrelation descriptors outperform conventional ANNs as well as optimized fingerprint similarity search methods. PMID:26830599
NASA Astrophysics Data System (ADS)
Sippl, Wolfgang
2000-08-01
One of the major challenges in computational approaches to drug design is the accurate prediction of binding affinity of biomolecules. In the present study several prediction methods for a published set of estrogen receptor ligands are investigated and compared. The binding modes of 30 ligands were determined using the docking program AutoDock and were compared with available X-ray structures of estrogen receptor-ligand complexes. On the basis of the docking results an interaction energy-based model, which uses the information of the whole ligand-receptor complex, was generated. Several parameters were modified in order to analyze their influence onto the correlation between binding affinities and calculated ligand-receptor interaction energies. The highest correlation coefficient ( r 2 = 0.617, q 2 LOO = 0.570) was obtained considering protein flexibility during the interaction energy evaluation. The second prediction method uses a combination of receptor-based and 3D quantitative structure-activity relationships (3D QSAR) methods. The ligand alignment obtained from the docking simulations was taken as basis for a comparative field analysis applying the GRID/GOLPE program. Using the interaction field derived with a water probe and applying the smart region definition (SRD) variable selection, a significant and robust model was obtained ( r 2 = 0.991, q 2 LOO = 0.921). The predictive ability of the established model was further evaluated by using a test set of six additional compounds. The comparison with the generated interaction energy-based model and with a traditional CoMFA model obtained using a ligand-based alignment ( r 2 = 0.951, q 2 LOO = 0.796) indicates that the combination of receptor-based and 3D QSAR methods is able to improve the quality of the underlying model.
2D-QSAR and 3D-QSAR Analyses for EGFR Inhibitors
Zhao, Manman; Zheng, Linfeng; Qiu, Chun
2017-01-01
Epidermal growth factor receptor (EGFR) is an important target for cancer therapy. In this study, EGFR inhibitors were investigated to build a two-dimensional quantitative structure-activity relationship (2D-QSAR) model and a three-dimensional quantitative structure-activity relationship (3D-QSAR) model. In the 2D-QSAR model, the support vector machine (SVM) classifier combined with the feature selection method was applied to predict whether a compound was an EGFR inhibitor. As a result, the prediction accuracy of the 2D-QSAR model was 98.99% by using tenfold cross-validation test and 97.67% by using independent set test. Then, in the 3D-QSAR model, the model with q2 = 0.565 (cross-validated correlation coefficient) and r2 = 0.888 (non-cross-validated correlation coefficient) was built to predict the activity of EGFR inhibitors. The mean absolute error (MAE) of the training set and test set was 0.308 log units and 0.526 log units, respectively. In addition, molecular docking was also employed to investigate the interaction between EGFR inhibitors and EGFR. PMID:28630865
Dolezal, Rafael; Korabecny, Jan; Malinak, David; Honegr, Jan; Musilek, Kamil; Kuca, Kamil
2015-03-01
To predict unknown reactivation potencies of 12 mono- and bis-pyridinium aldoximes for VX-inhibited rat acetylcholinesterase (rAChE), three-dimensional quantitative structure-activity relationship (3D QSAR) analysis has been carried out. Utilizing molecular interaction fields (MIFs) calculated by molecular mechanical (MMFF94) and quantum chemical (B3LYP/6-31G*) methods, two satisfactory ligand-based CoMFA models have been developed: 1. R(2)=0.9989, Q(LOO)(2)=0.9090, Q(LTO)(2)=0.8921, Q(LMO(20%))(2)=0.8853, R(ext)(2)=0.9259, SDEP(ext)=6.8938; 2. R(2)=0.9962, Q(LOO)(2)=0.9368, Q(LTO)(2)=0.9298, Q(LMO(20%))(2)=0.9248, R(ext)(2)=0.8905, SDEP(ext)=6.6756. High statistical significance of the 3D QSAR models has been achieved through the application of several data noise reduction techniques (i.e. smart region definition SRD, fractional factor design FFD, uninformative/iterative variable elimination UVE/IVE) on the original MIFs. Besides the ligand-based CoMFA models, an alignment molecular set constructed by flexible molecular docking has been also studied. The contour maps as well as the predicted reactivation potencies resulting from 3D QSAR analyses help better understand which structural features are associated with increased reactivation potency of studied compounds. Copyright © 2014 Elsevier Inc. All rights reserved.
Bradbury, Steven P; Russom, Christine L; Ankley, Gerald T; Schultz, T Wayne; Walker, John D
2003-08-01
The use of quantitative structure-activity relationships (QSARs) in assessing potential toxic effects of organic chemicals on aquatic organisms continues to evolve as computational efficiency and toxicological understanding advance. With the ever-increasing production of new chemicals, and the need to optimize resources to assess thousands of existing chemicals in commerce, regulatory agencies have turned to QSARs as essential tools to help prioritize tiered risk assessments when empirical data are not available to evaluate toxicological effects. Progress in designing scientifically credible QSARs is intimately associated with the development of empirically derived databases of well-defined and quantified toxicity endpoints, which are based on a strategic evaluation of diverse sets of chemical structures, modes of toxic action, and species. This review provides a brief overview of four databases created for the purpose of developing QSARs for estimating toxicity of chemicals to aquatic organisms. The evolution of QSARs based initially on general chemical classification schemes, to models founded on modes of toxic action that range from nonspecific partitioning into hydrophobic cellular membranes to receptor-mediated mechanisms is summarized. Finally, an overview of expert systems that integrate chemical-specific mode of action classification and associated QSAR selection for estimating potential toxicological effects of organic chemicals is presented.
QSAR modeling of cumulative environmental end-points for the prioritization of hazardous chemicals.
Gramatica, Paola; Papa, Ester; Sangion, Alessandro
2018-01-24
The hazard of chemicals in the environment is inherently related to the molecular structure and derives simultaneously from various chemical properties/activities/reactivities. Models based on Quantitative Structure Activity Relationships (QSARs) are useful to screen, rank and prioritize chemicals that may have an adverse impact on humans and the environment. This paper reviews a selection of QSAR models (based on theoretical molecular descriptors) developed for cumulative multivariate endpoints, which were derived by mathematical combination of multiple effects and properties. The cumulative end-points provide an integrated holistic point of view to address environmentally relevant properties of chemicals.
2011-09-22
OPs) are a group of pesticides that inhibit enzymes such as acetylcholinesterase. Numerous OP structural variants exist and toxicity data can be...and human toxicity studies especially for OPs lacking experimental data. 15. SUBJECT TERMS QSAR Organophosphates...structure and mechanism of toxicity c) Linking QSAR and OP PBPK/PD 2. Methods a) Physiochemical Descriptors b) Regression Techniques 3. Results a
QSAR modeling based on structure-information for properties of interest in human health.
Hall, L H; Hall, L M
2005-01-01
The development of QSAR models based on topological structure description is presented for problems in human health. These models are based on the structure-information approach to quantitative biological modeling and prediction, in contrast to the mechanism-based approach. The structure-information approach is outlined, starting with basic structure information developed from the chemical graph (connection table). Information explicit in the connection table (element identity and skeletal connections) leads to significant (implicit) structure information that is useful for establishing sound models of a wide range of properties of interest in drug design. Valence state definition leads to relationships for valence state electronegativity and atom/group molar volume. Based on these important aspects of molecules, together with skeletal branching patterns, both the electrotopological state (E-state) and molecular connectivity (chi indices) structure descriptors are developed and described. A summary of four QSAR models indicates the wide range of applicability of these structure descriptors and the predictive quality of QSAR models based on them: aqueous solubility (5535 chemically diverse compounds, 938 in external validation), percent oral absorption (%OA, 417 therapeutic drugs, 195 drugs in external validation testing), AMES mutagenicity (2963 compounds including 290 therapeutic drugs, 400 in external validation), fish toxicity (92 substituted phenols, anilines and substituted aromatics). These models are established independent of explicit three-dimensional (3-D) structure information and are directly interpretable in terms of the implicit structure information useful to the drug design process.
Hattotuwagama, Channa K; Doytchinova, Irini A; Flower, Darren R
2007-01-01
Quantitative structure-activity relationship (QSAR) analysis is a cornerstone of modern informatics. Predictive computational models of peptide-major histocompatibility complex (MHC)-binding affinity based on QSAR technology have now become important components of modern computational immunovaccinology. Historically, such approaches have been built around semiqualitative, classification methods, but these are now giving way to quantitative regression methods. We review three methods--a 2D-QSAR additive-partial least squares (PLS) and a 3D-QSAR comparative molecular similarity index analysis (CoMSIA) method--which can identify the sequence dependence of peptide-binding specificity for various class I MHC alleles from the reported binding affinities (IC50) of peptide sets. The third method is an iterative self-consistent (ISC) PLS-based additive method, which is a recently developed extension to the additive method for the affinity prediction of class II peptides. The QSAR methods presented here have established themselves as immunoinformatic techniques complementary to existing methodology, useful in the quantitative prediction of binding affinity: current methods for the in silico identification of T-cell epitopes (which form the basis of many vaccines, diagnostics, and reagents) rely on the accurate computational prediction of peptide-MHC affinity. We have reviewed various human and mouse class I and class II allele models. Studied alleles comprise HLA-A*0101, HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*3101, HLA-A*6801, HLA-A*6802, HLA-B*3501, H2-K(k), H2-K(b), H2-D(b) HLA-DRB1*0101, HLA-DRB1*0401, HLA-DRB1*0701, I-A(b), I-A(d), I-A(k), I-A(S), I-E(d), and I-E(k). In this chapter we show a step-by-step guide into predicting the reliability and the resulting models to represent an advance on existing methods. The peptides used in this study are available from the AntiJen database (http://www.jenner.ac.uk/AntiJen). The PLS method is available commercially in the SYBYL molecular modeling software package. The resulting models, which can be used for accurate T-cell epitope prediction, will be made are freely available online at the URL http://www.jenner.ac.uk/MHCPred.
QSAR Study for Carcinogenic Potency of Aromatic Amines Based on GEP and MLPs
Song, Fucheng; Zhang, Anling; Liang, Hui; Cui, Lianhua; Li, Wenlian; Si, Hongzong; Duan, Yunbo; Zhai, Honglin
2016-01-01
A new analysis strategy was used to classify the carcinogenicity of aromatic amines. The physical-chemical parameters are closely related to the carcinogenicity of compounds. Quantitative structure activity relationship (QSAR) is a method of predicting the carcinogenicity of aromatic amine, which can reveal the relationship between carcinogenicity and physical-chemical parameters. This study accessed gene expression programming by APS software, the multilayer perceptrons by Weka software to predict the carcinogenicity of aromatic amines, respectively. All these methods relied on molecular descriptors calculated by CODESSA software and eight molecular descriptors were selected to build function equations. As a remarkable result, the accuracy of gene expression programming in training and test sets are 0.92 and 0.82, the accuracy of multilayer perceptrons in training and test sets are 0.84 and 0.74 respectively. The precision of the gene expression programming is obviously superior to multilayer perceptrons both in training set and test set. The QSAR application in the identification of carcinogenic compounds is a high efficiency method. PMID:27854309
From QSAR to QSIIR: Searching for Enhanced Computational Toxicology Models
Zhu, Hao
2017-01-01
Quantitative Structure Activity Relationship (QSAR) is the most frequently used modeling approach to explore the dependency of biological, toxicological, or other types of activities/properties of chemicals on their molecular features. In the past two decades, QSAR modeling has been used extensively in drug discovery process. However, the predictive models resulted from QSAR studies have limited use for chemical risk assessment, especially for animal and human toxicity evaluations, due to the low predictivity of new compounds. To develop enhanced toxicity models with independently validated external prediction power, novel modeling protocols were pursued by computational toxicologists based on rapidly increasing toxicity testing data in recent years. This chapter reviews the recent effort in our laboratory to incorporate the biological testing results as descriptors in the toxicity modeling process. This effort extended the concept of QSAR to Quantitative Structure In vitro-In vivo Relationship (QSIIR). The QSIIR study examples provided in this chapter indicate that the QSIIR models that based on the hybrid (biological and chemical) descriptors are indeed superior to the conventional QSAR models that only based on chemical descriptors for several animal toxicity endpoints. We believe that the applications introduced in this review will be of interest and value to researchers working in the field of computational drug discovery and environmental chemical risk assessment. PMID:23086837
NASA Astrophysics Data System (ADS)
Manoharan, Prabu; Vijayan, R. S. K.; Ghoshal, Nanda
2010-10-01
The ability to identify fragments that interact with a biological target is a key step in FBDD. To date, the concept of fragment based drug design (FBDD) is increasingly driven by bio-physical methods. To expand the boundaries of QSAR paradigm, and to rationalize FBDD using In silico approach, we propose a fragment based QSAR methodology referred here in as FB-QSAR. The FB-QSAR methodology was validated on a dataset consisting of 52 Hydroxy ethylamine (HEA) inhibitors, disclosed by GlaxoSmithKline Pharmaceuticals as potential anti-Alzheimer agents. To address the issue of target selectivity, a major confounding factor in the development of selective BACE1 inhibitors, FB-QSSR models were developed using the reported off target activity values. A heat map constructed, based on the activity and selectivity profile of the individual R-group fragments, and was in turn used to identify superior R-group fragments. Further, simultaneous optimization of multiple properties, an issue encountered in real-world drug discovery scenario, and often overlooked in QSAR approaches, was addressed using a Multi Objective (MO-QSPR) method that balances properties, based on the defined objectives. MO-QSPR was implemented using Derringer and Suich desirability algorithm to identify the optimal level of independent variables ( X) that could confer a trade-off between selectivity and activity. The results obtained from FB-QSAR were further substantiated using MIF (Molecular Interaction Fields) studies. To exemplify the potentials of FB-QSAR and MO-QSPR in a pragmatic fashion, the insights gleaned from the MO-QSPR study was reverse engineered using Inverse-QSAR in a combinatorial fashion to enumerate some prospective novel, potent and selective BACE1 inhibitors.
Manoharan, Prabu; Vijayan, R S K; Ghoshal, Nanda
2010-10-01
The ability to identify fragments that interact with a biological target is a key step in FBDD. To date, the concept of fragment based drug design (FBDD) is increasingly driven by bio-physical methods. To expand the boundaries of QSAR paradigm, and to rationalize FBDD using In silico approach, we propose a fragment based QSAR methodology referred here in as FB-QSAR. The FB-QSAR methodology was validated on a dataset consisting of 52 Hydroxy ethylamine (HEA) inhibitors, disclosed by GlaxoSmithKline Pharmaceuticals as potential anti-Alzheimer agents. To address the issue of target selectivity, a major confounding factor in the development of selective BACE1 inhibitors, FB-QSSR models were developed using the reported off target activity values. A heat map constructed, based on the activity and selectivity profile of the individual R-group fragments, and was in turn used to identify superior R-group fragments. Further, simultaneous optimization of multiple properties, an issue encountered in real-world drug discovery scenario, and often overlooked in QSAR approaches, was addressed using a Multi Objective (MO-QSPR) method that balances properties, based on the defined objectives. MO-QSPR was implemented using Derringer and Suich desirability algorithm to identify the optimal level of independent variables (X) that could confer a trade-off between selectivity and activity. The results obtained from FB-QSAR were further substantiated using MIF (Molecular Interaction Fields) studies. To exemplify the potentials of FB-QSAR and MO-QSPR in a pragmatic fashion, the insights gleaned from the MO-QSPR study was reverse engineered using Inverse-QSAR in a combinatorial fashion to enumerate some prospective novel, potent and selective BACE1 inhibitors.
QSAR and 3D QSAR of inhibitors of the epidermal growth factor receptor
NASA Astrophysics Data System (ADS)
Pinto-Bazurco, Mariano; Tsakovska, Ivanka; Pajeva, Ilza
This article reports quantitative structure-activity relationships (QSAR) and 3D QSAR models of 134 structurally diverse inhibitors of the epidermal growth factor receptor (EGFR) tyrosine kinase. Free-Wilson analysis was used to derive the QSAR model. It identified the substituents in aniline, the polycyclic system, and the substituents at the 6- and 7-positions of the polycyclic system as the most important structural features. Comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were used in the 3D QSAR modeling. The steric and electrostatic interactions proved the most important for the inhibitory effect. Both QSAR and 3D QSAR models led to consistent results. On the basis of the statistically significant models, new structures were proposed and their inhibitory activities were predicted.
QSAR as a random event: modeling of nanoparticles uptake in PaCa2 cancer cells.
Toropov, Andrey A; Toropova, Alla P; Puzyn, Tomasz; Benfenati, Emilio; Gini, Giuseppina; Leszczynska, Danuta; Leszczynski, Jerzy
2013-06-01
Quantitative structure-property/activity relationships (QSPRs/QSARs) are a tool to predict various endpoints for various substances. The "classic" QSPR/QSAR analysis is based on the representation of the molecular structure by the molecular graph. However, simplified molecular input-line entry system (SMILES) gradually becomes most popular representation of the molecular structure in the databases available on the Internet. Under such circumstances, the development of molecular descriptors calculated directly from SMILES becomes attractive alternative to "classic" descriptors. The CORAL software (http://www.insilico.eu/coral) is provider of SMILES-based optimal molecular descriptors which are aimed to correlate with various endpoints. We analyzed data set on nanoparticles uptake in PaCa2 pancreatic cancer cells. The data set includes 109 nanoparticles with the same core but different surface modifiers (small organic molecules). The concept of a QSAR as a random event is suggested in opposition to "classic" QSARs which are based on the only one distribution of available data into the training and the validation sets. In other words, five random splits into the "visible" training set and the "invisible" validation set were examined. The SMILES-based optimal descriptors (obtained by the Monte Carlo technique) for these splits are calculated with the CORAL software. The statistical quality of all these models is good. Copyright © 2013 Elsevier Ltd. All rights reserved.
Latest advances in molecular topology applications for drug discovery.
Zanni, Riccardo; Galvez-Llompart, Maria; García-Domenech, Ramón; Galvez, Jorge
2015-01-01
Molecular topology (MT) has emerged in recent years as a powerful approach for the in silico generation of new drugs. In the last decade, its application has become more and more popular among the leading research groups in the field of quantitative structure-activity relationships (QSAR) and drug design. This has, in turn, contributed to the rapid development of new techniques and applications of MT in QSAR studies, as well as the introduction of new topological indices. This review collates the main innovative techniques in the field of MT and provides a description of the novel topological indices recently introduced, through an exhaustive recompilation of the most significant works carried out by the leading research groups in the field of drug design and discovery. The objective is to show the importance of MT methods combined with the effectiveness of the descriptors. Recent years have witnessed a remarkable rise in QSAR methods based on MT and its application to drug design. New methodologies have been introduced in the area such as QSAR multi-target, Markov networks or perturbation methods. Moreover, novel topological indices, such as Bourgas' descriptors and other new concepts as the derivative of a graph or cliques capable to distinguish between conformers, have also been introduced. New drugs have also been discovered, including anticonvulsants, anineoplastics, antimalarials or antiallergics, just to name a few. In the authors' opinion, MT and QSAR have moved from an attractive possibility to representing a foundation stone in the process of drug discovery.
Ebalunode, Jerry O; Zheng, Weifan; Tropsha, Alexander
2011-01-01
Optimization of chemical library composition affords more efficient identification of hits from biological screening experiments. The optimization could be achieved through rational selection of reagents used in combinatorial library synthesis. However, with a rapid advent of parallel synthesis methods and availability of millions of compounds synthesized by many vendors, it may be more efficient to design targeted libraries by means of virtual screening of commercial compound collections. This chapter reviews the application of advanced cheminformatics approaches such as quantitative structure-activity relationships (QSAR) and pharmacophore modeling (both ligand and structure based) for virtual screening. Both approaches rely on empirical SAR data to build models; thus, the emphasis is placed on achieving models of the highest rigor and external predictive power. We present several examples of successful applications of both approaches for virtual screening to illustrate their utility. We suggest that the expert use of both QSAR and pharmacophore models, either independently or in combination, enables users to achieve targeted libraries enriched with experimentally confirmed hit compounds.
SAR/QSAR methods in public health practice
DOE Office of Scientific and Technical Information (OSTI.GOV)
Demchuk, Eugene, E-mail: edemchuk@cdc.gov; Ruiz, Patricia; Chou, Selene
2011-07-15
Methods of (Quantitative) Structure-Activity Relationship ((Q)SAR) modeling play an important and active role in ATSDR programs in support of the Agency mission to protect human populations from exposure to environmental contaminants. They are used for cross-chemical extrapolation to complement the traditional toxicological approach when chemical-specific information is unavailable. SAR and QSAR methods are used to investigate adverse health effects and exposure levels, bioavailability, and pharmacokinetic properties of hazardous chemical compounds. They are applied as a part of an integrated systematic approach in the development of Health Guidance Values (HGVs), such as ATSDR Minimal Risk Levels, which are used to protectmore » populations exposed to toxic chemicals at hazardous waste sites. (Q)SAR analyses are incorporated into ATSDR documents (such as the toxicological profiles and chemical-specific health consultations) to support environmental health assessments, prioritization of environmental chemical hazards, and to improve study design, when filling the priority data needs (PDNs) as mandated by Congress, in instances when experimental information is insufficient. These cases are illustrated by several examples, which explain how ATSDR applies (Q)SAR methods in public health practice.« less
Li, Yi; Tseng, Yufeng J.; Pan, Dahua; Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Hopfinger, Anton J.
2008-01-01
Currently, the only validated methods to identify skin sensitization effects are in vivo models, such as the Local Lymph Node Assay (LLNA) and guinea pig studies. There is a tremendous need, in particular due to novel legislation, to develop animal alternatives, eg. Quantitative Structure-Activity Relationship (QSAR) models. Here, QSAR models for skin sensitization using LLNA data have been constructed. The descriptors used to generate these models are derived from the 4D-molecular similarity paradigm and are referred to as universal 4D-fingerprints. A training set of 132 structurally diverse compounds and a test set of 15 structurally diverse compounds were used in this study. The statistical methodologies used to build the models are logistic regression (LR), and partial least square coupled logistic regression (PLS-LR), which prove to be effective tools for studying skin sensitization measures expressed in the two categorical terms of sensitizer and non-sensitizer. QSAR models with low values of the Hosmer-Lemeshow goodness-of-fit statistic, χHL2, are significant and predictive. For the training set, the cross-validated prediction accuracy of the logistic regression models ranges from 77.3% to 78.0%, while that of PLS-logistic regression models ranges from 87.1% to 89.4%. For the test set, the prediction accuracy of logistic regression models ranges from 80.0%-86.7%, while that of PLS-logistic regression models ranges from 73.3%-80.0%. The QSAR models are made up of 4D-fingerprints related to aromatic atoms, hydrogen bond acceptors and negatively partially charged atoms. PMID:17226934
An ensemble model of QSAR tools for regulatory risk assessment.
Pradeep, Prachi; Povinelli, Richard J; White, Shannon; Merrill, Stephen J
2016-01-01
Quantitative structure activity relationships (QSARs) are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR predictions can be used for chemical risk assessment for protection of human and environmental health, which makes them interesting to regulators, especially in the absence of experimental data. For compatibility with regulatory use, QSAR models should be transparent, reproducible and optimized to minimize the number of false negatives. In silico QSAR tools are gaining wide acceptance as a faster alternative to otherwise time-consuming clinical and animal testing methods. However, different QSAR tools often make conflicting predictions for a given chemical and may also vary in their predictive performance across different chemical datasets. In a regulatory context, conflicting predictions raise interpretation, validation and adequacy concerns. To address these concerns, ensemble learning techniques in the machine learning paradigm can be used to integrate predictions from multiple tools. By leveraging various underlying QSAR algorithms and training datasets, the resulting consensus prediction should yield better overall predictive ability. We present a novel ensemble QSAR model using Bayesian classification. The model allows for varying a cut-off parameter that allows for a selection in the desirable trade-off between model sensitivity and specificity. The predictive performance of the ensemble model is compared with four in silico tools (Toxtree, Lazar, OECD Toolbox, and Danish QSAR) to predict carcinogenicity for a dataset of air toxins (332 chemicals) and a subset of the gold carcinogenic potency database (480 chemicals). Leave-one-out cross validation results show that the ensemble model achieves the best trade-off between sensitivity and specificity (accuracy: 83.8 % and 80.4 %, and balanced accuracy: 80.6 % and 80.8 %) and highest inter-rater agreement [kappa ( κ ): 0.63 and 0.62] for both the datasets. The ROC curves demonstrate the utility of the cut-off feature in the predictive ability of the ensemble model. This feature provides an additional control to the regulators in grading a chemical based on the severity of the toxic endpoint under study.
An ensemble model of QSAR tools for regulatory risk assessment
Pradeep, Prachi; Povinelli, Richard J.; White, Shannon; ...
2016-09-22
Quantitative structure activity relationships (QSARs) are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR predictions can be used for chemical risk assessment for protection of human and environmental health, which makes them interesting to regulators, especially in the absence of experimental data. For compatibility with regulatory use, QSAR models should be transparent, reproducible and optimized to minimize the number of false negatives. In silico QSAR tools are gaining wide acceptance as a faster alternative to otherwise time-consuming clinical and animal testing methods. However, different QSAR tools often make conflictingmore » predictions for a given chemical and may also vary in their predictive performance across different chemical datasets. In a regulatory context, conflicting predictions raise interpretation, validation and adequacy concerns. To address these concerns, ensemble learning techniques in the machine learning paradigm can be used to integrate predictions from multiple tools. By leveraging various underlying QSAR algorithms and training datasets, the resulting consensus prediction should yield better overall predictive ability. We present a novel ensemble QSAR model using Bayesian classification. The model allows for varying a cut-off parameter that allows for a selection in the desirable trade-off between model sensitivity and specificity. The predictive performance of the ensemble model is compared with four in silico tools (Toxtree, Lazar, OECD Toolbox, and Danish QSAR) to predict carcinogenicity for a dataset of air toxins (332 chemicals) and a subset of the gold carcinogenic potency database (480 chemicals). Leave-one-out cross validation results show that the ensemble model achieves the best trade-off between sensitivity and specificity (accuracy: 83.8 % and 80.4 %, and balanced accuracy: 80.6 % and 80.8 %) and highest inter-rater agreement [kappa (κ): 0.63 and 0.62] for both the datasets. The ROC curves demonstrate the utility of the cut-off feature in the predictive ability of the ensemble model. In conclusion, this feature provides an additional control to the regulators in grading a chemical based on the severity of the toxic endpoint under study.« less
Zhao, Yongsheng; Zhao, Jihong; Huang, Ying; Zhou, Qing; Zhang, Xiangping; Zhang, Suojiang
2014-08-15
A comprehensive database on toxicity of ionic liquids (ILs) is established. The database includes over 4000 pieces of data. Based on the database, the relationship between IL's structure and its toxicity has been analyzed qualitatively. Furthermore, Quantitative Structure-Activity relationships (QSAR) model is conducted to predict the toxicities (EC50 values) of various ILs toward the Leukemia rat cell line IPC-81. Four parameters selected by the heuristic method (HM) are used to perform the studies of multiple linear regression (MLR) and support vector machine (SVM). The squared correlation coefficient (R(2)) and the root mean square error (RMSE) of training sets by two QSAR models are 0.918 and 0.959, 0.258 and 0.179, respectively. The prediction R(2) and RMSE of QSAR test sets by MLR model are 0.892 and 0.329, by SVM model are 0.958 and 0.234, respectively. The nonlinear model developed by SVM algorithm is much outperformed MLR, which indicates that SVM model is more reliable in the prediction of toxicity of ILs. This study shows that increasing the relative number of O atoms of molecules leads to decrease in the toxicity of ILs. Copyright © 2014 Elsevier B.V. All rights reserved.
Fatima, Sabiha; Jatavath, Mohan Babu; Bathini, Raju; Sivan, Sree Kanth; Manga, Vijjulatha
2014-10-01
Poly(ADP-ribose) polymerase-1 (PARP-1) functions as a DNA damage sensor and signaling molecule. It plays a vital role in the repair of DNA strand breaks induced by radiation and chemotherapeutic drugs; inhibitors of this enzyme have the potential to improve cancer chemotherapy or radiotherapy. Three-dimensional quantitative structure activity relationship (3D QSAR) models were developed using comparative molecular field analysis, comparative molecular similarity indices analysis and docking studies. A set of 88 molecules were docked into the active site of six X-ray crystal structures of poly(ADP-ribose)polymerase-1 (PARP-1), by a procedure called multiple receptor conformation docking (MRCD), in order to improve the 3D QSAR models through the analysis of binding conformations. The docked poses were clustered to obtain the best receptor binding conformation. These dock poses from clustering were used for 3D QSAR analysis. Based on MRCD and QSAR information, some key features have been identified that explain the observed variance in the activity. Two receptor-based QSAR models were generated; these models showed good internal and external statistical reliability that is evident from the [Formula: see text], [Formula: see text] and [Formula: see text]. The identified key features enabled us to design new PARP-1 inhibitors.
Maganti, Lakshmi; Das, Sanjit Kumar; Mascarenhas, Nahren Manuel; Ghoshal, Nanda
2011-10-01
The re-emergence of tuberculosis infections, which are resistant to conventional drug therapy, has steadily risen in the last decade. Inhibitors of aryl acid adenylating enzyme known as MbtA, involved in siderophore biosynthesis in Mycobacterium tuberculosis, are being explored as potential antitubercular agents. The ability to identify fragments that interact with a biological target is a key step in fragment based drug design (FBDD). To expand the boundaries of quantitative structure activity relationship (QSAR) paradigm, we have proposed a Fragment Based QSAR methodology, referred here in as FB-QSAR, for deciphering the structural requirements of a series of nucleoside bisubstrate analogs for inhibition of MbtA, a key enzyme involved in siderophore biosynthetic pathway. For the development of FB-QSAR models, statistical techniques such as stepwise multiple linear regression (SMLR), genetic function approximation (GFA) and GFAspline were used. The predictive ability of the generated models was validated using different statistical metrics, and similarity-based coverage estimation was carried out to define applicability boundaries. To aid the creation of novel antituberculosis compounds, a bioisosteric database was enumerated using the combichem approach endorsed mining in a lead-like chemical space. The generated library was screened using an integrated in-silico approach and potential hits identified. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Jing, Pu; Zhao, Shujuan; Ruan, Siyu; Sui, Zhongquan; Chen, Lihong; Jiang, Linlei; Qian, Bingjun
2014-02-15
The 3-dimensional quantitative structure activity relationship (3D-QSAR) models were established from 21 anthocyanins based on their oxygen radical absorbing capacity (ORAC) and were applied to predict anthocyanins in eggplant and radish for their ORAC values. The cross-validated q(2)=0.857/0.729, non-cross-validated r(2) = 0.958/0.856, standard error of estimate = 0.153/0.134, and F = 73.267/19.247 were for the best QSAR (CoMFA/CoMSIA) models, where the correlation coefficient r(2)pred = 0.998/0.997 (>0.6) indicated a high predictive ability for each. Additionally, the contour map results suggested that structural characteristics of anthocyanins favourable for the high ORAC. Four anthocyanins from eggplant and radish have been screened based on the QSAR models. Pelargonidin-3-[(6''-p-coumaroyl)-glucosyl(2 → 1)glucoside]-5-(6''-malonyl)-glucoside, delphinidin-3-rutinoside-5-glucoside, and delphinidin-3-[(4''-p-coumaroyl)-rhamnosyl(1 → 6)glucoside]-5-glucoside potential with high ORAC based the QSAR models were isolated and also confirmed for their relative high antioxidant ability, which might attribute to the bulky and/or electron-donating substituent at the 3-position in the C ring or/and hydrogen bond donor group/electron donating group on the R1 position in the B ring. Copyright © 2013 Elsevier Ltd. All rights reserved.
Chen, Meimei; Yang, Fafu; Kang, Jie; Yang, Xuemei; Lai, Xinmei; Gao, Yuxing
2016-11-29
In this study, in silico approaches, including multiple QSAR modeling, structural similarity analysis, and molecular docking, were applied to develop QSAR classification models as a fast screening tool for identifying highly-potent ABCA1 up-regulators targeting LXRβ based on a series of new flavonoids. Initially, four modeling approaches, including linear discriminant analysis, support vector machine, radial basis function neural network, and classification and regression trees, were applied to construct different QSAR classification models. The statistics results indicated that these four kinds of QSAR models were powerful tools for screening highly potent ABCA1 up-regulators. Then, a consensus QSAR model was developed by combining the predictions from these four models. To discover new ABCA1 up-regulators at maximum accuracy, the compounds in the ZINC database that fulfilled the requirement of structural similarity of 0.7 compared to known potent ABCA1 up-regulator were subjected to the consensus QSAR model, which led to the discovery of 50 compounds. Finally, they were docked into the LXRβ binding site to understand their role in up-regulating ABCA1 expression. The excellent binding modes and docking scores of 10 hit compounds suggested they were highly-potent ABCA1 up-regulators targeting LXRβ. Overall, this study provided an effective strategy to discover highly potent ABCA1 up-regulators.
Wang, Hui; Jiang, Mingyue; Li, Shujun; Hse, Chung-Yun; Jin, Chunde; Sun, Fangli; Li, Zhuo
2017-09-01
Cinnamaldehyde amino acid Schiff base (CAAS) is a new class of safe, bioactive compounds which could be developed as potential antifungal agents for fungal infections. To design new cinnamaldehyde amino acid Schiff base compounds with high bioactivity, the quantitative structure-activity relationships (QSARs) for CAAS compounds against Aspergillus niger ( A. niger ) and Penicillium citrinum (P. citrinum) were analysed. The QSAR models ( R 2 = 0.9346 for A. niger , R 2 = 0.9590 for P. citrinum, ) were constructed and validated. The models indicated that the molecular polarity and the Max atomic orbital electronic population had a significant effect on antifungal activity. Based on the best QSAR models, two new compounds were designed and synthesized. Antifungal activity tests proved that both of them have great bioactivity against the selected fungi.
Zhang, Shuqun; Hou, Bo; Yang, Huaiyu; Zuo, Zhili
2016-05-01
Acetylcholinesterase (AChE) is an important enzyme in the pathogenesis of Alzheimer's disease (AD). Comparative quantitative structure-activity relationship (QSAR) analyses on some huprines inhibitors against AChE were carried out using comparative molecular field analysis (CoMFA), comparative molecular similarity indices analysis (CoMSIA), and hologram QSAR (HQSAR) methods. Three highly predictive QSAR models were constructed successfully based on the training set. The CoMFA, CoMSIA, and HQSAR models have values of r (2) = 0.988, q (2) = 0.757, ONC = 6; r (2) = 0.966, q (2) = 0.645, ONC = 5; and r (2) = 0.957, q (2) = 0.736, ONC = 6. The predictabilities were validated using an external test sets, and the predictive r (2) values obtained by the three models were 0.984, 0.973, and 0.783, respectively. The analysis was performed by combining the CoMFA and CoMSIA field distributions with the active sites of the AChE to further understand the vital interactions between huprines and the protease. On the basis of the QSAR study, 14 new potent molecules have been designed and six of them are predicted to be more active than the best active compound 24 described in the literature. The final QSAR models could be helpful in design and development of novel active AChE inhibitors.
Cronin, Mark T D; Walker, John D; Jaworska, Joanna S; Comber, Michael H I; Watts, Christopher D; Worth, Andrew P
2003-01-01
This article is a review of the use, by regulatory agencies and authorities, of quantitative structure-activity relationships (QSARs) to predict ecologic effects and environmental fate of chemicals. For many years, the U.S. Environmental Protection Agency has been the most prominent regulatory agency using QSARs to predict the ecologic effects and environmental fate of chemicals. However, as increasing numbers of standard QSAR methods are developed and validated to predict ecologic effects and environmental fate of chemicals, it is anticipated that more regulatory agencies and authorities will find them to be acceptable alternatives to chemical testing. PMID:12896861
Quantitative structure activity relationship studies of mushroom tyrosinase inhibitors
NASA Astrophysics Data System (ADS)
Xue, Chao-Bin; Luo, Wan-Chun; Ding, Qi; Liu, Shou-Zhu; Gao, Xing-Xiang
2008-05-01
Here, we report our results from quantitative structure-activity relationship studies on tyrosinase inhibitors. Interactions between benzoic acid derivatives and tyrosinase active sites were also studied using a molecular docking method. These studies indicated that one possible mechanism for the interaction between benzoic acid derivatives and the tyrosinase active site is the formation of a hydrogen-bond between the hydroxyl (aOH) and carbonyl oxygen atoms of Tyr98, which stabilized the position of Tyr98 and prevented Tyr98 from participating in the interaction between tyrosinase and ORF378. Tyrosinase, also known as phenoloxidase, is a key enzyme in animals, plants and insects that is responsible for catalyzing the hydroxylation of tyrosine into o-diphenols and the oxidation of o-diphenols into o-quinones. In the present study, the bioactivities of 48 derivatives of benzaldehyde, benzoic acid, and cinnamic acid compounds were used to construct three-dimensional quantitative structure-activity relationship (3D-QSAR) models using comparative molecular field (CoMFA) and comparative molecular similarity indices (CoMSIA) analyses. After superimposition using common substructure-based alignments, robust and predictive 3D-QSAR models were obtained from CoMFA ( q 2 = 0.855, r 2 = 0.978) and CoMSIA ( q 2 = 0.841, r 2 = 0.946), with 6 optimum components. Chemical descriptors, including electronic (Hammett σ), hydrophobic (π), and steric (MR) parameters, hydrogen bond acceptor (H-acc), and indicator variable ( I), were used to construct a 2D-QSAR model. The results of this QSAR indicated that π, MR, and H-acc account for 34.9, 31.6, and 26.7% of the calculated biological variance, respectively. The molecular interactions between ligand and target were studied using a flexible docking method (FlexX). The best scored candidates were docked flexibly, and the interaction between the benzoic acid derivatives and the tyrosinase active site was elucidated in detail. We believe that the QSAR models built here provide important information necessary for the design of novel tyrosinase inhibitors.
2D-QSAR study of fullerene nanostructure derivatives as potent HIV-1 protease inhibitors
NASA Astrophysics Data System (ADS)
Barzegar, Abolfazl; Jafari Mousavi, Somaye; Hamidi, Hossein; Sadeghi, Mehdi
2017-09-01
The protease of human immunodeficiency virus1 (HIV-PR) is an essential enzyme for antiviral treatments. Carbon nanostructures of fullerene derivatives, have nanoscale dimension with a diameter comparable to the diameter of the active site of HIV-PR which would in turn inhibit HIV. In this research, two dimensional quantitative structure-activity relationships (2D-QSAR) of fullerene derivatives against HIV-PR activity were employed as a powerful tool for elucidation the relationships between structure and experimental observations. QSAR study of 49 fullerene derivatives was performed by employing stepwise-MLR, GAPLS-MLR, and PCA-MLR models for variable (descriptor) selection and model construction. QSAR models were obtained with higher ability to predict the activity of the fullerene derivatives against HIV-PR by a correlation coefficient (R2training) of 0.942, 0.89, and 0.87 as well as R2test values of 0.791, 0.67and 0.674 for stepwise-MLR, GAPLS-MLR, and PCA -MLR models, respectively. Leave-one-out cross-validated correlation coefficient (R2CV) and Y-randomization methods confirmed the models robustness. The descriptors indicated that the HIV-PR inhibition depends on the van der Waals volumes, polarizability, bond order between two atoms and electronegativities of fullerenes derivatives. 2D-QSAR simulation without needing receptor's active site geometry, resulted in useful descriptors mainly denoting ;C60 backbone-functional groups; and ;C60 functional groups; properties. Both properties in fullerene refer to the ligand fitness and improvement van der Waals interactions with HIV-PR active site. Therefore, the QSAR models can be used in the search for novel HIV-PR inhibitors based on fullerene derivatives.
Roy, Kunal; Mitra, Indrani
2011-07-01
Quantitative structure-activity relationships (QSARs) have important applications in drug discovery research, environmental fate modeling, property prediction, etc. Validation has been recognized as a very important step for QSAR model development. As one of the important objectives of QSAR modeling is to predict activity/property/toxicity of new chemicals falling within the domain of applicability of the developed models and QSARs are being used for regulatory decisions, checking reliability of the models and confidence of their predictions is a very important aspect, which can be judged during the validation process. One prime application of a statistically significant QSAR model is virtual screening for molecules with improved potency based on the pharmacophoric features and the descriptors appearing in the QSAR model. Validated QSAR models may also be utilized for design of focused libraries which may be subsequently screened for the selection of hits. The present review focuses on various metrics used for validation of predictive QSAR models together with an overview of the application of QSAR models in the fields of virtual screening and focused library design for diverse series of compounds with citation of some recent examples.
Jardínez, Christiaan; Vela, Alberto; Cruz-Borbolla, Julián; Alvarez-Mendez, Rodrigo J; Alvarado-Rodríguez, José G
2016-12-01
The relationship between the chemical structure and biological activity (log IC 50 ) of 40 derivatives of 1,4-dihydropyridines (DHPs) was studied using density functional theory (DFT) and multiple linear regression analysis methods. With the aim of improving the quantitative structure-activity relationship (QSAR) model, the reduced density gradient s( r) of the optimized equilibrium geometries was used as a descriptor to include weak non-covalent interactions. The QSAR model highlights the correlation between the log IC 50 with highest molecular orbital energy (E HOMO ), molecular volume (V), partition coefficient (log P), non-covalent interactions NCI(H4-G) and the dual descriptor [Δf(r)]. The model yielded values of R 2 =79.57 and Q 2 =69.67 that were validated with the next four internal analytical validations DK=0.076, DQ=-0.006, R P =0.056, and R N =0.000, and the external validation Q 2 boot =64.26. The QSAR model found can be used to estimate biological activity with high reliability in new compounds based on a DHP series. Graphical abstract The good correlation between the log IC 50 with the NCI (H4-G) estimated by the reduced density gradient approach of the DHP derivatives.
Yadav, Mukesh; Joshi, Shobha; Nayarisseri, Anuraj; Jain, Anuja; Hussain, Aabid; Dubey, Tushar
2013-06-01
Global QSAR models predict biological response of molecular structures which are generic in particular class. A global QSAR dataset admits structural features derived from larger chemical space, intricate to model but more applicable in medicinal chemistry. The present work is global in either sense of structural diversity in QSAR dataset or large number of descriptor input. Forty phenethylamine structure derivatives were selected from a large pool (904) of similar phenethylamines available in Pubchem database. LogP values of selected candidates were collected from physical properties database (PHYSPROP) determined in identical set of conditions. Attempts to model logP value have produced significant QSAR models. MLR aided linear one-variable and two-variable QSAR models with their respective R(2) (0.866, 0.937), R(2)A (0.862, 0.932), F-stat (181.936, 199.812) and Standard Error (0.365, 0.255) are statistically fit and found predictive after internal validation and external validation. The descriptors chosen after improvisation and optimization reveal mechanistic part of work in terms of Verhaar model of Fish base-line toxicity from MLOGP, i.e. (BLTF96) and 3D-MoRSE -signal 15 /unweighted molecular descriptor calculated by summing atom weights viewed by a different angular scattering function (Mor15u) are crucial in regulation of logP values of phenethylamines.
Combined QSAR and molecule docking studies on predicting P-glycoprotein inhibitors
NASA Astrophysics Data System (ADS)
Tan, Wen; Mei, Hu; Chao, Li; Liu, Tengfei; Pan, Xianchao; Shu, Mao; Yang, Li
2013-12-01
P-glycoprotein (P-gp) is an ATP-binding cassette multidrug transporter. The over expression of P-gp leads to the development of multidrug resistance (MDR), which is a major obstacle to effective treatment of cancer. Thus, designing effective P-gp inhibitors has an extremely important role in the overcoming MDR. In this paper, both ligand-based quantitative structure-activity relationship (QSAR) and receptor-based molecular docking are used to predict P-gp inhibitors. The results show that each method achieves good prediction performance. According to the results of tenfold cross-validation, an optimal linear SVM model with only three descriptors is established on 857 training samples, of which the overall accuracy (Acc), sensitivity, specificity, and Matthews correlation coefficient are 0.840, 0.873, 0.813, and 0.683, respectively. The SVM model is further validated by 418 test samples with the overall Acc of 0.868. Based on a homology model of human P-gp established, Surflex-dock is also performed to give binding free energy-based evaluations with the overall accuracies of 0.823 for the test set. Furthermore, a consensus evaluation is also performed by using these two methods. Both QSAR and molecular docking studies indicate that molecular volume, hydrophobicity and aromaticity are three dominant factors influencing the inhibitory activities.
Cruz-Monteagudo, Maykel; Borges, Fernanda; Cordeiro, M Natália D S; Cagide Fajin, J Luis; Morell, Carlos; Ruiz, Reinaldo Molina; Cañizares-Carmenate, Yudith; Dominguez, Elena Rosa
2008-01-01
Up to now, very few applications of multiobjective optimization (MOOP) techniques to quantitative structure-activity relationship (QSAR) studies have been reported in the literature. However, none of them report the optimization of objectives related directly to the final pharmaceutical profile of a drug. In this paper, a MOOP method based on Derringer's desirability function that allows conducting global QSAR studies, simultaneously considering the potency, bioavailability, and safety of a set of drug candidates, is introduced. The results of the desirability-based MOOP (the levels of the predictor variables concurrently producing the best possible compromise between the properties determining an optimal drug candidate) are used for the implementation of a ranking method that is also based on the application of desirability functions. This method allows ranking drug candidates with unknown pharmaceutical properties from combinatorial libraries according to the degree of similarity with the previously determined optimal candidate. Application of this method will make it possible to filter the most promising drug candidates of a library (the best-ranked candidates), which should have the best pharmaceutical profile (the best compromise between potency, safety and bioavailability). In addition, a validation method of the ranking process, as well as a quantitative measure of the quality of a ranking, the ranking quality index (Psi), is proposed. The usefulness of the desirability-based methods of MOOP and ranking is demonstrated by its application to a library of 95 fluoroquinolones, reporting their gram-negative antibacterial activity and mammalian cell cytotoxicity. Finally, the combined use of the desirability-based methods of MOOP and ranking proposed here seems to be a valuable tool for rational drug discovery and development.
Using Theoretical Descriptions in Structure Activity Relations. 3. Electronic Descriptors
1988-08-01
Activity Relationships (QSAR) have been used successfully in the past to develop predictive equations for several biological and physical properties...Linear Free Energy Relationships (,FF.3) and is based on work by Hammet in which he derived electronic descriptors for the dissociation of substituted...structure of a compound and its activity in a system. Several different structural descriptors have been used in QSAR equations . These range from
Vijayaraj, Ramadoss; Devi, Mekapothula Lakshmi Vasavi; Subramanian, Venkatesan; Chattaraj, Pratim Kumar
2012-06-01
Three-dimensional quantitative structure activity relationship (3D-QSAR) study has been carried out on the Escherichia coli DHFR inhibitors 2,4-diamino-5-(substituted-benzyl)pyrimidine derivatives to understand the structural features responsible for the improved potency. To construct highly predictive 3D-QSAR models, comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) methods were used. The predicted models show statistically significant cross-validated and non-cross-validated correlation coefficient of r2 CV and r2 nCV, respectively. The final 3D-QSAR models were validated using structurally diverse test set compounds. Analysis of the contour maps generated from CoMFA and CoMSIA methods reveals that the substitution of electronegative groups at the first and second position along with electropositive group at the third position of R2 substitution significantly increases the potency of the derivatives. The results obtained from the CoMFA and CoMSIA study delineate the substituents on the trimethoprim analogues responsible for the enhanced potency and also provide valuable directions for the design of new trimethoprim analogues with improved affinity. © 2012 John Wiley & Sons A/S.
Furuhama, A; Toida, T; Nishikawa, N; Aoki, Y; Yoshioka, Y; Shiraishi, H
2010-07-01
The KAshinhou Tool for Ecotoxicity (KATE) system, including ecotoxicity quantitative structure-activity relationship (QSAR) models, was developed by the Japanese National Institute for Environmental Studies (NIES) using the database of aquatic toxicity results gathered by the Japanese Ministry of the Environment and the US EPA fathead minnow database. In this system chemicals can be entered according to their one-dimensional structures and classified by substructure. The QSAR equations for predicting the toxicity of a chemical compound assume a linear correlation between its log P value and its aquatic toxicity. KATE uses a structural domain called C-judgement, defined by the substructures of specified functional groups in the QSAR models. Internal validation by the leave-one-out method confirms that the QSAR equations, with r(2 )> 0.7, RMSE
Politi, Regina; Rusyn, Ivan; Tropsha, Alexander
2016-01-01
The thyroid hormone receptor (THR) is an important member of the nuclear receptor family that can be activated by endocrine disrupting chemicals (EDC). Quantitative Structure-Activity Relationship (QSAR) models have been developed to facilitate the prioritization of THR-mediated EDC for the experimental validation. The largest database of binding affinities available at the time of the study for ligand binding domain (LBD) of THRβ was assembled to generate both continuous and classification QSAR models with an external accuracy of R2=0.55 and CCR=0.76, respectively. In addition, for the first time a QSAR model was developed to predict binding affinities of antagonists inhibiting the interaction of coactivators with the AF-2 domain of THRβ (R2=0.70). Furthermore, molecular docking studies were performed for a set of THRβ ligands (57 agonists and 15 antagonists of LBD, 210 antagonists of the AF-2 domain, supplemented by putative decoys/non-binders) using several THRβ structures retrieved from the Protein Data Bank. We found that two agonist-bound THRβ conformations could effectively discriminate their corresponding ligands from presumed non-binders. Moreover, one of the agonist conformations could discriminate agonists from antagonists. Finally, we have conducted virtual screening of a chemical library compiled by the EPA as part of the Tox21 program to identify potential THRβ-mediated EDCs using both QSAR models and docking. We concluded that the library is unlikely to have any EDC that would bind to the THRβ. Models developed in this study can be employed either to identify environmental chemicals interacting with the THR or, conversely, to eliminate the THR-mediated mechanism of action for chemicals of concern. PMID:25058446
2012-01-01
Background The Hedgehog Signaling Pathway is one of signaling pathways that are very important to embryonic development. The participation of inhibitors in the Hedgehog Signal Pathway can control cell growth and death, and searching novel inhibitors to the functioning of the pathway are in a great demand. As the matter of fact, effective inhibitors could provide efficient therapies for a wide range of malignancies, and targeting such pathway in cells represents a promising new paradigm for cell growth and death control. Current research mainly focuses on the syntheses of the inhibitors of cyclopamine derivatives, which bind specifically to the Smo protein, and can be used for cancer therapy. While quantitatively structure-activity relationship (QSAR) studies have been performed for these compounds among different cell lines, none of them have achieved acceptable results in the prediction of activity values of new compounds. In this study, we proposed a novel collaborative QSAR model for inhibitors of the Hedgehog Signaling Pathway by integration the information from multiple cell lines. Such a model is expected to substantially improve the QSAR ability from single cell lines, and provide useful clues in developing clinically effective inhibitors and modifications of parent lead compounds for target on the Hedgehog Signaling Pathway. Results In this study, we have presented: (1) a collaborative QSAR model, which is used to integrate information among multiple cell lines to boost the QSAR results, rather than only a single cell line QSAR modeling. Our experiments have shown that the performance of our model is significantly better than single cell line QSAR methods; and (2) an efficient feature selection strategy under such collaborative environment, which can derive the commonly important features related to the entire given cell lines, while simultaneously showing their specific contributions to a specific cell-line. Based on feature selection results, we have proposed several possible chemical modifications to improve the inhibitor affinity towards multiple targets in the Hedgehog Signaling Pathway. Conclusions Our model with the feature selection strategy presented here is efficient, robust, and flexible, and can be easily extended to model large-scale multiple cell line/QSAR data. The data and scripts for collaborative QSAR modeling are available in the Additional file 1. PMID:22849868
Gao, Jun; Che, Dongsheng; Zheng, Vincent W; Zhu, Ruixin; Liu, Qi
2012-07-31
The Hedgehog Signaling Pathway is one of signaling pathways that are very important to embryonic development. The participation of inhibitors in the Hedgehog Signal Pathway can control cell growth and death, and searching novel inhibitors to the functioning of the pathway are in a great demand. As the matter of fact, effective inhibitors could provide efficient therapies for a wide range of malignancies, and targeting such pathway in cells represents a promising new paradigm for cell growth and death control. Current research mainly focuses on the syntheses of the inhibitors of cyclopamine derivatives, which bind specifically to the Smo protein, and can be used for cancer therapy. While quantitatively structure-activity relationship (QSAR) studies have been performed for these compounds among different cell lines, none of them have achieved acceptable results in the prediction of activity values of new compounds. In this study, we proposed a novel collaborative QSAR model for inhibitors of the Hedgehog Signaling Pathway by integration the information from multiple cell lines. Such a model is expected to substantially improve the QSAR ability from single cell lines, and provide useful clues in developing clinically effective inhibitors and modifications of parent lead compounds for target on the Hedgehog Signaling Pathway. In this study, we have presented: (1) a collaborative QSAR model, which is used to integrate information among multiple cell lines to boost the QSAR results, rather than only a single cell line QSAR modeling. Our experiments have shown that the performance of our model is significantly better than single cell line QSAR methods; and (2) an efficient feature selection strategy under such collaborative environment, which can derive the commonly important features related to the entire given cell lines, while simultaneously showing their specific contributions to a specific cell-line. Based on feature selection results, we have proposed several possible chemical modifications to improve the inhibitor affinity towards multiple targets in the Hedgehog Signaling Pathway. Our model with the feature selection strategy presented here is efficient, robust, and flexible, and can be easily extended to model large-scale multiple cell line/QSAR data. The data and scripts for collaborative QSAR modeling are available in the Additional file 1.
Liu, Zhichao; Kelly, Reagan; Fang, Hong; Ding, Don; Tong, Weida
2011-07-18
The primary testing strategy to identify nongenotoxic carcinogens largely relies on the 2-year rodent bioassay, which is time-consuming and labor-intensive. There is an increasing effort to develop alternative approaches to prioritize the chemicals for, supplement, or even replace the cancer bioassay. In silico approaches based on quantitative structure-activity relationships (QSAR) are rapid and inexpensive and thus have been investigated for such purposes. A slightly more expensive approach based on short-term animal studies with toxicogenomics (TGx) represents another attractive option for this application. Thus, the primary questions are how much better predictive performance using short-term TGx models can be achieved compared to that of QSAR models, and what length of exposure is sufficient for high quality prediction based on TGx. In this study, we developed predictive models for rodent liver carcinogenicity using gene expression data generated from short-term animal models at different time points and QSAR. The study was focused on the prediction of nongenotoxic carcinogenicity since the genotoxic chemicals can be inexpensively removed from further development using various in vitro assays individually or in combination. We identified 62 chemicals whose hepatocarcinogenic potential was available from the National Center for Toxicological Research liver cancer database (NCTRlcdb). The gene expression profiles of liver tissue obtained from rats treated with these chemicals at different time points (1 day, 3 days, and 5 days) are available from the Gene Expression Omnibus (GEO) database. Both TGx and QSAR models were developed on the basis of the same set of chemicals using the same modeling approach, a nearest-centroid method with a minimum redundancy and maximum relevancy-based feature selection with performance assessed using compound-based 5-fold cross-validation. We found that the TGx models outperformed QSAR in every aspect of modeling. For example, the TGx models' predictive accuracy (0.77, 0.77, and 0.82 for the 1-day, 3-day, and 5-day models, respectively) was much higher for an independent validation set than that of a QSAR model (0.55). Permutation tests confirmed the statistical significance of the model's prediction performance. The study concluded that a short-term 5-day TGx animal model holds the potential to predict nongenotoxic hepatocarcinogenicity. © 2011 American Chemical Society
QSAR models for anti-malarial activity of 4-aminoquinolines.
Masand, Vijay H; Toropov, Andrey A; Toropova, Alla P; Mahajan, Devidas T
2014-03-01
In the present study, predictive quantitative structure - activity relationship (QSAR) models for anti-malarial activity of 4-aminoquinolines have been developed. CORAL, which is freely available on internet (http://www.insilico.eu/coral), has been used as a tool of QSAR analysis to establish statistically robust QSAR model of anti-malarial activity of 4-aminoquinolines. Six random splits into the visible sub-system of the training and invisible subsystem of validation were examined. Statistical qualities for these splits vary, but in all these cases, statistical quality of prediction for anti-malarial activity was quite good. The optimal SMILES-based descriptor was used to derive the single descriptor based QSAR model for a data set of 112 aminoquinolones. All the splits had r(2)> 0.85 and r(2)> 0.78 for subtraining and validation sets, respectively. The three parametric multilinear regression (MLR) QSAR model has Q(2) = 0.83, R(2) = 0.84 and F = 190.39. The anti-malarial activity has strong correlation with presence/absence of nitrogen and oxygen at a topological distance of six.
Patel, Preeti; Singh, Avineesh; Patel, Vijay K; Jain, Deepak K; Veerasamy, Ravichandran; Rajak, Harish
2016-01-01
Histone deacetylase (HDAC) inhibitors can reactivate gene expression and inhibit the growth and survival of cancer cells. To identify the important pharmacophoric features and correlate 3Dchemical structure with biological activity using 3D-QSAR and Pharmacophore modeling studies. The pharmacophore hypotheses were developed using e-pharmacophore script and phase module. Pharmacophore hypothesis represents the 3D arrangement of molecular features necessary for activity. A series of 55 compounds with wellassigned HDAC inhibitory activity were used for 3D-QSAR model development. Best 3D-QSAR model, which is a five partial least square (PLS) factor model with good statistics and predictive ability, acquired Q2 (0.7293), R2 (0.9811), cross-validated coefficient rcv 2=0.9807 and R2 pred=0.7147 with low standard deviation (0.0952). Additionally, the selected pharmacophore model DDRRR.419 was used as a 3D query for virtual screening against the ZINC database. In the virtual screening workflow, docking studies (HTVS, SP and XP) were carried out by selecting multiple receptors (PDB ID: 1T69, 1T64, 4LXZ, 4LY1, 3MAX, 2VQQ, 3C10, 1W22). Finally, six compounds were obtained based on high scoring function (dock score -11.2278-10.2222 kcal/mol) and diverse structures. The structure activity correlation was established using virtual screening, docking, energetic based pharmacophore modelling, pharmacophore, atom based 3D QSAR models and their validation. The outcomes of these studies could be further employed for the design of novel HDAC inhibitors for anticancer activity.
A novel method to estimate the affinity of HLA-A∗0201 restricted CTL epitope
NASA Astrophysics Data System (ADS)
Xu, Yun-sheng; Lin, Yong; Zhu, Bo; Lin, Zhi-hua
2009-02-01
A set of 70 peptides with affinity for the class I MHC HLA-A∗0201 molecule was subjected to quantitative structure-affinity relationship studies based on the SCORE function with good results ( r2 = 0.6982, RMS = 0.280). Then the 'leave-one-out' cross-validation (LOO-CV) and an outer test set including 18 outer samples were used to validate the QSAR model. The results of the LOO-CV were q2 = 0.6188, RMS = 0.315, and the results of outer test set were r2 = 0.5633, RMS = 0.2292. All these show that the QSAR model has good predictability. Statistical analysis showed that the hydrophobic and hydrogen bond interaction played a significant role in peptide-MHC molecule binding. The study also provided useful information for structure modification of CTL epitope, and laid theoretical base for molecular design of therapeutic vaccine.
Departments of Defense and Agriculture Team Up to Develop New Insecticides for Mosquito Control
2010-01-01
archives of insecticide data by quantita- tive structure-activity relationship ( QSAR ) modeling to predict and synthesize new insecticides. This...blood- sucking arthropods. The key thrust of IIBBL’s approach involves QSAR -based modeling of fast-acting pyrethroid insecticides to predict and
Kim, Kwang-Yon; Shin, Seong Eun; No, Kyoung Tai
2015-01-01
Objectives For successful adoption of legislation controlling registration and assessment of chemical substances, it is important to obtain sufficient toxicological experimental evidence and other related information. It is also essential to obtain a sufficient number of predicted risk and toxicity results. Particularly, methods used in predicting toxicities of chemical substances during acquisition of required data, ultimately become an economic method for future dealings with new substances. Although the need for such methods is gradually increasing, the-required information about reliability and applicability range has not been systematically provided. Methods There are various representative environmental and human toxicity models based on quantitative structure-activity relationships (QSAR). Here, we secured the 10 representative QSAR-based prediction models and its information that can make predictions about substances that are expected to be regulated. We used models that predict and confirm usability of the information expected to be collected and submitted according to the legislation. After collecting and evaluating each predictive model and relevant data, we prepared methods quantifying the scientific validity and reliability, which are essential conditions for using predictive models. Results We calculated predicted values for the models. Furthermore, we deduced and compared adequacies of the models using the Alternative non-testing method assessed for Registration, Evaluation, Authorization, and Restriction of Chemicals Substances scoring system, and deduced the applicability domains for each model. Additionally, we calculated and compared inclusion rates of substances expected to be regulated, to confirm the applicability. Conclusions We evaluated and compared the data, adequacy, and applicability of our selected QSAR-based toxicity prediction models, and included them in a database. Based on this data, we aimed to construct a system that can be used with predicted toxicity results. Furthermore, by presenting the suitability of individual predicted results, we aimed to provide a foundation that could be used in actual assessments and regulations. PMID:26206368
Naik, P K; Singh, T; Singh, H
2009-07-01
Quantitative structure-activity relationship (QSAR) analyses were performed independently on data sets belonging to two groups of insecticides, namely the organophosphates and carbamates. Several types of descriptors including topological, spatial, thermodynamic, information content, lead likeness and E-state indices were used to derive quantitative relationships between insecticide activities and structural properties of chemicals. A systematic search approach based on missing value, zero value, simple correlation and multi-collinearity tests as well as the use of a genetic algorithm allowed the optimal selection of the descriptors used to generate the models. The QSAR models developed for both organophosphate and carbamate groups revealed good predictability with r(2) values of 0.949 and 0.838 as well as [image omitted] values of 0.890 and 0.765, respectively. In addition, a linear correlation was observed between the predicted and experimental LD(50) values for the test set data with r(2) of 0.871 and 0.788 for both the organophosphate and carbamate groups, indicating that the prediction accuracy of the QSAR models was acceptable. The models were also tested successfully from external validation criteria. QSAR models developed in this study should help further design of novel potent insecticides.
Di Tullio, Maurizio; Maccallini, Cristina; Ammazzalorso, Alessandra; Giampietro, Letizia; Amoroso, Rosa; De Filippis, Barbara; Fantacuzzi, Marialuigia; Wiczling, Paweł; Kaliszan, Roman
2012-07-01
A series of 27 analogues of clofibric acid, mostly heteroarylalkanoic derivatives, have been analyzed by a novel high-throughput reversed-phase HPLC method employing combined gradient of eluent's pH and organic modifier content. The such determined hydrophobicity (lipophilicity) parameters, log kw , and acidity constants, pKa , were subjected to multiple regression analysis to get a QSRR (Quantitative StructureRetention Relationships) and a QSPR (Quantitative Structure-Property Relationships) equation, respectively, describing these pharmacokinetics-determining physicochemical parameters in terms of the calculation chemistry derived structural descriptors. The previously determined in vitro log EC50 values - transactivation activity towards PPARα (human Peroxisome Proliferator-Activated Receptor α) - have also been described in a QSAR (Quantitative StructureActivity Relationships) equation in terms of the 3-D-MoRSE descriptors (3D-Molecule Representation of Structures based on Electron diffraction descriptors). The QSAR model derived can serve for an a priori prediction of bioactivity in vitro of any designed analogue, whereas the QSRR and the QSPR models can be used to evaluate lipophilicity and acidity, respectively, of the compounds, and hence to rational guide selection of structures of proper pharmacokinetics. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Alert-QSAR. Implications for Electrophilic Theory of Chemical Carcinogenesis
Putz, Mihai V.; Ionaşcu, Cosmin; Putz, Ana-Maria; Ostafe, Vasile
2011-01-01
Given the modeling and predictive abilities of quantitative structure activity relationships (QSARs) for genotoxic carcinogens or mutagens that directly affect DNA, the present research investigates structural alert (SA) intermediate-predicted correlations ASA of electrophilic molecular structures with observed carcinogenic potencies in rats (observed activity, A = Log[1/TD50], i.e., ASA=f(X1SA,X2SA,…)). The present method includes calculation of the recently developed residual correlation of the structural alert models, i.e., ARASA=f(A−ASA,X1SA,X2SA,…). We propose a specific electrophilic ligand-receptor mechanism that combines electronegativity with chemical hardness-associated frontier principles, equality of ligand-reagent electronegativities and ligand maximum chemical hardness for highly diverse toxic molecules against specific receptors in rats. The observed carcinogenic activity is influenced by the induced SA-mutagenic intermediate effect, alongside Hansch indices such as hydrophobicity (LogP), polarizability (POL) and total energy (Etot), which account for molecular membrane diffusion, ionic deformation, and stericity, respectively. A possible QSAR mechanistic interpretation of mutagenicity as the first step in genotoxic carcinogenesis development is discussed using the structural alert chemoinformation and in full accordance with the Organization for Economic Co-operation and Development QSAR guidance principles. PMID:21954348
Fjodorova, Natalja; Novič, Marjana
2012-01-01
The knowledge-based Toxtree expert system (SAR approach) was integrated with the statistically based counter propagation artificial neural network (CP ANN) model (QSAR approach) to contribute to a better mechanistic understanding of a carcinogenicity model for non-congeneric chemicals using Dragon descriptors and carcinogenic potency for rats as a response. The transparency of the CP ANN algorithm was demonstrated using intrinsic mapping technique specifically Kohonen maps. Chemical structures were represented by Dragon descriptors that express the structural and electronic features of molecules such as their shape and electronic surrounding related to reactivity of molecules. It was illustrated how the descriptors are correlated with particular structural alerts (SAs) for carcinogenicity with recognized mechanistic link to carcinogenic activity. Moreover, the Kohonen mapping technique enables one to examine the separation of carcinogens and non-carcinogens (for rats) within a family of chemicals with a particular SA for carcinogenicity. The mechanistic interpretation of models is important for the evaluation of safety of chemicals. PMID:24688639
The discovery of indicator variables for QSAR using inductive logic programming
NASA Astrophysics Data System (ADS)
King, Ross D.; Srinivasan, Ashwin
1997-11-01
A central problem in forming accurate regression equations in QSAR studies isthe selection of appropriate descriptors for the compounds under study. Wedescribe a novel procedure for using inductive logic programming (ILP) todiscover new indicator variables (attributes) for QSAR problems, and show thatthese improve the accuracy of the derived regression equations. ILP techniqueshave previously been shown to work well on drug design problems where thereis a large structural component or where clear comprehensible rules arerequired. However, ILP techniques have had the disadvantage of only being ableto make qualitative predictions (e.g. active, inactive) and not to predictreal numbers (regression). We unify ILP and linear regression techniques togive a QSAR method that has the strength of ILP at describing stericstructure, with the familiarity and power of linear regression. We evaluatedthe utility of this new QSAR technique by examining the prediction ofbiological activity with and without the addition of new structural indicatorvariables formed by ILP. In three out of five datasets examined the additionof ILP variables produced statistically better results (P < 0.01) over theoriginal description. The new ILP variables did not increase the overallcomplexity of the derived QSAR equations and added insight into possiblemechanisms of action. We conclude that ILP can aid in the process of drugdesign.
NASA Astrophysics Data System (ADS)
Cao, Shandong
2012-08-01
The purpose of the present study was to develop in silico models allowing for a reliable prediction of polo-like kinase inhibitors based on a large diverse dataset of 136 compounds. As an effective method, quantitative structure activity relationship (QSAR) was applied using the comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA). The proposed QSAR models showed reasonable predictivity of thiophene analogs (Rcv2=0.533, Rpred2=0.845) and included four molecular descriptors, namely IC3, RDF075m, Mor02m and R4e+. The optimal model for imidazopyridine derivatives (Rcv2=0.776, Rpred2=0.876) was shown to perform good in prediction accuracy, using GATS2m and BEHe1 descriptors. Analysis of the contour maps helped to identify structural requirements for the inhibitors and served as a basis for the design of the next generation of the inhibitor analogues. Docking studies were also employed to position the inhibitors into the polo-like kinase active site to determine the most probable binding mode. These studies may help to understand the factors influencing the binding affinity of chemicals and to develop alternative methods for prescreening and designing of polo-like kinase inhibitors.
NASA Astrophysics Data System (ADS)
Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Santos-Filho, Osvaldo A.; Esposito, Emilio X.; Hopfinger, Anton J.; Tseng, Yufeng J.
2008-06-01
In previous studies we have developed categorical QSAR models for predicting skin-sensitization potency based on 4D-fingerprint (4D-FP) descriptors and in vivo murine local lymph node assay (LLNA) measures. Only 4D-FP derived from the ground state (GMAX) structures of the molecules were used to build the QSAR models. In this study we have generated 4D-FP descriptors from the first excited state (EMAX) structures of the molecules. The GMAX, EMAX and the combined ground and excited state 4D-FP descriptors (GEMAX) were employed in building categorical QSAR models. Logistic regression (LR) and partial least square coupled logistic regression (PLS-CLR), found to be effective model building for the LLNA skin-sensitization measures in our previous studies, were used again in this study. This also permitted comparison of the prior ground state models to those involving first excited state 4D-FP descriptors. Three types of categorical QSAR models were constructed for each of the GMAX, EMAX and GEMAX datasets: a binary model (2-state), an ordinal model (3-state) and a binary-binary model (two-2-state). No significant differences exist among the LR 2-state model constructed for each of the three datasets. However, the PLS-CLR 3-state and 2-state models based on the EMAX and GEMAX datasets have higher predictivity than those constructed using only the GMAX dataset. These EMAX and GMAX categorical models are also more significant and predictive than corresponding models built in our previous QSAR studies of LLNA skin-sensitization measures.
NASA Astrophysics Data System (ADS)
Ragno, Rino; Ballante, Flavio; Pirolli, Adele; Wickersham, Richard B.; Patsilinakos, Alexandros; Hesse, Stéphanie; Perspicace, Enrico; Kirsch, Gilbert
2015-08-01
Vascular endothelial growth factor receptor-2, (VEGFR-2), is a key element in angiogenesis, the process by which new blood vessels are formed, and is thus an important pharmaceutical target. Here, 3-D quantitative structure-activity relationship (3-D QSAR) were used to build a quantitative screening and pharmacophore model of the VEGFR-2 receptors for design of inhibitors with improved activities. Most of available experimental data information has been used as training set to derive optimized and fully cross-validated eight mono-probe and a multi-probe quantitative models. Notable is the use of 262 molecules, aligned following both structure-based and ligand-based protocols, as external test set confirming the 3-D QSAR models' predictive capability and their usefulness in design new VEGFR-2 inhibitors. From a survey on literature, this is the first generation of a wide-ranging computational medicinal chemistry application on VEGFR2 inhibitors.
Pérez-Garrido, Alfonso; Morales Helguera, Aliuska; Abellán Guillén, Adela; Cordeiro, M Natália D S; Garrido Escudero, Amalio
2009-01-15
This paper reports a QSAR study for predicting the complexation of a large and heterogeneous variety of substances (233 organic compounds) with beta-cyclodextrins (beta-CDs). Several different theoretical molecular descriptors, calculated solely from the molecular structure of the compounds under investigation, and an efficient variable selection procedure, like the Genetic Algorithm, led to models with satisfactory global accuracy and predictivity. But the best-final QSAR model is based on Topological descriptors meanwhile offering a reasonable interpretation. This QSAR model was able to explain ca. 84% of the variance in the experimental activity, and displayed very good internal cross-validation statistics and predictivity on external data. It shows that the driving forces for CD complexation are mainly hydrophobic and steric (van der Waals) interactions. Thus, the results of our study provide a valuable tool for future screening and priority testing of beta-CDs guest molecules.
Hattotuwagama, Channa K; Guan, Pingping; Doytchinova, Irini A; Flower, Darren R
2004-11-21
Quantitative structure-activity relationship (QSAR) analysis is a main cornerstone of modern informatic disciplines. Predictive computational models, based on QSAR technology, of peptide-major histocompatibility complex (MHC) binding affinity have now become a vital component of modern day computational immunovaccinology. Historically, such approaches have been built around semi-qualitative, classification methods, but these are now giving way to quantitative regression methods. The additive method, an established immunoinformatics technique for the quantitative prediction of peptide-protein affinity, was used here to identify the sequence dependence of peptide binding specificity for three mouse class I MHC alleles: H2-D(b), H2-K(b) and H2-K(k). As we show, in terms of reliability the resulting models represent a significant advance on existing methods. They can be used for the accurate prediction of T-cell epitopes and are freely available online ( http://www.jenner.ac.uk/MHCPred).
Sparse QSAR modelling methods for therapeutic and regenerative medicine
NASA Astrophysics Data System (ADS)
Winkler, David A.
2018-02-01
The quantitative structure-activity relationships method was popularized by Hansch and Fujita over 50 years ago. The usefulness of the method for drug design and development has been shown in the intervening years. As it was developed initially to elucidate which molecular properties modulated the relative potency of putative agrochemicals, and at a time when computing resources were scarce, there is much scope for applying modern mathematical methods to improve the QSAR method and to extending the general concept to the discovery and optimization of bioactive molecules and materials more broadly. I describe research over the past two decades where we have rebuilt the unit operations of the QSAR method using improved mathematical techniques, and have applied this valuable platform technology to new important areas of research and industry such as nanoscience, omics technologies, advanced materials, and regenerative medicine. This paper was presented as the 2017 ACS Herman Skolnik lecture.
Use of the Monte Carlo Method for OECD Principles-Guided QSAR Modeling of SIRT1 Inhibitors.
Kumar, Ashwani; Chauhan, Shilpi
2017-01-01
SIRT1 inhibitors offer therapeutic potential for the treatment of a number of diseases including cancer and human immunodeficiency virus infection. A diverse series of 45 compounds with reported SIRT1 inhibitory activity has been employed for the development of quantitative structure-activity relationship (QSAR) models using the Monte Carlo optimization method. This method makes use of simplified molecular input line entry system notation of the molecular structure. The QSAR models were built up according to OECD principles. Three subsets of three splits were examined and validated by respective external sets. All the three described models have good statistical quality. The best model has the following statistical characteristics: R 2 = 0.8350, Q 2 test = 0.7491 for the test set and R 2 = 0.9655, Q 2 ext = 0.9261 for the validation set. In the mechanistic interpretation, structural attributes responsible for the endpoint increase and decrease are defined. Further, the design of some prospective SIRT1 inhibitors is also presented on the basis of these structural attributes. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Goyal, Sukriti; Dhanjal, Jaspreet K; Tyagi, Chetna; Goyal, Manisha; Grover, Abhinav
2014-07-01
The CRK3 cyclin-dependent kinase of Leishmania plays an important role in regulating the cell-cycle progression at the G2/M phase checkpoint transition, proliferation, and viability inside the host macrophage. In this study, a novel fragment-based QSAR model has been developed using 22 pyrazole-derived compounds exhibiting inhibitory activity against Leishmanial CRK3. Unlike other QSAR methods, this fragment-based method gives flexibility to study the relationship between molecular fragments of interest and their contribution for the variation in the biological response by evaluating cross-term fragment descriptors. Based on the fragment-based QSAR model, a combinatorial library was generated, and top two compounds were reported after predicting their activity. The QSAR model showed satisfactory statistical parameters for the data set (r(2) = 0.8752, q(2) = 0.6690, F-ratio = 30.37, and pred_r(2) = 0.8632) with four descriptors describing the nature of substituent groups and the environment of the substitution site. Evaluation of the model implied that electron-rich substitution at R1 position improves the inhibitory activity, while decline in inhibitory activity was observed in presence of nitrogen at R2 position. The analysis carried out in this study provides a substantial basis for consideration of the designed pyrazole-based leads as potent antileishmanial drugs. © 2014 John Wiley & Sons A/S.
Comparative study of topological indices of macro/supramolecular RNA complex networks.
Agüero-Chapín, Guillermín; Antunes, Agostinho; Ubeira, Florencio M; Chou, Kuo-Chen; González-Díaz, Humberto
2008-11-01
RNA function annotation is often based on alignment to a previously studied template. In contrast to the study of proteins, there are not many alignment-free methods to predict RNA functions if alignment fails. The use of topological indices (TIs) of RNA complex networks (CNs) to find quantitative structure-activity relationships (QSAR) may be an alternative to incorporate secondary structure or sequence-to-sequence similarity. Here, we introduce new QSAR-like techniques using RNA macromolecular CNs (mmCNs), where nodes are nucleotides, or RNA supramolecular CNs (smCNs), where nodes are RNA sequences. We studied a data set of 198 sequences including 18S-rRNAs (important phylogenetic molecular biomarkers). We constructed three types of RNA mmCNs: sequence-linear (SL), Cartesian-lattice (CL), and sequence-folding CNs (SF-CNs) and two smCNs: sequence-sequence disagreement CN (SSD) and sequence-sequence similarity (SSS-smCN). We reported the first comparative QSAR study with all these CIs and CNs, which includes: (i) spectral moments ( ( i )micro d ( w)) of SL-mmCNs (accuracy = 75.3%), (ii) electrostatic CIs (xi d ) of CL-mmCNs (>90%), (iii) thermodynamic parameters (Delta G, Delta H, Delta S, and T m) of SF-mmCNs (64.7%), (iv) disagreement-distribution moments ( M k ) of the SSD-smCN (79.3%), and (v) node centralities of the SSD-smCN (78.0%). Furthermore, we reported the experimental isolation of a new RNA sequence from Psidum guajava leaf tissue and its QSAR and BLAST prediction to illustrate the practical use of these methods. We also investigated the use of these CNs to explore rRNA diversity on bacteria, plants, and parasites from the Dactylogyrus genus. The HPL-mmCNs model was the best of all found. All the CNs and TIs, except SF-mmCNs, were introduced here by the first time for the QSAR study of RNA, which allowed a comparative study for RNA classification.
QSAR modeling for predicting mutagenic toxicity of diverse chemicals for regulatory purposes.
Basant, Nikita; Gupta, Shikha
2017-06-01
The safety assessment process of chemicals requires information on their mutagenic potential. The experimental determination of mutagenicity of a large number of chemicals is tedious and time and cost intensive, thus compelling for alternative methods. We have established local and global QSAR models for discriminating low and high mutagenic compounds and predicting their mutagenic activity in a quantitative manner in Salmonella typhimurium (TA) bacterial strains (TA98 and TA100). The decision treeboost (DTB)-based classification QSAR models discriminated among two categories with accuracies of >96% and the regression QSAR models precisely predicted the mutagenic activity of diverse chemicals yielding high correlations (R 2 ) between the experimental and model-predicted values in the respective training (>0.96) and test (>0.94) sets. The test set root mean squared error (RMSE) and mean absolute error (MAE) values emphasized the usefulness of the developed models for predicting new compounds. Relevant structural features of diverse chemicals that were responsible and influence the mutagenic activity were identified. The applicability domains of the developed models were defined. The developed models can be used as tools for screening new chemicals for their mutagenicity assessment for regulatory purpose.
In general, the accuracy of a predicted toxicity value increases with increase in similarity between the query chemical and the chemicals used to develop a QSAR model. A toxicity estimation methodology employing this finding has been developed. A hierarchical based clustering t...
Li, Jiazhong; Gramatica, Paola
2010-11-01
Quantitative structure-activity relationship (QSAR) methodology aims to explore the relationship between molecular structures and experimental endpoints, producing a model for the prediction of new data; the predictive performance of the model must be checked by external validation. Clearly, the qualities of chemical structure information and experimental endpoints, as well as the statistical parameters used to verify the external predictivity have a strong influence on QSAR model reliability. Here, we emphasize the importance of these three aspects by analyzing our models on estrogen receptor binders (Endocrine disruptor knowledge base (EDKB) database). Endocrine disrupting chemicals, which mimic or antagonize the endogenous hormones such as estrogens, are a hot topic in environmental and toxicological sciences. QSAR shows great values in predicting the estrogenic activity and exploring the interactions between the estrogen receptor and ligands. We have verified our previously published model for additional external validation on new EDKB chemicals. Having found some errors in the used 3D molecular conformations, we redevelop a new model using the same data set with corrected structures, the same method (ordinary least-square regression, OLS) and DRAGON descriptors. The new model, based on some different descriptors, is more predictive on external prediction sets. Three different formulas to calculate correlation coefficient for the external prediction set (Q2 EXT) were compared, and the results indicated that the new proposal of Consonni et al. had more reasonable results, consistent with the conclusions from regression line, Williams plot and root mean square error (RMSE) values. Finally, the importance of reliable endpoints values has been highlighted by comparing the classification assignments of EDKB with those of another estrogen receptor binders database (METI): we found that 16.1% assignments of the common compounds were opposite (20 among 124 common compounds). In order to verify the real assignments for these inconsistent compounds, we predicted these samples, as a blind external set, by our regression models and compared the results with the two databases. The results indicated that most of the predictions were consistent with METI. Furthermore, we built a kNN classification model using the 104 consistent compounds to predict those inconsistent ones, and most of the predictions were also in agreement with METI database.
QSAR Analysis of 2-Amino or 2-Methyl-1-Substituted Benzimidazoles Against Pseudomonas aeruginosa
Podunavac-Kuzmanović, Sanja O.; Cvetković, Dragoljub D.; Barna, Dijana J.
2009-01-01
A set of benzimidazole derivatives were tested for their inhibitory activities against the Gram-negative bacterium Pseudomonas aeruginosa and minimum inhibitory concentrations were determined for all the compounds. Quantitative structure activity relationship (QSAR) analysis was applied to fourteen of the abovementioned derivatives using a combination of various physicochemical, steric, electronic, and structural molecular descriptors. A multiple linear regression (MLR) procedure was used to model the relationships between molecular descriptors and the antibacterial activity of the benzimidazole derivatives. The stepwise regression method was used to derive the most significant models as a calibration model for predicting the inhibitory activity of this class of molecules. The best QSAR models were further validated by a leave one out technique as well as by the calculation of statistical parameters for the established theoretical models. To confirm the predictive power of the models, an external set of molecules was used. High agreement between experimental and predicted inhibitory values, obtained in the validation procedure, indicated the good quality of the derived QSAR models. PMID:19468332
The anesthetic action of some polyhalogenated ethers-Monte Carlo method based QSAR study.
Golubović, Mlađan; Lazarević, Milan; Zlatanović, Dragan; Krtinić, Dane; Stoičkov, Viktor; Mladenović, Bojan; Milić, Dragan J; Sokolović, Dušan; Veselinović, Aleksandar M
2018-04-13
Up to this date, there has been an ongoing debate about the mode of action of general anesthetics, which have postulated many biological sites as targets for their action. However, postoperative nausea and vomiting are common problems in which inhalational agents may have a role in their development. When a mode of action is unknown, QSAR modelling is essential in drug development. To investigate the aspects of their anesthetic, QSAR models based on the Monte Carlo method were developed for a set of polyhalogenated ethers. Until now, their anesthetic action has not been completely defined, although some hypotheses have been suggested. Therefore, a QSAR model should be developed on molecular fragments that contribute to anesthetic action. QSAR models were built on the basis of optimal molecular descriptors based on the SMILES notation and local graph invariants, whereas the Monte Carlo optimization method with three random splits into the training and test set was applied for model development. Different methods, including novel Index of ideality correlation, were applied for the determination of the robustness of the model and its predictive potential. The Monte Carlo optimization process was capable of being an efficient in silico tool for building up a robust model of good statistical quality. Molecular fragments which have both positive and negative influence on anesthetic action were determined. The presented study can be useful in the search for novel anesthetics. Copyright © 2018 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alves, Vinicius M.; Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599; Muratov, Eugene
Skin permeability is widely considered to be mechanistically implicated in chemically-induced skin sensitization. Although many chemicals have been identified as skin sensitizers, there have been very few reports analyzing the relationships between molecular structure and skin permeability of sensitizers and non-sensitizers. The goals of this study were to: (i) compile, curate, and integrate the largest publicly available dataset of chemicals studied for their skin permeability; (ii) develop and rigorously validate QSAR models to predict skin permeability; and (iii) explore the complex relationships between skin sensitization and skin permeability. Based on the largest publicly available dataset compiled in this study, wemore » found no overall correlation between skin permeability and skin sensitization. In addition, cross-species correlation coefficient between human and rodent permeability data was found to be as low as R{sup 2} = 0.44. Human skin permeability models based on the random forest method have been developed and validated using OECD-compliant QSAR modeling workflow. Their external accuracy was high (Q{sup 2}{sub ext} = 0.73 for 63% of external compounds inside the applicability domain). The extended analysis using both experimentally-measured and QSAR-imputed data still confirmed the absence of any overall concordance between skin permeability and skin sensitization. This observation suggests that chemical modifications that affect skin permeability should not be presumed a priori to modulate the sensitization potential of chemicals. The models reported herein as well as those developed in the companion paper on skin sensitization suggest that it may be possible to rationally design compounds with the desired high skin permeability but low sensitization potential. - Highlights: • It was compiled the largest publicly-available skin permeability dataset. • Predictive QSAR models were developed for skin permeability. • No concordance between skin sensitization and skin permeability has been found. • Structural rules for optimizing sensitization and penetration were established.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Politi, Regina; Department of Environmental Sciences and Engineering, University of North Carolina, Chapel Hill, NC 27599; Rusyn, Ivan, E-mail: iir@unc.edu
2014-10-01
The thyroid hormone receptor (THR) is an important member of the nuclear receptor family that can be activated by endocrine disrupting chemicals (EDC). Quantitative Structure–Activity Relationship (QSAR) models have been developed to facilitate the prioritization of THR-mediated EDC for the experimental validation. The largest database of binding affinities available at the time of the study for ligand binding domain (LBD) of THRβ was assembled to generate both continuous and classification QSAR models with an external accuracy of R{sup 2} = 0.55 and CCR = 0.76, respectively. In addition, for the first time a QSAR model was developed to predict bindingmore » affinities of antagonists inhibiting the interaction of coactivators with the AF-2 domain of THRβ (R{sup 2} = 0.70). Furthermore, molecular docking studies were performed for a set of THRβ ligands (57 agonists and 15 antagonists of LBD, 210 antagonists of the AF-2 domain, supplemented by putative decoys/non-binders) using several THRβ structures retrieved from the Protein Data Bank. We found that two agonist-bound THRβ conformations could effectively discriminate their corresponding ligands from presumed non-binders. Moreover, one of the agonist conformations could discriminate agonists from antagonists. Finally, we have conducted virtual screening of a chemical library compiled by the EPA as part of the Tox21 program to identify potential THRβ-mediated EDCs using both QSAR models and docking. We concluded that the library is unlikely to have any EDC that would bind to the THRβ. Models developed in this study can be employed either to identify environmental chemicals interacting with the THR or, conversely, to eliminate the THR-mediated mechanism of action for chemicals of concern. - Highlights: • This is the largest curated dataset for ligand binding domain (LBD) of the THRβ. • We report the first QSAR model for antagonists of AF-2 domain of THRβ. • A combination of QSAR and docking enables prediction of both affinity and efficacy. • Models can be used to identify environmental chemicals interacting with THRβ. • Models can be used to eliminate the THRβ-mediated mechanism of action.« less
Fassihi, Afshin; Sabet, Razieh
2008-01-01
Quantitative relationships between molecular structure and p56lck protein tyrosine kinase inhibitory activity of 50 flavonoid derivatives are discovered by MLR and GA-PLS methods. Different QSAR models revealed that substituent electronic descriptors (SED) parameters have significant impact on protein tyrosine kinase inhibitory activity of the compounds. Between the two statistical methods employed, GA-PLS gave superior results. The resultant GA-PLS model had a high statistical quality (R2 = 0.74 and Q2 = 0.61) for predicting the activity of the inhibitors. The models proposed in the present work are more useful in describing QSAR of flavonoid derivatives as p56lck protein tyrosine kinase inhibitors than those provided previously. PMID:19325836
Mladenović, Milan; Patsilinakos, Alexandros; Pirolli, Adele; Sabatino, Manuela; Ragno, Rino
2017-04-24
Monoamine oxidase B (MAO B) catalyzes the oxidative deamination of aryalkylamines neurotransmitters with concomitant reduction of oxygen to hydrogen peroxide. Consequently, the enzyme's malfunction can induce oxidative damage to mitochondrial DNA and mediates development of Parkinson's disease. Thus, MAO B emerges as a promising target for developing pharmaceuticals potentially useful to treat this vicious neurodegenerative condition. Aiming to contribute to the development of drugs with the reversible mechanism of MAO B inhibition only, herein, an extended in silico-in vitro procedure for the selection of novel MAO B inhibitors is demonstrated, including the following: (1) definition of optimized and validated structure-based three-dimensional (3-D) quantitative structure-activity relationships (QSAR) models derived from available cocrystallized inhibitor-MAO B complexes; (2) elaboration of SAR features for either irreversible or reversible MAO B inhibitors to characterize and improve coumarin-based inhibitor activity (Protein Data Bank ID: 2V61 ) as the most potent reversible lead compound; (3) definition of structure-based (SB) and ligand-based (LB) alignment rule assessments by which virtually any untested potential MAO B inhibitor might be evaluated; (4) predictive ability validation of the best 3-D QSAR model through SB/LB modeling of four coumarin-based external test sets (267 compounds); (5) design and SB/LB alignment of novel coumarin-based scaffolds experimentally validated through synthesis and biological evaluation in vitro. Due to the wide range of molecular diversity within the 3-D QSAR training set and derived features, the selected N probe-derived 3-D QSAR model proves to be a valuable tool for virtual screening (VS) of novel MAO B inhibitors and a platform for design, synthesis and evaluation of novel active structures. Accordingly, six highly active and selective MAO B inhibitors (picomolar to low nanomolar range of activity) were disclosed as a result of rational SB/LB 3D QSAR design; therefore, D123 (IC 50 = 0.83 nM, K i = 0.25 nM) and D124 (IC 50 = 0.97 nM, K i = 0.29 nM) are potential lead candidates as anti-Parkinson's drugs.
Ahlberg, Ernst; Amberg, Alexander; Beilke, Lisa D; Bower, David; Cross, Kevin P; Custer, Laura; Ford, Kevin A; Van Gompel, Jacky; Harvey, James; Honma, Masamitsu; Jolly, Robert; Joossens, Elisabeth; Kemper, Raymond A; Kenyon, Michelle; Kruhlak, Naomi; Kuhnke, Lara; Leavitt, Penny; Naven, Russell; Neilan, Claire; Quigley, Donald P; Shuey, Dana; Spirkl, Hans-Peter; Stavitskaya, Lidiya; Teasdale, Andrew; White, Angela; Wichard, Joerg; Zwickl, Craig; Myatt, Glenn J
2016-06-01
Statistical-based and expert rule-based models built using public domain mutagenicity knowledge and data are routinely used for computational (Q)SAR assessments of pharmaceutical impurities in line with the approach recommended in the ICH M7 guideline. Knowledge from proprietary corporate mutagenicity databases could be used to increase the predictive performance for selected chemical classes as well as expand the applicability domain of these (Q)SAR models. This paper outlines a mechanism for sharing knowledge without the release of proprietary data. Primary aromatic amine mutagenicity was selected as a case study because this chemical class is often encountered in pharmaceutical impurity analysis and mutagenicity of aromatic amines is currently difficult to predict. As part of this analysis, a series of aromatic amine substructures were defined and the number of mutagenic and non-mutagenic examples for each chemical substructure calculated across a series of public and proprietary mutagenicity databases. This information was pooled across all sources to identify structural classes that activate or deactivate aromatic amine mutagenicity. This structure activity knowledge, in combination with newly released primary aromatic amine data, was incorporated into Leadscope's expert rule-based and statistical-based (Q)SAR models where increased predictive performance was demonstrated. Copyright © 2016 Elsevier Inc. All rights reserved.
QSAR modelling using combined simple competitive learning networks and RBF neural networks.
Sheikhpour, R; Sarram, M A; Rezaeian, M; Sheikhpour, E
2018-04-01
The aim of this study was to propose a QSAR modelling approach based on the combination of simple competitive learning (SCL) networks with radial basis function (RBF) neural networks for predicting the biological activity of chemical compounds. The proposed QSAR method consisted of two phases. In the first phase, an SCL network was applied to determine the centres of an RBF neural network. In the second phase, the RBF neural network was used to predict the biological activity of various phenols and Rho kinase (ROCK) inhibitors. The predictive ability of the proposed QSAR models was evaluated and compared with other QSAR models using external validation. The results of this study showed that the proposed QSAR modelling approach leads to better performances than other models in predicting the biological activity of chemical compounds. This indicated the efficiency of simple competitive learning networks in determining the centres of RBF neural networks.
Metabolic biotransformation half-lives in fish: QSAR modeling and consensus analysis.
Papa, Ester; van der Wal, Leon; Arnot, Jon A; Gramatica, Paola
2014-02-01
Bioaccumulation in fish is a function of competing rates of chemical uptake and elimination. For hydrophobic organic chemicals bioconcentration, bioaccumulation and biomagnification potential are high and the biotransformation rate constant is a key parameter. Few measured biotransformation rate constant data are available compared to the number of chemicals that are being evaluated for bioaccumulation hazard and for exposure and risk assessment. Three new Quantitative Structure-Activity Relationships (QSARs) for predicting whole body biotransformation half-lives (HLN) in fish were developed and validated using theoretical molecular descriptors that seek to capture structural characteristics of the whole molecule and three data set splitting schemes. The new QSARs were developed using a minimal number of theoretical descriptors (n=9) and compared to existing QSARs developed using fragment contribution methods that include up to 59 descriptors. The predictive statistics of the models are similar thus further corroborating the predictive performance of the different QSARs; Q(2)ext ranges from 0.75 to 0.77, CCCext ranges from 0.86 to 0.87, RMSE in prediction ranges from 0.56 to 0.58. The new QSARs provide additional mechanistic insights into the biotransformation capacity of organic chemicals in fish by including whole molecule descriptors and they also include information on the domain of applicability for the chemical of interest. Advantages of consensus modeling for improving overall prediction and minimizing false negative errors in chemical screening assessments, for identifying potential sources of residual error in the empirical HLN database, and for identifying structural features that are not well represented in the HLN dataset to prioritize future testing needs are illustrated. © 2013.
Ferrari, Thomas; Lombardo, Anna; Benfenati, Emilio
2018-05-14
Several methods exist to develop QSAR models automatically. Some are based on indices of the presence of atoms, other on the most similar compounds, other on molecular descriptors. Here we introduce QSARpy v1.0, a new QSAR modeling tool based on a different approach: the dissimilarity. This tool fragments the molecules of the training set to extract fragments that can be associated to a difference in the property/activity value, called modulators. If the target molecule share part of the structure with a molecule of the training set and differences can be explained with one or more modulators, the property/activity value of the molecule of the training set is adjusted using the value associated to the modulator(s). This tool is tested here on the n-octanol/water partition coefficient (Kow, usually expressed in logarithmic units as log Kow). It is a key parameter in risk assessment since it is a measure of hydrophobicity. Its wide spread use makes these estimation methods very useful to reduce testing costs. Using QSARpy v1.0, we obtained a new model to predict log Kow with accurate performance (RMSE 0.43 and R 2 0.94 for the external test set), comparing favorably with other programs. QSARpy is freely available on request. Copyright © 2018 Elsevier B.V. All rights reserved.
Shi, Weimin; Zhang, Xiaoya; Shen, Qi
2010-01-01
Quantitative structure-activity relationship (QSAR) study of chemokine receptor 5 (CCR5) binding affinity of substituted 1-(3,3-diphenylpropyl)-piperidinyl amides and ureas and toxicity of aromatic compounds have been performed. The gene expression programming (GEP) was used to select variables and produce nonlinear QSAR models simultaneously using the selected variables. In our GEP implementation, a simple and convenient method was proposed to infer the K-expression from the number of arguments of the function in a gene, without building the expression tree. The results were compared to those obtained by artificial neural network (ANN) and support vector machine (SVM). It has been demonstrated that the GEP is a useful tool for QSAR modeling. Copyright 2009 Elsevier Masson SAS. All rights reserved.
Quantum chemical parameters in QSAR: what do I use when?
Hickey, James P.; Ostrander, Gary K.
1996-01-01
This chapter provides a brief overview of the numerous quantum chemical parameters that have been/are currently being used in quantitative structure activity relationships (QSAR), along with a representative bibliography. The parameters will be grouped according to their mechanistic interpretations, and representative biological and physical chemical applications will be mentioned. Parmater computation methods and the appropriate software are highlighted, as are sources for software.
Wang, Wenyi; Kim, Marlene T.; Sedykh, Alexander
2015-01-01
Purpose Experimental Blood–Brain Barrier (BBB) permeability models for drug molecules are expensive and time-consuming. As alternative methods, several traditional Quantitative Structure-Activity Relationship (QSAR) models have been developed previously. In this study, we aimed to improve the predictivity of traditional QSAR BBB permeability models by employing relevant public bio-assay data in the modeling process. Methods We compiled a BBB permeability database consisting of 439 unique compounds from various resources. The database was split into a modeling set of 341 compounds and a validation set of 98 compounds. Consensus QSAR modeling workflow was employed on the modeling set to develop various QSAR models. A five-fold cross-validation approach was used to validate the developed models, and the resulting models were used to predict the external validation set compounds. Furthermore, we used previously published membrane transporter models to generate relevant transporter profiles for target compounds. The transporter profiles were used as additional biological descriptors to develop hybrid QSAR BBB models. Results The consensus QSAR models have R2=0.638 for fivefold cross-validation and R2=0.504 for external validation. The consensus model developed by pooling chemical and transporter descriptors showed better predictivity (R2=0.646 for five-fold cross-validation and R2=0.526 for external validation). Moreover, several external bio-assays that correlate with BBB permeability were identified using our automatic profiling tool. Conclusions The BBB permeability models developed in this study can be useful for early evaluation of new compounds (e.g., new drug candidates). The combination of chemical and biological descriptors shows a promising direction to improve the current traditional QSAR models. PMID:25862462
2011-01-01
Background Machine learning has a vast range of applications. In particular, advanced machine learning methods are routinely and increasingly used in quantitative structure activity relationship (QSAR) modeling. QSAR data sets often encompass tens of thousands of compounds and the size of proprietary, as well as public data sets, is rapidly growing. Hence, there is a demand for computationally efficient machine learning algorithms, easily available to researchers without extensive machine learning knowledge. In granting the scientific principles of transparency and reproducibility, Open Source solutions are increasingly acknowledged by regulatory authorities. Thus, an Open Source state-of-the-art high performance machine learning platform, interfacing multiple, customized machine learning algorithms for both graphical programming and scripting, to be used for large scale development of QSAR models of regulatory quality, is of great value to the QSAR community. Results This paper describes the implementation of the Open Source machine learning package AZOrange. AZOrange is specially developed to support batch generation of QSAR models in providing the full work flow of QSAR modeling, from descriptor calculation to automated model building, validation and selection. The automated work flow relies upon the customization of the machine learning algorithms and a generalized, automated model hyper-parameter selection process. Several high performance machine learning algorithms are interfaced for efficient data set specific selection of the statistical method, promoting model accuracy. Using the high performance machine learning algorithms of AZOrange does not require programming knowledge as flexible applications can be created, not only at a scripting level, but also in a graphical programming environment. Conclusions AZOrange is a step towards meeting the needs for an Open Source high performance machine learning platform, supporting the efficient development of highly accurate QSAR models fulfilling regulatory requirements. PMID:21798025
Stålring, Jonna C; Carlsson, Lars A; Almeida, Pedro; Boyer, Scott
2011-07-28
Machine learning has a vast range of applications. In particular, advanced machine learning methods are routinely and increasingly used in quantitative structure activity relationship (QSAR) modeling. QSAR data sets often encompass tens of thousands of compounds and the size of proprietary, as well as public data sets, is rapidly growing. Hence, there is a demand for computationally efficient machine learning algorithms, easily available to researchers without extensive machine learning knowledge. In granting the scientific principles of transparency and reproducibility, Open Source solutions are increasingly acknowledged by regulatory authorities. Thus, an Open Source state-of-the-art high performance machine learning platform, interfacing multiple, customized machine learning algorithms for both graphical programming and scripting, to be used for large scale development of QSAR models of regulatory quality, is of great value to the QSAR community. This paper describes the implementation of the Open Source machine learning package AZOrange. AZOrange is specially developed to support batch generation of QSAR models in providing the full work flow of QSAR modeling, from descriptor calculation to automated model building, validation and selection. The automated work flow relies upon the customization of the machine learning algorithms and a generalized, automated model hyper-parameter selection process. Several high performance machine learning algorithms are interfaced for efficient data set specific selection of the statistical method, promoting model accuracy. Using the high performance machine learning algorithms of AZOrange does not require programming knowledge as flexible applications can be created, not only at a scripting level, but also in a graphical programming environment. AZOrange is a step towards meeting the needs for an Open Source high performance machine learning platform, supporting the efficient development of highly accurate QSAR models fulfilling regulatory requirements.
Liu, Ming; He, Lin; Hu, Xiaopeng; Liu, Peiqing; Luo, Hai-Bin
2010-12-01
The nociceptin/orphanin FQ receptor (NOP) has been implicated in a wide range of biological functions, including pain, anxiety, depression and drug abuse. Especially, its agonists have a great potential to be developed into anxiolytics. However, the crystal structure of NOP is still not available. In the present work, both structure-based and ligand-based modeling methods have been used to achieve a comprehensive understanding on 67N-substituted spiropiperidine analogues as NOP agonists. The comparative molecular-field analysis method was performed to formulate a reasonable 3D-QSAR model (cross-validated coefficient q(2)=0.819 and conventional r(2)=0.950), whose robustness and predictability were further verified by leave-eight-out, Y-randomization, and external test-set validations. The excellent performance of CoMFA to the affinity differences among these compounds was attributed to the contributions of electrostatic/hydrogen-bonding and steric/hydrophobic interactions, which was supported by the Surflex-Dock and CDOCKER molecular-docking simulations based on the 3D model of NOP built by the homology modeling method. The CoMFA contour maps and the molecular docking simulations were integrated to propose a binding mode for the spiropiperidine analogues at the binding site of NOP. Copyright © 2010 Elsevier Ltd. All rights reserved.
Nargotra, Amit; Sharma, Sujata; Koul, Jawahir Lal; Sangwan, Pyare Lal; Khan, Inshad Ali; Kumar, Ashwani; Taneja, Subhash Chander; Koul, Surrinder
2009-10-01
Quantitative structure activity relationship (QSAR) analysis of piperine analogs as inhibitors of efflux pump NorA from Staphylococcus aureus has been performed in order to obtain a highly accurate model enabling prediction of inhibition of S. aureus NorA of new chemical entities from natural sources as well as synthetic ones. Algorithm based on genetic function approximation method of variable selection in Cerius2 was used to generate the model. Among several types of descriptors viz., topological, spatial, thermodynamic, information content and E-state indices that were considered in generating the QSAR model, three descriptors such as partial negative surface area of the compounds, area of the molecular shadow in the XZ plane and heat of formation of the molecules resulted in a statistically significant model with r(2)=0.962 and cross-validation parameter q(2)=0.917. The validation of the QSAR models was done by cross-validation, leave-25%-out and external test set prediction. The theoretical approach indicates that the increase in the exposed partial negative surface area increases the inhibitory activity of the compound against NorA whereas the area of the molecular shadow in the XZ plane is inversely proportional to the inhibitory activity. This model also explains the relationship of the heat of formation of the compound with the inhibitory activity. The model is not only able to predict the activity of new compounds but also explains the important regions in the molecules in quantitative manner.
Kaiser, K L E
2007-01-01
This presentation will review the evolution of the workshops from a scientific and personal perspective. From their modest beginning in 1983, the workshops have developed into larger international meetings, regularly held every two years. Their initial focus on the aquatic sphere soon expanded to include properties and effects on atmospheric and terrestrial species, including man. Concurrent with this broadening of their scientific scope, the workshops have become an important forum for the early dissemination of all aspects of qualitative and quantitative structure-activity research in ecotoxicology and human health effects. Over the last few decades, the field of quantitative structure/activity relationships (QSARs) has quickly emerged as a major scientific method in understanding the properties and effects of chemicals on the environment and human health. From substances that only affect cell membranes to those that bind strongly to a specific enzyme, QSARs provides insight into the biological effects and chemical and physical properties of substances. QSARs are useful for delineating the quantitative changes in biological effects resulting from minor but systematic variations of the structure of a compound with a specific mode of action. In addition, more holistic approaches are being devised that result in our ability to predict the effects of structurally unrelated compounds with (potentially) different modes of action. Research in QSAR environmental toxicology has led to many improvements in the manufacturing, use, and disposal of chemicals. Furthermore, it has led to national policies and international agreements, from use restrictions or outright bans of compounds, such as polychlorinated biphenyls (PCBs), mirex, and highly chlorinated pesticides (e.g. DDT, dieldrin) for the protection of avian predators, to alternatives for ozone-depleting compounds, to better waste treatment systems, to more powerful and specific acting drugs. Most of the recent advances in drug development could not have been achieved without the use of QSARs in one form or another. The pace of such developments is rapid and QSARs are the keystone to that progress. These workshops have contributed to this progress and will continue to do so in the future.
Potta, Thrimoorthy; Zhen, Zhuo; Grandhi, Taraka Sai Pavan; Christensen, Matthew D.; Ramos, James; Breneman, Curt M.; Rege, Kaushal
2014-01-01
We describe the combinatorial synthesis and cheminformatics modeling of aminoglycoside antibiotics-derived polymers for transgene delivery and expression. Fifty-six polymers were synthesized by polymerizing aminoglycosides with diglycidyl ether cross-linkers. Parallel screening resulted in identification of several lead polymers that resulted in high transgene expression levels in cells. The role of polymer physicochemical properties in determining efficacy of transgene expression was investigated using Quantitative Structure-Activity Relationship (QSAR) cheminformatics models based on Support Vector Regression (SVR) and ‘building block’ polymer structures. The QSAR model exhibited high predictive ability, and investigation of descriptors in the model, using molecular visualization and correlation plots, indicated that physicochemical attributes related to both, aminoglycosides and diglycidyl ethers facilitated transgene expression. This work synergistically combines combinatorial synthesis and parallel screening with cheminformatics-based QSAR models for discovery and physicochemical elucidation of effective antibiotics-derived polymers for transgene delivery in medicine and biotechnology. PMID:24331709
The interplay between QSAR/QSPR studies and partial order ranking and formal concept analyses.
Carlsen, Lars
2009-04-17
The often observed scarcity of physical-chemical and well as toxicological data hampers the assessment of potentially hazardous chemicals released to the environment. In such cases Quantitative Structure-Activity Relationships/Quantitative Structure-Property Relationships (QSAR/QSPR) constitute an obvious alternative for rapidly, effectively and inexpensively generatng missing experimental values. However, typically further treatment of the data appears necessary, e.g., to elucidate the possible relations between the single compounds as well as implications and associations between the various parameters used for the combined characterization of the compounds under investigation. In the present paper the application of QSAR/QSPR in combination with Partial Order Ranking (POR) methodologies will be reviewed and new aspects using Formal Concept Analysis (FCA) will be introduced. Where POR constitutes an attractive method for, e.g., prioritizing a series of chemical substances based on a simultaneous inclusion of a range of parameters, FCA gives important information on the implications associations between the parameters. The combined approach thus constitutes an attractive method to a preliminary assessment of the impact on environmental and human health by primary pollutants or possibly by a primary pollutant well as a possible suite of transformation subsequent products that may be both persistent in and bioaccumulating and toxic. The present review focus on the environmental - and human health impact by residuals of the rocket fuel 1,1-dimethylhydrazine (heptyl) and its transformation products as an illustrative example.
DemQSAR: predicting human volume of distribution and clearance of drugs
NASA Astrophysics Data System (ADS)
Demir-Kavuk, Ozgur; Bentzien, Jörg; Muegge, Ingo; Knapp, Ernst-Walter
2011-12-01
In silico methods characterizing molecular compounds with respect to pharmacologically relevant properties can accelerate the identification of new drugs and reduce their development costs. Quantitative structure-activity/-property relationship (QSAR/QSPR) correlate structure and physico-chemical properties of molecular compounds with a specific functional activity/property under study. Typically a large number of molecular features are generated for the compounds. In many cases the number of generated features exceeds the number of molecular compounds with known property values that are available for learning. Machine learning methods tend to overfit the training data in such situations, i.e. the method adjusts to very specific features of the training data, which are not characteristic for the considered property. This problem can be alleviated by diminishing the influence of unimportant, redundant or even misleading features. A better strategy is to eliminate such features completely. Ideally, a molecular property can be described by a small number of features that are chemically interpretable. The purpose of the present contribution is to provide a predictive modeling approach, which combines feature generation, feature selection, model building and control of overtraining into a single application called DemQSAR. DemQSAR is used to predict human volume of distribution (VDss) and human clearance (CL). To control overtraining, quadratic and linear regularization terms were employed. A recursive feature selection approach is used to reduce the number of descriptors. The prediction performance is as good as the best predictions reported in the recent literature. The example presented here demonstrates that DemQSAR can generate a model that uses very few features while maintaining high predictive power. A standalone DemQSAR Java application for model building of any user defined property as well as a web interface for the prediction of human VDss and CL is available on the webpage of DemPRED: http://agknapp.chemie.fu-berlin.de/dempred/.
DemQSAR: predicting human volume of distribution and clearance of drugs.
Demir-Kavuk, Ozgur; Bentzien, Jörg; Muegge, Ingo; Knapp, Ernst-Walter
2011-12-01
In silico methods characterizing molecular compounds with respect to pharmacologically relevant properties can accelerate the identification of new drugs and reduce their development costs. Quantitative structure-activity/-property relationship (QSAR/QSPR) correlate structure and physico-chemical properties of molecular compounds with a specific functional activity/property under study. Typically a large number of molecular features are generated for the compounds. In many cases the number of generated features exceeds the number of molecular compounds with known property values that are available for learning. Machine learning methods tend to overfit the training data in such situations, i.e. the method adjusts to very specific features of the training data, which are not characteristic for the considered property. This problem can be alleviated by diminishing the influence of unimportant, redundant or even misleading features. A better strategy is to eliminate such features completely. Ideally, a molecular property can be described by a small number of features that are chemically interpretable. The purpose of the present contribution is to provide a predictive modeling approach, which combines feature generation, feature selection, model building and control of overtraining into a single application called DemQSAR. DemQSAR is used to predict human volume of distribution (VD(ss)) and human clearance (CL). To control overtraining, quadratic and linear regularization terms were employed. A recursive feature selection approach is used to reduce the number of descriptors. The prediction performance is as good as the best predictions reported in the recent literature. The example presented here demonstrates that DemQSAR can generate a model that uses very few features while maintaining high predictive power. A standalone DemQSAR Java application for model building of any user defined property as well as a web interface for the prediction of human VD(ss) and CL is available on the webpage of DemPRED: http://agknapp.chemie.fu-berlin.de/dempred/ .
Molecular design of new aggrecanases-2 inhibitors.
Shan, Zhi Jie; Zhai, Hong Lin; Huang, Xiao Yan; Li, Li Na; Zhang, Xiao Yun
2013-10-01
Aggrecanases-2 is a very important potential drug target for the treatment of osteoarthritis. In this study, a series of known aggrecanases-2 inhibitors was analyzed by the technologies of three-dimensional quantitative structure-activity relationships (3D-QSAR) and molecular docking. Two 3D-QSAR models, which based on comparative molecular field analysis (CoMFA) and comparative molecular similarity analysis (CoMSIA) methods, were established. Molecular docking was employed to explore the details of the interaction between inhibitors and aggrecanases-2 protein. According to the analyses for these models, several new potential inhibitors with higher activity predicted were designed, and were supported by the simulation of molecular docking. This work propose the fast and effective approach to design and prediction for new potential inhibitors, and the study of the interaction mechanism provide a better understanding for the inhibitors binding into the target protein, which will be useful for the structure-based drug design and modifications. Copyright © 2013 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Shahin, Rand; Swellmeen, Lubna; Shaheen, Omar; Aboalhaija, Nour; Habash, Maha
2016-01-01
Targeting Proviral integration-site of murine Moloney leukemia virus 1 kinase, hereafter called Pim-1 kinase, is a promising strategy for treating different kinds of human cancer. Headed for this a total list of 328 formerly reported Pim-1 kinase inhibitors has been explored and divided based on the pharmacophoric features of the most active molecules into 10 subsets projected to represent potential active binding manners accessible to ligands within the binding pocket of Pim-1 kinase. Discovery Studio 4.1 (DS 4.1) was employed to detect potential pharmacophoric active binding manners anticipated by Pim-1 Kinase inhibitors. The pharmacophoric models were then allowed to compete within Quantitative Structure Activity Relationship (QSAR) framework with other 2D descriptors. Accordingly Genetic algorithm and multiple linear regression investigation were engaged to find the finest QSAR equation that has the best predictive power r 262 2 = 0.70, F = 119.14, r LOO 2 = 0.693, r PRESS 2 against 66 external test inhibitors = 0.71 q2 = 0.55. Three different pharmacophores appeared in the successful QSAR equation this represents three different binding modes for inhibitors within the Pim-1 kinase binding pocket. Pharmacophoric models were later used to screen compounds within the National Cancer Institute database. Several low micromolar Pim-1 Kinase inhibitors were captured. The most potent hits show IC50 values of 0.77 and 1.03 µM. Also, upon analyzing the successful QSAR Equation we found that some polycyclic aromatic electron-rich structures namely 6-Chloro-2-methoxy-acridine can be considered as putative hits for Pim-1 kinase inhibition.
Turabekova, Malakhat A.; Rasulev, Bakhtiyor F.; Levkovich, Mikhail G.; Abdullaev, Nasrulla D.; Leszczynski, Jerzy
2015-01-01
Early pharmacological studies of Aconitum and Delphinium sp. alkaloids suggested that these neurotoxins act at site 2 of voltage-gated Na+ channel and allosterically modulate its function. Understanding structural requirements for these compounds to exhibit binding activity at voltage-gated Na+ channel has been important in various fields. This paper reports quantum-chemical studies and quantitative structure-activity relationships (QSARs) based on a total of 65 natural alkaloids from two plant species, which includes both blockers and openers of sodium ion channel. A series of 18 antagonist alkaloids (9 blockers and 9 openers) have been studied using AM1 and DFT computational methods in order to reveal their structure-activity (structure-toxicity) relationship at electronic level. An examination of frontier orbitals obtained for ground and protonated forms of the compounds revealed that HOMOs and LUMOs were mainly represented by nitrogen atom and benzyl/benzoylester orbitals with –OH and –OCOCH3 contributions. The results obtained from this research have confirmed the experimental findings suggesting that neurotoxins acting at type 2 receptor site of voltage-dependent sodium channel are activators and blockers with common structural features and differ only in efficacy. The energetic tendency of HOMO-LUMO energy gap can probably distinguish activators and blockers that have been observed. Genetic Algorithm with Multiple Linear Regression Analysis (GA-MLRA) technique was also applied for the generation of two-descriptor QSAR models for the set of 65 blockers. Additionally to the computational studies, the HOMO-LUMO gap descriptor in each obtained QSAR model has confirmed the crucial role of charge transfer in receptor-ligand interactions. A number of other descriptors such as logP, IBEG, nNH2, nHDon, nCO have been selected as complementary ones to LUMO and their role in activity alteration has also been discussed. PMID:18201930
Validity and validation of expert (Q)SAR systems.
Hulzebos, E; Sijm, D; Traas, T; Posthumus, R; Maslankiewicz, L
2005-08-01
At a recent workshop in Setubal (Portugal) principles were drafted to assess the suitability of (quantitative) structure-activity relationships ((Q)SARs) for assessing the hazards and risks of chemicals. In the present study we applied some of the Setubal principles to test the validity of three (Q)SAR expert systems and validate the results. These principles include a mechanistic basis, the availability of a training set and validation. ECOSAR, BIOWIN and DEREK for Windows have a mechanistic or empirical basis. ECOSAR has a training set for each QSAR. For half of the structural fragments the number of chemicals in the training set is >4. Based on structural fragments and log Kow, ECOSAR uses linear regression to predict ecotoxicity. Validating ECOSAR for three 'valid' classes results in predictivity of > or = 64%. BIOWIN uses (non-)linear regressions to predict the probability of biodegradability based on fragments and molecular weight. It has a large training set and predicts non-ready biodegradability well. DEREK for Windows predictions are supported by a mechanistic rationale and literature references. The structural alerts in this program have been developed with a training set of positive and negative toxicity data. However, to support the prediction only a limited number of chemicals in the training set is presented to the user. DEREK for Windows predicts effects by 'if-then' reasoning. The program predicts best for mutagenicity and carcinogenicity. Each structural fragment in ECOSAR and DEREK for Windows needs to be evaluated and validated separately.
QSAR models based on quantum topological molecular similarity.
Popelier, P L A; Smith, P J
2006-07-01
A new method called quantum topological molecular similarity (QTMS) was fairly recently proposed [J. Chem. Inf. Comp. Sc., 41, 2001, 764] to construct a variety of medicinal, ecological and physical organic QSAR/QSPRs. QTMS method uses quantum chemical topology (QCT) to define electronic descriptors drawn from modern ab initio wave functions of geometry-optimised molecules. It was shown that the current abundance of computing power can be utilised to inject realistic descriptors into QSAR/QSPRs. In this article we study seven datasets of medicinal interest : the dissociation constants (pK(a)) for a set of substituted imidazolines , the pK(a) of imidazoles , the ability of a set of indole derivatives to displace [(3)H] flunitrazepam from binding to bovine cortical membranes , the influenza inhibition constants for a set of benzimidazoles , the interaction constants for a set of amides and the enzyme liver alcohol dehydrogenase , the natriuretic activity of sulphonamide carbonic anhydrase inhibitors and the toxicity of a series of benzyl alcohols. A partial least square analysis in conjunction with a genetic algorithm delivered excellent models. They are also able to highlight the active site, of the ligand or the molecule whose structure determines the activity. The advantages and limitations of QTMS are discussed.
Yang, Zhihui; Luo, Shuang; Wei, Zongsu; Ye, Tiantian; Spinney, Richard; Chen, Dong; Xiao, Ruiyang
2016-04-01
The second-order rate constants (k) of hydroxyl radical (·OH) with polychlorinated biphenyls (PCBs) in the gas phase are of scientific and regulatory importance for assessing their global distribution and fate in the atmosphere. Due to the limited number of measured k values, there is a need to model the k values for unknown PCBs congeners. In the present study, we developed a quantitative structure-activity relationship (QSAR) model with quantum chemical descriptors using a sequential approach, including correlation analysis, principal component analysis, multi-linear regression, validation, and estimation of applicability domain. The result indicates that the single descriptor, polarizability (α), plays an important role in determining the reactivity with a global standardized function of lnk = -0.054 × α ‒ 19.49 at 298 K. In order to validate the QSAR predicted k values and expand the current k value database for PCBs congeners, an independent method, density functional theory (DFT), was employed to calculate the kinetics and thermodynamics of the gas-phase ·OH oxidation of 2,4',5-trichlorobiphenyl (PCB31), 2,2',4,4'-tetrachlorobiphenyl (PCB47), 2,3,4,5,6-pentachlorobiphenyl (PCB116), 3,3',4,4',5,5'-hexachlorobiphenyl (PCB169), and 2,3,3',4,5,5',6-heptachlorobiphenyl (PCB192) at 298 K at B3LYP/6-311++G**//B3LYP/6-31 + G** level of theory. The QSAR predicted and DFT calculated k values for ·OH oxidation of these PCB congeners exhibit excellent agreement with the experimental k values, indicating the robustness and predictive power of the single-descriptor based QSAR model we developed. Copyright © 2015 Elsevier Ltd. All rights reserved.
Xu, G; Hughes-Oliver, J M; Brooks, J D; Yeatts, J L; Baynes, R E
2013-01-01
Quantitative structure-activity relationship (QSAR) models are being used increasingly in skin permeation studies. The main idea of QSAR modelling is to quantify the relationship between biological activities and chemical properties, and thus to predict the activity of chemical solutes. As a key step, the selection of a representative and structurally diverse training set is critical to the prediction power of a QSAR model. Early QSAR models selected training sets in a subjective way and solutes in the training set were relatively homogenous. More recently, statistical methods such as D-optimal design or space-filling design have been applied but such methods are not always ideal. This paper describes a comprehensive procedure to select training sets from a large candidate set of 4534 solutes. A newly proposed 'Baynes' rule', which is a modification of Lipinski's 'rule of five', was used to screen out solutes that were not qualified for the study. U-optimality was used as the selection criterion. A principal component analysis showed that the selected training set was representative of the chemical space. Gas chromatograph amenability was verified. A model built using the training set was shown to have greater predictive power than a model built using a previous dataset [1].
2012-01-01
The ORCHESTRA online questionnaire on “benefits and barriers to the use of QSAR methods” addressed the academic, consultant, regulatory and industry communities potentially interested by QSAR methods in the context of REACH. Replies from more than 60 stakeholders produced some insights on the actual application of QSAR methods, and how to improve their use. Respondents state in majority that they have used QSAR methods. All have some future plans to test or use QSAR methods in accordance with their stakeholder role. The stakeholder respondents cited a total of 28 models, methods or software that they have actually applied. The three most frequently cited suites, used moreover by all the stakeholder categories, are the OECD Toolbox, EPISuite and CAESAR; all are free tools. Results suggest that stereotyped assumptions about the barriers to application of QSAR may be incorrect. Economic costs (including potential delays) are not found to be a major barrier. And only one respondent “prefers” traditional, well-known and accepted toxicological assessment methods. Information and guidance may be the keys to reinforcing use of QSAR models. Regulators appear most interested in obtaining clear explanation of the basis of the models, to provide a solid basis for decisions. Scientists appear most interested in the exploration of the scientific capabilities of the QSAR approach. Industry shows interest in obtaining reassurance that appropriate uses of QSAR will be accepted by regulators. PMID:23244245
Kovarich, Simona; Papa, Ester; Gramatica, Paola
2011-06-15
The identification of potential endocrine disrupting (ED) chemicals is an important task for the scientific community due to their diffusion in the environment; the production and use of such compounds will be strictly regulated through the authorization process of the REACH regulation. To overcome the problem of insufficient experimental data, the quantitative structure-activity relationship (QSAR) approach is applied to predict the ED activity of new chemicals. In the present study QSAR classification models are developed, according to the OECD principles, to predict the ED potency for a class of emerging ubiquitary pollutants, viz. brominated flame retardants (BFRs). Different endpoints related to ED activity (i.e. aryl hydrocarbon receptor agonism and antagonism, estrogen receptor agonism and antagonism, androgen and progesterone receptor antagonism, T4-TTR competition, E2SULT inhibition) are modeled using the k-NN classification method. The best models are selected by maximizing the sensitivity and external predictive ability. We propose simple QSARs (based on few descriptors) characterized by internal stability, good predictive power and with a verified applicability domain. These models are simple tools that are applicable to screen BFRs in relation to their ED activity, and also to design safer alternatives, in agreement with the requirements of REACH regulation at the authorization step. Copyright © 2011 Elsevier B.V. All rights reserved.
Hodyna, Diana; Kovalishyn, Vasyl; Rogalsky, Sergiy; Blagodatnyi, Volodymyr; Petko, Kirill; Metelytsia, Larisa
2016-09-01
Predictive QSAR models for the inhibitors of B. subtilis and Ps. aeruginosa among imidazolium-based ionic liquids were developed using literary data. The regression QSAR models were created through Artificial Neural Network and k-nearest neighbor procedures. The classification QSAR models were constructed using WEKA-RF (random forest) method. The predictive ability of the models was tested by fivefold cross-validation; giving q(2) = 0.77-0.92 for regression models and accuracy 83-88% for classification models. Twenty synthesized samples of 1,3-dialkylimidazolium ionic liquids with predictive value of activity level of antimicrobial potential were evaluated. For all asymmetric 1,3-dialkylimidazolium ionic liquids, only compounds containing at least one radical with alkyl chain length of 12 carbon atoms showed high antibacterial activity. However, the activity of symmetric 1,3-dialkylimidazolium salts was found to have opposite relationship with the length of aliphatic radical being maximum for compounds based on 1,3-dioctylimidazolium cation. The obtained experimental results suggested that the application of classification QSAR models is more accurate for the prediction of activity of new imidazolium-based ILs as potential antibacterials. © 2016 John Wiley & Sons A/S.
Collecting the chemical structures and data for necessary QSAR modeling is facilitated by available public databases and open data. However, QSAR model performance is dependent on the quality of data and modeling methodology used. This study developed robust QSAR models for physi...
Sharma, Nripen S.; Jindal, Rohit; Mitra, Bhaskar; Lee, Serom; Li, Lulu; Maguire, Tim J.; Schloss, Rene; Yarmush, Martin L.
2014-01-01
Skin sensitization remains a major environmental and occupational health hazard. Animal models have been used as the gold standard method of choice for estimating chemical sensitization potential. However, a growing international drive and consensus for minimizing animal usage have prompted the development of in vitro methods to assess chemical sensitivity. In this paper, we examine existing approaches including in silico models, cell and tissue based assays for distinguishing between sensitizers and irritants. The in silico approaches that have been discussed include Quantitative Structure Activity Relationships (QSAR) and QSAR based expert models that correlate chemical molecular structure with biological activity and mechanism based read-across models that incorporate compound electrophilicity. The cell and tissue based assays rely on an assortment of mono and co-culture cell systems in conjunction with 3D skin models. Given the complexity of allergen induced immune responses, and the limited ability of existing systems to capture the entire gamut of cellular and molecular events associated with these responses, we also introduce a microfabricated platform that can capture all the key steps involved in allergic contact sensitivity. Finally, we describe the development of an integrated testing strategy comprised of two or three tier systems for evaluating sensitization potential of chemicals. PMID:24741377
Liu, Huihui; Wei, Mengbi; Yang, Xianhai; Yin, Cen; He, Xiao
2017-01-01
Partition coefficients are vital parameters for measuring accurately the chemicals concentrations by passive sampling devices. Given the wide use of low density polyethylene (LDPE) film in passive sampling, we developed a theoretical linear solvation energy relationship (TLSER) model and a quantitative structure-activity relationship (QSAR) model for the prediction of the partition coefficient of chemicals between LDPE and water (K pew ). For chemicals with the octanol-water partition coefficient (log K ow ) <8, a TLSER model with V x (McGowan volume) and qA - (the most negative charge on O, N, S, X atoms) as descriptors was developed, but the model had relatively low determination coefficient (R 2 ) and cross-validated coefficient (Q 2 ). In order to further explore the theoretical mechanisms involved in the partition process, a QSAR model with four descriptors (MLOGP (Moriguchi octanol-water partition coeff.), P_VSA_s_3 (P_VSA-like on I-state, bin 3), Hy (hydrophilic factor) and NssO (number of atoms of type ssO)) was established, and statistical analysis indicated that the model had satisfactory goodness-of-fit, robustness and predictive ability. For chemicals with log K OW >8, a TLSER model with V x and a QSAR model with MLOGP as descriptor were developed. This is the first paper to explore the models for highly hydrophobic chemicals. The applicability domain of the models, characterized by the Euclidean distance-based method and Williams plot, covered a large number of structurally diverse chemicals, which included nearly all the common hydrophobic organic compounds. Additionally, through mechanism interpretation, we explored the structural features those governing the partition behavior of chemicals between LDPE and water. Copyright © 2016 Elsevier B.V. All rights reserved.
Bhhatarai, Barun; Wilson, Daniel M.; Price, Paul S.; Marty, Sue; Parks, Amanda K.; Carney, Edward
2016-01-01
Background: Integrative testing strategies (ITSs) for potential endocrine activity can use tiered in silico and in vitro models. Each component of an ITS should be thoroughly assessed. Objectives: We used the data from three in vitro ToxCast™ binding assays to assess OASIS, a quantitative structure-activity relationship (QSAR) platform covering both estrogen receptor (ER) and androgen receptor (AR) binding. For stronger binders (described here as AC50 < 1 μM), we also examined the relationship of QSAR predictions of ER or AR binding to the results from 18 ER and 10 AR transactivation assays, 72 ER-binding reference compounds, and the in vivo uterotrophic assay. Methods: NovaScreen binding assay data for ER (human, bovine, and mouse) and AR (human, chimpanzee, and rat) were used to assess the sensitivity, specificity, concordance, and applicability domain of two OASIS QSAR models. The binding strength relative to the QSAR-predicted binding strength was examined for the ER data. The relationship of QSAR predictions of binding to transactivation- and pathway-based assays, as well as to in vivo uterotrophic responses, was examined. Results: The QSAR models had both high sensitivity (> 75%) and specificity (> 86%) for ER as well as both high sensitivity (92–100%) and specificity (70–81%) for AR. For compounds within the domains of the ER and AR QSAR models that bound with AC50 < 1 μM, the QSAR models accurately predicted the binding for the parent compounds. The parent compounds were active in all transactivation assays where metabolism was incorporated and, except for those compounds known to require metabolism to manifest activity, all assay platforms where metabolism was not incorporated. Compounds in-domain and predicted to bind by the ER QSAR model that were positive in ToxCast™ ER binding at AC50 < 1 μM were active in the uterotrophic assay. Conclusions: We used the extensive ToxCast™ HTS binding data set to show that OASIS ER and AR QSAR models had high sensitivity and specificity when compounds were in-domain of the models. Based on this research, we recommend a tiered screening approach wherein a) QSAR is used to identify compounds in-domain of the ER or AR binding models and predicted to bind; b) those compounds are screened in vitro to assess binding potency; and c) the stronger binders (AC50 < 1 μM) are screened in vivo. This scheme prioritizes compounds for integrative testing and risk assessment. Importantly, compounds that are not in-domain, that are predicted either not to bind or to bind weakly, that are not active in in vitro, that require metabolism to manifest activity, or for which in vivo AR testing is in order, need to be assessed differently. Citation: Bhhatarai B, Wilson DM, Price PS, Marty S, Parks AK, Carney E. 2016. Evaluation of OASIS QSAR models using ToxCast™ in vitro estrogen and androgen receptor binding data and application in an integrated endocrine screening approach. Environ Health Perspect 124:1453–1461; http://dx.doi.org/10.1289/EHP184 PMID:27152837
Jagiello, Karolina; Grzonkowska, Monika; Swirog, Marta; ...
2016-08-29
In this contribution, the advantages and limitations of two computational techniques that can be used for the investigation of nanoparticles activity and toxicity: classic nano-QSAR (Quantitative Structure–Activity Relationships employed for nanomaterials) and 3D nano-QSAR (three-dimensional Quantitative Structure–Activity Relationships, such us Comparative Molecular Field Analysis, CoMFA/Comparative Molecular Similarity Indices Analysis, CoMSIA analysis employed for nanomaterials) have been briefly summarized. Both approaches were compared according to the selected criteria, including: efficiency, type of experimental data, class of nanomaterials, time required for calculations and computational cost, difficulties in the interpretation. Taking into account the advantages and limitations of each method, we provide themore » recommendations for nano-QSAR modellers and QSAR model users to be able to determine a proper and efficient methodology to investigate biological activity of nanoparticles in order to describe the underlying interactions in the most reliable and useful manner.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jagiello, Karolina; Grzonkowska, Monika; Swirog, Marta
In this contribution, the advantages and limitations of two computational techniques that can be used for the investigation of nanoparticles activity and toxicity: classic nano-QSAR (Quantitative Structure–Activity Relationships employed for nanomaterials) and 3D nano-QSAR (three-dimensional Quantitative Structure–Activity Relationships, such us Comparative Molecular Field Analysis, CoMFA/Comparative Molecular Similarity Indices Analysis, CoMSIA analysis employed for nanomaterials) have been briefly summarized. Both approaches were compared according to the selected criteria, including: efficiency, type of experimental data, class of nanomaterials, time required for calculations and computational cost, difficulties in the interpretation. Taking into account the advantages and limitations of each method, we provide themore » recommendations for nano-QSAR modellers and QSAR model users to be able to determine a proper and efficient methodology to investigate biological activity of nanoparticles in order to describe the underlying interactions in the most reliable and useful manner.« less
Demystifying Multitask Deep Neural Networks for Quantitative Structure-Activity Relationships.
Xu, Yuting; Ma, Junshui; Liaw, Andy; Sheridan, Robert P; Svetnik, Vladimir
2017-10-23
Deep neural networks (DNNs) are complex computational models that have found great success in many artificial intelligence applications, such as computer vision1,2 and natural language processing.3,4 In the past four years, DNNs have also generated promising results for quantitative structure-activity relationship (QSAR) tasks.5,6 Previous work showed that DNNs can routinely make better predictions than traditional methods, such as random forests, on a diverse collection of QSAR data sets. It was also found that multitask DNN models-those trained on and predicting multiple QSAR properties simultaneously-outperform DNNs trained separately on the individual data sets in many, but not all, tasks. To date there has been no satisfactory explanation of why the QSAR of one task embedded in a multitask DNN can borrow information from other unrelated QSAR tasks. Thus, using multitask DNNs in a way that consistently provides a predictive advantage becomes a challenge. In this work, we explored why multitask DNNs make a difference in predictive performance. Our results show that during prediction a multitask DNN does borrow "signal" from molecules with similar structures in the training sets of the other tasks. However, whether this borrowing leads to better or worse predictive performance depends on whether the activities are correlated. On the basis of this, we have developed a strategy to use multitask DNNs that incorporate prior domain knowledge to select training sets with correlated activities, and we demonstrate its effectiveness on several examples.
García-Jacas, César R; Contreras-Torres, Ernesto; Marrero-Ponce, Yovani; Pupo-Meriño, Mario; Barigye, Stephen J; Cabrera-Leyva, Lisset
2016-01-01
Recently, novel 3D alignment-free molecular descriptors (also known as QuBiLS-MIDAS) based on two-linear, three-linear and four-linear algebraic forms have been introduced. These descriptors codify chemical information for relations between two, three and four atoms by using several (dis-)similarity metrics and multi-metrics. Several studies aimed at assessing the quality of these novel descriptors have been performed. However, a deeper analysis of their performance is necessary. Therefore, in the present manuscript an assessment and statistical validation of the performance of these novel descriptors in QSAR studies is performed. To this end, eight molecular datasets (angiotensin converting enzyme, acetylcholinesterase inhibitors, benzodiazepine receptor, cyclooxygenase-2 inhibitors, dihydrofolate reductase inhibitors, glycogen phosphorylase b, thermolysin inhibitors, thrombin inhibitors) widely used as benchmarks in the evaluation of several procedures are utilized. Three to nine variable QSAR models based on Multiple Linear Regression are built for each chemical dataset according to the original division into training/test sets. Comparisons with respect to leave-one-out cross-validation correlation coefficients[Formula: see text] reveal that the models based on QuBiLS-MIDAS indices possess superior predictive ability in 7 of the 8 datasets analyzed, outperforming methodologies based on similar or more complex techniques such as: Partial Least Square, Neural Networks, Support Vector Machine and others. On the other hand, superior external correlation coefficients[Formula: see text] are attained in 6 of the 8 test sets considered, confirming the good predictive power of the obtained models. For the [Formula: see text] values non-parametric statistic tests were performed, which demonstrated that the models based on QuBiLS-MIDAS indices have the best global performance and yield significantly better predictions in 11 of the 12 QSAR procedures used in the comparison. Lastly, a study concerning to the performance of the indices according to several conformer generation methods was performed. This demonstrated that the quality of predictions of the QSAR models based on QuBiLS-MIDAS indices depend on 3D structure generation method considered, although in this preliminary study the results achieved do not present significant statistical differences among them. As conclusions it can be stated that the QuBiLS-MIDAS indices are suitable for extracting structural information of the molecules and thus, constitute a promissory alternative to build models that contribute to the prediction of pharmacokinetic, pharmacodynamics and toxicological properties on novel compounds.Graphical abstractComparative graphical representation of the performance of the novel QuBiLS-MIDAS 3D-MDs with respect to other methodologies in QSAR modeling of eight chemical datasets.
QSAR studies in the discovery of novel type-II diabetic therapies.
Abuhammad, Areej; Taha, Mutasem O
2016-01-01
Type-II diabetes mellitus (T2DM) is a complex chronic disease that represents a major therapeutic challenge. Despite extensive efforts in T2DM drug development, therapies remain unsatisfactory. Currently, there are many novel and important antidiabetic drug targets under investigation by many research groups worldwide. One of the main challenges to develop effective orally active hypoglycemic agents is off-target effects. Computational tools have impacted drug discovery at many levels. One of the earliest methods is quantitative structure-activity relationship (QSAR) studies. QSAR strategies help medicinal chemists understand the relationship between hypoglycemic activity and molecular properties. Hence, QSAR may hold promise in guiding the synthesis of specifically designed novel ligands that demonstrate high potency and target selectivity. This review aims to provide an overview of the QSAR strategies used to model antidiabetic agents. In particular, this review focuses on drug targets that raised recent scientific interest and/or led to successful antidiabetic agents in the market. Special emphasis has been made on studies that led to the identification of novel antidiabetic scaffolds. Computer-aided molecular design and discovery techniques like QSAR have a great potential in designing leads against complex diseases such as T2DM. Combined with other in silico techniques, QSAR can provide more useful and rational insights to facilitate the discovery of novel compounds. However, since T2DM is a complex disease that includes several faulty biological targets, multi-target QSAR studies are recommended in the future to achieve efficient antidiabetic therapies.
Experimental and QSAR study on the surface activities of alkyl imidazoline surfactants
NASA Astrophysics Data System (ADS)
Kong, Xiangjun; Qian, Chengduo; Fan, Weiyu; Liang, Zupei
2018-03-01
15 alkyl imidazoline surfactants with different structures were synthesized and their critical micelle concentration (CMC) and surface tension under the CMC (σcmc) in aqueous solution were measured at 298 K. 54 kinds of molecular structure descriptors were selected as independent variables and the quantitative structure-activity relationship (QSAR) between surface activities of alkyl imidazoline and molecular structure were built through the genetic function approximation (GFA) method. Experimental results showed that the maximum surface excess of alkyl imidazoline molecules at the gas-liquid interface increased and the area occupied by each surfactant molecule and the free energies of micellization ΔGm decreased with increasing carbon number (NC) of the hydrophobic chain or decreasing hydrophilicity of counterions, which resulted in a CMC and σcmc decrease, while the log CMC and NC had a linear relationship and a negative correlation. The GFA-QSAR model, which was generated by a training set composed of 13 kinds of alkyl imidazoline though GFA method regression analysis, was highly correlated with predicted values and experimental values of the CMC. The correlation coefficient R was 0.9991, which means high prediction accuracy. The prediction error of 2 kinds of alkyl imidazoline CMCs in the Validation Set that quantitatively analyzed the influence of the alkyl imidazoline molecular structure on the CMC was less than 4%.
NASA Astrophysics Data System (ADS)
Jukić, Marijana; Rastija, Vesna; Opačak-Bernardi, Teuta; Stolić, Ivana; Krstulović, Luka; Bajić, Miroslav; Glavaš-Obrovac, Ljubica
2017-04-01
The aim of this study was to evaluate nine newly synthesized amidine derivatives of 3,4- ethylenedioxythiophene (3,4-EDOT) for their cytotoxic activity against a panel of human cancer cell lines and to perform a quantitative structure-activity relationship (QSAR) analysis for the antitumor activity of a total of 27 3,4-ethylenedioxythiophene derivatives. Induction of apoptosis was investigated on the selected compounds, along with delivery options for the optimization of activity. The best obtained QSAR models include the following group of descriptors: BCUT, WHIM, 2D autocorrelations, 3D-MoRSE, GETAWAY descriptors, 2D frequency fingerprint and information indices. Obtained QSAR models should be relieved in elucidation of important physicochemical and structural requirements for this biological activity. Highly potent molecules have a symmetrical arrangement of substituents along the x axis, high frequency of distance between N and O atoms at topological distance 9, as well as between C and N atoms at topological distance 10, and more C atoms located at topological distances 6 and 3. Based on the conclusion given in the QSAR analysis, a new compound with possible great activity was proposed.
2011-01-01
used in efforts to develop QSAR models. Measurement of Repellent Efficacy Screening for Repellency of Compounds with Unknown Toxicology In screening...CPT) were used to develop Quantitative Structure Activity Relationship ( QSAR ) models to predict repellency. Successful prediction of novel...acylpiperidine QSAR models employed 4 descriptors to describe the relationship between structure and repellent duration. The ANN model of the carboxamides did not
He, Gu; Qiu, Minghua; Li, Rui; Ouyang, Liang; Wu, Fengbo; Song, Xiangrong; Cheng, Li; Xiang, Mingli; Yu, Luoting
2012-06-01
Aurora-A has been known as one of the most important targets for cancer therapy, and some Aurora-A inhibitors have entered clinical trails. In this study, combination of the ligand-based and structure-based methods is used to clarify the essential quantitative structure-activity relationship of known Aurora-A inhibitors, and multicomplex-based pharmacophore-guided method has been suggested to generate a comprehensive pharmacophore of Aurora-A kinase based on a collection of crystal structures of Aurora-A-inhibitor complex. This model has been successfully used to identify the bioactive conformation and align 37 structurally diverse N-substituted 2'-(aminoaryl)benzothiazoles derivatives. The quantitative structure-activity relationship analyses have been performed on these Aurora-A inhibitors based on multicomplex-based pharmacophore-guided alignment. These results may provide important information for further design and virtual screening of novel Aurora-A inhibitors. © 2012 John Wiley & Sons A/S.
NASA Astrophysics Data System (ADS)
Adhikari, Nilanjan; Amin, Sk. Abdul; Saha, Achintya; Jha, Tarun
2018-03-01
Matrix metalloproteinase-2 (MMP-2) is a promising pharmacological target for designing potential anticancer drugs. MMP-2 plays critical functions in apoptosis by cleaving the DNA repair enzyme namely poly (ADP-ribose) polymerase (PARP). Moreover, MMP-2 expression triggers the vascular endothelial growth factor (VEGF) having a positive influence on tumor size, invasion, and angiogenesis. Therefore, it is an urgent need to develop potential MMP-2 inhibitors without any toxicity but better pharmacokinetic property. In this article, robust validated multi-quantitative structure-activity relationship (QSAR) modeling approaches were attempted on a dataset of 222 MMP-2 inhibitors to explore the important structural and pharmacophoric requirements for higher MMP-2 inhibition. Different validated regression and classification-based QSARs, pharmacophore mapping and 3D-QSAR techniques were performed. These results were challenged and subjected to further validation to explain 24 in house MMP-2 inhibitors to judge the reliability of these models further. All these models were individually validated internally as well as externally and were supported and validated by each other. These results were further justified by molecular docking analysis. Modeling techniques adopted here not only helps to explore the necessary structural and pharmacophoric requirements but also for the overall validation and refinement techniques for designing potential MMP-2 inhibitors.
Rational selection of training and test sets for the development of validated QSAR models
NASA Astrophysics Data System (ADS)
Golbraikh, Alexander; Shen, Min; Xiao, Zhiyan; Xiao, Yun-De; Lee, Kuo-Hsiung; Tropsha, Alexander
2003-02-01
Quantitative Structure-Activity Relationship (QSAR) models are used increasingly to screen chemical databases and/or virtual chemical libraries for potentially bioactive molecules. These developments emphasize the importance of rigorous model validation to ensure that the models have acceptable predictive power. Using k nearest neighbors ( kNN) variable selection QSAR method for the analysis of several datasets, we have demonstrated recently that the widely accepted leave-one-out (LOO) cross-validated R2 (q2) is an inadequate characteristic to assess the predictive ability of the models [Golbraikh, A., Tropsha, A. Beware of q2! J. Mol. Graphics Mod. 20, 269-276, (2002)]. Herein, we provide additional evidence that there exists no correlation between the values of q 2 for the training set and accuracy of prediction ( R 2) for the test set and argue that this observation is a general property of any QSAR model developed with LOO cross-validation. We suggest that external validation using rationally selected training and test sets provides a means to establish a reliable QSAR model. We propose several approaches to the division of experimental datasets into training and test sets and apply them in QSAR studies of 48 functionalized amino acid anticonvulsants and a series of 157 epipodophyllotoxin derivatives with antitumor activity. We formulate a set of general criteria for the evaluation of predictive power of QSAR models.
Philipp, Bodo; Hoff, Malte; Germa, Florence; Schink, Bernhard; Beimborn, Dieter; Mersch-Sundermann, Volker
2007-02-15
Prediction of the biodegradability of organic compounds is an ecologically desirable and economically feasible tool for estimating the environmental fate of chemicals. We combined quantitative structure-activity relationships (QSAR) with the systematic collection of biochemical knowledge to establish rules for the prediction of aerobic biodegradation of N-heterocycles. Validated biodegradation data of 194 N-heterocyclic compounds were analyzed using the MULTICASE-method which delivered two QSAR models based on 17 activating (OSAR 1) and on 16 inactivating molecular fragments (GSAR 2), which were statistically significantly linked to efficient or poor biodegradability, respectively. The percentages of correct classifications were over 99% for both models, and cross-validation resulted in 67.9% (GSAR 1) and 70.4% (OSAR 2) correct predictions. Biochemical interpretation of the activating and inactivating characteristics of the molecular fragments delivered plausible mechanistic interpretations and enabled us to establish the following biodegradation rules: (1) Target sites for amidohydrolases and for cytochrome P450 monooxygenases enhance biodegradation of nonaromatic N-heterocycles. (2) Target sites for molybdenum hydroxylases enhance biodegradation of aromatic N-heterocycles. (3) Target sites for hydratation by an urocanase-like mechanism enhance biodegradation of imidazoles. Our complementary approach represents a feasible strategy for generating concrete rules for the prediction of biodegradability of organic compounds.
Ren, Ji-Xia; Li, Cheng-Ping; Zhou, Xiu-Ling; Cao, Xue-Song; Xie, Yong
2017-08-22
Myeloid cell leukemia-1 (Mcl-1) has been a validated and attractive target for cancer therapy. Over-expression of Mcl-1 in many cancers allows cancer cells to evade apoptosis and contributes to the resistance to current chemotherapeutics. Here, we identified new Mcl-1 inhibitors using a multi-step virtual screening approach. First, based on two different ligand-receptor complexes, 20 pharmacophore models were established by simultaneously using 'Receptor-Ligand Pharmacophore Generation' method and manual build feature method, and then carefully validated by a test database. Then, pharmacophore-based virtual screening (PB-VS) could be performed by using the 20 pharmacophore models. In addition, docking study was used to predict the possible binding poses of compounds, and the docking parameters were optimized before performing docking-based virtual screening (DB-VS). Moreover, a 3D QSAR model was established by applying the 55 aligned Mcl-1 inhibitors. The 55 inhibitors sharing the same scaffold were docked into the Mcl-1 active site before alignment, then the inhibitors with possible binding conformations were aligned. For the training set, the 3D QSAR model gave a correlation coefficient r 2 of 0.996; for the test set, the correlation coefficient r 2 was 0.812. Therefore, the developed 3D QSAR model was a good model, which could be applied for carrying out 3D QSAR-based virtual screening (QSARD-VS). After the above three virtual screening methods orderly filtering, 23 potential inhibitors with novel scaffolds were identified. Furthermore, we have discussed in detail the mapping results of two potent compounds onto pharmacophore models, 3D QSAR model, and the interactions between the compounds and active site residues.
Ponzano, Stefano; Berteotti, Anna; Petracca, Rita; Vitale, Romina; Mengatto, Luisa; Bandiera, Tiziano; Cavalli, Andrea; Piomelli, Daniele; Bertozzi, Fabio; Bottegoni, Giovanni
2014-12-11
N-(2-Oxo-3-oxetanyl)carbamic acid esters have recently been reported to be noncompetitive inhibitors of the N-acylethanolamine acid amidase (NAAA) potentially useful for the treatment of pain and inflammation. In the present study, we further explored the structure-activity relationships of the carbamic acid ester side chain of 2-methyl-4-oxo-3-oxetanylcarbamic acid ester derivatives. Additional favorable features in the design of potent NAAA inhibitors have been found together with the identification of a single digit nanomolar inhibitor. In addition, we devised a 3D QSAR using the atomic property field method. The model turned out to be able to account for the structural variability and was prospectively validated by designing, synthesizing, and testing novel inhibitors. The fairly good agreement between predictions and experimental potency values points to this 3D QSAR model as the first example of quantitative structure-activity relationships in the field of NAAA inhibitors.
Jhin, Changho; Hwang, Keum Taek
2014-01-01
Radical scavenging activity of anthocyanins is well known, but only a few studies have been conducted by quantum chemical approach. The adaptive neuro-fuzzy inference system (ANFIS) is an effective technique for solving problems with uncertainty. The purpose of this study was to construct and evaluate quantitative structure-activity relationship (QSAR) models for predicting radical scavenging activities of anthocyanins with good prediction efficiency. ANFIS-applied QSAR models were developed by using quantum chemical descriptors of anthocyanins calculated by semi-empirical PM6 and PM7 methods. Electron affinity (A) and electronegativity (χ) of flavylium cation, and ionization potential (I) of quinoidal base were significantly correlated with radical scavenging activities of anthocyanins. These descriptors were used as independent variables for QSAR models. ANFIS models with two triangular-shaped input fuzzy functions for each independent variable were constructed and optimized by 100 learning epochs. The constructed models using descriptors calculated by both PM6 and PM7 had good prediction efficiency with Q-square of 0.82 and 0.86, respectively. PMID:25153627
Ivanciuc, Ovidiu
2013-06-01
Chemical and molecular graphs have fundamental applications in chemoinformatics, quantitative structureproperty relationships (QSPR), quantitative structure-activity relationships (QSAR), virtual screening of chemical libraries, and computational drug design. Chemoinformatics applications of graphs include chemical structure representation and coding, database search and retrieval, and physicochemical property prediction. QSPR, QSAR and virtual screening are based on the structure-property principle, which states that the physicochemical and biological properties of chemical compounds can be predicted from their chemical structure. Such structure-property correlations are usually developed from topological indices and fingerprints computed from the molecular graph and from molecular descriptors computed from the three-dimensional chemical structure. We present here a selection of the most important graph descriptors and topological indices, including molecular matrices, graph spectra, spectral moments, graph polynomials, and vertex topological indices. These graph descriptors are used to define several topological indices based on molecular connectivity, graph distance, reciprocal distance, distance-degree, distance-valency, spectra, polynomials, and information theory concepts. The molecular descriptors and topological indices can be developed with a more general approach, based on molecular graph operators, which define a family of graph indices related by a common formula. Graph descriptors and topological indices for molecules containing heteroatoms and multiple bonds are computed with weighting schemes based on atomic properties, such as the atomic number, covalent radius, or electronegativity. The correlation in QSPR and QSAR models can be improved by optimizing some parameters in the formula of topological indices, as demonstrated for structural descriptors based on atomic connectivity and graph distance.
Web-4D-QSAR: A web-based application to generate 4D-QSAR descriptors.
Ataide Martins, João Paulo; Rougeth de Oliveira, Marco Antônio; Oliveira de Queiroz, Mário Sérgio
2018-06-05
A web-based application is developed to generate 4D-QSAR descriptors using the LQTA-QSAR methodology, based on molecular dynamics (MD) trajectories and topology information retrieved from the GROMACS package. The LQTAGrid module calculates the intermolecular interaction energies at each grid point, considering probes and all aligned conformations resulting from MD simulations. These interaction energies are the independent variables or descriptors employed in a QSAR analysis. A friendly front end web interface, built using the Django framework and Python programming language, integrates all steps of the LQTA-QSAR methodology in a way that is transparent to the user, and in the backend, GROMACS and LQTAGrid are executed to generate 4D-QSAR descriptors to be used later in the process of QSAR model building. © 2018 Wiley Periodicals, Inc. © 2018 Wiley Periodicals, Inc.
Burden, Natalie; Maynard, Samuel K; Weltje, Lennart; Wheeler, James R
2016-10-01
The European Plant Protection Products Regulation 1107/2009 requires that registrants establish whether pesticide metabolites pose a risk to the environment. Fish acute toxicity assessments may be carried out to this end. Considering the total number of pesticide (re-) registrations, the number of metabolites can be considerable, and therefore this testing could use many vertebrates. EFSA's recent "Guidance on tiered risk assessment for plant protection products for aquatic organisms in edge-of-field surface waters" outlines opportunities to apply non-testing methods, such as Quantitative Structure Activity Relationship (QSAR) models. However, a scientific evidence base is necessary to support the use of QSARs in predicting acute fish toxicity of pesticide metabolites. Widespread application and subsequent regulatory acceptance of such an approach would reduce the numbers of animals used. The work presented here intends to provide this evidence base, by means of retrospective data analysis. Experimental fish LC50 values for 150 metabolites were extracted from the Pesticide Properties Database (http://sitem.herts.ac.uk/aeru/ppdb/en/atoz.htm). QSAR calculations were performed to predict fish acute toxicity values for these metabolites using the US EPA's ECOSAR software. The most conservative predicted LC50 values generated by ECOSAR were compared with experimental LC50 values. There was a significant correlation between predicted and experimental fish LC50 values (Spearman rs = 0.6304, p < 0.0001). For 62% of metabolites assessed, the QSAR predicted values are equal to or lower than their respective experimental values. Refined analysis, taking into account data quality and experimental variation considerations increases the proportion of sufficiently predictive estimates to 91%. For eight of the nine outliers, there are plausible explanation(s) for the disparity between measured and predicted LC50 values. Following detailed consideration of the robustness of this non-testing approach, it can be concluded there is a strong data driven rationale for the applicability of QSAR models in the metabolite assessment scheme recommended by EFSA. As such there is value in further refining this approach, to improve the method and enable its future incorporation into regulatory guidance and practice. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
MIANN models in medicinal, physical and organic chemistry.
González-Díaz, Humberto; Arrasate, Sonia; Sotomayor, Nuria; Lete, Esther; Munteanu, Cristian R; Pazos, Alejandro; Besada-Porto, Lina; Ruso, Juan M
2013-01-01
Reducing costs in terms of time, animal sacrifice, and material resources with computational methods has become a promising goal in Medicinal, Biological, Physical and Organic Chemistry. There are many computational techniques that can be used in this sense. In any case, almost all these methods focus on few fundamental aspects including: type (1) methods to quantify the molecular structure, type (2) methods to link the structure with the biological activity, and others. In particular, MARCH-INSIDE (MI), acronym for Markov Chain Invariants for Networks Simulation and Design, is a well-known method for QSAR analysis useful in step (1). In addition, the bio-inspired Artificial-Intelligence (AI) algorithms called Artificial Neural Networks (ANNs) are among the most powerful type (2) methods. We can combine MI with ANNs in order to seek QSAR models, a strategy which is called herein MIANN (MI & ANN models). One of the first applications of the MIANN strategy was in the development of new QSAR models for drug discovery. MIANN strategy has been expanded to the QSAR study of proteins, protein-drug interactions, and protein-protein interaction networks. In this paper, we review for the first time many interesting aspects of the MIANN strategy including theoretical basis, implementation in web servers, and examples of applications in Medicinal and Biological chemistry. We also report new applications of the MIANN strategy in Medicinal chemistry and the first examples in Physical and Organic Chemistry, as well. In so doing, we developed new MIANN models for several self-assembly physicochemical properties of surfactants and large reaction networks in organic synthesis. In some of the new examples we also present experimental results which were not published up to date.
Pandey, Gyanendra; Saxena, Anil K
2006-01-01
A set of 65 flexible peptidomimetic competitive inhibitors (52 in the training set and 13 in the test set) of protein tyrosine phosphatase 1B (PTP1B) has been used to compare the quality and predictive power of 3D quantitative structure-activity relationship (QSAR) comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) models for the three most commonly used conformer-based alignments, namely, cocrystallized conformer-based alignment (CCBA), docked conformer-based alignment (DCBA), and global minima energy conformer-based alignment (GMCBA). These three conformers of 5-[(2S)-2-({(2S)-2-[(tert-butoxycarbonyl)amino]-3-phenylpropanoyl}amino)3-oxo-3-pentylamino)propyl]-2-(carboxymethoxy)benzoic acid (compound number 66) were obtained from the X-ray structure of its cocrystallized complex with PTP1B (PDB ID: 1JF7), its docking studies, and its global minima by simulated annealing. Among the 3D QSAR models developed using the above three alignments, the CCBA provided the optimal predictive CoMFA model for the training set with cross-validated r2 (q2)=0.708, non-cross-validated r2=0.902, standard error of estimate (s)=0.165, and F=202.553 and the optimal CoMSIA model with q2=0.440, r2=0.799, s=0.192, and F=117.782. These models also showed the best test set prediction for the 13 compounds with predictive r2 values of 0.706 and 0.683, respectively. Though the QSAR models derived using the other two alignments also produced statistically acceptable models in the order DCBA>GMCBA in terms of the values of q2, r2, and predictive r2, they were inferior to the corresponding models derived using CCBA. Thus, the order of preference for the alignment selection for 3D QSAR model development may be CCBA>DCBA>GMCBA, and the information obtained from the CoMFA and CoMSIA contour maps may be useful in designing specific PTP1B inhibitors.
The Interplay between QSAR/QSPR Studies and Partial Order Ranking and Formal Concept Analyses
Carlsen, Lars
2009-01-01
The often observed scarcity of physical-chemical and well as toxicological data hampers the assessment of potentially hazardous chemicals released to the environment. In such cases Quantitative Structure-Activity Relationships/Quantitative Structure-Property Relationships (QSAR/QSPR) constitute an obvious alternative for rapidly, effectively and inexpensively generatng missing experimental values. However, typically further treatment of the data appears necessary, e.g., to elucidate the possible relations between the single compounds as well as implications and associations between the various parameters used for the combined characterization of the compounds under investigation. In the present paper the application of QSAR/QSPR in combination with Partial Order Ranking (POR) methodologies will be reviewed and new aspects using Formal Concept Analysis (FCA) will be introduced. Where POR constitutes an attractive method for, e.g., prioritizing a series of chemical substances based on a simultaneous inclusion of a range of parameters, FCA gives important information on the implications associations between the parameters. The combined approach thus constitutes an attractive method to a preliminary assessment of the impact on environmental and human health by primary pollutants or possibly by a primary pollutant well as a possible suite of transformation subsequent products that may be both persistent in and bioaccumulating and toxic. The present review focus on the environmental – and human health impact by residuals of the rocket fuel 1,1-dimethylhydrazine (heptyl) and its transformation products as an illustrative example. PMID:19468330
CheS-Mapper 2.0 for visual validation of (Q)SAR models
2014-01-01
Background Sound statistical validation is important to evaluate and compare the overall performance of (Q)SAR models. However, classical validation does not support the user in better understanding the properties of the model or the underlying data. Even though, a number of visualization tools for analyzing (Q)SAR information in small molecule datasets exist, integrated visualization methods that allow the investigation of model validation results are still lacking. Results We propose visual validation, as an approach for the graphical inspection of (Q)SAR model validation results. The approach applies the 3D viewer CheS-Mapper, an open-source application for the exploration of small molecules in virtual 3D space. The present work describes the new functionalities in CheS-Mapper 2.0, that facilitate the analysis of (Q)SAR information and allows the visual validation of (Q)SAR models. The tool enables the comparison of model predictions to the actual activity in feature space. The approach is generic: It is model-independent and can handle physico-chemical and structural input features as well as quantitative and qualitative endpoints. Conclusions Visual validation with CheS-Mapper enables analyzing (Q)SAR information in the data and indicates how this information is employed by the (Q)SAR model. It reveals, if the endpoint is modeled too specific or too generic and highlights common properties of misclassified compounds. Moreover, the researcher can use CheS-Mapper to inspect how the (Q)SAR model predicts activity cliffs. The CheS-Mapper software is freely available at http://ches-mapper.org. Graphical abstract Comparing actual and predicted activity values with CheS-Mapper.
Neural network-based QSAR and insecticide discovery: spinetoram
NASA Astrophysics Data System (ADS)
Sparks, Thomas C.; Crouse, Gary D.; Dripps, James E.; Anzeveno, Peter; Martynow, Jacek; DeAmicis, Carl V.; Gifford, James
2008-06-01
Improvements in the efficacy and spectrum of the spinosyns, novel fermentation derived insecticide, has long been a goal within Dow AgroSciences. As large and complex fermentation products identifying specific modifications to the spinosyns likely to result in improved activity was a difficult process, since most modifications decreased the activity. A variety of approaches were investigated to identify new synthetic directions for the spinosyn chemistry including several explorations of the quantitative structure activity relationships (QSAR) of spinosyns, which initially were unsuccessful. However, application of artificial neural networks (ANN) to the spinosyn QSAR problem identified new directions for improved activity in the chemistry, which subsequent synthesis and testing confirmed. The ANN-based analogs coupled with other information on substitution effects resulting from spinosyn structure activity relationships lead to the discovery of spinetoram (XDE-175). Launched in late 2007, spinetoram provides both improved efficacy and an expanded spectrum while maintaining the exceptional environmental and toxicological profile already established for the spinosyn chemistry.
Forecasting the Environmental Impacts of New Energetic Materials
2010-11-30
Quantitative structure- activity relationships for chemical reductions of organic contaminants. Environmental Toxicology and Chemistry 22(8): 1733-1742. QSARs ...activity relationships [ QSARs ]) and the use of these properties to predict the chemical?s fate with multimedia assessment models. SERDP has recently...has several parts, including the prediction of chemical properties (e.g., with quantitative structure-activity relationships [ QSARs ]) and the use of
Combinatorial QSAR Modeling of Rat Acute Toxicity by Oral Exposure
Quantitative Structure-Activity Relationship (QSAR) toxicity models have become popular tools for identifying potential toxic compounds and prioritizing candidates for animal toxicity tests. However, few QSAR studies have successfully modeled large, diverse mammalian toxicity end...
Prado-Prado, Francisco; García-Mera, Xerardo; Escobar, Manuel; Alonso, Nerea; Caamaño, Olga; Yañez, Matilde; González-Díaz, Humberto
2012-01-01
The number of neurodegenerative diseases has been increasing in recent years. Many of the drug candidates to be used in the treatment of neurodegenerative diseases present specific 3D structural features. An important protein in this sense is the acetylcholinesterase (AChE), which is the target of many Alzheimer's dementia drugs. Consequently, the prediction of Drug-Protein Interactions (DPIs/nDPIs) between new drug candidates and specific 3D structure and targets is of major importance. To this end, we can use Quantitative Structure-Activity Relationships (QSAR) models to carry out a rational DPIs prediction. Unfortunately, many previous QSAR models developed to predict DPIs take into consideration only 2D structural information and codify the activity against only one target. To solve this problem we can develop some 3D multi-target QSAR (3D mt-QSAR) models. In this study, using the 3D MI-DRAGON technique, we have introduced a new predictor for DPIs based on two different well-known software. We have used the MARCH-INSIDE (MI) and DRAGON software to calculate 3D structural parameters for drugs and targets respectively. Both classes of 3D parameters were used as input to train Artificial Neuronal Network (ANN) algorithms using as benchmark dataset the complex network (CN) made up of all DPIs between US FDA approved drugs and their targets. The entire dataset was downloaded from the DrugBank database. The best 3D mt-QSAR predictor found was an ANN of Multi-Layer Perceptron-type (MLP) with profile MLP 37:37-24-1:1. This MLP classifies correctly 274 out of 321 DPIs (Sensitivity = 85.35%) and 1041 out of 1190 nDPIs (Specificity = 87.48%), corresponding to training Accuracy = 87.03%. We have validated the model with external predicting series with Sensitivity = 84.16% (542/644 DPIs; Specificity = 87.51% (2039/2330 nDPIs) and Accuracy = 86.78%. The new CNs of DPIs reconstructed from US FDA can be used to explore large DPI databases in order to discover both new drugs and/or targets. We have carried out some theoretical-experimental studies to illustrate the practical use of 3D MI-DRAGON. First, we have reported the prediction and pharmacological assay of 22 different rasagiline derivatives with possible AChE inhibitory activity. In this work, we have reviewed different computational studies on Drug- Protein models. First, we have reviewed 10 studies on DP computational models. Next, we have reviewed 2D QSAR, 3D QSAR, CoMFA, CoMSIA and Docking with different compounds to find Drug-Protein QSAR models. Last, we have developped a 3D multi-target QSAR (3D mt-QSAR) models for the prediction of the activity of new compounds against different targets or the discovery of new targets.
Nirouei, Mahyar; Ghasemi, Ghasem; Abdolmaleki, Parviz; Tavakoli, Abdolreza; Shariati, Shahab
2012-06-01
The antiviral drugs that inhibit human immunodeficiency virus (HIV) entry to the target cells are already in different phases of clinical trials. They prevent viral entry and have a highly specific mechanism of action with a low toxicity profile. Few QSAR studies have been performed on this group of inhibitors. This study was performed to develop a quantitative structure-activity relationship (QSAR) model of the biological activity of indole glyoxamide derivatives as inhibitors of the interaction between HIV glycoprotein gp120 and host cell CD4 receptors. Forty different indole glyoxamide derivatives were selected as a sample set and geometrically optimized using Gaussian 98W. Different combinations of multiple linear regression (MLR), genetic algorithms (GA) and artificial neural networks (ANN) were then utilized to construct the QSAR models. These models were also utilized to select the most efficient subsets of descriptors in a cross-validation procedure for non-linear log (1/EC50) prediction. The results that were obtained using GA-ANN were compared with MLR-MLR and MLR-ANN models. A high predictive ability was observed for the MLR, MLR-ANN and GA-ANN models, with root mean sum square errors (RMSE) of 0.99, 0.91 and 0.67, respectively (N = 40). In summary, machine learning methods were highly effective in designing QSAR models when compared to statistical method.
Cronin, Mark T D; Jaworska, Joanna S; Walker, John D; Comber, Michael H I; Watts, Christopher D; Worth, Andrew P
2003-01-01
This article is a review of the use of quantitative (and qualitative) structure-activity relationships (QSARs and SARs) by regulatory agencies and authorities to predict acute toxicity, mutagenicity, carcinogenicity, and other health effects. A number of SAR and QSAR applications, by regulatory agencies and authorities, are reviewed. These include the use of simple QSAR analyses, as well as the use of multivariate QSARs, and a number of different expert system approaches. PMID:12896862
Yang, Ran; Yu, Lanlan; Zeng, Huajin; Liang, Ruiling; Chen, Xiaolan; Qu, Lingbo
2012-11-01
In this work, the interactions of twelve structurally different flavonoids with Lysozyme (Lys) were studied by fluorescence quenching method. The interaction mechanism and binding properties were investigated. It was found that the binding capacities of flavonoids to Lys were highly depend on the number and position of hydrogen, the kind and position of glycosyl. To explore the selectivity of the bindings of flavonoids with Lys, the structure descriptors of the flavonoids were calculated under QSAR software package of Cerius2, the quantitative relationship between the structures of flavonoids and their binding activities to Lys (QSAR) was performed through genetic function approximation (GFA) regression analysis. The QSAR regression equation was K(A) = 37850.460 + 1630.01Dipole +3038.330HD-171.795MR. (r = 0.858, r(CV)(2) = 0.444, F((11,3)) = 7.48), where K(A) is binding constants, Dipole, HD and MR was dipole moment, number of hydrogen-bond donor and molecular refractivity, respectively. The obtained results make us understand better how the molecular structures influencing their binding to protein which may open up new avenues for the design of the most suitable flavonoids derivatives with structure variants.
Wignall, Jessica A; Muratov, Eugene; Sedykh, Alexander; Guyton, Kathryn Z; Tropsha, Alexander; Rusyn, Ivan; Chiu, Weihsueh A
2018-05-01
Human health assessments synthesize human, animal, and mechanistic data to produce toxicity values that are key inputs to risk-based decision making. Traditional assessments are data-, time-, and resource-intensive, and they cannot be developed for most environmental chemicals owing to a lack of appropriate data. As recommended by the National Research Council, we propose a solution for predicting toxicity values for data-poor chemicals through development of quantitative structure-activity relationship (QSAR) models. We used a comprehensive database of chemicals with existing regulatory toxicity values from U.S. federal and state agencies to develop quantitative QSAR models. We compared QSAR-based model predictions to those based on high-throughput screening (HTS) assays. QSAR models for noncancer threshold-based values and cancer slope factors had cross-validation-based Q 2 of 0.25-0.45, mean model errors of 0.70-1.11 log 10 units, and applicability domains covering >80% of environmental chemicals. Toxicity values predicted from QSAR models developed in this study were more accurate and precise than those based on HTS assays or mean-based predictions. A publicly accessible web interface to make predictions for any chemical of interest is available at http://toxvalue.org. An in silico tool that can predict toxicity values with an uncertainty of an order of magnitude or less can be used to quickly and quantitatively assess risks of environmental chemicals when traditional toxicity data or human health assessments are unavailable. This tool can fill a critical gap in the risk assessment and management of data-poor chemicals. https://doi.org/10.1289/EHP2998.
Molnar, Maja; Komar, Mario; Brahmbhatt, Harshad; Babić, Jurislav; Jokić, Stela; Rastija, Vesna
2017-09-05
Deep eutectic solvents, as green and environmentally friendly media, were utilized in the synthesis of novel coumarinyl Schiff bases. Novel derivatives were synthesized from 2-((4-methyl-2-oxo-2 H -chromen-7-yl)oxy)acetohydrazide and corresponding aldehyde in choline chloride:malonic acid (1:1) based deep eutectic solvent. In these reactions, deep eutectic solvent acted as a solvent and catalyst as well. Novel Schiff bases were synthesized in high yields (65-75%) with no need for further purification, and their structures were confirmed by mass spectra, ¹H and 13 C NMR. Furthermore, their antioxidant activity was determined and compared to antioxidant activity of previously synthesized derivatives, thus investigating their structure-activity relationship utilizing quantitative structure-activity relationship QSAR studies. Calculation of molecular descriptors has been performed by DRAGON software. The best QSAR model ( R tr = 0.636; R ext = 0.709) obtained with three descriptors ( MATS3m , Mor22u , Hy ) implies that the pairs of atoms higher mass at the path length 3, three-dimensional arrangement of atoms at scattering parameter s = 21 Å - ¹, and higher number of hydrophilic groups (-OH, -NH) enhanced antioxidant activity. Electrostatic potential surface of the most active compounds showed possible regions for donation of electrons to 1,1-diphenyl-2-picryhydrazyl (DPPH) radicals.
Prediction of Solvent Physical Properties using the Hierarchical Clustering Method
Recently a QSAR (Quantitative Structure Activity Relationship) method, the hierarchical clustering method, was developed to estimate acute toxicity values for large, diverse datasets. This methodology has now been applied to the estimate solvent physical properties including sur...
Ahamad, Shahzaib; Hassan, Md Imtaiyaz; Dwivedi, Neeraja
2018-05-01
Tuberculosis (Tb) is an airborne infectious disease caused by Mycobacterium tuberculosis. Beta-carbonic anhydrase 1 ( β-CA1 ) has emerged as one of the potential targets for new antitubercular drug development. In this work, three-dimensional quantitative structure-activity relationships (3D-QSAR), molecular docking, and molecular dynamics (MD) simulation approaches were performed on a series of natural and synthetic phenol-based β-CA1 inhibitors. The developed 3D-QSAR model ( r 2 = 0.94, q 2 = 0.86, and pred_r 2 = 0.74) indicated that the steric and electrostatic factors are important parameters to modulate the bioactivity of phenolic compounds. Based on this indication, we designed 72 new phenolic inhibitors, out of which two compounds (D25 and D50) effectively stabilized β-CA1 receptor and, thus, are potential candidates for new generation antitubercular drug discovery program.
Toropova, Alla P; Toropov, Andrey A
2013-11-01
The increasing use of nanomaterials incorporated into consumer products leads to the need for developing approaches to establish "quantitative structure-activity relationships" (QSARs) for various nanomaterials. However, the molecular structure as rule is not available for nanomaterials at least in its classic meaning. An possible alternative of classic QSAR (based on the molecular structure) is the using of data on physicochemical features of TiO(2) nanoparticles. The damage to cellular membranes (units L(-1)) by means of various TiO(2) nanoparticles is examined as the endpoint. Copyright © 2013 Elsevier Ltd. All rights reserved.
Bobovská, Adela; Tvaroška, Igor; Kóňa, Juraj
2016-05-01
Human Golgi α-mannosidase II (GMII), a zinc ion co-factor dependent glycoside hydrolase (E.C.3.2.1.114), is a pharmaceutical target for the design of inhibitors with anti-cancer activity. The discovery of an effective inhibitor is complicated by the fact that all known potent inhibitors of GMII are involved in unwanted co-inhibition with lysosomal α-mannosidase (LMan, E.C.3.2.1.24), a relative to GMII. Routine empirical QSAR models for both GMII and LMan did not work with a required accuracy. Therefore, we have developed a fast computational protocol to build predictive models combining interaction energy descriptors from an empirical docking scoring function (Glide-Schrödinger), Linear Interaction Energy (LIE) method, and quantum mechanical density functional theory (QM-DFT) calculations. The QSAR models were built and validated with a library of structurally diverse GMII and LMan inhibitors and non-active compounds. A critical role of QM-DFT descriptors for the more accurate prediction abilities of the models is demonstrated. The predictive ability of the models was significantly improved when going from the empirical docking scoring function to mixed empirical-QM-DFT QSAR models (Q(2)=0.78-0.86 when cross-validation procedures were carried out; and R(2)=0.81-0.83 for a testing set). The average error for the predicted ΔGbind decreased to 0.8-1.1kcalmol(-1). Also, 76-80% of non-active compounds were successfully filtered out from GMII and LMan inhibitors. The QSAR models with the fragmented QM-DFT descriptors may find a useful application in structure-based drug design where pure empirical and force field methods reached their limits and where quantum mechanics effects are critical for ligand-receptor interactions. The optimized models will apply in lead optimization processes for GMII drug developments. Copyright © 2016 Elsevier Inc. All rights reserved.
Kovačević, Strahinja; Karadžić, Milica; Podunavac-Kuzmanović, Sanja; Jevrić, Lidija
2018-01-01
The present study is based on the quantitative structure-activity relationship (QSAR) analysis of binding affinity toward human prion protein (huPrP C ) of quinacrine, pyridine dicarbonitrile, diphenylthiazole and diphenyloxazole analogs applying different linear and non-linear chemometric regression techniques, including univariate linear regression, multiple linear regression, partial least squares regression and artificial neural networks. The QSAR analysis distinguished molecular lipophilicity as an important factor that contributes to the binding affinity. Principal component analysis was used in order to reveal similarities or dissimilarities among the studied compounds. The analysis of in silico absorption, distribution, metabolism, excretion and toxicity (ADMET) parameters was conducted. The ranking of the studied analogs on the basis of their ADMET parameters was done applying the sum of ranking differences, as a relatively new chemometric method. The main aim of the study was to reveal the most important molecular features whose changes lead to the changes in the binding affinities of the studied compounds. Another point of view on the binding affinity of the most promising analogs was established by application of molecular docking analysis. The results of the molecular docking were proven to be in agreement with the experimental outcome. Copyright © 2017 Elsevier B.V. All rights reserved.
Du, Q S; Ma, Y; Xie, N Z; Huang, R B
2014-01-01
In the design of peptide inhibitors the huge possible variety of the peptide sequences is of high concern. In collaboration with the fast accumulation of the peptide experimental data and database, a statistical method is suggested for peptide inhibitor design. In the two-level peptide prediction network (2L-QSAR) one level is the physicochemical properties of amino acids and the other level is the peptide sequence position. The activity contributions of amino acids are the functions of physicochemical properties and the sequence positions. In the prediction equation two weight coefficient sets {ak} and {bl} are assigned to the physicochemical properties and to the sequence positions, respectively. After the two coefficient sets are optimized based on the experimental data of known peptide inhibitors using the iterative double least square (IDLS) procedure, the coefficients are used to evaluate the bioactivities of new designed peptide inhibitors. The two-level prediction network can be applied to the peptide inhibitor design that may aim for different target proteins, or different positions of a protein. A notable advantage of the two-level statistical algorithm is that there is no need for host protein structural information. It may also provide useful insight into the amino acid properties and the roles of sequence positions.
On the virtues of automated quantitative structure-activity relationship: the new kid on the block.
de Oliveira, Marcelo T; Katekawa, Edson
2018-02-01
Quantitative structure-activity relationship (QSAR) has proved to be an invaluable tool in medicinal chemistry. Data availability at unprecedented levels through various databases have collaborated to a resurgence in the interest for QSAR. In this context, rapid generation of quality predictive models is highly desirable for hit identification and lead optimization. We showcase the application of an automated QSAR approach, which randomly selects multiple training/test sets and utilizes machine-learning algorithms to generate predictive models. Results demonstrate that AutoQSAR produces models of improved or similar quality to those generated by practitioners in the field but in just a fraction of the time. Despite the potential of the concept to the benefit of the community, the AutoQSAR opportunity has been largely undervalued.
Bennett, Erin R; Clausen, Jay; Linkov, Eugene; Linkov, Igor
2009-11-01
Reliable, up-front information on physical and biological properties of emerging materials is essential before making a decision and investment to formulate, synthesize, scale-up, test, and manufacture a new material for use in both military and civilian applications. Multiple quantitative structure-activity relationships (QSARs) software tools are available for predicting a material's physical/chemical properties and environmental effects. Even though information on emerging materials is often limited, QSAR software output is treated without sufficient uncertainty analysis. We hypothesize that uncertainty and variability in material properties and uncertainty in model prediction can be too large to provide meaningful results. To test this hypothesis, we predicted octanol water partitioning coefficients (logP) for multiple, similar compounds with limited physical-chemical properties using six different commercial logP calculators (KOWWIN, MarvinSketch, ACD/Labs, ALogP, CLogP, SPARC). Analysis was done for materials with largely uncertain properties that were similar, based on molecular formula, to military compounds (RDX, BTTN, TNT) and pharmaceuticals (Carbamazepine, Gemfibrizol). We have also compared QSAR modeling results for a well-studied pesticide and pesticide breakdown product (Atrazine, DDE). Our analysis shows variability due to structural variations of the emerging chemicals may be several orders of magnitude. The model uncertainty across six software packages was very high (10 orders of magnitude) for emerging materials while it was low for traditional chemicals (e.g. Atrazine). Thus the use of QSAR models for emerging materials screening requires extensive model validation and coupling QSAR output with available empirical data and other relevant information.
Afantitis, Antreas; Melagraki, Georgia; Sarimveis, Haralambos; Koutentis, Panayiotis A; Markopoulos, John; Igglessi-Markopoulou, Olga
2006-08-01
A quantitative-structure activity relationship was obtained by applying Multiple Linear Regression Analysis to a series of 80 1-[2-hydroxyethoxy-methyl]-6-(phenylthio) thymine (HEPT) derivatives with significant anti-HIV activity. For the selection of the best among 37 different descriptors, the Elimination Selection Stepwise Regression Method (ES-SWR) was utilized. The resulting QSAR model (R (2) (CV) = 0.8160; S (PRESS) = 0.5680) proved to be very accurate both in training and predictive stages.
Abuhamdah, Sawsan; Habash, Maha; Taha, Mutasem O
2013-12-01
Inhibition of the enzyme acetylcholinesterase (AChE) has been shown to alleviate neurodegenerative diseases prompting several attempts to discover and optimize new AChE inhibitors. In this direction, we explored the pharmacophoric space of 85 AChE inhibitors to identify high quality pharmacophores. Subsequently, we implemented genetic algorithm-based quantitative structure-activity relationship (QSAR) modeling to select optimal combination of pharmacophoric models and 2D physicochemical descriptors capable of explaining bioactivity variation among training compounds (r2(68)=0.94, F-statistic=125.8, r2 LOO=0.92, r2 PRESS against 17 external test inhibitors = 0.84). Two orthogonal pharmacophores emerged in the QSAR equation suggesting the existence of at least two binding modes accessible to ligands within AChE binding pocket. The successful pharmacophores were comparable with crystallographically resolved AChE binding pocket. We employed the pharmacophoric models and associated QSAR equation to screen the national cancer institute list of compounds. Twenty-four low micromolar AChE inhibitors were identified. The most potent gave IC50 value of 1.0 μM.
Fan, Feng; Cheng, Jiagao; Li, Zhong; Xu, Xiaoyong; Qian, Xuhong
2010-02-01
Molecular aggregation state of bioactive compounds plays a key role in its bio-interactive procedure. In this article, based on the structure information of dimers, the simplest model of molecular aggregation state, and combined with solvational computation, total four descriptors (DeltaV, MR2, DeltaE(1), and DeltaE(2)) were calculated for QSAR study of a novel insect-growth regulator, N-(5-phenyl-1,3,4-oxadiazol-2-yl)-N'-benzoyl urea. Two QSAR models were constructed with r(2) = 0.671, q(2) = 0.516 and r(2) = 0.816, q(2) = 0.695, respectively. It implicates that the bioactivity may strongly depend on the characters of molecular aggregation state, especially on the dimeric transport ability from oil phase to water phase. Copyright 2009 Wiley Periodicals, Inc.
Handa, Koichi; Nakagome, Izumi; Yamaotsu, Noriyuki; Gouda, Hiroaki; Hirono, Shuichi
2015-01-01
The pregnane X receptor [PXR (NR1I2)] induces the expression of xenobiotic metabolic genes and transporter genes. In this study, we aimed to establish a computational method for quantifying the enzyme-inducing potencies of different compounds via their ability to activate PXR, for the application in drug discovery and development. To achieve this purpose, we developed a three-dimensional quantitative structure-activity relationship (3D-QSAR) model using comparative molecular field analysis (CoMFA) for predicting enzyme-inducing potencies, based on computer-ligand docking to multiple PXR protein structures sampled from the trajectory of a molecular dynamics simulation. Molecular mechanics-generalized born/surface area scores representing the ligand-protein-binding free energies were calculated for each ligand. As a result, the predicted enzyme-inducing potencies for compounds generated by the CoMFA model were in good agreement with the experimental values. Finally, we concluded that this 3D-QSAR model has the potential to predict the enzyme-inducing potencies of novel compounds with high precision and therefore has valuable applications in the early stages of the drug discovery process. © 2014 Wiley Periodicals, Inc. and the American Pharmacists Association.
QSAR modeling: where have you been? Where are you going to?
Cherkasov, Artem; Muratov, Eugene N; Fourches, Denis; Varnek, Alexandre; Baskin, Igor I; Cronin, Mark; Dearden, John; Gramatica, Paola; Martin, Yvonne C; Todeschini, Roberto; Consonni, Viviana; Kuz'min, Victor E; Cramer, Richard; Benigni, Romualdo; Yang, Chihae; Rathman, James; Terfloth, Lothar; Gasteiger, Johann; Richard, Ann; Tropsha, Alexander
2014-06-26
Quantitative structure-activity relationship modeling is one of the major computational tools employed in medicinal chemistry. However, throughout its entire history it has drawn both praise and criticism concerning its reliability, limitations, successes, and failures. In this paper, we discuss (i) the development and evolution of QSAR; (ii) the current trends, unsolved problems, and pressing challenges; and (iii) several novel and emerging applications of QSAR modeling. Throughout this discussion, we provide guidelines for QSAR development, validation, and application, which are summarized in best practices for building rigorously validated and externally predictive QSAR models. We hope that this Perspective will help communications between computational and experimental chemists toward collaborative development and use of QSAR models. We also believe that the guidelines presented here will help journal editors and reviewers apply more stringent scientific standards to manuscripts reporting new QSAR studies, as well as encourage the use of high quality, validated QSARs for regulatory decision making.
QSAR Modeling: Where have you been? Where are you going to?
Cherkasov, Artem; Muratov, Eugene N.; Fourches, Denis; Varnek, Alexandre; Baskin, Igor I.; Cronin, Mark; Dearden, John; Gramatica, Paola; Martin, Yvonne C.; Todeschini, Roberto; Consonni, Viviana; Kuz'min, Victor E.; Cramer, Richard; Benigni, Romualdo; Yang, Chihae; Rathman, James; Terfloth, Lothar; Gasteiger, Johann; Richard, Ann; Tropsha, Alexander
2014-01-01
Quantitative Structure-Activity Relationship modeling is one of the major computational tools employed in medicinal chemistry. However, throughout its entire history it has drawn both praise and criticism concerning its reliability, limitations, successes, and failures. In this paper, we discuss: (i) the development and evolution of QSAR; (ii) the current trends, unsolved problems, and pressing challenges; and (iii) several novel and emerging applications of QSAR modeling. Throughout this discussion, we provide guidelines for QSAR development, validation, and application, which are summarized in best practices for building rigorously validated and externally predictive QSAR models. We hope that this Perspective will help communications between computational and experimental chemists towards collaborative development and use of QSAR models. We also believe that the guidelines presented here will help journal editors and reviewers apply more stringent scientific standards to manuscripts reporting new QSAR studies, as well as encourage the use of high quality, validated QSARs for regulatory decision making. PMID:24351051
NASA Astrophysics Data System (ADS)
Santos-Filho, Osvaldo A.; Mishra, Rama K.; Hopfinger, A. J.
2001-09-01
Free energy force field (FEFF) 3D-QSAR analysis was used to construct ligand-receptor binding models for a set of 18 structurally diverse antifolates including pyrimethamine, cycloguanil, methotrexate, aminopterin and trimethoprim, and 13 pyrrolo[2,3-d]pyrimidines. The molecular target (`receptor') used was a 3D-homology model of a specific mutant type of Plasmodium falciparum (Pf) dihydrofolate reductase (DHFR). The dependent variable of the 3D-QSAR models is the IC50 inhibition constant for the specific mutant type of PfDHFR. The independent variables of the 3D-QSAR models (the descriptors) are scaled energy terms of a modified first-generation AMBER force field combined with a hydration shell aqueous solvation model and a collection of 2D-QSAR descriptors often used in QSAR studies. Multiple temperature molecular dynamics simulation (MDS) and the genetic function approximation (GFA) were employed using partial least square (PLS) and multidimensional linear regressions as the fitting functions to develop FEFF 3D-QSAR models for the binding process. The significant FEFF energy terms in the best 3D-QSAR models include energy contributions of the direct ligand-receptor interaction. Some changes in conformational energy terms of the ligand due to binding to the enzyme are also found to be important descriptors. The FEFF 3D-QSAR models indicate some structural features perhaps relevant to the mechanism of resistance of the PfDHFR to current antimalarials. The FEFF 3D-QSAR models are also compared to receptor-independent (RI) 4D-QSAR models developed in an earlier study and subsequently refined using recently developed generalized alignment rules.
3D-QSAR modeling and molecular docking studies on a series of 2,5 disubstituted 1,3,4-oxadiazoles
NASA Astrophysics Data System (ADS)
Ghaleb, Adib; Aouidate, Adnane; Ghamali, Mounir; Sbai, Abdelouahid; Bouachrine, Mohammed; Lakhlifi, Tahar
2017-10-01
3D-QSAR (comparative molecular field analysis (CoMFA)) and comparative molecular similarity indices analysis (CoMSIA) were performed on novel 2,5 disubstituted 1,3,4-oxadiazoles analogues as anti-fungal agents. The CoMFA and CoMSIA models using 13 compounds in the training set gives Q2 values of 0.52 and 0.51 respectively, while R2 values of 0.92. The adapted alignment method with the suitable parameters resulted in reliable models. The contour maps produced by the CoMFA and CoMSIA models were employed to determine a three-dimensional quantitative structure-activity relationship. Based on this study a set of new molecules with high predicted activities were designed. Surflex-docking confirmed the stability of predicted molecules in the receptor.
Majumdar, Subhabrata; Basak, Subhash C
2018-04-26
Proper validation is an important aspect of QSAR modelling. External validation is one of the widely used validation methods in QSAR where the model is built on a subset of the data and validated on the rest of the samples. However, its effectiveness for datasets with a small number of samples but large number of predictors remains suspect. Calculating hundreds or thousands of molecular descriptors using currently available software has become the norm in QSAR research, owing to computational advances in the past few decades. Thus, for n chemical compounds and p descriptors calculated for each molecule, the typical chemometric dataset today has high value of p but small n (i.e. n < p). Motivated by the evidence of inadequacies of external validation in estimating the true predictive capability of a statistical model in recent literature, this paper performs an extensive and comparative study of this method with several other validation techniques. We compared four validation methods: leave-one-out, K-fold, external and multi-split validation, using statistical models built using the LASSO regression, which simultaneously performs variable selection and modelling. We used 300 simulated datasets and one real dataset of 95 congeneric amine mutagens for this evaluation. External validation metrics have high variation among different random splits of the data, hence are not recommended for predictive QSAR models. LOO has the overall best performance among all validation methods applied in our scenario. Results from external validation are too unstable for the datasets we analyzed. Based on our findings, we recommend using the LOO procedure for validating QSAR predictive models built on high-dimensional small-sample data. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
QSAR DataBank - an approach for the digital organization and archiving of QSAR model information
2014-01-01
Background Research efforts in the field of descriptive and predictive Quantitative Structure-Activity Relationships or Quantitative Structure–Property Relationships produce around one thousand scientific publications annually. All the materials and results are mainly communicated using printed media. The printed media in its present form have obvious limitations when they come to effectively representing mathematical models, including complex and non-linear, and large bodies of associated numerical chemical data. It is not supportive of secondary information extraction or reuse efforts while in silico studies poses additional requirements for accessibility, transparency and reproducibility of the research. This gap can and should be bridged by introducing domain-specific digital data exchange standards and tools. The current publication presents a formal specification of the quantitative structure-activity relationship data organization and archival format called the QSAR DataBank (QsarDB for shorter, or QDB for shortest). Results The article describes QsarDB data schema, which formalizes QSAR concepts (objects and relationships between them) and QsarDB data format, which formalizes their presentation for computer systems. The utility and benefits of QsarDB have been thoroughly tested by solving everyday QSAR and predictive modeling problems, with examples in the field of predictive toxicology, and can be applied for a wide variety of other endpoints. The work is accompanied with open source reference implementation and tools. Conclusions The proposed open data, open source, and open standards design is open to public and proprietary extensions on many levels. Selected use cases exemplify the benefits of the proposed QsarDB data format. General ideas for future development are discussed. PMID:24910716
Wang, Zhanhui; Kai, Zhenpeng; Beier, Ross C.; Shen, Jianzhong; Yang, Xinling
2012-01-01
A three-dimensional quantitative structure-activity relationship (3D-QSAR) model of sulfonamide analogs binding a monoclonal antibody (MAbSMR) produced against sulfamerazine was carried out by Distance Comparison (DISCOtech), comparative molecular field analysis (CoMFA), and comparative molecular similarity indices analysis (CoMSIA). The affinities of the MAbSMR, expressed as Log10IC50, for 17 sulfonamide analogs were determined by competitive fluorescence polarization immunoassay (FPIA). The results demonstrated that the proposed pharmacophore model containing two hydrogen-bond acceptors, two hydrogen-bond donors and two hydrophobic centers characterized the structural features of the sulfonamides necessary for MAbSMR binding. Removal of two outliers from the initial set of 17 sulfonamide analogs improved the predictability of the models. The 3D-QSAR models of 15 sulfonamides based on CoMFA and CoMSIA resulted in q2 cv values of 0.600 and 0.523, and r2 values of 0.995 and 0.994, respectively, which indicates that both methods have significant predictive capability. Connolly surface analysis, which mainly focused on steric force fields, was performed to complement the results from CoMFA and CoMSIA. This novel study combining FPIA with pharmacophore modeling demonstrates that multidisciplinary research is useful for investigating antigen-antibody interactions and also may provide information required for the design of new haptens. PMID:22754368
Combinatorial Pharmacophore-Based 3D-QSAR Analysis and Virtual Screening of FGFR1 Inhibitors
Zhou, Nannan; Xu, Yuan; Liu, Xian; Wang, Yulan; Peng, Jianlong; Luo, Xiaomin; Zheng, Mingyue; Chen, Kaixian; Jiang, Hualiang
2015-01-01
The fibroblast growth factor/fibroblast growth factor receptor (FGF/FGFR) signaling pathway plays crucial roles in cell proliferation, angiogenesis, migration, and survival. Aberration in FGFRs correlates with several malignancies and disorders. FGFRs have proved to be attractive targets for therapeutic intervention in cancer, and it is of high interest to find FGFR inhibitors with novel scaffolds. In this study, a combinatorial three-dimensional quantitative structure-activity relationship (3D-QSAR) model was developed based on previously reported FGFR1 inhibitors with diverse structural skeletons. This model was evaluated for its prediction performance on a diverse test set containing 232 FGFR inhibitors, and it yielded a SD value of 0.75 pIC50 units from measured inhibition affinities and a Pearson’s correlation coefficient R2 of 0.53. This result suggests that the combinatorial 3D-QSAR model could be used to search for new FGFR1 hit structures and predict their potential activity. To further evaluate the performance of the model, a decoy set validation was used to measure the efficiency of the model by calculating EF (enrichment factor). Based on the combinatorial pharmacophore model, a virtual screening against SPECS database was performed. Nineteen novel active compounds were successfully identified, which provide new chemical starting points for further structural optimization of FGFR1 inhibitors. PMID:26110383
Speck-Planche, Alejandro; Kleandrova, Valeria V; Luan, Feng; Cordeiro, M Natália D S
2012-08-01
The discovery of new and more potent anti-cancer agents constitutes one of the most active fields of research in chemotherapy. Colorectal cancer (CRC) is one of the most studied cancers because of its high prevalence and number of deaths. In the current pharmaceutical design of more efficient anti-CRC drugs, the use of methodologies based on Chemoinformatics has played a decisive role, including Quantitative-Structure-Activity Relationship (QSAR) techniques. However, until now, there is no methodology able to predict anti-CRC activity of compounds against more than one CRC cell line, which should constitute the principal goal. In an attempt to overcome this problem we develop here the first multi-target (mt) approach for the virtual screening and rational in silico discovery of anti-CRC agents against ten cell lines. Here, two mt-QSAR classification models were constructed using a large and heterogeneous database of compounds. The first model was based on linear discriminant analysis (mt-QSAR-LDA) employing fragment-based descriptors while the second model was obtained using artificial neural networks (mt-QSAR-ANN) with global 2D descriptors. Both models correctly classified more than 90% of active and inactive compounds in training and prediction sets. Some fragments were extracted from the molecules and their contributions to anti-CRC activity were calculated using mt-QSAR-LDA model. Several fragments were identified as potential substructural features responsible for the anti-CRC activity and new molecules designed from those fragments with positive contributions were suggested and correctly predicted by the two models as possible potent and versatile anti-CRC agents. Copyright © 2012 Elsevier Ltd. All rights reserved.
Zhu, Hao; Ye, Lin; Richard, Ann; Golbraikh, Alexander; Wright, Fred A; Rusyn, Ivan; Tropsha, Alexander
2009-08-01
Accurate prediction of in vivo toxicity from in vitro testing is a challenging problem. Large public-private consortia have been formed with the goal of improving chemical safety assessment by the means of high-throughput screening. A wealth of available biological data requires new computational approaches to link chemical structure, in vitro data, and potential adverse health effects. A database containing experimental cytotoxicity values for in vitro half-maximal inhibitory concentration (IC(50)) and in vivo rodent median lethal dose (LD(50)) for more than 300 chemicals was compiled by Zentralstelle zur Erfassung und Bewertung von Ersatz- und Ergaenzungsmethoden zum Tierversuch (ZEBET; National Center for Documentation and Evaluation of Alternative Methods to Animal Experiments). The application of conventional quantitative structure-activity relationship (QSAR) modeling approaches to predict mouse or rat acute LD(50) values from chemical descriptors of ZEBET compounds yielded no statistically significant models. The analysis of these data showed no significant correlation between IC(50) and LD(50). However, a linear IC(50) versus LD(50) correlation could be established for a fraction of compounds. To capitalize on this observation, we developed a novel two-step modeling approach as follows. First, all chemicals are partitioned into two groups based on the relationship between IC(50) and LD(50) values: One group comprises compounds with linear IC(50) versus LD(50) relationships, and another group comprises the remaining compounds. Second, we built conventional binary classification QSAR models to predict the group affiliation based on chemical descriptors only. Third, we developed k-nearest neighbor continuous QSAR models for each subclass to predict LD(50) values from chemical descriptors. All models were extensively validated using special protocols. The novelty of this modeling approach is that it uses the relationships between in vivo and in vitro data only to inform the initial construction of the hierarchical two-step QSAR models. Models resulting from this approach employ chemical descriptors only for external prediction of acute rodent toxicity.
Towards molecular design using 2D-molecular contour maps obtained from PLS regression coefficients
NASA Astrophysics Data System (ADS)
Borges, Cleber N.; Barigye, Stephen J.; Freitas, Matheus P.
2017-12-01
The multivariate image analysis descriptors used in quantitative structure-activity relationships are direct representations of chemical structures as they are simply numerical decodifications of pixels forming the 2D chemical images. These MDs have found great utility in the modeling of diverse properties of organic molecules. Given the multicollinearity and high dimensionality of the data matrices generated with the MIA-QSAR approach, modeling techniques that involve the projection of the data space onto orthogonal components e.g. Partial Least Squares (PLS) have been generally used. However, the chemical interpretation of the PLS-based MIA-QSAR models, in terms of the structural moieties affecting the modeled bioactivity has not been straightforward. This work describes the 2D-contour maps based on the PLS regression coefficients, as a means of assessing the relevance of single MIA predictors to the response variable, and thus allowing for the structural, electronic and physicochemical interpretation of the MIA-QSAR models. A sample study to demonstrate the utility of the 2D-contour maps to design novel drug-like molecules is performed using a dataset of some anti-HIV-1 2-amino-6-arylsulfonylbenzonitriles and derivatives, and the inferences obtained are consistent with other reports in the literature. In addition, the different schemes for encoding atomic properties in molecules are discussed and evaluated.
Al-Sha'er, Mahmoud A; Khanfar, Mohammad A; Taha, Mutasem O
2014-01-01
Urokinase plasminogen activator (uPA)-a serine protease-is thought to play a central role in tumor metastasis and angiogenesis and, therefore, inhibition of this enzyme could be beneficial in treating cancer. Toward this end, we explored the pharmacophoric space of 202 uPA inhibitors using seven diverse sets of inhibitors to identify high-quality pharmacophores. Subsequently, we employed genetic algorithm-based quantitative structure-activity relationship (QSAR) analysis as a competition arena to select the best possible combination of pharmacophoric models and physicochemical descriptors that can explain bioactivity variation within the training inhibitors (r (2) 162 = 0.74, F-statistic = 64.30, r (2) LOO = 0.71, r (2) PRESS against 40 test inhibitors = 0.79). Three orthogonal pharmacophores emerged in the QSAR equation suggesting the existence of at least three binding modes accessible to ligands within the uPA binding pocket. This conclusion was supported by receiver operating characteristic (ROC) curve analyses of the QSAR-selected pharmacophores. Moreover, the three pharmacophores were comparable with binding interactions seen in crystallographic structures of bound ligands within the uPA binding pocket. We employed the resulting pharmacophoric models and associated QSAR equation to screen the national cancer institute (NCI) list of compounds. The captured hits were tested in vitro. Overall, our modeling workflow identified new low micromolar anti-uPA hits.
Artificial intelligence approaches for rational drug design and discovery.
Duch, Włodzisław; Swaminathan, Karthikeyan; Meller, Jarosław
2007-01-01
Pattern recognition, machine learning and artificial intelligence approaches play an increasingly important role in rational drug design, screening and identification of candidate molecules and studies on quantitative structure-activity relationships (QSAR). In this review, we present an overview of basic concepts and methodology in the fields of machine learning and artificial intelligence (AI). An emphasis is put on methods that enable an intuitive interpretation of the results and facilitate gaining an insight into the structure of the problem at hand. We also discuss representative applications of AI methods to docking, screening and QSAR studies. The growing trend to integrate computational and experimental efforts in that regard and some future developments are discussed. In addition, we comment on a broader role of machine learning and artificial intelligence approaches in biomedical research.
3D QSAR models built on structure-based alignments of Abl tyrosine kinase inhibitors.
Falchi, Federico; Manetti, Fabrizio; Carraro, Fabio; Naldini, Antonella; Maga, Giovanni; Crespan, Emmanuele; Schenone, Silvia; Bruno, Olga; Brullo, Chiara; Botta, Maurizio
2009-06-01
Quality QSAR: A combination of docking calculations and a statistical approach toward Abl inhibitors resulted in a 3D QSAR model, the analysis of which led to the identification of ligand portions important for affinity. New compounds designed on the basis of the model were found to have very good affinity for the target, providing further validation of the model itself.The X-ray crystallographic coordinates of the Abl tyrosine kinase domain in its active, inactive, and Src-like inactive conformations were used as targets to simulate the binding mode of a large series of pyrazolo[3,4-d]pyrimidines (known Abl inhibitors) by means of GOLD software. Receptor-based alignments provided by molecular docking calculations were submitted to a GRID-GOLPE protocol to generate 3D QSAR models. Analysis of the results showed that the models based on the inactive and Src-like inactive conformations had very poor statistical parameters, whereas the sole model based on the active conformation of Abl was characterized by significant internal and external predictive ability. Subsequent analysis of GOLPE PLS pseudo-coefficient contour plots of this model gave us a better understanding of the relationships between structure and affinity, providing suggestions for the next optimization process. On the basis of these results, new compounds were designed according to the hydrophobic and hydrogen bond donor and acceptor contours, and were found to have improved enzymatic and cellular activity with respect to parent compounds. Additional biological assays confirmed the important role of the selected compounds as inhibitors of cell proliferation in leukemia cells.
Investigation into adamantane-based M2 inhibitors with FB-QSAR.
Wei, Hang; Wang, Cheng-Hua; Du, Qi-Shi; Meng, Jianzong; Chou, Kuo-Chen
2009-07-01
Because of their high resistance rate to the existing drugs, influenza A viruses have become a threat to human beings. It is known that the replication of influenza A viruses needs a pH-gated proton channel, the so-called M2 channel. Therefore, to develop effective drugs against influenza A, the most logic strategy is to inhibit the M2 channel. Recently, the atomic structure of the M2 channel was determined by NMR spectroscopy (Schnell, J.R. and Chou, J.J., Nature, 2008, 451, 591-595). The high-resolution NMR structure has provided a solid basis for structure-based drug design approaches. In this study, a benchmark dataset has been constructed that contains 34 newly-developed adamantane-based M2 inhibitors and covers considerable structural diversities and wide range of bioactivities. Based on these compounds, an in-depth analysis was performed with the newly developed fragment-based quantitative structure-activity relationship (FB-QSAR) algorithm. The results thus obtained provide useful insights for dealing with the drug-resistant problem and designing effective adamantane-based antiflu drugs.
Mathematical modeling of tetrahydroimidazole benzodiazepine-1-one derivatives as an anti HIV agent
NASA Astrophysics Data System (ADS)
Ojha, Lokendra Kumar
2017-07-01
The goal of the present work is the study of drug receptor interaction via QSAR (Quantitative Structure-Activity Relationship) analysis for 89 set of TIBO (Tetrahydroimidazole Benzodiazepine-1-one) derivatives. MLR (Multiple Linear Regression) method is utilized to generate predictive models of quantitative structure-activity relationships between a set of molecular descriptors and biological activity (IC50). The best QSAR model was selected having a correlation coefficient (r) of 0.9299 and Standard Error of Estimation (SEE) of 0.5022, Fisher Ratio (F) of 159.822 and Quality factor (Q) of 1.852. This model is statistically significant and strongly favours the substitution of sulphur atom, IS i.e. indicator parameter for -Z position of the TIBO derivatives. Two other parameter logP (octanol-water partition coefficient) and SAG (Surface Area Grid) also played a vital role in the generation of best QSAR model. All three descriptor shows very good stability towards data variation in leave-one-out (LOO).
NASA Astrophysics Data System (ADS)
Hsieh, Jui-Hua; Wang, Xiang S.; Teotico, Denise; Golbraikh, Alexander; Tropsha, Alexander
2008-09-01
The use of inaccurate scoring functions in docking algorithms may result in the selection of compounds with high predicted binding affinity that nevertheless are known experimentally not to bind to the target receptor. Such falsely predicted binders have been termed `binding decoys'. We posed a question as to whether true binders and decoys could be distinguished based only on their structural chemical descriptors using approaches commonly used in ligand based drug design. We have applied the k-Nearest Neighbor ( kNN) classification QSAR approach to a dataset of compounds characterized as binders or binding decoys of AmpC beta-lactamase. Models were subjected to rigorous internal and external validation as part of our standard workflow and a special QSAR modeling scheme was employed that took into account the imbalanced ratio of inhibitors to non-binders (1:4) in this dataset. 342 predictive models were obtained with correct classification rate (CCR) for both training and test sets as high as 0.90 or higher. The prediction accuracy was as high as 100% (CCR = 1.00) for the external validation set composed of 10 compounds (5 true binders and 5 decoys) selected randomly from the original dataset. For an additional external set of 50 known non-binders, we have achieved the CCR of 0.87 using very conservative model applicability domain threshold. The validated binary kNN QSAR models were further employed for mining the NCGC AmpC screening dataset (69653 compounds). The consensus prediction of 64 compounds identified as screening hits in the AmpC PubChem assay disagreed with their annotation in PubChem but was in agreement with the results of secondary assays. At the same time, 15 compounds were identified as potential binders contrary to their annotation in PubChem. Five of them were tested experimentally and showed inhibitory activities in millimolar range with the highest binding constant Ki of 135 μM. Our studies suggest that validated QSAR models could complement structure based docking and scoring approaches in identifying promising hits by virtual screening of molecular libraries.
Ai, Yong; Wang, Shao-Teng; Sun, Ping-Hua; Song, Fa-Jun
2011-01-01
Aurora kinases have emerged as attractive targets for the design of anticancer drugs. 3D-QSAR (comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA)) and Surflex-docking studies were performed on a series of pyrrole-indoline-2-ones as Aurora A inhibitors. The CoMFA and CoMSIA models using 25 inhibitors in the training set gave r2cv values of 0.726 and 0.566, and r2 values of 0.972 and 0.984, respectively. The adapted alignment method with the suitable parameters resulted in reliable models. The contour maps produced by the CoMFA and CoMSIA models were employed to rationalize the key structural requirements responsible for the activity. Surflex-docking studies revealed that the sulfo group, secondary amine group on indolin-2-one, and carbonyl of 6,7-dihydro-1H-indol-4(5H)-one groups were significant for binding to the receptor, and some essential features were also identified. Based on the 3D-QSAR and docking results, a set of new molecules with high predicted activities were designed. PMID:21673910
Ai, Yong; Wang, Shao-Teng; Sun, Ping-Hua; Song, Fa-Jun
2011-01-01
Aurora kinases have emerged as attractive targets for the design of anticancer drugs. 3D-QSAR (comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA)) and Surflex-docking studies were performed on a series of pyrrole-indoline-2-ones as Aurora A inhibitors. The CoMFA and CoMSIA models using 25 inhibitors in the training set gave r(2) (cv) values of 0.726 and 0.566, and r(2) values of 0.972 and 0.984, respectively. The adapted alignment method with the suitable parameters resulted in reliable models. The contour maps produced by the CoMFA and CoMSIA models were employed to rationalize the key structural requirements responsible for the activity. Surflex-docking studies revealed that the sulfo group, secondary amine group on indolin-2-one, and carbonyl of 6,7-dihydro-1H-indol-4(5H)-one groups were significant for binding to the receptor, and some essential features were also identified. Based on the 3D-QSAR and docking results, a set of new molecules with high predicted activities were designed.
Nendza, Monika; Kühne, Ralph; Lombardo, Anna; Strempel, Sebastian; Schüürmann, Gerrit
2018-03-01
Aquatic bioconcentration factors (BCFs) are critical in PBT (persistent, bioaccumulative, toxic) and risk assessment of chemicals. High costs and use of more than 100 fish per standard BCF study (OECD 305) call for alternative methods to replace as much in vivo testing as possible. The BCF waiving scheme is a screening tool combining QSAR classifications based on physicochemical properties related to the distribution (hydrophobicity, ionisation), persistence (biodegradability, hydrolysis), solubility and volatility (Henry's law constant) of substances in water bodies and aquatic biota to predict substances with low aquatic bioaccumulation (nonB, BCF<2000). The BCF waiving scheme was developed with a dataset of reliable BCFs for 998 compounds and externally validated with another 181 substances. It performs with 100% sensitivity (no false negatives), >50% efficacy (waiving potential), and complies with the OECD principles for valid QSARs. The chemical applicability domain of the BCF waiving scheme is given by the structures of the training set, with some compound classes explicitly excluded like organometallics, poly- and perfluorinated compounds, aromatic triphenylphosphates, surfactants. The prediction confidence of the BCF waiving scheme is based on applicability domain compliance, consensus modelling, and the structural similarity with known nonB and B/vB substances. Compounds classified as nonB by the BCF waiving scheme are candidates for waiving of BCF in vivo testing on fish due to low concern with regard to the B criterion. The BCF waiving scheme supports the 3Rs with a possible reduction of >50% of BCF in vivo testing on fish. If the target chemical is outside the applicability domain of the BCF waiving scheme or not classified as nonB, further assessments with in silico, in vitro or in vivo methods are necessary to either confirm or reject bioaccumulative behaviour. Copyright © 2017 Elsevier B.V. All rights reserved.
The collection of chemical structures and associated experimental data for QSAR modeling is facilitated by the increasing number and size of public databases. However, the performance of QSAR models highly depends on the quality of the data used and the modeling methodology. The ...
QSAR studies of benzofuran/benzothiophene biphenyl derivatives as inhibitors of PTPase-1B
Kaushik, D.; Kumar, R.; Saxena, A. K.
2010-01-01
Objectives: Insulin resistance is associated with a defect in protein tyrosine phosphorylation in the insulin signal transduction cascade. The PTPase enzyme dephosphorylates the active form of the insulin receptor and thus attenuates its tyrosine kinase activity, therefore, the need for a potent PTPase inhibitor exists, with the intention of which the QSAR was performed. Materials and Methods: Quantitative structure-activity relationship (QSAR) has been established on a series of 106 compounds considering 27 variables, for novel biphenyl analogs, using the SYSTAT (Version 7.0) software, for their protein tyrosine phosphatase (PTPase-1B) inhibitor activity, in order to understand the essential structural requirement for binding with the receptor. Results: Among several regression models, one per series was selected on the basis of a high correlation coefficient (r, 0.86), least standard deviation (s, 0.234), and a high value of significance for the maximum number of subjects (n, 101). Conclusions: The influence of the different physicochemical parameters of the substituents in various positions has been discussed by generating the best QSAR model using multiple regression analysis, and the information thus obtained from the present study can be used to design and predict more potent molecules as PTPase-1B inhibitors, prior to their synthesis. PMID:21814427
NASA Astrophysics Data System (ADS)
Rondla, Rohini; Padma Rao, Lavanya Souda; Ramatenki, Vishwanath; Vadija, Rajender; Mukkera, Thirupathi; Potlapally, Sarita Rajender; Vuruputuri, Uma
2017-04-01
The cyclin-dependent kinase 4 (CDK4) enzyme is a key regulator in cell cycle G1 phase progression. It is often overexpressed in variety of cancer cells, which makes it an attractive therapeutic target for cancer treatment. A number of chemical scaffolds have been reported as CDK4 inhibitors in the literature, and in particular azolium scaffolds as potential inhibitors. Here, a ligand based pharmacophore modeling and an atom based 3D-QSAR analyses for a series of azolium based CDK4 inhibitors are presented. A five point pharmacophore hypothesis, i.e. APRRR with one H-bond acceptor (A), one positive cationic feature (P) and three ring aromatic sites (R) is developed, which yielded an atom based 3D-QSAR model that shows an excellent correlation coefficient value- R2 = 0.93, fisher ratio- F = 207, along with good predictive ability- Q2 = 0.79, and Pearson R value = 0.89. The visual inspection of the 3D-QSAR model, with the most active and the least active ligands, demonstrates the favorable and unfavorable structural regions for the activity towards CDK4. The roles of positively charged nitrogen, the steric effect, ligand flexibility, and the substituents on the activity are in good agreement with the previously reported experimental results. The generated 3D QSAR model is further applied as query for a 3D database screening, which identifies 23 lead drug candidates with good predicted activities and diverse scaffolds. The ADME analysis reveals that, the pharmacokinetic parameters of all the identified new leads are within the acceptable range.
Automated workflows for data curation and standardization of chemical structures for QSAR modeling
Large collections of chemical structures and associated experimental data are publicly available, and can be used to build robust QSAR models for applications in different fields. One common concern is the quality of both the chemical structure information and associated experime...
Bauer, Katharina Christin; Hämmerling, Frank; Kittelmann, Jörg; Dürr, Cathrin; Görlich, Fabian; Hubbuch, Jürgen
2017-04-01
Information about protein-protein interactions provides valuable knowledge about the phase behavior of protein solutions during the biopharmaceutical production process. Up to date it is possible to capture their overall impact by an experimentally determined potential of mean force. For the description of this potential, the second virial coefficient B22, the diffusion interaction parameter kD, the storage modulus G', or the diffusion coefficient D is applied. In silico methods do not only have the potential to predict these parameters, but also to provide deeper understanding of the molecular origin of the protein-protein interactions by correlating the data to the protein's three-dimensional structure. This methodology furthermore allows a lower sample consumption and less experimental effort. Of all in silico methods, QSAR modeling, which correlates the properties of the molecule's structure with the experimental behavior, seems to be particularly suitable for this purpose. To verify this, the study reported here dealt with the determination of a QSAR model for the diffusion coefficient of proteins. This model consisted of diffusion coefficients for six different model proteins at various pH values and NaCl concentrations. The generated QSAR model showed a good correlation between experimental and predicted data with a coefficient of determination R2 = 0.9 and a good predictability for an external test set with R2 = 0.91. The information about the properties affecting protein-protein interactions present in solution was in agreement with experiment and theory. Furthermore, the model was able to give a more detailed picture of the protein properties influencing the diffusion coefficient and the acting protein-protein interactions. Biotechnol. Bioeng. 2017;114: 821-831. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Quasi-QSAR for mutagenic potential of multi-walled carbon-nanotubes.
Toropov, Andrey A; Toropova, Alla P
2015-04-01
Available on the Internet, the CORAL software (http://www.insilico.eu/coral) has been used to build up quasi-quantitative structure-activity relationships (quasi-QSAR) for prediction of mutagenic potential of multi-walled carbon-nanotubes (MWCNTs). In contrast with the previous models built up by CORAL which were based on representation of the molecular structure by simplified molecular input-line entry system (SMILES) the quasi-QSARs based on the representation of conditions (not on the molecular structure) such as concentration, presence (absence) S9 mix, the using (or without the using) of preincubation were encoded by so-called quasi-SMILES. The statistical characteristics of these models (quasi-QSARs) for three random splits into the visible training set and test set and invisible validation set are the following: (i) split 1: n=13, r(2)=0.8037, q(2)=0.7260, s=0.033, F=45 (training set); n=5, r(2)=0.9102, s=0.071 (test set); n=6, r(2)=0.7627, s=0.044 (validation set); (ii) split 2: n=13, r(2)=0.6446, q(2)=0.4733, s=0.045, F=20 (training set); n=5, r(2)=0.6785, s=0.054 (test set); n=6, r(2)=0.9593, s=0.032 (validation set); and (iii) n=14, r(2)=0.8087, q(2)=0.6975, s=0.026, F=51 (training set); n=5, r(2)=0.9453, s=0.074 (test set); n=5, r(2)=0.8951, s=0.052 (validation set). Copyright © 2014 Elsevier Ltd. All rights reserved.
López-Lira, Claudia; Alzate-Morales, Jans H; Paulino, Margot; Mella-Raipán, Jaime; Salas, Cristian O; Tapia, Ricardo A; Soto-Delgado, Jorge
2018-01-01
A combination of three-dimensional quantitative structure-activity relationship (3D-QSAR), and molecular modelling methods were used to understand the potent inhibitory NAD(P)H:quinone oxidoreductase 1 (NQO1) activity of a set of 52 heterocyclic quinones. Molecular docking results indicated that some favourable interactions of key amino acid residues at the binding site of NQO1 with these quinones would be responsible for an improvement of the NQO1 activity of these compounds. The main interactions involved are hydrogen bond of the amino group of residue Tyr128, π-stacking interactions with Phe106 and Phe178, and electrostatic interactions with flavin adenine dinucleotide (FADH) cofactor. Three models were prepared by 3D-QSAR analysis. The models derived from Model I and Model III, shown leave-one-out cross-validation correlation coefficients (q 2 LOO ) of .75 and .73 as well as conventional correlation coefficients (R 2 ) of .93 and .95, respectively. In addition, the external predictive abilities of these models were evaluated using a test set, producing the predicted correlation coefficients (r 2 pred ) of .76 and .74, respectively. The good concordance between the docking results and 3D-QSAR contour maps provides helpful information about a rational modification of new molecules based in quinone scaffold, in order to design more potent NQO1 inhibitors, which would exhibit highly potent antitumor activity. © 2017 John Wiley & Sons A/S.
Li, ZhiLiang; Wu, ShiRong; Chen, ZeCong; Ye, Nancy; Yang, ShengXi; Liao, ChunYang; Zhang, MengJun; Yang, Li; Mei, Hu; Yang, Yan; Zhao, Na; Zhou, Yuan; Zhou, Ping; Xiong, Qing; Xu, Hong; Liu, ShuShen; Ling, ZiHua; Chen, Gang; Li, GenRong
2007-10-01
Only from the primary structures of peptides, a new set of descriptors called the molecular electronegativity edge-distance vector (VMED) was proposed and applied to describing and characterizing the molecular structures of oligopeptides and polypeptides, based on the electronegativity of each atom or electronic charge index (ECI) of atomic clusters and the bonding distance between atom-pairs. Here, the molecular structures of antigenic polypeptides were well expressed in order to propose the automated technique for the computerized identification of helper T lymphocyte (Th) epitopes. Furthermore, a modified MED vector was proposed from the primary structures of polypeptides, based on the ECI and the relative bonding distance of the fundamental skeleton groups. The side-chains of each amino acid were here treated as a pseudo-atom. The developed VMED was easy to calculate and able to work. Some quantitative model was established for 28 immunogenic or antigenic polypeptides (AGPP) with 14 (1-14) A(d) and 14 other restricted activities assigned as "1"(+) and "0"(-), respectively. The latter comprised 6 A(b)(15-20), 3 A(k)(21-23), 2 E(k)(24-26), 2 H-2(k)(27 and 28) restricted sequences. Good results were obtained with 90% correct classification (only 2 wrong ones for 20 training samples) and 100% correct prediction (none wrong for 8 testing samples); while contrastively 100% correct classification (none wrong for 20 training samples) and 88% correct classification (1 wrong for 8 testing samples). Both stochastic samplings and cross validations were performed to demonstrate good performance. The described method may also be suitable for estimation and prediction of classes I and II for major histocompatibility antigen (MHC) epitope of human. It will be useful in immune identification and recognition of proteins and genes and in the design and development of subunit vaccines. Several quantitative structure activity relationship (QSAR) models were developed for various oligopeptides and polypeptides including 58 dipeptides and 31 pentapeptides with angiotensin converting enzyme (ACE) inhibition by multiple linear regression (MLR) method. In order to explain the ability to characterize molecular structure of polypeptides, a molecular modeling investigation on QSAR was performed for functional prediction of polypeptide sequences with antigenic activity and heptapeptide sequences with tachykinin activity through quantitative sequence-activity models (QSAMs) by the molecular electronegativity edge-distance vector (VMED). The results showed that VMED exhibited both excellent structural selectivity and good activity prediction. Moreover, the results showed that VMED behaved quite well for both QSAR and QSAM of poly-and oligopeptides, which exhibited both good estimation ability and prediction power, equal to or better than those reported in the previous references. Finally, a preliminary conclusion was drawn: both classical and modified MED vectors were very useful structural descriptors. Some suggestions were proposed for further studies on QSAR/QSAM of proteins in various fields.
Acetylcholinesterase (AChE), a serine hydrolase vital for regulating the neurotransmitter acetylcholine in animals, has been used as a target for drugs and pesticides. With the increasing availability of AChE crystal structures, with or without ligands bound, structure-based appr...
Comber, Mike H I; Walker, John D; Watts, Chris; Hermens, Joop
2003-08-01
The use of quantitative structure-activity relationships (QSARs) for deriving the predicted no-effect concentration of discrete organic chemicals for the purposes of conducting a regulatory risk assessment in Europe and the United States is described. In the United States, under the Toxic Substances Control Act (TSCA), the TSCA Interagency Testing Committee and the U.S. Environmental Protection Agency (U.S. EPA) use SARs to estimate the hazards of existing and new chemicals. Within the Existing Substances Regulation in Europe, QSARs may be used for data evaluation, test strategy indications, and the identification and filling of data gaps. To illustrate where and when QSARs may be useful and when their use is more problematic, an example, methyl tertiary-butyl ether (MTBE), is given and the predicted and experimental data are compared. Improvements needed for new QSARs and tools for developing and using QSARs are discussed.
Gini, Giuseppina
2016-01-01
In this chapter, we introduce the basis of computational chemistry and discuss how computational methods have been extended to some biological properties and toxicology, in particular. Since about 20 years, chemical experimentation is more and more replaced by modeling and virtual experimentation, using a large core of mathematics, chemistry, physics, and algorithms. Then we see how animal experiments, aimed at providing a standardized result about a biological property, can be mimicked by new in silico methods. Our emphasis here is on toxicology and on predicting properties through chemical structures. Two main streams of such models are available: models that consider the whole molecular structure to predict a value, namely QSAR (Quantitative Structure Activity Relationships), and models that find relevant substructures to predict a class, namely SAR. The term in silico discovery is applied to chemical design, to computational toxicology, and to drug discovery. We discuss how the experimental practice in biological science is moving more and more toward modeling and simulation. Such virtual experiments confirm hypotheses, provide data for regulation, and help in designing new chemicals.
Wang, Yuan; Wu, Mingwei; Ai, Chunzhi; Wang, Yonghua
2015-01-01
Presently, 151 widely-diverse pyridinylimidazole-based compounds that show inhibitory activities at the TNF-α release were investigated. By using the distance comparison technique (DISCOtech), comparative molecular field analysis (CoMFA), and comparative molecular similarity index analysis (CoMSIA) methods, the pharmacophore models and the three-dimensional quantitative structure-activity relationships (3D-QSAR) of the compounds were explored. The proposed pharmacophore model, including two hydrophobic sites, two aromatic centers, two H-bond donor atoms, two H-bond acceptor atoms, and two H-bond donor sites characterizes the necessary structural features of TNF-α release inhibitors. Both the resultant CoMFA and CoMSIA models exhibited satisfactory predictability (with Q2 (cross-validated correlation coefficient) = 0.557, R2ncv (non-cross-validated correlation coefficient) = 0.740, R2pre (predicted correlation coefficient) = 0.749 and Q2 = 0.598, R2ncv = 0.767, R2pre = 0.860, respectively). Good consistency was observed between the 3D-QSAR models and the pharmacophore model that the hydrophobic interaction and hydrogen bonds play crucial roles in the mechanism of actions. The corresponding contour maps generated by these models provide more diverse information about the key intermolecular interactions of inhibitors with the surrounding environment. All these models have extended the understanding of imidazole-based compounds in the structure-activity relationship, and are useful for rational design and screening of novel 2-thioimidazole-based TNF-α release inhibitors. PMID:26307982
Wang, Yuan; Wu, Mingwei; Ai, Chunzhi; Wang, Yonghua
2015-08-25
Presently, 151 widely-diverse pyridinylimidazole-based compounds that show inhibitory activities at the TNF-α release were investigated. By using the distance comparison technique (DISCOtech), comparative molecular field analysis (CoMFA), and comparative molecular similarity index analysis (CoMSIA) methods, the pharmacophore models and the three-dimensional quantitative structure-activity relationships (3D-QSAR) of the compounds were explored. The proposed pharmacophore model, including two hydrophobic sites, two aromatic centers, two H-bond donor atoms, two H-bond acceptor atoms, and two H-bond donor sites characterizes the necessary structural features of TNF-α release inhibitors. Both the resultant CoMFA and CoMSIA models exhibited satisfactory predictability (with Q(2) (cross-validated correlation coefficient) = 0.557, R(2)ncv (non-cross-validated correlation coefficient) = 0.740, R(2)pre (predicted correlation coefficient) = 0.749 and Q(2) = 0.598, R(2)ncv = 0.767, R(2)pre = 0.860, respectively). Good consistency was observed between the 3D-QSAR models and the pharmacophore model that the hydrophobic interaction and hydrogen bonds play crucial roles in the mechanism of actions. The corresponding contour maps generated by these models provide more diverse information about the key intermolecular interactions of inhibitors with the surrounding environment. All these models have extended the understanding of imidazole-based compounds in the structure-activity relationship, and are useful for rational design and screening of novel 2-thioimidazole-based TNF-α release inhibitors.
Elaborate ligand-based modeling reveal new submicromolar Rho kinase inhibitors
NASA Astrophysics Data System (ADS)
Shahin, Rand; AlQtaishat, Saja; Taha, Mutasem O.
2012-02-01
Rho Kinase (ROCKII) has been recently implicated in several cardiovascular diseases prompting several attempts to discover and optimize new ROCKII inhibitors. Towards this end we explored the pharmacophoric space of 138 ROCKII inhibitors to identify high quality pharmacophores. The pharmacophoric models were subsequently allowed to compete within quantitative structure-activity relationship (QSAR) context. Genetic algorithm and multiple linear regression analysis were employed to select an optimal combination of pharmacophoric models and 2D physicochemical descriptors capable of accessing self-consistent QSAR of optimal predictive potential ( r 77 = 0.84, F = 18.18, r LOO 2 = 0.639, r PRESS 2 against 19 external test inhibitors = 0.494). Two orthogonal pharmacophores emerged in the QSAR equation suggesting the existence of at least two binding modes accessible to ligands within ROCKII binding pocket. Receiver operating characteristic (ROC) curve analyses established the validity of QSAR-selected pharmacophores. Moreover, the successful pharmacophores models were found to be comparable with crystallographically resolved ROCKII binding pocket. We employed the pharmacophoric models and associated QSAR equation to screen the national cancer institute (NCI) list of compounds Eight submicromolar ROCKII inhibitors were identified. The most potent gave IC50 values of 0.7 and 1.0 μM.
Giesen, Daniel; van Gestel, Cornelis A M
2013-03-01
Quantitative structure-activity relationships (QSARs) are an established tool in environmental risk assessment and a valuable alternative to the exhaustive use of test animals under REACH. In this study a QSAR was developed for the toxicity of a series of six chloroanilines to the soil-dwelling collembolan Folsomia candida in standardized natural LUFA2.2 soil. Toxicity endpoints incorporated in the QSAR were the concentrations causing 10% (EC10) and 50% (EC50) reduction in reproduction of F. candida. Toxicity was based on concentrations in interstitial water estimated from nominal concentrations in the soil and published soil-water partition coefficients. Estimated effect concentrations were negatively correlated with the lipophilicity of the compounds. Interstitial water concentrations for both the EC10 and EC50 for four compounds were determined by using solid-phase microextraction (SPME). Measured and estimated concentrations were comparable only for tetra- and pentachloroaniline. With decreasing chlorination the disparity between modelled and actual concentrations increased. Optimisation of the QSAR therefore could not be accomplished, showing the necessity to move from total soil to (bio)available concentration measurements. Copyright © 2012 Elsevier Ltd. All rights reserved.
Alves, Vinicius M.; Muratov, Eugene; Fourches, Denis; Strickland, Judy; Kleinstreuer, Nicole; Andrade, Carolina H.; Tropsha, Alexander
2015-01-01
Skin permeability is widely considered to be mechanistically implicated in chemically-induced skin sensitization. Although many chemicals have been identified as skin sensitizers, there have been very few reports analyzing the relationships between molecular structure and skin permeability of sensitizers and non-sensitizers. The goals of this study were to: (i) compile, curate, and integrate the largest publicly available dataset of chemicals studied for their skin permeability; (ii) develop and rigorously validate QSAR models to predict skin permeability; and (iii) explore the complex relationships between skin sensitization and skin permeability. Based on the largest publicly available dataset compiled in this study, we found no overall correlation between skin permeability and skin sensitization. In addition, cross-species correlation coefficient between human and rodent permeability data was found to be as low as R2=0.44. Human skin permeability models based on the random forest method have been developed and validated using OECD-compliant QSAR modeling workflow. Their external accuracy was high (Q2ext = 0.73 for 63% of external compounds inside the applicability domain). The extended analysis using both experimentally-measured and QSAR-imputed data still confirmed the absence of any overall concordance between skin permeability and skin sensitization. This observation suggests that chemical modifications that affect skin permeability should not be presumed a priori to modulate the sensitization potential of chemicals. The models reported herein as well as those developed in the companion paper on skin sensitization suggest that it may be possible to rationally design compounds with the desired high skin permeability but low sensitization potential. PMID:25560673
Xiao, Ruiyang; Ye, Tiantian; Wei, Zongsu; Luo, Shuang; Yang, Zhihui; Spinney, Richard
2015-11-17
The sulfate radical anion (SO4•–) based oxidation of trace organic contaminants (TrOCs) has recently received great attention due to its high reactivity and low selectivity. In this study, a meta-analysis was conducted to better understand the role of functional groups on the reactivity between SO4•– and TrOCs. The results indicate that compounds in which electron transfer and addition channels dominate tend to exhibit a faster second-order rate constants (kSO4•–) than that of H–atom abstraction, corroborating the SO4•– reactivity and mechanisms observed in the individual studies. Then, a quantitative structure activity relationship (QSAR) model was developed using a sequential approach with constitutional, geometrical, electrostatic, and quantum chemical descriptors. Two descriptors, ELUMO and EHOMO energy gap (ELUMO–EHOMO) and the ratio of oxygen atoms to carbon atoms (#O:C), were found to mechanistically and statistically affect kSO4•– to a great extent with the standardized QSAR model: ln kSO4•– = 26.8–3.97 × #O:C – 0.746 × (ELUMO–EHOMO). In addition, the correlation analysis indicates that there is no dominant reaction channel for SO4•– reactions with various structurally diverse compounds. Our QSAR model provides a robust predictive tool for estimating emerging micropollutants removal using SO4•– during wastewater treatment processes.
QSAR of phytochemicals for the design of better drugs.
Kar, Supratik; Roy, Kunal
2012-10-01
Phytochemicals have been the single most prolific source of leads for the development of new drug entities from the dawn of the drug discovery. They cover a wide range of therapeutic indications with a great diversity of chemical structures. The research fraternity still believes in exploring the phytochemicals for new drug discovery. Application of molecular biological techniques has increased the availability of novel compounds that can be conveniently isolated from natural sources. Combinatorial chemistry approaches are being applied based on phytochemical scaffolds to create screening libraries that closely resemble drug-like compounds. In silico techniques like quantitative structure-activity relationships (QSAR), pharmacophore and virtual screening are playing crucial and rate accelerating steps for the better drug design in modern era. QSAR models of different classes of phytochemicals covering different therapeutic areas are thoroughly discussed in the review. Further, the authors have enlisted all the available phytochemical databases for the convenience of researchers working in the area. This review justifies the need to develop more QSAR models for the design of better drugs from phytochemicals. Technical drawbacks associated with phytochemical research have been lessened, and there are better opportunities to explore the biological activity of previously inaccessible sources of phytochemicals although there is still the need to reduce the time and cost involvement in such exercise. The future possibilities for the integration of ethnopharmacology with QSAR, place us at an exciting stage that will allow us to explore plant sources worldwide and design better drugs.
The effects of characteristics of substituents on toxicity of the nitroaromatics: HiT QSAR study
NASA Astrophysics Data System (ADS)
Kuz'min, Victor E.; Muratov, Eugene N.; Artemenko, Anatoly G.; Gorb, Leonid; Qasim, Mohammad; Leszczynski, Jerzy
2008-10-01
The present study applies the Hierarchical Technology for Quantitative Structure-Activity Relationships (HiT QSAR) for (i) evaluation of the influence of the characteristics of 28 nitroaromatic compounds (some of which belong to a widely known class of explosives) as to their toxicity; (ii) prediction of toxicity for new nitroaromatic derivatives; (iii) analysis of the effects of substituents in nitroaromatic compounds on their toxicity in vivo. The 50% lethal dose concentration for rats (LD50) was used to develop the QSAR models based on simplex representation of molecular structure. The preliminary 1D QSAR results show that even the information on the composition of molecules reveals the main tendencies of changes in toxicity. The statistic characteristics for partial least squares 2D QSAR models are quite satisfactory ( R 2 = 0.96-0.98; Q 2 = 0.91-0.93; R 2 test = 0.89-0.92), which allows us to carry out the prediction of activity for 41 novel compounds designed by the application of new combinations of substituents represented in the training set. The comprehensive analysis of toxicity changes as a function of substituent position and nature was carried out. Molecular fragments that promote and interfere with toxicity were defined on the basis of the obtained models. It was shown that the mutual influence of substituents in the benzene ring plays a crucial role regarding toxicity. The influence of different substituents on toxicity can be mediated via different C-H fragments of the aromatic ring.
Large collections of chemical structures and associated experimental data are publicly available, and can be used to build robust QSAR models for applications in different fields. One common concern is the quality of both the chemical structure information and associated experime...
NASA Astrophysics Data System (ADS)
Alloui, Mebarka; Belaidi, Salah; Othmani, Hasna; Jaidane, Nejm-Eddine; Hochlaf, Majdi
2018-03-01
We performed benchmark studies on the molecular geometry, electron properties and vibrational analysis of imidazole using semi-empirical, density functional theory and post Hartree-Fock methods. These studies validated the use of AM1 for the treatment of larger systems. Then, we treated the structural, physical and chemical relationships for a series of imidazole derivatives acting as angiotensin II AT1 receptor blockers using AM1. QSAR studies were done for these imidazole derivatives using a combination of various physicochemical descriptors. A multiple linear regression procedure was used to design the relationships between molecular descriptor and the activity of imidazole derivatives. Results validate the derived QSAR model.
NASA Astrophysics Data System (ADS)
Guziałowska-Tic, Joanna
2017-10-01
According to the Directive of the European Parliament and of the Council concerning the protection of animals used for scientific purposes, the number of experiments involving the use of animals needs to be reduced. The methods which can replace animal testing include computational prediction methods, for instance, the quantitative structure-activity relationships (QSAR). These methods are designed to find a cohesive relationship between differences in the values of the properties of molecules and the biological activity of a series of test compounds. This paper compares the results of the author's own results of examination on the n-octanol/water coefficient for the hydroxyester HE-1 with those generated by means of three models: Kowwin, MlogP, AlogP. The test results indicate that, in the case of molecular similarity, the highest determination coefficient was obtained for the model MlogP and the lowest root-mean square error was obtained for the Kowwin method. When comparing the mean logP value obtained using the QSAR models with the value resulting from the author's own experiments, it was observed that the best conformity was that recorded for the model AlogP, where relative error was 15.2%.
Are the Chemical Structures in your QSAR Correct?
Quantitative structure-activity relationships (QSARs) are used to predict many different endpoints, utilize hundreds and even thousands of different parameters (or descriptors), and are created using a variety of approaches. The one thing they all have in common is the assumptio...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kiwamoto, R., E-mail: reiko.kiwamoto@wur.nl; Spenkelink, A.; Rietjens, I.M.C.M.
Acyclic α,β-unsaturated aldehydes present in food raise a concern because the α,β-unsaturated aldehyde moiety is considered a structural alert for genotoxicity. However, controversy remains on whether in vivo at realistic dietary exposure DNA adduct formation is significant. The aim of the present study was to develop physiologically based kinetic/dynamic (PBK/D) models to examine dose-dependent detoxification and DNA adduct formation of a group of 18 food-borne acyclic α,β-unsaturated aldehydes without 2- or 3-alkylation, and with no more than one conjugated double bond. Parameters for the PBK/D models were obtained using quantitative structure–activity relationships (QSARs) defined with a training set of sixmore » selected aldehydes. Using the QSARs, PBK/D models for the other 12 aldehydes were defined. Results revealed that DNA adduct formation in the liver increases with decreasing bulkiness of the molecule especially due to less efficient detoxification. 2-Propenal (acrolein) was identified to induce the highest DNA adduct levels. At realistic dietary intake, the predicted DNA adduct levels for all aldehydes were two orders of magnitude lower than endogenous background levels observed in disease free human liver, suggesting that for all 18 aldehydes DNA adduct formation is negligible at the relevant levels of dietary intake. The present study provides a proof of principle for the use of QSAR-based PBK/D modelling to facilitate group evaluations and read-across in risk assessment. - Highlights: • Physiologically based in silico models were made for 18 α,β-unsaturated aldehydes. • Kinetic parameters were determined by in vitro incubations and a QSAR approach. • DNA adduct formation was negligible at levels relevant for dietary intake. • The use of QSAR-based PBK/D modelling facilitates group evaluations and read-across.« less
Mladenović, Milan; Mihailović, Mirjana; Bogojević, Desanka; Matić, Sanja; Nićiforović, Neda; Mihailović, Vladimir; Vuković, Nenad; Sukdolak, Slobodan; Solujić, Slavica
2011-01-01
The series of fifteen synthesized 4-hydroxycoumarin derivatives was subjected to antioxidant activity evaluation in vitro, through total antioxidant capacity, 1,1-diphenyl-2-picryl-hydrazyl (DPPH), hydroxyl radical, lipid peroxide scavenging and chelating activity. The highest activity was detected during the radicals scavenging, with 2b, 6b, 2c, and 4c noticed as the most active. The antioxidant activity was further quantified by the quantitative structure-activity relationships (QSAR) studies. For this purpose, the structures were optimized using Paramethric Method 6 (PM6) semi-empirical and Density Functional Theory (DFT) B3LYP methods. Bond dissociation enthalpies of coumarin 4-OH, Natural Bond Orbital (NBO) gained hybridization of the oxygen, acidity of the hydrogen atom and various molecular descriptors obtained, were correlated with biological activity, after which we designed 20 new antioxidant structures, using the most favorable structural motifs, with much improved predicted activity in vitro. PMID:21686153
Approaches to developing alternative and predictive toxicology based on PBPK/PD and QSAR modeling.
Yang, R S; Thomas, R S; Gustafson, D L; Campain, J; Benjamin, S A; Verhaar, H J; Mumtaz, M M
1998-01-01
Systematic toxicity testing, using conventional toxicology methodologies, of single chemicals and chemical mixtures is highly impractical because of the immense numbers of chemicals and chemical mixtures involved and the limited scientific resources. Therefore, the development of unconventional, efficient, and predictive toxicology methods is imperative. Using carcinogenicity as an end point, we present approaches for developing predictive tools for toxicologic evaluation of chemicals and chemical mixtures relevant to environmental contamination. Central to the approaches presented is the integration of physiologically based pharmacokinetic/pharmacodynamic (PBPK/PD) and quantitative structure--activity relationship (QSAR) modeling with focused mechanistically based experimental toxicology. In this development, molecular and cellular biomarkers critical to the carcinogenesis process are evaluated quantitatively between different chemicals and/or chemical mixtures. Examples presented include the integration of PBPK/PD and QSAR modeling with a time-course medium-term liver foci assay, molecular biology and cell proliferation studies. Fourier transform infrared spectroscopic analyses of DNA changes, and cancer modeling to assess and attempt to predict the carcinogenicity of the series of 12 chlorobenzene isomers. Also presented is an ongoing effort to develop and apply a similar approach to chemical mixtures using in vitro cell culture (Syrian hamster embryo cell transformation assay and human keratinocytes) methodologies and in vivo studies. The promise and pitfalls of these developments are elaborated. When successfully applied, these approaches may greatly reduce animal usage, personnel, resources, and time required to evaluate the carcinogenicity of chemicals and chemical mixtures. Images Figure 6 PMID:9860897
Quantitative Structure--Activity Relationship Modeling of Rat Acute Toxicity by Oral Exposure
Background: Few Quantitative Structure-Activity Relationship (QSAR) studies have successfully modeled large, diverse rodent toxicity endpoints. Objective: In this study, a combinatorial QSAR approach has been employed for the creation of robust and predictive models of acute toxi...
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS FOR CHEMICAL REDUCTIONS OF ORGANIC CONTAMINANTS
Sufficient kinetic data on abiotic reduction reactions involving organic contaminants are now available that quantitative structure-activity relationships (QSARs) for these reactions can be developed. Over 50 QSARs have been reported, most in just the last few years, and they ar...
Acetylcholinesterase (AChE), a serine hydrolase vital for regulating the neurotransmitter acetylcholine in animals, has been used as a target for drugs and pesticides. With the increasing availability of AChE crystal structures, with or without ligands bound, structure-based appr...
Vijaya Prabhu, Sitrarasu; Singh, Sanjeev Kumar
2018-05-28
Atom-based three dimensional-quantitative structure-activity relationship (3D-QSAR) model was developed on the basis of 5-point pharmacophore hypothesis (AARRR) with two hydrogen bond acceptors (A) and three aromatic rings for the derivatives of thieno[2,3-b]pyridine, which modulates the activity to inhibit the mGluR5 receptor. Generation of a highly predictive 3D-QSAR model was performed using the alignment of predicted pharmacophore hypothesis for the training set (R 2 = 0.84, SD = 0.26, F = 45.8, N = 29) and test set (Q 2 = 0.74, RMSE = 0.235, Pearson-R = 0.94, N = 9). The best pharmacophore hypothesis AARRR was selected, and developed three dimensional-quantitative structure activity relationship (3D-QSAR) model also supported the outcome of this study by means of favorable and unfavorable electron withdrawing group and hydrophobic regions of most active compound 42d and least active compound 18b. Following, induced fit docking and binding free energy calculations reveals the reliable binding orientation of the compounds. Finally, molecular dynamics simulations for 100 ns were performed to depict the protein-ligand stability. We anticipate that the resulted outcome could be supportive to discover potent negative allosteric modulators for metabotropic glutamate receptor 5 (mGluR5).
NASA Astrophysics Data System (ADS)
Assefa, Haregewein; Kamath, Shantaram; Buolamwini, John K.
2003-08-01
The overexpression and/or mutation of the epidermal growth factor receptor (EGFR) tyrosine kinase has been observed in many human solid tumors, and is under intense investigation as a novel anticancer molecular target. Comparative 3D-QSAR analyses using different alignments were undertaken employing comparative molecular field analysis (CoMFA) and comparative molecular similarity analysis (CoMSIA) for 122 anilinoquinazoline and 50 anilinoquinoline inhibitors of EGFR kinase. The SYBYL multifit alignment rule was applied to three different conformational templates, two obtained from a MacroModel Monte Carlo conformational search, and one from the bound conformation of erlotinib in complex with EGFR in the X-ray crystal structure. In addition, a flexible ligand docking alignment obtained with the GOLD docking program, and a novel flexible receptor-guided consensus dynamics alignment obtained with the DISCOVER program in the INSIGHTII modeling package were also investigated. 3D-QSAR models with q2 values up to 0.70 and r2 values up to 0.97 were obtained. Among the 4-anilinoquinazoline set, the q2 values were similar, but the ability of the different conformational models to predict the activities of an external test set varied considerably. In this regard, the model derived using the X-ray crystallographically determined bioactive conformation of erlotinib afforded the best predictive model. Electrostatic, hydrophobic and H-bond donor descriptors contributed the most to the QSAR models of the 4-anilinoquinazolines, whereas electrostatic, hydrophobic and H-bond acceptor descriptors contributed the most to the 4-anilinoquinoline QSAR, particularly the H-bond acceptor descriptor. A novel receptor-guided consensus dynamics alignment has also been introduced for 3D-QSAR studies. This new alignment method may incorporate to some extent ligand-receptor induced fit effects into 3D-QSAR models.
Hamadache, Mabrouk; Benkortbi, Othmane; Hanini, Salah; Amrane, Abdeltif; Khaouane, Latifa; Si Moussa, Cherif
2016-02-13
Quantitative Structure Activity Relationship (QSAR) models are expected to play an important role in the risk assessment of chemicals on humans and the environment. In this study, we developed a validated QSAR model to predict acute oral toxicity of 329 pesticides to rats because a few QSAR models have been devoted to predict the Lethal Dose 50 (LD50) of pesticides on rats. This QSAR model is based on 17 molecular descriptors, and is robust, externally predictive and characterized by a good applicability domain. The best results were obtained with a 17/9/1 Artificial Neural Network model trained with the Quasi Newton back propagation (BFGS) algorithm. The prediction accuracy for the external validation set was estimated by the Q(2)ext and the root mean square error (RMS) which are equal to 0.948 and 0.201, respectively. 98.6% of external validation set is correctly predicted and the present model proved to be superior to models previously published. Accordingly, the model developed in this study provides excellent predictions and can be used to predict the acute oral toxicity of pesticides, particularly for those that have not been tested as well as new pesticides. Copyright © 2015 Elsevier B.V. All rights reserved.
Manoj Kumar, Palanivelu; Karthikeyan, Chandrabose; Hari Narayana Moorthy, Narayana Subbiah; Trivedi, Piyush
2006-11-01
In the present paper, quantitative structure activity relationship (QSAR) approach was applied to understand the affinity and selectivity of a novel series of triaryl imidazole derivatives towards glucagon receptor. Statistically significant and highly predictive QSARs were derived for glucagon receptor inhibition by triaryl imidazoles using QuaSAR descriptors of molecular operating environment (MOE) employing computer-assisted multiple regression procedure. The generated QSAR models revealed that factors related to hydrophobicity, molecular shape and geometry predominantly influences glucagon receptor binding affinity of the triaryl imidazoles indicating the relevance of shape specific steric interactions between the molecule and the receptor. Further, QSAR models formulated for selective inhibition of glucagon receptor over p38 mitogen activated protein (MAP) kinase of the compounds in the series highlights that the same structural features, which influence the glucagon receptor affinity, also contribute to their selective inhibition.
Gade, Deepak Reddy; Makkapati, Amareswararao; Yarlagadda, Rajesh Babu; Peters, Godefridus J; Sastry, B S; Rajendra Prasad, V V S
2018-06-01
Overexpression of P-glycoprotein (P-gp) leads to the emergence of multidrug resistance (MDR) in cancer treatment. Acridones have the potential to reverse MDR and sensitize cells. In the present study, we aimed to elucidate the chemosensitization potential of acridones by employing various molecular modelling techniques. Pharmacophore modeling was performed for the dataset of chemosensitizing acridones earlier proved for cytotoxic activity against MCF7 breast cancer cell line. Gaussian-based QSAR studies also performed to predict the favored and disfavored region of the acridone molecules. Molecular dynamics simulations were performed for compound 10 and human P-glycoprotein (obtained from Homology modeling). An efficient pharmacophore containing 2 hydrogen bond acceptors and 3 aromatic rings (AARRR.14) was identified. NCI 2012 chemical database was screened against AARRR.14 CPH and identified 25 best-fit molecules. Potential regions of the compound were identified through Field (Gaussian) based QSAR. Regression analysis of atom-based QSAR resulted in r 2 of 0.95 and q 2 of 0.72, whereas, regression analysis of field-based QSAR resulted in r 2 of 0.92 and q 2 of 0.87 along with r 2 cv as 0.71. The fate of the acridone molecule (compound 10) in the P-glycoprotein environment is analyzed through analyzing the conformational changes occurring during the molecular dynamics simulations. Combined data of different in silico techniques provided basis for deeper understanding of structural and mechanistic insights of interaction phenomenon of acridones with P-glycoprotein and also as strategic basis for designing more potent molecules for anti-cancer and multidrug resistance reversal activities. Copyright © 2018 Elsevier Ltd. All rights reserved.
Li, Chao; Xie, Hong-Bin; Chen, Jingwen; Yang, Xianhai; Zhang, Yifei; Qiao, Xianliang
2014-12-02
Short chain chlorinated paraffins (SCCPs) are under evaluation for inclusion in the Stockholm Convention on persistent organic pollutants. However, information on their reaction rate constants with gaseous ·OH (kOH) is unavailable, limiting the evaluation of their persistence in the atmosphere. Experimental determination of kOH is confined by the unavailability of authentic chemical standards for some SCCP congeners. In this study, we evaluated and selected density functional theory (DFT) methods to predict kOH of SCCPs, by comparing the experimental kOH values of six polychlorinated alkanes (PCAs) with those calculated by the different theoretical methods. We found that the M06-2X/6-311+G(3df,2pd)//B3LYP/6-311 +G(d,p) method is time-effective and can be used to predict kOH of PCAs. Moreover, based on the calculated kOH of nine SCCPs and available experimental kOH values of 22 PCAs with low carbon chain, a quantitative structure-activity relationship (QSAR) model was developed. The molecular structural characteristics determining the ·OH reaction rate were discussed. logkOH was found to negatively correlate with the percentage of chlorine substitutions (Cl%). The DFT calculation method and the QSAR model are important alternatives to the conventional experimental determination of kOH for SCCPs, and are prospective in predicting their persistence in the atmosphere.
NASA Astrophysics Data System (ADS)
Filho, Edilson B. A.; Moraes, Ingrid A.; Weber, Karen C.; Rocha, Gerd B.; Vasconcellos, Mário L. A. A.
2012-08-01
Morita-Baylis-Hillman Adducts (MBHA) has been recently synthesized and bio-evaluated by our research group against Leishmania amazonensis, parasite that causes cutaneous and mucocutaneous leishmaniasis. We present here a theoretical conformational study of thirty-two leismanicidal MBHA by B3LYP/6-31+g(d) calculations with Polarized Continuum Model (PCM) to simulate water influence. Intramolecular Hydrogen Bonds (IHBs) indicated to control the most conformational preferences of MBHA. Quantum Theory Atoms in Molecules (QTAIM) calculations were able to characterize these interactions at Bond Critical Point level. Compounds presenting an unusual seven member IHB between NO2 group and hydroxyl moiety, supported by experimental spectroscopic data, showed a considerable improvement of biological activity (lower IC50 values). These results are in accordance to redox NO2 mechanism of action. Based on structural observations, some molecular descriptors were calculated and submitted to Quantitative Structure-Activity Relationship (QSAR) studies through the PLS Regression Method. These studies provided a model with good validation parameters values (R2 = 0.71, Q2 = 0.61 and Qext2 = 0.92).
Ul-Haq, Zaheer; Effendi, Juweria Shahrukh; Ashraf, Sajda; Bkhaitan, Majdi M
2017-06-01
In the current study, quantitative three-dimensional structure-activity-relationship (3D-QSAR) method was performed to design a model for new chemical entities by utilizing pyrazolopyrimidines. Their inhibiting activity on receptor IL-2 Itk correlates descriptors based on topology and hydrophobicity. The best model developed by ligand-based (atom-based) approach has correlation-coefficient of r 2 : 0.987 and cross-validated squared correlation-coefficient of q 2 : 0.541 with an external prediction capability of r 2 : 0.944. Whereas the best selected model developed by structured-based (receptor-based) approach has correlation-coefficient of r 2 : 0.987, cross-validated squared correlation-coefficient of q 2 : 0.637 with an external predictive ability of r 2 : 0.941. The statistical parameters prove that structure-based gave a better model to design new chemical scaffolds. The results achieved indicated that hydrophobicity at R 1 location play a vital role in the inhibitory activity and introduction of appropriately bulky and strongly hydrophobic-groups at position 3 of the terminal phenyl-group which is highly significant to enhance the activity. Six new pyrazolopyrimidine derivatives were designed. Docking simulation study was carried out and their inhibitory activity was predicted by the best structure based model with predictive activity of ranging from 8.43 to 8.85 log unit. The interacting residues PHE435, ASP500, LYS391, GLU436, MET438, CYS442, ILE369, VAL377 of PDB 4HCT were studied with respect to type of bonding with the new compounds. This study was aimed to search out more potent inhibitors of IL-2 Itk. Copyright © 2017 Elsevier Inc. All rights reserved.
Methods to enable the design of bioactive small molecules targeting RNA
Disney, Matthew D.; Yildirim, Ilyas; Childs-Disney, Jessica L.
2014-01-01
RNA is an immensely important target for small molecule therapeutics or chemical probes of function. However, methods that identify, annotate, and optimize RNA-small molecule interactions that could enable the design of compounds that modulate RNA function are in their infancies. This review describes recent approaches that have been developed to understand and optimize RNA motif-small molecule interactions, including Structure-Activity Relationships Through Sequencing (StARTS), quantitative structure-activity relationships (QSAR), chemical similarity searching, structure-based design and docking, and molecular dynamics (MD) simulations. Case studies described include the design of small molecules targeting RNA expansions, the bacterial A-site, viral RNAs, and telomerase RNA. These approaches can be combined to afford a synergistic method to exploit the myriad of RNA targets in the transcriptome. PMID:24357181
Methods to enable the design of bioactive small molecules targeting RNA.
Disney, Matthew D; Yildirim, Ilyas; Childs-Disney, Jessica L
2014-02-21
RNA is an immensely important target for small molecule therapeutics or chemical probes of function. However, methods that identify, annotate, and optimize RNA-small molecule interactions that could enable the design of compounds that modulate RNA function are in their infancies. This review describes recent approaches that have been developed to understand and optimize RNA motif-small molecule interactions, including structure-activity relationships through sequencing (StARTS), quantitative structure-activity relationships (QSAR), chemical similarity searching, structure-based design and docking, and molecular dynamics (MD) simulations. Case studies described include the design of small molecules targeting RNA expansions, the bacterial A-site, viral RNAs, and telomerase RNA. These approaches can be combined to afford a synergistic method to exploit the myriad of RNA targets in the transcriptome.
Apablaza, Gastón; Montoya, Luisa; Morales-Verdejo, Cesar; Mellado, Marco; Cuellar, Mauricio; Lagos, Carlos F; Soto-Delgado, Jorge; Chung, Hery; Pessoa-Mahana, Carlos David; Mella, Jaime
2017-03-05
The β₃ adrenergic receptor is raising as an important drug target for the treatment of pathologies such as diabetes, obesity, depression, and cardiac diseases among others. Several attempts to obtain selective and high affinity ligands have been made. Currently, Mirabegron is the only available drug on the market that targets this receptor approved for the treatment of overactive bladder. However, the FDA (Food and Drug Administration) in USA and the MHRA (Medicines and Healthcare products Regulatory Agency) in UK have made reports of potentially life-threatening side effects associated with the administration of Mirabegron, casting doubts on the continuity of this compound. Therefore, it is of utmost importance to gather information for the rational design and synthesis of new β₃ adrenergic ligands. Herein, we present the first combined 2D-QSAR (two-dimensional Quantitative Structure-Activity Relationship) and 3D-QSAR/CoMSIA (three-dimensional Quantitative Structure-Activity Relationship/Comparative Molecular Similarity Index Analysis) study on a series of potent β₃ adrenergic agonists of indole-alkylamine structure. We found a series of changes that can be made in the steric, hydrogen-bond donor and acceptor, lipophilicity and molar refractivity properties of the compounds to generate new promising molecules. Finally, based on our analysis, a summary and a regiospecific description of the requirements for improving β₃ adrenergic activity is given.
Novel 1,4-naphthoquinone-based sulfonamides: Synthesis, QSAR, anticancer and antimalarial studies.
Pingaew, Ratchanok; Prachayasittikul, Veda; Worachartcheewan, Apilak; Nantasenamat, Chanin; Prachayasittikul, Supaluk; Ruchirawat, Somsak; Prachayasittikul, Virapong
2015-10-20
A novel series of 1,4-naphthoquinones (33-44) tethered by open and closed chain sulfonamide moieties were designed, synthesized and evaluated for their cytotoxic and antimalarial activities. All quinone-sulfonamide derivatives displayed a broad spectrum of cytotoxic activities against all of the tested cancer cell lines including HuCCA-1, HepG2, A549 and MOLT-3. Most quinones (33-36 and 38-43) exerted higher anticancer activity against HepG2 cell than that of the etoposide. The open chain analogs 36 and 42 were shown to be the most potent compounds. Notably, the restricted sulfonamide analog 38 with 6,7-dimethoxy groups exhibited the most potent antimalarial activity (IC₅₀ = 2.8 μM). Quantitative structure-activity relationships (QSAR) study was performed to reveal important chemical features governing the biological activities. Five constructed QSAR models provided acceptable predictive performance (Rcv 0.5647-0.9317 and RMSEcv 0.1231-0.2825). Four additional sets of structurally modified compounds were generated in silico (34a-34d, 36a-36k, 40a-40d and 42a-42k) in which their activities were predicted using the constructed QSAR models. A comprehensive discussion of the structure-activity relationships was made and a set of promising compounds (i.e., 33, 36, 38, 42, 36d, 36f, 42e, 42g and 42f) was suggested for further development as anticancer and antimalarial agents. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Van Bossuyt, Melissa; Van Hoeck, Els; Raitano, Giuseppa; Manganelli, Serena; Braeken, Els; Ates, Gamze; Vanhaecke, Tamara; Van Miert, Sabine; Benfenati, Emilio; Mertens, Birgit; Rogiers, Vera
2017-04-01
Over the last years, more stringent safety requirements for an increasing number of chemicals across many regulatory fields (e.g. industrial chemicals, pharmaceuticals, food, cosmetics, …) have triggered the need for an efficient screening strategy to prioritize the substances of highest concern. In this context, alternative methods such as in silico (i.e. computational) techniques gain more and more importance. In the current study, a new prioritization strategy for identifying potentially mutagenic substances was developed based on the combination of multiple (quantitative) structure-activity relationship ((Q)SAR) tools. Non-evaluated substances used in printed paper and board food contact materials (FCM) were selected for a case study. By applying our strategy, 106 out of the 1723 substances were assigned 'high priority' as they were predicted mutagenic by 4 different (Q)SAR models. Information provided within the models allowed to identify 53 substances for which Ames mutagenicity prediction already has in vitro Ames test results. For further prioritization, additional support could be obtained by applying local i.e. specific models, as demonstrated here for aromatic azo compounds, typically found in printed paper and board FCM. The strategy developed here can easily be applied to other groups of chemicals facing the same need for priority ranking. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Li, Wenlian; Xiao, Faqi; Zhou, Mingming; Jiang, Xuejin; Liu, Jun; Si, Hongzong; Xie, Meng; Ma, Xiuting; Duan, Yunbo; Zhai, Honglin
2016-09-01
The three dimensional-quantitative structure activity relationship (3D-QSAR) study was performed on a series of 4-hydroxyamino α-pyranone carboxamide analogues using comparative molecular similarity indices analysis (COMSIA). The purpose of the present study was to develop a satisfactory model providing a reliable prediction based on 4-hydroxyamino α-pyranone carboxamide analogues as anti-HCV (hepatitis C virus) inhibitors. The statistical results and the results of validation of this optimum COMSIA model were satisfactory. Furthermore, analysis of the contour maps helped to provide guidelines for finding structural requirement. Therefore, the satisfactory results from this study may provide useful guidelines for drug development of anti-HCV inhibitors.
[Application of Kohonen Self-Organizing Feature Maps in QSAR of human ADMET and kinase data sets].
Hegymegi-Barakonyi, Bálint; Orfi, László; Kéri, György; Kövesdi, István
2013-01-01
QSAR predictions have been proven very useful in a large number of studies for drug design, such as kinase inhibitor design as targets for cancer therapy, however the overall predictability often remains unsatisfactory. To improve predictability of ADMET features and kinase inhibitory data, we present a new method using Kohonen's Self-Organizing Feature Map (SOFM) to cluster molecules based on explanatory variables (X) and separate dissimilar ones. We calculated SOFM clusters for a large number of molecules with human ADMET and kinase inhibitory data, and we showed that chemically similar molecules were in the same SOFM cluster, and within such clusters the QSAR models had significantly better predictability. We used also target variables (Y, e.g. ADMET) jointly with X variables to create a novel type of clustering. With our method, cells of loosely coupled XY data could be identified and separated into different model building sets.
Increasing availability of large collections of chemical structures and associated experimental data provides an opportunity to build robust QSAR models for applications in different fields. One common concern is the quality of both the chemical structure information and associat...
The present study explores the merit of utilizing available pharmaceutical data to construct a quantitative structure-activity relationship (QSAR) for prediction of the fraction of a chemical unbound to plasma protein (Fub) in environmentally relevant compounds. Independent model...
Posa, Mihalj; Pilipović, Ana; Lalić, Mladena; Popović, Jovan
2011-02-15
Linear dependence between temperature (t) and retention coefficient (k, reversed phase HPLC) of bile acids is obtained. Parameters (a, intercept and b, slope) of the linear function k=f(t) highly correlate with bile acids' structures. Investigated bile acids form linear congeneric groups on a principal component (calculated from k=f(t)) score plot that are in accordance with conformations of the hydroxyl and oxo groups in a bile acid steroid skeleton. Partition coefficient (K(p)) of nitrazepam in bile acids' micelles is investigated. Nitrazepam molecules incorporated in micelles show modified bioavailability (depo effect, higher permeability, etc.). Using multiple linear regression method QSAR models of nitrazepams' partition coefficient, K(p) are derived on the temperatures of 25°C and 37°C. For deriving linear regression models on both temperatures experimentally obtained lipophilicity parameters are included (PC1 from data k=f(t)) and in silico descriptors of the shape of a molecule while on the higher temperature molecular polarisation is introduced. This indicates the fact that the incorporation mechanism of nitrazepam in BA micelles changes on the higher temperatures. QSAR models are derived using partial least squares method as well. Experimental parameters k=f(t) are shown to be significant predictive variables. Both QSAR models are validated using cross validation and internal validation method. PLS models have slightly higher predictive capability than MLR models. Copyright © 2010 Elsevier B.V. All rights reserved.
Kim, J; Lee, C; Chong, Y
2009-01-01
Influenza endonucleases have appeared as an attractive target of antiviral therapy for influenza infection. With the purpose of designing a novel antiviral agent with enhanced biological activities against influenza endonuclease, a three-dimensional quantitative structure-activity relationships (3D-QSAR) model was generated based on 34 influenza endonuclease inhibitors. The comparative molecular similarity index analysis (CoMSIA) with a steric, electrostatic and hydrophobic (SEH) model showed the best correlative and predictive capability (q(2) = 0.763, r(2) = 0.969 and F = 174.785), which provided a pharmacophore composed of the electronegative moiety as well as the bulky hydrophobic group. The CoMSIA model was used as a pharmacophore query in the UNITY search of the ChemDiv compound library to give virtual active compounds. The 3D-QSAR model was then used to predict the activity of the selected compounds, which identified three compounds as the most likely inhibitor candidates.
Predicting Drug-induced Hepatotoxicity Using QSAR and Toxicogenomics Approaches
Low, Yen; Uehara, Takeki; Minowa, Yohsuke; Yamada, Hiroshi; Ohno, Yasuo; Urushidani, Tetsuro; Sedykh, Alexander; Muratov, Eugene; Fourches, Denis; Zhu, Hao; Rusyn, Ivan; Tropsha, Alexander
2014-01-01
Quantitative Structure-Activity Relationship (QSAR) modeling and toxicogenomics are used independently as predictive tools in toxicology. In this study, we evaluated the power of several statistical models for predicting drug hepatotoxicity in rats using different descriptors of drug molecules, namely their chemical descriptors and toxicogenomic profiles. The records were taken from the Toxicogenomics Project rat liver microarray database containing information on 127 drugs (http://toxico.nibio.go.jp/datalist.html). The model endpoint was hepatotoxicity in the rat following 28 days of exposure, established by liver histopathology and serum chemistry. First, we developed multiple conventional QSAR classification models using a comprehensive set of chemical descriptors and several classification methods (k nearest neighbor, support vector machines, random forests, and distance weighted discrimination). With chemical descriptors alone, external predictivity (Correct Classification Rate, CCR) from 5-fold external cross-validation was 61%. Next, the same classification methods were employed to build models using only toxicogenomic data (24h after a single exposure) treated as biological descriptors. The optimized models used only 85 selected toxicogenomic descriptors and had CCR as high as 76%. Finally, hybrid models combining both chemical descriptors and transcripts were developed; their CCRs were between 68 and 77%. Although the accuracy of hybrid models did not exceed that of the models based on toxicogenomic data alone, the use of both chemical and biological descriptors enriched the interpretation of the models. In addition to finding 85 transcripts that were predictive and highly relevant to the mechanisms of drug-induced liver injury, chemical structural alerts for hepatotoxicity were also identified. These results suggest that concurrent exploration of the chemical features and acute treatment-induced changes in transcript levels will both enrich the mechanistic understanding of sub-chronic liver injury and afford models capable of accurate prediction of hepatotoxicity from chemical structure and short-term assay results. PMID:21699217
(Q)SARs to predict environmental toxicities: current status and future needs.
Cronin, Mark T D
2017-03-22
The current state of the art of (Quantitative) Structure-Activity Relationships ((Q)SARs) to predict environmental toxicity is assessed along with recommendations to develop these models further. The acute toxicity of compounds acting by the non-polar narcotic mechanism of action can be well predicted, however other approaches, including read-across, may be required for compounds acting by specific mechanisms of action. The chronic toxicity of compounds to environmental species is more difficult to predict from (Q)SARs, with robust data sets and more mechanistic information required. In addition, the toxicity of mixtures is little addressed by (Q)SAR approaches. Developments in environmental toxicology including Adverse Outcome Pathways (AOPs) and omics responses should be utilised to develop better, more mechanistically relevant, (Q)SAR models.
Ruiz, Patricia; Begluitti, Gino; Tincher, Terry; Wheeler, John; Mumtaz, Moiz
2012-07-27
Predicting toxicity quantitatively, using Quantitative Structure Activity Relationships (QSAR), has matured over recent years to the point that the predictions can be used to help identify missing comparison values in a substance's database. In this manuscript we investigate using the lethal dose that kills fifty percent of a test population (LD₅₀) for determining relative toxicity of a number of substances. In general, the smaller the LD₅₀ value, the more toxic the chemical, and the larger the LD₅₀ value, the lower the toxicity. When systemic toxicity and other specific toxicity data are unavailable for the chemical(s) of interest, during emergency responses, LD₅₀ values may be employed to determine the relative toxicity of a series of chemicals. In the present study, a group of chemical warfare agents and their breakdown products have been evaluated using four available rat oral QSAR LD₅₀ models. The QSAR analysis shows that the breakdown products of Sulfur Mustard (HD) are predicted to be less toxic than the parent compound as well as other known breakdown products that have known toxicities. The QSAR estimated break down products LD₅₀ values ranged from 299 mg/kg to 5,764 mg/kg. This evaluation allows for the ranking and toxicity estimation of compounds for which little toxicity information existed; thus leading to better risk decision making in the field.
Cerruela García, G; García-Pedrajas, N; Luque Ruiz, I; Gómez-Nieto, M Á
2018-03-01
This paper proposes a method for molecular activity prediction in QSAR studies using ensembles of classifiers constructed by means of two supervised subspace projection methods, namely nonparametric discriminant analysis (NDA) and hybrid discriminant analysis (HDA). We studied the performance of the proposed ensembles compared to classical ensemble methods using four molecular datasets and eight different models for the representation of the molecular structure. Using several measures and statistical tests for classifier comparison, we observe that our proposal improves the classification results with respect to classical ensemble methods. Therefore, we show that ensembles constructed using supervised subspace projections offer an effective way of creating classifiers in cheminformatics.
2013-01-01
The disappointing results obtained in recent clinical trials renew the interest in experimental/computational techniques for the discovery of neuroprotective drugs. In this context, multitarget or multiplexing QSAR models (mt-QSAR/mx-QSAR) may help to predict neurotoxicity/neuroprotective effects of drugs in multiple assays, on drug targets, and in model organisms. In this work, we study a data set downloaded from CHEMBL; each data point (>8000) contains the values of one out of 37 possible measures of activity, 493 assays, 169 molecular or cellular targets, and 11 different organisms (including human) for a given compound. In this work, we introduce the first mx-QSAR model for neurotoxicity/neuroprotective effects of drugs based on the MARCH-INSIDE (MI) method. First, we used MI to calculate the stochastic spectral moments (structural descriptors) of all compounds. Next, we found a model that classified correctly 2955 out of 3548 total cases in the training and validation series with Accuracy, Sensitivity, and Specificity values > 80%. The model also showed excellent results in Computational-Chemistry simulations of High-Throughput Screening (CCHTS) experiments, with accuracy = 90.6% for 4671 positive cases. Next, we reported the synthesis, characterization, and experimental assays of new rasagiline derivatives. We carried out three different experimental tests: assay (1) in the absence of neurotoxic agents, assay (2) in the presence of glutamate, and assay (3) in the presence of H2O2. Compounds 11 with 27.4%, 8 with 11.6%, and 9 with 15.4% showed the highest neuroprotective effects in assays (1), (2), and (3), respectively. After that, we used the mx-QSAR model to carry out a CCHTS of the new compounds in >400 unique pharmacological tests not carried out experimentally. Consequently, this model may become a promising auxiliary tool for the discovery of new drugs for the treatment of neurodegenerative diseases. PMID:23855599
NASA Astrophysics Data System (ADS)
Costanzi, Stefano; Tikhonova, Irina G.; Harden, T. Kendall; Jacobson, Kenneth A.
2009-11-01
Accurate in silico models for the quantitative prediction of the activity of G protein-coupled receptor (GPCR) ligands would greatly facilitate the process of drug discovery and development. Several methodologies have been developed based on the properties of the ligands, the direct study of the receptor-ligand interactions, or a combination of both approaches. Ligand-based three-dimensional quantitative structure-activity relationships (3D-QSAR) techniques, not requiring knowledge of the receptor structure, have been historically the first to be applied to the prediction of the activity of GPCR ligands. They are generally endowed with robustness and good ranking ability; however they are highly dependent on training sets. Structure-based techniques generally do not provide the level of accuracy necessary to yield meaningful rankings when applied to GPCR homology models. However, they are essentially independent from training sets and have a sufficient level of accuracy to allow an effective discrimination between binders and nonbinders, thus qualifying as viable lead discovery tools. The combination of ligand and structure-based methodologies in the form of receptor-based 3D-QSAR and ligand and structure-based consensus models results in robust and accurate quantitative predictions. The contribution of the structure-based component to these combined approaches is expected to become more substantial and effective in the future, as more sophisticated scoring functions are developed and more detailed structural information on GPCRs is gathered.
Das, Sreeparna; Mitra, Indrani; Batuta, Shaikh; Niharul Alam, Md; Roy, Kunal; Begum, Naznin Ara
2014-11-01
A series of flavonoid analogues were synthesized and screened for the in vitro antioxidant activity through their ability to quench 1,1-diphenyl-2-picryl hydrazyl (DPPH) radical. The activity of these compounds, measured in comparison to the well-known standard antioxidants (29-32), their precursors (38-42) and other bioactive moieties (38-42) resembling partially the flavone skeleton was analyzed further to develop Quantitative Structure-Activity Relationship (QSAR) models using the Genetic Function Approximation (GFA) technique. Based on the essential structural requirements predicted by the QSAR models, some analogues were designed, synthesized and tested for activity. The predicted and experimental activities of these compounds were well correlated. Flavone analogue 20 was found to be the most potent antioxidant. Copyright © 2014 Elsevier Ltd. All rights reserved.
Ying, Jiali; Zhang, Ting; Tang, Meng
2015-01-01
Metal oxide nanomaterials are widely used in various areas; however, the divergent published toxicology data makes it difficult to determine whether there is a risk associated with exposure to metal oxide nanomaterials. The application of quantitative structure activity relationship (QSAR) modeling in metal oxide nanomaterials toxicity studies can reduce the need for time-consuming and resource-intensive nanotoxicity tests. The nanostructure and inorganic composition of metal oxide nanomaterials makes this approach different from classical QSAR study; this review lists and classifies some structural descriptors, such as size, cation charge, and band gap energy, in recent metal oxide nanomaterials quantitative nanostructure activity relationship (QNAR) studies and discusses the mechanism of metal oxide nanomaterials toxicity based on these descriptors and traditional nanotoxicity tests. PMID:28347085
NASA Astrophysics Data System (ADS)
Masand, Vijay H.; El-Sayed, Nahed N. E.; Bambole, Mukesh U.; Quazi, Syed A.
2018-04-01
Multiple discrete quantitative structure-activity relationships (QSARs) models were constructed for the anticancer activity of α, β-unsaturated carbonyl-based compounds, oxime and oxime ether analogues with a variety of substituents like sbnd Br, sbnd OH, -OMe, etc. at different positions. A big pool of descriptors was considered for QSAR model building. Genetic algorithm (GA), available in QSARINS-Chem, was executed to choose optimum number and set of descriptors to create the multi-linear regression equations for a dataset of sixty-nine compounds. The newly developed five parametric models were subjected to exhaustive internal and external validation along with Y-scrambling using QSARINS-Chem, according to the OECD principles for QSAR model validation. The models were built using easily interpretable descriptors and accepted after confirming statistically robustness with high external predictive ability. The five parametric models were found to have R2 = 0.80 to 0.86, R2ex = 0.75 to 0.84, and CCCex = 0.85 to 0.90. The models indicate that frequency of nitrogen and oxygen atoms separated by five bonds from each other and internal electronic environment of the molecule have correlation with the anticancer activity.
QSAR Study on the anti-tumor activity of levofloxacin-thiadiazole HDACi conjugates
NASA Astrophysics Data System (ADS)
Tang, Ziqiang; Feng, Hui; Chen, Yan; Yue, Wei; Feng, Changjun
2017-12-01
A molecular electronegativity distance vector(M t) based on 13atomic types is used to describe the structures of 19 conjugates(LHCc) of levofloxacin-thiadiazole HDAC inhibitor(HDACi) and related to the anti-tumor activity (M F and P C) of LHCc against MCF-7 and PC-3. The quantitative structure-activity relationships (QSAR) was established by using leaps-and-bounds regression analysis for the anti-tumor activities (M F and P C) of 19 above compounds to MCF-7and PC-3 along with the M t. The correlation coefficients (R 2) and the leave-one-out (LOO) cross validation R cv 2 for the M F and P C models were 0.792 and 0.679; 0.773 and 0.565, respectively. The QSAR models have favorable correlation, as well as robustness and good prediction capability by R 2, F, R cv 2, A IC F IT V IF tests. The results indicate that the molecular structural units: -CHg-(g=1, 2), -NH2, -NH-,-OH, O=, -O-, -S- and -X are main factors which can affect the anti-tumor activity M F and PC bioactivities of these compounds directly.
Vlot, Anna H C; de Witte, Wilhelmus E A; Danhof, Meindert; van der Graaf, Piet H; van Westen, Gerard J P; de Lange, Elizabeth C M
2017-12-04
Selectivity is an important attribute of effective and safe drugs, and prediction of in vivo target and tissue selectivity would likely improve drug development success rates. However, a lack of understanding of the underlying (pharmacological) mechanisms and availability of directly applicable predictive methods complicates the prediction of selectivity. We explore the value of combining physiologically based pharmacokinetic (PBPK) modeling with quantitative structure-activity relationship (QSAR) modeling to predict the influence of the target dissociation constant (K D ) and the target dissociation rate constant on target and tissue selectivity. The K D values of CB1 ligands in the ChEMBL database are predicted by QSAR random forest (RF) modeling for the CB1 receptor and known off-targets (TRPV1, mGlu5, 5-HT1a). Of these CB1 ligands, rimonabant, CP-55940, and Δ 8 -tetrahydrocanabinol, one of the active ingredients of cannabis, were selected for simulations of target occupancy for CB1, TRPV1, mGlu5, and 5-HT1a in three brain regions, to illustrate the principles of the combined PBPK-QSAR modeling. Our combined PBPK and target binding modeling demonstrated that the optimal values of the K D and k off for target and tissue selectivity were dependent on target concentration and tissue distribution kinetics. Interestingly, if the target concentration is high and the perfusion of the target site is low, the optimal K D value is often not the lowest K D value, suggesting that optimization towards high drug-target affinity can decrease the benefit-risk ratio. The presented integrative structure-pharmacokinetic-pharmacodynamic modeling provides an improved understanding of tissue and target selectivity.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Esposito, Emilio Xavier, E-mail: emilio@exeResearch.com; The Chem21 Group, Inc., 1780 Wilson Drive, Lake Forest, IL 60045; Hopfinger, Anton J., E-mail: hopfingr@gmail.com
2015-10-01
Carbon nanotubes have become widely used in a variety of applications including biosensors and drug carriers. Therefore, the issue of carbon nanotube toxicity is increasingly an area of focus and concern. While previous studies have focused on the gross mechanisms of action relating to nanomaterials interacting with biological entities, this study proposes detailed mechanisms of action, relating to nanotoxicity, for a series of decorated (functionalized) carbon nanotube complexes based on previously reported QSAR models. Possible mechanisms of nanotoxicity for six endpoints (bovine serum albumin, carbonic anhydrase, chymotrypsin, hemoglobin along with cell viability and nitrogen oxide production) have been extracted frommore » the corresponding optimized QSAR models. The molecular features relevant to each of the endpoint respective mechanism of action for the decorated nanotubes are also discussed. Based on the molecular information contained within the optimal QSAR models for each nanotoxicity endpoint, either the decorator attached to the nanotube is directly responsible for the expression of a particular activity, irrespective of the decorator's 3D-geometry and independent of the nanotube, or those decorators having structures that place the functional groups of the decorators as far as possible from the nanotube surface most strongly influence the biological activity. These molecular descriptors are further used to hypothesize specific interactions involved in the expression of each of the six biological endpoints. - Highlights: • Proposed toxicity mechanism of action for decorated nanotubes complexes • Discussion of the key molecular features for each endpoint's mechanism of action • Unique mechanisms of action for each of the six biological systems • Hypothesized mechanisms of action based on QSAR/QNAR predictive models.« less
Bairy, Santhosh Kumar; Suneel Kumar, B V S; Bhalla, Joseph Uday Tej; Pramod, A B; Ravikumar, Muttineni
2009-04-01
c-Src kinase play an important role in cell growth and differentiation and its inhibitors can be useful for the treatment of various diseases, including cancer, osteoporosis, and metastatic bone disease. Three dimensional quantitative structure-activity relationship (3D-QSAR) studies were carried out on quinazolin derivatives inhibiting c-Src kinase. Molecular field analysis (MFA) models with four different alignment techniques, namely, GLIDE, GOLD, LIGANDFIT and Least squares based methods were developed. glide based MFA model showed better results (Leave one out cross validation correlation coefficient r(2)(cv) = 0.923 and non-cross validation correlation coefficient r(2)= 0.958) when compared with other models. These results help us to understand the nature of descriptors required for activity of these compounds and thereby provide guidelines to design novel and potent c-Src kinase inhibitors.
Dong, Lili; Feng, Ruirui; Bi, Jiawei; Shen, Shengqiang; Lu, Huizhe; Zhang, Jianjun
2018-03-06
Human sodium-dependent glucose co-transporter 2 (hSGLT2) is a crucial therapeutic target in the treatment of type 2 diabetes. In this study, both comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were applied to generate three-dimensional quantitative structure-activity relationship (3D-QSAR) models. In the most accurate CoMFA-based and CoMSIA-based QSAR models, the cross-validated coefficients (r 2 cv ) were 0.646 and 0.577, respectively, while the non-cross-validated coefficients (r 2 ) were 0.997 and 0.991, respectively, indicating that both models were reliable. In addition, we constructed a homology model of hSGLT2 in the absence of a crystal structure. Molecular docking was performed to explore the bonding mode of inhibitors to the active site of hSGLT2. Molecular dynamics (MD) simulations and binding free energy calculations using MM-PBSA and MM-GBSA were carried out to further elucidate the interaction mechanism. With regards to binding affinity, we found that hydrogen-bond interactions of Asn51 and Glu75, located in the active site of hSGLT2, with compound 40 were critical. Hydrophobic and electrostatic interactions were shown to enhance activity, in agreement with the results obtained from docking and 3D-QSAR analysis. Our study results shed light on the interaction mode between inhibitors and hSGLT2 and may aid in the development of C-aryl glucoside SGLT2 inhibitors.
Kiwamoto, R; Spenkelink, A; Rietjens, I M C M; Punt, A
2015-01-01
Acyclic α,β-unsaturated aldehydes present in food raise a concern because the α,β-unsaturated aldehyde moiety is considered a structural alert for genotoxicity. However, controversy remains on whether in vivo at realistic dietary exposure DNA adduct formation is significant. The aim of the present study was to develop physiologically based kinetic/dynamic (PBK/D) models to examine dose-dependent detoxification and DNA adduct formation of a group of 18 food-borne acyclic α,β-unsaturated aldehydes without 2- or 3-alkylation, and with no more than one conjugated double bond. Parameters for the PBK/D models were obtained using quantitative structure-activity relationships (QSARs) defined with a training set of six selected aldehydes. Using the QSARs, PBK/D models for the other 12 aldehydes were defined. Results revealed that DNA adduct formation in the liver increases with decreasing bulkiness of the molecule especially due to less efficient detoxification. 2-Propenal (acrolein) was identified to induce the highest DNA adduct levels. At realistic dietary intake, the predicted DNA adduct levels for all aldehydes were two orders of magnitude lower than endogenous background levels observed in disease free human liver, suggesting that for all 18 aldehydes DNA adduct formation is negligible at the relevant levels of dietary intake. The present study provides a proof of principle for the use of QSAR-based PBK/D modelling to facilitate group evaluations and read-across in risk assessment. Copyright © 2014 Elsevier Inc. All rights reserved.
Carrió, Pau; López, Oriol; Sanz, Ferran; Pastor, Manuel
2015-01-01
Computational models based in Quantitative-Structure Activity Relationship (QSAR) methodologies are widely used tools for predicting the biological properties of new compounds. In many instances, such models are used as a routine in the industry (e.g. food, cosmetic or pharmaceutical industry) for the early assessment of the biological properties of new compounds. However, most of the tools currently available for developing QSAR models are not well suited for supporting the whole QSAR model life cycle in production environments. We have developed eTOXlab; an open source modeling framework designed to be used at the core of a self-contained virtual machine that can be easily deployed in production environments, providing predictions as web services. eTOXlab consists on a collection of object-oriented Python modules with methods mapping common tasks of standard modeling workflows. This framework allows building and validating QSAR models as well as predicting the properties of new compounds using either a command line interface or a graphic user interface (GUI). Simple models can be easily generated by setting a few parameters, while more complex models can be implemented by overriding pieces of the original source code. eTOXlab benefits from the object-oriented capabilities of Python for providing high flexibility: any model implemented using eTOXlab inherits the features implemented in the parent model, like common tools and services or the automatic exposure of the models as prediction web services. The particular eTOXlab architecture as a self-contained, portable prediction engine allows building models with confidential information within corporate facilities, which can be safely exported and used for prediction without disclosing the structures of the training series. The software presented here provides full support to the specific needs of users that want to develop, use and maintain predictive models in corporate environments. The technologies used by eTOXlab (web services, VM, object-oriented programming) provide an elegant solution to common practical issues; the system can be installed easily in heterogeneous environments and integrates well with other software. Moreover, the system provides a simple and safe solution for building models with confidential structures that can be shared without disclosing sensitive information.
Quantitative Structure-Cytotoxicity Relationship of Cinnamic Acid Phenetyl Esters.
Uesawa, Yoshihiro; Sakagami, Hiroshi; Okudaira, Noriyuki; Toda, Kazuhiro; Takao, Koichi; Kagaya, Hajime; Sugita, Yoshiaki
2018-02-01
Many phenolic acid phenethyl esters possess diverse biological effects including antioxidant, cytoprotective, anti-inflammation and anti-tumor activities. However, most previous antitumor studies have not considered the cytotoxicity against normal cells. Ten cinnamic acid phenetyl esters were subjected to quantitative structure-activity relationship (QSAR) analysis, based on their cytotoxicity and tumor-specificity, in order to find their new biological activities. Cytotoxicity against four human oral squamous cell carcinoma cell lines and three oral normal mesenchymal cells was determined by the 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide (MTT) method. Tumor specificity (TS) was evaluated by the ratio of the mean 50% cytotoxic concentration (CC 50 ) against normal oral cells to that against human oral squamous cell carcinoma cell lines. Potency-selectivity expression (PSE) value was calculated by dividing the TS value by CC 50 against tumor cells. Apoptosis markers were detected by western blot analysis. Physicochemical, structural and quantum-chemical parameters were calculated based on the conformations optimized by force-field minimization. Western blot analysis demonstrated that [ 9 ] stimulated the cleavage of caspase-3, suggesting the induction of apoptosis. QSAR analysis demonstrated that TS values were correlated with shape, size and ionization potential. Chemical modification of the lead compound may be a potential choice for designing a new type of anticancer drugs. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Satpathy, Raghunath; Guru, R K; Behera, R; Nayak, B
2015-01-01
Boswellic acid consists of a series of pentacyclic triterpene molecules that are produced by the plant Boswellia serrata. The potential applications of Bowsellic acid for treatment of cancer have been focused here. To predict the property of the bowsellic acid derivatives as anticancer compounds by various computational approaches. In this work, all total 65 derivatives of bowsellic acids from the PubChem database were considered for the study. After energy minimization of the ligands various types of molecular descriptors were computed and corresponding two-dimensional quantitative structure activity relationship (QSAR) models were obtained by taking Andrews coefficient as the dependent variable. Different types of comparative analysis were used for QSAR study are multiple linear regression, partial least squares, support vector machines and artificial neural network. From the study geometrical descriptors shows the highest correlation coefficient, which indicates the binding factor of the compound. To evaluate the anticancer property molecular docking study of six selected ligands based on Andrews affinity were performed with nuclear factor-kappa protein kinase (Protein Data Bank ID 4G3D), which is an established therapeutic target for cancers. Along with QSAR study and docking result, it was predicted that bowsellic acid can also be treated as a potential anticancer compound. Along with QSAR study and docking result, it was predicted that bowsellic acid can also be treated as a potential anticancer compound.
The importance of data curation on QSAR Modeling ...
During the last few decades many QSAR models and tools have been developed at the US EPA, including the widely used EPISuite. During this period the arsenal of computational capabilities supporting cheminformatics has broadened dramatically with multiple software packages. These modern tools allow for more advanced techniques in terms of chemical structure representation and storage, as well as enabling automated data-mining and standardization approaches to examine and fix data quality issues.This presentation will investigate the impact of data curation on the reliability of QSAR models being developed within the EPA‘s National Center for Computational Toxicology. As part of this work we have attempted to disentangle the influence of the quality versus quantity of data based on the Syracuse PHYSPROP database partly used by EPISuite software. We will review our automated approaches to examining key datasets related to the EPISuite data to validate across chemical structure representations (e.g., mol file and SMILES) and identifiers (chemical names and registry numbers) and approaches to standardize data into QSAR-ready formats prior to modeling procedures. Our efforts to quantify and segregate data into quality categories has allowed us to evaluate the resulting models that can be developed from these data slices and to quantify to what extent efforts developing high-quality datasets have the expected pay-off in terms of predicting performance. The most accur
Nolte, Tom M; Peijnenburg, Willie J G M; Hendriks, A Jan; van de Meent, Dik
2017-07-01
After use and disposal of chemical products, many types of polymer particles end up in the aquatic environment with potential toxic effects to primary producers like green algae. In this study, we have developed Quantitative Structure-Activity Relationships (QSARs) for a set of highly structural diverse polymers which are capable to estimate green algae growth inhibition (EC50). The model (N = 43, R 2 = 0.73, RMSE = 0.28) is a regression-based decision tree using one structural descriptor for each of three polymer classes separated based on charge. The QSAR is applicable to linear homo polymers as well as copolymers and does not require information on the size of the polymer particle or underlying core material. Highly branched polymers, non-nitrogen cationic polymers and polymeric surfactants are not included in the model and thus cannot be evaluated. The model works best for cationic and non-ionic polymers for which cellular adsorption, disruption of the cell wall and photosynthesis inhibition were the mechanisms of action. For anionic polymers, specific properties of the polymer and test characteristics need to be known for detailed assessment. The data and QSAR results for anionic polymers, when combined with molecular dynamics simulations indicated that nutrient depletion is likely the dominant mode of toxicity. Nutrient depletion in turn, is determined by the non-linear interplay between polymer charge density and backbone flexibility. Copyright © 2017 Elsevier Ltd. All rights reserved.
Zhu, Hao; Ye, Lin; Richard, Ann; Golbraikh, Alexander; Wright, Fred A.; Rusyn, Ivan; Tropsha, Alexander
2009-01-01
Background Accurate prediction of in vivo toxicity from in vitro testing is a challenging problem. Large public–private consortia have been formed with the goal of improving chemical safety assessment by the means of high-throughput screening. Objective A wealth of available biological data requires new computational approaches to link chemical structure, in vitro data, and potential adverse health effects. Methods and results A database containing experimental cytotoxicity values for in vitro half-maximal inhibitory concentration (IC50) and in vivo rodent median lethal dose (LD50) for more than 300 chemicals was compiled by Zentralstelle zur Erfassung und Bewertung von Ersatz- und Ergaenzungsmethoden zum Tierversuch (ZEBET; National Center for Documentation and Evaluation of Alternative Methods to Animal Experiments). The application of conventional quantitative structure–activity relationship (QSAR) modeling approaches to predict mouse or rat acute LD50 values from chemical descriptors of ZEBET compounds yielded no statistically significant models. The analysis of these data showed no significant correlation between IC50 and LD50. However, a linear IC50 versus LD50 correlation could be established for a fraction of compounds. To capitalize on this observation, we developed a novel two-step modeling approach as follows. First, all chemicals are partitioned into two groups based on the relationship between IC50 and LD50 values: One group comprises compounds with linear IC50 versus LD50 relationships, and another group comprises the remaining compounds. Second, we built conventional binary classification QSAR models to predict the group affiliation based on chemical descriptors only. Third, we developed k-nearest neighbor continuous QSAR models for each subclass to predict LD50 values from chemical descriptors. All models were extensively validated using special protocols. Conclusions The novelty of this modeling approach is that it uses the relationships between in vivo and in vitro data only to inform the initial construction of the hierarchical two-step QSAR models. Models resulting from this approach employ chemical descriptors only for external prediction of acute rodent toxicity. PMID:19672406
A novel integrated framework and improved methodology of computer-aided drug design.
Chen, Calvin Yu-Chian
2013-01-01
Computer-aided drug design (CADD) is a critical initiating step of drug development, but a single model capable of covering all designing aspects remains to be elucidated. Hence, we developed a drug design modeling framework that integrates multiple approaches, including machine learning based quantitative structure-activity relationship (QSAR) analysis, 3D-QSAR, Bayesian network, pharmacophore modeling, and structure-based docking algorithm. Restrictions for each model were defined for improved individual and overall accuracy. An integration method was applied to join the results from each model to minimize bias and errors. In addition, the integrated model adopts both static and dynamic analysis to validate the intermolecular stabilities of the receptor-ligand conformation. The proposed protocol was applied to identifying HER2 inhibitors from traditional Chinese medicine (TCM) as an example for validating our new protocol. Eight potent leads were identified from six TCM sources. A joint validation system comprised of comparative molecular field analysis, comparative molecular similarity indices analysis, and molecular dynamics simulation further characterized the candidates into three potential binding conformations and validated the binding stability of each protein-ligand complex. The ligand pathway was also performed to predict the ligand "in" and "exit" from the binding site. In summary, we propose a novel systematic CADD methodology for the identification, analysis, and characterization of drug-like candidates.
Foltz, Martin; van Buren, Leo; Klaffke, Werner; Duchateau, Guus S M J E
2009-09-01
Selected di- and tripeptides exhibit angiotensin-I converting enzyme (ACE) inhibitory activity in vitro. However, the efficacy in vivo is most likely limited for most peptides due to low bioavailability. The purpose of this study was to identify descriptors of intestinal stability, permeability, and ACE inhibitory activity of dipeptides. A total of 228 dipeptides were synthesized; intestinal stability was obtained by in vitro digestion, intestinal permeability using Caco-2 cells and ACE inhibitory activity by an in vitro assay. Databases were constructed to study the relationship between structure and activity, permeability, and stability. Quantitative structure-activity relationship (QSAR) modeling was performed based on computed models using partial least squares regression based on 400 molecular descriptors. QSAR modeling of dipeptide stability revealed high correlation coefficients (R > 0.65) for models based on Z and X scales. However, amino acid (AA) clustering showed the best results in describing stability of dipeptides. The N-terminal AA residues Asp, Gly, and Pro as well as the C-terminal residues Pro, Ser, Thr, and Asp stabilize dipeptides toward luminal enzymatic peptide hydrolysis. QSAR modeling did not reveal significant correlation models for intestinal permeability. 2D-fingerprint models were identified describing ACE inhibitory activity of dipeptides. The intestinal stability of 12 peptides was predicted. Peptides were synthesized and stability was confirmed in simulated digestion experiments. Based on the results, specific dipeptides can be designed to meet both stability and activity criteria. However, postabsorptive ACE inhibitory activities of dipeptides in vivo are most likely limited due to the very low intestinal permeability of dipeptides.
Structure- and ligand-based structure-activity relationships for a series of inhibitors of aldolase.
Ferreira, Leonardo G; Andricopulo, Adriano D
2012-12-01
Aldolase has emerged as a promising molecular target for the treatment of human African trypanosomiasis. Over the last years, due to the increasing number of patients infected with Trypanosoma brucei, there is an urgent need for new drugs to treat this neglected disease. In the present study, two-dimensional fragment-based quantitative-structure activity relationship (QSAR) models were generated for a series of inhibitors of aldolase. Through the application of leave-one-out and leave-many-out cross-validation procedures, significant correlation coefficients were obtained (r²=0.98 and q²=0.77) as an indication of the statistical internal and external consistency of the models. The best model was employed to predict pKi values for a series of test set compounds, and the predicted values were in good agreement with the experimental results, showing the power of the model for untested compounds. Moreover, structure-based molecular modeling studies were performed to investigate the binding mode of the inhibitors in the active site of the parasitic target enzyme. The structural and QSAR results provided useful molecular information for the design of new aldolase inhibitors within this structural class.
Escher, Beate I; Baumer, Andreas; Bittermann, Kai; Henneberger, Luise; König, Maria; Kühnert, Christin; Klüver, Nils
2017-03-22
The Microtox assay, a bioluminescence inhibition assay with the marine bacterium Aliivibrio fischeri, is one of the most popular bioassays for assessing the cytotoxicity of organic chemicals, mixtures and environmental samples. Most environmental chemicals act as baseline toxicants in this short-term screening assay, which is typically run with only 30 min of exposure duration. Numerous Quantitative Structure-Activity Relationships (QSARs) exist for the Microtox assay for nonpolar and polar narcosis. However, typical water pollutants, which have highly diverse structures covering a wide range of hydrophobicity and speciation from neutral to anionic and cationic, are often outside the applicability domain of these QSARs. To include all types of environmentally relevant organic pollutants we developed a general baseline toxicity QSAR using liposome-water distribution ratios as descriptors. Previous limitations in availability of experimental liposome-water partition constants were overcome by reliable prediction models based on polyparameter linear free energy relationships for neutral chemicals and the COSMOmic model for charged chemicals. With this QSAR and targeted mixture experiments we could demonstrate that ionisable chemicals fall in the applicability domain. Most investigated water pollutants acted as baseline toxicants in this bioassay, with the few outliers identified as uncouplers or reactive toxicants. The main limitation of the Microtox assay is that chemicals with a high melting point and/or high hydrophobicity were outside of the applicability domain because of their low water solubility. We quantitatively derived a solubility cut-off but also demonstrated with mixture experiments that chemicals inactive on their own can contribute to mixture toxicity, which is highly relevant for complex environmental mixtures, where these chemicals may be present at concentrations below the solubility cut-off.
Potter, W R; Henderson, B W; Bellnier, D A; Pandey, R K; Vaughan, L A; Weishaupt, K R; Dougherty, T J
1999-11-01
An open three-compartment pharmacokinetic model was applied to the in vivo quantitative structure-activity relationship (QSAR) data of a homologous series of pyropheophorbide photosensitizers for photodynamic therapy (PDT). The physical model was a lipid compartment sandwiched between two identical aqueous compartments. The first compartment was assumed to clear irreversibly at a rate K0. The measured octanol-water partition coefficients, P(i) (where i is the number of carbons in the alkyl chain) and the clearance rate K0 determined the clearance kinetics of the drugs. Solving the coupled differential equations of the three-compartment model produced clearance kinetics for each of the sensitizers in each of the compartments. The third compartment was found to contain the target of PDT. This series of compounds is quite lipophilic. Therefore these drugs are found mainly in the second compartment. The drug level in the third compartment represents a small fraction of the tissue level and is thus not accessible to direct measurement by extraction. The second compartment of the model accurately predicted the clearance from the serum of mice of the hexyl ether of pyropheophorbide a, one member of this series of compounds. The diffusion and clearance rate constants were those found by fitting the pharmacokinetics of the third compartment to the QSAR data. This result validated the magnitude and mechanistic significance of the rate constants used to model the QSAR data. The PDT response to dose theory was applied to the kinetic behavior of the target compartment drug concentration. This produced a pharmacokinetic-based function connecting PDT response to dose as a function of time postinjection. This mechanistic dose-response function was fitted to published, single time point QSAR data for the pheophorbides. As a result, the PDT target threshold dose together with the predicted QSAR as a function of time postinjection was found.
NASA Astrophysics Data System (ADS)
Andersson, C. David; Hillgren, J. Mikael; Lindgren, Cecilia; Qian, Weixing; Akfur, Christine; Berg, Lotta; Ekström, Fredrik; Linusson, Anna
2015-03-01
Scientific disciplines such as medicinal- and environmental chemistry, pharmacology, and toxicology deal with the questions related to the effects small organic compounds exhort on biological targets and the compounds' physicochemical properties responsible for these effects. A common strategy in this endeavor is to establish structure-activity relationships (SARs). The aim of this work was to illustrate benefits of performing a statistical molecular design (SMD) and proper statistical analysis of the molecules' properties before SAR and quantitative structure-activity relationship (QSAR) analysis. Our SMD followed by synthesis yielded a set of inhibitors of the enzyme acetylcholinesterase (AChE) that had very few inherent dependencies between the substructures in the molecules. If such dependencies exist, they cause severe errors in SAR interpretation and predictions by QSAR-models, and leave a set of molecules less suitable for future decision-making. In our study, SAR- and QSAR models could show which molecular sub-structures and physicochemical features that were advantageous for the AChE inhibition. Finally, the QSAR model was used for the prediction of the inhibition of AChE by an external prediction set of molecules. The accuracy of these predictions was asserted by statistical significance tests and by comparisons to simple but relevant reference models.
Ko, Gene M; Garg, Rajni; Bailey, Barbara A; Kumar, Sunil
2016-01-01
Quantitative structure-activity relationship (QSAR) models can be used as a predictive tool for virtual screening of chemical libraries to identify novel drug candidates. The aims of this paper were to report the results of a study performed for descriptor selection, QSAR model development, and virtual screening for identifying novel HIV-1 integrase inhibitor drug candidates. First, three evolutionary algorithms were compared for descriptor selection: differential evolution-binary particle swarm optimization (DE-BPSO), binary particle swarm optimization, and genetic algorithms. Next, three QSAR models were developed from an ensemble of multiple linear regression, partial least squares, and extremely randomized trees models. A comparison of the performances of three evolutionary algorithms showed that DE-BPSO has a significant improvement over the other two algorithms. QSAR models developed in this study were used in consensus as a predictive tool for virtual screening of the NCI Open Database containing 265,242 compounds to identify potential novel HIV-1 integrase inhibitors. Six compounds were predicted to be highly active (plC50 > 6) by each of the three models. The use of a hybrid evolutionary algorithm (DE-BPSO) for descriptor selection and QSAR model development in drug design is a novel approach. Consensus modeling may provide better predictivity by taking into account a broader range of chemical properties within the data set conducive for inhibition that may be missed by an individual model. The six compounds identified provide novel drug candidate leads in the design of next generation HIV- 1 integrase inhibitors targeting drug resistant mutant viruses.
Pred-Skin: A Fast and Reliable Web Application to Assess Skin Sensitization Effect of Chemicals.
Braga, Rodolpho C; Alves, Vinicius M; Muratov, Eugene N; Strickland, Judy; Kleinstreuer, Nicole; Trospsha, Alexander; Andrade, Carolina Horta
2017-05-22
Chemically induced skin sensitization is a complex immunological disease with a profound impact on quality of life and working ability. Despite some progress in developing alternative methods for assessing the skin sensitization potential of chemical substances, there is no in vitro test that correlates well with human data. Computational QSAR models provide a rapid screening approach and contribute valuable information for the assessment of chemical toxicity. We describe the development of a freely accessible web-based and mobile application for the identification of potential skin sensitizers. The application is based on previously developed binary QSAR models of skin sensitization potential from human (109 compounds) and murine local lymph node assay (LLNA, 515 compounds) data with good external correct classification rate (0.70-0.81 and 0.72-0.84, respectively). We also included a multiclass skin sensitization potency model based on LLNA data (accuracy ranging between 0.73 and 0.76). When a user evaluates a compound in the web app, the outputs are (i) binary predictions of human and murine skin sensitization potential; (ii) multiclass prediction of murine skin sensitization; and (iii) probability maps illustrating the predicted contribution of chemical fragments. The app is the first tool available that incorporates quantitative structure-activity relationship (QSAR) models based on human data as well as multiclass models for LLNA. The Pred-Skin web app version 1.0 is freely available for the web, iOS, and Android (in development) at the LabMol web portal ( http://labmol.com.br/predskin/ ), in the Apple Store, and on Google Play, respectively. We will continuously update the app as new skin sensitization data and respective models become available.
Toxicity Estimation Software Tool (TEST)
The Toxicity Estimation Software Tool (TEST) was developed to allow users to easily estimate the toxicity of chemicals using Quantitative Structure Activity Relationships (QSARs) methodologies. QSARs are mathematical models used to predict measures of toxicity from the physical c...
Zhang, Wen; Qiu, Kai-Xiong; Yu, Fang; Xie, Xiao-Guang; Zhang, Shu-Qun; Chen, Ya-Juan; Xie, Hui-Ding
2017-10-01
B-Raf kinase has been identified as an important target in recent cancer treatment. In order to discover structurally diverse and novel B-Raf inhibitors (BRIs), a virtual screening of BRIs against ZINC database was performed by using a combination of pharmacophore modelling, molecular docking, 3D-QSAR model and binding free energy (ΔG bind ) calculation studies in this work. After the virtual screening, six promising hit compounds were obtained, which were then tested for inhibitory activities of A375 cell lines. In the result, five hit compounds show good biological activities (IC 50 <50μM). The present method of virtual screening can be applied to find structurally diverse inhibitors, and the obtained five structurally diverse compounds are expected to develop novel BRIs. Copyright © 2017. Published by Elsevier Ltd.
Structure-based Understanding of Binding Affinity and Mode ...
The flexible hydrophobic ligand binding pocket (LBP) of estrogen receptor α (ERα) allows the binding of a wide variety of endocrine disruptors. Upon ligand binding, the LBP reshapes around the contours of the ligand and stabilizes the complex by complementary hydrophobic interactions and specific hydrogen bonds with the ligand. Here we present a framework for quantitative analysis of the steric and electronic features of the human ERα-ligand complex using three dimensional (3D) protein-ligand interaction description combined with 3D-QSAR approach. An empirical hydrophobicity density field is applied to account for hydrophobic contacts of ligand within the LBP. The obtained 3D-QSAR model revealed that hydrophobic contacts primarily determine binding affinity and govern binding mode with hydrogen bonds. Several residues of the LBP appear to be quite flexible and adopt a spectrum of conformations in various ERα-ligand complexes, in particular His524. The 3D-QSAR was combined with molecular docking based on three receptor conformations to accommodate receptor flexibility. The model indicates that the dynamic character of the LBP allows accommodation and stable binding of structurally diverse ligands, and proper representation of the protein flexibility is critical for reasonable description of binding of the ligands. Our results provide a quantitative and mechanistic understanding of binding affinity and mode of ERα agonists and antagonists that may be applicab
GTM-Based QSAR Models and Their Applicability Domains.
Gaspar, H A; Baskin, I I; Marcou, G; Horvath, D; Varnek, A
2015-06-01
In this paper we demonstrate that Generative Topographic Mapping (GTM), a machine learning method traditionally used for data visualisation, can be efficiently applied to QSAR modelling using probability distribution functions (PDF) computed in the latent 2-dimensional space. Several different scenarios of the activity assessment were considered: (i) the "activity landscape" approach based on direct use of PDF, (ii) QSAR models involving GTM-generated on descriptors derived from PDF, and, (iii) the k-Nearest Neighbours approach in 2D latent space. Benchmarking calculations were performed on five different datasets: stability constants of metal cations Ca(2+) , Gd(3+) and Lu(3+) complexes with organic ligands in water, aqueous solubility and activity of thrombin inhibitors. It has been shown that the performance of GTM-based regression models is similar to that obtained with some popular machine-learning methods (random forest, k-NN, M5P regression tree and PLS) and ISIDA fragment descriptors. By comparing GTM activity landscapes built both on predicted and experimental activities, we may visually assess the model's performance and identify the areas in the chemical space corresponding to reliable predictions. The applicability domain used in this work is based on data likelihood. Its application has significantly improved the model performances for 4 out of 5 datasets. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Mager, P P; Rothe, H
1990-10-01
Multicollinearity of physicochemical descriptors leads to serious consequences in quantitative structure-activity relationship (QSAR) analysis, such as incorrect estimators and test statistics of regression coefficients of the ordinary least-squares (OLS) model applied usually to QSARs. Beside the diagnosis of the known simple collinearity, principal component regression analysis (PCRA) also allows the diagnosis of various types of multicollinearity. Only if the absolute values of PCRA estimators are order statistics that decrease monotonically, the effects of multicollinearity can be circumvented. Otherwise, obscure phenomena may be observed, such as good data recognition but low predictive model power of a QSAR model.
Rank Order Entropy: why one metric is not enough
McLellan, Margaret R.; Ryan, M. Dominic; Breneman, Curt M.
2011-01-01
The use of Quantitative Structure-Activity Relationship models to address problems in drug discovery has a mixed history, generally resulting from the mis-application of QSAR models that were either poorly constructed or used outside of their domains of applicability. This situation has motivated the development of a variety of model performance metrics (r2, PRESS r2, F-tests, etc) designed to increase user confidence in the validity of QSAR predictions. In a typical workflow scenario, QSAR models are created and validated on training sets of molecules using metrics such as Leave-One-Out or many-fold cross-validation methods that attempt to assess their internal consistency. However, few current validation methods are designed to directly address the stability of QSAR predictions in response to changes in the information content of the training set. Since the main purpose of QSAR is to quickly and accurately estimate a property of interest for an untested set of molecules, it makes sense to have a means at hand to correctly set user expectations of model performance. In fact, the numerical value of a molecular prediction is often less important to the end user than knowing the rank order of that set of molecules according to their predicted endpoint values. Consequently, a means for characterizing the stability of predicted rank order is an important component of predictive QSAR. Unfortunately, none of the many validation metrics currently available directly measure the stability of rank order prediction, making the development of an additional metric that can quantify model stability a high priority. To address this need, this work examines the stabilities of QSAR rank order models created from representative data sets, descriptor sets, and modeling methods that were then assessed using Kendall Tau as a rank order metric, upon which the Shannon Entropy was evaluated as a means of quantifying rank-order stability. Random removal of data from the training set, also known as Data Truncation Analysis (DTA), was used as a means for systematically reducing the information content of each training set while examining both rank order performance and rank order stability in the face of training set data loss. The premise for DTA ROE model evaluation is that the response of a model to incremental loss of training information will be indicative of the quality and sufficiency of its training set, learning method, and descriptor types to cover a particular domain of applicability. This process is termed a “rank order entropy” evaluation, or ROE. By analogy with information theory, an unstable rank order model displays a high level of implicit entropy, while a QSAR rank order model which remains nearly unchanged during training set reductions would show low entropy. In this work, the ROE metric was applied to 71 data sets of different sizes, and was found to reveal more information about the behavior of the models than traditional metrics alone. Stable, or consistently performing models, did not necessarily predict rank order well. Models that performed well in rank order did not necessarily perform well in traditional metrics. In the end, it was shown that ROE metrics suggested that some QSAR models that are typically used should be discarded. ROE evaluation helps to discern which combinations of data set, descriptor set, and modeling methods lead to usable models in prioritization schemes, and provides confidence in the use of a particular model within a specific domain of applicability. PMID:21875058
Bhargava, Dinesh; Karthikeyan, C; Moorthy, N S H N; Trivedi, Piyush
2009-09-01
QSAR study was carried out for a series of piperazinyl phenylalanine derivatives exhibiting VLA-4/VCAM-1 inhibitory activity to find out the structural features responsible for the biological activity. The QSAR study was carried out on V-life Molecular Design Suite software and the derived best QSAR model by partial least square (forward) regression method showed 85.67% variation in biological activity. The statistically significant model with high correlation coefficient (r2=0.85) was selected for further study and the resulted validation parameters of the model, crossed squared correlation coefficient (q2=0.76 and pred_r2=0.42) show the model has good predictive ability. The model showed that the parameters SaaNEindex, SsClcount slogP,and 4PathCount are highly correlated with VLA-4/VCAM-1 inhibitory activity of piperazinyl phenylalanine derivatives. The result of the study suggests that the chlorine atoms in the molecule and fourth order fragmentation patterns in the molecular skeleton favour VLA-4/VCAM-1 inhibition shown by the title compounds whereas lipophilicity and nitrogen bonded to aromatic bond are not conducive for VLA-4/VCAM-1 inhibitory activity.
Quantitative structure-activity relationships for organophosphates binding to acetylcholinesterase.
Ruark, Christopher D; Hack, C Eric; Robinson, Peter J; Anderson, Paul E; Gearhart, Jeffery M
2013-02-01
Organophosphates are a group of pesticides and chemical warfare nerve agents that inhibit acetylcholinesterase, the enzyme responsible for hydrolysis of the excitatory neurotransmitter acetylcholine. Numerous structural variants exist for this chemical class, and data regarding their toxicity can be difficult to obtain in a timely fashion. At the same time, their use as pesticides and military weapons is widespread, which presents a major concern and challenge in evaluating human toxicity. To address this concern, a quantitative structure-activity relationship (QSAR) was developed to predict pentavalent organophosphate oxon human acetylcholinesterase bimolecular rate constants. A database of 278 three-dimensional structures and their bimolecular rates was developed from 15 peer-reviewed publications. A database of simplified molecular input line entry notations and their respective acetylcholinesterase bimolecular rate constants are listed in Supplementary Material, Table I. The database was quite diverse, spanning 7 log units of activity. In order to describe their structure, 675 molecular descriptors were calculated using AMPAC 8.0 and CODESSA 2.7.10. Orthogonal projection to latent structures regression, bootstrap leave-random-many-out cross-validation and y-randomization were used to develop an externally validated consensus QSAR model. The domain of applicability was assessed by the William's plot. Six external compounds were outside the warning leverage indicating potential model extrapolation. A number of compounds had residuals >2 or <-2, indicating potential outliers or activity cliffs. The results show that the HOMO-LUMO energy gap contributed most significantly to the binding affinity. A mean training R (2) of 0.80, a mean test set R (2) of 0.76 and a consensus external test set R (2) of 0.66 were achieved using the QSAR. The training and external test set RMSE values were found to be 0.76 and 0.88. The results suggest that this QSAR model can be used in physiologically based pharmacokinetic/pharmacodynamic models of organophosphate toxicity to determine the rate of acetylcholinesterase inhibition.
SEDIMENT-ASSOCIATED REACTIONS OF AROMATIC AMINES: QSAR DEVELOPMENT
Despite the common occurrence of the aromatic amine functional group in environmental contaminants, few quantitative structure-activity relationships (QSARs) have been developed to predict sorption kinetics for aromatic amines in natural soils and sediments. Towards the goal of d...
OPERA models for predicting physicochemical properties and environmental fate endpoints.
Mansouri, Kamel; Grulke, Chris M; Judson, Richard S; Williams, Antony J
2018-03-08
The collection of chemical structure information and associated experimental data for quantitative structure-activity/property relationship (QSAR/QSPR) modeling is facilitated by an increasing number of public databases containing large amounts of useful data. However, the performance of QSAR models highly depends on the quality of the data and modeling methodology used. This study aims to develop robust QSAR/QSPR models for chemical properties of environmental interest that can be used for regulatory purposes. This study primarily uses data from the publicly available PHYSPROP database consisting of a set of 13 common physicochemical and environmental fate properties. These datasets have undergone extensive curation using an automated workflow to select only high-quality data, and the chemical structures were standardized prior to calculation of the molecular descriptors. The modeling procedure was developed based on the five Organization for Economic Cooperation and Development (OECD) principles for QSAR models. A weighted k-nearest neighbor approach was adopted using a minimum number of required descriptors calculated using PaDEL, an open-source software. The genetic algorithms selected only the most pertinent and mechanistically interpretable descriptors (2-15, with an average of 11 descriptors). The sizes of the modeled datasets varied from 150 chemicals for biodegradability half-life to 14,050 chemicals for logP, with an average of 3222 chemicals across all endpoints. The optimal models were built on randomly selected training sets (75%) and validated using fivefold cross-validation (CV) and test sets (25%). The CV Q 2 of the models varied from 0.72 to 0.95, with an average of 0.86 and an R 2 test value from 0.71 to 0.96, with an average of 0.82. Modeling and performance details are described in QSAR model reporting format and were validated by the European Commission's Joint Research Center to be OECD compliant. All models are freely available as an open-source, command-line application called OPEn structure-activity/property Relationship App (OPERA). OPERA models were applied to more than 750,000 chemicals to produce freely available predicted data on the U.S. Environmental Protection Agency's CompTox Chemistry Dashboard.
Grid Based Technologies for in silico Screening and Drug Design.
Potemkin, Vladimir; Grishina, Maria
2018-03-08
Various techniques for rational drug design are presented in the paper. The methods are based on a substitution of antipharmacophore atoms of the molecules of training dataset by new atoms and/or group of atoms increasing the atomic bioactivity increments obtained at a SAR study. Furthermore, a design methodology based on the genetic algorithm DesPot for discrete optimization and generation of new drug candidate structures is described. Additionally, wide spectra of SAR approaches (3D/4D QSAR interior and exterior-based methods - BiS, CiS, ConGO, CoMIn, high-quality docking method - ReDock) using MERA force field and/or AlteQ quantum chemical method for correct prognosis of bioactivity and bioactive probability is described. The design methods are implemented now at www.chemosophia.com web-site for online computational services. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
A review on principles, theory and practices of 2D-QSAR.
Roy, Kunal; Das, Rudra Narayan
2014-01-01
The central axiom of science purports the explanation of every natural phenomenon using all possible logics coming from pure as well as mixed scientific background. The quantitative structure-activity relationship (QSAR) analysis is a study correlating the behavioral manifestation of compounds with their structures employing the interdisciplinary knowledge of chemistry, mathematics, biology as well as physics. Several studies have attempted to mathematically correlate the chemistry and property (physicochemical/ biological/toxicological) of molecules using various computationally or experimentally derived quantitative parameters termed as descriptors. The dimensionality of the descriptors depends on the type of algorithm employed and defines the nature of QSAR analysis. The most interesting feature of predictive QSAR models is that the behavior of any new or even hypothesized molecule can be predicted by the use of the mathematical equations. The phrase "2D-QSAR" signifies development of QSAR models using 2D-descriptors. Such predictor variables are the most widely practised ones because of their simple and direct mathematical algorithmic nature involving no time consuming energy computations and having reproducible operability. 2D-descriptors have a deluge of contributions in extracting chemical attributes and they are also capable of representing the 3D molecular features to some extent; although in no case they should be considered as the ultimate one, since they often suffer from the problems of intercorrelation, insufficient chemical information as well as lack of interpretation. However, by following rational approaches, novel 2D-descriptors may be developed to obviate various existing problems giving potential 2D-QSAR equations, thereby solving the innumerable chemical mysteries still unexplored.
Muddukrishna, B S; Pai, Vasudev; Lobo, Richard; Pai, Aravinda
2017-11-22
In the present study, five important binary fingerprinting techniques were used to model novel flavones for the selective inhibition of Tankyrase I. From the fingerprints used: the fingerprint atom pairs resulted in a statistically significant 2D QSAR model using a kernel-based partial least square regression method. This model indicates that the presence of electron-donating groups positively contributes to activity, whereas the presence of electron withdrawing groups negatively contributes to activity. This model could be used to develop more potent as well as selective analogues for the inhibition of Tankyrase I. Schematic representation of 2D QSAR work flow.
Qidwai, Tabish; Yadav, Dharmendra K; Khan, Feroz; Dhawan, Sangeeta; Bhakuni, R S
2012-01-01
This work presents the development of quantitative structure activity relationship (QSAR) model to predict the antimalarial activity of artemisinin derivatives. The structures of the molecules are represented by chemical descriptors that encode topological, geometric, and electronic structure features. Screening through QSAR model suggested that compounds A24, A24a, A53, A54, A62 and A64 possess significant antimalarial activity. Linear model is developed by the multiple linear regression method to link structures to their reported antimalarial activity. The correlation in terms of regression coefficient (r(2)) was 0.90 and prediction accuracy of model in terms of cross validation regression coefficient (rCV(2)) was 0.82. This study indicates that chemical properties viz., atom count (all atoms), connectivity index (order 1, standard), ring count (all rings), shape index (basic kappa, order 2), and solvent accessibility surface area are well correlated with antimalarial activity. The docking study showed high binding affinity of predicted active compounds against antimalarial target Plasmepsins (Plm-II). Further studies for oral bioavailability, ADMET and toxicity risk assessment suggest that compound A24, A24a, A53, A54, A62 and A64 exhibits marked antimalarial activity comparable to standard antimalarial drugs. Later one of the predicted active compound A64 was chemically synthesized, structure elucidated by NMR and in vivo tested in multidrug resistant strain of Plasmodium yoelii nigeriensis infected mice. The experimental results obtained agreed well with the predicted values.
Szaleniec, Maciej
2012-01-01
Artificial Neural Networks (ANNs) are introduced as robust and versatile tools in quantitative structure-activity relationship (QSAR) modeling. Their application to the modeling of enzyme reactivity is discussed, along with methodological issues. Methods of input variable selection, optimization of network internal structure, data set division and model validation are discussed. The application of ANNs in the modeling of enzyme activity over the last 20 years is briefly recounted. The discussed methodology is exemplified by the case of ethylbenzene dehydrogenase (EBDH). Intelligent Problem Solver and genetic algorithms are applied for input vector selection, whereas k-means clustering is used to partition the data into training and test cases. The obtained models exhibit high correlation between the predicted and experimental values (R(2) > 0.9). Sensitivity analyses and study of the response curves are used as tools for the physicochemical interpretation of the models in terms of the EBDH reaction mechanism. Neural networks are shown to be a versatile tool for the construction of robust QSAR models that can be applied to a range of aspects important in drug design and the prediction of biological activity.
Patlewicz, Grace Y; Basketter, David A; Pease, Camilla K Smith; Wilson, Karen; Wright, Zoe M; Roberts, David W; Bernard, Guillaume; Arnau, Elena Giménez; Lepoittevin, Jean-Pierre
2004-02-01
Fragrance substances represent a very diverse group of chemicals; a proportion of them are associated with the ability to cause allergic reactions in the skin. Efforts to find substitute materials are hindered by the need to undertake animal testing for determining both skin sensitization hazard and potency. One strategy to avoid such testing is through an understanding of the relationships between chemical structure and skin sensitization, so-called structure-activity relationships. In recent work, we evaluated 2 groups of fragrance chemicals -- saturated aldehydes and alpha,beta-unsaturated aldehydes. Simple quantitative structure-activity relationship (QSAR) models relating the EC3 values [derived from the local lymph node assay (LLNA)] to physicochemical properties were developed for both sets of aldehydes. In the current study, we evaluated an additional group of carbonyl-containing compounds to test the predictive power of the developed QSARs and to extend their scope. The QSAR models were used to predict EC3 values of 10 newly selected compounds. Local lymph node assay data generated for these compounds demonstrated that the original QSARs were fairly accurate, but still required improvement. Development of these QSAR models has provided us with a better understanding of the potential mechanisms of action for aldehydes, and hence how to avoid or limit allergy. Knowledge generated from this work is being incorporated into new/improved rules for sensitization in the expert toxicity prediction system, deductive estimation of risk from existing knowledge (DEREK).
Anibamine and its Analogues as Novel Anti Prostate Cancer Agents
2010-06-01
PC- 3, and DU-145 has been conducted continuously to evaluate the efficacy of more ligands. A molecular modeling study (3D QSAR ) protocol has been... Toxicology at Virginia Commonwealth University. Both the PI’s lab and Dr. 10 Selley’s lab have fully functional binding assay facility. The assays is...pursue the docking study and 3D QSAR study. 5.3 3D QSAR (Quantitative Structure-Activity Relationships) Study As proposed in our proposal, we will
Toxicity Evaluation of Engineered Nanomaterials: Risk Evaluation Tools (Phase 3 Studies)
2012-01-01
report. The second modeling approach was on quantitative structure activity relationships ( QSARs ). A manuscript entitled “Connecting the dots: Towards...expands rapidly. We proposed two types of mechanisms of toxic action supported by the nano- QSAR model , which collectively govern the toxicity of the...interpretative nano- QSAR model describing toxicity of 18 nano-metal oxides to a HaCaT cell line as a model for dermal exposure. In result, by the comparison of
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gordon, R.K.; Breuer, E.; Padilla, F.N.
1987-05-01
QSAR between biological activities and molecular-chemical properties were investigated to aid in designing more effective and potent antimuscarinic pharmacophores. A molecular modeling program was used to calculate geometrical and topological values of a series of DPP pharmacophores. The newly synthesized pharmacophores were tested for their antagonist activities by: (1) inhibition of (N-methyl-/sup 3/H)scopolamine binding assay to the muscarinic receptors of N4TG1 neuroblastoma cells; (2) blocking of acetylcholine-induced contraction of guinea pig ileum; and (3) inhibition of carbachol-induced ..cap alpha..-amylase release from rat pancreas. The differences in the log of these biological activities were directly and significantly related to the distancesmore » between the carbonyl oxygen of the DPP and the quaternary nitrogen of the modified pharmacophores. The biological activities, while depending on each particular assay, varied between three and four logs of activity. The charge remained the same in all the pharmacophores. There were no QSAR correlations between molecular volume, molecular connectivity, or principle moments and their antagonistic activities, although multivariate QSAR was not employed. Thus, based on distance geometry, potent muscarinic pharmacophores can be predicted.« less
Comparison of Global and Mode of Action-Based Models for Aquatic Toxicity
The ability to estimate aquatic toxicity for a wide variety of chemicals is a critical need for ecological risk assessment and chemical regulation. The consensus in the literature is that mode of action (MOA) based QSAR (Quantitative Structure Activity Relationship) models yield ...
Esposito, Emilio Xavier; Hopfinger, Anton J; Shao, Chi-Yu; Su, Bo-Han; Chen, Sing-Zuo; Tseng, Yufeng Jane
2015-10-01
Carbon nanotubes have become widely used in a variety of applications including biosensors and drug carriers. Therefore, the issue of carbon nanotube toxicity is increasingly an area of focus and concern. While previous studies have focused on the gross mechanisms of action relating to nanomaterials interacting with biological entities, this study proposes detailed mechanisms of action, relating to nanotoxicity, for a series of decorated (functionalized) carbon nanotube complexes based on previously reported QSAR models. Possible mechanisms of nanotoxicity for six endpoints (bovine serum albumin, carbonic anhydrase, chymotrypsin, hemoglobin along with cell viability and nitrogen oxide production) have been extracted from the corresponding optimized QSAR models. The molecular features relevant to each of the endpoint respective mechanism of action for the decorated nanotubes are also discussed. Based on the molecular information contained within the optimal QSAR models for each nanotoxicity endpoint, either the decorator attached to the nanotube is directly responsible for the expression of a particular activity, irrespective of the decorator's 3D-geometry and independent of the nanotube, or those decorators having structures that place the functional groups of the decorators as far as possible from the nanotube surface most strongly influence the biological activity. These molecular descriptors are further used to hypothesize specific interactions involved in the expression of each of the six biological endpoints. Copyright © 2015 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Pandit, Amit; Sengupta, Sagnik; Krishnan, Mena Asha; Reddy, Ramesh B.; Sharma, Rajesh; Venkatesh, Chelvam
2018-05-01
Prostate Specific Membrane Antigen (PSMA) or Glutamate carboxypeptidase II (GCPII) has been identified as an important target in diagnosis and therapy of prostate cancer. Among several types of inhibitors, urea based inhibitors are the most common and widely employed in preclinical and clinical studies. Computational studies have been carried out to uncover active sites and interaction of PSMA inhibitors with the protein by modifying the core structure of the ligand. Analysis of the literature, however, show lack of 3-D quantitative structure activity relationship (QSAR) and molecular dynamics based molecular docking study to identify structural modifications responsible for better GCPII inhibitory activity. The present study aims to fulfil this gap by analysing well known PSMA inhibitors reported in the literature with known experimental PSMA inhibition constants. Also in order to validate the in silico study, a new GCPII inhibitor 7 was designed, synthesized and experimental PSMA enzyme inhibition was evaluated by using freshly isolated PSMA protein from human cancer cell line derived from lymph node, LNCaP. 3D-QSAR CoMFA models on 58 urea based GCPII inhibitors were generated, and the best correlation was obtained in Gast-Huck charge assigning method with q2, r2 and predictive r2 values as 0.592, 0.995 and 0.842 respectively. Moreover, steric, electrostatic, and hydrogen bond donor field contribution analysis provided best statistical values from CoMSIA model (q2, r2 and predictive r2 as 0.527, 0.981 and 0.713 respectively). Contour maps study revealed that electrostatic field contribution is the major factor for discovering better binding affinity ligands. Further molecular dynamic assisted molecular docking was also performed on GCPII receptor (PDB ID 4NGM) and most active GCPII inhibitor, DCIBzL. 4NGM co-crystallised ligand, JB7 was used to validate the docking procedure and the amino acid interactions present in JB7 are compared with DCIBzL. The results suggest that Arg210, Asn257, Gly518, Tyr552, Lys699, and Tyr700 amino acid residues may play a crucial role in GCPII inhibition. Molecular Dynamics Simulation provides information about docked pose stability of DCIBzL. By combination of CoMFA-CoMSIA field analysis and docking interaction analysis studies, conclusive SAR was generated for urea based derivatives based on which GCPII inhibitor 7 was designed and chemically synthesized in our laboratory. Evaluation of GCPII inhibitory activity of 7 by performing NAALADase assay provided IC50 value of 113 nM which is in close agreement with in silico predicted value (119 nM). Thus we have successfully validated our 3D-QSAR and molecular docking based designing of GCPII inhibitors methodology through biological experiments. This conclusive SAR would be helpful to generate novel and more potent GCPII inhibitors for drug delivery applications.
NASA Astrophysics Data System (ADS)
Wang, Fangfang; Zhou, Bo
2018-04-01
Protein tyrosine phosphatase 1B (PTP1B) is an intracellular non-receptor phosphatase that is implicated in signal transduction of insulin and leptin pathways, thus PTP1B is considered as potential target for treating type II diabetes and obesity. The present article is an attempt to formulate the three-dimensional quantitative structure-activity relationship (3D-QSAR) modeling of a series of compounds possessing PTP1B inhibitory activities using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) techniques. The optimum template ligand-based models are statistically significant with great CoMFA (R2cv = 0.600, R2pred = 0.6760) and CoMSIA (R2cv = 0.624, R2pred = 0.8068) values. Molecular docking was employed to elucidate the inhibitory mechanisms of this series of compounds against PTP1B. In addition, the CoMFA and CoMSIA field contour maps agree well with the structural characteristics of the binding pocket of PTP1B active site. The knowledge of structure-activity relationship and ligand-receptor interactions from 3D-QSAR model and molecular docking will be useful for better understanding the mechanism of ligand-receptor interaction and facilitating development of novel compounds as potent PTP1B inhibitors.
Toropova, Alla P; Schultz, Terry W; Toropov, Andrey A
2016-03-01
Data on toxicity toward Tetrahymena pyriformis is indicator of applicability of a substance in ecologic and pharmaceutical aspects. Quantitative structure-activity relationships (QSARs) between the molecular structure of benzene derivatives and toxicity toward T. pyriformis (expressed as the negative logarithms of the population growth inhibition dose, mmol/L) are established. The available data were randomly distributed three times into the visible training and calibration sets, and invisible validation sets. The statistical characteristics for the validation set are the following: r(2)=0.8179 and s=0.338 (first distribution); r(2)=0.8682 and s=0.341 (second distribution); r(2)=0.8435 and s=0.323 (third distribution). These models are built up using only information on the molecular structure: no data on physicochemical parameters, 3D features of the molecular structure and quantum mechanics descriptors are involved in the modeling process. Copyright © 2016 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Ahmed, Nafees; Anwar, Sirajudheen; Thet Htar, Thet
2017-06-01
The Plasmodium falciparum Lactate Dehydrogenase enzyme (PfLDH) catalyzes inter-conversion of pyruvate to lactate during glycolysis producing the energy required for parasitic growth. The PfLDH has been studied as a potential molecular target for development of anti-malarial agents. In an attempt to find the potent inhibitor of PfLDH, we have used Discovery studio to perform molecular docking in the active binding pocket of PfLDH by CDOCKER, followed by three-dimensional quantitative structure-activity relationship (3D-QSAR) studies of tricyclic guanidine batzelladine compounds, which were previously synthesized in our laboratory. Docking studies showed that there is a very strong correlation between in silico and in vitro results. Based on docking results, a highly predictive 3D-QSAR model was developed with q2 of 0.516. The model has predicted r2 of 0.91 showing that predicted IC50 values are in good agreement with experimental IC50 values. The results obtained from this study revealed the developed model can be used to design new anti-malarial compounds based on tricyclic guanidine derivatives and to predict activities of new inhibitors.
Ahmed, Nafees; Anwar, Sirajudheen; Thet Htar, Thet
2017-01-01
The Plasmodium falciparum Lactate Dehydrogenase enzyme ( Pf LDH) catalyzes inter-conversion of pyruvate to lactate during glycolysis producing the energy required for parasitic growth. The Pf LDH has been studied as a potential molecular target for development of anti-malarial agents. In an attempt to find the potent inhibitor of Pf LDH, we have used Discovery studio to perform molecular docking in the active binding pocket of Pf LDH by CDOCKER, followed by three-dimensional quantitative structure-activity relationship (3D-QSAR) studies of tricyclic guanidine batzelladine compounds, which were previously synthesized in our laboratory. Docking studies showed that there is a very strong correlation between in silico and in vitro results. Based on docking results, a highly predictive 3D-QSAR model was developed with q 2 of 0.516. The model has predicted r 2 of 0.91 showing that predicted IC 50 values are in good agreement with experimental IC 50 values. The results obtained from this study revealed the developed model can be used to design new anti-malarial compounds based on tricyclic guanidine derivatives and to predict activities of new inhibitors.
Ahmed, Nafees; Anwar, Sirajudheen; Thet Htar, Thet
2017-01-01
The Plasmodium falciparum Lactate Dehydrogenase enzyme (PfLDH) catalyzes inter-conversion of pyruvate to lactate during glycolysis producing the energy required for parasitic growth. The PfLDH has been studied as a potential molecular target for development of anti-malarial agents. In an attempt to find the potent inhibitor of PfLDH, we have used Discovery studio to perform molecular docking in the active binding pocket of PfLDH by CDOCKER, followed by three-dimensional quantitative structure-activity relationship (3D-QSAR) studies of tricyclic guanidine batzelladine compounds, which were previously synthesized in our laboratory. Docking studies showed that there is a very strong correlation between in silico and in vitro results. Based on docking results, a highly predictive 3D-QSAR model was developed with q2 of 0.516. The model has predicted r2 of 0.91 showing that predicted IC50 values are in good agreement with experimental IC50 values. The results obtained from this study revealed the developed model can be used to design new anti-malarial compounds based on tricyclic guanidine derivatives and to predict activities of new inhibitors. PMID:28664157
Fish acute toxicity syndromes and their use in the QSAR approach to hazard assessment.
McKim, J M; Bradbury, S P; Niemi, G J
1987-01-01
Implementation of the Toxic Substances Control Act of 1977 creates the need to reliably establish testing priorities because laboratory resources are limited and the number of industrial chemicals requiring evaluation is overwhelming. The use of quantitative structure activity relationship (QSAR) models as rapid and predictive screening tools to select more potentially hazardous chemicals for in-depth laboratory evaluation has been proposed. Further implementation and refinement of quantitative structure-toxicity relationships in aquatic toxicology and hazard assessment requires the development of a "mode-of-action" database. With such a database, a qualitative structure-activity relationship can be formulated to assign the proper mode of action, and respective QSAR, to a given chemical structure. In this review, the development of fish acute toxicity syndromes (FATS), which are toxic-response sets based on various behavioral and physiological-biochemical measurements, and their projected use in the mode-of-action database are outlined. Using behavioral parameters monitored in the fathead minnow during acute toxicity testing, FATS associated with acetylcholinesterase (AChE) inhibitors and narcotics could be reliably predicted. However, compounds classified as oxidative phosphorylation uncouplers or stimulants could not be resolved. Refinement of this approach by using respiratory-cardiovascular responses in the rainbow trout, enabled FATS associated with AChE inhibitors, convulsants, narcotics, respiratory blockers, respiratory membrane irritants, and uncouplers to be correctly predicted. PMID:3297660
Ekuase, E.J.; van ’t Erve, T.J.; Rahaman, A.; Robertson, L.W.; Duffel, M.W.; Luthe, G.
2015-01-01
Determining the relationships between the structures of substrates and inhibitors and their interactions with drug-metabolizing enzymes is of prime importance in predicting the toxic potential of new and legacy xenobiotics. Traditionally, quantitative structure activity relationship (QSAR) studies are performed with many distinct compounds. Based on the chemical properties of the tested compounds, complex relationships can be established so that models can be developed to predict toxicity of novel compounds. In this study, the use of fluorinated analogues as supplemental QSAR compounds was investigated. Substituting fluorine induces changes in electronic and steric properties of the substrate without substantially changing the chemical backbone of the substrate. In vitro assays were performed using purified human cytosolic sulfotransferase hSULT2A1 as a model enzyme. A mono-hydroxylated polychlorinated biphenyl (4-OH PCB 14) and its four possible mono-fluoro analogues were used as test compounds. Remarkable similarities were found between this approach and previously published QSAR studies for hSULT2A1. Both studies implicate the importance of dipole moment and dihedral angle as being important to PCB structure in respect to being substrates for hSULT2A1. We conclude that mono-fluorinated analogues of a target substrate can be a useful tool to study the structure activity relationships for enzyme specificity. PMID:26165989
A MODE-OF-ACTION-BASED QSAR APPROACH TO IMPROVE UNDERSTANDING OF DEVELOPMENTAL TOXICITY
QSAR models of developmental toxicity (devtox) have met with limited regulatory acceptance due to the use of ill-defined endpoints, lack of biological interpretability, and poor model performance. More generally, the lack of biological inference of many QSAR models is often due t...
Experimental Errors in QSAR Modeling Sets: What We Can Do and What We Cannot Do.
Zhao, Linlin; Wang, Wenyi; Sedykh, Alexander; Zhu, Hao
2017-06-30
Numerous chemical data sets have become available for quantitative structure-activity relationship (QSAR) modeling studies. However, the quality of different data sources may be different based on the nature of experimental protocols. Therefore, potential experimental errors in the modeling sets may lead to the development of poor QSAR models and further affect the predictions of new compounds. In this study, we explored the relationship between the ratio of questionable data in the modeling sets, which was obtained by simulating experimental errors, and the QSAR modeling performance. To this end, we used eight data sets (four continuous endpoints and four categorical endpoints) that have been extensively curated both in-house and by our collaborators to create over 1800 various QSAR models. Each data set was duplicated to create several new modeling sets with different ratios of simulated experimental errors (i.e., randomizing the activities of part of the compounds) in the modeling process. A fivefold cross-validation process was used to evaluate the modeling performance, which deteriorates when the ratio of experimental errors increases. All of the resulting models were also used to predict external sets of new compounds, which were excluded at the beginning of the modeling process. The modeling results showed that the compounds with relatively large prediction errors in cross-validation processes are likely to be those with simulated experimental errors. However, after removing a certain number of compounds with large prediction errors in the cross-validation process, the external predictions of new compounds did not show improvement. Our conclusion is that the QSAR predictions, especially consensus predictions, can identify compounds with potential experimental errors. But removing those compounds by the cross-validation procedure is not a reasonable means to improve model predictivity due to overfitting.
Merging Applicability Domains for in Silico Assessment of Chemical Mutagenicity
2014-02-04
molecular fingerprints as descriptors for developing quantitative structure−activity relationship ( QSAR ) models and defining applicability domains with...used to define and quantify an applicability domain for either method. The importance of using applicability domains in QSAR modeling cannot be...domain from roughly 80% to 90%. These results indicated that the proposed QSAR protocol constituted a highly robust chemical mutagenicity prediction
CURRENT PRACTICES IN QSAR DEVELOPMENT AND APPLICATIONS
Current Practices in QSAR Development and Applications
Although it is commonly assumed that the structure and properties of a single chemical determines its activity in a particular biological system, it is only through study of how biological activity varies with changes...
Rastija, Vesna; Agić, Dejan; Tomiš, Sanja; Nikolič, Sonja; Hranjec, Marijana; Grace, Karminski-Zamola; Abramić, Marija
2015-01-01
A molecular modeling study is performed on series of benzimidazol-based inhibitors of human dipeptidyl peptidase III (DPP III). An eight novel compounds were synthesized in excellent yields using green chemistry approach. This study is aimed to elucidate the structural features of benzimidazole derivatives required for antagonism of human DPP III activity using Quantitative Structure-Activity Relationship (QSAR) analysis, and to understand the mechanism of one of the most potent inhibitor binding into the active site of this enzyme, by molecular dynamics (MD) simulations. The best model obtained includes S3K and RDF045m descriptors which have explained 89.4 % of inhibitory activity. Depicted moiety for strong inhibition activity matches to the structure of most potent compound. MD simulation has revealed importance of imidazolinyl and phenyl groups in the mechanism of binding into the active site of human DPP III.
Netzeva, Tatiana I; Gallegos Saliner, Ana; Worth, Andrew P
2006-05-01
The aim of the present study was to illustrate that it is possible and relatively straightforward to compare the domain of applicability of a quantitative structure-activity relationship (QSAR) model in terms of its physicochemical descriptors with a large inventory of chemicals. A training set of 105 chemicals with data for relative estrogenic gene activation, obtained in a recombinant yeast assay, was used to develop the QSAR. A binary classification model for predicting active versus inactive chemicals was developed using classification tree analysis and two descriptors with a clear physicochemical meaning (octanol-water partition coefficient, or log Kow, and the number of hydrogen bond donors, or n(Hdon)). The model demonstrated a high overall accuracy (90.5%), with a sensitivity of 95.9% and a specificity of 78.1%. The robustness of the model was evaluated using the leave-many-out cross-validation technique, whereas the predictivity was assessed using an artificial external test set composed of 12 compounds. The domain of the QSAR training set was compared with the chemical space covered by the European Inventory of Existing Commercial Chemical Substances (EINECS), as incorporated in the CDB-EC software, in the log Kow / n(Hdon) plane. The results showed that the training set and, therefore, the applicability domain of the QSAR model covers a small part of the physicochemical domain of the inventory, even though a simple method for defining the applicability domain (ranges in the descriptor space) was used. However, a large number of compounds are located within the narrow descriptor window.
Computational methods in drug discovery
Leelananda, Sumudu P
2016-01-01
The process for drug discovery and development is challenging, time consuming and expensive. Computer-aided drug discovery (CADD) tools can act as a virtual shortcut, assisting in the expedition of this long process and potentially reducing the cost of research and development. Today CADD has become an effective and indispensable tool in therapeutic development. The human genome project has made available a substantial amount of sequence data that can be used in various drug discovery projects. Additionally, increasing knowledge of biological structures, as well as increasing computer power have made it possible to use computational methods effectively in various phases of the drug discovery and development pipeline. The importance of in silico tools is greater than ever before and has advanced pharmaceutical research. Here we present an overview of computational methods used in different facets of drug discovery and highlight some of the recent successes. In this review, both structure-based and ligand-based drug discovery methods are discussed. Advances in virtual high-throughput screening, protein structure prediction methods, protein–ligand docking, pharmacophore modeling and QSAR techniques are reviewed. PMID:28144341
Computational methods in drug discovery.
Leelananda, Sumudu P; Lindert, Steffen
2016-01-01
The process for drug discovery and development is challenging, time consuming and expensive. Computer-aided drug discovery (CADD) tools can act as a virtual shortcut, assisting in the expedition of this long process and potentially reducing the cost of research and development. Today CADD has become an effective and indispensable tool in therapeutic development. The human genome project has made available a substantial amount of sequence data that can be used in various drug discovery projects. Additionally, increasing knowledge of biological structures, as well as increasing computer power have made it possible to use computational methods effectively in various phases of the drug discovery and development pipeline. The importance of in silico tools is greater than ever before and has advanced pharmaceutical research. Here we present an overview of computational methods used in different facets of drug discovery and highlight some of the recent successes. In this review, both structure-based and ligand-based drug discovery methods are discussed. Advances in virtual high-throughput screening, protein structure prediction methods, protein-ligand docking, pharmacophore modeling and QSAR techniques are reviewed.
Ragno, Rino; Artico, Marino; De Martino, Gabriella; La Regina, Giuseppe; Coluccia, Antonio; Di Pasquali, Alessandra; Silvestri, Romano
2005-01-13
Three-dimensional quantitative structure-activity relationship (3-D QSAR) studies and docking simulations were developed on indolyl aryl sulfones (IASs), a class of novel HIV-1 non-nucleoside reverse transcriptase (RT) inhibitors (Silvestri, et al. J. Med. Chem. 2003, 46, 2482-2493) highly active against wild type and some clinically relevant resistant strains (Y181C, the double mutant K103N-Y181C, and the K103R-V179D-P225H strain, highly resistant to efavirenz). Predictive 3-D QSAR models using the combination of GRID and GOLPE programs were obtained using a receptor-based alignment by means of docking IASs into the non-nucleoside binding site (NNBS) of RT. The derived 3-D QSAR models showed conventional correlation (r(2)) and cross-validated (q(2)) coefficients values ranging from 0.79 to 0.93 and from 0.59 to 0.84, respectively. All described models were validated by an external test set compiled from previously reported pyrryl aryl sulfones (Artico, et al. J. Med. Chem. 1996, 39, 522-530). The most predictive 3-D QSAR model was then used to predict the activity of novel untested IASs. The synthesis of six designed derivatives (prediction set) allowed disclosure of new IASs endowed with high anti-HIV-1 activities.
NASA Astrophysics Data System (ADS)
Asati, Vivek; Bharti, Sanjay Kumar; Budhwani, Ashok Kumar
2017-04-01
The proviral insertion site in moloney murine leukemia virus (PIM) is a family of serine/threonine kinase of Ca2+-calmodulin-dependent protein kinase (CAMK) group which is responsible for the activation and regulation of cellular transcription and translation. The three isoforms of PIM kinase (PIM-1, PIM-2 and PIM-3) share high homology and functional idleness are widely expressed and involved in a variety of biological processes including cell survival, proliferation, differentiation and apoptosis. Altered expression of PIM-1 kinase correlated with hematologic malignancies and solid tumors. In the present study, atom-based 3D-QSAR, docking and virtual screening studies have been performed on a series of thiazolidine-2,4-dione derivatives as PIM-1 kinase inhibitors. 3D-QSAR and docking approach has shortlisted the most active thiazolidine-2,4-dione derivatives such as 28, 31, 33 and 35 with the incorporation of more than one structural feature in a single molecule. External validations by various parameters and molecular docking studies at the active site of PIM-1 kinase have proved the reliability of the developed 3D-QSAR model. The generated pharmacophore (AADHR.33) from 3D-QSAR study was used for screening of drug like compounds from ZINC database, where ZINC15056464 and ZINC83292944 showed potential binding affinities at the active site amino acid residues (LYS67, GLU171, ASP128 and ASP186) of PIM-1 kinase.
3-D QSARS FOR RANKING AND PRIORITIZATION OF LARGE CHEMICAL DATASETS: AN EDC CASE STUDY
The COmmon REactivity Pattern (COREPA) approach is a three-dimensional structure activity (3-D QSAR) technique that permits identification and quantification of specific global and local steroelectronic characteristics associated with a chemical's biological activity. It goes bey...
3D-QSAR and molecular docking studies on HIV protease inhibitors
NASA Astrophysics Data System (ADS)
Tong, Jianbo; Wu, Yingji; Bai, Min; Zhan, Pei
2017-02-01
In order to well understand the chemical-biological interactions governing their activities toward HIV protease activity, QSAR models of 34 cyclic-urea derivatives with inhibitory HIV were developed. The quantitative structure activity relationship (QSAR) model was built by using comparative molecular similarity indices analysis (CoMSIA) technique. And the best CoMSIA model has rcv2, rncv2 values of 0.586 and 0.931 for cross-validated and non-cross-validated. The predictive ability of CoMSIA model was further validated by a test set of 7 compounds, giving rpred2 value of 0.973. Docking studies were used to find the actual conformations of chemicals in active site of HIV protease, as well as the binding mode pattern to the binding site in protease enzyme. The information provided by 3D-QSAR model and molecular docking may lead to a better understanding of the structural requirements of 34 cyclic-urea derivatives and help to design potential anti-HIV protease molecules.
Liu, Jing; Li, Yan; Zhang, Shuwei; Xiao, Zhengtao; Ai, Chunzhi
2011-01-01
In recent years, great interest has been paid to the development of compounds with high selectivity for central dopamine (DA) D3 receptors, an interesting therapeutic target in the treatment of different neurological disorders. In the present work, based on a dataset of 110 collected benzazepine (BAZ) DA D3 antagonists with diverse kinds of structures, a variety of in silico modeling approaches, including comparative molecular field analysis (CoMFA), comparative similarity indices analysis (CoMSIA), homology modeling, molecular docking and molecular dynamics (MD) were carried out to reveal the requisite 3D structural features for activity. Our results show that both the receptor-based (Q2 = 0.603, R2ncv = 0.829, R2pre = 0.690, SEE = 0.316, SEP = 0.406) and ligand-based 3D-QSAR models (Q2 = 0.506, R2ncv =0.838, R2pre = 0.794, SEE = 0.316, SEP = 0.296) are reliable with proper predictive capacity. In addition, a combined analysis between the CoMFA, CoMSIA contour maps and MD results with a homology DA receptor model shows that: (1) ring-A, position-2 and R3 substituent in ring-D are crucial in the design of antagonists with higher activity; (2) more bulky R1 substituents (at position-2 of ring-A) of antagonists may well fit in the binding pocket; (3) hydrophobicity represented by MlogP is important for building satisfactory QSAR models; (4) key amino acids of the binding pocket are CYS101, ILE105, LEU106, VAL151, PHE175, PHE184, PRO254 and ALA251. To our best knowledge, this work is the first report on 3D-QSAR modeling of the new fused BAZs as DA D3 antagonists. These results might provide information for a better understanding of the mechanism of antagonism and thus be helpful in designing new potent DA D3 antagonists. PMID:21541053
Liu, Jing; Li, Yan; Zhang, Shuwei; Xiao, Zhengtao; Ai, Chunzhi
2011-02-18
In recent years, great interest has been paid to the development of compounds with high selectivity for central dopamine (DA) D3 receptors, an interesting therapeutic target in the treatment of different neurological disorders. In the present work, based on a dataset of 110 collected benzazepine (BAZ) DA D3 antagonists with diverse kinds of structures, a variety of in silico modeling approaches, including comparative molecular field analysis (CoMFA), comparative similarity indices analysis (CoMSIA), homology modeling, molecular docking and molecular dynamics (MD) were carried out to reveal the requisite 3D structural features for activity. Our results show that both the receptor-based (Q(2) = 0.603, R(2) (ncv) = 0.829, R(2) (pre) = 0.690, SEE = 0.316, SEP = 0.406) and ligand-based 3D-QSAR models (Q(2) = 0.506, R(2) (ncv) =0.838, R(2) (pre) = 0.794, SEE = 0.316, SEP = 0.296) are reliable with proper predictive capacity. In addition, a combined analysis between the CoMFA, CoMSIA contour maps and MD results with a homology DA receptor model shows that: (1) ring-A, position-2 and R(3) substituent in ring-D are crucial in the design of antagonists with higher activity; (2) more bulky R(1) substituents (at position-2 of ring-A) of antagonists may well fit in the binding pocket; (3) hydrophobicity represented by MlogP is important for building satisfactory QSAR models; (4) key amino acids of the binding pocket are CYS101, ILE105, LEU106, VAL151, PHE175, PHE184, PRO254 and ALA251. To our best knowledge, this work is the first report on 3D-QSAR modeling of the new fused BAZs as DA D3 antagonists. These results might provide information for a better understanding of the mechanism of antagonism and thus be helpful in designing new potent DA D3 antagonists.
Yu, Haijing; Fang, Yu; Lu, Xia; Liu, Yongjuan; Zhang, Huabei
2014-01-01
The NS5B RNA-dependent RNA polymerase (RdRP) is a promising therapeutic target for developing novel anti-hepatitis C virus (HCV) drugs. In this work, a combined molecular modeling study was performed on a series of 193 5-hydroxy-2H-pyridazin-3-one derivatives as inhibitors of HCV NS5B Polymerase. The best 3D-QSAR models, including CoMFA and CoMSIA, are based on receptor (or docking). Furthermore, a 40-ns molecular dynamics (MD) simulation and binding free energy calculations using docked structures of NS5B with ten compounds, which have diverse structures and pIC50 values, were employed to determine the detailed binding process and to compare the binding modes of the inhibitors with different activities. On one side, the stability and rationality of molecular docking and 3D-QSAR results were validated by MD simulation. The binding free energies calculated by the MM-PBSA method gave a good correlation with the experimental biological activity. On the other side, by analyzing some differences between the molecular docking and the MD simulation results, we can find that the MD simulation could also remedy the defects of molecular docking. The analyses of the combined molecular modeling results have identified that Tyr448, Ser556, and Asp318 are the key amino acid residues in the NS5B binding pocket. The results from this study can provide some insights into the development of novel potent NS5B inhibitors. © 2013 John Wiley & Sons A/S.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Liying; Sedykh, Alexander; Tripathi, Ashutosh
2013-10-01
Identification of endocrine disrupting chemicals is one of the important goals of environmental chemical hazard screening. We report on the development of validated in silico predictors of chemicals likely to cause estrogen receptor (ER)-mediated endocrine disruption to facilitate their prioritization for future screening. A database of relative binding affinity of a large number of ERα and/or ERβ ligands was assembled (546 for ERα and 137 for ERβ). Both single-task learning (STL) and multi-task learning (MTL) continuous quantitative structure–activity relationship (QSAR) models were developed for predicting ligand binding affinity to ERα or ERβ. High predictive accuracy was achieved for ERα bindingmore » affinity (MTL R{sup 2} = 0.71, STL R{sup 2} = 0.73). For ERβ binding affinity, MTL models were significantly more predictive (R{sup 2} = 0.53, p < 0.05) than STL models. In addition, docking studies were performed on a set of ER agonists/antagonists (67 agonists and 39 antagonists for ERα, 48 agonists and 32 antagonists for ERβ, supplemented by putative decoys/non-binders) using the following ER structures (in complexes with respective ligands) retrieved from the Protein Data Bank: ERα agonist (PDB ID: 1L2I), ERα antagonist (PDB ID: 3DT3), ERβ agonist (PDB ID: 2NV7), and ERβ antagonist (PDB ID: 1L2J). We found that all four ER conformations discriminated their corresponding ligands from presumed non-binders. Finally, both QSAR models and ER structures were employed in parallel to virtually screen several large libraries of environmental chemicals to derive a ligand- and structure-based prioritized list of putative estrogenic compounds to be used for in vitro and in vivo experimental validation. - Highlights: • This is the largest curated dataset inclusive of ERα and β (the latter is unique). • New methodology that for the first time affords acceptable ERβ models. • A combination of QSAR and docking enables prediction of affinity and function. • The results have potential applications to green chemistry. • Models are publicly available for virtual screening via a web portal.« less
PBT assessment and prioritization of contaminants of emerging concern: Pharmaceuticals.
Sangion, Alessandro; Gramatica, Paola
2016-05-01
The strong and widespread use of pharmaceuticals, together with incorrect disposal procedures, has recently made these products contaminants of emerging concern (CEC). Unfortunately, little is known about pharmaceuticals' environmental behaviour and ecotoxicity, so that EMEA (European Medicines Agency) released guidelines for the pharmaceuticals' environmental risk assessment. In particular, there is a severe lack of information about persistence, bioaccumulation and toxicity (PBT) of the majority of the thousands of substances on the market. Computational tools, like QSAR (Quantitative Structure Activity Relationship) models, are the only way to screen large sets of chemicals in short time, with the aim of ranking, highlighting and prioritizing the most environmentally hazardous for focusing further experimental studies. In this work we propose a screening method to assess the potential persistence, bioaccumulation and toxicity of more than 1200 pharmaceutical ingredients, based on the application of two different QSAR models. We applied the Insubria-PBT Index, a MLR (Multiple Linear Regression) QSAR model based on four simple molecular descriptors, implemented in QSARINS software, and able to synthesize the PBT potential in a unique cumulative value and the US-EPA PBT Profiler that assesses the PBT behaviour evaluating separately P, B and T. Particular attention was given to the study of Applicability Domain in order to provide reliable predictions. An agreement of 86% was found between the two models and a priority list of 35 pharmaceuticals, highlighted as potential PBTs by consensus, was proposed for further experimental validation. Moreover, the results of this computational screening are in agreement with preliminary experimental data in the literature. This study shows how in silico models can be applied in the hazard assessment to perform preliminary screening and prioritization of chemicals, and how the identification of the structural features, mainly associated with the potential PBT behaviour of the prioritized pharmaceuticals, is particularly relevant to perform the rational a priori design of new, environmentally safer, pharmaceuticals. Copyright © 2016 Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Shevade, A. V.; Ryan, M. A.; Homer, M. L.; Jewell, A. D.; Zhou, H.; Manatt, K.; Kisor, A. K.
2005-01-01
We report a Quantitative Structure-Activity Relationships (QSAR) study using Genetic Function Approximations (GFA) to describe the polymer-carbon composite sensor activities in the JPL Electronic Nose, when exposed to chemical vapors at parts-per-million concentration levels.
Eren, Gokcen; Macchiarulo, Antonio; Banoglu, Erden
2012-02-01
Pharmacological intervention with 5-Lipoxygenase (5-LO) is a promising strategy for treatment of inflammatory and allergic ailments, including asthma. With the aim of developing predictive models of 5-LO affinity and gaining insights into the molecular basis of ligand-target interaction, we herein describe QSAR studies of 59 diverse nonredox-competitive 5-LO inhibitors based on the use of molecular shape descriptors and docking experiments. These studies have successfully yielded a predictive model able to explain much of the variance in the activity of the training set compounds while predicting satisfactorily the 5-LO inhibitory activity of an external test set of compounds. The inspection of the selected variables in the QSAR equation unveils the importance of specific interactions which are observed from docking experiments. Collectively, these results may be used to design novel potent and selective nonredox 5-LO inhibitors. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Structure-Based Predictions of Activity Cliffs
Husby, Jarmila; Bottegoni, Giovanni; Kufareva, Irina; Abagyan, Ruben; Cavalli, Andrea
2015-01-01
In drug discovery, it is generally accepted that neighboring molecules in a given descriptors' space display similar activities. However, even in regions that provide strong predictability, structurally similar molecules can occasionally display large differences in potency. In QSAR jargon, these discontinuities in the activity landscape are known as ‘activity cliffs’. In this study, we assessed the reliability of ligand docking and virtual ligand screening schemes in predicting activity cliffs. We performed our calculations on a diverse, independently collected database of cliff-forming co-crystals. Starting from ideal situations, which allowed us to establish our baseline, we progressively moved toward simulating more realistic scenarios. Ensemble- and template-docking achieved a significant level of accuracy, suggesting that, despite the well-known limitations of empirical scoring schemes, activity cliffs can be accurately predicted by advanced structure-based methods. PMID:25918827
Building on a solid foundation: SAR and QSAR as a fundamental strategy to reduce animal testing.
Sullivan, K M; Manuppello, J R; Willett, C E
2014-01-01
The development of more efficient, ethical, and effective means of assessing the effects of chemicals on human health and the environment was a lifetime goal of Gilman Veith. His work has provided the foundation for the use of chemical structure for informing toxicological assessment by regulatory agencies the world over. Veith's scientific work influenced the early development of the SAR models in use today at the US Environmental Protection Agency. He was the driving force behind the Organisation for Economic Co-operation and Development QSAR Toolbox. Veith was one of a few early pioneers whose vision led to the linkage of chemical structure and biological activity as a means of predicting adverse apical outcomes (known as a mode of action, or an adverse outcome pathway approach), and he understood at an early stage the power that could be harnessed when combining computational and mechanistic biological approaches as a means of avoiding animal testing. Through the International QSAR Foundation he organized like-minded experts to develop non-animal methods and frameworks for the assessment of chemical hazard and risk for the benefit of public and environmental health. Avoiding animal testing was Gil's passion, and his work helped to initiate the paradigm shift in toxicology that is now rendering this feasible.
Da, Chenxiao; Mooberry, Susan L.; Gupton, John T.; Kellogg, Glen E.
2013-01-01
αβ-tubulin colchicine site inhibitors (CSIs) from four scaffolds that we previously tested for antiproliferative activity were modeled to better understand their effect on microtubules. Docking models, constructed by exploiting the SAR of a pyrrole subset and HINT scoring, guided ensemble docking of all 59 compounds. This conformation set and two variants having progressively less structure knowledge were subjected to CoMFA, CoMFA+HINT, and CoMSIA 3D-QSAR analyses. The CoMFA+HINT model (docked alignment) showed the best statistics: leave-one-out q2 of 0.616, r2 of 0.949 and r2pred (internal test set) of 0.755. An external (tested in other laboratories) collection of 24 CSIs from eight scaffolds were evaluated with the 3D-QSAR models, which correctly ranked their activity trends in 7/8 scaffolds for CoMFA+HINT (8/8 for CoMFA). The combination of SAR, ensemble docking, hydropathic analysis and 3D-QSAR provides an atomic-scale colchicine site model more consistent with a target structure resolution much higher than the ~3.6 Å available for αβ-tubulin. PMID:23961916
Statistical molecular design of balanced compound libraries for QSAR modeling.
Linusson, A; Elofsson, M; Andersson, I E; Dahlgren, M K
2010-01-01
A fundamental step in preclinical drug development is the computation of quantitative structure-activity relationship (QSAR) models, i.e. models that link chemical features of compounds with activities towards a target macromolecule associated with the initiation or progression of a disease. QSAR models are computed by combining information on the physicochemical and structural features of a library of congeneric compounds, typically assembled from two or more building blocks, and biological data from one or more in vitro assays. Since the models provide information on features affecting the compounds' biological activity they can be used as guides for further optimization. However, in order for a QSAR model to be relevant to the targeted disease, and drug development in general, the compound library used must contain molecules with balanced variation of the features spanning the chemical space believed to be important for interaction with the biological target. In addition, the assays used must be robust and deliver high quality data that are directly related to the function of the biological target and the associated disease state. In this review, we discuss and exemplify the concept of statistical molecular design (SMD) in the selection of building blocks and final synthetic targets (i.e. compounds to synthesize) to generate information-rich, balanced libraries for biological testing and computation of QSAR models.
Jin, Wen-Yan; Ma, Ying; Li, Wei-Ya; Li, Hong-Lian; Wang, Run-Ling
2018-04-01
SHP2 is a potential target for the development of novel therapies for SHP2-dependent cancers. In our research, with the aid of the 'Receptor-Ligand Pharmacophore' technique, a 3D-QSAR method was carried out to explore structure activity relationship of SHP2 allosteric inhibitors. Structure-based drug design was employed to optimize SHP099, an efficacious, potent, and selective SHP2 allosteric inhibitor. A novel class of selective SHP2 allosteric inhibitors was discovered by using the powerful 'SBP', 'ADMET' and 'CDOCKER' techniques. By means of molecular dynamics simulations, it was observed that these novel inhibitors not only had the same function as SHP099 did in inhibiting SHP2, but also had more favorable conformation for binding to the receptor. Thus, this report may provide a new method in discovering novel and selective SHP2 allosteric inhibitors. Copyright © 2018 Elsevier Ltd. All rights reserved.
Hui Wang; Mingyue Jiang; Shujun Li; Chung-Yun Hse; Chunde Jin; Fangli Sun; Zhuo Li
2017-01-01
Cinnamaldehyde amino acid Schiff base (CAAS) is a new class of safe, bioactive compounds which could be developed as potential antifungal agents for fungal infections. To design new cinnamaldehyde amino acid Schiff base compounds with high bioactivity, the quantitative structureâactivity relationships (QSARs) for CAAS compounds against Aspergillus niger (A. niger) and...
Freitas, Mirlaine R; Matias, Stella V B G; Macedo, Renato L G; Freitas, Matheus P; Venturin, Nelson
2013-09-11
Two of major weeds affecting cereal crops worldwide are Avena fatua L. (wild oat) and Lolium rigidum Gaud. (rigid ryegrass). Thus, development of new herbicides against these weeds is required; in line with this, benzoxazinones, their degradation products, and analogues have been shown to be important allelochemicals and natural herbicides. Despite earlier structure-activity studies demonstrating that hydrophobicity (log P) of aminophenoxazines correlates to phytotoxicity, our findings for a series of benzoxazinone derivatives do not show any relationship between phytotoxicity and log P nor with other two usual molecular descriptors. On the other hand, a quantitative structure-activity relationship (QSAR) analysis based on molecular graphs representing structural shape, atomic sizes, and colors to encode other atomic properties performed very accurately for the prediction of phytotoxicities of these compounds against wild oat and rigid ryegrass. Therefore, these QSAR models can be used to estimate the phytotoxicity of new congeners of benzoxazinone herbicides toward A. fatua L. and L. rigidum Gaud.
Jensen, G.E.; Niemelä, J.R.; Wedebye, E.B.; Nikolov, N.G.
2008-01-01
A special challenge in the new European Union chemicals legislation, Registration, Evaluation and Authorisation of Chemicals, will be the toxicological evaluation of chemicals for reproductive toxicity. Use of valid quantitative structure–activity relationships (QSARs) is a possibility under the new legislation. This article focuses on a screening exercise by use of our own and commercial QSAR models for identification of possible reproductive toxicants. Three QSAR models were used for reproductive toxicity for the endpoints teratogenic risk to humans (based on animal tests, clinical data and epidemiological human studies), dominant lethal effect in rodents (in vivo) and Drosophila melanogaster sex-linked recessive lethal effect. A structure set of 57,014 European Inventory of Existing Chemical Substances (EINECS) chemicals was screened. A total of 5240 EINECS chemicals, corresponding to 9.2%, were predicted as reproductive toxicants by one or more of the models. The chemicals predicted positive for reproductive toxicity will be submitted to the Danish Environmental Protection Agency as scientific input for a future updated advisory classification list with advisory classifications for concern for humans owing to possible developmental toxic effects: Xn (Harmful) and R63 (Possible risk of harm to the unborn child). The chemicals were also screened in three models for endocrine disruption. PMID:19061080
Does Rational Selection of Training and Test Sets Improve the Outcome of QSAR Modeling?
Prior to using a quantitative structure activity relationship (QSAR) model for external predictions, its predictive power should be established and validated. In the absence of a true external dataset, the best way to validate the predictive ability of a model is to perform its s...
Salter-Blanc, Alexandra; Bylaska, Eric J.; Johnston, Hayley; ...
2015-02-11
The evaluation of new energetic nitroaromatic compounds (NACs) for use in green munitions formulations requires models that can predict their environmental fate. The susceptibility of energetic NACs to nitro reduction might be predicted from correlations between rate constants (k) for this reaction and one-electron reduction potentials (E1NAC) / 0.059 V, but the mechanistic implications of such correlations are inconsistent with evidence from other methods. To address this inconsistency, we have reevaluated existing kinetic data using a (non-linear) free-energy relationship (FER) based on the Marcus theory of outer-sphere electron transfer. For most reductants, the results are inconsistent with rate limitation bymore » an initial, outer-sphere electron transfer, suggesting that the strong correlation between k and E1NAC is justified only as an empirical model. This empirical correlation was used to calibrate a new quantitative structure-activity relationship (QSAR) using previously reported values of k for non-energetic NAC reduction by Fe(II) porphyrin and newly reported values of E1NAC determined using density functional theory at the B3LYP/6-311++G(2d,2p) level with the COSMO solvation model. The QSAR was then validated for energetic NACs using newly measured kinetic data for 2,4,6-trinitrotoluene (TNT), 2,4-dinitrotoluene (2,4-DNT), and 2,4-dinitroanisole (DNAN). The data show close agreement with the QSAR, supporting its applicability to energetic NACs.« less
Lenselink, Eelke B; Ten Dijke, Niels; Bongers, Brandon; Papadatos, George; van Vlijmen, Herman W T; Kowalczyk, Wojtek; IJzerman, Adriaan P; van Westen, Gerard J P
2017-08-14
The increase of publicly available bioactivity data in recent years has fueled and catalyzed research in chemogenomics, data mining, and modeling approaches. As a direct result, over the past few years a multitude of different methods have been reported and evaluated, such as target fishing, nearest neighbor similarity-based methods, and Quantitative Structure Activity Relationship (QSAR)-based protocols. However, such studies are typically conducted on different datasets, using different validation strategies, and different metrics. In this study, different methods were compared using one single standardized dataset obtained from ChEMBL, which is made available to the public, using standardized metrics (BEDROC and Matthews Correlation Coefficient). Specifically, the performance of Naïve Bayes, Random Forests, Support Vector Machines, Logistic Regression, and Deep Neural Networks was assessed using QSAR and proteochemometric (PCM) methods. All methods were validated using both a random split validation and a temporal validation, with the latter being a more realistic benchmark of expected prospective execution. Deep Neural Networks are the top performing classifiers, highlighting the added value of Deep Neural Networks over other more conventional methods. Moreover, the best method ('DNN_PCM') performed significantly better at almost one standard deviation higher than the mean performance. Furthermore, Multi-task and PCM implementations were shown to improve performance over single task Deep Neural Networks. Conversely, target prediction performed almost two standard deviations under the mean performance. Random Forests, Support Vector Machines, and Logistic Regression performed around mean performance. Finally, using an ensemble of DNNs, alongside additional tuning, enhanced the relative performance by another 27% (compared with unoptimized 'DNN_PCM'). Here, a standardized set to test and evaluate different machine learning algorithms in the context of multi-task learning is offered by providing the data and the protocols. Graphical Abstract .
Asadabadi, Ebrahim Barzegari; Abdolmaleki, Parviz; Barkooie, Seyyed Mohsen Hosseini; Jahandideh, Samad; Rezaei, Mohammad Ali
2009-12-01
Regarding the great potential of dual binding site inhibitors of acetylcholinesterase as the future potent drugs of Alzheimer's disease, this study was devoted to extraction of the most effective structural features of these inhibitors from among a large number of quantitative descriptors. To do this, we adopted a unique approach in quantitative structure-activity relationships. An efficient feature selection method was emphasized in such an approach, using the confirmative results of different routine and novel feature selection methods. The proposed methods generated quite consistent results ensuring the effectiveness of the selected structural features.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Z.T.; Wang, L.S.; Chen, S.P.
1996-12-31
The fundamental differentiation of toxicity is between reactive and nonreactive toxicity. Reactive toxicity is associated with a specific mechanism for the reaction with an enzyme or inhibition of a metabolic pathway, and nonreactive toxicity is related directly to the quantity of toxicant acting upon the cell. The quantitative structure-activity relationships (QSARs) have been successfully used in the nonreactive toxicity, such as prediction of the toxicity of nonreactive compounds based on their solubility in the lipids of organisms. The elements of molecular structure that are most closely related to nonreactive toxicity are those that describe the partitioning of the toxicant intomore » the organism, while QSARs for the reactive toxicity are less common in the environmental toxicology literature. With the recent increase in the use of synthetic substituted benzenes as industrial chemicals, the accurate analysis of the effect of reactive toxic chemicals has become recognized with QSAR. For this purpose, we selected the fish (Carassias auratus) as the test organism, measured the acute toxicity of 50% lethal concentration (LC{sub 50}) of the chemicals and the adenosine triphosphate (ATP) content of the liver cells for the organism. These determined the relationships of the acute toxicity of some substituted benzenes with their physicochemical structural parameters. The effects on the ATP content was also compared to predict biological reactivities of the chemicals, so as to find some clues to explain the mode of mechanism of the toxicity. 17 refs., 1 tab.« less
Xie, Huiding; Chen, Lijun; Zhang, Jianqiang; Xie, Xiaoguang; Qiu, Kaixiong; Fu, Jijun
2015-01-01
B-Raf kinase is an important target in treatment of cancers. In order to design and find potent B-Raf inhibitors (BRIs), 3D pharmacophore models were created using the Genetic Algorithm with Linear Assignment of Hypermolecular Alignment of Database (GALAHAD). The best pharmacophore model obtained which was used in effective alignment of the data set contains two acceptor atoms, three donor atoms and three hydrophobes. In succession, comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were performed on 39 imidazopyridine BRIs to build three dimensional quantitative structure-activity relationship (3D QSAR) models based on both pharmacophore and docking alignments. The CoMSIA model based on the pharmacophore alignment shows the best result (q2 = 0.621, r2pred = 0.885). This 3D QSAR approach provides significant insights that are useful for designing potent BRIs. In addition, the obtained best pharmacophore model was used for virtual screening against the NCI2000 database. The hit compounds were further filtered with molecular docking, and their biological activities were predicted using the CoMSIA model, and three potential BRIs with new skeletons were obtained. PMID:26035757
Xie, Huiding; Chen, Lijun; Zhang, Jianqiang; Xie, Xiaoguang; Qiu, Kaixiong; Fu, Jijun
2015-05-29
B-Raf kinase is an important target in treatment of cancers. In order to design and find potent B-Raf inhibitors (BRIs), 3D pharmacophore models were created using the Genetic Algorithm with Linear Assignment of Hypermolecular Alignment of Database (GALAHAD). The best pharmacophore model obtained which was used in effective alignment of the data set contains two acceptor atoms, three donor atoms and three hydrophobes. In succession, comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were performed on 39 imidazopyridine BRIs to build three dimensional quantitative structure-activity relationship (3D QSAR) models based on both pharmacophore and docking alignments. The CoMSIA model based on the pharmacophore alignment shows the best result (q(2) = 0.621, r(2)(pred) = 0.885). This 3D QSAR approach provides significant insights that are useful for designing potent BRIs. In addition, the obtained best pharmacophore model was used for virtual screening against the NCI2000 database. The hit compounds were further filtered with molecular docking, and their biological activities were predicted using the CoMSIA model, and three potential BRIs with new skeletons were obtained.
Amin, Sk Abdul; Adhikari, Nilanjan; Jha, Tarun; Gayen, Shovanlal
2016-12-01
Huntington's disease (HD) is caused by mutation of huntingtin protein (mHtt) leading to neuronal cell death. The mHtt induced toxicity can be rescued by inhibiting the kynurenine monooxygenase (KMO) enzyme. Therefore, KMO is a promising drug target to address the neurodegenerative disorders such as Huntington's diseases. Fiftysix arylpyrimidine KMO inhibitors are structurally explored through regression and classification based multi-QSAR modeling, pharmacophore mapping and molecular docking approaches. Moreover, ten new compounds are proposed and validated through the modeling that may be effective in accelerating Huntington's disease drug discovery efforts. Copyright © 2016 Elsevier Ltd. All rights reserved.
DPubChem: a web tool for QSAR modeling and high-throughput virtual screening.
Soufan, Othman; Ba-Alawi, Wail; Magana-Mora, Arturo; Essack, Magbubah; Bajic, Vladimir B
2018-06-14
High-throughput screening (HTS) performs the experimental testing of a large number of chemical compounds aiming to identify those active in the considered assay. Alternatively, faster and cheaper methods of large-scale virtual screening are performed computationally through quantitative structure-activity relationship (QSAR) models. However, the vast amount of available HTS heterogeneous data and the imbalanced ratio of active to inactive compounds in an assay make this a challenging problem. Although different QSAR models have been proposed, they have certain limitations, e.g., high false positive rates, complicated user interface, and limited utilization options. Therefore, we developed DPubChem, a novel web tool for deriving QSAR models that implement the state-of-the-art machine-learning techniques to enhance the precision of the models and enable efficient analyses of experiments from PubChem BioAssay database. DPubChem also has a simple interface that provides various options to users. DPubChem predicted active compounds for 300 datasets with an average geometric mean and F 1 score of 76.68% and 76.53%, respectively. Furthermore, DPubChem builds interaction networks that highlight novel predicted links between chemical compounds and biological assays. Using such a network, DPubChem successfully suggested a novel drug for the Niemann-Pick type C disease. DPubChem is freely available at www.cbrc.kaust.edu.sa/dpubchem .
AlzhCPI: A knowledge base for predicting chemical-protein interactions towards Alzheimer's disease.
Fang, Jiansong; Wang, Ling; Li, Yecheng; Lian, Wenwen; Pang, Xiaocong; Wang, Hong; Yuan, Dongsheng; Wang, Qi; Liu, Ai-Lin; Du, Guan-Hua
2017-01-01
Alzheimer's disease (AD) is a complicated progressive neurodegeneration disorder. To confront AD, scientists are searching for multi-target-directed ligands (MTDLs) to delay disease progression. The in silico prediction of chemical-protein interactions (CPI) can accelerate target identification and drug discovery. Previously, we developed 100 binary classifiers to predict the CPI for 25 key targets against AD using the multi-target quantitative structure-activity relationship (mt-QSAR) method. In this investigation, we aimed to apply the mt-QSAR method to enlarge the model library to predict CPI towards AD. Another 104 binary classifiers were further constructed to predict the CPI for 26 preclinical AD targets based on the naive Bayesian (NB) and recursive partitioning (RP) algorithms. The internal 5-fold cross-validation and external test set validation were applied to evaluate the performance of the training sets and test set, respectively. The area under the receiver operating characteristic curve (ROC) for the test sets ranged from 0.629 to 1.0, with an average of 0.903. In addition, we developed a web server named AlzhCPI to integrate the comprehensive information of approximately 204 binary classifiers, which has potential applications in network pharmacology and drug repositioning. AlzhCPI is available online at http://rcidm.org/AlzhCPI/index.html. To illustrate the applicability of AlzhCPI, the developed system was employed for the systems pharmacology-based investigation of shichangpu against AD to enhance the understanding of the mechanisms of action of shichangpu from a holistic perspective.
Castillo-Garit, Juan Alberto; Abad, Concepción; Rodríguez-Borges, J Enrique; Marrero-Ponce, Yovani; Torrens, Francisco
2012-01-01
The neglected tropical diseases (NTDs) affect more than one billion people (one-sixth of the world's population) and occur primarily in undeveloped countries in sub-Saharan Africa, Asia, and Latin America. Available drugs for these diseases are decades old and present an important number of limitations, especially high toxicity and, more recently, the emergence of drug resistance. In the last decade several Quantitative Structure-Activity Relationship (QSAR) studies have been developed in order to identify new organic compounds with activity against the parasites responsible for these diseases, which are reviewed in this paper. The topics summarized in this work are: 1) QSAR studies to identify new organic compounds actives against Chaga's disease; 2) Development of QSAR studies to discover new antileishmanial drusg; 3) Computational studies to identify new drug-like compounds against human African trypanosomiasis. Each topic include the general characteristics, epidemiology and chemotherapy of the disease as well as the main QSAR approaches to discovery/identification of new actives compounds for the corresponding neglected disease. The last section is devoted to a new approach know as multi-target QSAR models developed for antiparasitic drugs specifically those actives against trypanosomatid parasites. At present, as a result of these QSAR studies several promising compounds, active against these parasites, are been indentify. However, more efforts will be required in the future to develop more selective (specific) useful drugs.
Use of statistical and neural net approaches in predicting toxicity of chemicals.
Basak, S C; Grunwald, G D; Gute, B D; Balasubramanian, K; Opitz, D
2000-01-01
Hierarchical quantitative structure-activity relationships (H-QSAR) have been developed as a new approach in constructing models for estimating physicochemical, biomedicinal, and toxicological properties of interest. This approach uses increasingly more complex molecular descriptors in a graduated approach to model building. In this study, statistical and neural network methods have been applied to the development of H-QSAR models for estimating the acute aquatic toxicity (LC50) of 69 benzene derivatives to Pimephales promelas (fathead minnow). Topostructural, topochemical, geometrical, and quantum chemical indices were used as the four levels of the hierarchical method. It is clear from both the statistical and neural network models that topostructural indices alone cannot adequately model this set of congeneric chemicals. Not surprisingly, topochemical indices greatly increase the predictive power of both statistical and neural network models. Quantum chemical indices also add significantly to the modeling of this set of acute aquatic toxicity data.
USDA-ARS?s Scientific Manuscript database
A three-dimensional quantitative structure-activity relationship (3D-QSAR) model of sulfonamide analogs binding a monoclonal antibody (MAbSMR) produced against sulfamerazine was carried out by Distance Comparison (DISCOtech), comparative molecular field analysis (CoMFA), and comparative molecular si...
The objective of this work is to use the Exposure Related Dose Estimating Model (ERDEM) and quantitative structure-activity relationship (QSAR) models to develop an assessment tool for human exposure assessment to triazole fungicides. A dermal exposure route is used for the physi...
The increasing number and size of public databases is facilitating the collection of chemical structures and associated experimental data for QSAR modeling. However, the performance of QSAR models is highly dependent not only on the modeling methodology, but also on the quality o...
Low, Yen S.; Sedykh, Alexander; Rusyn, Ivan; Tropsha, Alexander
2017-01-01
Cheminformatics approaches such as Quantitative Structure Activity Relationship (QSAR) modeling have been used traditionally for predicting chemical toxicity. In recent years, high throughput biological assays have been increasingly employed to elucidate mechanisms of chemical toxicity and predict toxic effects of chemicals in vivo. The data generated in such assays can be considered as biological descriptors of chemicals that can be combined with molecular descriptors and employed in QSAR modeling to improve the accuracy of toxicity prediction. In this review, we discuss several approaches for integrating chemical and biological data for predicting biological effects of chemicals in vivo and compare their performance across several data sets. We conclude that while no method consistently shows superior performance, the integrative approaches rank consistently among the best yet offer enriched interpretation of models over those built with either chemical or biological data alone. We discuss the outlook for such interdisciplinary methods and offer recommendations to further improve the accuracy and interpretability of computational models that predict chemical toxicity. PMID:24805064
Although ranking schemes based on exposure and toxicity have been developed to aid in the prioritization of research funds for identifying chemicals of regulatory concern, there are significant gaps in the availability of experimental toxicity data for most health endpoints. Pred...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Valerio, Luis G.; Arvidson, Kirk B.; Chanderbhan, Ronald F.
2007-07-01
Consistent with the U.S. Food and Drug Administration (FDA) Critical Path Initiative, predictive toxicology software programs employing quantitative structure-activity relationship (QSAR) models are currently under evaluation for regulatory risk assessment and scientific decision support for highly sensitive endpoints such as carcinogenicity, mutagenicity and reproductive toxicity. At the FDA's Center for Food Safety and Applied Nutrition's Office of Food Additive Safety and the Center for Drug Evaluation and Research's Informatics and Computational Safety Analysis Staff (ICSAS), the use of computational SAR tools for both qualitative and quantitative risk assessment applications are being developed and evaluated. One tool of current interest ismore » MDL-QSAR predictive discriminant analysis modeling of rodent carcinogenicity, which has been previously evaluated for pharmaceutical applications by the FDA ICSAS. The study described in this paper aims to evaluate the utility of this software to estimate the carcinogenic potential of small, organic, naturally occurring chemicals found in the human diet. In addition, a group of 19 known synthetic dietary constituents that were positive in rodent carcinogenicity studies served as a control group. In the test group of naturally occurring chemicals, 101 were found to be suitable for predictive modeling using this software's discriminant analysis modeling approach. Predictions performed on these compounds were compared to published experimental evidence of each compound's carcinogenic potential. Experimental evidence included relevant toxicological studies such as rodent cancer bioassays, rodent anti-carcinogenicity studies, genotoxic studies, and the presence of chemical structural alerts. Statistical indices of predictive performance were calculated to assess the utility of the predictive modeling method. Results revealed good predictive performance using this software's rodent carcinogenicity module of over 1200 chemicals, comprised primarily of pharmaceutical, industrial and some natural products developed under an FDA-MDL cooperative research and development agreement (CRADA). The predictive performance for this group of dietary natural products and the control group was 97% sensitivity and 80% concordance. Specificity was marginal at 53%. This study finds that the in silico QSAR analysis employing this software's rodent carcinogenicity database is capable of identifying the rodent carcinogenic potential of naturally occurring organic molecules found in the human diet with a high degree of sensitivity. It is the first study to demonstrate successful QSAR predictive modeling of naturally occurring carcinogens found in the human diet using an external validation test. Further test validation of this software and expansion of the training data set for dietary chemicals will help to support the future use of such QSAR methods for screening and prioritizing the risk of dietary chemicals when actual animal data are inadequate, equivocal, or absent.« less
Saraiva, Ádria P B; Miranda, Ricardo M; Valente, Renan P P; Araújo, Jéssica O; Souza, Rutelene N B; Costa, Clauber H S; Oliveira, Amanda R S; Almeida, Michell O; Figueiredo, Antonio F; Ferreira, João E V; Alves, Cláudio Nahum; Honorio, Kathia M
2018-04-22
In this work, a group of α-keto-based inhibitors of the cruzain enzyme with anti-chagas activity was selected for a three-dimensional quantitative structure-activity relationship study (3D-QSAR) combined with molecular dynamics (MD). Firstly, statistical models based on Partial Least Square (PLS) regression were developed employing comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) descriptors. Validation parameters (q 2 and r 2 )for the models were, respectively, 0.910 and 0.997 (CoMFA) and 0.913 and 0.992 (CoMSIA). In addition, external validation for the models using a test group revealed r 2 pred = 0.728 (CoMFA) and 0.971 (CoMSIA). The most relevant aspect in this study was the generation of molecular fields in both favorable and unfavorable regions based on the models developed. These fields are important to interpret modifications necessary to enhance the biological activities of the inhibitors. This analysis was restricted considering the inhibitors in a fixed conformation, not interacting with their target, the cruzain enzyme. Then, MD was employed taking into account important variables such as time and temperature. MD helped describe the behavior of the inhibitors and their properties showed similar results as those generated by QSAR-3D study. © 2018 John Wiley & Sons A/S.
NASA Astrophysics Data System (ADS)
Baumgartner, Matthew P.; Evans, David A.
2018-01-01
Two of the major ongoing challenges in computational drug discovery are predicting the binding pose and affinity of a compound to a protein. The Drug Design Data Resource Grand Challenge 2 was developed to address these problems and to drive development of new methods. The challenge provided the 2D structures of compounds for which the organizers help blinded data in the form of 35 X-ray crystal structures and 102 binding affinity measurements and challenged participants to predict the binding pose and affinity of the compounds. We tested a number of pose prediction methods as part of the challenge; we found that docking methods that incorporate protein flexibility (Induced Fit Docking) outperformed methods that treated the protein as rigid. We also found that using binding pose metadynamics, a molecular dynamics based method, to score docked poses provided the best predictions of our methods with an average RMSD of 2.01 Å. We tested both structure-based (e.g. docking) and ligand-based methods (e.g. QSAR) in the affinity prediction portion of the competition. We found that our structure-based methods based on docking with Smina (Spearman ρ = 0.614), performed slightly better than our ligand-based methods (ρ = 0.543), and had equivalent performance with the other top methods in the competition. Despite the overall good performance of our methods in comparison to other participants in the challenge, there exists significant room for improvement especially in cases such as these where protein flexibility plays such a large role.
Zong, Guanghui; Yan, Xiaojing; Bi, Jiawei; Jiang, Rui; Qin, Yinan; Yuan, Huizhu; Lu, Huizhe; Dong, Yanhong; Jin, Shuhui; Zhang, Jianjun
2017-01-01
1,3,4-Thiadiazole and sugar-derived molecules have proven to be promising agrochemicals with growth promoting, insecticidal and fungicidal activities. In the research field of agricultural fungicide, applying union of active group we synthesized a new set of 1,3,4-thiadiazole xylofuranose derivatives and all of the compounds were characterized by 1H NMR and HRMS. In precise toxicity measurement, some of compounds exhibited more potent fungicidal activities than the most widely used commercial fungicide Chlorothalonil, promoting further research and development. Based on our experimental data, 3D-QSAR (three-dimensional quantitative structure-activity relationship) was established and investigated using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) techniques, helping to better understand the structural requirements of lead compounds with high fungicidal activity and environmental compatibility. PMID:28746366
Shahlaei, Mohsen; Sabet, Razieh; Ziari, Maryam Bahman; Moeinifard, Behzad; Fassihi, Afshin; Karbakhsh, Reza
2010-10-01
Quantitative relationships between molecular structure and methionine aminopeptidase-2 inhibitory activity of a series of cytotoxic anthranilic acid sulfonamide derivatives were discovered. We have demonstrated the detailed application of two efficient nonlinear methods for evaluation of quantitative structure-activity relationships of the studied compounds. Components produced by principal component analysis as input of developed nonlinear models were used. The performance of the developed models namely PC-GRNN and PC-LS-SVM were tested by several validation methods. The resulted PC-LS-SVM model had a high statistical quality (R(2)=0.91 and R(CV)(2)=0.81) for predicting the cytotoxic activity of the compounds. Comparison between predictability of PC-GRNN and PC-LS-SVM indicates that later method has higher ability to predict the activity of the studied molecules. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.
Scior, Thomas; Lozano-Aponte, Jorge; Ajmani, Subhash; Hernández-Montero, Eduardo; Chávez-Silva, Fabiola; Hernández-Núñez, Emanuel; Moo-Puc, Rosa; Fraguela-Collar, Andres; Navarrete-Vázquez, Gabriel
2015-01-01
In view of the serious health problems concerning infectious diseases in heavily populated areas, we followed the strategy of lead compound diversification to evaluate the near-by chemical space for new organic compounds. To this end, twenty derivatives of nitazoxanide (NTZ) were synthesized and tested for activity against Entamoeba histolytica parasites. To ensure drug-likeliness and activity relatedness of the new compounds, the synthetic work was assisted by a quantitative structure-activity relationships study (QSAR). Many of the inherent downsides – well-known to QSAR practitioners – we circumvented thanks to workarounds which we proposed in prior QSAR publication. To gain further mechanistic insight on a molecular level, ligand-enzyme docking simulations were carried out since NTZ is known to inhibit the protozoal pyruvate ferredoxin oxidoreductase (PFOR) enzyme as its biomolecular target. PMID:25872791
A Novel Two-Step Hierarchial Quantitative Structure-Activity ...
Background: Accurate prediction of in vivo toxicity from in vitro testing is a challenging problem. Large public–private consortia have been formed with the goal of improving chemical safety assessment by the means of high-throughput screening. Methods and results: A database containing experimental cytotoxicity values for in vitro half-maximal inhibitory concentration (IC50) and in vivo rodent median lethal dose (LD50) for more than 300 chemicals was compiled by Zentralstelle zur Erfassung und Bewertung von Ersatz- und Ergaenzungsmethoden zum Tierversuch (ZEBET ; National Center for Documentation and Evaluation of Alternative Methods to Animal Experiments) . The application of conventional quantitative structure–activity relationship (QSAR) modeling approaches to predict mouse or rat acute LD50 values from chemical descriptors of ZEBET compounds yielded no statistically significant models. The analysis of these data showed no significant correlation between IC50 and LD50. However, a linear IC50 versus LD50 correlation could be established for a fraction of compounds. To capitalize on this observation, we developed a novel two-step modeling approach as follows. First, all chemicals are partitioned into two groups based on the relationship between IC50 and LD50 values: One group comprises compounds with linear IC50 versus LD50 relationships, and another group comprises the remaining compounds. Second, we built conventional binary classification QSAR models t
A novel model to predict gas-phase hydroxyl radical oxidation kinetics of polychlorinated compounds.
Luo, Shuang; Wei, Zongsu; Spinney, Richard; Yang, Zhihui; Chai, Liyuan; Xiao, Ruiyang
2017-04-01
In this study, a novel model based on aromatic meta-substituent grouping was presented to predict the second-order rate constants (k) for OH oxidation of PCBs in gas-phase. Since the oxidation kinetics are dependent on the chlorination degree and position, we hypothesized that it may be more accurate for k value prediction if we group PCB congeners based on substitution positions (i.e., ortho (o), meta (m), and para (p)). To test this hypothesis, we examined the correlation of polarizability (α), a quantum chemical based descriptor for k values, with an empirical Hammett constant (σ + ) on each substitution position. Our result shows that α is highly linearly correlated to ∑σ o,m,p + based on aromatic meta-substituents leading to the grouping based predictive model. With the new model, the calculated k values exhibited an excellent agreement with experimental measurements, and greater predictive power than the quantum chemical based quantitative structure activity relationship (QSAR) model. Further, the relationship of α and ∑σ o,m,p + for PCDDs congeners, together with highest occupied molecular orbital (HOMO) distribution, were used to validate the aromatic meta-substituent grouping method. This newly developed model features a combination of good predictability of quantum chemical based QSAR model and simplicity of Hammett relationship, showing a great potential for fast and computational tractable prediction of k values for gas-phase OH oxidation of polychlorinated compounds. Copyright © 2017 Elsevier Ltd. All rights reserved.
Feng, Taotao; Wang, Hai; Zhang, Xiaojin; Sun, Haopeng; You, Qidong
2014-06-01
Protein lysine methyltransferase G9a, which catalyzes methylation of lysine 9 of histone H3 (H3K9) and lysine 373 (K373) of p53, is overexpressed in human cancers. This suggests that small molecular inhibitors of G9a might be attractive antitumor agents. Herein we report our efforts on the design of novel G9a inhibitor based on the 3D quantitative structure-activity relationship (3D-QSAR) analysis of a series of 2,4-diamino-7-aminoalkoxyquinazolineas G9a inhibitors. The 3D-QSAR model was generated from 47 compounds using docking based molecular alignment. The best predictions were obtained with CoMFA standard model (q2 =0.700, r2 = 0.952) and CoMSIA model combined with steric, electrostatic, hydrophobic, hydrogen bond donor and acceptor fields (q2 = 0.724, r2 =0.960). The structural requirements for substituted 2,4-diamino-7-aminoalkoxyquinazoline for G9a inhibitory activity can be obtained by analysing the COMSIA plots. Based on the information, six novel follow-up analogs were designed.
The mode of toxic action (MOA) has been recognized as a key determinant of chemical toxicity and as an alternative to chemical class-based predictive toxicity modeling. However, the development of quantitative structure activity relationship (QSAR) and other models has been limit...
tThe mode of toxic action (MOA) has been recognized as a key determinant of chemical toxicity andas an alternative to chemical class-based predictive toxicity modeling. However, the development ofquantitative structure activity relationship (QSAR) and other models has been limite...
NASA Astrophysics Data System (ADS)
Kharkar, Prashant S.; Reith, Maarten E. A.; Dutta, Aloke K.
2008-01-01
Three-dimensional quantitative structure-activity relationship (3D QSAR) using comparative molecular field analysis (CoMFA) was performed on a series of substituted tetrahydropyran (THP) derivatives possessing serotonin (SERT) and norepinephrine (NET) transporter inhibitory activities. The study aimed to rationalize the potency of these inhibitors for SERT and NET as well as the observed selectivity differences for NET over SERT. The dataset consisted of 29 molecules, of which 23 molecules were used as the training set for deriving CoMFA models for SERT and NET uptake inhibitory activities. Superimpositions were performed using atom-based fitting and 3-point pharmacophore-based alignment. Two charge calculation methods, Gasteiger-Hückel and semiempirical PM3, were tried. Both alignment methods were analyzed in terms of their predictive abilities and produced comparable results with high internal and external predictivities. The models obtained using the 3-point pharmacophore-based alignment outperformed the models with atom-based fitting in terms of relevant statistics and interpretability of the generated contour maps. Steric fields dominated electrostatic fields in terms of contribution. The selectivity analysis (NET over SERT), though yielded models with good internal predictivity, showed very poor external test set predictions. The analysis was repeated with 24 molecules after systematically excluding so-called outliers (5 out of 29) from the model derivation process. The resulting CoMFA model using the atom-based fitting exhibited good statistics and was able to explain most of the selectivity (NET over SERT)-discriminating factors. The presence of -OH substituent on the THP ring was found to be one of the most important factors governing the NET selectivity over SERT. Thus, a 4-point NET-selective pharmacophore, after introducing this newly found H-bond donor/acceptor feature in addition to the initial 3-point pharmacophore, was proposed.
Our study assesses the value of both in vitro assay and quantitative structure activity relationship (QSAR) data in predicting in vivo toxicity using numerous statistical models and approaches to process the data. Our models are built on datasets of (i) 586 chemicals for which bo...
Predictive QSAR modeling workflow, model applicability domains, and virtual screening.
Tropsha, Alexander; Golbraikh, Alexander
2007-01-01
Quantitative Structure Activity Relationship (QSAR) modeling has been traditionally applied as an evaluative approach, i.e., with the focus on developing retrospective and explanatory models of existing data. Model extrapolation was considered if only in hypothetical sense in terms of potential modifications of known biologically active chemicals that could improve compounds' activity. This critical review re-examines the strategy and the output of the modern QSAR modeling approaches. We provide examples and arguments suggesting that current methodologies may afford robust and validated models capable of accurate prediction of compound properties for molecules not included in the training sets. We discuss a data-analytical modeling workflow developed in our laboratory that incorporates modules for combinatorial QSAR model development (i.e., using all possible binary combinations of available descriptor sets and statistical data modeling techniques), rigorous model validation, and virtual screening of available chemical databases to identify novel biologically active compounds. Our approach places particular emphasis on model validation as well as the need to define model applicability domains in the chemistry space. We present examples of studies where the application of rigorously validated QSAR models to virtual screening identified computational hits that were confirmed by subsequent experimental investigations. The emerging focus of QSAR modeling on target property forecasting brings it forward as predictive, as opposed to evaluative, modeling approach.
Molecular docking and 3D-QSAR studies on inhibitors of DNA damage signaling enzyme human PARP-1.
Fatima, Sabiha; Bathini, Raju; Sivan, Sree Kanth; Manga, Vijjulatha
2012-08-01
Poly (ADP-ribose) polymerase-1 (PARP-1) operates in a DNA damage signaling network. Molecular docking and three dimensional-quantitative structure activity relationship (3D-QSAR) studies were performed on human PARP-1 inhibitors. Docked conformation obtained for each molecule was used as such for 3D-QSAR analysis. Molecules were divided into a training set and a test set randomly in four different ways, partial least square analysis was performed to obtain QSAR models using the comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA). Derived models showed good statistical reliability that is evident from their r², q²(loo) and r²(pred) values. To obtain a consensus for predictive ability from all the models, average regression coefficient r²(avg) was calculated. CoMFA and CoMSIA models showed a value of 0.930 and 0.936, respectively. Information obtained from the best 3D-QSAR model was applied for optimization of lead molecule and design of novel potential inhibitors.
Li, Shun-Lai; He, Mao-Yu; Du, Hong-Guang
2011-01-01
The active metabolite of the novel immunosuppressive agent leflunomide has been shown to inhibit the enzyme dihydroorotate dehydrogenase (DHODH). This enzyme catalyzes the fourth step in de novo pyrimidine biosynthesis. Self-organizing molecular field analysis (SOMFA), a simple three-dimensional quantitative structure-activity relationship (3D-QSAR) method is used to study the correlation between the molecular properties and the biological activities of a series of analogues of the active metabolite. The statistical results, cross-validated rCV2 (0.664) and non cross-validated r2 (0.687), show a good predictive ability. The final SOMFA model provides a better understanding of DHODH inhibitor-enzyme interactions, and may be useful for further modification and improvement of inhibitors of this important enzyme. PMID:21686163
Shahlaei, M.; Saghaie, L.
2014-01-01
A quantitative structure–activity relationship (QSAR) study is suggested for the prediction of biological activity (pIC50) of 3, 4-dihydropyrido [3,2-d] pyrimidone derivatives as p38 inhibitors. Modeling of the biological activities of compounds of interest as a function of molecular structures was established by means of principal component analysis (PCA) and least square support vector machine (LS-SVM) methods. The results showed that the pIC50 values calculated by LS-SVM are in good agreement with the experimental data, and the performance of the LS-SVM regression model is superior to the PCA-based model. The developed LS-SVM model was applied for the prediction of the biological activities of pyrimidone derivatives, which were not in the modeling procedure. The resulted model showed high prediction ability with root mean square error of prediction of 0.460 for LS-SVM. The study provided a novel and effective approach for predicting biological activities of 3, 4-dihydropyrido [3,2-d] pyrimidone derivatives as p38 inhibitors and disclosed that LS-SVM can be used as a powerful chemometrics tool for QSAR studies. PMID:26339262
Sushko, Iurii; Salmina, Elena; Potemkin, Vladimir A; Poda, Gennadiy; Tetko, Igor V
2012-08-27
The article presents a Web-based platform for collecting and storing toxicological structural alerts from literature and for virtual screening of chemical libraries to flag potentially toxic chemicals and compounds that can cause adverse side effects. An alert is uniquely identified by a SMARTS template, a toxicological endpoint, and a publication where the alert was described. Additionally, the system allows storing complementary information such as name, comments, and mechanism of action, as well as other data. Most importantly, the platform can be easily used for fast virtual screening of large chemical datasets, focused libraries, or newly designed compounds against the toxicological alerts, providing a detailed profile of the chemicals grouped by structural alerts and endpoints. Such a facility can be used for decision making regarding whether a compound should be tested experimentally, validated with available QSAR models, or eliminated from consideration altogether. The alert-based screening can also be helpful for an easier interpretation of more complex QSAR models. The system is publicly accessible and tightly integrated with the Online Chemical Modeling Environment (OCHEM, http://ochem.eu). The system is open and expandable: any registered OCHEM user can introduce new alerts, browse, edit alerts introduced by other users, and virtually screen his/her data sets against all or selected alerts. The user sets being passed through the structural alerts can be used at OCHEM for other typical tasks: exporting in a wide variety of formats, development of QSAR models, additional filtering by other criteria, etc. The database already contains almost 600 structural alerts for such endpoints as mutagenicity, carcinogenicity, skin sensitization, compounds that undergo metabolic activation, and compounds that form reactive metabolites and, thus, can cause adverse reactions. The ToxAlerts platform is accessible on the Web at http://ochem.eu/alerts, and it is constantly growing.
2012-01-01
The article presents a Web-based platform for collecting and storing toxicological structural alerts from literature and for virtual screening of chemical libraries to flag potentially toxic chemicals and compounds that can cause adverse side effects. An alert is uniquely identified by a SMARTS template, a toxicological endpoint, and a publication where the alert was described. Additionally, the system allows storing complementary information such as name, comments, and mechanism of action, as well as other data. Most importantly, the platform can be easily used for fast virtual screening of large chemical datasets, focused libraries, or newly designed compounds against the toxicological alerts, providing a detailed profile of the chemicals grouped by structural alerts and endpoints. Such a facility can be used for decision making regarding whether a compound should be tested experimentally, validated with available QSAR models, or eliminated from consideration altogether. The alert-based screening can also be helpful for an easier interpretation of more complex QSAR models. The system is publicly accessible and tightly integrated with the Online Chemical Modeling Environment (OCHEM, http://ochem.eu). The system is open and expandable: any registered OCHEM user can introduce new alerts, browse, edit alerts introduced by other users, and virtually screen his/her data sets against all or selected alerts. The user sets being passed through the structural alerts can be used at OCHEM for other typical tasks: exporting in a wide variety of formats, development of QSAR models, additional filtering by other criteria, etc. The database already contains almost 600 structural alerts for such endpoints as mutagenicity, carcinogenicity, skin sensitization, compounds that undergo metabolic activation, and compounds that form reactive metabolites and, thus, can cause adverse reactions. The ToxAlerts platform is accessible on the Web at http://ochem.eu/alerts, and it is constantly growing. PMID:22876798
The goal of chemical toxicology research is utilizing short term bioassays and/or robust computational methods to predict in vivo toxicity endpoints for chemicals. The ToxCast program established at the US Environmental Protection Agency (EPA) is addressing this goal by using ca....
Gonzalez, J; Marchand-Geneste, N; Giraudel, J L; Shimada, T
2012-01-01
To obtain chemical clues on the process of bioactivation by cytochromes P450 1A1 and 1B1, some QSAR studies were carried out based on cellular experiments of the metabolic activation of polycyclic aromatic hydrocarbons and heterocyclic aromatic compounds by those enzymes. Firstly, the 3D structures of cytochromes 1A1 and 1B1 were built using homology modelling with a cytochrome 1A2 template. Using these structures, 32 ligands including heterocyclic aromatic compounds, polycyclic aromatic hydrocarbons and corresponding diols, were docked with LigandFit and CDOCKER algorithms. Binding mode analysis highlighted the importance of hydrophobic interactions and the hydrogen bonding network between cytochrome amino acids and docked molecules. Finally, for each enzyme, multilinear regression and artificial neural network QSAR models were developed and compared. These statistical models highlighted the importance of electronic, structural and energetic descriptors in metabolic activation process, and could be used for virtual screening of ligand databases. In the case of P450 1A1, the best model was obtained with artificial neural network analysis and gave an r (2) of 0.66 and an external prediction [Formula: see text] of 0.73. Concerning P450 1B1, artificial neural network analysis gave a much more robust model, associated with an r (2) value of 0.73 and an external prediction [Formula: see text] of 0.59.
Saavedra, Laura M; Romanelli, Gustavo P; Rozo, Ciro E; Duchowicz, Pablo R
2018-01-01
The insecticidal activity of a series of 62 plant derived molecules against the chikungunya, dengue and zika vector, the Aedes aegypti (Diptera:Culicidae) mosquito, is subjected to a Quantitative Structure-Activity Relationships (QSAR) analysis. The Replacement Method (RM) variable subset selection technique based on Multivariable Linear Regression (MLR) proves to be successful for exploring 4885 molecular descriptors calculated with Dragon 6. The predictive capability of the obtained models is confirmed through an external test set of compounds, Leave-One-Out (LOO) cross-validation and Y-Randomization. The present study constitutes a first necessary computational step for designing less toxic insecticides. Copyright © 2017 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gombar, V.K.; Enslein, K.; Hart, J.B.
1991-09-01
A quantitative structure-activity relationship (QSAR) model has been developed to estimate maximum tolerated doses (MTD) from structural features of chemicals and the corresponding oral acute lethal doses (LD50) as determined in male rats. The model is based on a set of 269 diverse chemicals which have been tested under the National Cancer Institute/National Toxicology Program (NCI/NTP) protocols. The rat oral LD50 value was the strongest predictor. Additionally, 22 structural descriptors comprising nine substructural MOLSTAC(c) keys, three molecular connectivity indices, and sigma charges on 10 molecular fragments were identified as endpoint predictors. The model explains 76% of the variance and ismore » significant (F = 35.7) at p less than 0.0001 with a standard error of the estimate of 0.40 in the log (1/mol) units used in Hansch-type equations. Cross-validation showed that the difference between the average deleted residual square (0.179) and the model residual square (0.160) was not significant (t = 0.98).« less
Mansouri, K; Grulke, C M; Richard, A M; Judson, R S; Williams, A J
2016-11-01
The increasing availability of large collections of chemical structures and associated experimental data provides an opportunity to build robust QSAR models for applications in different fields. One common concern is the quality of both the chemical structure information and associated experimental data. Here we describe the development of an automated KNIME workflow to curate and correct errors in the structure and identity of chemicals using the publicly available PHYSPROP physicochemical properties and environmental fate datasets. The workflow first assembles structure-identity pairs using up to four provided chemical identifiers, including chemical name, CASRNs, SMILES, and MolBlock. Problems detected included errors and mismatches in chemical structure formats, identifiers and various structure validation issues, including hypervalency and stereochemistry descriptions. Subsequently, a machine learning procedure was applied to evaluate the impact of this curation process. The performance of QSAR models built on only the highest-quality subset of the original dataset was compared with the larger curated and corrected dataset. The latter showed statistically improved predictive performance. The final workflow was used to curate the full list of PHYSPROP datasets, and is being made publicly available for further usage and integration by the scientific community.
Alam, Sarfaraz; Khan, Feroz
2014-01-01
Due to the high mortality rate in India, the identification of novel molecules is important in the development of novel and potent anticancer drugs. Xanthones are natural constituents of plants in the families Bonnetiaceae and Clusiaceae, and comprise oxygenated heterocycles with a variety of biological activities along with an anticancer effect. To explore the anticancer compounds from xanthone derivatives, a quantitative structure activity relationship (QSAR) model was developed by the multiple linear regression method. The structure–activity relationship represented by the QSAR model yielded a high activity–descriptors relationship accuracy (84%) referred by regression coefficient (r2=0.84) and a high activity prediction accuracy (82%). Five molecular descriptors – dielectric energy, group count (hydroxyl), LogP (the logarithm of the partition coefficient between n-octanol and water), shape index basic (order 3), and the solvent-accessible surface area – were significantly correlated with anticancer activity. Using this QSAR model, a set of virtually designed xanthone derivatives was screened out. A molecular docking study was also carried out to predict the molecular interaction between proposed compounds and deoxyribonucleic acid (DNA) topoisomerase IIα. The pharmacokinetics parameters, such as absorption, distribution, metabolism, excretion, and toxicity, were also calculated, and later an appraisal of synthetic accessibility of organic compounds was carried out. The strategy used in this study may provide understanding in designing novel DNA topoisomerase IIα inhibitors, as well as for other cancer targets. PMID:24516330
Verma, Rajeshwar P; Matthews, Edwin J
2015-03-01
This is part II of an in silico investigation of chemical-induced eye injury that was conducted at FDA's CFSAN. Serious eye damage caused by chemical (eye corrosion) is assessed using the rabbit Draize test, and this endpoint is an essential part of hazard identification and labeling of industrial and consumer products to ensure occupational and consumer safety. There is an urgent need to develop an alternative to the Draize test because EU's 7th amendment to the Cosmetic Directive (EC, 2003; 76/768/EEC) and recast Regulation now bans animal testing on all cosmetic product ingredients and EU's REACH Program limits animal testing for chemicals in commerce. Although in silico methods have been reported for eye irritation (reversible damage), QSARs specific for eye corrosion (irreversible damage) have not been published. This report describes the development of 21 ANN c-QSAR models (QSAR-21) for assessing eye corrosion potential of chemicals using a large and diverse CFSAN data set of 504 chemicals, ADMET Predictor's three sensitivity analyses and ANNE classification functionalities with 20% test set selection from seven different methods. QSAR-21 models were internally and externally validated and exhibited high predictive performance: average statistics for the training, verification, and external test sets of these models were 96/96/94% sensitivity and 91/91/90% specificity. Copyright © 2014 Elsevier Inc. All rights reserved.
Barber, Chris; Cayley, Alex; Hanser, Thierry; Harding, Alex; Heghes, Crina; Vessey, Jonathan D; Werner, Stephane; Weiner, Sandy K; Wichard, Joerg; Giddings, Amanda; Glowienke, Susanne; Parenty, Alexis; Brigo, Alessandro; Spirkl, Hans-Peter; Amberg, Alexander; Kemper, Ray; Greene, Nigel
2016-04-01
The relative wealth of bacterial mutagenicity data available in the public literature means that in silico quantitative/qualitative structure activity relationship (QSAR) systems can readily be built for this endpoint. A good means of evaluating the performance of such systems is to use private unpublished data sets, which generally represent a more distinct chemical space than publicly available test sets and, as a result, provide a greater challenge to the model. However, raw performance metrics should not be the only factor considered when judging this type of software since expert interpretation of the results obtained may allow for further improvements in predictivity. Enough information should be provided by a QSAR to allow the user to make general, scientifically-based arguments in order to assess and overrule predictions when necessary. With all this in mind, we sought to validate the performance of the statistics-based in vitro bacterial mutagenicity prediction system Sarah Nexus (version 1.1) against private test data sets supplied by nine different pharmaceutical companies. The results of these evaluations were then analysed in order to identify findings presented by the model which would be useful for the user to take into consideration when interpreting the results and making their final decision about the mutagenic potential of a given compound. Copyright © 2015 Elsevier Inc. All rights reserved.
Mamy, Laure; Patureau, Dominique; Barriuso, Enrique; Bedos, Carole; Bessac, Fabienne; Louchart, Xavier; Martin-laurent, Fabrice; Miege, Cecile; Benoit, Pierre
2015-01-01
A comprehensive review of quantitative structure-activity relationships (QSAR) allowing the prediction of the fate of organic compounds in the environment from their molecular properties was done. The considered processes were water dissolution, dissociation, volatilization, retention on soils and sediments (mainly adsorption and desorption), degradation (biotic and abiotic), and absorption by plants. A total of 790 equations involving 686 structural molecular descriptors are reported to estimate 90 environmental parameters related to these processes. A significant number of equations was found for dissociation process (pKa), water dissolution or hydrophobic behavior (especially through the KOW parameter), adsorption to soils and biodegradation. A lack of QSAR was observed to estimate desorption or potential of transfer to water. Among the 686 molecular descriptors, five were found to be dominant in the 790 collected equations and the most generic ones: four quantum-chemical descriptors, the energy of the highest occupied molecular orbital (EHOMO) and the energy of the lowest unoccupied molecular orbital (ELUMO), polarizability (α) and dipole moment (μ), and one constitutional descriptor, the molecular weight. Keeping in mind that the combination of descriptors belonging to different categories (constitutional, topological, quantum-chemical) led to improve QSAR performances, these descriptors should be considered for the development of new QSAR, for further predictions of environmental parameters. This review also allows finding of the relevant QSAR equations to predict the fate of a wide diversity of compounds in the environment. PMID:25866458
Mamy, Laure; Patureau, Dominique; Barriuso, Enrique; Bedos, Carole; Bessac, Fabienne; Louchart, Xavier; Martin-Laurent, Fabrice; Miege, Cecile; Benoit, Pierre
2015-06-18
A comprehensive review of quantitative structure-activity relationships (QSAR) allowing the prediction of the fate of organic compounds in the environment from their molecular properties was done. The considered processes were water dissolution, dissociation, volatilization, retention on soils and sediments (mainly adsorption and desorption), degradation (biotic and abiotic), and absorption by plants. A total of 790 equations involving 686 structural molecular descriptors are reported to estimate 90 environmental parameters related to these processes. A significant number of equations was found for dissociation process (pK a ), water dissolution or hydrophobic behavior (especially through the K OW parameter), adsorption to soils and biodegradation. A lack of QSAR was observed to estimate desorption or potential of transfer to water. Among the 686 molecular descriptors, five were found to be dominant in the 790 collected equations and the most generic ones: four quantum-chemical descriptors, the energy of the highest occupied molecular orbital (E HOMO ) and the energy of the lowest unoccupied molecular orbital (E LUMO ), polarizability (α) and dipole moment (μ), and one constitutional descriptor, the molecular weight. Keeping in mind that the combination of descriptors belonging to different categories (constitutional, topological, quantum-chemical) led to improve QSAR performances, these descriptors should be considered for the development of new QSAR, for further predictions of environmental parameters. This review also allows finding of the relevant QSAR equations to predict the fate of a wide diversity of compounds in the environment.
Use of in Vitro HTS-Derived Concentration-Response Data as ...
Background: Quantitative high-throughput screening (qHTS) assays are increasingly being employed to inform chemical hazard identification. Hundreds of chemicals have been tested in dozens of cell lines across extensive concentration ranges by the National Toxicology Program in collaboration with the NIH Chemical Genomics Center. Objectives: To test a hypothesis that dose-response data points of the qHTS assays can serve as biological descriptors of assayed chemicals and, when combined with conventional chemical descriptors, may improve the accuracy of Quantitative Structure-Activity Relationship (QSAR) models applied to prediction of in vivo toxicity endpoints. Methods and Results: The cell viability qHTS concentration-response data for 1,408 substances assayed in 13 cell lines were obtained from PubChem; for a subset of these compounds rodent acute toxicity LD50 data were also available. The classification k Nearest Neighbor and Random Forest QSAR methods were employed for modeling LD50 data using either chemical descriptors alone (conventional models) or in combination with biological descriptors derived from the concentration-response qHTS data (hybrid models). Critical to our approach was the use of a novel noise-filtering algorithm to treat qHTS data. We show that both the external classification accuracy and coverage (i.e., fraction of compounds in the external set that fall within the applicability domain) of the hybrid QSAR models was superior to convent
Ballu, Srilata; Itteboina, Ramesh; Sivan, Sree Kanth; Manga, Vijjulatha
2018-04-01
Staphylococcus aureus is a gram positive bacterium. It is the leading cause of skin and respiratory infections, osteomyelitis, Ritter's disease, endocarditis, and bacteraemia in the developed world. We employed combined studies of 3D QSAR, molecular docking which are validated by molecular dynamics simulations and in silico ADME prediction have been performed on Isothiazoloquinolones inhibitors against methicillin resistance Staphylococcus aureus. Three-dimensional quantitative structure-activity relationship (3D-QSAR) study was applied using comparative molecular field analysis (CoMFA) with Q 2 of 0.578, R 2 of 0.988, and comparative molecular similarity indices analysis (CoMSIA) with Q 2 of 0.554, R 2 of 0.975. The predictive ability of these model was determined using a test set of molecules that gave acceptable predictive correlation (r 2 Pred) values 0.55 and 0.57 of CoMFA and CoMSIA respectively. Docking, simulations were employed to position the inhibitors into protein active site to find out the most probable binding mode and most reliable conformations. Developed models and Docking methods provide guidance to design molecules with enhanced activity. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Dong, Huanhuan; Liu, Jing; Liu, Xiaoru; Yu, Yanying; Cao, Shuwen
2018-01-01
A collection of thirty-six aromatic heterocycle thiosemicarbazone analogues presented a broad span of anti-tyrosinase activities were designed and obtained. A robust and reliable two-dimensional quantitative structure-activity relationship model, as evidenced by the high q2 and r2 values (0.848 and 0.893, respectively), was gained based on the analogues to predict the quantitative chemical-biological relationship and the new modifier direction. Inhibitory activities of the compounds were found to greatly depend on molecular shape and orbital energy. Substituents brought out large ovality and high highest-occupied molecular orbital energy values helped to improve the activity of these analogues. The molecular docking results provided visual evidence for QSAR analysis and inhibition mechanism. Based on these, two novel tyrosinase inhibitors O04 and O05 with predicted IC50 of 0.5384 and 0.8752 nM were designed and suggested for further research.
3D-QSAR analysis of MCD inhibitors by CoMFA and CoMSIA.
Pourbasheer, Eslam; Aalizadeh, Reza; Ebadi, Amin; Ganjali, Mohammad Reza
2015-01-01
Three-dimensional quantitative structure-activity relationship was developed for the series of compounds as malonyl-CoA decarboxylase antagonists (MCD) using the CoMFA and CoMSIA methods. The statistical parameters for CoMFA (q(2)=0.558, r(2)=0.841) and CoMSIA (q(2)= 0.615, r(2) = 0.870) models were derived based on 38 compounds as training set in the basis of the selected alignment. The external predictive abilities of the built models were evaluated by using the test set of nine compounds. From obtained results, the CoMSIA method was found to have highly predictive capability in comparison with CoMFA method. Based on the given results by CoMSIA and CoMFA contour maps, some features that can enhance the activity of compounds as MCD antagonists were introduced and used to design new compounds with better inhibition activity.
Physiologically based pharmacokinetic (PBPK) models bridge the gap between in vitro assays and in vivo effects by accounting for the adsorption, distribution, metabolism, and excretion of xenobiotics, which is especially useful in the assessment of human toxicity. Quantitative st...
Sethi, Kalyan K; Verma, Saurabh M
2014-08-01
Drug design involves the design of small molecules that are complementary in shape and charge to the biomolecular target with which they interact and therefore will bind to it. Three-dimensional quantitative structure-activity relationship (3D-QSAR) studies were performed for a series of carbonic anhydrase IX inhibitors using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) techniques with the help of SYBYL 7.1 software. The large set of 36 different aromatic/heterocyclic sulfamates carbonic anhydrase (CA, EC 4.2.1.1) inhibitors, such as hCA IX, was chosen for this study. The conventional ligand-based 3D-QSAR studies were performed based on the low energy conformations employing database alignment rule. The ligand-based model gave q(2) values 0.802 and 0.829 and r(2) values 1.000 and 0.994 for CoMFA and CoMSIA, respectively, and the predictive ability of the model was validated. The predicted r(2) values are 0.999 and 0.502 for CoMFA and CoMSIA, respectively. SEA (steric, electrostatic, hydrogen bond acceptor) of CoMSIA has the significant contribution for the model development. The docking of inhibitors into hCA IX active site using Glide XP (Schrödinger) software revealed the vital interactions and binding conformation of the inhibitors. The CoMFA and CoMSIA field contour maps are well in agreement with the structural characteristics of the binding pocket of hCA IX active site, which suggests that the information rendered by 3D-QSAR models and the docking interactions can provide guidelines for the development of improved hCA IX inhibitors as leads for various types of metastatic cancers including those of cervical, renal, breast and head and neck origin.
Molecular docking and QSAR study on steroidal compounds as aromatase inhibitors.
Dai, Yujie; Wang, Qiang; Zhang, Xiuli; Jia, Shiru; Zheng, Heng; Feng, Dacheng; Yu, Peng
2010-12-01
In order to develop more potent, selective and less toxic steroidal aromatase (AR) inhibitors, molecular docking, 2D and 3D hybrid quantitative structure-activity relationship (QSAR) study have been conducted using topological, molecular shape, spatial, structural and thermodynamic descriptors on 32 steroidal compounds. The molecular docking study shows that one or more hydrogen bonds with MET374 are one of the essential requirements for the optimum binding of ligands. The QSAR model obtained indicates that the aromatase inhibitory activity can be enhanced by increasing SIC, SC_3_C, Jurs_WNSA_1, Jurs_WPSA_1 and decreasing CDOCKER interaction energy (ECD), IAC_Total and Shadow_XZfrac. The predicted results shows that this model has a comparatively good predictive power which can be used in prediction of activity of new steroidal aromatase inhibitors. Copyright © 2010 Elsevier Masson SAS. All rights reserved.
Tansel, Berrin; Lee, Mengshan; Tansel, Derya Z
2013-08-15
First order removal rates for 15 polyaromatic hydrocarbons (PAHs) in soil, sediments and mangrove leaves were compared in relation to the parameters used in fate transport analyses (i.e., octanol-water partition coefficient, organic carbon-water partition coefficient, solubility, diffusivity in water, HOMO-LUMO gap, molecular size, molecular aspect ratio). The quantitative structure activity relationships (QSAR) and quantitative structure property relationships (QSPR) showed that the rate of disappearance of PAHs is correlated with their diffusivities in water as well as molecular volumes in different media. Strong correlations for the rate of disappearance of PAHs in sediments could not be obtained in relation to most of the parameters evaluated. The analyses showed that the QSAR and QSPR correlations developed for removal rates of PAHs in soils would not be adequate for sediments and plant tissues. Copyright © 2013 Elsevier Ltd. All rights reserved.
The chemistry side of AOP: implications for toxicity ...
An adverse outcome pathway (AOP) is a structured representation of the biological events that lead to adverse impacts following a molecular initiating event caused by chemical interaction with a macromolecule. AOPs have been proposed to facilitate toxicity extrapolation across species through understanding of species similarity in the sequence of molecular, cellular, organ and organismal level responses. However, AOPs are non-specific regarding the identity of the chemical initiators, and the range of structures for which an AOP is considered applicable has generally been poorly defined. Applicability domain has been widely understood in the field of QSAR as the response and chemical structure space in which the model makes predictions with a given reliability, and has been traditionally applied to define the similarity of query molecules within the training set. Three dimensional (3D) receptor modeling offers an approach to better define the applicability domain for selected AOPs through determination of the chemical space of the molecular initiating event. Universal 3D-QSAR models were developed for acetylcholinesterase inhibitors and estrogen receptor agonists and antagonists using a combination of fingerprint, molecular docking and structure-based pharmacophore approaches. The models were based on the critical molecular interactions within each receptor ligand binding domain, and included the key amino acid residues responsible for high binding affinity. T
USE OF INTERSPECIES CORRELATION ESTIMATIONS TO PREDICT HC5'S BASED ON QSAR
Dyer, S.D., S. Belanger, J. Chaney, D. Versteeg and F. Mayer. In press. Use of Interspecies Correlation Estimations to predict HC5's Based on QSARs (Abstract). To be presented at the SETAC Europe 14th Annual Meeting: Environmental Science Solution: A Pan-European Perspective, 18-...
NASA Astrophysics Data System (ADS)
Jójárt, Balázs; Martinek, Tamás A.; Márki, Árpád
2005-05-01
Molecular docking and 3D-QSAR studies were performed to determine the binding mode for a series of benzoxazine oxytocin antagonists taken from the literature. Structural hypotheses were generated by docking the most active molecule to the rigid receptor by means of AutoDock 3.05. The cluster analysis yielded seven possible binding conformations. These structures were refined by using constrained simulated annealing, and the further ligands were aligned in the refined receptor by molecular docking. A good correlation was found between the estimated Δ G bind and the p K i values for complex F. The Connolly-surface analysis, CoMFA and CoMSIA models q CoMFA 2 = 0.653, q CoMSA 2 = 0.630 and r pred,CoMFA 2 = 0.852 , r pred,CoMSIA 2 = 0.815) confirmed the scoring function results. The structural features of the receptor-ligand complex and the CoMFA and CoMSIA fields are in closely connected. These results suggest that receptor-ligand complex F is the most likely binding hypothesis for the studied benzoxazine analogs.
Hisaki, Tomoka; Aiba Née Kaneko, Maki; Yamaguchi, Masahiko; Sasa, Hitoshi; Kouzuki, Hirokazu
2015-04-01
Use of laboratory animals for systemic toxicity testing is subject to strong ethical and regulatory constraints, but few alternatives are yet available. One possible approach to predict systemic toxicity of chemicals in the absence of experimental data is quantitative structure-activity relationship (QSAR) analysis. Here, we present QSAR models for prediction of maximum "no observed effect level" (NOEL) for repeated-dose, developmental and reproductive toxicities. NOEL values of 421 chemicals for repeated-dose toxicity, 315 for reproductive toxicity, and 156 for developmental toxicity were collected from Japan Existing Chemical Data Base (JECDB). Descriptors to predict toxicity were selected based on molecular orbital (MO) calculations, and QSAR models employing multiple independent descriptors as the input layer of an artificial neural network (ANN) were constructed to predict NOEL values. Robustness of the models was indicated by the root-mean-square (RMS) errors after 10-fold cross-validation (0.529 for repeated-dose, 0.508 for reproductive, and 0.558 for developmental toxicity). Evaluation of the models in terms of the percentages of predicted NOELs falling within factors of 2, 5 and 10 of the in-vivo-determined NOELs suggested that the model is applicable to both general chemicals and the subset of chemicals listed in International Nomenclature of Cosmetic Ingredients (INCI). Our results indicate that ANN models using in silico parameters have useful predictive performance, and should contribute to integrated risk assessment of systemic toxicity using a weight-of-evidence approach. Availability of predicted NOELs will allow calculation of the margin of safety, as recommended by the Scientific Committee on Consumer Safety (SCCS).
Yu, S; Gao, S; Gan, Y; Zhang, Y; Ruan, X; Wang, Y; Yang, L; Shi, J
2016-04-01
Quantitative structure-property relationship modelling can be a valuable alternative method to replace or reduce experimental testing. In particular, some endpoints such as octanol-water (KOW) and organic carbon-water (KOC) partition coefficients of polychlorinated biphenyls (PCBs) are easier to predict and various models have been already developed. In this paper, two different methods, which are multiple linear regression based on the descriptors generated using Dragon software and hologram quantitative structure-activity relationships, were employed to predict suspended particulate matter (SPM) derived log KOC and generator column, shake flask and slow stirring method derived log KOW values of 209 PCBs. The predictive ability of the derived models was validated using a test set. The performances of all these models were compared with EPI Suite™ software. The results indicated that the proposed models were robust and satisfactory, and could provide feasible and promising tools for the rapid assessment of the SPM derived log KOC and generator column, shake flask and slow stirring method derived log KOW values of PCBs.
Teixeira, Ana L; Falcao, Andre O
2014-07-28
Structurally similar molecules tend to have similar properties, i.e. closer molecules in the molecular space are more likely to yield similar property values while distant molecules are more likely to yield different values. Based on this principle, we propose the use of a new method that takes into account the high dimensionality of the molecular space, predicting chemical, physical, or biological properties based on the most similar compounds with measured properties. This methodology uses ordinary kriging coupled with three different molecular similarity approaches (based on molecular descriptors, fingerprints, and atom matching) which creates an interpolation map over the molecular space that is capable of predicting properties/activities for diverse chemical data sets. The proposed method was tested in two data sets of diverse chemical compounds collected from the literature and preprocessed. One of the data sets contained dihydrofolate reductase inhibition activity data, and the second molecules for which aqueous solubility was known. The overall predictive results using kriging for both data sets comply with the results obtained in the literature using typical QSPR/QSAR approaches. However, the procedure did not involve any type of descriptor selection or even minimal information about each problem, suggesting that this approach is directly applicable to a large spectrum of problems in QSAR/QSPR. Furthermore, the predictive results improve significantly with the similarity threshold between the training and testing compounds, allowing the definition of a confidence threshold of similarity and error estimation for each case inferred. The use of kriging for interpolation over the molecular metric space is independent of the training data set size, and no reparametrizations are necessary when more compounds are added or removed from the set, and increasing the size of the database will consequentially improve the quality of the estimations. Finally it is shown that this model can be used for checking the consistency of measured data and for guiding an extension of the training set by determining the regions of the molecular space for which new experimental measurements could be used to maximize the model's predictive performance.
Vision & Strategy: Predictive Ecotoxicology in the 21st Century
2011-01-01
their relative abundance or modifications QSARs —Correlation of ecological or toxicological activity with chemical structure to understand or predict...data and collection methods. The dramatic increase in the amount of toxicological data we can collect and analyze is complemented by our improved...diverse disciplines such as biochemistry, ecology, molecular biology, toxicology , bioinformatics, and health and environmental risk assess- ment
Asymmetric bagging and feature selection for activities prediction of drug molecules.
Li, Guo-Zheng; Meng, Hao-Hua; Lu, Wen-Cong; Yang, Jack Y; Yang, Mary Qu
2008-05-28
Activities of drug molecules can be predicted by QSAR (quantitative structure activity relationship) models, which overcomes the disadvantages of high cost and long cycle by employing the traditional experimental method. With the fact that the number of drug molecules with positive activity is rather fewer than that of negatives, it is important to predict molecular activities considering such an unbalanced situation. Here, asymmetric bagging and feature selection are introduced into the problem and asymmetric bagging of support vector machines (asBagging) is proposed on predicting drug activities to treat the unbalanced problem. At the same time, the features extracted from the structures of drug molecules affect prediction accuracy of QSAR models. Therefore, a novel algorithm named PRIFEAB is proposed, which applies an embedded feature selection method to remove redundant and irrelevant features for asBagging. Numerical experimental results on a data set of molecular activities show that asBagging improve the AUC and sensitivity values of molecular activities and PRIFEAB with feature selection further helps to improve the prediction ability. Asymmetric bagging can help to improve prediction accuracy of activities of drug molecules, which can be furthermore improved by performing feature selection to select relevant features from the drug molecules data sets.
Grid-based Continual Analysis of Molecular Interior for Drug Discovery, QSAR and QSPR.
Potemkin, Andrey V; Grishina, Maria A; Potemkin, Vladimir A
2017-01-01
In 1979, R.D.Cramer and M.Milne made a first realization of 3D comparison of molecules by aligning them in space and by mapping their molecular fields to a 3D grid. Further, this approach was developed as the DYLOMMS (Dynamic Lattice- Oriented Molecular Modelling System) approach. In 1984, H.Wold and S.Wold proposed the use of partial least squares (PLS) analysis, instead of principal component analysis, to correlate the field values with biological activities. Then, in 1988, the method which was called CoMFA (Comparative Molecular Field Analysis) was introduced and the appropriate software became commercially available. Since 1988, a lot of 3D QSAR methods, algorithms and their modifications are introduced for solving of virtual drug discovery problems (e.g., CoMSIA, CoMMA, HINT, HASL, GOLPE, GRID, PARM, Raptor, BiS, CiS, ConGO,). All the methods can be divided into two groups (classes):1. Methods studying the exterior of molecules; 2) Methods studying the interior of molecules. A series of grid-based computational technologies for Continual Molecular Interior analysis (CoMIn) are invented in the current paper. The grid-based analysis is fulfilled by means of a lattice construction analogously to many other grid-based methods. The further continual elucidation of molecular structure is performed in various ways. (i) In terms of intermolecular interactions potentials. This can be represented as a superposition of Coulomb, Van der Waals interactions and hydrogen bonds. All the potentials are well known continual functions and their values can be determined in all lattice points for a molecule. (ii) In the terms of quantum functions such as electron density distribution, Laplacian and Hamiltonian of electron density distribution, potential energy distribution, the highest occupied and the lowest unoccupied molecular orbitals distribution and their superposition. To reduce time of calculations using quantum methods based on the first principles, an original quantum free-orbital approach AlteQ is proposed. All the functions can be calculated using a quantum approach at a sufficient level of theory and their values can be determined in all lattice points for a molecule. Then, the molecules of a dataset can be superimposed in the lattice for the maximal coincidence (or minimal deviations) of the potentials (i) or the quantum functions (ii). The methods and criteria of the superimposition are discussed. After that a functional relationship between biological activity or property and characteristics of potentials (i) or functions (ii) is created. The methods of the quantitative relationship construction are discussed. New approaches for rational virtual drug design based on the intermolecular potentials and quantum functions are invented. All the invented methods are realized at www.chemosophia.com web page. Therefore, a set of 3D QSAR approaches for continual molecular interior study giving a lot of opportunities for virtual drug discovery, virtual screening and ligand-based drug design are invented. The continual elucidation of molecular structure is performed in the terms of intermolecular interactions potentials and in the terms of quantum functions such as electron density distribution, Laplacian and Hamiltonian of electron density distribution, potential energy distribution, the highest occupied and the lowest unoccupied molecular orbitals distribution and their superposition. To reduce time of calculations using quantum methods based on the first principles, an original quantum free-orbital approach AlteQ is proposed. The methods of the quantitative relationship construction are discussed. New approaches for rational virtual drug design based on the intermolecular potentials and quantum functions are invented. All the invented methods are realized at www.chemosophia.com web page. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Quantitative structure-activity relationship studies of threo-methylphenidate analogs.
Misra, Milind; Shi, Qing; Ye, Xiaocong; Gruszecka-Kowalik, Ewa; Bu, Wei; Liu, Zhanzhu; Schweri, Margaret M; Deutsch, Howard M; Venanzi, Carol A
2010-10-15
Complementary two-dimensional (2D) and three-dimensional (3D) Quantitative Structure-Activity Relationship (QSAR) techniques were used to derive a preliminary model for the dopamine transporter (DAT) binding affinity of 80 racemic threo-methylphenidate (MP) analogs. A novel approach based on using the atom-level E-state indices of the 14 common scaffold atoms in a sphere exclusion protocol was used to identify a test set for 2D- and 3D-QSAR model validation. Comparative Molecular Field Analysis (CoMFA) contour maps based on the structure-activity data of the training set indicate that the 2' position of the phenyl ring cannot tolerate much steric bulk and that addition of electron-withdrawing groups to the 3' or 4' positions of the phenyl ring leads to improved DAT binding affinity. In particular, the optimal substituents were found to be those whose bulk is mainly in the plane of the phenyl ring. Substituents with significant bulk above or below the plane of the ring led to decreased binding affinity. Suggested alterations to be explored in the design of new compounds are the placement at the 3' and 4' position of the phenyl ring of electron-withdrawing groups that lie chiefly in the plane of the ring, for example, halogen substituents on the 3',4'-benzo analog, 79. A complementary 2D-QSAR approach-partial least squares analysis using a reduced set of Molconn-Z descriptors-supports the CoMFA structure-activity interpretation that phenyl ring substitution is a major determinant of DAT binding affinity. The potential usefulness of the CoMFA models was demonstrated by the prediction of the binding affinity of methyl 2-(naphthalen-1-yl)-2-(piperidin-2-yl)acetate, an analog not in the original data set, to be in good agreement with the experimental value. Copyright © 2010 Elsevier Ltd. All rights reserved.
Combined 3D-QSAR modeling and molecular docking study on azacycles CCR5 antagonists
NASA Astrophysics Data System (ADS)
Ji, Yongjun; Shu, Mao; Lin, Yong; Wang, Yuanqiang; Wang, Rui; Hu, Yong; Lin, Zhihua
2013-08-01
The beta chemokine receptor 5 (CCR5) is an attractive target for pharmaceutical industry in the HIV-1, inflammation and cancer therapeutic areas. In this study, we have developed quantitative structure activity relationship (QSAR) models for a series of 41 azacycles CCR5 antagonists using comparative molecular field analysis (CoMFA), comparative molecular similarity indices analysis (CoMSIA), and Topomer CoMFA methods. The cross-validated coefficient q2 values of 3D-QASR (CoMFA, CoMSIA, and Topomer CoMFA) methods were 0.630, 0.758, and 0.852, respectively, the non-cross-validated R2 values were 0.979, 0.978, and 0.990, respectively. Docking studies were also employed to determine the most probable binding mode. 3D contour maps and docking results suggested that bulky groups and electron-withdrawing groups on the core part would decrease antiviral activity. Furthermore, docking results indicated that H-bonds and π bonds were favorable for antiviral activities. Finally, a set of novel derivatives with predicted activities were designed.
Wu, Xiangxiang; Zeng, Huahui; Zhu, Xin; Ma, Qiujuan; Hou, Yimin; Wu, Xuefen
2013-11-20
A series of pyrrolopyridinone derivatives as specific inhibitors towards the cell division cycle 7 (Cdc7) was taken into account, and the efficacy of these compounds was analyzed by QSAR and docking approaches to gain deeper insights into the interaction mechanism and ligands selectivity for Cdc7. By regression analysis the prediction models based on Grid score and Zou-GB/SA score were found, respectively with good quality of fits (r(2)=0.748, 0.951; r(cv)(2)=0.712, 0.839). The accuracy of the models was validated by test set and the deviation of the predicted values in validation set using Zou-GB/SA score was smaller than that using Grid score, suggesting that the model based on Zou-GB/SA score provides a more effective method for predicting potencies of Cdc7 inhibitors. Copyright © 2013 Elsevier B.V. All rights reserved.
Parameters for Pyrethroid Insecticide QSAR and PBPK/PD Models for Human Risk Assessment
This pyrethroid insecticide parameter review is an extension of our interest in developing quantitative structure–activity relationship–physiologically based pharmacokinetic/pharmacodynamic (QSAR-PBPK/PD) models for assessing health risks, which interest started with the organoph...
NASA Astrophysics Data System (ADS)
Norinder, Ulf
1990-12-01
An experimental design based 3-D QSAR analysis using a combination of principal component and PLS analysis is presented and applied to human corticosteroid-binding globulin complexes. The predictive capability of the created model is good. The technique can also be used as guidance when selecting new compounds to be investigated.
QSAR analyses on avian influenza virus neuraminidase inhibitors using CoMFA, CoMSIA, and HQSAR
NASA Astrophysics Data System (ADS)
Zheng, Mingyue; Yu, Kunqian; Liu, Hong; Luo, Xiaomin; Chen, Kaixian; Zhu, Weiliang; Jiang, Hualiang
2006-09-01
The recent wide spreading of the H5N1 avian influenza virus (AIV) in Asia, Europe and Africa and its ability to cause fatal infections in human has raised serious concerns about a pending global flu pandemic. Neuraminidase (NA) inhibitors are currently the only option for treatment or prophylaxis in humans infected with this strain. However, drugs currently on the market often meet with rapidly emerging resistant mutants and only have limited application as inadequate supply of synthetic material. To dig out helpful information for designing potent inhibitors with novel structures against the NA, we used automated docking, CoMFA, CoMSIA, and HQSAR methods to investigate the quantitative structure-activity relationship for 126 NA inhibitors (NIs) with great structural diversities and wide range of bioactivities against influenza A virus. Based on the binding conformations discovered via molecular docking into the crystal structure of NA, CoMFA and CoMSIA models were successfully built with the cross-validated q 2 of 0.813 and 0.771, respectively. HQSAR was also carried out as a complementary study in that HQSAR technique does not require 3D information of these compounds and could provide a detailed molecular fragment contribution to the inhibitory activity. These models also show clearly how steric, electrostatic, hydrophobicity, and individual fragments affect the potency of NA inhibitors. In addition, CoMFA and CoMSIA field distributions are found to be in well agreement with the structural characteristics of the corresponding binding sites. Therefore, the final 3D-QSAR models and the information of the inhibitor-enzyme interaction should be useful in developing novel potent NA inhibitors.
Synthesis and biological activities of turkesterone 11α-acyl derivatives
Dinan, Laurence; Bourne, Pauline; Whiting, Pensri; Tsitsekli, Ada; Saatov, Ziyadilla; Dhadialla, Tarlochan S.; Hormann, Robert E.; Lafont, René; Coll, Josep
2003-01-01
Turkesterone is a phytoecdysteroid possessing an 11α-hydroxyl group. It is an analogue of the insect steroid hormone 20-hydroxyecdysone. Previous ecdysteroid QSAR and molecular modelling studies predicted that the cavity of the ligand binding domain of the ecdysteroid receptor would possess space in the vicinity of C-11/C-12 of the ecdysteroid. We report the regioselective synthesis of a series of turkesterone 11α-acyl derivatives in order to explore this possibility. The structures of the analogues have been unambiguously determined by spectroscopic means (NMR and low-resolution mass spectrometry). Purity was verified by HPLC. Biological activities have been determined in Drosophila melanogaster BII cell-based bioassay for ecdysteroid agonists and in an in vitro radioligand-displacement assay using bacterially-expressed D. melanogaster EcR/USP receptor proteins. The 11α-acyl derivatives do retain a significant amount of biological activity relative to the parent ecdysteroid. Further, although activity initially drops with the extension of the acyl chain length (C2 to C4), it then increases (C6 to C10), before decreasing again (C14 and C20). The implications of these findings for the interaction of ecdysteroids with the ecdysteroid receptor and potential applications in the generation of affinity-labelled and fluorescently-tagged ecdysteroids are discussed. Abbreviation: CoMFA comparative molecular field analysis DCM dichloromethane DMF dimethylformamide DMP 2,2-dimethoxypropane 4D-QSAR 4-dimensional quantitative structure-activity relationship EcR ecdysteroid receptor EcRE ecdysteroid response element HPLC high-performance liquid chromatography LBD ligand-binding domain NMR nuclear magnetic resonance ponA ponasterone A QSAR quantitative structure-activity relationship RXR retinoid X receptor SAR structure-activity relationship SPE solid-phase extraction THF tetrahydrofuran TLC thin-layer chromatography p-TsOH para-toluenesulphonic acid USP ultraspiracle UV-VIS ultraviolet-visible PMID:15841223
5D-QSAR for spirocyclic sigma1 receptor ligands by Quasar receptor surface modeling.
Oberdorf, Christoph; Schmidt, Thomas J; Wünsch, Bernhard
2010-07-01
Based on a contiguous and structurally as well as biologically diverse set of 87 sigma(1) ligands, a 5D-QSAR study was conducted in which a quasi-atomistic receptor surface modeling approach (program package Quasar) was applied. The superposition of the ligands was performed with the tool Pharmacophore Elucidation (MOE-package), which takes all conformations of the ligands into account. This procedure led to four pharmacophoric structural elements with aromatic, hydrophobic, cationic and H-bond acceptor properties. Using the aligned structures a 3D-model of the ligand binding site of the sigma(1) receptor was obtained, whose general features are in good agreement with previous assumptions on the receptor structure, but revealed some novel insights since it represents the receptor surface in more detail. Thus, e.g., our model indicates the presence of an H-bond acceptor moiety in the binding site as counterpart to the ligands' cationic ammonium center, rather than a negatively charged carboxylate group. The presented QSAR model is statistically valid and represents the biological data of all tested compounds, including a test set of 21 ligands not used in the modeling process, with very good to excellent accuracy [q(2) (training set, n=66; leave 1/3 out) = 0.84, p(2) (test set, n=21)=0.64]. Moreover, the binding affinities of 13 further spirocyclic sigma(1) ligands were predicted with reasonable accuracy (mean deviation in pK(i) approximately 0.8). Thus, in addition to novel insights into the requirements for binding of spirocyclic piperidines to the sigma(1) receptor, the presented model can be used successfully in the rational design of new sigma(1) ligands. Copyright (c) 2010 Elsevier Masson SAS. All rights reserved.
Wang, Hui; Jiang, Mingyue; Hse, Chung-Yun; Jin, Chunde; Sun, Fangli; Li, Zhuo
2017-01-01
Cinnamaldehyde amino acid Schiff base (CAAS) is a new class of safe, bioactive compounds which could be developed as potential antifungal agents for fungal infections. To design new cinnamaldehyde amino acid Schiff base compounds with high bioactivity, the quantitative structure–activity relationships (QSARs) for CAAS compounds against Aspergillus niger (A. niger) and Penicillium citrinum (P. citrinum) were analysed. The QSAR models (R2 = 0.9346 for A. niger, R2 = 0.9590 for P. citrinum,) were constructed and validated. The models indicated that the molecular polarity and the Max atomic orbital electronic population had a significant effect on antifungal activity. Based on the best QSAR models, two new compounds were designed and synthesized. Antifungal activity tests proved that both of them have great bioactivity against the selected fungi. PMID:28989758
Chaube, Udit; Chhatbar, Dhara; Bhatt, Hardik
2016-02-01
According to WHO statistics, lung cancer is one of the leading causes of death among all other types of cancer. Many genes get mutated in lung cancer but involvement of EGFR and KRAS are more common. Unavailability of drugs or resistance to the available drugs is the major problem in the treatment of lung cancer. In the present research, mTOR was selected as an alternative target for the treatment of lung cancer which involves PI3K/AKT/mTOR pathway. 28 synthetic mTOR inhibitors were selected from the literature. Ligand based approach (CoMFA and CoMSIA) and structure based approach (molecular dynamics simulations assisted molecular docking study) were applied for the identification of important features of benzoxazepine moiety, responsible for mTOR inhibition. Three different alignments were tried to obtain best QSAR model, of which, distil was found to be the best method, as it gave good statistical results. In CoMFA, Leave One Out (LOO) cross validated coefficients (q(2)), conventional coefficient (r(2)) and predicted correlation coefficient (r(2)pred) values were found to be 0.615, 0.990 and 0.930, respectively. Similarly in CoMSIA, q(2), r(2)ncv and r(2)pred values were found to be 0.748, 0.986 and 0.933, respectively. Molecular dynamics and simulations study revealed that B-chain of mTOR protein was stable at and above 500 FS with respect to temperature (at and above 298 K), Potential energy (at and above 7669.72 kJ/mol) and kinetic energy (at and above 4009.77 kJ/mol). Molecular docking study was performed on simulated protein of mTOR which helped to correlate interactions of amino acids surrounded to the ligand with contour maps generated by QSAR method. Important features of benzoxazepine were identified by contour maps and molecular docking study which would be useful to design novel molecules as mTOR inhibitors for the treatment of lung cancer. Copyright © 2015 Elsevier Ltd. All rights reserved.
Chi, Yulang; Zhang, Huanteng; Huang, Qiansheng; Lin, Yi; Ye, Guozhu; Zhu, Huimin; Dong, Sijun
2018-02-01
Environmental risks of organic chemicals have been greatly determined by their persistence, bioaccumulation, and toxicity (PBT) and physicochemical properties. Major regulations in different countries and regions identify chemicals according to their bioconcentration factor (BCF) and octanol-water partition coefficient (Kow), which frequently displays a substantial correlation with the sediment sorption coefficient (Koc). Half-life or degradability is crucial for the persistence evaluation of chemicals. Quantitative structure activity relationship (QSAR) estimation models are indispensable for predicting environmental fate and health effects in the absence of field- or laboratory-based data. In this study, 39 chemicals of high concern were chosen for half-life testing based on total organic carbon (TOC) degradation, and two widely accepted and highly used QSAR estimation models (i.e., EPI Suite and PBT Profiler) were adopted for environmental risk evaluation. The experimental results and estimated data, as well as the two model-based results were compared, based on the water solubility, Kow, Koc, BCF and half-life. Environmental risk assessment of the selected compounds was achieved by combining experimental data and estimation models. It was concluded that both EPI Suite and PBT Profiler were fairly accurate in measuring the physicochemical properties and degradation half-lives for water, soil, and sediment. However, the half-lives between the experimental and the estimated results were still not absolutely consistent. This suggests deficiencies of the prediction models in some ways, and the necessity to combine the experimental data and predicted results for the evaluation of environmental fate and risks of pollutants. Copyright © 2016. Published by Elsevier B.V.
NASA Astrophysics Data System (ADS)
Wang, Zhenya; Chang, Yiqun; Han, Yushui; Liu, Kangjia; Hou, Jinsong; Dai, Chengli; Zhai, Yuanhao; Guo, Jialiang; Sun, Pinghua; Lin, Jing; Chen, Weimin
2016-11-01
Mutation of isocitrate dehydrogenase 1 (IDH1) which is frequently found in certain cancers such as glioma, sarcoma and acute myeloid leukemia, has been proven to be a potent drug target for cancer therapy. In silico methodologies such as 3D-QSAR and molecular docking were performed to explore compounds with better mutant isocitrate dehydrogenase 1 (MIDH1) inhibitory activity using a series of 40 newly reported 1-hydroxypyridin-2-one compounds as MIDH1 inhibitors. The satisfactory CoMFA and CoMSIA models obtained after internal and external cross-validation gave q2 values of 0.691 and 0.535, r2 values of 0.984 and 0.936, respectively. 3D contour maps generated from CoMFA and CoMSIA along with the docking results provided information about the structural requirements for better MIDH1 inhibitory activity. Based on the structure-activity relationship, 17 new potent molecules with better predicted activity than the most active compound in the literature have been designed.
Predictive ecotoxicity of MoA 1 of organic chemicals using in silico approaches.
de Morais E Silva, Luana; Alves, Mateus Feitosa; Scotti, Luciana; Lopes, Wilton Silva; Scotti, Marcus Tullius
2018-05-30
Persistent organic products are compounds used for various purposes, such as personal care products, surfactants, colorants, industrial additives, food, pesticides and pharmaceuticals. These substances are constantly introduced into the environment and many of these pollutants are difficult to degrade. Toxic compounds classified as MoA 1 (Mode of Action 1) are low toxicity compounds that comprise nonreactive chemicals. In silico methods such as Quantitative Structure-Activity Relationships (QSARs) have been used to develop important models for prediction in several areas of science, as well as aquatic toxicity studies. The aim of the present study was to build a QSAR model-based set of theoretical Volsurf molecular descriptors using the fish acute toxicity values of compounds defined as MoA 1 to identify the molecular properties related to this mechanism. The selected Partial Least Squares (PLS) results based on the values of cross-validation coefficients of determination (Q cv 2 ) show the following values: Q cv 2 = 0.793, coefficient of determination (R 2 ) = 0.823, explained variance in external prediction (Q ext 2 ) = 0.87. From the selected descriptors, not only the hydrophobicity is related to the toxicity as already mentioned in previously published studies but other physicochemical properties combined contribute to the activity of these compounds. The symmetric distribution of the hydrophobic moieties in the structure of the compounds as well as the shape, as branched chains, are important features that are related to the toxicity. This information from the model can be useful in predicting so as to minimize the toxicity of organic compounds. Copyright © 2018. Published by Elsevier Inc.
Balupuri, Anand; Balasubramanian, Pavithra K; Cho, Seung J
2016-01-01
Checkpoint kinase 1 (Chk1) has emerged as a potential therapeutic target for design and development of novel anticancer drugs. Herein, we have performed three-dimensional quantitative structure-activity relationship (3D-QSAR) and molecular docking analyses on a series of diazacarbazoles to design potent Chk1 inhibitors. 3D-QSAR models were developed using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) techniques. Docking studies were performed using AutoDock. The best CoMFA and CoMSIA models exhibited cross-validated correlation coefficient (q2) values of 0.631 and 0.585, and non-cross-validated correlation coefficient (r2) values of 0.933 and 0.900, respectively. CoMFA and CoMSIA models showed reasonable external predictabilities (r2 pred) of 0.672 and 0.513, respectively. A satisfactory performance in the various internal and external validation techniques indicated the reliability and robustness of the best model. Docking studies were performed to explore the binding mode of inhibitors inside the active site of Chk1. Molecular docking revealed that hydrogen bond interactions with Lys38, Glu85 and Cys87 are essential for Chk1 inhibitory activity. The binding interaction patterns observed during docking studies were complementary to 3D-QSAR results. Information obtained from the contour map analysis was utilized to design novel potent Chk1 inhibitors. Their activities and binding affinities were predicted using the derived model and docking studies. Designed inhibitors were proposed as potential candidates for experimental synthesis.
Itteboina, Ramesh; Ballu, Srilata; Sivan, Sree Kanth; Manga, Vijjulatha
2017-10-01
Janus kinase 1 (JAK 1) belongs to the JAK family of intracellular nonreceptor tyrosine kinase. JAK-signal transducer and activator of transcription (JAK-STAT) pathway mediate signaling by cytokines, which control survival, proliferation and differentiation of a variety of cells. Three-dimensional quantitative structure activity relationship (3 D-QSAR), molecular docking and molecular dynamics (MD) methods was carried out on a dataset of Janus kinase 1(JAK 1) inhibitors. Ligands were constructed and docked into the active site of protein using GLIDE 5.6. Best docked poses were selected after analysis for further 3 D-QSAR analysis using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) methodology. Employing 60 molecules in the training set, 3 D-QSAR models were generate that showed good statistical reliability, which is clearly observed in terms of r 2 ncv and q 2 loo values. The predictive ability of these models was determined using a test set of 25 molecules that gave acceptable predictive correlation (r 2 Pred ) values. The key amino acid residues were identified by means of molecular docking, and the stability and rationality of the derived molecular conformations were also validated by MD simulation. The good consonance between the docking results and CoMFA/CoMSIA contour maps provides helpful clues about the reasonable modification of molecules in order to design more efficient JAK 1 inhibitors. The developed models are expected to provide some directives for further synthesis of highly effective JAK 1 inhibitors.
Kafoury, Ramzi M; Huang, Ming-Ju
2005-08-01
The sequence of events leading to ozone-induced airway inflammation is not well known. To elucidate the molecular and cellular events underlying ozone toxicity in the lung, we hypothesized that lipid ozonation products (LOPs) generated by the reaction of ozone with unsaturated fatty acids in the epithelial lining fluid and cell membranes play a key role in mediating ozone-induced airway inflammation. To test our hypothesis, we ozonized 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphatidylcholine (POPC) and generated LOPs. Confluent human bronchial epithelial cells were exposed to the derivatives of ozonized POPC-9-oxononanoyl, 9-hydroxy-9-hydroperoxynonanoyl, and 8-(5-octyl-1,2,4-trioxolan-3-yl-)octanoyl-at a concentration of 10 muM, and the activity of phospholipases A2 (PLA2), C (PLC), and D (PLD) was measured (1, 0.5, and 1 h, respectively). Quantitative structure-activity relationship (QSAR) models were utilized to predict the biological activity of LOPs in airway epithelial cells. The QSAR results showed a strong correlation between experimental and computed activity (r = 0.97, 0.98, 0.99, for PLA2, PLC, and PLD, respectively). The results indicate that QSAR models can be utilized to predict the biological activity of the various ozone-derived LOP species in the lung. Copyright 2005 Wiley Periodicals, Inc.
Horobin, R W; Stockert, J C; Rashid-Doubell, F
2015-05-01
We discuss a variety of biological targets including generic biomembranes and the membranes of the endoplasmic reticulum, endosomes/lysosomes, Golgi body, mitochondria (outer and inner membranes) and the plasma membrane of usual fluidity. For each target, we discuss the access of probes to the target membrane, probe uptake into the membrane and the mechanism of selectivity of the probe uptake. A statement of the QSAR decision rule that describes the required physicochemical features of probes that enable selective staining also is provided, followed by comments on exceptions and limits. Examples of probes typically used to demonstrate each target structure are noted and decision rule tabulations are provided for probes that localize in particular targets; these tabulations show distribution of probes in the conceptual space defined by the relevant structure parameters ("parameter space"). Some general implications and limitations of the QSAR models for probe targeting are discussed including the roles of certain cell and protocol factors that play significant roles in lipid staining. A case example illustrates the predictive ability of QSAR models. Key limiting values of the head group hydrophilicity parameter associated with membrane-probe interactions are discussed in an appendix.
Lu, Qingzhang; Shen, Guoli; Yu, Ruqin
2002-11-15
The chaotic dynamical system is introduced in genetic algorithm to train ANN to formulate the CGANN algorithm. Logistic mapping as one of the most important chaotic dynamic mappings provides each new generation a high chance to hold GA's population diversity. This enhances the ability to overcome overfitting in training an ANN. The proposed CGANN has been used for QSAR studies to predict the tetrahedral modes (nu(1)(A1) and nu(2)(E)) of halides [MX(4)](epsilon). The frequencies predicted by QSAR were compared with those calculated by quantum chemistry methods including PM3, AM1, and MNDO/d. The possibility of improving the predictive ability of QSAR by including quantum chemistry parameters as feature variables has been investigated using tetrahedral tetrahalide examples. Copyright 2002 Wiley Periodicals, Inc.
Development of a general baseline toxicity QSAR model for the fish embryo acute toxicity test.
Klüver, Nils; Vogs, Carolina; Altenburger, Rolf; Escher, Beate I; Scholz, Stefan
2016-12-01
Fish embryos have become a popular model in ecotoxicology and toxicology. The fish embryo acute toxicity test (FET) with the zebrafish embryo was recently adopted by the OECD as technical guideline TG 236 and a large database of concentrations causing 50% lethality (LC 50 ) is available in the literature. Quantitative Structure-Activity Relationships (QSARs) of baseline toxicity (also called narcosis) are helpful to estimate the minimum toxicity of chemicals to be tested and to identify excess toxicity in existing data sets. Here, we analyzed an existing fish embryo toxicity database and established a QSAR for fish embryo LC 50 using chemicals that were independently classified to act according to the non-specific mode of action of baseline toxicity. The octanol-water partition coefficient K ow is commonly applied to discriminate between non-polar and polar narcotics. Replacing the K ow by the liposome-water partition coefficient K lipw yielded a common QSAR for polar and non-polar baseline toxicants. This developed baseline toxicity QSAR was applied to compare the final mode of action (MOA) assignment of 132 chemicals. Further, we included the analysis of internal lethal concentration (ILC 50 ) and chemical activity (La 50 ) as complementary approaches to evaluate the robustness of the FET baseline toxicity. The analysis of the FET dataset revealed that specifically acting and reactive chemicals converged towards the baseline toxicity QSAR with increasing hydrophobicity. The developed FET baseline toxicity QSAR can be used to identify specifically acting or reactive compounds by determination of the toxic ratio and in combination with appropriate endpoints to infer the MOA for chemicals. Copyright © 2016 Elsevier Ltd. All rights reserved.
2014-01-01
Background A number of microtubule disassembly blocking agents and inhibitors of tubulin polymerization have been elements of great interest in anti-cancer therapy, some of them even entering into the clinical trials. One such class of tubulin assembly inhibitors is of arylthioindole derivatives which results in effective microtubule disorganization responsible for cell apoptosis by interacting with the colchicine binding site of the β-unit of tubulin close to the interface with the α unit. We modelled the human tubulin β unit (chain D) protein and performed docking studies to elucidate the detailed binding mode of actions associated with their inhibition. The activity enhancing structural aspects were evaluated using a fragment-based Group QSAR (G-QSAR) model and was validated statistically to determine its robustness. A combinatorial library was generated keeping the arylthioindole moiety as the template and their activities were predicted. Results The G-QSAR model obtained was statistically significant with r2 value of 0.85, cross validated correlation coefficient q2 value of 0.71 and pred_r2 (r2 value for test set) value of 0.89. A high F test value of 65.76 suggests robustness of the model. Screening of the combinatorial library on the basis of predicted activity values yielded two compounds HPI (predicted pIC50 = 6.042) and MSI (predicted pIC50 = 6.001) whose interactions with the D chain of modelled human tubulin protein were evaluated in detail. A toxicity evaluation resulted in MSI being less toxic in comparison to HPI. Conclusions The study provides an insight into the crucial structural requirements and the necessary chemical substitutions required for the arylthioindole moiety to exhibit enhanced inhibitory activity against human tubulin. The two reported compounds HPI and MSI showed promising anti cancer activities and thus can be considered as potent leads against cancer. The toxicity evaluation of these compounds suggests that MSI is a promising therapeutic candidate. This study provided another stepping stone in the direction of evaluating tubulin inhibition and microtubule disassembly degeneration as viable targets for development of novel therapeutics against cancer. PMID:25521775
Liu, H; Ji, M; Jiang, H; Liu, L; Hua, W; Chen, K; Ji, R
2000-10-02
Class III antiarrhythmic agents selectively delay the effective refractory period (ERP) and increase the transmembrance action potential duration (APD). Based on our previous studies, a set of 17 methylsulfonamido phenylethylamine analogues were investigated by 3D-QSAR techniques of CoMFA and CoMSIA. The 3D-QSAR models proved a good predictive ability, and could describe the steric, electrostatic and hydrophobic requirements for recognition forces of the receptor site. According to the clues provided by this 3D-QSAR analysis, we designed and synthesized a series of new analogues of methanesulfonamido phenylethylamine (VIa-i). Pharmacological assay indicated that the effective concentrations of delaying the functional refractory period (FRP) 10ms of these new compounds have a good correlation with the 3D-QSAR predicted values. It is remarkable that the maximal percent change of delaying FRP in microM of compound VIc is much higher than that of dofetilide. The results showed that the 3D-QSAR models are reliable.
NASA Astrophysics Data System (ADS)
Santos-Filho, Osvaldo Andrade; Hopfinger, Anton J.
2001-01-01
A set of 18 structurally diverse antifolates including pyrimethamine, cycloguanil, methotrexate, aminopterin and trimethoprim, and 13 pyrrolo[2,3-d]pyrimidines were studied using four-dimensional quantitative structure-activity relationship (4D-QSAR) analysis. The corresponding biological activities of these compounds include IC50 inhibition constants for both the wild type, and a specific mutant type of Plasmodium falciparum dihydrofolate reductase (DHFR). Two thousand conformations of each analog were sampled to generate a conformational ensemble profile (CEP) from a molecular dynamics simulation (MDS) of 100,000 conformer trajectory states. Each sampled conformation was placed in a 1 Å cubic grid cell lattice for each of five trial alignments. The frequency of occupation of each grid cell was computed for each of six types of pharmacophore groups of atoms of each compound. These grid cell occupancy descriptors (GCODs) were then used as a descriptor pool to construct 4D-QSAR models. Models for inhibition of both the `wild' type and the mutant enzyme were generated which provide detailed spatial pharmacophore requirements for inhibition in terms of atom types and their corresponding relative locations in space. The 4D-QSAR models indicate some structural features perhaps relevant to the mechanism of resistance of the Plasmodium falciparum DHFR to current antimalarials. One feature identified is a slightly different binding alignment of the ligands to the mutant form of the enzyme as compared to the wild type.
Assessing and predicting drug-induced anticholinergic risks: an integrated computational approach.
Xu, Dong; Anderson, Heather D; Tao, Aoxiang; Hannah, Katia L; Linnebur, Sunny A; Valuck, Robert J; Culbertson, Vaughn L
2017-11-01
Anticholinergic (AC) adverse drug events (ADEs) are caused by inhibition of muscarinic receptors as a result of designated or off-target drug-receptor interactions. In practice, AC toxicity is assessed primarily based on clinician experience. The goal of this study was to evaluate a novel concept of integrating big pharmacological and healthcare data to assess clinical AC toxicity risks. AC toxicity scores (ATSs) were computed using drug-receptor inhibitions identified through pharmacological data screening. A longitudinal retrospective cohort study using medical claims data was performed to quantify AC clinical risks. ATS was compared with two previously reported toxicity measures. A quantitative structure-activity relationship (QSAR) model was established for rapid assessment and prediction of AC clinical risks. A total of 25 common medications, and 575,228 exposed and unexposed patients were analyzed. Our data indicated that ATS is more consistent with the trend of AC outcomes than other toxicity methods. Incorporating drug pharmacokinetic parameters to ATS yielded a QSAR model with excellent correlation to AC incident rate ( R 2 = 0.83) and predictive performance (cross validation Q 2 = 0.64). Good correlation and predictive performance ( R 2 = 0.68/ Q 2 = 0.29) were also obtained for an M2 receptor-specific QSAR model and tachycardia, an M2 receptor-specific ADE. Albeit using a small medication sample size, our pilot data demonstrated the potential and feasibility of a new computational AC toxicity scoring approach driven by underlying pharmacology and big data analytics. Follow-up work is under way to further develop the ATS scoring approach and clinical toxicity predictive model using a large number of medications and clinical parameters.
Can currently available non-animal methods detect pre and pro-haptens? (QSAR2016)
Predictive testing to identify and characterise substances for their skin sensitisation potential has historically been based on animal tests such as the Local Lymph Node Assay (LLNA). In recent years, regulations in the cosmetics and chemicals sectors has provided a strong impe...
Ghanem, Ouahid Ben; Shah, Syed Nasir; Lévêque, Jean-Marc; Mutalib, M I Abdul; El-Harbawi, Mohanad; Khan, Amir Sada; Alnarabiji, Mohamad Sahban; Al-Absi, Hamada R H; Ullah, Zahoor
2018-03-01
Over the past decades, Ionic liquids (ILs) have gained considerable attention from the scientific community in reason of their versatility and performance in many fields. However, they nowadays remain mainly for laboratory scale use. The main barrier hampering their use in a larger scale is their questionable ecological toxicity. This study investigated the effect of hydrophobic and hydrophilic cyclic cation-based ILs against four pathogenic bacteria that infect humans. For that, cations, either of aromatic character (imidazolium or pyridinium) or of non-aromatic nature, (pyrrolidinium or piperidinium), were selected with different alkyl chain lengths and combined with both hydrophilic and hydrophobic anionic moieties. The results clearly demonstrated that introducing of hydrophobic anion namely bis((trifluoromethyl)sulfonyl)amide, [NTF 2 ] and the elongation of the cations substitutions dramatically affect ILs toxicity behaviour. The established toxicity data [50% effective concentration (EC 50 )] along with similar endpoint collected from previous work against Aeromonas hydrophila were combined to developed quantitative structure-activity relationship (QSAR) model for toxicity prediction. The model was developed and validated in the light of Organization for Economic Co-operation and Development (OECD) guidelines strategy, producing good correlation coefficient R 2 of 0.904 and small mean square error (MSE) of 0.095. The reliability of the QSAR model was further determined using k-fold cross validation. Copyright © 2017 Elsevier Ltd. All rights reserved.
Molecular Modeling in Drug Design for the Development of Organophosphorus Antidotes/Prophylactics.
1986-06-01
multidimensional statistical QSAR analysis techniques to suggest new structures for synthesis and evaluation. C. Application of quantum chemical techniques to...compounds for synthesis and testing for antidotal potency. E. Use of computer-assisted methods to determine the steric constraints at the active site...modeling techniques to model the enzyme acetylcholinester-se. H. Suggestion of some novel compounds for synthesis and testing for reactivating
SAR/QSAR MODELS FOR TOXICITY PREDICTION: APPROACHES AND NEW DIRECTIONS
Abstract
SAR/QSAR MODELS FOR TOXICITY PREDICTION: APPROACHES AND NEW DIRECTIONS
Risk assessment typically incorporates some relevant toxicity information upon which to base a sound estimation for a chemical of concern. However, there are many circumstances in whic...
AQUATIC TOXICITY MODE OF ACTION STUDIES APPLIED TO QSAR DEVELOPMENT
A series of QSAR models for predicting fish acute lethality were developed using systematically collected data on more than 600 chemicals. These models were developed based on the assumption that chemicals producing toxicity through a common mechanism will have commonality in the...
QSAR Modeling Using Large-Scale Databases: Case Study for HIV-1 Reverse Transcriptase Inhibitors.
Tarasova, Olga A; Urusova, Aleksandra F; Filimonov, Dmitry A; Nicklaus, Marc C; Zakharov, Alexey V; Poroikov, Vladimir V
2015-07-27
Large-scale databases are important sources of training sets for various QSAR modeling approaches. Generally, these databases contain information extracted from different sources. This variety of sources can produce inconsistency in the data, defined as sometimes widely diverging activity results for the same compound against the same target. Because such inconsistency can reduce the accuracy of predictive models built from these data, we are addressing the question of how best to use data from publicly and commercially accessible databases to create accurate and predictive QSAR models. We investigate the suitability of commercially and publicly available databases to QSAR modeling of antiviral activity (HIV-1 reverse transcriptase (RT) inhibition). We present several methods for the creation of modeling (i.e., training and test) sets from two, either commercially or freely available, databases: Thomson Reuters Integrity and ChEMBL. We found that the typical predictivities of QSAR models obtained using these different modeling set compilation methods differ significantly from each other. The best results were obtained using training sets compiled for compounds tested using only one method and material (i.e., a specific type of biological assay). Compound sets aggregated by target only typically yielded poorly predictive models. We discuss the possibility of "mix-and-matching" assay data across aggregating databases such as ChEMBL and Integrity and their current severe limitations for this purpose. One of them is the general lack of complete and semantic/computer-parsable descriptions of assay methodology carried by these databases that would allow one to determine mix-and-matchability of result sets at the assay level.
Thompson, Corbin G; Sedykh, Alexander; Nicol, Melanie R; Muratov, Eugene; Fourches, Denis; Tropsha, Alexander; Kashuba, Angela D M
2014-11-01
The exposure of oral antiretroviral (ARV) drugs in the female genital tract (FGT) is variable and almost unpredictable. Identifying an efficient method to find compounds with high tissue penetration would streamline the development of regimens for both HIV preexposure prophylaxis and viral reservoir targeting. Here we describe the cheminformatics investigation of diverse drugs with known FGT penetration using cluster analysis and quantitative structure-activity relationships (QSAR) modeling. A literature search over the 1950-2012 period identified 58 compounds (including 21 ARVs and representing 13 drug classes) associated with their actual concentration data for cervical or vaginal tissue, or cervicovaginal fluid. Cluster analysis revealed significant trends in the penetrative ability for certain chemotypes. QSAR models to predict genital tract concentrations normalized to blood plasma concentrations were developed with two machine learning techniques utilizing drugs' molecular descriptors and pharmacokinetic parameters as inputs. The QSAR model with the highest predictive accuracy had R(2)test=0.47. High volume of distribution, high MRP1 substrate probability, and low MRP4 substrate probability were associated with FGT concentrations ≥1.5-fold plasma concentrations. However, due to the limited FGT data available, prediction performances of all models were low. Despite this limitation, we were able to support our findings by correctly predicting the penetration class of rilpivirine and dolutegravir. With more data to enrich the models, we believe these methods could potentially enhance the current approach of clinical testing.
Lorca, Marcos; Morales-Verdejo, Cesar; Vásquez-Velásquez, David; Andrades-Lagos, Juan; Campanini-Salinas, Javier; Soto-Delgado, Jorge; Recabarren-Gajardo, Gonzalo; Mella, Jaime
2018-05-16
The wide tissue distribution of the adrenergic β3 receptor makes it a potential target for the treatment of multiple pathologies such as diabetes, obesity, depression, overactive bladder (OAB), and cancer. Currently, there is only one drug on the market, mirabegron, approved for the treatment of OAB. In the present study, we have carried out an extensive structure-activity relationship analysis of a series of 41 aryloxypropanolamine compounds based on three-dimensional quantitative structure-activity relationship (3D-QSAR) techniques. This is the first combined comparative molecular field analysis (CoMFA) and comparative molecular similarity index analysis (CoMSIA) study in a series of selective aryloxypropanolamines displaying anti-diabetes and anti-obesity pharmacological profiles. The best CoMFA and CoMSIA models presented values of r ² ncv = 0.993 and 0.984 and values of r ² test = 0.865 and 0.918, respectively. The results obtained were subjected to extensive external validation ( q ², r ², r ² m , etc.) and a final series of compounds was designed and their biological activity was predicted (best pEC 50 = 8.561).
Synthesis and QSAR study of novel α-methylene-γ-butyrolactone derivatives as antifungal agents.
Wu, Yong-Ling; Wang, De-Long; Guo, En-Hui; Song, Shuang; Feng, Jun-Tao; Zhang, Xing
2017-03-01
Thirty-six new α-benzylidene-γ-lactone compounds based α-methylene-γ-butyrolactone substructure were prepared and characterized by spectroscopic analysis. All compounds were evaluated for antifungal activities in vitro against six plant pathogenic fungi and the half maximal inhibitory concentration (IC 50 ) against Botrytis cinerea and Colletotrichum lagenarium were investigated. Compounds 5c-3 and 5c-5 with the halogen atom exhibited excellent fungicidal activity against B. cinerea (IC 50 =22.91, 18.89μM). The structure-activity relationships (SARs) analysis indicated that the derivatives with electron-withdrawing substituents at the meta- or para-positions improves the activity. Via the heuristic method, the generated quantitative structure-activity relationship (QSAR) model (R 2 =0.961) revealed a strong correlation of antifungal activity against B. cinerea with molecular structures of these compounds. Meanwhile, the cytotoxicity of 20 representative derivatives was tested in the human tumor cells line (HepG2) and the hepatic L02 cells line, the result indicated that the synthesized compounds showed significant inhibitory activity and limited selectivity. Compound 5c-5 has the highest fungicidal activity with IC 50 =18.89μM (against B. cinerea.) but low cytotoxicity with IC 50 =35.4μM (against HepG2 cell line) and IC 50 =68.8μM (against Hepatic L02 cell line). These encouraging results can be providing an alternative, promising use of α-benzylidene-γ-lactone through the design and exploration of eco-friendly fungicides with low toxicity and high efficiency. Copyright © 2017 Elsevier Ltd. All rights reserved.
Istyastono, Enade P; Nijmeijer, Saskia; Lim, Herman D; van de Stolpe, Andrea; Roumen, Luc; Kooistra, Albert J; Vischer, Henry F; de Esch, Iwan J P; Leurs, Rob; de Graaf, Chris
2011-12-08
The histamine H(4) receptor (H(4)R) is a G protein-coupled receptor (GPCR) that plays an important role in inflammation. Similar to the homologous histamine H(3) receptor (H(3)R), two acidic residues in the H(4)R binding pocket, D(3.32) and E(5.46), act as essential hydrogen bond acceptors of positively ionizable hydrogen bond donors in H(4)R ligands. Given the symmetric distribution of these complementary pharmacophore features in H(4)R and its ligands, different alternative ligand binding mode hypotheses have been proposed. The current study focuses on the elucidation of the molecular determinants of H(4)R-ligand binding modes by combining (3D) quantitative structure-activity relationship (QSAR), protein homology modeling, molecular dynamics simulations, and site-directed mutagenesis studies. We have designed and synthesized a series of clobenpropit (N-(4-chlorobenzyl)-S-[3-(4(5)-imidazolyl)propyl]isothiourea) derivatives to investigate H(4)R-ligand interactions and ligand binding orientations. Interestingly, our studies indicate that clobenpropit (2) itself can bind to H(4)R in two distinct binding modes, while the addition of a cyclohexyl group to the clobenpropit isothiourea moiety allows VUF5228 (5) to adopt only one specific binding mode in the H(4)R binding pocket. Our ligand-steered, experimentally supported protein modeling method gives new insights into ligand recognition by H(4)R and can be used as a general approach to elucidate the structure of protein-ligand complexes.
Design and Synthesis of Mannich bases as Benzimidazole Derivatives as Analgesic Agents.
Datar, Prasanna A; Limaye, Saleel A
2015-01-01
Mannich bases were selected for 2D QSAR study to derive meaningful relationship between the structural features and analgesic activity. Using the knowledge of important features a novel series was designed to obtain improved analgesic activity. A series of novel Mannich bases 1-(N-substituted amino)methyl]-2-substituted benzimidazole derivatives were synthesized and were screened for analgesic activity. Some of these compounds showed promising analgesic activity when compared with the standard drug diclofenac sodium.
NASA Astrophysics Data System (ADS)
Özbek, Neslihan; Alyar, Saliha; Alyar, Hamit; Şahin, Ertan; Karacan, Nurcan
2013-05-01
Copper(II), nickel(II), platinum(II) and palladium(II) complexes with 2-hydroxy-1-naphthaldehyde-N-methylpropanesulfonylhydrazone (nafpsmh) derived from propanesulfonic acid-1-methylhydrazide (psmh) were synthesized, their structure were identified, and antimicrobial activity of the compounds was screened against three Gram-positive and three Gram-negative bacteria. The results of antimicrobial studies indicate that Pt(II) and Pd(II) complexes showed the most activity against all bacteria. The crystal structure of 2-hydroxy-1-naphthaldehyde-N-methylpropanesulfonylhydrazone (nafpsmh) was also investigated by X-ray analysis. A series of Ni(II) sulfonyl hydrazone complexes (1-33) was synthesized and tested in vitro against Escherichia coli and Staphylococcus aureus. Their antimicrobial activities were used in the QSAR analysis. Four-parameter QSAR models revealed that nucleophilic reaction index for Ni and O atoms, and HOMO-LUMO energy gap play key roles in the antimicrobial activity.
The Potential of Micro Electro Mechanical Systems and Nanotechnology for the U.S. Army
2001-05-01
Quantitative Structure Activity Relationship ( QSAR ) model . The QSAR model calculates the proper composition of the polymer-carbon black matrix...example, the BEI Gyrochip Model QRS11 from Systron Donner Inertial Division has a startup time of less than 1 second, a Mean Time Between Failure (MTBF... modeling from many equations per atom to a few lines of code. This approach is amenable to parallel processing. Nevertheless, their programs require
Abdullah, Nor Hayati; Thomas, Noel Francis; Sivasothy, Yasodha; Lee, Vannajan Sanghiran; Liew, Sook Yee; Noorbatcha, Ibrahim Ali; Awang, Khalijah
2016-01-01
The mammalian hyaluronidase degrades hyaluronic acid by the cleavage of the β-1,4-glycosidic bond furnishing a tetrasaccharide molecule as the main product which is a highly angiogenic and potent inducer of inflammatory cytokines. Ursolic acid 1, isolated from Prismatomeris tetrandra, was identified as having the potential to develop inhibitors of hyaluronidase. A series of ursolic acid analogues were either synthesized via structure modification of ursolic acid 1 or commercially obtained. The evaluation of the inhibitory activity of these compounds on the hyaluronidase enzyme was conducted. Several structural, topological and quantum chemical descriptors for these compounds were calculated using semi empirical quantum chemical methods. A quantitative structure activity relationship study (QSAR) was performed to correlate these descriptors with the hyaluronidase inhibitory activity. The statistical characteristics provided by the best multi linear model (BML) (R2 = 0.9717, R2cv = 0.9506) indicated satisfactory stability and predictive ability of the developed model. The in silico molecular docking study which was used to determine the binding interactions revealed that the ursolic acid analog 22 had a strong affinity towards human hyaluronidase. PMID:26907251
Deeb, Omar; Shaik, Basheerulla; Agrawal, Vijay K
2014-10-01
Quantitative Structure-Activity Relationship (QSAR) models for binding affinity constants (log Ki) of 78 flavonoid ligands towards the benzodiazepine site of GABA (A) receptor complex were calculated using the machine learning methods: artificial neural network (ANN) and support vector machine (SVM) techniques. The models obtained were compared with those obtained using multiple linear regression (MLR) analysis. The descriptor selection and model building were performed with 10-fold cross-validation using the training data set. The SVM and MLR coefficient of determination values are 0.944 and 0.879, respectively, for the training set and are higher than those of ANN models. Though the SVM model shows improvement of training set fitting, the ANN model was superior to SVM and MLR in predicting the test set. Randomization test is employed to check the suitability of the models.
Correlating methane production to microbiota in anaerobic digesters fed synthetic wastewater.
Venkiteshwaran, K; Milferstedt, K; Hamelin, J; Fujimoto, M; Johnson, M; Zitomer, D H
2017-03-01
A quantitative structure activity relationship (QSAR) between relative abundance values and digester methane production rate was developed. For this, 50 triplicate anaerobic digester sets (150 total digesters) were each seeded with different methanogenic biomass samples obtained from full-scale, engineered methanogenic systems. Although all digesters were operated identically for at least 5 solids retention times (SRTs), their quasi steady-state function varied significantly, with average daily methane production rates ranging from 0.09 ± 0.004 to 1 ± 0.05 L-CH 4 /L R -day (L R = Liter of reactor volume) (average ± standard deviation). Digester microbial community structure was analyzed using more than 4.1 million partial 16S rRNA gene sequences of Archaea and Bacteria. At the genus level, 1300 operational taxonomic units (OTUs) were observed across all digesters, whereas each digester contained 158 ± 27 OTUs. Digester function did not correlate with typical biomass descriptors such as volatile suspended solids (VSS) concentration, microbial richness, diversity or evenness indices. However, methane production rate did correlate notably with relative abundances of one Archaeal and nine Bacterial OTUs. These relative abundances were used as descriptors to develop a multiple linear regression (MLR) QSAR equation to predict methane production rates solely based on microbial community data. The model explained over 66% of the variance in the experimental data set based on 149 anaerobic digesters with a standard error of 0.12 L-CH 4 /L R -day. This study provides a framework to relate engineered process function and microbial community composition which can be further expanded to include different feed stocks and digester operating conditions in order to develop a more robust QSAR model. Copyright © 2016 Elsevier Ltd. All rights reserved.
Su, Hanrui; Yu, Chunyang; Zhou, Yongfeng; Gong, Lidong; Li, Qilin; Alvarez, Pedro J J; Long, Mingce
2018-05-02
Tetra-amido macrocyclic ligand (TAML) activator is a functional analog of peroxidase enzymes, which activates hydrogen peroxide (H 2 O 2 ) to form high valence iron-oxo complexes that selectively degrade persistent aromatic organic contaminants (ACs) in water. Here, we develop quantitative structure-activity relationship (QSAR) models based on measured pseudo first-order kinetic rate coefficients (k obs ) of 29 ACs (e.g., phenols and pharmaceuticals) oxidized by TAML/H 2 O 2 at neutral and basic pH values to gain mechanistic insight on the selectivity and pH dependence of TAML/H 2 O 2 systems. These QSAR models infer that electron donating ability (E HOMO ) is the most important AC characteristic for TAML/H 2 O 2 oxidation, pointing to a rate-limiting single-electron transfer (SET) mechanism. Oxidation rates at pH 7 also depend on AC reactive indices such as f min - and qH + , which respectively represent propensity for electrophilic attack and the most positive net atomic charge on hydrogen atoms. At pH 10, TAML/H 2 O 2 is more reactive towards ACs with a lower hydrogen to carbon atoms ratio (#H:C), suggesting the significance of hydrogen atom abstraction. In addition, lnk obs of 14 monosubstituted phenols is negatively correlated with Hammett constants (σ) and exhibits similar sensitivity to substituent effects as horseradish peroxidase. Although accurately predicting degradation rates of specific ACs in complex wastewater matrices could be difficult, these QSAR models are statistically robust and help predict both relative degradability and reaction mechanism for TAML/H 2 O 2 -based treatment processes. Copyright © 2018 Elsevier Ltd. All rights reserved.
Rácz, A; Bajusz, D; Héberger, K
2015-01-01
Recent implementations of QSAR modelling software provide the user with numerous models and a wealth of information. In this work, we provide some guidance on how one should interpret the results of QSAR modelling, compare and assess the resulting models, and select the best and most consistent ones. Two QSAR datasets are applied as case studies for the comparison of model performance parameters and model selection methods. We demonstrate the capabilities of sum of ranking differences (SRD) in model selection and ranking, and identify the best performance indicators and models. While the exchange of the original training and (external) test sets does not affect the ranking of performance parameters, it provides improved models in certain cases (despite the lower number of molecules in the training set). Performance parameters for external validation are substantially separated from the other merits in SRD analyses, highlighting their value in data fusion.
In Silico Prediction of Organ Level Toxicity: Linking Chemistry to Adverse Effects
Cronin, Mark T.D.; Enoch, Steven J.; Mellor, Claire L.; Przybylak, Katarzyna R.; Richarz, Andrea-Nicole; Madden, Judith C.
2017-01-01
In silico methods to predict toxicity include the use of (Quantitative) Structure-Activity Relationships ((Q)SARs) as well as grouping (category formation) allowing for read-across. A challenging area for in silico modelling is the prediction of chronic toxicity and the No Observed (Adverse) Effect Level (NO(A)EL) in particular. A proposed solution to the prediction of chronic toxicity is to consider organ level effects, as opposed to modelling the NO(A)EL itself. This review has focussed on the use of structural alerts to identify potential liver toxicants. In silico profilers, or groups of structural alerts, have been developed based on mechanisms of action and informed by current knowledge of Adverse Outcome Pathways. These profilers are robust and can be coded computationally to allow for prediction. However, they do not cover all mechanisms or modes of liver toxicity and recommendations for the improvement of these approaches are given. PMID:28744348
In Silico Prediction of Organ Level Toxicity: Linking Chemistry to Adverse Effects.
Cronin, Mark T D; Enoch, Steven J; Mellor, Claire L; Przybylak, Katarzyna R; Richarz, Andrea-Nicole; Madden, Judith C
2017-07-01
In silico methods to predict toxicity include the use of (Quantitative) Structure-Activity Relationships ((Q)SARs) as well as grouping (category formation) allowing for read-across. A challenging area for in silico modelling is the prediction of chronic toxicity and the No Observed (Adverse) Effect Level (NO(A)EL) in particular. A proposed solution to the prediction of chronic toxicity is to consider organ level effects, as opposed to modelling the NO(A)EL itself. This review has focussed on the use of structural alerts to identify potential liver toxicants. In silico profilers, or groups of structural alerts, have been developed based on mechanisms of action and informed by current knowledge of Adverse Outcome Pathways. These profilers are robust and can be coded computationally to allow for prediction. However, they do not cover all mechanisms or modes of liver toxicity and recommendations for the improvement of these approaches are given.
One use of alternative methods is to target animal use at only those chemicals and tests that are absolutely necessary. We discuss prioritization of testing based on high-throughput screening assays (HTS), QSAR modeling, high-throughput toxicokinetics (HTTK), and exposure modelin...
Hocart, Simon J.; Liu, Huayin; Deng, Haiyan; De, Dibyendu; Krogstad, Frances M.; Krogstad, Donald J.
2011-01-01
Chloroquine (CQ) is a safe and economical 4-aminoquinoline (AQ) antimalarial. However, its value has been severely compromised by the increasing prevalence of CQ resistance. This study examined 108 AQs, including 68 newly synthesized compounds. Of these 108 AQs, 32 (30%) were active only against CQ-susceptible Plasmodium falciparum strains and 59 (55%) were active against both CQ-susceptible and CQ-resistant P. falciparum strains (50% inhibitory concentrations [IC50s], ≤25 nM). All AQs active against both CQ-susceptible and CQ-resistant P. falciparum strains shared four structural features: (i) an AQ ring without alkyl substitution, (ii) a halogen at position 7 (Cl, Br, or I but not F), (iii) a protonatable nitrogen at position 1, and (iv) a second protonatable nitrogen at the end of the side chain distal from the point of attachment to the AQ ring via the nitrogen at position 4. For activity against CQ-resistant parasites, side chain lengths of ≤3 or ≥10 carbons were necessary but not sufficient; they were identified as essential factors by visual comparison of 2-dimensional (2-D) structures in relation to the antiparasite activities of the AQs and were confirmed by computer-based 3-D comparisons and differential contour plots of activity against P. falciparum. The advantage of the method reported here (refinement of quantitative structure-activity relationship [QSAR] descriptors by random assignment of compounds to multiple training and test sets) is that it retains QSAR descriptors according to their abilities to predict the activities of unknown test compounds rather than according to how well they fit the activities of the compounds in the training sets. PMID:21383099
Nonparametric regression applied to quantitative structure-activity relationships
Constans; Hirst
2000-03-01
Several nonparametric regressors have been applied to modeling quantitative structure-activity relationship (QSAR) data. The simplest regressor, the Nadaraya-Watson, was assessed in a genuine multivariate setting. Other regressors, the local linear and the shifted Nadaraya-Watson, were implemented within additive models--a computationally more expedient approach, better suited for low-density designs. Performances were benchmarked against the nonlinear method of smoothing splines. A linear reference point was provided by multilinear regression (MLR). Variable selection was explored using systematic combinations of different variables and combinations of principal components. For the data set examined, 47 inhibitors of dopamine beta-hydroxylase, the additive nonparametric regressors have greater predictive accuracy (as measured by the mean absolute error of the predictions or the Pearson correlation in cross-validation trails) than MLR. The use of principal components did not improve the performance of the nonparametric regressors over use of the original descriptors, since the original descriptors are not strongly correlated. It remains to be seen if the nonparametric regressors can be successfully coupled with better variable selection and dimensionality reduction in the context of high-dimensional QSARs.
Mahapatra, Manoj Kumar; Bera, Krishnendu; Singh, Durg Vijay; Kumar, Rajnish; Kumar, Manoj
2018-04-01
Protein tyrosine phosphatase 1B (PTP1B) has been identified as a negative regulator of insulin and leptin signalling pathway; hence, it can be considered as a new therapeutic target of intervention for the treatment of type 2 diabetes. Inhibition of this molecular target takes care of both diabetes and obesity, i.e. diabestiy. In order to get more information on identification and optimization of lead, pharmacophore modelling, atom-based 3D QSAR, docking and molecular dynamics studies were carried out on a set of ligands containing thiazolidine scaffold. A six-point pharmacophore model consisting of three hydrogen bond acceptor (A), one negative ionic (N) and two aromatic rings (R) with discrete geometries as pharmacophoric features were developed for a predictive 3D QSAR model. The probable binding conformation of the ligands within the active site was studied through molecular docking. The molecular interactions and the structural features responsible for PTP1B inhibition and selectivity were further supplemented by molecular dynamics simulation study for a time scale of 30 ns. The present investigation has identified some of the indispensible structural features of thiazolidine analogues which can further be explored to optimize PTP1B inhibitors.
QSAR studies of macrocyclic diterpenes with P-glycoprotein inhibitory activity.
Sousa, Inês J; Ferreira, Maria-José U; Molnár, Joseph; Fernandes, Miguel X
2013-02-14
Multidrug resistance (MDR) represents a major limitation for cancer chemotherapy. There are several mechanisms of MDR but the most important is associated with P-glycoprotein (P-gp) overexpression. The development of modulators of P-gp that are able to re-establish drug sensitivity of resistant cells has been considered a promising approach for overcoming MDR. Macrocyclic lathyrane and jatrophane-type diterpenes from Euphorbia species were found to be strong MDR reversing agents. In this study we applied quantitative structure-activity relationship (QSAR) methodology in order to identify the most relevant molecular features of macrocyclic diterpenes with P-gp inhibitory activity and to determine which structural modifications can be performed to improve their activity. Using experimental biological data at two concentrations (4 and 40 μg/ml), we developed a QSAR model for a set of 51 bioactive diterpenic compounds which includes lathyrane and jatrophane-type diterpenes and another model just for jatrophanes. The cross-validation correlation values for all diterpenes QSAR models developed for biological activities at compound concentrations of 4 and 40 μg/ml were 0.758 and 0.729, respectively. Regarding the prediction ability, we get R²(pred) values of 0.765 and 0.534 for biological activities at compound concentrations of 4 and 40 μg/ml, respectively. Applying the cross-validation test to jatrophanes QSAR models, we obtained 0.680 and 0.787 for biological activities at compound concentrations of 4 and 40 μg/ml concentrations, respectively. For the same concentrations, the obtained R²(pred) values for jatrophanes models were 0.541 and 0.534, respectively. The obtained models were statistically valid and showed high prediction ability. Copyright © 2012 Elsevier B.V. All rights reserved.
Boik, John C; Newman, Robert A
2008-01-01
Background Quantitative structure-activity relationship (QSAR) models have become popular tools to help identify promising lead compounds in anticancer drug development. Few QSAR studies have investigated multitask learning, however. Multitask learning is an approach that allows distinct but related data sets to be used in training. In this paper, a suite of three QSAR models is developed to identify compounds that are likely to (a) exhibit cytotoxic behavior against cancer cells, (b) exhibit high rat LD50 values (low systemic toxicity), and (c) exhibit low to modest human oral clearance (favorable pharmacokinetic characteristics). Models were constructed using Kernel Multitask Latent Analysis (KMLA), an approach that can effectively handle a large number of correlated data features, nonlinear relationships between features and responses, and multitask learning. Multitask learning is particularly useful when the number of available training records is small relative to the number of features, as was the case with the oral clearance data. Results Multitask learning modestly but significantly improved the classification precision for the oral clearance model. For the cytotoxicity model, which was constructed using a large number of records, multitask learning did not affect precision but did reduce computation time. The models developed here were used to predict activities for 115,000 natural compounds. Hundreds of natural compounds, particularly in the anthraquinone and flavonoids groups, were predicted to be cytotoxic, have high LD50 values, and have low to moderate oral clearance. Conclusion Multitask learning can be useful in some QSAR models. A suite of QSAR models was constructed and used to screen a large drug library for compounds likely to be cytotoxic to multiple cancer cell lines in vitro, have low systemic toxicity in rats, and have favorable pharmacokinetic properties in humans. PMID:18554402
Boik, John C; Newman, Robert A
2008-06-13
Quantitative structure-activity relationship (QSAR) models have become popular tools to help identify promising lead compounds in anticancer drug development. Few QSAR studies have investigated multitask learning, however. Multitask learning is an approach that allows distinct but related data sets to be used in training. In this paper, a suite of three QSAR models is developed to identify compounds that are likely to (a) exhibit cytotoxic behavior against cancer cells, (b) exhibit high rat LD50 values (low systemic toxicity), and (c) exhibit low to modest human oral clearance (favorable pharmacokinetic characteristics). Models were constructed using Kernel Multitask Latent Analysis (KMLA), an approach that can effectively handle a large number of correlated data features, nonlinear relationships between features and responses, and multitask learning. Multitask learning is particularly useful when the number of available training records is small relative to the number of features, as was the case with the oral clearance data. Multitask learning modestly but significantly improved the classification precision for the oral clearance model. For the cytotoxicity model, which was constructed using a large number of records, multitask learning did not affect precision but did reduce computation time. The models developed here were used to predict activities for 115,000 natural compounds. Hundreds of natural compounds, particularly in the anthraquinone and flavonoids groups, were predicted to be cytotoxic, have high LD50 values, and have low to moderate oral clearance. Multitask learning can be useful in some QSAR models. A suite of QSAR models was constructed and used to screen a large drug library for compounds likely to be cytotoxic to multiple cancer cell lines in vitro, have low systemic toxicity in rats, and have favorable pharmacokinetic properties in humans.
NASA Astrophysics Data System (ADS)
Lalit, Manisha; Gangwal, Rahul P.; Dhoke, Gaurao V.; Damre, Mangesh V.; Khandelwal, Kanchan; Sangamwar, Abhay T.
2013-10-01
A combined pharmacophore modelling, 3D-QSAR and molecular docking approach was employed to reveal structural and chemical features essential for the development of small molecules as LRH-1 agonists. The best HypoGen pharmacophore hypothesis (Hypo1) consists of one hydrogen-bond donor (HBD), two general hydrophobic (H), one hydrophobic aromatic (HYAr) and one hydrophobic aliphatic (HYA) feature. It has exhibited high correlation coefficient of 0.927, cost difference of 85.178 bit and low RMS value of 1.411. This pharmacophore hypothesis was cross-validated using test set, decoy set and Cat-Scramble methodology. Subsequently, validated pharmacophore hypothesis was used in the screening of small chemical databases. Further, 3D-QSAR models were developed based on the alignment obtained using substructure alignment. The best CoMFA and CoMSIA model has exhibited excellent rncv2 values of 0.991 and 0.987, and rcv2 values of 0.767 and 0.703, respectively. CoMFA predicted rpred2 of 0.87 and CoMSIA predicted rpred2 of 0.78 showed that the predicted values were in good agreement with the experimental values. Molecular docking analysis reveals that π-π interaction with His390 and hydrogen bond interaction with His390/Arg393 is essential for LRH-1 agonistic activity. The results from pharmacophore modelling, 3D-QSAR and molecular docking are complementary to each other and could serve as a powerful tool for the discovery of potent small molecules as LRH-1 agonists.
Vyas, V K; Gupta, N; Ghate, M; Patel, S
2014-01-01
In this study we designed novel substituted benzimidazole derivatives and predicted their absorption, distribution, metabolism, excretion and toxicity (ADMET) properties, based on a predictive 3D QSAR study on 132 substituted benzimidazoles as AngII-AT1 receptor antagonists. The two best predicted compounds were synthesized and evaluated for AngII-AT1 receptor antagonism. Three different alignment tools for comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) were used. The best 3D QSAR models were obtained using the rigid body (Distill) alignment method. CoMFA and CoMSIA models were found to be statistically significant with leave-one-out correlation coefficients (q(2)) of 0.630 and 0.623, respectively, cross-validated coefficients (r(2)cv) of 0.651 and 0.630, respectively, and conventional coefficients of determination (r(2)) of 0.848 and 0.843, respectively. 3D QSAR models were validated using a test set of 24 compounds, giving satisfactory predicted results (r(2)pred) of 0.727 and 0.689 for the CoMFA and CoMSIA models, respectively. We have identified some key features in substituted benzimidazole derivatives, such as lipophilicity and H-bonding at the 2- and 5-positions of the benzimidazole nucleus, respectively, for AT1 receptor antagonistic activity. We designed 20 novel substituted benzimidazole derivatives and predicted their activity. In silico ADMET properties were also predicted for these designed molecules. Finally, the compounds with best predicted activity were synthesized and evaluated for in vitro angiotensin II-AT1 receptor antagonism.
DSSTOX (DISTRIBUTED STRUCTURE-SEARCHABLE ...
Distributed Structure-Searchable Toxicity Database Network Major trends affecting public toxicity information resources have the potential to significantly alter the future of predictive toxicology. Chemical toxicity screening is undergoing shifts towards greater use of more fundamental information on gene/protein expression patterns and bioactivity and bioassay profiles, the latter generated with highthroughput screening technologies. Curated, systematically organized, and webaccessible toxicity and biological activity data in association with chemical structures, enabling the integration of diverse data information domains, will fuel the next frontier of advancement for QSAR (quantitative structure-activity relationship) and data mining technologies. The DSSTox project is supporting progress towards these goals on many fronts, promoting the use of formalized and structure-annotated toxicity data models, helping to interface these efforts with QSAR modelers, linking data from diverse sources, and creating a large, quality reviewed, central chemical structure information resource linked to various toxicity data sources
Gu, Wenwen; Chen, Ying; Li, Yu
2017-08-01
Based on the experimental subcooled liquid vapor pressures (P L ) of 17 polychlorinated naphthalene (PCN) congeners, one type of three-dimensional quantitative structure-activity relationship (3D-QSAR) models, comparative molecular similarity indices analysis (CoMSIA), was constructed with Sybyl software. Full factor experimental design was used to obtain the final regulation scheme for PCN, and then carry out modification of PCN-2 to significantly lower its P L . The contour maps of CoMSIA model showed that the migration ability of PCN decreases when the Cl atoms at the 2-, 3-, 4-, 5-, 6-, 7- and 8-positions of PCNs are replaced by electropositive groups. After modification of PCN-2, 12 types of new modified PCN-2 compounds were obtained with lnP L values two orders of magnitude lower than that of PCN-2. In addition, there are significant differences between the calculated total energies and energy gaps of the new modified compounds and those of PCN-2.
Martins Alho, Miriam A; Marrero-Ponce, Yovani; Barigye, Stephen J; Meneses-Marcel, Alfredo; Machado Tugores, Yanetsy; Montero-Torres, Alina; Gómez-Barrio, Alicia; Nogal, Juan J; García-Sánchez, Rory N; Vega, María Celeste; Rolón, Miriam; Martínez-Fernández, Antonio R; Escario, José A; Pérez-Giménez, Facundo; Garcia-Domenech, Ramón; Rivera, Norma; Mondragón, Ricardo; Mondragón, Mónica; Ibarra-Velarde, Froylán; Lopez-Arencibia, Atteneri; Martín-Navarro, Carmen; Lorenzo-Morales, Jacob; Cabrera-Serra, Maria Gabriela; Piñero, Jose; Tytgat, Jan; Chicharro, Roberto; Arán, Vicente J
2014-03-01
Protozoan parasites have been one of the most significant public health problems for centuries and several human infections caused by them have massive global impact. Most of the current drugs used to treat these illnesses have been used for decades and have many limitations such as the emergence of drug resistance, severe side-effects, low-to-medium drug efficacy, administration routes, cost, etc. These drugs have been largely neglected as models for drug development because they are majorly used in countries with limited resources and as a consequence with scarce marketing possibilities. Nowadays, there is a pressing need to identify and develop new drug-based antiprotozoan therapies. In an effort to overcome this problem, the main purpose of this study is to develop a QSARs-based ensemble classifier for antiprotozoan drug-like entities from a heterogeneous compounds collection. Here, we use some of the TOMOCOMD-CARDD molecular descriptors and linear discriminant analysis (LDA) to derive individual linear classification functions in order to discriminate between antiprotozoan and non-antiprotozoan compounds as a way to enable the computational screening of virtual combinatorial datasets and/or drugs already approved. Firstly, we construct a wide-spectrum benchmark database comprising of 680 organic chemicals with great structural variability (254 of them antiprotozoan agents and 426 to drugs having other clinical uses). This series of compounds was processed by a k-means cluster analysis in order to design training and predicting sets. In total, seven discriminant functions were obtained, by using the whole set of atom-based linear indices. All the LDA-based QSAR models show accuracies above 85% in the training set and values of Matthews correlation coefficients (C) vary from 0.70 to 0.86. The external validation set shows rather-good global classifications of around 80% (92.05% for best equation). Later, we developed a multi-agent QSAR classification system, in which the individual QSAR outputs are the inputs of the aforementioned fusion approach. Finally, the fusion model was used for the identification of a novel generation of lead-like antiprotozoan compounds by using ligand-based virtual screening of 'available' small molecules (with synthetic feasibility) in our 'in-house' library. A new molecular subsystem (quinoxalinones) was then theoretically selected as a promising lead series, and its derivatives subsequently synthesized, structurally characterized, and experimentally assayed by using in vitro screening that took into consideration a battery of five parasite-based assays. The chemicals 11(12) and 16 are the most active (hits) against apicomplexa (sporozoa) and mastigophora (flagellata) subphylum parasites, respectively. Both compounds depicted good activity in every protozoan in vitro panel and they did not show unspecific cytotoxicity on the host cells. The described technical framework seems to be a promising QSAR-classifier tool for the molecular discovery and development of novel classes of broad-antiprotozoan-spectrum drugs, which may meet the dual challenges posed by drug-resistant parasites and the rapid progression of protozoan illnesses. Copyright © 2014 Elsevier Ltd. All rights reserved.
Deep neural nets as a method for quantitative structure-activity relationships.
Ma, Junshui; Sheridan, Robert P; Liaw, Andy; Dahl, George E; Svetnik, Vladimir
2015-02-23
Neural networks were widely used for quantitative structure-activity relationships (QSAR) in the 1990s. Because of various practical issues (e.g., slow on large problems, difficult to train, prone to overfitting, etc.), they were superseded by more robust methods like support vector machine (SVM) and random forest (RF), which arose in the early 2000s. The last 10 years has witnessed a revival of neural networks in the machine learning community thanks to new methods for preventing overfitting, more efficient training algorithms, and advancements in computer hardware. In particular, deep neural nets (DNNs), i.e. neural nets with more than one hidden layer, have found great successes in many applications, such as computer vision and natural language processing. Here we show that DNNs can routinely make better prospective predictions than RF on a set of large diverse QSAR data sets that are taken from Merck's drug discovery effort. The number of adjustable parameters needed for DNNs is fairly large, but our results show that it is not necessary to optimize them for individual data sets, and a single set of recommended parameters can achieve better performance than RF for most of the data sets we studied. The usefulness of the parameters is demonstrated on additional data sets not used in the calibration. Although training DNNs is still computationally intensive, using graphical processing units (GPUs) can make this issue manageable.
Gurung, Arun Bahadur; Aguan, Kripamoy; Mitra, Sivaprasad; Bhattacharjee, Atanu
2017-06-01
In Alzheimer's disease (AD), the level of Acetylcholine (ACh) neurotransmitter is reduced. Since Acetylcholinesterase (AChE) cleaves ACh, inhibitors of AChE are very much sought after for AD treatment. The side effects of current inhibitors necessitate development of newer AChE inhibitors. Isoalloxazine derivatives have proved to be promising (AChE) inhibitors. However, their structure-activity relationship studies have not been reported till date. In the present work, various quantitative structure-activity relationship (QSAR) building methods such as multiple linear regression (MLR), partial least squares ,and principal component regression were employed to derive 3D-QSAR models using steric and electrostatic field descriptors. Statistically significant model was obtained using MLR coupled with stepwise selection method having r 2 = .9405, cross validated r 2 (q 2 ) = .6683, and a high predictability (pred_r 2 = .6206 and standard error, pred_r 2 se = .2491). Steric and electrostatic contribution plot revealed three electrostatic fields E_496, E_386 and E_577 and one steric field S_60 contributing towards biological activity. A ligand-based 3D-pharmacophore model was generated consisting of eight pharmacophore features. Isoalloxazine derivatives were docked against human AChE, which revealed critical residues implicated in hydrogen bonds as well as hydrophobic interactions. The binding modes of docked complexes (AChE_IA1 and AChE_IA14) were validated by molecular dynamics simulation which showed their stable trajectories in terms of root mean square deviation and molecular mechanics/Poisson-Boltzmann surface area binding free energy analysis revealed key residues contributing significantly to overall binding energy. The present study may be useful in the design of more potent Isoalloxazine derivatives as AChE inhibitors.
Tomioka, Haruaki
2014-01-01
Worldwide, tuberculosis (TB) remains the most frequent and important infectious disease causing morbidity and death. However, the development of new drugs for the treatment and prophylaxis of TB, particularly those truly active against dormant and persistent types of tubercle bacilli, has been slow, although some promising drugs, such as diarylquinoline TMC207, nitroimidazopyran PA-824, nitroimidazo-oxazole Delamanid (OPC-67683), oxazolidinone PNU-100480, ethylene diamine SQ-109, and pyrrole derivative LL3858, are currently under phase 1 to 3 clinical trials. Therefore, novel types of antituberculous drug, which act on unique drug targets in Mycobacterium tuberculosis (MTB) pathogens, particularly drug targets related to the establishment of mycobacterial dormancy in the host's macrophages, are urgently needed. In this context, it should be noted that current anti-TB drugs mostly target the metabolic reactions and proteins which are essential for the growth of MTB in extracellular milieus. It may also be promising to develop another type of drug that exerts an inhibitory action against bacterial virulence factors which cross-talk and interfere with signaling pathways of MTB-infected immunocompetent host cells, such as lymphocytes, macrophages, and NK cells, thereby changing the intracellular milieus that are favorable to intramacrophage survival and the growth of infected bacilli. This special issue contains ten review articles, dealing with recent approaches to identify and establish novel drug targets in MTB for the development of new and unique antitubercular drugs, including those related to mycobacterial dormancy and crosstalk with cellular signaling pathways. In addition, this special issue contains some review papers with special reference to the drug design based on quantitative structure-activity relationship (QSAR) analysis, especially three-dimensional (3D)-QSAR. New, critical information on the entire genome of MTB and mycobacterial virulence genes is promoting the elucidation of the molecular structures of drug targets in MTB, and are consequently markedly useful for the design of new, promising antituberculous drugs using QSAR techniques. In this issue, we review the following areas. Firstly, Dr. Li M. Fu reviews the perspective that combines machine learning and genomics for drug discovery in tuberculosis, in relation to the problem that the exhaustive search for useful drug targets over the entire MTB genome would not be as productive as expected in practice [1]. Secondly, the review article by Drs. R. S. Chauhan. S. K. Chanumolu, C. Rout, and R. Shrivastava focuses on analysis of the current state of MTB genomic resources, host-pathogen interaction studies in the context of mycobacterial persistence, and drug target discovery based on the utilization of computational tools and metabolic network analyses [2]. Thirdly, Drs. Daria Bottai, Agnese Serafini, Alessandro Cascioferro, Roland Brosch, and Riccardo Manganelli review the current knowledge on MTB T7SS/ESX secretion systems and their impact on MTB physiology and virulence, and the possible approaches to develop T7SS/ESX inhibitors [3]. Fourthly, Drs. E. Jeffrey North, Mary Jackson, and Richard E. Lee review and analyze new and emerging inhibitors of the mycolic acid biosynthetic pathway, including mycobacterial enzymes for fatty acid synthesis, mycolic acid-modifying enzymes, fatty acid-activating and -condensing enzymes, transporters, and transferases, that have been discovered in the post-genomic era of tuberculosis drug discovery [4]. Fifthly, Drs. Katarina Mikusova, Vadim Makarov, and Joao Neres review the mycobacterial enzyme DprE1, which catalyzes a unique epimerization reaction in the biosynthesis of decaprenylphosphoryl arabinose, a single donor of the arabinosyl residue for the build-up of arabinans, one of the mycobacterial cell wall components, as an important drug target especially for the development of benzothiazinones [5]. Sixthly, I review the present status of global research on novel drug targets related to the Toll-like receptor in the MTB pathogen, with special reference to mycobacterial virulence factors that cross-talk and interfere with signaling pathways of host macrophages [6]. The following four review articles deal with drug design of novel anti-TB agents employing QSAR techniques. Firstly, Drs. Nidhi and Mohammad Imran Siddiqi review 2D and 3D QSAR approaches and the recent trends of these methods integrated with virtual screening using the 3D pharmacophore and molecular docking approaches for the identification and design of novel antituberculous agents, by presenting a comprehensive overview of QSAR studies reported for newer antituberculous agents [7]. Secondly, Drs. Filomena Martins, Cristina Ventura, Susana Santos, and Miguel Viveiros review the current status of different QSAR-based strategies for the design of novel anti-TB drugs based upon the most active anti-TB agent, isoniazid, from the viewpoint of the development of promising derivatives that are active against isoniazid- resistant strains with katG mutations [8]. Thirdly, Drs. Sanchaita Rajkhowa and Ramesh C. Deka review current studies concerning 2D and 3D QSAR models that contain density-functional theory (DFT)-based descriptors as their parameters [9]. Notably, DFT-based descriptors such as atomic charges, molecular orbital energies, frontier orbital densities, and atom-atom polarizabilities are very useful in predicting the reactivity of atoms in molecules. Fourthly, Drs. Renata V. Bueno, Rodolpho C. Braga, Natanael D. Segretti, Elizabeth I. Ferreira, Gustavo H. G. Trossini, and Carolina H. Andrade review the current progress and applications of QSAR analysis for the discovery of innovative tuberculostatic agents as inhibitors of ribonucleotide reductase, DNA gyrase, ATP synthase, and thymidylate kinase enzymes, highlighting present challenges and new opportunities in TB drug design [10]. The aim of this issue is to address the future prospects for the development of new antituberculous drugs. There are a number of difficulties in computational drug-design for the development of new drug formulations with potential antimycobacterial effects, especially therapeutic and prophylactic efficacy against infection due to dormant-type MTB pathogens. In addition, it should be emphasized that the most urgent goal of TB chemotherapy is develop highly active, low-cost drugs which can be used not only in industrialized but also in developing countries, because most global TB incidence occurs in the latter. I am sincerely grateful to the individuals who contributed to this work. All authors are experts in their fields and they made earnest efforts to perform these in-depth reviews. I thank them all.
Modeling of adipose/blood partition coefficient for environmental chemicals.
Papadaki, K C; Karakitsios, S P; Sarigiannis, D A
2017-12-01
A Quantitative Structure Activity Relationship (QSAR) model was developed in order to predict the adipose/blood partition coefficient of environmental chemical compounds. The first step of QSAR modeling was the collection of inputs. Input data included the experimental values of adipose/blood partition coefficient and two sets of molecular descriptors for 67 organic chemical compounds; a) the descriptors from Linear Free Energy Relationship (LFER) and b) the PaDEL descriptors. The datasets were split to training and prediction set and were analysed using two statistical methods; Genetic Algorithm based Multiple Linear Regression (GA-MLR) and Artificial Neural Networks (ANN). The models with LFER and PaDEL descriptors, coupled with ANN, produced satisfying performance results. The fitting performance (R 2 ) of the models, using LFER and PaDEL descriptors, was 0.94 and 0.96, respectively. The Applicability Domain (AD) of the models was assessed and then the models were applied to a large number of chemical compounds with unknown values of adipose/blood partition coefficient. In conclusion, the proposed models were checked for fitting, validity and applicability. It was demonstrated that they are stable, reliable and capable to predict the values of adipose/blood partition coefficient of "data poor" chemical compounds that fall within the applicability domain. Copyright © 2017. Published by Elsevier Ltd.
Luo, Xiang; Yang, Xianhai; Qiao, Xianliang; Wang, Ya; Chen, Jingwen; Wei, Xiaoxuan; Peijnenburg, Willie J G M
2017-03-22
Reaction with hydroxyl radicals (˙OH) is an important removal pathway for organic pollutants in the aquatic environment. The aqueous reaction rate constant (k OH ) is therefore an important parameter for fate assessment of aquatic pollutants. Since experimental determination fails to meet the requirement of being able to efficiently handle numerous organic chemicals at limited cost and within a relatively short period of time, in silico methods such as quantitative structure-activity relationship (QSAR) models are needed to predict k OH . In this study, a QSAR model with a larger and wider applicability domain as compared with existing models was developed. Following the guidelines for the development and validation of QSAR models proposed by the Organization for Economic Co-operation and Development (OECD), the model shows satisfactory performance. The applicability domain of the model has been extended and contained chemicals that have rarely been covered in most previous studies. The chemicals covered in the current model contain functional groups including [double bond splayed left]C[double bond, length as m-dash]C[double bond splayed right], -C[triple bond, length as m-dash]C-, -C 6 H 5 , -OH, -CHO, -O-, [double bond splayed left]C[double bond, length as m-dash]O, -C[double bond, length as m-dash]O(O)-, -COOH, -C[triple bond, length as m-dash]N, [double bond splayed left]N-, -NH 2 , -NH-C(O)-, -NO 2 , -N[double bond, length as m-dash]C-N[double bond splayed right], [double bond splayed left]N-N[double bond splayed right], -N[double bond, length as m-dash]N-, -S-, -S-S-, -SH, -SO 3 , -SO 4 , -PO 4 , and -X (F, Cl, Br, and I).
Aiba née Kaneko, Maki; Hirota, Morihiko; Kouzuki, Hirokazu; Mori, Masaaki
2015-02-01
Genotoxicity is the most commonly used endpoint to predict the carcinogenicity of chemicals. The International Conference on Harmonization (ICH) M7 Guideline on Assessment and Control of DNA Reactive (Mutagenic) Impurities in Pharmaceuticals to Limit Potential Carcinogenic Risk offers guidance on (quantitative) structure-activity relationship ((Q)SAR) methodologies that predict the outcome of bacterial mutagenicity assay for actual and potential impurities. We examined the effectiveness of the (Q)SAR approach with the combination of DEREK NEXUS as an expert rule-based system and ADMEWorks as a statistics-based system for the prediction of not only mutagenic potential in the Ames test, but also genotoxic potential in mutagenicity and clastogenicity tests, using a data set of 342 chemicals extracted from the literature. The prediction of mutagenic potential or genotoxic potential by DEREK NEXUS or ADMEWorks showed high values of sensitivity and concordance, while prediction by the combination of DEREK NEXUS and ADMEWorks (battery system) showed the highest values of sensitivity and concordance among the three methods, but the lowest value of specificity. The number of false negatives was reduced with the battery system. We also separately predicted the mutagenic potential and genotoxic potential of 41 cosmetic ingredients listed in the International Nomenclature of Cosmetic Ingredients (INCI) among the 342 chemicals. Although specificity was low with the battery system, sensitivity and concordance were high. These results suggest that the battery system consisting of DEREK NEXUS and ADMEWorks is useful for prediction of genotoxic potential of chemicals, including cosmetic ingredients.
Holland, Erika B; Feng, Wei; Zheng, Jing; Dong, Yao; Li, Xueshu; Lehmler, Hans-Joachim; Pessah, Isaac N
2017-01-01
Nondioxin-like polychlorinated biphenyls (NDL PCBs) activate ryanodine-sensitive Ca 2+ channels (RyRs) and this activation has been associated with neurotoxicity in exposed animals. RyR-active congeners follow a distinct structure-activity relationship and a quantitative structure-activity relationship (QSAR) predicts that a large number of PCBs likely activate the receptor, which requires validation. Additionally, previous structural based conclusions have been established using receptor ligand binding assays but the impact of varying PCB structures on ion channel gating behavior is not understood. We used [ 3 H]Ryanodine ([ 3 H]Ry) binding to assess the RyR-activity of 14 previously untested PCB congeners evaluating the predictability of the QSAR. Congeners determined to display widely varying potency were then assayed with single channel voltage clamp analysis to assess direct influences on channel gating kinetics. The RyR-activity of individual PCBs assessed in in vitro assays followed the general pattern predicted by the QSAR but binding and lipid bilayer experiments demonstrated higher potency than predicted. Of the 49 congeners tested to date, tetra-ortho PCB 202 was found to be the most potent RyR-active congener increasing channel open probability at 200 pM. Shifting meta-substitutions to the para-position resulted in a > 100-fold reduction in potency as seen with PCB 197. Non-ortho PCB 11 was found to lack activity at the receptor supporting a minimum mono-ortho substitution for PCB RyR activity. These findings expand and support previous SAR assessments; where out of the 49 congeners tested to date 42 activate the receptor demonstrating that the RyR is a sensitive and common target of PCBs. © The Author 2016. Published by Oxford University Press on behalf of the Society of Toxicology. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Oszwałdowski, Sławomir; Timerbaev, Andrei R
2008-02-01
The relevance of the quantitative structure-activity relationship (QSAR) principle in MEKC and microemulsion EKC (MEEKC) of metal-ligand complexes was evaluated for a better understanding of analyte migration mechanism. A series of gallium chelates were applied as test solutes with available experimental migration data in order to reveal the molecular properties that govern the separation. The QSAR models operating with n-octanol-water partition coefficients or van der Waals volumes were found to be valid for estimation of the retention factors (log k') of neutral compounds when using only an aqueous MEEKC electrolyte. On the other hand, consistent approximations of log k' for both uncharged and charged complexes in either EKC mode (and also with hydro-organic BGEs) were achievable with two-parametric QSARs in which the dipole moment is additionally incorporated as a structural descriptor, reflecting the electrostatic solute-pseudostationary phase interaction. The theoretical analysis of significant molecular parameters in MEKC systems, in which the micellar BGE is modified with an organic solvent, confirmed that concomitant consideration of hydrophobic, electrostatic, and solvation factors is essential for explaining the migration behavior of neutral metal complexes.
Marrero-Ponce, Yovani
2004-01-01
This report describes a new set of molecular descriptors of relevance to QSAR/QSPR studies and drug design, atom linear indices fk(xi). These atomic level chemical descriptors are based on the calculation of linear maps on Rn[fk(xi): Rn--> Rn] in canonical basis. In this context, the kth power of the molecular pseudograph's atom adjacency matrix [Mk(G)] denotes the matrix of fk(xi) with respect to the canonical basis. In addition, a local-fragment (atom-type) formalism was developed. The kth atom-type linear indices are calculated by summing the kth atom linear indices of all atoms of the same atom type in the molecules. Moreover, total (whole-molecule) linear indices are also proposed. This descriptor is a linear functional (linear form) on Rn. That is, the kth total linear indices is a linear map from Rn to the scalar R[ fk(x): Rn --> R]. Thus, the kth total linear indices are calculated by summing the atom linear indices of all atoms in the molecule. The features of the kth total and local linear indices are illustrated by examples of various types of molecular structures, including chain-lengthening, branching, heteroatoms-content, and multiple bonds. Additionally, the linear independence of the local linear indices to other 0D, 1D, 2D, and 3D molecular descriptors is demonstrated by using principal component analysis for 42 very heterogeneous molecules. Much redundancy and overlapping was found among total linear indices and most of the other structural indices presently in use in the QSPR/QSAR practice. On the contrary, the information carried by atom-type linear indices was strikingly different from that codified in most of the 229 0D-3D molecular descriptors used in this study. It is concluded that the local linear indices are an independent indices containing important structural information to be used in QSPR/QSAR and drug design studies. In this sense, atom, atom-type, and total linear indices were used for the prediction of pIC50 values for the cleavage process of a set of flavone derivatives inhibitors of HIV-1 integrase. Quantitative models found are significant from a statistical point of view (R of 0.965, 0.902, and 0.927, respectively) and permit a clear interpretation of the studied properties in terms of the structural features of molecules. A LOO cross-validation procedure revealed that the regression models had a fairly good predictability (q2 of 0.679, 0.543, and 0.721, respectively). The comparison with other approaches reveals good behavior of the method proposed. The approach described in this paper appears to be an excellent alternative or guides for discovery and optimization of new lead compounds.
Li, Hongzhi; Zhong, Ziyan; Li, Lin; Gao, Rui; Cui, Jingxia; Gao, Ting; Hu, Li Hong; Lu, Yinghua; Su, Zhong-Min; Li, Hui
2015-05-30
A cascaded model is proposed to establish the quantitative structure-activity relationship (QSAR) between the overall power conversion efficiency (PCE) and quantum chemical molecular descriptors of all-organic dye sensitizers. The cascaded model is a two-level network in which the outputs of the first level (JSC, VOC, and FF) are the inputs of the second level, and the ultimate end-point is the overall PCE of dye-sensitized solar cells (DSSCs). The model combines quantum chemical methods and machine learning methods, further including quantum chemical calculations, data division, feature selection, regression, and validation steps. To improve the efficiency of the model and reduce the redundancy and noise of the molecular descriptors, six feature selection methods (multiple linear regression, genetic algorithms, mean impact value, forward selection, backward elimination, and +n-m algorithm) are used with the support vector machine. The best established cascaded model predicts the PCE values of DSSCs with a MAE of 0.57 (%), which is about 10% of the mean value PCE (5.62%). The validation parameters according to the OECD principles are R(2) (0.75), Q(2) (0.77), and Qcv2 (0.76), which demonstrate the great goodness-of-fit, predictivity, and robustness of the model. Additionally, the applicability domain of the cascaded QSAR model is defined for further application. This study demonstrates that the established cascaded model is able to effectively predict the PCE for organic dye sensitizers with very low cost and relatively high accuracy, providing a useful tool for the design of dye sensitizers with high PCE. © 2015 Wiley Periodicals, Inc.
Benigni, Romualdo; Bossa, Cecilia
2008-01-01
In the past decades, chemical carcinogenicity has been the object of mechanistic studies that have been translated into valuable experimental (e.g., the Salmonella assays system) and theoretical (e.g., compilations of structure alerts for chemical carcinogenicity) models. These findings remain the basis of the science and regulation of mutagens and carcinogens. Recent advances in the organization and treatment of large databases consisting of both biological and chemical information nowadays allows for a much easier and more refined view of data. This paper reviews recent analyses on the predictive performance of various lists of structure alerts, including a new compilation of alerts that combines previous work in an optimized form for computer implementation. The revised compilation is part of the Toxtree 1.50 software (freely available from the European Chemicals Bureau website). The use of structural alerts for the chemical biological profiling of a large database of Salmonella mutagenicity results is also reported. Together with being a repository of the science on the chemical biological interactions at the basis of chemical carcinogenicity, the SAs have a crucial role in practical applications for risk assessment, for: (a) description of sets of chemicals; (b) preliminary hazard characterization; (c) formation of categories for e.g., regulatory purposes; (d) generation of subsets of congeneric chemicals to be analyzed subsequently with QSAR methods; (e) priority setting. An important aspect of SAs as predictive toxicity tools is that they derive directly from mechanistic knowledge. The crucial role of mechanistic knowledge in the process of applying (Q)SAR considerations to risk assessment should be strongly emphasized. Mechanistic knowledge provides a ground for interaction and dialogue between model developers, toxicologists and regulators, and permits the integration of the (Q)SAR results into a wider regulatory framework, where different types of evidence and data concur or complement each other as a basis for making decisions and taking actions.
Sharma, Mukesh C; Sharma, S
2016-12-01
A series of 2-dihydro-4-quinazolin with potent highly selective inhibitors of inducible nitric oxide synthase activities was subjected to quantitative structure activity relationships (QSAR) analysis. Statistically significant equations with high correlation coefficient (r 2 = 0.8219) were developed. The k-nearest neighbor model has showed good cross-validated correlation coefficient and external validation values of 0.7866 and 0.7133, respectively. The selected electrostatic field descriptors the presence of blue ball around R1 and R4 in the quinazolinamine moiety showed electronegative groups favorable for nitric oxide synthase activity. The QSAR models may lead to the structural requirements of inducible nitric oxide compounds and help in the design of new compounds.
Dai, Yujie; Chen, Nan; Wang, Qiang; Zheng, Heng; Zhang, Xiuli; Jia, Shiru; Dong, Lilong; Feng, Dacheng
2012-01-01
The inhibitors of p53-HDM2 interaction are attractive molecules for the treatment of wild-type p53 tumors. In order to search more potent HDM2 inhibitors, docking operation with CDOCKER protocol in Discovery Studio 2.1 (DS2.1) and multidimensional hybrid quantitative structure-activity relationship (QSAR) studies through the physiochemical properties obtained from DS2.1 and E-Dragon 1.0 as descriptors, have been performed on 59 1,4-benzodiazepine- 2,5-diones which have p53-HDM2 interaction inhibitory activities. The docking results indicate that π-π interaction between the imidazole group in HIS96 and the aryl ring at 4-N of 1,4-benzodiazepine-2,5-dione may be one of the key factors for the combination of ligands with HDM2. Two QSAR models were obtained using genetic function approximation (GFA) and genetic partial least squares (G/PLS) based on the descriptors obtained from DS2.1 and E-dragon 1.0, respectively. The best model can explain 85.5% of the variance (R 2adj ) while it could predict 81.7% of the variance (R 2 cv ). With this model, the bioactivities of some new compounds were predicted. PMID:24250508
Dai, Yujie; Chen, Nan; Wang, Qiang; Zheng, Heng; Zhang, Xiuli; Jia, Shiru; Dong, Lilong; Feng, Dacheng
2012-01-01
The inhibitors of p53-HDM2 interaction are attractive molecules for the treatment of wild-type p53 tumors. In order to search more potent HDM2 inhibitors, docking operation with CDOCKER protocol in Discovery Studio 2.1 (DS2.1) and multidimensional hybrid quantitative structure-activity relationship (QSAR) studies through the physiochemical properties obtained from DS2.1 and E-Dragon 1.0 as descriptors, have been performed on 59 1,4-benzodiazepine- 2,5-diones which have p53-HDM2 interaction inhibitory activities. The docking results indicate that π-π interaction between the imidazole group in HIS96 and the aryl ring at 4-N of 1,4-benzodiazepine-2,5-dione may be one of the key factors for the combination of ligands with HDM2. Two QSAR models were obtained using genetic function approximation (GFA) and genetic partial least squares (G/PLS) based on the descriptors obtained from DS2.1 and E-dragon 1.0, respectively. The best model can explain 85.5% of the variance (R (2) adj ) while it could predict 81.7% of the variance (R (2) cv ). With this model, the bioactivities of some new compounds were predicted.
The QSAR and docking calculations of fullerene derivatives as HIV-1 protease inhibitors
NASA Astrophysics Data System (ADS)
Saleh, Noha A.
2015-02-01
The inhibition of HIV-1 protease is considered as one of the most important targets for drug design and the deactivation of HIV-1. In the present work, the fullerene surface (C60) is modified by adding oxygen atoms as well as hydroxymethylcarbonyl (HMC) groups to form 6 investigated fullerene derivative compounds. These compounds have one, two, three, four or five O atoms + HMC groups at different positions on phenyl ring. The effect of the repeating of these groups on the ability of suggested compounds to inhibit the HIV protease is studied by calculating both Quantitative Structure Activity Relationship (QSAR) properties and docking simulation. Based on the QSAR descriptors, the solubility and the hydrophilicity of studied fullerene derivatives increased with increasing the number of oxygen atoms + HMC groups in the compound. While docking calculations indicate that, the compound with two oxygen atoms + HMC groups could interact and binds with HIV-1 protease active site. This is could be attributed to the active site residues of HIV-1 protease are hydrophobic except the two aspartic acids. So that, the increase in the hydrophilicity and polarity of the compound is preventing and/or decreasing the hydrophobic interaction between the compound and HIV-1 protease active site.
Toxicity challenges in environmental chemicals: Prediction of ...
Physiologically based pharmacokinetic (PBPK) models bridge the gap between in vitro assays and in vivo effects by accounting for the adsorption, distribution, metabolism, and excretion of xenobiotics, which is especially useful in the assessment of human toxicity. Quantitative structure-activity relationships (QSAR) serve as a vital tool for the high-throughput prediction of chemical-specific PBPK parameters, such as the fraction of a chemical unbound by plasma protein (Fub). The presented work explores the merit of utilizing experimental pharmaceutical Fub data for the construction of a universal QSAR model, in order to compensate for the limited range of high-quality experimental Fub data for environmentally relevant chemicals, such as pollutants, pesticides, and consumer products. Independent QSAR models were constructed with three machine-learning algorithms, k nearest neighbors (kNN), random forest (RF), and support vector machine (SVM) regression, from a large pharmaceutical training set (~1000) and assessed with independent test sets of pharmaceuticals (~200) and environmentally relevant chemicals in the ToxCast program (~400). Small descriptor sets yielded the optimal balance of model complexity and performance, providing insight into the biochemical factors of plasma protein binding, while preventing over fitting to the training set. Overlaps in chemical space between pharmaceutical and environmental compounds were considered through applicability of do
Nolte, Tom M; Pinto-Gil, Kevin; Hendriks, A Jan; Ragas, Ad M J; Pastor, Manuel
2018-01-24
Microbial biomass and acclimation can affect the removal of organic chemicals in natural surface waters. In order to account for these effects and develop more robust models for biodegradation, we have compiled and curated removal data for un-acclimated (pristine) surface waters on which we developed quantitative structure-activity relationships (QSARs). Global analysis of the very heterogeneous dataset including neutral, anionic, cationic and zwitterionic chemicals (N = 233) using a random forest algorithm showed that useful predictions were possible (Q ext 2 = 0.4-0.5) though relatively large standard errors were associated (SDEP ∼0.7). Classification of the chemicals based on speciation state and metabolic pathway showed that biodegradation is influenced by the two, and that the dependence of biodegradation on chemical characteristics is non-linear. Class-specific QSAR analysis indicated that shape and charge distribution determine the biodegradation of neutral chemicals (R 2 ∼ 0.6), e.g. through membrane permeation or binding to P450 enzymes, whereas the average biodegradation of charged chemicals is 1 to 2 orders of magnitude lower, for which degradation depends more directly on cellular uptake (R 2 ∼ 0.6). Further analysis showed that specific chemical classes such as peptides and organic halogens are relatively less biodegradable in pristine surface waters, resulting in the need for the microbial consortia to acclimate. Additional literature data was used to verify an acclimation model (based on Monod-type kinetics) capable of extrapolating QSAR predictions to acclimating conditions such as in water treatment, downstream lakes and large rivers under μg L -1 to mg L -1 concentrations. The framework developed, despite being based on multiple assumptions, is promising and needs further validation using experimentation with more standardised and homogenised conditions as well as adequate characterization of the inoculum used.
A Java-based web service is being developed within the US EPA’s Chemistry Dashboard to provide real time estimates of toxicity values and physical properties. WebTEST can generate toxicity predictions directly from a simple URL which includes the endpoint, QSAR method, and ...
A Java-based web service is being developed within the US EPA’s Chemistry Dashboard to provide real time estimates of toxicity values and physical properties. WebTEST can generate toxicity predictions directly from a simple URL which includes the endpoint, QSAR method, and ...
2008-02-01
West Society of Toxicology in Breckenridge, CO in September 2007: “Identification of Curcumin Analogs Toxic against Prostate Cancer Cells Through...quantitative structure-activity relationship ( QSAR ) and ligand-based virtual screening (LBVS) to explore the possibility of improving their efficacy...Student in my laboratory has presented part of this data at the 25th Annual Meeting of the Mountain West Society of Toxicology in Breckenridge, CO in
Predicting the bioconcentration factor of highly hydrophobic organic chemicals.
Garg, Rajni; Smith, Carr J
2014-07-01
Bioconcentration refers to the process of uptake and buildup of chemicals in living organisms. Experimental measurement of bioconcentration factor (BCF) is time-consuming and expensive, and is not feasible for a large number of chemicals of regulatory concern. Quantitative structure-activity relationship (QSAR) models are used for estimating BCF values to help in risk assessment of a chemical. This paper presents the results of a QSAR study conducted to address an important problem encountered in the prediction of the BCF of highly hydrophobic chemicals. A new QSAR model is derived using a dataset of diverse organic chemicals previously tested in a United States Environmental Protection Agency laboratory. It is noted that the linear relationship between the BCF and hydrophobic parameter, i.e., calculated octanol-water partition coefficient (ClogP), breaks down for highly hydrophobic chemicals. The parabolic QSAR equation, log BCF=3.036 ClogP-0.197 ClogP(2)-0.808 MgVol (n=28, r(2)=0.817, q(2)=0.761, s=0.558) (experimental log BCF range=0.44-5.29, ClogP range=3.16-11.27), suggests that a non-linear relationship between BCF and the hydrophobic parameter, along with inclusion of additional molecular size, weight and/or volume parameters, should be considered while developing a QSAR model for more reliable prediction of the BCF of highly hydrophobic chemicals. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
Vijayan, R S K; Ghoshal, Nanda
2008-10-01
Given the heterogeneity of GABA(A) receptor, the pharmacological significance of identifying subtype selective modulators is increasingly being recognized. Thus, drugs selective for GABA(A) alpha(3) receptors are expected to display fewer side effects than the drugs presently in clinical use. Hence we carried out 3D QSAR (three-dimensional quantitative structure-activity relationship) studies on a series of novel GABA(A) alpha(3) subtype selective modulators to gain more insight into subtype affinity. To identify the 3D functional attributes required for subtype selectivity, a chemical feature-based pharmacophore, primarily based on selective ligands representing diverse structural classes was generated. The obtained pseudo receptor model of the benzodiazepine binding site revealed a binding mode akin to "Message-Address" concept. Scaffold hopping was carried out across multi-conformational May Bridge database for the identification of novel chemotypes. Further a focused data reduction approach was employed to choose a subset of enriched compounds based on "Drug likeness" and "Similarity-based" methods. These results taken together could provide impetus for rational design and optimization of more selective and high affinity leads with a potential to have decreased adverse effects.
Code of Federal Regulations, 2012 CFR
2012-07-01
... Species Act. Existing Great Lakes discharger is any building, structure, facility, or installation from... discharger is any building, structure, facility, or installation from which there is or may be a “discharge... monitoring of the contaminant. Quantitative structure activity relationship (QSAR) or structure activity...
Quantitative structure-activity relationship models that stand the test of time.
Davis, Andrew M; Wood, David J
2013-04-01
The pharmaceutical industry is in a period of intense change. While this has many drivers, attrition through the development process continues to be an important pressure. The emerging definitions of "compound quality" that are based on retrospective analyses of developmental attrition have highlighted a new direction for medicinal chemistry and the paradigm of "quality at the point of design". The time has come for retrospective analyses to catalyze prospective action. Quality at the point of design places pressure on the quality of our predictive models. Empirical QSAR models when built with care provide true predictive control, but their accuracy and precision can be improved. Here we describe AstraZeneca's experience of automation in QSAR model building and validation, and how an informatics system can provide a step-change in predictive power to project design teams, if they choose to use it.
Kleandrova, Valeria V; Luan, Feng; Speck-Planche, Alejandro; Cordeiro, M Natália D S
2015-01-01
The assessment of acute toxicity is one of the most important stages to ensure the safety of chemicals with potential applications in pharmaceutical sciences, biomedical research, or any other industrial branch. A huge and indiscriminate number of toxicity assays have been carried out on laboratory animals. In this sense, computational approaches involving models based on quantitative-structure activity/toxicity relationships (QSAR/QSTR) can help to rationalize time and financial costs. Here, we discuss the most significant advances in the last 6 years focused on the use of QSAR/QSTR models to predict acute toxicity of drugs/chemicals in laboratory animals, employing large and heterogeneous datasets. The advantages and drawbacks of the different QSAR/QSTR models are analyzed. As a contribution to the field, we introduce the first multitasking (mtk) QSTR model for simultaneous prediction of acute toxicity of compounds by considering different routes of administration, diverse breeds of laboratory animals, and the reliability of the experimental conditions. The mtk-QSTR model was based on artificial neural networks (ANN), allowing the classification of compounds as toxic or non-toxic. This model correctly classified more than 94% of the 1646 cases present in the whole dataset, and its applicability was demonstrated by performing predictions of different chemicals such as drugs, dietary supplements, and molecules which could serve as nanocarriers for drug delivery. The predictions given by the mtk-QSTR model are in very good agreement with the experimental results.
Quantitative structure-cytotoxicity relationship of piperic acid amides.
Shimada, Chiyako; Uesawa, Yoshihiro; Ishihara, Mariko; Kagaya, Hajime; Kanamoto, Taisei; Terakubo, Shigemi; Nakashima, Hideki; Takao, Koichi; Miyashiro, Takaki; Sugita, Yoshiaki; Sakagami, Hiroshi
2014-09-01
A total of 12 piperic acid amides, including piperine, were subjected to quantitative structure-activity relationship (QSAR) analysis, based on their cytotoxicity, tumor selectivity and anti-HIV activity, in order to find new biological activities. Cytotoxicity against four human oral squamous cell carcinoma (OSCC) cell lines and three human oral normal cells was determined by the 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide (MTT) method. Tumor selectivity was evaluated by the ratio of the mean 50% cytotoxic concentration (CC50) against normal oral cells to that against OSCC cell lines. Anti-HIV activity was evaluated by the ratio of the CC50 to 50% HIV infection-cytoprotective concentration (EC50). Physicochemical, structural, and quantum-chemical parameters were calculated based on the conformations optimized by LowModeMD method followed by density functional theory method. All compounds showed low-to-moderate tumor selectivity, but no anti-HIV activity. N-Piperoyldopamine ( 8: ) which has a catechol moiety, showed the highest tumor selectivity, possibly due to its unique molecular shape and electrostatic interaction, especially its largest partial equalization of orbital electronegativities and vsurf descriptors. The present study suggests that molecular shape and ability for electrostatic interaction are useful parameters for estimating the tumor selectivity of piperic acid amides. Copyright© 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.