small molecule databases: Topics by Science.gov

Sample records for small molecule databases

Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.

PubMed

Disney, Matthew D; Winkelsas, Audrey M; Velagapudi, Sai Pradeep; Southern, Mark; Fallahi, Mohammad; Childs-Disney, Jessica L

2016-06-17

The development of small molecules that target RNA is challenging yet, if successful, could advance the development of chemical probes to study RNA function or precision therapeutics to treat RNA-mediated disease. Previously, we described Inforna, an approach that can mine motifs (secondary structures) within target RNAs, which is deduced from the RNA sequence, and compare them to a database of known RNA motif-small molecule binding partners. Output generated by Inforna includes the motif found in both the database and the desired RNA target, lead small molecules for that target, and other related meta-data. Lead small molecules can then be tested for binding and affecting cellular (dys)function. Herein, we describe Inforna 2.0, which incorporates all known RNA motif-small molecule binding partners reported in the scientific literature, a chemical similarity searching feature, and an improved user interface and is freely available via an online web server. By incorporation of interactions identified by other laboratories, the database has been doubled, containing 1936 RNA motif-small molecule interactions, including 244 unique small molecules and 1331 motifs. Interestingly, chemotype analysis of the compounds that bind RNA in the database reveals features in small molecule chemotypes that are privileged for binding. Further, this updated database expanded the number of cellular RNAs to which lead compounds can be identified.
Internet Databases of the Properties, Enzymatic Reactions, and Metabolism of Small Molecules-Search Options and Applications in Food Science.

PubMed

Minkiewicz, Piotr; Darewicz, Małgorzata; Iwaniak, Anna; Bucholska, Justyna; Starowicz, Piotr; Czyrko, Emilia

2016-12-06

Internet databases of small molecules, their enzymatic reactions, and metabolism have emerged as useful tools in food science. Database searching is also introduced as part of chemistry or enzymology courses for food technology students. Such resources support the search for information about single compounds and facilitate the introduction of secondary analyses of large datasets. Information can be retrieved from databases by searching for the compound name or structure, annotating with the help of chemical codes or drawn using molecule editing software. Data mining options may be enhanced by navigating through a network of links and cross-links between databases. Exemplary databases reviewed in this article belong to two classes: tools concerning small molecules (including general and specialized databases annotating food components) and tools annotating enzymes and metabolism. Some problems associated with database application are also discussed. Data summarized in computer databases may be used for calculation of daily intake of bioactive compounds, prediction of metabolism of food components, and their biological activity as well as for prediction of interactions between food component and drugs.
A prototypic small molecule database for bronchoalveolar lavage-based metabolomics

NASA Astrophysics Data System (ADS)

Walmsley, Scott; Cruickshank-Quinn, Charmion; Quinn, Kevin; Zhang, Xing; Petrache, Irina; Bowler, Russell P.; Reisdorph, Richard; Reisdorph, Nichole

2018-04-01

The analysis of bronchoalveolar lavage fluid (BALF) using mass spectrometry-based metabolomics can provide insight into lung diseases, such as asthma. However, the important step of compound identification is hindered by the lack of a small molecule database that is specific for BALF. Here we describe prototypic, small molecule databases derived from human BALF samples (n=117). Human BALF was extracted into lipid and aqueous fractions and analyzed using liquid chromatography mass spectrometry. Following filtering to reduce contaminants and artifacts, the resulting BALF databases (BALF-DBs) contain 11,736 lipid and 658 aqueous compounds. Over 10% of these were found in 100% of samples. Testing the BALF-DBs using nested test sets produced a 99% match rate for lipids and 47% match rate for aqueous molecules. Searching an independent dataset resulted in 45% matching to the lipid BALF-DB compared to<25% when general databases are searched. The BALF-DBs are available for download from MetaboLights. Overall, the BALF-DBs can reduce false positives and improve confidence in compound identification compared to when general databases are used.
Internet Databases of the Properties, Enzymatic Reactions, and Metabolism of Small Molecules—Search Options and Applications in Food Science

PubMed Central

Minkiewicz, Piotr; Darewicz, Małgorzata; Iwaniak, Anna; Bucholska, Justyna; Starowicz, Piotr; Czyrko, Emilia

2016-01-01

Internet databases of small molecules, their enzymatic reactions, and metabolism have emerged as useful tools in food science. Database searching is also introduced as part of chemistry or enzymology courses for food technology students. Such resources support the search for information about single compounds and facilitate the introduction of secondary analyses of large datasets. Information can be retrieved from databases by searching for the compound name or structure, annotating with the help of chemical codes or drawn using molecule editing software. Data mining options may be enhanced by navigating through a network of links and cross-links between databases. Exemplary databases reviewed in this article belong to two classes: tools concerning small molecules (including general and specialized databases annotating food components) and tools annotating enzymes and metabolism. Some problems associated with database application are also discussed. Data summarized in computer databases may be used for calculation of daily intake of bioactive compounds, prediction of metabolism of food components, and their biological activity as well as for prediction of interactions between food component and drugs. PMID:27929431
SM-TF: A structural database of small molecule-transcription factor complexes.

PubMed

Xu, Xianjin; Ma, Zhiwei; Sun, Hongmin; Zou, Xiaoqin

2016-06-30

Transcription factors (TFs) are the proteins involved in the transcription process, ensuring the correct expression of specific genes. Numerous diseases arise from the dysfunction of specific TFs. In fact, over 30 TFs have been identified as therapeutic targets of about 9% of the approved drugs. In this study, we created a structural database of small molecule-transcription factor (SM-TF) complexes, available online at http://zoulab.dalton.missouri.edu/SM-TF. The 3D structures of the co-bound small molecule and the corresponding binding sites on TFs are provided in the database, serving as a valuable resource to assist structure-based drug design related to TFs. Currently, the SM-TF database contains 934 entries covering 176 TFs from a variety of species. The database is further classified into several subsets by species and organisms. The entries in the SM-TF database are linked to the UniProt database and other sequence-based TF databases. Furthermore, the druggable TFs from human and the corresponding approved drugs are linked to the DrugBank. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Ligand.Info small-molecule Meta-Database.

PubMed

von Grotthuss, Marcin; Koczyk, Grzegorz; Pas, Jakub; Wyrwicz, Lucjan S; Rychlewski, Leszek

2004-12-01

Ligand.Info is a compilation of various publicly available databases of small molecules. The total size of the Meta-Database is over 1 million entries. The compound records contain calculated three-dimensional coordinates and sometimes information about biological activity. Some molecules have information about FDA drug approving status or about anti-HIV activity. Meta-Database can be downloaded from the http://Ligand.Info web page. The database can also be screened using a Java-based tool. The tool can interactively cluster sets of molecules on the user side and automatically download similar molecules from the server. The application requires the Java Runtime Environment 1.4 or higher, which can be automatically downloaded from Sun Microsystems or Apple Computer and installed during the first use of Ligand.Info on desktop systems, which support Java (Ms Windows, Mac OS, Solaris, and Linux). The Ligand.Info Meta-Database can be used for virtual high-throughput screening of new potential drugs. Presented examples showed that using a known antiviral drug as query the system was able to find others antiviral drugs and inhibitors.
ChemBank: a small-molecule screening and cheminformatics resource database.

PubMed

Seiler, Kathleen Petri; George, Gregory A; Happ, Mary Pat; Bodycombe, Nicole E; Carrinski, Hyman A; Norton, Stephanie; Brudz, Steve; Sullivan, John P; Muhlich, Jeremy; Serrano, Martin; Ferraiolo, Paul; Tolliday, Nicola J; Schreiber, Stuart L; Clemons, Paul A

2008-01-01

ChemBank (http://chembank.broad.harvard.edu/) is a public, web-based informatics environment developed through a collaboration between the Chemical Biology Program and Platform at the Broad Institute of Harvard and MIT. This knowledge environment includes freely available data derived from small molecules and small-molecule screens and resources for studying these data. ChemBank is unique among small-molecule databases in its dedication to the storage of raw screening data, its rigorous definition of screening experiments in terms of statistical hypothesis testing, and its metadata-based organization of screening experiments into projects involving collections of related assays. ChemBank stores an increasingly varied set of measurements derived from cells and other biological assay systems treated with small molecules. Analysis tools are available and are continuously being developed that allow the relationships between small molecules, cell measurements, and cell states to be studied. Currently, ChemBank stores information on hundreds of thousands of small molecules and hundreds of biomedically relevant assays that have been performed at the Broad Institute by collaborators from the worldwide research community. The goal of ChemBank is to provide life scientists unfettered access to biomedically relevant data and tools heretofore available primarily in the private sector.
Identification of small molecules capable of regulating conformational changes of telomeric G-quadruplex

NASA Astrophysics Data System (ADS)

Chen, Shuo-Bin; Liu, Guo-Cai; Gu, Lian-Quan; Huang, Zhi-Shu; Tan, Jia-Heng

2018-02-01

Design of small molecules targeted at human telomeric G-quadruplex DNA is an extremely active research area. Interestingly, the telomeric G-quadruplex is a highly polymorphic structure. Changes in its conformation upon small molecule binding may be a powerful method to achieve a desired biological effect. However, the rational development of small molecules capable of regulating conformational change of telomeric G-quadruplex structures is still challenging. In this study, we developed a reliable ligand-based pharmacophore model based on isaindigotone derivatives with conformational change activity toward telomeric G-quadruplex DNA. Furthermore, virtual screening of database was conducted using this pharmacophore model and benzopyranopyrimidine derivatives in the database were identified as a strong inducer of the telomeric G-quadruplex DNA conformation, transforming it from hybrid-type structure to parallel structure.
Exploring Chemical Space for Drug Discovery Using the Chemical Universe Database

PubMed Central

2012-01-01

Herein we review our recent efforts in searching for bioactive ligands by enumeration and virtual screening of the unknown chemical space of small molecules. Enumeration from first principles shows that almost all small molecules (>99.9%) have never been synthesized and are still available to be prepared and tested. We discuss open access sources of molecules, the classification and representation of chemical space using molecular quantum numbers (MQN), its exhaustive enumeration in form of the chemical universe generated databases (GDB), and examples of using these databases for prospective drug discovery. MQN-searchable GDB, PubChem, and DrugBank are freely accessible at www.gdb.unibe.ch. PMID:23019491
Validation and extraction of molecular-geometry information from small-molecule databases.

PubMed

Long, Fei; Nicholls, Robert A; Emsley, Paul; Graǽulis, Saulius; Merkys, Andrius; Vaitkus, Antanas; Murshudov, Garib N

2017-02-01

A freely available small-molecule structure database, the Crystallography Open Database (COD), is used for the extraction of molecular-geometry information on small-molecule compounds. The results are used for the generation of new ligand descriptions, which are subsequently used by macromolecular model-building and structure-refinement software. To increase the reliability of the derived data, and therefore the new ligand descriptions, the entries from this database were subjected to very strict validation. The selection criteria made sure that the crystal structures used to derive atom types, bond and angle classes are of sufficiently high quality. Any suspicious entries at a crystal or molecular level were removed from further consideration. The selection criteria included (i) the resolution of the data used for refinement (entries solved at 0.84 Å resolution or higher) and (ii) the structure-solution method (structures must be from a single-crystal experiment and all atoms of generated molecules must have full occupancies), as well as basic sanity checks such as (iii) consistency between the valences and the number of connections between atoms, (iv) acceptable bond-length deviations from the expected values and (v) detection of atomic collisions. The derived atom types and bond classes were then validated using high-order moment-based statistical techniques. The results of the statistical analyses were fed back to fine-tune the atom typing. The developed procedure was repeated four times, resulting in fine-grained atom typing, bond and angle classes. The procedure will be repeated in the future as and when new entries are deposited in the COD. The whole procedure can also be applied to any source of small-molecule structures, including the Cambridge Structural Database and the ZINC database.
FreeSolv: A database of experimental and calculated hydration free energies, with input files

PubMed Central

Mobley, David L.; Guthrie, J. Peter

2014-01-01

This work provides a curated database of experimental and calculated hydration free energies for small neutral molecules in water, along with molecular structures, input files, references, and annotations. We call this the Free Solvation Database, or FreeSolv. Experimental values were taken from prior literature and will continue to be curated, with updated experimental references and data added as they become available. Calculated values are based on alchemical free energy calculations using molecular dynamics simulations. These used the GAFF small molecule force field in TIP3P water with AM1-BCC charges. Values were calculated with the GROMACS simulation package, with full details given in references cited within the database itself. This database builds in part on a previous, 504-molecule database containing similar information. However, additional curation of both experimental data and calculated values has been done here, and the total number of molecules is now up to 643. Additional information is now included in the database, such as SMILES strings, PubChem compound IDs, accurate reference DOIs, and others. One version of the database is provided in the Supporting Information of this article, but as ongoing updates are envisioned, the database is now versioned and hosted online. In addition to providing the database, this work describes its construction process. The database is available free-of-charge via http://www.escholarship.org/uc/item/6sd403pz. PMID:24928188
Heterogeneous Biomedical Database Integration Using a Hybrid Strategy: A p53 Cantcer Research Database

PubMed Central

Bichutskiy, Vadim Y.; Colman, Richard; Brachmann, Rainer K.; Lathrop, Richard H.

2006-01-01

Complex problems in life science research give rise to multidisciplinary collaboration, and hence, to the need for heterogeneous database integration. The tumor suppressor p53 is mutated in close to 50% of human cancers, and a small drug-like molecule with the ability to restore native function to cancerous p53 mutants is a long-held medical goal of cancer treatment. The Cancer Research DataBase (CRDB) was designed in support of a project to find such small molecules. As a cancer informatics project, the CRDB involved small molecule data, computational docking results, functional assays, and protein structure data. As an example of the hybrid strategy for data integration, it combined the mediation and data warehousing approaches. This paper uses the CRDB to illustrate the hybrid strategy as a viable approach to heterogeneous data integration in biomedicine, and provides a design method for those considering similar systems. More efficient data sharing implies increased productivity, and, hopefully, improved chances of success in cancer research. (Code and database schemas are freely downloadable, http://www.igb.uci.edu/research/research.html.) PMID:19458771
EDULISS: a small-molecule database with data-mining and pharmacophore searching capabilities

PubMed Central

Hsin, Kun-Yi; Morgan, Hugh P.; Shave, Steven R.; Hinton, Andrew C.; Taylor, Paul; Walkinshaw, Malcolm D.

2011-01-01

We present the relational database EDULISS (EDinburgh University Ligand Selection System), which stores structural, physicochemical and pharmacophoric properties of small molecules. The database comprises a collection of over 4 million commercially available compounds from 28 different suppliers. A user-friendly web-based interface for EDULISS (available at http://eduliss.bch.ed.ac.uk/) has been established providing a number of data-mining possibilities. For each compound a single 3D conformer is stored along with over 1600 calculated descriptor values (molecular properties). A very efficient method for unique compound recognition, especially for a large scale database, is demonstrated by making use of small subgroups of the descriptors. Many of the shape and distance descriptors are held as pre-calculated bit strings permitting fast and efficient similarity and pharmacophore searches which can be used to identify families of related compounds for biological testing. Two ligand searching applications are given to demonstrate how EDULISS can be used to extract families of molecules with selected structural and biophysical features. PMID:21051336
NALDB: nucleic acid ligand database for small molecules targeting nucleic acid

PubMed Central

Kumar Mishra, Subodh; Kumar, Amit

2016-01-01

Nucleic acid ligand database (NALDB) is a unique database that provides detailed information about the experimental data of small molecules that were reported to target several types of nucleic acid structures. NALDB is the first ligand database that contains ligand information for all type of nucleic acid. NALDB contains more than 3500 ligand entries with detailed pharmacokinetic and pharmacodynamic information such as target name, target sequence, ligand 2D/3D structure, SMILES, molecular formula, molecular weight, net-formal charge, AlogP, number of rings, number of hydrogen bond donor and acceptor, potential energy along with their Ki, Kd, IC50 values. All these details at single platform would be helpful for the development and betterment of novel ligands targeting nucleic acids that could serve as a potential target in different diseases including cancers and neurological disorders. With maximum 255 conformers for each ligand entry, our database is a multi-conformer database and can facilitate the virtual screening process. NALDB provides powerful web-based search tools that make database searching efficient and simplified using option for text as well as for structure query. NALDB also provides multi-dimensional advanced search tool which can screen the database molecules on the basis of molecular properties of ligand provided by database users. A 3D structure visualization tool has also been included for 3D structure representation of ligands. NALDB offers an inclusive pharmacological information and the structurally flexible set of small molecules with their three-dimensional conformers that can accelerate the virtual screening and other modeling processes and eventually complement the nucleic acid-based drug discovery research. NALDB can be routinely updated and freely available on bsbe.iiti.ac.in/bsbe/naldb/HOME.php. Database URL: http://bsbe.iiti.ac.in/bsbe/naldb/HOME.php PMID:26896846
SMMRNA: a database of small molecule modulators of RNA

PubMed Central

Mehta, Ankita; Sonam, Surabhi; Gouri, Isha; Loharch, Saurabh; Sharma, Deepak K.; Parkesh, Raman

2014-01-01

We have developed SMMRNA, an interactive database, available at http://www.smmrna.org, with special focus on small molecule ligands targeting RNA. Currently, SMMRNA consists of ∼770 unique ligands along with structural images of RNA molecules. Each ligand in the SMMRNA contains information such as Kd, Ki, IC50, ΔTm, molecular weight (MW), hydrogen donor and acceptor count, XlogP, number of rotatable bonds, number of aromatic rings and 2D and 3D structures. These parameters can be explored using text search, advanced search, substructure and similarity-based analysis tools that are embedded in SMMRNA. A structure editor is provided for 3D visualization of ligands. Advance analysis can be performed using substructure and OpenBabel-based chemical similarity fingerprints. Upload facility for both RNA and ligands is also provided. The physicochemical properties of the ligands were further examined using OpenBabel descriptors, hierarchical clustering, binning partition and multidimensional scaling. We have also generated a 3D conformation database of ligands to support the structure and ligand-based screening. SMMRNA provides comprehensive resource for further design, development and refinement of small molecule modulators for selective targeting of RNA molecules. PMID:24163098
Chembank | Office of Cancer Genomics

Cancer.gov

Funded in large part by the Initiative for Chemical Genetics (ICG), Chembank is an interactive database for small molecules. It contains data from hundreds of biomedically relevant small molecule screens that involved hundreds-of-thousands of compounds. Chembank also provides analysis tools to facilitate data mining.
NALDB: nucleic acid ligand database for small molecules targeting nucleic acid.

PubMed

Kumar Mishra, Subodh; Kumar, Amit

2016-01-01

Nucleic acid ligand database (NALDB) is a unique database that provides detailed information about the experimental data of small molecules that were reported to target several types of nucleic acid structures. NALDB is the first ligand database that contains ligand information for all type of nucleic acid. NALDB contains more than 3500 ligand entries with detailed pharmacokinetic and pharmacodynamic information such as target name, target sequence, ligand 2D/3D structure, SMILES, molecular formula, molecular weight, net-formal charge, AlogP, number of rings, number of hydrogen bond donor and acceptor, potential energy along with their Ki, Kd, IC50 values. All these details at single platform would be helpful for the development and betterment of novel ligands targeting nucleic acids that could serve as a potential target in different diseases including cancers and neurological disorders. With maximum 255 conformers for each ligand entry, our database is a multi-conformer database and can facilitate the virtual screening process. NALDB provides powerful web-based search tools that make database searching efficient and simplified using option for text as well as for structure query. NALDB also provides multi-dimensional advanced search tool which can screen the database molecules on the basis of molecular properties of ligand provided by database users. A 3D structure visualization tool has also been included for 3D structure representation of ligands. NALDB offers an inclusive pharmacological information and the structurally flexible set of small molecules with their three-dimensional conformers that can accelerate the virtual screening and other modeling processes and eventually complement the nucleic acid-based drug discovery research. NALDB can be routinely updated and freely available on bsbe.iiti.ac.in/bsbe/naldb/HOME.php. Database URL: http://bsbe.iiti.ac.in/bsbe/naldb/HOME.php. © The Author(s) 2016. Published by Oxford University Press.
SInCRe—structural interactome computational resource for Mycobacterium tuberculosis

PubMed Central

Metri, Rahul; Hariharaputran, Sridhar; Ramakrishnan, Gayatri; Anand, Praveen; Raghavender, Upadhyayula S.; Ochoa-Montaño, Bernardo; Higueruelo, Alicia P.; Sowdhamini, Ramanathan; Chandra, Nagasuma R.; Blundell, Tom L.; Srinivasan, Narayanaswamy

2015-01-01

We have developed an integrated database for Mycobacterium tuberculosis H37Rv (Mtb) that collates information on protein sequences, domain assignments, functional annotation and 3D structural information along with protein–protein and protein–small molecule interactions. SInCRe (Structural Interactome Computational Resource) is developed out of CamBan (Cambridge and Bangalore) collaboration. The motivation for development of this database is to provide an integrated platform to allow easily access and interpretation of data and results obtained by all the groups in CamBan in the field of Mtb informatics. In-house algorithms and databases developed independently by various academic groups in CamBan are used to generate Mtb-specific datasets and are integrated in this database to provide a structural dimension to studies on tuberculosis. The SInCRe database readily provides information on identification of functional domains, genome-scale modelling of structures of Mtb proteins and characterization of the small-molecule binding sites within Mtb. The resource also provides structure-based function annotation, information on small-molecule binders including FDA (Food and Drug Administration)-approved drugs, protein–protein interactions (PPIs) and natural compounds that bind to pathogen proteins potentially and result in weakening or elimination of host–pathogen protein–protein interactions. Together they provide prerequisites for identification of off-target binding. Database URL: http://proline.biochem.iisc.ernet.in/sincre PMID:26130660
Small molecule mimics of DFTamP1, a database designed anti-Staphylococcal peptide

PubMed Central

Dong, Yuxiang; Lushnikova, Tamara; Golla, Radha M.; Wang, Xiaofang; Wang, Guangshun

2017-01-01

Antimicrobial peptides (AMPs) are important templates for developing new antimicrobial agents. Previously, we developed a database filtering technology that enabled us to design a potent anti-Staphylococcal peptide DFTamP1. Using this same design approach, we now report the discovery of a new class of bis-indole diimidazolines as AMP small molecule mimics. The best compound killed multiple S. aureus clinical strains in both planktonic and biofilm forms. The compound appeared to target bacterial membranes with antimicrobial activity and membrane permeation ability similar to daptomycin. PMID:28011203
DG-AMMOS: a new tool to generate 3d conformation of small molecules using distance geometry and automated molecular mechanics optimization for in silico screening.

PubMed

Lagorce, David; Pencheva, Tania; Villoutreix, Bruno O; Miteva, Maria A

2009-11-13

Discovery of new bioactive molecules that could enter drug discovery programs or that could serve as chemical probes is a very complex and costly endeavor. Structure-based and ligand-based in silico screening approaches are nowadays extensively used to complement experimental screening approaches in order to increase the effectiveness of the process and facilitating the screening of thousands or millions of small molecules against a biomolecular target. Both in silico screening methods require as input a suitable chemical compound collection and most often the 3D structure of the small molecules has to be generated since compounds are usually delivered in 1D SMILES, CANSMILES or in 2D SDF formats. Here, we describe the new open source program DG-AMMOS which allows the generation of the 3D conformation of small molecules using Distance Geometry and their energy minimization via Automated Molecular Mechanics Optimization. The program is validated on the Astex dataset, the ChemBridge Diversity database and on a number of small molecules with known crystal structures extracted from the Cambridge Structural Database. A comparison with the free program Balloon and the well-known commercial program Omega generating the 3D of small molecules is carried out. The results show that the new free program DG-AMMOS is a very efficient 3D structure generator engine. DG-AMMOS provides fast, automated and reliable access to the generation of 3D conformation of small molecules and facilitates the preparation of a compound collection prior to high-throughput virtual screening computations. The validation of DG-AMMOS on several different datasets proves that generated structures are generally of equal quality or sometimes better than structures obtained by other tested methods.

A structural examination and collision cross section database for over 500 metabolites and xenobiotics using drift tube ion mobility spectrometry† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c7sc03464d

PubMed Central

Zheng, Xueyun; Aly, Noor A.; Zhou, Yuxuan; Dupuis, Kevin T.; Bilbao, Aivett; Paurus, Vanessa L.; Orton, Daniel J.; Wilson, Ryan; Payne, Samuel H.; Smith, Richard D.

2017-01-01

The confident identification of metabolites and xenobiotics in biological and environmental studies is an analytical challenge due to their immense dynamic range, vast chemical space and structural diversity. Ion mobility spectrometry (IMS) is widely used for small molecule analyses since it can separate isomeric species and be easily coupled with front end separations and mass spectrometry for multidimensional characterizations. However, to date IMS metabolomic and exposomic studies have been limited by an inadequate number of accurate collision cross section (CCS) values for small molecules, causing features to be detected but not confidently identified. In this work, we utilized drift tube IMS (DTIMS) to directly measure CCS values for over 500 small molecules including primary metabolites, secondary metabolites and xenobiotics. Since DTIMS measurements do not need calibrant ions or calibration like some other IMS techniques, they avoid calibration errors which can cause problems in distinguishing structurally similar molecules. All measurements were performed in triplicate in both positive and negative polarities with nitrogen gas and seven different electric fields, so that relative standard deviations (RSD) could be assessed for each molecule and structural differences studied. The primary metabolites analyzed to date have come from key metabolism pathways such as glycolysis, the pentose phosphate pathway and the tricarboxylic acid cycle, while the secondary metabolites consisted of classes such as terpenes and flavonoids, and the xenobiotics represented a range of molecules from antibiotics to polycyclic aromatic hydrocarbons. Different CCS trends were observed for several of the diverse small molecule classes and when urine features were matched to the database, the addition of the IMS dimension greatly reduced the possible number of candidate molecules. This CCS database and structural information are freely available for download at http://panomics.pnnl.gov/metabolites/ with new molecules being added frequently. PMID:29568436
Psmir: a database of potential associations between small molecules and miRNAs.

PubMed

Meng, Fanlin; Wang, Jing; Dai, Enyu; Yang, Feng; Chen, Xiaowen; Wang, Shuyuan; Yu, Xuexin; Liu, Dianming; Jiang, Wei

2016-01-13

miRNAs are key post-transcriptional regulators of many essential biological processes, and their dysregulation has been validated in almost all human cancers. Restoring aberrantly expressed miRNAs might be a novel therapeutics. Recently, many studies have demonstrated that small molecular compounds can affect miRNA expression. Thus, prediction of associations between small molecules and miRNAs is important for investigation of miRNA-targeted drugs. Here, we analyzed 39 miRNA-perturbed gene expression profiles, and then calculated the similarity of transcription responses between miRNA perturbation and drug treatment to predict drug-miRNA associations. At the significance level of 0.05, we obtained 6501 candidate associations between 1295 small molecules and 25 miRNAs, which included 624 FDA approved drugs. Finally, we constructed the Psmir database to store all potential associations and the related materials. In a word, Psmir served as a valuable resource for dissecting the biological significance in small molecules' effects on miRNA expression, which will facilitate developing novel potential therapeutic targets or treatments for human cancers. Psmir is supported by all major browsers, and is freely available at http://www.bio-bigdata.com/Psmir/.
cMapper: gene-centric connectivity mapper for EBI-RDF platform.

PubMed

Shoaib, Muhammad; Ansari, Adnan Ahmad; Ahn, Sung-Min

2017-01-15

In this era of biological big data, data integration has become a common task and a challenge for biologists. The Resource Description Framework (RDF) was developed to enable interoperability of heterogeneous datasets. The EBI-RDF platform enables an efficient data integration of six independent biological databases using RDF technologies and shared ontologies. However, to take advantage of this platform, biologists need to be familiar with RDF technologies and SPARQL query language. To overcome this practical limitation of the EBI-RDF platform, we developed cMapper, a web-based tool that enables biologists to search the EBI-RDF databases in a gene-centric manner without a thorough knowledge of RDF and SPARQL. cMapper allows biologists to search data entities in the EBI-RDF platform that are connected to genes or small molecules of interest in multiple biological contexts. The input to cMapper consists of a set of genes or small molecules, and the output are data entities in six independent EBI-RDF databases connected with the given genes or small molecules in the user's query. cMapper provides output to users in the form of a graph in which nodes represent data entities and the edges represent connections between data entities and inputted set of genes or small molecules. Furthermore, users can apply filters based on database, taxonomy, organ and pathways in order to focus on a core connectivity graph of their interest. Data entities from multiple databases are differentiated based on background colors. cMapper also enables users to investigate shared connections between genes or small molecules of interest. Users can view the output graph on a web browser or download it in either GraphML or JSON formats. cMapper is available as a web application with an integrated MySQL database. The web application was developed using Java and deployed on Tomcat server. We developed the user interface using HTML5, JQuery and the Cytoscape Graph API. cMapper can be accessed at http://cmapper.ewostech.net Readers can download the development manual from the website http://cmapper.ewostech.net/docs/cMapperDocumentation.pdf. Source Code is available at https://github.com/muhammadshoaib/cmapperContact:smahn@gachon.ac.krSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Mapping small molecule binding data to structural domains

PubMed Central

2012-01-01

Background Large-scale bioactivity/SAR Open Data has recently become available, and this has allowed new analyses and approaches to be developed to help address the productivity and translational gaps of current drug discovery. One of the current limitations of these data is the relative sparsity of reported interactions per protein target, and complexities in establishing clear relationships between bioactivity and targets using bioinformatics tools. We detail in this paper the indexing of targets by the structural domains that bind (or are likely to bind) the ligand within a full-length protein. Specifically, we present a simple heuristic to map small molecule binding to Pfam domains. This profiling can be applied to all proteins within a genome to give some indications of the potential pharmacological modulation and regulation of all proteins. Results In this implementation of our heuristic, ligand binding to protein targets from the ChEMBL database was mapped to structural domains as defined by profiles contained within the Pfam-A database. Our mapping suggests that the majority of assay targets within the current version of the ChEMBL database bind ligands through a small number of highly prevalent domains, and conversely the majority of Pfam domains sampled by our data play no currently established role in ligand binding. Validation studies, carried out firstly against Uniprot entries with expert binding-site annotation and secondly against entries in the wwPDB repository of crystallographic protein structures, demonstrate that our simple heuristic maps ligand binding to the correct domain in about 90 percent of all assessed cases. Using the mappings obtained with our heuristic, we have assembled ligand sets associated with each Pfam domain. Conclusions Small molecule binding has been mapped to Pfam-A domains of protein targets in the ChEMBL bioactivity database. The result of this mapping is an enriched annotation of small molecule bioactivity data and a grouping of activity classes following the Pfam-A specifications of protein domains. This is valuable for data-focused approaches in drug discovery, for example when extrapolating potential targets of a small molecule with known activity against one or few targets, or in the assessment of a potential target for drug discovery or screening studies. PMID:23282026
Psmir: a database of potential associations between small molecules and miRNAs

PubMed Central

Meng, Fanlin; Wang, Jing; Dai, Enyu; Yang, Feng; Chen, Xiaowen; Wang, Shuyuan; Yu, Xuexin; Liu, Dianming; Jiang, Wei

2016-01-01

miRNAs are key post-transcriptional regulators of many essential biological processes, and their dysregulation has been validated in almost all human cancers. Restoring aberrantly expressed miRNAs might be a novel therapeutics. Recently, many studies have demonstrated that small molecular compounds can affect miRNA expression. Thus, prediction of associations between small molecules and miRNAs is important for investigation of miRNA-targeted drugs. Here, we analyzed 39 miRNA-perturbed gene expression profiles, and then calculated the similarity of transcription responses between miRNA perturbation and drug treatment to predict drug-miRNA associations. At the significance level of 0.05, we obtained 6501 candidate associations between 1295 small molecules and 25 miRNAs, which included 624 FDA approved drugs. Finally, we constructed the Psmir database to store all potential associations and the related materials. In a word, Psmir served as a valuable resource for dissecting the biological significance in small molecules’ effects on miRNA expression, which will facilitate developing novel potential therapeutic targets or treatments for human cancers. Psmir is supported by all major browsers, and is freely available at http://www.bio-bigdata.com/Psmir/. PMID:26759061
Using the gini coefficient to measure the chemical diversity of small-molecule libraries.

PubMed

Weidlich, Iwona E; Filippov, Igor V

2016-08-15

Modern databases of small organic molecules contain tens of millions of structures. The size of theoretically available chemistry is even larger. However, despite the large amount of chemical information, the "big data" moment for chemistry has not yet provided the corresponding payoff of cheaper computer-predicted medicine or robust machine-learning models for the determination of efficacy and toxicity. Here, we present a study of the diversity of chemical datasets using a measure that is commonly used in socioeconomic studies. We demonstrate the use of this diversity measure on several datasets that were constructed to contain various congeneric subsets of molecules as well as randomly selected molecules. We also apply our method to a number of well-known databases that are frequently used for structure-activity relationship modeling. Our results show the poor diversity of the common sources of potential lead compounds compared to actual known drugs. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The Biomolecular Interaction Network Database and related tools 2005 update

PubMed Central

Alfarano, C.; Andrade, C. E.; Anthony, K.; Bahroos, N.; Bajec, M.; Bantoft, K.; Betel, D.; Bobechko, B.; Boutilier, K.; Burgess, E.; Buzadzija, K.; Cavero, R.; D'Abreo, C.; Donaldson, I.; Dorairajoo, D.; Dumontier, M. J.; Dumontier, M. R.; Earles, V.; Farrall, R.; Feldman, H.; Garderman, E.; Gong, Y.; Gonzaga, R.; Grytsan, V.; Gryz, E.; Gu, V.; Haldorsen, E.; Halupa, A.; Haw, R.; Hrvojic, A.; Hurrell, L.; Isserlin, R.; Jack, F.; Juma, F.; Khan, A.; Kon, T.; Konopinsky, S.; Le, V.; Lee, E.; Ling, S.; Magidin, M.; Moniakis, J.; Montojo, J.; Moore, S.; Muskat, B.; Ng, I.; Paraiso, J. P.; Parker, B.; Pintilie, G.; Pirone, R.; Salama, J. J.; Sgro, S.; Shan, T.; Shu, Y.; Siew, J.; Skinner, D.; Snyder, K.; Stasiuk, R.; Strumpf, D.; Tuekam, B.; Tao, S.; Wang, Z.; White, M.; Willis, R.; Wolting, C.; Wong, S.; Wrong, A.; Xin, C.; Yao, R.; Yates, B.; Zhang, S.; Zheng, K.; Pawson, T.; Ouellette, B. F. F.; Hogue, C. W. V.

2005-01-01

The Biomolecular Interaction Network Database (BIND) (http://bind.ca) archives biomolecular interaction, reaction, complex and pathway information. Our aim is to curate the details about molecular interactions that arise from published experimental research and to provide this information, as well as tools to enable data analysis, freely to researchers worldwide. BIND data are curated into a comprehensive machine-readable archive of computable information and provides users with methods to discover interactions and molecular mechanisms. BIND has worked to develop new methods for visualization that amplify the underlying annotation of genes and proteins to facilitate the study of molecular interaction networks. BIND has maintained an open database policy since its inception in 1999. Data growth has proceeded at a tremendous rate, approaching over 100 000 records. New services provided include a new BIND Query and Submission interface, a Standard Object Access Protocol service and the Small Molecule Interaction Database (http://smid.blueprint.org) that allows users to determine probable small molecule binding sites of new sequences and examine conserved binding residues. PMID:15608229
SuperSweet—a resource on natural and artificial sweetening agents

PubMed Central

Ahmed, Jessica; Preissner, Saskia; Dunkel, Mathias; Worth, Catherine L.; Eckert, Andreas; Preissner, Robert

2011-01-01

A vast number of sweet tasting molecules are known, encompassing small compounds, carbohydrates, d-amino acids and large proteins. Carbohydrates play a particularly big role in human diet. The replacement of sugars in food with artificial sweeteners is common and is a general approach to prevent cavities, obesity and associated diseases such as diabetes and hyperlipidemia. Knowledge about the molecular basis of taste may reveal new strategies to overcome diet-induced diseases. In this context, the design of safe, low-calorie sweeteners is particularly important. Here, we provide a comprehensive collection of carbohydrates, artificial sweeteners and other sweet tasting agents like proteins and peptides. Additionally, structural information and properties such as number of calories, therapeutic annotations and a sweetness-index are stored in SuperSweet. Currently, the database consists of more than 8000 sweet molecules. Moreover, the database provides a modeled 3D structure of the sweet taste receptor and binding poses of the small sweet molecules. These binding poses provide hints for the design of new sweeteners. A user-friendly graphical interface allows similarity searching, visualization of docked sweeteners into the receptor etc. A sweetener classification tree and browsing features allow quick requests to be made to the database. The database is freely available at: http://bioinformatics.charite.de/sweet/. PMID:20952410
SuperSweet--a resource on natural and artificial sweetening agents.

PubMed

Ahmed, Jessica; Preissner, Saskia; Dunkel, Mathias; Worth, Catherine L; Eckert, Andreas; Preissner, Robert

2011-01-01

A vast number of sweet tasting molecules are known, encompassing small compounds, carbohydrates, d-amino acids and large proteins. Carbohydrates play a particularly big role in human diet. The replacement of sugars in food with artificial sweeteners is common and is a general approach to prevent cavities, obesity and associated diseases such as diabetes and hyperlipidemia. Knowledge about the molecular basis of taste may reveal new strategies to overcome diet-induced diseases. In this context, the design of safe, low-calorie sweeteners is particularly important. Here, we provide a comprehensive collection of carbohydrates, artificial sweeteners and other sweet tasting agents like proteins and peptides. Additionally, structural information and properties such as number of calories, therapeutic annotations and a sweetness-index are stored in SuperSweet. Currently, the database consists of more than 8000 sweet molecules. Moreover, the database provides a modeled 3D structure of the sweet taste receptor and binding poses of the small sweet molecules. These binding poses provide hints for the design of new sweeteners. A user-friendly graphical interface allows similarity searching, visualization of docked sweeteners into the receptor etc. A sweetener classification tree and browsing features allow quick requests to be made to the database. The database is freely available at: http://bioinformatics.charite.de/sweet/.
Crystallography Open Database – an open-access collection of crystal structures

PubMed Central

Gražulis, Saulius; Chateigner, Daniel; Downs, Robert T.; Yokochi, A. F. T.; Quirós, Miguel; Lutterotti, Luca; Manakova, Elena; Butkus, Justas; Moeck, Peter; Le Bail, Armel

2009-01-01

The Crystallography Open Database (COD), which is a project that aims to gather all available inorganic, metal–organic and small organic molecule structural data in one database, is described. The database adopts an open-access model. The COD currently contains ∼80 000 entries in crystallographic information file format, with nearly full coverage of the International Union of Crystallography publications, and is growing in size and quality. PMID:22477773
The NCBI BioSystems database.

PubMed

Geer, Lewis Y; Marchler-Bauer, Aron; Geer, Renata C; Han, Lianyi; He, Jane; He, Siqian; Liu, Chunlei; Shi, Wenyao; Bryant, Stephen H

2010-01-01

The NCBI BioSystems database, found at http://www.ncbi.nlm.nih.gov/biosystems/, centralizes and cross-links existing biological systems databases, increasing their utility and target audience by integrating their pathways and systems into NCBI resources. This integration allows users of NCBI's Entrez databases to quickly categorize proteins, genes and small molecules by metabolic pathway, disease state or other BioSystem type, without requiring time-consuming inference of biological relationships from the literature or multiple experimental datasets.
CASMI 2013: Identification of Small Molecules by Tandem Mass Spectrometry Combined with Database and Literature Mining

PubMed Central

Newsome, Andrew G.; Nikolic, Dejan

2014-01-01

The Critical Assessment of Small Molecule Identification (CASMI) contest was initiated in 2012 to evaluate manual and automated strategies for the identification of small molecules from raw mass spectrometric data. The authors participated in both category 1 (molecular formula determination) and category 2 (molecular structure determination) of the second annual CASMI contest (CASMI 2013) using slow but effective manual methods. The provided high resolution mass spectrometric data were interpreted manually using a combination of molecular formula calculators, fragment and neutral loss analysis, literature consultation, manual database searches, deductive logic, and experience. The authors submitted correct formulas as lead candidates for 16 of 16 challenges and submitted correct structure solutions as lead candidates for 14 of 16 challenges. One structure submission (Challenge 3) was very close but not exact (N2-acetylglutaminylisoleucinamide instead of the correct N2-acetylglutaminylleucinamide). A solution for one (Challenge 13) was not submitted due to an inability to reconcile the provided fragmentation pattern with any known structures with the provided molecular composition. PMID:26819877
Analysis of commercial and public bioactivity databases.

PubMed

Tiikkainen, Pekka; Franke, Lutz

2012-02-27

Activity data for small molecules are invaluable in chemoinformatics. Various bioactivity databases exist containing detailed information of target proteins and quantitative binding data for small molecules extracted from journals and patents. In the current work, we have merged several public and commercial bioactivity databases into one bioactivity metabase. The molecular presentation, target information, and activity data of the vendor databases were standardized. The main motivation of the work was to create a single relational database which allows fast and simple data retrieval by in-house scientists. Second, we wanted to know the amount of overlap between databases by commercial and public vendors to see whether the former contain data complementing the latter. Third, we quantified the degree of inconsistency between data sources by comparing data points derived from the same scientific article cited by more than one vendor. We found that each data source contains unique data which is due to different scientific articles cited by the vendors. When comparing data derived from the same article we found that inconsistencies between the vendors are common. In conclusion, using databases of different vendors is still useful since the data overlap is not complete. It should be noted that this can be partially explained by the inconsistencies and errors in the source data.
Size-independent neural networks based first-principles method for accurate prediction of heat of formation of fuels

NASA Astrophysics Data System (ADS)

Yang, GuanYa; Wu, Jiang; Chen, ShuGuang; Zhou, WeiJun; Sun, Jian; Chen, GuanHua

2018-06-01

Neural network-based first-principles method for predicting heat of formation (HOF) was previously demonstrated to be able to achieve chemical accuracy in a broad spectrum of target molecules [L. H. Hu et al., J. Chem. Phys. 119, 11501 (2003)]. However, its accuracy deteriorates with the increase in molecular size. A closer inspection reveals a systematic correlation between the prediction error and the molecular size, which appears correctable by further statistical analysis, calling for a more sophisticated machine learning algorithm. Despite the apparent difference between simple and complex molecules, all the essential physical information is already present in a carefully selected set of small molecule representatives. A model that can capture the fundamental physics would be able to predict large and complex molecules from information extracted only from a small molecules database. To this end, a size-independent, multi-step multi-variable linear regression-neural network-B3LYP method is developed in this work, which successfully improves the overall prediction accuracy by training with smaller molecules only. And in particular, the calculation errors for larger molecules are drastically reduced to the same magnitudes as those of the smaller molecules. Specifically, the method is based on a 164-molecule database that consists of molecules made of hydrogen and carbon elements. 4 molecular descriptors were selected to encode molecule's characteristics, among which raw HOF calculated from B3LYP and the molecular size are also included. Upon the size-independent machine learning correction, the mean absolute deviation (MAD) of the B3LYP/6-311+G(3df,2p)-calculated HOF is reduced from 16.58 to 1.43 kcal/mol and from 17.33 to 1.69 kcal/mol for the training and testing sets (small molecules), respectively. Furthermore, the MAD of the testing set (large molecules) is reduced from 28.75 to 1.67 kcal/mol.
The NCBI BioSystems database

PubMed Central

Geer, Lewis Y.; Marchler-Bauer, Aron; Geer, Renata C.; Han, Lianyi; He, Jane; He, Siqian; Liu, Chunlei; Shi, Wenyao; Bryant, Stephen H.

2010-01-01

The NCBI BioSystems database, found at http://www.ncbi.nlm.nih.gov/biosystems/, centralizes and cross-links existing biological systems databases, increasing their utility and target audience by integrating their pathways and systems into NCBI resources. This integration allows users of NCBI’s Entrez databases to quickly categorize proteins, genes and small molecules by metabolic pathway, disease state or other BioSystem type, without requiring time-consuming inference of biological relationships from the literature or multiple experimental datasets. PMID:19854944
The LncRNA Connectivity Map: Using LncRNA Signatures to Connect Small Molecules, LncRNAs, and Diseases.

PubMed

Yang, Haixiu; Shang, Desi; Xu, Yanjun; Zhang, Chunlong; Feng, Li; Sun, Zeguo; Shi, Xinrui; Zhang, Yunpeng; Han, Junwei; Su, Fei; Li, Chunquan; Li, Xia

2017-07-27

Well characterized the connections among diseases, long non-coding RNAs (lncRNAs) and drugs are important for elucidating the key roles of lncRNAs in biological mechanisms in various biological states. In this study, we constructed a database called LNCmap (LncRNA Connectivity Map), available at http://www.bio-bigdata.com/LNCmap/ , to establish the correlations among diseases, physiological processes, and the action of small molecule therapeutics by attempting to describe all biological states in terms of lncRNA signatures. By reannotating the microarray data from the Connectivity Map database, the LNCmap obtained 237 lncRNA signatures of 5916 instances corresponding to 1262 small molecular drugs. We provided a user-friendly interface for the convenient browsing, retrieval and download of the database, including detailed information and the associations of drugs and corresponding affected lncRNAs. Additionally, we developed two enrichment analysis methods for users to identify candidate drugs for a particular disease by inputting the corresponding lncRNA expression profiles or an associated lncRNA list and then comparing them to the lncRNA signatures in our database. Overall, LNCmap could significantly improve our understanding of the biological roles of lncRNAs and provide a unique resource to reveal the connections among drugs, lncRNAs and diseases.
Re-education begins at home: an overview of the discovery of in vivo-active small molecule modulators of endogenous stem cells.

PubMed

Um, JungIn; Lee, Ji-Hyung; Jung, Da-Woon; Williams, Darren R

2018-04-01

Degenerative diseases, such as Alzheimer's disease, heart disease and arthritis cause great suffering and are major socioeconomic burdens. An attractive treatment approach is stem cell transplantation to regenerate damaged or destroyed tissues. However, this can be problematic. For example, donor cells may not functionally integrate into the host tissue. An alternative methodology is to deliver bioactive agents, such as small molecules, directly into the diseased tissue to enhance the regenerative potential of endogenous stem cells. Areas covered: In this review, the authors discuss the necessity of developing these small molecules to treat degenerative diseases and survey progress in their application as therapeutics. They describe both the successes and caveats of developing small molecules that target endogenous stem cells to induce tissue regeneration. This article is based on literature searches which encompass databases for biomedical research and clinical trials. These small molecules are also categorized per their target disease and mechanism of action. Expert opinion: The development of small molecules targeting endogenous stem cells is a high-profile research area. Some compounds have made the successful transition to the clinic. Novel approaches, such as modulating the stem cell niche or targeted delivery to disease sites, should increase the likelihood of future successes in this field.
Prediction of small molecule binding property of protein domains with Bayesian classifiers based on Markov chains.

PubMed

Bulashevska, Alla; Stein, Martin; Jackson, David; Eils, Roland

2009-12-01

Accurate computational methods that can help to predict biological function of a protein from its sequence are of great interest to research biologists and pharmaceutical companies. One approach to assume the function of proteins is to predict the interactions between proteins and other molecules. In this work, we propose a machine learning method that uses a primary sequence of a domain to predict its propensity for interaction with small molecules. By curating the Pfam database with respect to the small molecule binding ability of its component domains, we have constructed a dataset of small molecule binding and non-binding domains. This dataset was then used as training set to learn a Bayesian classifier, which should distinguish members of each class. The domain sequences of both classes are modelled with Markov chains. In a Jack-knife test, our classification procedure achieved the predictive accuracies of 77.2% and 66.7% for binding and non-binding classes respectively. We demonstrate the applicability of our classifier by using it to identify previously unknown small molecule binding domains. Our predictions are available as supplementary material and can provide very useful information to drug discovery specialists. Given the ubiquitous and essential role small molecules play in biological processes, our method is important for identifying pharmaceutically relevant components of complete proteomes. The software is available from the author upon request.
Electron-Impact Ionization Cross Section Database

National Institute of Standards and Technology Data Gateway

SRD 107 Electron-Impact Ionization Cross Section Database (Web, free access) This is a database primarily of total ionization cross sections of molecules by electron impact. The database also includes cross sections for a small number of atoms and energy distributions of ejected electrons for H, He, and H2. The cross sections were calculated using the Binary-Encounter-Bethe (BEB) model, which combines the Mott cross section with the high-incident energy behavior of the Bethe cross section. Selected experimental data are included.
Drug search for leishmaniasis: a virtual screening approach by grid computing

NASA Astrophysics Data System (ADS)

Ochoa, Rodrigo; Watowich, Stanley J.; Flórez, Andrés; Mesa, Carol V.; Robledo, Sara M.; Muskus, Carlos

2016-07-01

The trypanosomatid protozoa Leishmania is endemic in 100 countries, with infections causing 2 million new cases of leishmaniasis annually. Disease symptoms can include severe skin and mucosal ulcers, fever, anemia, splenomegaly, and death. Unfortunately, therapeutics approved to treat leishmaniasis are associated with potentially severe side effects, including death. Furthermore, drug-resistant Leishmania parasites have developed in most endemic countries. To address an urgent need for new, safe and inexpensive anti-leishmanial drugs, we utilized the IBM World Community Grid to complete computer-based drug discovery screens (Drug Search for Leishmaniasis) using unique leishmanial proteins and a database of 600,000 drug-like small molecules. Protein structures from different Leishmania species were selected for molecular dynamics (MD) simulations, and a series of conformational "snapshots" were chosen from each MD trajectory to simulate the protein's flexibility. A Relaxed Complex Scheme methodology was used to screen 2000 MD conformations against the small molecule database, producing >1 billion protein-ligand structures. For each protein target, a binding spectrum was calculated to identify compounds predicted to bind with highest average affinity to all protein conformations. Significantly, four different Leishmania protein targets were predicted to strongly bind small molecules, with the strongest binding interactions predicted to occur for dihydroorotate dehydrogenase (LmDHODH; PDB:3MJY). A number of predicted tight-binding LmDHODH inhibitors were tested in vitro and potent selective inhibitors of Leishmania panamensis were identified. These promising small molecules are suitable for further development using iterative structure-based optimization and in vitro/in vivo validation assays.

Drug search for leishmaniasis: a virtual screening approach by grid computing.

PubMed

Ochoa, Rodrigo; Watowich, Stanley J; Flórez, Andrés; Mesa, Carol V; Robledo, Sara M; Muskus, Carlos

2016-07-01

The trypanosomatid protozoa Leishmania is endemic in ~100 countries, with infections causing ~2 million new cases of leishmaniasis annually. Disease symptoms can include severe skin and mucosal ulcers, fever, anemia, splenomegaly, and death. Unfortunately, therapeutics approved to treat leishmaniasis are associated with potentially severe side effects, including death. Furthermore, drug-resistant Leishmania parasites have developed in most endemic countries. To address an urgent need for new, safe and inexpensive anti-leishmanial drugs, we utilized the IBM World Community Grid to complete computer-based drug discovery screens (Drug Search for Leishmaniasis) using unique leishmanial proteins and a database of 600,000 drug-like small molecules. Protein structures from different Leishmania species were selected for molecular dynamics (MD) simulations, and a series of conformational "snapshots" were chosen from each MD trajectory to simulate the protein's flexibility. A Relaxed Complex Scheme methodology was used to screen ~2000 MD conformations against the small molecule database, producing >1 billion protein-ligand structures. For each protein target, a binding spectrum was calculated to identify compounds predicted to bind with highest average affinity to all protein conformations. Significantly, four different Leishmania protein targets were predicted to strongly bind small molecules, with the strongest binding interactions predicted to occur for dihydroorotate dehydrogenase (LmDHODH; PDB:3MJY). A number of predicted tight-binding LmDHODH inhibitors were tested in vitro and potent selective inhibitors of Leishmania panamensis were identified. These promising small molecules are suitable for further development using iterative structure-based optimization and in vitro/in vivo validation assays.
Small molecules as therapy for uveitis: a selected perspective of new and developing agents.

PubMed

Pleyer, Uwe; Algharably, Engi Abdel-Hady; Feist, Eugen; Kreutz, Reinhold

2017-09-01

Intraocular inflammation (uveitis) remains a significant burden of legal blindness. Because of its immune mediated and chronic recurrent nature, common therapy includes corticosteroids, disease-modifying anti-rheumatic drugs and more recently biologics as immune modulatory agents. The purpose of this article is to identify the role of new treatment approaches focusing on small molecules as therapeutic option in uveitis. Areas covered: A MEDLINE database search was conducted through February 2017 using the terms 'uveitis' and 'small molecule'. To provide ongoing and future perspectives in treatment options, also clinical trials as registered at ClinicalTrials.gov were included. Both, results from experimental as well as clinical research in this field were included. Since this field is rapidly evolving, a selection of promising agents had to be made. Expert opinion: Small molecules may interfere at different steps of the inflammatory cascade and appear as an interesting option in the treatment algorithm of uveitis. Because of their highly targeted molecular effects and their favorable bioavailability with the potential of topical application small molecules hold great promise. Nevertheless, a careful evaluation of these agents has to be made, since current experience is almost exclusively based on experimental uveitis models and few registered trials.
Searching molecular structure databases with tandem mass spectra using CSI:FingerID

PubMed Central

Dührkop, Kai; Shen, Huibin; Meusel, Marvin; Rousu, Juho; Böcker, Sebastian

2015-01-01

Metabolites provide a direct functional signature of cellular state. Untargeted metabolomics experiments usually rely on tandem MS to identify the thousands of compounds in a biological sample. Today, the vast majority of metabolites remain unknown. We present a method for searching molecular structure databases using tandem MS data of small molecules. Our method computes a fragmentation tree that best explains the fragmentation spectrum of an unknown molecule. We use the fragmentation tree to predict the molecular structure fingerprint of the unknown compound using machine learning. This fingerprint is then used to search a molecular structure database such as PubChem. Our method is shown to improve on the competing methods for computational metabolite identification by a considerable margin. PMID:26392543
Using more than 801 296 small-molecule crystal structures to aid in protein structure refinement and analysis

PubMed Central

Cole, Jason C.

2017-01-01

The Cambridge Structural Database (CSD) is the worldwide resource for the dissemination of all published three-dimensional structures of small-molecule organic and metal–organic compounds. This paper briefly describes how this collection of crystal structures can be used en masse in the context of macromolecular crystallography. Examples highlight how the CSD and associated software aid protein–ligand complex validation, and show how the CSD could be further used in the generation of geometrical restraints for protein structure refinement. PMID:28291758
Computational Thermochemistry of Jet Fuels and Rocket Propellants

NASA Technical Reports Server (NTRS)

Crawford, T. Daniel

2002-01-01

The design of new high-energy density molecules as candidates for jet and rocket fuels is an important goal of modern chemical thermodynamics. The NASA Glenn Research Center is home to a database of thermodynamic data for over 2000 compounds related to this goal, in the form of least-squares fits of heat capacities, enthalpies, and entropies as functions of temperature over the range of 300 - 6000 K. The chemical equilibrium with applications (CEA) program written and maintained by researchers at NASA Glenn over the last fifty years, makes use of this database for modeling the performance of potential rocket propellants. During its long history, the NASA Glenn database has been developed based on experimental results and data published in the scientific literature such as the standard JANAF tables. The recent development of efficient computational techniques based on quantum chemical methods provides an alternative source of information for expansion of such databases. For example, it is now possible to model dissociation or combustion reactions of small molecules to high accuracy using techniques such as coupled cluster theory or density functional theory. Unfortunately, the current applicability of reliable computational models is limited to relatively small molecules containing only around a dozen (non-hydrogen) atoms. We propose to extend the applicability of coupled cluster theory- often referred to as the 'gold standard' of quantum chemical methods- to molecules containing 30-50 non-hydrogen atoms. The centerpiece of this work is the concept of local correlation, in which the description of the electron interactions- known as electron correlation effects- are reduced to only their most important localized components. Such an advance has the potential to greatly expand the current reach of computational thermochemistry and thus to have a significant impact on the theoretical study of jet and rocket propellants.
CREDO: a structural interactomics database for drug discovery

PubMed Central

Schreyer, Adrian M.; Blundell, Tom L.

2013-01-01

CREDO is a unique relational database storing all pairwise atomic interactions of inter- as well as intra-molecular contacts between small molecules and macromolecules found in experimentally determined structures from the Protein Data Bank. These interactions are integrated with further chemical and biological data. The database implements useful data structures and algorithms such as cheminformatics routines to create a comprehensive analysis platform for drug discovery. The database can be accessed through a web-based interface, downloads of data sets and web services at http://www-cryst.bioc.cam.ac.uk/credo. Database URL: http://www-cryst.bioc.cam.ac.uk/credo PMID:23868908
BioM2MetDisease: a manually curated database for associations between microRNAs, metabolites, small molecules and metabolic diseases

PubMed Central

Xu, Yanjun; Yang, Haixiu; Wu, Tan; Dong, Qun; Sun, Zeguo; Shang, Desi; Li, Feng; Xu, Yingqi; Su, Fei; Liu, Siyao

2017-01-01

Abstract BioM2MetDisease is a manually curated database that aims to provide a comprehensive and experimentally supported resource of associations between metabolic diseases and various biomolecules. Recently, metabolic diseases such as diabetes have become one of the leading threats to people’s health. Metabolic disease associated with alterations of multiple types of biomolecules such as miRNAs and metabolites. An integrated and high-quality data source that collection of metabolic disease associated biomolecules is essential for exploring the underlying molecular mechanisms and discovering novel therapeutics. Here, we developed the BioM2MetDisease database, which currently documents 2681 entries of relationships between 1147 biomolecules (miRNAs, metabolites and small molecules/drugs) and 78 metabolic diseases across 14 species. Each entry includes biomolecule category, species, biomolecule name, disease name, dysregulation pattern, experimental technique, a brief description of metabolic disease-biomolecule relationships, the reference, additional annotation information etc. BioM2MetDisease provides a user-friendly interface to explore and retrieve all data conveniently. A submission page was also offered for researchers to submit new associations between biomolecules and metabolic diseases. BioM2MetDisease provides a comprehensive resource for studying biology molecules act in metabolic diseases, and it is helpful for understanding the molecular mechanisms and developing novel therapeutics for metabolic diseases. Database URL: http://www.bio-bigdata.com/BioM2MetDisease/ PMID:28605773
Compilation of small ribosomal subunit RNA structures.

PubMed Central

Neefs, J M; Van de Peer, Y; De Rijk, P; Chapelle, S; De Wachter, R

1993-01-01

The database on small ribosomal subunit RNA structure contained 1804 nucleotide sequences on April 23, 1993. This number comprises 365 eukaryotic, 65 archaeal, 1260 bacterial, 30 plastidial, and 84 mitochondrial sequences. These are stored in the form of an alignment in order to facilitate the use of the database as input for comparative studies on higher-order structure and for reconstruction of phylogenetic trees. The elements of the postulated secondary structure for each molecule are indicated by special symbols. The database is available on-line directly from the authors by ftp and can also be obtained from the EMBL nucleotide sequence library by electronic mail, ftp, and on CD ROM disk. PMID:8332525
ChemNet: A Transferable and Generalizable Deep Neural Network for Small-Molecule Property Prediction

DOE Office of Scientific and Technical Information (OSTI.GOV)

Goh, Garrett B.; Siegel, Charles M.; Vishnu, Abhinav

With access to large datasets, deep neural networks through representation learning have been able to identify patterns from raw data, achieving human-level accuracy in image and speech recognition tasks. However, in chemistry, availability of large standardized and labelled datasets is scarce, and with a multitude of chemical properties of interest, chemical data is inherently small and fragmented. In this work, we explore transfer learning techniques in conjunction with the existing Chemception CNN model, to create a transferable and generalizable deep neural network for small-molecule property prediction. Our latest model, ChemNet learns in a semi-supervised manner from inexpensive labels computed frommore » the ChEMBL database. When fine-tuned to the Tox21, HIV and FreeSolv dataset, which are 3 separate chemical tasks that ChemNet was not originally trained on, we demonstrate that ChemNet exceeds the performance of existing Chemception models, contemporary MLP models that trains on molecular fingerprints, and it matches the performance of the ConvGraph algorithm, the current state-of-the-art. Furthermore, as ChemNet has been pre-trained on a large diverse chemical database, it can be used as a universal “plug-and-play” deep neural network, which accelerates the deployment of deep neural networks for the prediction of novel small-molecule chemical properties.« less
MARS: bringing the automation of small-molecule bioanalytical sample preparations to a new frontier.

PubMed

Li, Ming; Chou, Judy; Jing, Jing; Xu, Hui; Costa, Aldo; Caputo, Robin; Mikkilineni, Rajesh; Flannelly-King, Shane; Rohde, Ellen; Gan, Lawrence; Klunk, Lewis; Yang, Liyu

2012-06-01

In recent years, there has been a growing interest in automating small-molecule bioanalytical sample preparations specifically using the Hamilton MicroLab(®) STAR liquid-handling platform. In the most extensive work reported thus far, multiple small-molecule sample preparation assay types (protein precipitation extraction, SPE and liquid-liquid extraction) have been integrated into a suite that is composed of graphical user interfaces and Hamilton scripts. Using that suite, bioanalytical scientists have been able to automate various sample preparation methods to a great extent. However, there are still areas that could benefit from further automation, specifically, the full integration of analytical standard and QC sample preparation with study sample extraction in one continuous run, real-time 2D barcode scanning on the Hamilton deck and direct Laboratory Information Management System database connectivity. We developed a new small-molecule sample-preparation automation system that improves in all of the aforementioned areas. The improved system presented herein further streamlines the bioanalytical workflow, simplifies batch run design, reduces analyst intervention and eliminates sample-handling error.
Crystallography Open Database (COD): an open-access collection of crystal structures and platform for world-wide collaboration

PubMed Central

Gražulis, Saulius; Daškevič, Adriana; Merkys, Andrius; Chateigner, Daniel; Lutterotti, Luca; Quirós, Miguel; Serebryanaya, Nadezhda R.; Moeck, Peter; Downs, Robert T.; Le Bail, Armel

2012-01-01

Using an open-access distribution model, the Crystallography Open Database (COD, http://www.crystallography.net) collects all known ‘small molecule / small to medium sized unit cell’ crystal structures and makes them available freely on the Internet. As of today, the COD has aggregated ∼150 000 structures, offering basic search capabilities and the possibility to download the whole database, or parts thereof using a variety of standard open communication protocols. A newly developed website provides capabilities for all registered users to deposit published and so far unpublished structures as personal communications or pre-publication depositions. Such a setup enables extension of the COD database by many users simultaneously. This increases the possibilities for growth of the COD database, and is the first step towards establishing a world wide Internet-based collaborative platform dedicated to the collection and curation of structural knowledge. PMID:22070882
Evaluation of post-authorization safety studies in the first cohort of EU Risk Management Plans at time of regulatory approval.

PubMed

Giezen, Thijs J; Mantel-Teeuwisse, Aukje K; Straus, Sabine M J M; Egberts, Toine C G; Blackburn, Stella; Persson, Ingemar; Leufkens, Hubert G M

2009-01-01

Since November 2005, an EU Risk Management Plan (EU-RMP) has had to be submitted as part of a marketing application for all new chemical entities in the EU. In the EU-RMP, the safety profile of the medicine has to be described and pharmacovigilance activities should be proposed to study further safety concerns during use of the drug in the real-world setting. These activities include, for example, collection of spontaneously reported adverse events and post-authorization safety studies (PASS). Since the submission of an EU-RMP is a relatively new requirement, there is limited knowledge on the quality and completeness of the study protocols of PASS at the time of approval and there are no data on the influence of certain drug characteristics on the proposed pharmacovigilance activities. To examine the types of proposed pharmacovigilance activities in a sample of EU-RMPs, describe and evaluate the methodology of PASS, identify problems and propose remedies, and compare characteristics between biologicals and small molecules. Eighteen EU-RMPs (nine for biologicals, nine for small molecules) given a positive decision regarding the marketing application by the Committee for Medicinal Products for Human Use between November 2005 and May 2007 were included in this descriptive cohort study. The EU-RMPs were selected over time and different therapeutic areas. Classification of the safety concerns ('important identified risks', 'important potential risks', 'important missing information' within the EU-RMP was studied. For PASS, data source (registry, population-based database, sponsor-owned clinical trial database), source of study population to be included in PASS and comprehensiveness of study protocol (full protocol, limited protocol, study synopsis, short description, commitment without further information) were studied. Compared to small molecules, safety concerns for biologicals were less frequently classified as important identified risks (relative risk [RR] 0.6; 95% CI 0.3, 1.0) and more frequently as important missing information (RR 1.6; 95% CI 1.0, 2.7). Forty-seven PASS were proposed; 31 for biologicals and 16 for small molecules. Compared with studies proposed in population-based databases (4 for biologicals, 8 for small molecules), studies in registries (18 for biologicals, 4 for small molecules) were more frequently proposed for biologicals than for small molecules (RR 2.5; 95% CI 1.1, 5.7). About 60% of the proposed PASS will include EU inhabitants. No full study protocols were submitted; 26% involved a limited study protocol, 33% a study synopsis, 37% a short description and 4% a commitment without further information. Approximately 40% of the study proposals for PASS were classified as a short description or a commitment to perform a study without further information, precluding an adequate scientific assessment. Studying non-EU populations may give rise to difficulties with generalizability of the results to the EU due to differences in patient characteristics, differences in the indication for the medicine and different healthcare systems. This study emphasizes the need for more complete study proposals to be submitted earlier on in the evaluation period and for the inclusion of EU inhabitants in PASS. In addition, differences in the characteristics between biologicals and small molecules, e.g. in the data source proposed, support the need for individualized tailored PASS depending on the type of drug.
A theoretical-electron-density databank using a model of real and virtual spherical atoms.

PubMed

Nassour, Ayoub; Domagala, Slawomir; Guillot, Benoit; Leduc, Theo; Lecomte, Claude; Jelsch, Christian

2017-08-01

A database describing the electron density of common chemical groups using combinations of real and virtual spherical atoms is proposed, as an alternative to the multipolar atom modelling of the molecular charge density. Theoretical structure factors were computed from periodic density functional theory calculations on 38 crystal structures of small molecules and the charge density was subsequently refined using a density model based on real spherical atoms and additional dummy charges on the covalent bonds and on electron lone-pair sites. The electron-density parameters of real and dummy atoms present in a similar chemical environment were averaged on all the molecules studied to build a database of transferable spherical atoms. Compared with the now-popular databases of transferable multipolar parameters, the spherical charge modelling needs fewer parameters to describe the molecular electron density and can be more easily incorporated in molecular modelling software for the computation of electrostatic properties. The construction method of the database is described. In order to analyse to what extent this modelling method can be used to derive meaningful molecular properties, it has been applied to the urea molecule and to biotin/streptavidin, a protein/ligand complex.
High-Resolution Photoionization, Photoelectron and Photodissociation Studies. Determination of Accurate Energetic and Spectroscopic Database for Combustion Radicals and Molecules

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ng, Cheuk-Yiu

2016-04-25

The main goal of this research program was to obtain accurate thermochemical and spectroscopic data, such as ionization energies (IEs), 0 K bond dissociation energies, 0 K heats of formation, and spectroscopic constants for radicals and molecules and their ions of relevance to combustion chemistry. Two unique, generally applicable vacuum ultraviolet (VUV) laser photoion-photoelectron apparatuses have been developed in our group, which have used for high-resolution photoionization, photoelectron, and photodissociation studies for many small molecules of combustion relevance.
AN OVERVIEW OF COMPUTATIONAL LIFE SCIENCE DATABASES & EXCHANGE FORMATS OF RELEVANCE TO CHEMICAL BIOLOGY RESEARCH

PubMed Central

Hall, Aaron Smalter; Shan, Yunfeng; Lushington, Gerald; Visvanathan, Mahesh

2016-01-01

Databases and exchange formats describing biological entities such as chemicals and proteins, along with their relationships, are a critical component of research in life sciences disciplines, including chemical biology wherein small information about small molecule properties converges with cellular and molecular biology. Databases for storing biological entities are growing not only in size, but also in type, with many similarities between them and often subtle differences. The data formats available to describe and exchange these entities are numerous as well. In general, each format is optimized for a particular purpose or database, and hence some understanding of these formats is required when choosing one for research purposes. This paper reviews a selection of different databases and data formats with the goal of summarizing their purposes, features, and limitations. Databases are reviewed under the categories of 1) protein interactions, 2) metabolic pathways, 3) chemical interactions, and 4) drug discovery. Representation formats will be discussed according to those describing chemical structures, and those describing genomic/proteomic entities. PMID:22934944
An overview of computational life science databases & exchange formats of relevance to chemical biology research.

PubMed

Smalter Hall, Aaron; Shan, Yunfeng; Lushington, Gerald; Visvanathan, Mahesh

2013-03-01

Databases and exchange formats describing biological entities such as chemicals and proteins, along with their relationships, are a critical component of research in life sciences disciplines, including chemical biology wherein small information about small molecule properties converges with cellular and molecular biology. Databases for storing biological entities are growing not only in size, but also in type, with many similarities between them and often subtle differences. The data formats available to describe and exchange these entities are numerous as well. In general, each format is optimized for a particular purpose or database, and hence some understanding of these formats is required when choosing one for research purposes. This paper reviews a selection of different databases and data formats with the goal of summarizing their purposes, features, and limitations. Databases are reviewed under the categories of 1) protein interactions, 2) metabolic pathways, 3) chemical interactions, and 4) drug discovery. Representation formats will be discussed according to those describing chemical structures, and those describing genomic/proteomic entities.
Simulating electric field interactions with polar molecules using spectroscopic databases

NASA Astrophysics Data System (ADS)

Owens, Alec; Zak, Emil J.; Chubb, Katy L.; Yurchenko, Sergei N.; Tennyson, Jonathan; Yachmenev, Andrey

2017-03-01

Ro-vibrational Stark-associated phenomena of small polyatomic molecules are modelled using extensive spectroscopic data generated as part of the ExoMol project. The external field Hamiltonian is built from the computed ro-vibrational line list of the molecule in question. The Hamiltonian we propose is general and suitable for any polar molecule in the presence of an electric field. By exploiting precomputed data, the often prohibitively expensive computations associated with high accuracy simulations of molecule-field interactions are avoided. Applications to strong terahertz field-induced ro-vibrational dynamics of PH3 and NH3, and spontaneous emission data for optoelectrical Sisyphus cooling of H2CO and CH3Cl are discussed.
Design of a small molecule against an oncogenic noncoding RNA.

PubMed

Velagapudi, Sai Pradeep; Cameron, Michael D; Haga, Christopher L; Rosenberg, Laura H; Lafitte, Marie; Duckett, Derek R; Phinney, Donald G; Disney, Matthew D

2016-05-24

The design of precision, preclinical therapeutics from sequence is difficult, but advances in this area, particularly those focused on rational design, could quickly transform the sequence of disease-causing gene products into lead modalities. Herein, we describe the use of Inforna, a computational approach that enables the rational design of small molecules targeting RNA to quickly provide a potent modulator of oncogenic microRNA-96 (miR-96). We mined the secondary structure of primary microRNA-96 (pri-miR-96) hairpin precursor against a database of RNA motif-small molecule interactions, which identified modules that bound RNA motifs nearby and in the Drosha processing site. Precise linking of these modules together provided Targaprimir-96 (3), which selectively modulates miR-96 production in cancer cells and triggers apoptosis. Importantly, the compound is ineffective on healthy breast cells, and exogenous overexpression of pri-miR-96 reduced compound potency in breast cancer cells. Chemical Cross-Linking and Isolation by Pull-Down (Chem-CLIP), a small-molecule RNA target validation approach, shows that 3 directly engages pri-miR-96 in breast cancer cells. In vivo, 3 has a favorable pharmacokinetic profile and decreases tumor burden in a mouse model of triple-negative breast cancer. Thus, rational design can quickly produce precision, in vivo bioactive lead small molecules against hard-to-treat cancers by targeting oncogenic noncoding RNAs, advancing a disease-to-gene-to-drug paradigm.
ForceGen 3D structure and conformer generation: from small lead-like molecules to macrocyclic drugs

NASA Astrophysics Data System (ADS)

Cleves, Ann E.; Jain, Ajay N.

2017-05-01

We introduce the ForceGen method for 3D structure generation and conformer elaboration of drug-like small molecules. ForceGen is novel, avoiding use of distance geometry, molecular templates, or simulation-oriented stochastic sampling. The method is primarily driven by the molecular force field, implemented using an extension of MMFF94s and a partial charge estimator based on electronegativity-equalization. The force field is coupled to algorithms for direct sampling of realistic physical movements made by small molecules. Results are presented on a standard benchmark from the Cambridge Crystallographic Database of 480 drug-like small molecules, including full structure generation from SMILES strings. Reproduction of protein-bound crystallographic ligand poses is demonstrated on four carefully curated data sets: the ConfGen Set (667 ligands), the PINC cross-docking benchmark (1062 ligands), a large set of macrocyclic ligands (182 total with typical ring sizes of 12-23 atoms), and a commonly used benchmark for evaluating macrocycle conformer generation (30 ligands total). Results compare favorably to alternative methods, and performance on macrocyclic compounds approaches that observed on non-macrocycles while yielding a roughly 100-fold speed improvement over alternative MD-based methods with comparable performance.
sscMap: an extensible Java application for connecting small-molecule drugs using gene-expression signatures.

PubMed

Zhang, Shu-Dong; Gant, Timothy W

2009-07-31

Connectivity mapping is a process to recognize novel pharmacological and toxicological properties in small molecules by comparing their gene expression signatures with others in a database. A simple and robust method for connectivity mapping with increased specificity and sensitivity was recently developed, and its utility demonstrated using experimentally derived gene signatures. This paper introduces sscMap (statistically significant connections' map), a Java application designed to undertake connectivity mapping tasks using the recently published method. The software is bundled with a default collection of reference gene-expression profiles based on the publicly available dataset from the Broad Institute Connectivity Map 02, which includes data from over 7000 Affymetrix microarrays, for over 1000 small-molecule compounds, and 6100 treatment instances in 5 human cell lines. In addition, the application allows users to add their custom collections of reference profiles and is applicable to a wide range of other 'omics technologies. The utility of sscMap is two fold. First, it serves to make statistically significant connections between a user-supplied gene signature and the 6100 core reference profiles based on the Broad Institute expanded dataset. Second, it allows users to apply the same improved method to custom-built reference profiles which can be added to the database for future referencing. The software can be freely downloaded from http://purl.oclc.org/NET/sscMap.

Development and Mining of a Volatile Organic Compound Database

PubMed Central

Abdullah, Azian Azamimi; Ono, Naoaki; Sugiura, Tadao; Morita, Aki Hirai; Katsuragi, Tetsuo; Muto, Ai; Nishioka, Takaaki; Kanaya, Shigehiko

2015-01-01

Volatile organic compounds (VOCs) are small molecules that exhibit high vapor pressure under ambient conditions and have low boiling points. Although VOCs contribute only a small proportion of the total metabolites produced by living organisms, they play an important role in chemical ecology specifically in the biological interactions between organisms and ecosystems. VOCs are also important in the health care field as they are presently used as a biomarker to detect various human diseases. Information on VOCs is scattered in the literature until now; however, there is still no available database describing VOCs and their biological activities. To attain this purpose, we have developed KNApSAcK Metabolite Ecology Database, which contains the information on the relationships between VOCs and their emitting organisms. The KNApSAcK Metabolite Ecology is also linked with the KNApSAcK Core and KNApSAcK Metabolite Activity Database to provide further information on the metabolites and their biological activities. The VOC database can be accessed online. PMID:26495281
BioM2MetDisease: a manually curated database for associations between microRNAs, metabolites, small molecules and metabolic diseases.

PubMed

Xu, Yanjun; Yang, Haixiu; Wu, Tan; Dong, Qun; Sun, Zeguo; Shang, Desi; Li, Feng; Xu, Yingqi; Su, Fei; Liu, Siyao; Zhang, Yunpeng; Li, Xia

2017-01-01

BioM2MetDisease is a manually curated database that aims to provide a comprehensive and experimentally supported resource of associations between metabolic diseases and various biomolecules. Recently, metabolic diseases such as diabetes have become one of the leading threats to people’s health. Metabolic disease associated with alterations of multiple types of biomolecules such as miRNAs and metabolites. An integrated and high-quality data source that collection of metabolic disease associated biomolecules is essential for exploring the underlying molecular mechanisms and discovering novel therapeutics. Here, we developed the BioM2MetDisease database, which currently documents 2681 entries of relationships between 1147 biomolecules (miRNAs, metabolites and small molecules/drugs) and 78 metabolic diseases across 14 species. Each entry includes biomolecule category, species, biomolecule name, disease name, dysregulation pattern, experimental technique, a brief description of metabolic disease-biomolecule relationships, the reference, additional annotation information etc. BioM2MetDisease provides a user-friendly interface to explore and retrieve all data conveniently. A submission page was also offered for researchers to submit new associations between biomolecules and metabolic diseases. BioM2MetDisease provides a comprehensive resource for studying biology molecules act in metabolic diseases, and it is helpful for understanding the molecular mechanisms and developing novel therapeutics for metabolic diseases. http://www.bio-bigdata.com/BioM2MetDisease/. © The Author(s) 2017. Published by Oxford University Press.
Design of a small molecule against an oncogenic noncoding RNA

PubMed Central

Velagapudi, Sai Pradeep; Cameron, Michael D.; Haga, Christopher L.; Rosenberg, Laura H.; Lafitte, Marie; Duckett, Derek R.; Phinney, Donald G.; Disney, Matthew D.

2016-01-01

The design of precision, preclinical therapeutics from sequence is difficult, but advances in this area, particularly those focused on rational design, could quickly transform the sequence of disease-causing gene products into lead modalities. Herein, we describe the use of Inforna, a computational approach that enables the rational design of small molecules targeting RNA to quickly provide a potent modulator of oncogenic microRNA-96 (miR-96). We mined the secondary structure of primary microRNA-96 (pri-miR-96) hairpin precursor against a database of RNA motif–small molecule interactions, which identified modules that bound RNA motifs nearby and in the Drosha processing site. Precise linking of these modules together provided Targaprimir-96 (3), which selectively modulates miR-96 production in cancer cells and triggers apoptosis. Importantly, the compound is ineffective on healthy breast cells, and exogenous overexpression of pri-miR-96 reduced compound potency in breast cancer cells. Chemical Cross-Linking and Isolation by Pull-Down (Chem-CLIP), a small-molecule RNA target validation approach, shows that 3 directly engages pri-miR-96 in breast cancer cells. In vivo, 3 has a favorable pharmacokinetic profile and decreases tumor burden in a mouse model of triple-negative breast cancer. Thus, rational design can quickly produce precision, in vivo bioactive lead small molecules against hard-to-treat cancers by targeting oncogenic noncoding RNAs, advancing a disease-to-gene-to-drug paradigm. PMID:27170187
The use of small-molecule structures to complement protein–ligand crystal structures in drug discovery

PubMed Central

Cole, Jason C.

2017-01-01

Many ligand-discovery stories tell of the use of structures of protein–ligand complexes, but the contribution of structural chemistry is such a core part of finding and improving ligands that it is often overlooked. More than 800 000 crystal structures are available to the community through the Cambridge Structural Database (CSD). Individually, these structures can be of tremendous value and the collection of crystal structures is even more helpful. This article provides examples of how small-molecule crystal structures have been used to complement those of protein–ligand complexes to address challenges ranging from affinity, selectivity and bioavailability though to solubility. PMID:28291759
PPDMs-a resource for mapping small molecule bioactivities from ChEMBL to Pfam-A protein domains.

PubMed

Kruger, Felix A; Gaulton, Anna; Nowotka, Michal; Overington, John P

2015-03-01

PPDMs is a resource that maps small molecule bioactivities to protein domains from the Pfam-A collection of protein families. Small molecule bioactivities mapped to protein domains add important precision to approaches that use protein sequence searches alignments to assist applications in computational drug discovery and systems and chemical biology. We have previously proposed a mapping heuristic for a subset of bioactivities stored in ChEMBL with the Pfam-A domain most likely to mediate small molecule binding. We have since refined this mapping using a manual procedure. Here, we present a resource that provides up-to-date mappings and the possibility to review assigned mappings as well as to participate in their assignment and curation. We also describe how mappings provided through the PPDMs resource are made accessible through the main schema of the ChEMBL database. The PPDMs resource and curation interface is available at https://www.ebi.ac.uk/chembl/research/ppdms/pfam_maps. The source-code for PPDMs is available under the Apache license at https://github.com/chembl/pfam_maps. Source code is available at https://github.com/chembl/pfam_map_loader to demonstrate the integration process with the main schema of ChEMBL. © The Author 2014. Published by Oxford University Press.
Sachem: a chemical cartridge for high-performance substructure search.

PubMed

Kratochvíl, Miroslav; Vondrášek, Jiří; Galgonek, Jakub

2018-05-23

Structure search is one of the valuable capabilities of small-molecule databases. Fingerprint-based screening methods are usually employed to enhance the search performance by reducing the number of calls to the verification procedure. In substructure search, fingerprints are designed to capture important structural aspects of the molecule to aid the decision about whether the molecule contains a given substructure. Currently available cartridges typically provide acceptable search performance for processing user queries, but do not scale satisfactorily with dataset size. We present Sachem, a new open-source chemical cartridge that implements two substructure search methods: The first is a performance-oriented reimplementation of substructure indexing based on the OrChem fingerprint, and the second is a novel method that employs newly designed fingerprints stored in inverted indices. We assessed the performance of both methods on small, medium, and large datasets containing 1, 10, and 94 million compounds, respectively. Comparison of Sachem with other freely available cartridges revealed improvements in overall performance, scaling potential and screen-out efficiency. The Sachem cartridge allows efficient substructure searches in databases of all sizes. The sublinear performance scaling of the second method and the ability to efficiently query large amounts of pre-extracted information may together open the door to new applications for substructure searches.
Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening

PubMed Central

Mu, Lin

2018-01-01

This work introduces a number of algebraic topology approaches, including multi-component persistent homology, multi-level persistent homology, and electrostatic persistence for the representation, characterization, and description of small molecules and biomolecular complexes. In contrast to the conventional persistent homology, multi-component persistent homology retains critical chemical and biological information during the topological simplification of biomolecular geometric complexity. Multi-level persistent homology enables a tailored topological description of inter- and/or intra-molecular interactions of interest. Electrostatic persistence incorporates partial charge information into topological invariants. These topological methods are paired with Wasserstein distance to characterize similarities between molecules and are further integrated with a variety of machine learning algorithms, including k-nearest neighbors, ensemble of trees, and deep convolutional neural networks, to manifest their descriptive and predictive powers for protein-ligand binding analysis and virtual screening of small molecules. Extensive numerical experiments involving 4,414 protein-ligand complexes from the PDBBind database and 128,374 ligand-target and decoy-target pairs in the DUD database are performed to test respectively the scoring power and the discriminatory power of the proposed topological learning strategies. It is demonstrated that the present topological learning outperforms other existing methods in protein-ligand binding affinity prediction and ligand-decoy discrimination. PMID:29309403
Building an R&D chemical registration system.

PubMed

Martin, Elyette; Monge, Aurélien; Duret, Jacques-Antoine; Gualandi, Federico; Peitsch, Manuel C; Pospisil, Pavel

2012-05-31

Small molecule chemistry is of central importance to a number of R&D companies in diverse areas such as the pharmaceutical, nutraceutical, food flavoring, and cosmeceutical industries. In order to store and manage thousands of chemical compounds in such an environment, we have built a state-of-the-art master chemical database with unique structure identifiers. Here, we present the concept and methodology we used to build the system that we call the Unique Compound Database (UCD). In the UCD, each molecule is registered only once (uniqueness), structures with alternative representations are entered in a uniform way (normalization), and the chemical structure drawings are recognizable to chemists and to a cartridge. In brief, structural molecules are entered as neutral entities which can be associated with a salt. The salts are listed in a dictionary and bound to the molecule with the appropriate stoichiometric coefficient in an entity called "substance". The substances are associated with batches. Once a molecule is registered, some properties (e.g., ADMET prediction, IUPAC name, chemical properties) are calculated automatically. The UCD has both automated and manual data controls. Moreover, the UCD concept enables the management of user errors in the structure entry by reassigning or archiving the batches. It also allows updating of the records to include newly discovered properties of individual structures. As our research spans a wide variety of scientific fields, the database enables registration of mixtures of compounds, enantiomers, tautomers, and compounds with unknown stereochemistries.
Database and Related Activities in Japan

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murakami, Izumi; Kato, Daiji; Kato, Masatoshi

2011-05-11

We have constructed and made available atomic and molecular (AM) numerical databases on collision processes such as electron-impact excitation and ionization, recombination and charge transfer of atoms and molecules relevant for plasma physics, fusion research, astrophysics, applied-science plasma, and other related areas. The retrievable data is freely accessible via the internet. We also work on atomic data evaluation and constructing collisional-radiative models for spectroscopic plasma diagnostics. Recently we have worked on Fe ions and W ions theoretically and experimentally. The atomic data and collisional-radiative models for these ions are examined and applied to laboratory plasmas. A visible M1 transition ofmore » W{sup 26+} ion is identified at 389.41 nm by EBIT experiments and theoretical calculations. We have small non-retrievable databases in addition to our main database. Recently we evaluated photo-absorption cross sections for 9 atoms and 23 molecules and we present them as a new database. We established a new association ''Forum of Atomic and Molecular Data and Their Applications'' to exchange information among AM data producers, data providers and data users in Japan and we hope this will help to encourage AM data activities in Japan.« less
Database and Related Activities in Japan

NASA Astrophysics Data System (ADS)

Murakami, Izumi; Kato, Daiji; Kato, Masatoshi; Sakaue, Hiroyuki A.; Kato, Takako; Ding, Xiaobin; Morita, Shigeru; Kitajima, Masashi; Koike, Fumihiro; Nakamura, Nobuyuki; Sakamoto, Naoki; Sasaki, Akira; Skobelev, Igor; Tsuchida, Hidetsugu; Ulantsev, Artemiy; Watanabe, Tetsuya; Yamamoto, Norimasa

2011-05-01

We have constructed and made available atomic and molecular (AM) numerical databases on collision processes such as electron-impact excitation and ionization, recombination and charge transfer of atoms and molecules relevant for plasma physics, fusion research, astrophysics, applied-science plasma, and other related areas. The retrievable data is freely accessible via the internet. We also work on atomic data evaluation and constructing collisional-radiative models for spectroscopic plasma diagnostics. Recently we have worked on Fe ions and W ions theoretically and experimentally. The atomic data and collisional-radiative models for these ions are examined and applied to laboratory plasmas. A visible M1 transition of W26+ ion is identified at 389.41 nm by EBIT experiments and theoretical calculations. We have small non-retrievable databases in addition to our main database. Recently we evaluated photo-absorption cross sections for 9 atoms and 23 molecules and we present them as a new database. We established a new association "Forum of Atomic and Molecular Data and Their Applications" to exchange information among AM data producers, data providers and data users in Japan and we hope this will help to encourage AM data activities in Japan.
Autophagic compound database: A resource connecting autophagy-modulating compounds, their potential targets and relevant diseases.

PubMed

Deng, Yiqi; Zhu, Lingjuan; Cai, Haoyang; Wang, Guan; Liu, Bo

2018-06-01

Autophagy, a highly conserved lysosomal degradation process in eukaryotic cells, can digest long-lived proteins and damaged organelles through vesicular trafficking pathways. Nowadays, mechanisms of autophagy have been gradually elucidated and thus the discovery of small-molecule drugs targeting autophagy has always been drawing much attention. So far, some autophagy-related web servers have been available online to facilitate scientists to obtain the information relevant to autophagy conveniently, such as HADb, CTLPScanner, iLIR server and ncRDeathDB. However, to the best of our knowledge, there is not any web server available about the autophagy-modulating compounds. According to published articles, all the compounds and their relations with autophagy were anatomized. Subsequently, an online Autophagic Compound Database (ACDB) (http://www.acdbliulab.com/) was constructed, which contained information of 357 compounds with 164 corresponding signalling pathways and potential targets in different diseases. We achieved a great deal of information of autophagy-modulating compounds, including compounds, targets/pathways and diseases. ACDB is a valuable resource for users to access to more than 300 curated small-molecule compounds correlated with autophagy. Autophagic compound database will facilitate to the discovery of more novel therapeutic drugs in the near future. © 2017 John Wiley & Sons Ltd.
Virtual Exploration of the Ring Systems Chemical Universe.

PubMed

Visini, Ricardo; Arús-Pous, Josep; Awale, Mahendra; Reymond, Jean-Louis

2017-11-27

Here, we explore the chemical space of all virtually possible organic molecules focusing on ring systems, which represent the cyclic cores of organic molecules obtained by removing all acyclic bonds and converting all remaining atoms to carbon. This approach circumvents the combinatorial explosion encountered when enumerating the molecules themselves. We report the chemical universe database GDB4c containing 916 130 ring systems up to four saturated or aromatic rings and maximum ring size of 14 atoms and GDB4c3D containing the corresponding 6 555 929 stereoisomers. Almost all (98.6%) of these ring systems are unknown and represent chiral 3D-shaped macrocycles containing small rings and quaternary centers reminiscent of polycyclic natural products. We envision that GDB4c can serve to select new ring systems from which to design analogs of such natural products. The database is available for download at www.gdb.unibe.ch together with interactive visualization and search tools as a resource for molecular design.
A Chemoinformatics Approach to the Discovery of Lead-Like Molecules from Marine and Microbial Sources En Route to Antitumor and Antibiotic Drugs

PubMed Central

Pereira, Florbela; Latino, Diogo A. R. S.; Gaudêncio, Susana P.

2014-01-01

The comprehensive information of small molecules and their biological activities in the PubChem database allows chemoinformatic researchers to access and make use of large-scale biological activity data to improve the precision of drug profiling. A Quantitative Structure–Activity Relationship approach, for classification, was used for the prediction of active/inactive compounds relatively to overall biological activity, antitumor and antibiotic activities using a data set of 1804 compounds from PubChem. Using the best classification models for antibiotic and antitumor activities a data set of marine and microbial natural products from the AntiMarin database were screened—57 and 16 new lead compounds for antibiotic and antitumor drug design were proposed, respectively. All compounds proposed by our approach are classified as non-antibiotic and non-antitumor compounds in the AntiMarin database. Recently several of the lead-like compounds proposed by us were reported as being active in the literature. PMID:24473174
Identification and characterization of small molecule inhibitors of the calcium-dependent S100B-p53 tumor suppressor interaction.

PubMed

Markowitz, Joseph; Chen, Ijen; Gitti, Rossi; Baldisseri, Donna M; Pan, Yongping; Udan, Ryan; Carrier, France; MacKerell, Alexander D; Weber, David J

2004-10-07

The binding of S100B to p53 down-regulates wild-type p53 tumor suppressor activity in cancer cells such as malignant melanoma, so a search for small molecules that bind S100B and prevent S100B-p53 complex formation was undertaken. Chemical databases were computationally searched for potential inhibitors of S100B, and 60 compounds were selected for testing on the basis of energy scoring, commercial availability, and chemical similarity clustering. Seven of these compounds bound to S100B as determined by steady state fluorescence spectroscopy (1.0 microM < or = K(D) < or = 120 microM) and five inhibited the growth of primary malignant melanoma cells (C8146A) at comparable concentrations (1.0 microM < or = IC(50) < or = 50 microM). Additionally, saturation transfer difference (STD) NMR experiments confirmed binding and qualitatively identified protons from the small molecule at the small molecule-S100B interface. Heteronuclear single quantum coherence (HSQC) NMR titrations indicate that these compounds interact with the p53 binding site on S100B. An NMR-docked model of one such inhibitor, pentamidine, bound to Ca(2+)-loaded S100B was calculated using intermolecular NOE data between S100B and the drug, and indicates that pentamidine binds into the p53 binding site on S100B defined by helices 3 and 4 and loop 2 (termed the hinge region).
ZINC: A Free Tool to Discover Chemistry for Biology

PubMed Central

2012-01-01

ZINC is a free public resource for ligand discovery. The database contains over twenty million commercially available molecules in biologically relevant representations that may be downloaded in popular ready-to-dock formats and subsets. The Web site also enables searches by structure, biological activity, physical property, vendor, catalog number, name, and CAS number. Small custom subsets may be created, edited, shared, docked, downloaded, and conveyed to a vendor for purchase. The database is maintained and curated for a high purchasing success rate and is freely available at zinc.docking.org. PMID:22587354
Chemoinformatic Analysis of Combinatorial Libraries, Drugs, Natural Products and Molecular Libraries Small Molecule Repository

PubMed Central

Singh, Narender; Guha, Rajarshi; Giulianotti, Marc; Pinilla, Clemencia; Houghten, Richard; Medina-Franco, Jose L.

2009-01-01

A multiple criteria approach is presented, that is used to perform a comparative analysis of four recently developed combinatorial libraries to drugs, Molecular Libraries Small Molecule Repository (MLSMR) and natural products. The compound databases were assessed in terms of physicochemical properties, scaffolds and fingerprints. The approach enables the analysis of property space coverage, degree of overlap between collections, scaffold and structural diversity and overall structural novelty. The degree of overlap between combinatorial libraries and drugs was assessed using the R-NN curve methodology, which measures the density of chemical space around a query molecule embedded in the chemical space of a target collection. The combinatorial libraries studied in this work exhibit scaffolds that were not observed in the drug, MLSMR and natural products collections. The fingerprint-based comparisons indicate that these combinatorial libraries are structurally different to current drugs. The R-NN curve methodology revealed that a proportion of molecules in the combinatorial libraries are located within the property space of the drugs. However, the R-NN analysis also showed that there are a significant number of molecules in several combinatorial libraries that are located in sparse regions of the drug space. PMID:19301827
Incorporating Virtual Reactions into a Logic-based Ligand-based Virtual Screening Method to Discover New Leads

PubMed Central

Reynolds, Christopher R; Muggleton, Stephen H; Sternberg, Michael J E

2015-01-01

The use of virtual screening has become increasingly central to the drug development pipeline, with ligand-based virtual screening used to screen databases of compounds to predict their bioactivity against a target. These databases can only represent a small fraction of chemical space, and this paper describes a method of exploring synthetic space by applying virtual reactions to promising compounds within a database, and generating focussed libraries of predicted derivatives. A ligand-based virtual screening tool Investigational Novel Drug Discovery by Example (INDDEx) is used as the basis for a system of virtual reactions. The use of virtual reactions is estimated to open up a potential space of 1.21×1012 potential molecules. A de novo design algorithm known as Partial Logical-Rule Reactant Selection (PLoRRS) is introduced and incorporated into the INDDEx methodology. PLoRRS uses logical rules from the INDDEx model to select reactants for the de novo generation of potentially active products. The PLoRRS method is found to increase significantly the likelihood of retrieving molecules similar to known actives with a p-value of 0.016. Case studies demonstrate that the virtual reactions produce molecules highly similar to known actives, including known blockbuster drugs. PMID:26583052
Ambiguity of non-systematic chemical identifiers within and between small-molecule databases.

PubMed

Akhondi, Saber A; Muresan, Sorel; Williams, Antony J; Kors, Jan A

2015-01-01

A wide range of chemical compound databases are currently available for pharmaceutical research. To retrieve compound information, including structures, researchers can query these chemical databases using non-systematic identifiers. These are source-dependent identifiers (e.g., brand names, generic names), which are usually assigned to the compound at the point of registration. The correctness of non-systematic identifiers (i.e., whether an identifier matches the associated structure) can only be assessed manually, which is cumbersome, but it is possible to automatically check their ambiguity (i.e., whether an identifier matches more than one structure). In this study we have quantified the ambiguity of non-systematic identifiers within and between eight widely used chemical databases. We also studied the effect of chemical structure standardization on reducing the ambiguity of non-systematic identifiers. The ambiguity of non-systematic identifiers within databases varied from 0.1 to 15.2 % (median 2.5 %). Standardization reduced the ambiguity only to a small extent for most databases. A wide range of ambiguity existed for non-systematic identifiers that are shared between databases (17.7-60.2 %, median of 40.3 %). Removing stereochemistry information provided the largest reduction in ambiguity across databases (median reduction 13.7 percentage points). Ambiguity of non-systematic identifiers within chemical databases is generally low, but ambiguity of non-systematic identifiers that are shared between databases, is high. Chemical structure standardization reduces the ambiguity to a limited extent. Our findings can help to improve database integration, curation, and maintenance.
Screening for small molecule inhibitors of Toxoplasma gondii.

PubMed

Kortagere, Sandhya

2012-12-01

Toxoplasma gondii, the agent that causes toxoplasmosis, is an opportunistic parasite that infects many mammalian species. It is an obligate intracellular parasite that causes severe congenital neurological and ocular disease mostly in immunocompromised humans. The current regimen of therapy includes only a few medications that often lead to hypersensitivity and toxicity. In addition, there are no vaccines available to prevent the transmission of this agent. Therefore, safer and more effective medicines to treat toxoplasmosis are urgently needed. The author presents in silico and in vitro strategies that are currently used to screen for novel targets and unique chemotypes against T. gondii. Furthermore, this review highlights the screening technologies and characterization of some novel targets and new chemical entities that could be developed into highly efficacious treatments for toxoplasmosis. A number of diverse methods are being used to design inhibitors against T. gondii. These include ligand-based methods, in which drugs that have been shown to be efficacious against other Apicomplexa parasites can be repurposed to identify lead molecules against T. gondii. In addition, structure-based methods use currently available repertoire of structural information in various databases to rationally design small-molecule inhibitors of T. gondii. Whereas the screening methods have their advantages and limitations, a combination of methods is ideally suited to design small-molecule inhibitors of complex parasites such as T. gondii.
Signaling gateway molecule pages—a data model perspective

PubMed Central

Dinasarapu, Ashok Reddy; Saunders, Brian; Ozerlat, Iley; Azam, Kenan; Subramaniam, Shankar

2011-01-01

Summary: The Signaling Gateway Molecule Pages (SGMP) database provides highly structured data on proteins which exist in different functional states participating in signal transduction pathways. A molecule page starts with a state of a native protein, without any modification and/or interactions. New states are formed with every post-translational modification or interaction with one or more proteins, small molecules or class molecules and with each change in cellular location. State transitions are caused by a combination of one or more modifications, interactions and translocations which then might be associated with one or more biological processes. In a characterized biological state, a molecule can function as one of several entities or their combinations, including channel, receptor, enzyme, transcription factor and transporter. We have also exported SGMP data to the Biological Pathway Exchange (BioPAX) and Systems Biology Markup Language (SBML) as well as in our custom XML. Availability: SGMP is available at www.signaling-gateway.org/molecule. Contact: shankar@ucsd.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21505029

Increasing rigor in NMR-based metabolomics through validated and open source tools

PubMed Central

Eghbalnia, Hamid R; Romero, Pedro R; Westler, William M; Baskaran, Kumaran; Ulrich, Eldon L; Markley, John L

2016-01-01

The metabolome, the collection of small molecules associated with an organism, is a growing subject of inquiry, with the data utilized for data-intensive systems biology, disease diagnostics, biomarker discovery, and the broader characterization of small molecules in mixtures. Owing to their close proximity to the functional endpoints that govern an organism’s phenotype, metabolites are highly informative about functional states. The field of metabolomics identifies and quantifies endogenous and exogenous metabolites in biological samples. Information acquired from nuclear magnetic spectroscopy (NMR), mass spectrometry (MS), and the published literature, as processed by statistical approaches, are driving increasingly wider applications of metabolomics. This review focuses on the role of databases and software tools in advancing the rigor, robustness, reproducibility, and validation of metabolomics studies. PMID:27643760
In silico screening for Plasmodium falciparum enoyl-ACP reductase inhibitors

NASA Astrophysics Data System (ADS)

Lindert, Steffen; Tallorin, Lorillee; Nguyen, Quynh G.; Burkart, Michael D.; McCammon, J. Andrew

2015-01-01

The need for novel therapeutics against Plasmodium falciparum is urgent due to recent emergence of multi-drug resistant malaria parasites. Since fatty acids are essential for both the liver and blood stages of the malarial parasite, targeting fatty acid biosynthesis is a promising strategy for combatting P. falciparum. We present a combined computational and experimental study to identify novel inhibitors of enoyl-acyl carrier protein reductase ( PfENR) in the fatty acid biosynthesis pathway. A small-molecule database from ChemBridge was docked into three distinct PfENR crystal structures that provide multiple receptor conformations. Two different docking algorithms were used to generate a consensus score in order to rank possible small molecule hits. Our studies led to the identification of five low-micromolar pyrimidine dione inhibitors of PfENR.
Increasing rigor in NMR-based metabolomics through validated and open source tools.

PubMed

Eghbalnia, Hamid R; Romero, Pedro R; Westler, William M; Baskaran, Kumaran; Ulrich, Eldon L; Markley, John L

2017-02-01

The metabolome, the collection of small molecules associated with an organism, is a growing subject of inquiry, with the data utilized for data-intensive systems biology, disease diagnostics, biomarker discovery, and the broader characterization of small molecules in mixtures. Owing to their close proximity to the functional endpoints that govern an organism's phenotype, metabolites are highly informative about functional states. The field of metabolomics identifies and quantifies endogenous and exogenous metabolites in biological samples. Information acquired from nuclear magnetic spectroscopy (NMR), mass spectrometry (MS), and the published literature, as processed by statistical approaches, are driving increasingly wider applications of metabolomics. This review focuses on the role of databases and software tools in advancing the rigor, robustness, reproducibility, and validation of metabolomics studies. Copyright © 2016. Published by Elsevier Ltd.
Identification of DNA primase inhibitors via a combined fragment-based and virtual screening

NASA Astrophysics Data System (ADS)

Ilic, Stefan; Akabayov, Sabine R.; Arthanari, Haribabu; Wagner, Gerhard; Richardson, Charles C.; Akabayov, Barak

2016-11-01

The structural differences between bacterial and human primases render the former an excellent target for drug design. Here we describe a technique for selecting small molecule inhibitors of the activity of T7 DNA primase, an ideal model for bacterial primases due to their common structural and functional features. Using NMR screening, fragment molecules that bind T7 primase were identified and then exploited in virtual filtration to select larger molecules from the ZINC database. The molecules were docked to the primase active site using the available primase crystal structure and ranked based on their predicted binding energies to identify the best candidates for functional and structural investigations. Biochemical assays revealed that some of the molecules inhibit T7 primase-dependent DNA replication. The binding mechanism was delineated via NMR spectroscopy. Our approach, which combines fragment based and virtual screening, is rapid and cost effective and can be applied to other targets.
Chemical annotation of small and peptide-like molecules at the Protein Data Bank

PubMed Central

Young, Jasmine Y.; Feng, Zukang; Dimitropoulos, Dimitris; Sala, Raul; Westbrook, John; Zhuravleva, Marina; Shao, Chenghua; Quesada, Martha; Peisach, Ezra; Berman, Helen M.

2013-01-01

Over the past decade, the number of polymers and their complexes with small molecules in the Protein Data Bank archive (PDB) has continued to increase significantly. To support scientific advancements and ensure the best quality and completeness of the data files over the next 10 years and beyond, the Worldwide PDB partnership that manages the PDB archive is developing a new deposition and annotation system. This system focuses on efficient data capture across all supported experimental methods. The new deposition and annotation system is composed of four major modules that together support all of the processing requirements for a PDB entry. In this article, we describe one such module called the Chemical Component Annotation Tool. This tool uses information from both the Chemical Component Dictionary and Biologically Interesting molecule Reference Dictionary to aid in annotation. Benchmark studies have shown that the Chemical Component Annotation Tool provides significant improvements in processing efficiency and data quality. Database URL: http://wwpdb.org PMID:24291661
Chemical annotation of small and peptide-like molecules at the Protein Data Bank.

PubMed

Young, Jasmine Y; Feng, Zukang; Dimitropoulos, Dimitris; Sala, Raul; Westbrook, John; Zhuravleva, Marina; Shao, Chenghua; Quesada, Martha; Peisach, Ezra; Berman, Helen M

2013-01-01

Over the past decade, the number of polymers and their complexes with small molecules in the Protein Data Bank archive (PDB) has continued to increase significantly. To support scientific advancements and ensure the best quality and completeness of the data files over the next 10 years and beyond, the Worldwide PDB partnership that manages the PDB archive is developing a new deposition and annotation system. This system focuses on efficient data capture across all supported experimental methods. The new deposition and annotation system is composed of four major modules that together support all of the processing requirements for a PDB entry. In this article, we describe one such module called the Chemical Component Annotation Tool. This tool uses information from both the Chemical Component Dictionary and Biologically Interesting molecule Reference Dictionary to aid in annotation. Benchmark studies have shown that the Chemical Component Annotation Tool provides significant improvements in processing efficiency and data quality. Database URL: http://wwpdb.org.
FRASS: the web-server for RNA structural comparison

PubMed Central

2010-01-01

Background The impressive increase of novel RNA structures, during the past few years, demands automated methods for structure comparison. While many algorithms handle only small motifs, few techniques, developed in recent years, (ARTS, DIAL, SARA, SARSA, and LaJolla) are available for the structural comparison of large and intact RNA molecules. Results The FRASS web-server represents a RNA chain with its Gauss integrals and allows one to compare structures of RNA chains and to find similar entries in a database derived from the Protein Data Bank. We observed that FRASS scores correlate well with the ARTS and LaJolla similarity scores. Moreover, the-web server can also reproduce satisfactorily the DARTS classification of RNA 3D structures and the classification of the SCOR functions that was obtained by the SARA method. Conclusions The FRASS web-server can be easily used to detect relationships among RNA molecules and to scan efficiently the rapidly enlarging structural databases. PMID:20553602
TR-DB: an open-access database of compounds affecting the ethylene-induced triple response in Arabidopsis.

PubMed

Hu, Yuming; Callebert, Pieter; Vandemoortel, Ilse; Nguyen, Long; Audenaert, Dominique; Verschraegen, Luc; Vandenbussche, Filip; Van Der Straeten, Dominique

2014-02-01

Small molecules which act as hormone agonists or antagonists represent useful tools in fundamental research and are widely applied in agriculture to control hormone effects. High-throughput screening of large chemical compound libraries has yielded new findings in plant biology, with possible future applications in agriculture and horticulture. To further understand ethylene biosynthesis/signaling and its crosstalk with other hormones, we screened a 12,000 compound chemical library based on an ethylene-related bioassay of dark-grown Arabidopsis thaliana (L.) Heynh. seedlings. From the initial screening, 1313 (∼11%) biologically active small molecules altering the phenotype triggered by the ethylene precursor 1-aminocyclopropane-1-carboxylic acid (ACC), were identified. Selection and sorting in classes were based on the angle of curvature of the apical hook, the length and width of the hypocotyl and the root. A MySQL-database was constructed (https://chaos.ugent.be/WE15/) including basic chemical information on the compounds, images illustrating the phenotypes, phenotype descriptions and classification. The research perspectives for different classes of hit compounds will be evaluated, and some general screening tips for customized high-throughput screening and pitfalls will be discussed. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
UPLC-MS-ELSD-PDA as a powerful dereplication tool to facilitate compound identification from small-molecule natural product libraries.

PubMed

Yang, Jin; Liang, Qian; Wang, Mei; Jeffries, Cynthia; Smithson, David; Tu, Ying; Boulos, Nidal; Jacob, Melissa R; Shelat, Anang A; Wu, Yunshan; Ravu, Ranga Rao; Gilbertson, Richard; Avery, Mitchell A; Khan, Ikhlas A; Walker, Larry A; Guy, R Kiplin; Li, Xing-Cong

2014-04-25

The generation of natural product libraries containing column fractions, each with only a few small molecules, using a high-throughput, automated fractionation system, has made it possible to implement an improved dereplication strategy for selection and prioritization of leads in a natural product discovery program. Analysis of databased UPLC-MS-ELSD-PDA information of three leads from a biological screen employing the ependymoma cell line EphB2-EPD generated details on the possible structures of active compounds present. The procedure allows the rapid identification of known compounds and guides the isolation of unknown compounds of interest. Three previously known flavanone-type compounds, homoeriodictyol (1), hesperetin (2), and sterubin (3), were identified in a selected fraction derived from the leaves of Eriodictyon angustifolium. The lignan compound deoxypodophyllotoxin (8) was confirmed to be an active constituent in two lead fractions derived from the bark and leaves of Thuja occidentalis. In addition, two new but inactive labdane-type diterpenoids with an uncommon triol side chain were also identified as coexisting with deoxypodophyllotoxin in a lead fraction from the bark of T. occidentalis. Both diterpenoids were isolated in acetylated form, and their structures were determined as 14S,15-diacetoxy-13R-hydroxylabd-8(17)-en-19-oic acid (9) and 14R,15-diacetoxy-13S-hydroxylabd-8(17)-en-19-oic acid (10), respectively, by spectroscopic data interpretation and X-ray crystallography. This work demonstrates that a UPLC-MS-ELSD-PDA database produced during fractionation may be used as a powerful dereplication tool to facilitate compound identification from chromatographically tractable small-molecule natural product libraries.
A Small-molecule Inhibitor, 5′-O-Tritylthymidine, targets FAK and Mdm-2 Interaction, and Blocks Breast and Colon Tumorigenesis in vivo

PubMed Central

Golubovskaya, Vita; Palma, Nadia L.; Zheng, Min; Ho, Baotran; Magis, Andrew; Ostrov, David; Cance, William G.

2013-01-01

Focal Adhesion Kinase (FAK) is overexpressed in many types of tumors and plays an important role in survival. We developed a novel approach, targeting FAK-protein interactions by computer modeling and screening of NCI small molecule drug database. In this report we targeted FAK and Mdm-2 protein interaction to decrease tumor growth. By macromolecular modeling we found a model of FAK and Mdm-2 interaction and performed screening of >200,000 small molecule compounds from NCI database with drug-like characteristics, targeting the FAK-Mdm-2 interaction. We identified 5′-O-Tritylthymidine, called M13 compound that significantly decreased viability in different cancer cells. M13 was docked into the pocket of FAK and Mdm-2 interaction and was directly bound to the FAK-N terminal domain by ForteBio Octet assay. In addition, M13 compound affected FAK and Mdm-2 levels and decreased complex of FAK and Mdm-2 proteins in breast and colon cancer cells. M13 re-activated p53 activity inhibited by FAK with Mdm-2 promoter. M13 decreased viability, clonogenicity, increased detachment and apoptosis in a dose-dependent manner in BT474 breast and in HCT116 colon cancer cells in vitro. M13 decreased FAK, activated p53 and caspase-8 in both cell lines. In addition, M13 decreased breast and colon tumor growth in vivo. M13 activated p53 and decreased FAK in tumor samples consistent with decreased tumor growth. The data demonstrate a novel approach for targeting FAK and Mdm-2 protein interaction, provide a model of FAK and Mdm-2 interaction, identify M13 compound targeting this interaction and decreasing tumor growth that is critical for future targeted therapeutics. PMID:22292771
Pharmacogenomic identification of small molecules for lineage specific manipulation of subventricular zone germinal activity.

PubMed

Azim, Kasum; Angonin, Diane; Marcy, Guillaume; Pieropan, Francesca; Rivera, Andrea; Donega, Vanessa; Cantù, Claudio; Williams, Gareth; Berninger, Benedikt; Butt, Arthur M; Raineteau, Olivier

2017-03-01

Strategies for promoting neural regeneration are hindered by the difficulty of manipulating desired neural fates in the brain without complex genetic methods. The subventricular zone (SVZ) is the largest germinal zone of the forebrain and is responsible for the lifelong generation of interneuron subtypes and oligodendrocytes. Here, we have performed a bioinformatics analysis of the transcriptome of dorsal and lateral SVZ in early postnatal mice, including neural stem cells (NSCs) and their immediate progenies, which generate distinct neural lineages. We identified multiple signaling pathways that trigger distinct downstream transcriptional networks to regulate the diversity of neural cells originating from the SVZ. Next, we used a novel in silico genomic analysis, searchable platform-independent expression database/connectivity map (SPIED/CMAP), to generate a catalogue of small molecules that can be used to manipulate SVZ microdomain-specific lineages. Finally, we demonstrate that compounds identified in this analysis promote the generation of specific cell lineages from NSCs in vivo, during postnatal life and adulthood, as well as in regenerative contexts. This study unravels new strategies for using small bioactive molecules to direct germinal activity in the SVZ, which has therapeutic potential in neurodegenerative diseases.
Discovery of small molecules binding to the normal conformation of prion by combining virtual screening and multiple biological activity evaluation methods

NASA Astrophysics Data System (ADS)

Li, Lanlan; Wei, Wei; Jia, Wen-Juan; Zhu, Yongchang; Zhang, Yan; Chen, Jiang-Huai; Tian, Jiaqi; Liu, Huanxiang; He, Yong-Xing; Yao, Xiaojun

2017-12-01

Conformational conversion of the normal cellular prion protein, PrPC, into the misfolded isoform, PrPSc, is considered to be a central event in the development of fatal neurodegenerative diseases. Stabilization of prion protein at the normal cellular form (PrPC) with small molecules is a rational and efficient strategy for treatment of prion related diseases. However, few compounds have been identified as potent prion inhibitors by binding to the normal conformation of prion. In this work, to rational screening of inhibitors capable of stabilizing cellular form of prion protein, multiple approaches combining docking-based virtual screening, steady-state fluorescence quenching, surface plasmon resonance and thioflavin T fluorescence assay were used to discover new compounds interrupting PrPC to PrPSc conversion. Compound 3253-0207 that can bind to PrPC with micromolar affinity and inhibit prion fibrillation was identified from small molecule databases. Molecular dynamics simulation indicated that compound 3253-0207 can bind to the hotspot residues in the binding pocket composed by β1, β2 and α2, which are significant structure moieties in conversion from PrPC to PrPSc.
PubChem BioAssay: 2017 update

PubMed Central

Wang, Yanli; Bryant, Stephen H.; Cheng, Tiejun; Wang, Jiyao; Gindulyte, Asta; Shoemaker, Benjamin A.; Thiessen, Paul A.; He, Siqian; Zhang, Jian

2017-01-01

PubChem's BioAssay database (https://pubchem.ncbi.nlm.nih.gov) has served as a public repository for small-molecule and RNAi screening data since 2004 providing open access of its data content to the community. PubChem accepts data submission from worldwide researchers at academia, industry and government agencies. PubChem also collaborates with other chemical biology database stakeholders with data exchange. With over a decade's development effort, it becomes an important information resource supporting drug discovery and chemical biology research. To facilitate data discovery, PubChem is integrated with all other databases at NCBI. In this work, we provide an update for the PubChem BioAssay database describing several recent development including added sources of research data, redesigned BioAssay record page, new BioAssay classification browser and new features in the Upload system facilitating data sharing. PMID:27899599
Value of shared preclinical safety studies - The eTOX database.

PubMed

Briggs, Katharine; Barber, Chris; Cases, Montserrat; Marc, Philippe; Steger-Hartmann, Thomas

2015-01-01

A first analysis of a database of shared preclinical safety data for 1214 small molecule drugs and drug candidates extracted from 3970 reports donated by thirteen pharmaceutical companies for the eTOX project (www.etoxproject.eu) is presented. Species, duration of exposure and administration route data were analysed to assess if large enough subsets of homogenous data are available for building in silico predictive models. Prevalence of treatment related effects for the different types of findings recorded were analysed. The eTOX ontology was used to determine the most common treatment-related clinical chemistry and histopathology findings reported in the database. The data were then mined to evaluate sensitivity of established in vivo biomarkers for liver toxicity risk assessment. The value of the database to inform other drug development projects during early drug development is illustrated by a case study.
PharmMapper server: a web server for potential drug target identification using pharmacophore mapping approach

PubMed Central

Liu, Xiaofeng; Ouyang, Sisheng; Yu, Biao; Liu, Yabo; Huang, Kai; Gong, Jiayu; Zheng, Siyuan; Li, Zhihua; Li, Honglin; Jiang, Hualiang

2010-01-01

In silico drug target identification, which includes many distinct algorithms for finding disease genes and proteins, is the first step in the drug discovery pipeline. When the 3D structures of the targets are available, the problem of target identification is usually converted to finding the best interaction mode between the potential target candidates and small molecule probes. Pharmacophore, which is the spatial arrangement of features essential for a molecule to interact with a specific target receptor, is an alternative method for achieving this goal apart from molecular docking method. PharmMapper server is a freely accessed web server designed to identify potential target candidates for the given small molecules (drugs, natural products or other newly discovered compounds with unidentified binding targets) using pharmacophore mapping approach. PharmMapper hosts a large, in-house repertoire of pharmacophore database (namely PharmTargetDB) annotated from all the targets information in TargetBank, BindingDB, DrugBank and potential drug target database, including over 7000 receptor-based pharmacophore models (covering over 1500 drug targets information). PharmMapper automatically finds the best mapping poses of the query molecule against all the pharmacophore models in PharmTargetDB and lists the top N best-fitted hits with appropriate target annotations, as well as respective molecule’s aligned poses are presented. Benefited from the highly efficient and robust triangle hashing mapping method, PharmMapper bears high throughput ability and only costs 1 h averagely to screen the whole PharmTargetDB. The protocol was successful in finding the proper targets among the top 300 pharmacophore candidates in the retrospective benchmarking test of tamoxifen. PharmMapper is available at http://59.78.96.61/pharmmapper. PMID:20430828
News from Online: What's New with Chime?

NASA Astrophysics Data System (ADS)

Dorland, Liz

2002-07-01

The Chime plugin (pronounced like the bells) provides a simple route to presenting interactive molecular structures to students via the Internet or in classroom presentations. Small inorganic molecules, ionic structures, organic molecules and giant macromolecules can all be viewed in several formats including ball and stick and spacefilling. Extensive Chime resources on the Internet allow chemistry and biochemistry instructors to create their own Web pages or to use some of the many tutorials for students already online. This article describes about twenty Chime-based Web sites in three categories: Chime Resources, Materials for Student and Classroom Use, and Structure Databases. A list of links is provided.
Creating and virtually screening databases of fluorescently-labelled compounds for the discovery of target-specific molecular probes

NASA Astrophysics Data System (ADS)

Kamstra, Rhiannon L.; Dadgar, Saedeh; Wigg, John; Chowdhury, Morshed A.; Phenix, Christopher P.; Floriano, Wely B.

2014-11-01

Our group has recently demonstrated that virtual screening is a useful technique for the identification of target-specific molecular probes. In this paper, we discuss some of our proof-of-concept results involving two biologically relevant target proteins, and report the development of a computational script to generate large databases of fluorescence-labelled compounds for computer-assisted molecular design. The virtual screening of a small library of 1,153 fluorescently-labelled compounds against two targets, and the experimental testing of selected hits reveal that this approach is efficient at identifying molecular probes, and that the screening of a labelled library is preferred over the screening of base compounds followed by conjugation of confirmed hits. The automated script for library generation explores the known reactivity of commercially available dyes, such as NHS-esters, to create large virtual databases of fluorescence-tagged small molecules that can be easily synthesized in a laboratory. A database of 14,862 compounds, each tagged with the ATTO680 fluorophore was generated with the automated script reported here. This library is available for downloading and it is suitable for virtual ligand screening aiming at the identification of target-specific fluorescent molecular probes.
vSDC: a method to improve early recognition in virtual screening when limited experimental resources are available.

PubMed

Chaput, Ludovic; Martinez-Sanz, Juan; Quiniou, Eric; Rigolet, Pascal; Saettel, Nicolas; Mouawad, Liliane

2016-01-01

In drug design, one may be confronted to the problem of finding hits for targets for which no small inhibiting molecules are known and only low-throughput experiments are available (like ITC or NMR studies), two common difficulties encountered in a typical academic setting. Using a virtual screening strategy like docking can alleviate some of the problems and save a considerable amount of time by selecting only top-ranking molecules, but only if the method is very efficient, i.e. when a good proportion of actives are found in the 1-10 % best ranked molecules. The use of several programs (in our study, Gold, Surflex, FlexX and Glide were considered) shows a divergence of the results, which presents a difficulty in guiding the experiments. To overcome this divergence and increase the yield of the virtual screening, we created the standard deviation consensus (SDC) and variable SDC (vSDC) methods, consisting of the intersection of molecule sets from several virtual screening programs, based on the standard deviations of their ranking distributions. SDC allowed us to find hits for two new protein targets by testing only 9 and 11 small molecules from a chemical library of circa 15,000 compounds. Furthermore, vSDC, when applied to the 102 proteins of the DUD-E benchmarking database, succeeded in finding more hits than any of the four isolated programs for 13-60 % of the targets. In addition, when only 10 molecules of each of the 102 chemical libraries were considered, vSDC performed better in the number of hits found, with an improvement of 6-24 % over the 10 best-ranked molecules given by the individual docking programs.Graphical abstractIn drug design, for a given target and a given chemical library, the results obtained with different virtual screening programs are divergent. So how to rationally guide the experimental tests, especially when only a few number of experiments can be made? The variable Standard Deviation Consensus (vSDC) method was developed to answer this issue. Left panel the vSDC principle consists of intersecting molecule sets, chosen on the basis of the standard deviations of their ranking distributions, obtained from various virtual screening programs. In this study Glide, Gold, FlexX and Surflex were used and tested on the 102 targets of the DUD-E database. Right panel Comparison of the average percentage of hits found with vSDC and each of the four programs, when only 10 molecules from each of the 102 chemical libraries of the DUD-E database were considered. On average, vSDC was capable of finding 38 % of the findable hits, against 34 % for Glide, 32 % for Gold, 16 % for FlexX and 14 % for Surflex, showing that with vSDC, it was possible to overcome the unpredictability of the virtual screening results and to improve them.
Exploring the role of water in molecular recognition: predicting protein ligandability using a combinatorial search of surface hydration sites.

PubMed

Vukovic, Sinisa; Brennan, Paul E; Huggins, David J

2016-09-01

The interaction between any two biological molecules must compete with their interaction with water molecules. This makes water the most important molecule in medicine, as it controls the interactions of every therapeutic with its target. A small molecule binding to a protein is able to recognize a unique binding site on a protein by displacing bound water molecules from specific hydration sites. Quantifying the interactions of these water molecules allows us to estimate the potential of the protein to bind a small molecule. This is referred to as ligandability. In the study, we describe a method to predict ligandability by performing a search of all possible combinations of hydration sites on protein surfaces. We predict ligandability as the summed binding free energy for each of the constituent hydration sites, computed using inhomogeneous fluid solvation theory. We compared the predicted ligandability with the maximum observed binding affinity for 20 proteins in the human bromodomain family. Based on this comparison, it was determined that effective inhibitors have been developed for the majority of bromodomains, in the range from 10 to 100 nM. However, we predict that more potent inhibitors can be developed for the bromodomains BPTF and BRD7 with relative ease, but that further efforts to develop inhibitors for ATAD2 will be extremely challenging. We have also made predictions for the 14 bromodomains with no reported small molecule K d values by isothermal titration calorimetry. The calculations predict that PBRM1(1) will be a challenging target, while others such as TAF1L(2), PBRM1(4) and TAF1(2), should be highly ligandable. As an outcome of this work, we assembled a database of experimental maximal K d that can serve as a community resource assisting medicinal chemistry efforts focused on BRDs. Effective prediction of ligandability would be a very useful tool in the drug discovery process.
Exploring the role of water in molecular recognition: predicting protein ligandability using a combinatorial search of surface hydration sites

NASA Astrophysics Data System (ADS)

Vukovic, Sinisa; Brennan, Paul E.; Huggins, David J.

2016-09-01

The interaction between any two biological molecules must compete with their interaction with water molecules. This makes water the most important molecule in medicine, as it controls the interactions of every therapeutic with its target. A small molecule binding to a protein is able to recognize a unique binding site on a protein by displacing bound water molecules from specific hydration sites. Quantifying the interactions of these water molecules allows us to estimate the potential of the protein to bind a small molecule. This is referred to as ligandability. In the study, we describe a method to predict ligandability by performing a search of all possible combinations of hydration sites on protein surfaces. We predict ligandability as the summed binding free energy for each of the constituent hydration sites, computed using inhomogeneous fluid solvation theory. We compared the predicted ligandability with the maximum observed binding affinity for 20 proteins in the human bromodomain family. Based on this comparison, it was determined that effective inhibitors have been developed for the majority of bromodomains, in the range from 10 to 100 nM. However, we predict that more potent inhibitors can be developed for the bromodomains BPTF and BRD7 with relative ease, but that further efforts to develop inhibitors for ATAD2 will be extremely challenging. We have also made predictions for the 14 bromodomains with no reported small molecule K d values by isothermal titration calorimetry. The calculations predict that PBRM1(1) will be a challenging target, while others such as TAF1L(2), PBRM1(4) and TAF1(2), should be highly ligandable. As an outcome of this work, we assembled a database of experimental maximal K d that can serve as a community resource assisting medicinal chemistry efforts focused on BRDs. Effective prediction of ligandability would be a very useful tool in the drug discovery process.

Vibrational Spectroscopy and Astrobiology

NASA Technical Reports Server (NTRS)

Chaban, Galina M.; Kwak, D. (Technical Monitor)

2001-01-01

Role of vibrational spectroscopy in solving problems related to astrobiology will be discussed. Vibrational (infrared) spectroscopy is a very sensitive tool for identifying molecules. Theoretical approach used in this work is based on direct computation of anharmonic vibrational frequencies and intensities from electronic structure codes. One of the applications of this computational technique is possible identification of biological building blocks (amino acids, small peptides, DNA bases) in the interstellar medium (ISM). Identifying small biological molecules in the ISM is very important from the point of view of origin of life. Hybrid (quantum mechanics/molecular mechanics) theoretical techniques will be discussed that may allow to obtain accurate vibrational spectra of biomolecular building blocks and to create a database of spectroscopic signatures that can assist observations of these molecules in space. Another application of the direct computational spectroscopy technique is to help to design and analyze experimental observations of ice surfaces of one of the Jupiter's moons, Europa, that possibly contains hydrated salts. The presence of hydrated salts on the surface can be an indication of a subsurface ocean and the possible existence of life forms inhabiting such an ocean.
Antibiotics and specialized metabolites from the human microbiota.

PubMed

Mousa, Walaa K; Athar, Bilal; Merwin, Nishanth J; Magarvey, Nathan A

2017-11-15

Covering: 2000 to 2017Decades of research on human microbiota have revealed much of their taxonomic diversity and established their direct link to health and disease. However, the breadth of bioactive natural products secreted by our microbial partners remains unknown. Of particular interest are antibiotics produced by our microbiota to ward off invasive pathogens. Members of the human microbiota exclusively produce evolved small molecules with selective antimicrobial activity against human pathogens. Herein, we expand upon the current knowledge concerning antibiotics derived from human microbiota and their distribution across body sites. We analyze, using our in-house chem-bioinformatic tools and natural products database, the encoded antibiotic potential of the human microbiome. This compilation of information may create a foundation for the continued exploration of this intriguing resource of chemical diversity and expose challenges and future perspectives to accelerate the discovery rate of small molecules from the human microbiota.
pKa prediction of monoprotic small molecules the SMARTS way.

PubMed

Lee, Adam C; Yu, Jing-Yu; Crippen, Gordon M

2008-10-01

Realizing favorable absorption, distribution, metabolism, elimination, and toxicity profiles is a necessity due to the high attrition rate of lead compounds in drug development today. The ability to accurately predict bioavailability can help save time and money during the screening and optimization processes. As several robust programs already exist for predicting logP, we have turned our attention to the fast and robust prediction of pK(a) for small molecules. Using curated data from the Beilstein Database and Lange's Handbook of Chemistry, we have created a decision tree based on a novel set of SMARTS strings that can accurately predict the pK(a) for monoprotic compounds with R(2) of 0.94 and root mean squared error of 0.68. Leave-some-out (10%) cross-validation achieved Q(2) of 0.91 and root mean squared error of 0.80.
Small molecule inhibitors of mesotrypsin from a structure-based docking screen

DOE PAGES

Kayode, Olumide; Huang, Zunnan; Soares, Alexei S.; ...

2017-05-02

PRSS3/mesotrypsin is an atypical isoform of trypsin, the upregulation of which has been implicated in promoting tumor progression. To date there are no mesotrypsin-selective pharmacological inhibitors which could serve as tools for deciphering the pathological role of this enzyme, and could potentially form the basis for novel therapeutic strategies targeting mesotrypsin. A virtual screen of the Natural Product Database (NPD) and Food and Drug Administration (FDA) approved Drug Database was conducted by high-throughput molecular docking utilizing crystal structures of mesotrypsin. Twelve high-scoring compounds were selected for testing based on lowest free energy docking scores, interaction with key mesotrypsin active sitemore » residues, and commercial availability. Diminazene (C1D22956468), along with two similar compounds presenting the bis-benzamidine substructure, was validated as a competitive inhibitor of mesotrypsin and other human trypsin isoforms. Diminazene is the most potent small molecule inhibitor of mesotrypsin reported to date with an inhibitory constant (K i) of 3.6±0.3 pM. Diminazene was subsequently co-crystalized with mesotrypsin and the crystal structure was solved and refined to 1.25 Å resolution. This high resolution crystal structure can now offer a foundation for structure-guided efforts to develop novel and potentially more selective mesotrypsin inhibitors based on similar molecular substructures.« less
Small molecule inhibitors of mesotrypsin from a structure-based docking screen

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kayode, Olumide; Huang, Zunnan; Soares, Alexei S.

PRSS3/mesotrypsin is an atypical isoform of trypsin, the upregulation of which has been implicated in promoting tumor progression. To date there are no mesotrypsin-selective pharmacological inhibitors which could serve as tools for deciphering the pathological role of this enzyme, and could potentially form the basis for novel therapeutic strategies targeting mesotrypsin. A virtual screen of the Natural Product Database (NPD) and Food and Drug Administration (FDA) approved Drug Database was conducted by high-throughput molecular docking utilizing crystal structures of mesotrypsin. Twelve high-scoring compounds were selected for testing based on lowest free energy docking scores, interaction with key mesotrypsin active sitemore » residues, and commercial availability. Diminazene (C1D22956468), along with two similar compounds presenting the bis-benzamidine substructure, was validated as a competitive inhibitor of mesotrypsin and other human trypsin isoforms. Diminazene is the most potent small molecule inhibitor of mesotrypsin reported to date with an inhibitory constant (K i) of 3.6±0.3 pM. Diminazene was subsequently co-crystalized with mesotrypsin and the crystal structure was solved and refined to 1.25 Å resolution. This high resolution crystal structure can now offer a foundation for structure-guided efforts to develop novel and potentially more selective mesotrypsin inhibitors based on similar molecular substructures.« less
R.E.DD.B.: A database for RESP and ESP atomic charges, and force field libraries

PubMed Central

Dupradeau, François-Yves; Cézard, Christine; Lelong, Rodolphe; Stanislawiak, Élodie; Pêcher, Julien; Delepine, Jean Charles; Cieplak, Piotr

2008-01-01

The web-based RESP ESP charge DataBase (R.E.DD.B., http://q4md-forcefieldtools.org/REDDB) is a free and new source of RESP and ESP atomic charge values and force field libraries for model systems and/or small molecules. R.E.DD.B. stores highly effective and reproducible charge values and molecular structures in the Tripos mol2 file format, information about the charge derivation procedure, scripts to integrate the charges and molecular topology in the most common molecular dynamics packages. Moreover, R.E.DD.B. allows users to freely store and distribute RESP or ESP charges and force field libraries to the scientific community, via a web interface. The first version of R.E.DD.B., released in January 2006, contains force field libraries for molecules as well as molecular fragments for standard residues and their analogs (amino acids, monosaccharides, nucleotides and ligands), hence covering a vast area of relevant biological applications. PMID:17962302
Molecule database framework: a framework for creating database applications with chemical structure search capability

PubMed Central

2013-01-01

Background Research in organic chemistry generates samples of novel chemicals together with their properties and other related data. The involved scientists must be able to store this data and search it by chemical structure. There are commercial solutions for common needs like chemical registration systems or electronic lab notebooks. However for specific requirements of in-house databases and processes no such solutions exist. Another issue is that commercial solutions have the risk of vendor lock-in and may require an expensive license of a proprietary relational database management system. To speed up and simplify the development for applications that require chemical structure search capabilities, I have developed Molecule Database Framework. The framework abstracts the storing and searching of chemical structures into method calls. Therefore software developers do not require extensive knowledge about chemistry and the underlying database cartridge. This decreases application development time. Results Molecule Database Framework is written in Java and I created it by integrating existing free and open-source tools and frameworks. The core functionality includes: • Support for multi-component compounds (mixtures) • Import and export of SD-files • Optional security (authorization) For chemical structure searching Molecule Database Framework leverages the capabilities of the Bingo Cartridge for PostgreSQL and provides type-safe searching, caching, transactions and optional method level security. Molecule Database Framework supports multi-component chemical compounds (mixtures). Furthermore the design of entity classes and the reasoning behind it are explained. By means of a simple web application I describe how the framework could be used. I then benchmarked this example application to create some basic performance expectations for chemical structure searches and import and export of SD-files. Conclusions By using a simple web application it was shown that Molecule Database Framework successfully abstracts chemical structure searches and SD-File import and export to simple method calls. The framework offers good search performance on a standard laptop without any database tuning. This is also due to the fact that chemical structure searches are paged and cached. Molecule Database Framework is available for download on the projects web page on bitbucket: https://bitbucket.org/kienerj/moleculedatabaseframework. PMID:24325762
Molecule database framework: a framework for creating database applications with chemical structure search capability.

PubMed

Kiener, Joos

2013-12-11

Research in organic chemistry generates samples of novel chemicals together with their properties and other related data. The involved scientists must be able to store this data and search it by chemical structure. There are commercial solutions for common needs like chemical registration systems or electronic lab notebooks. However for specific requirements of in-house databases and processes no such solutions exist. Another issue is that commercial solutions have the risk of vendor lock-in and may require an expensive license of a proprietary relational database management system. To speed up and simplify the development for applications that require chemical structure search capabilities, I have developed Molecule Database Framework. The framework abstracts the storing and searching of chemical structures into method calls. Therefore software developers do not require extensive knowledge about chemistry and the underlying database cartridge. This decreases application development time. Molecule Database Framework is written in Java and I created it by integrating existing free and open-source tools and frameworks. The core functionality includes:•Support for multi-component compounds (mixtures)•Import and export of SD-files•Optional security (authorization)For chemical structure searching Molecule Database Framework leverages the capabilities of the Bingo Cartridge for PostgreSQL and provides type-safe searching, caching, transactions and optional method level security. Molecule Database Framework supports multi-component chemical compounds (mixtures).Furthermore the design of entity classes and the reasoning behind it are explained. By means of a simple web application I describe how the framework could be used. I then benchmarked this example application to create some basic performance expectations for chemical structure searches and import and export of SD-files. By using a simple web application it was shown that Molecule Database Framework successfully abstracts chemical structure searches and SD-File import and export to simple method calls. The framework offers good search performance on a standard laptop without any database tuning. This is also due to the fact that chemical structure searches are paged and cached. Molecule Database Framework is available for download on the projects web page on bitbucket: https://bitbucket.org/kienerj/moleculedatabaseframework.
Large-scale annotation of small-molecule libraries using public databases.

PubMed

Zhou, Yingyao; Zhou, Bin; Chen, Kaisheng; Yan, S Frank; King, Frederick J; Jiang, Shumei; Winzeler, Elizabeth A

2007-01-01

While many large publicly accessible databases provide excellent annotation for biological macromolecules, the same is not true for small chemical compounds. Commercial data sources also fail to encompass an annotation interface for large numbers of compounds and tend to be cost prohibitive to be widely available to biomedical researchers. Therefore, using annotation information for the selection of lead compounds from a modern day high-throughput screening (HTS) campaign presently occurs only under a very limited scale. The recent rapid expansion of the NIH PubChem database provides an opportunity to link existing biological databases with compound catalogs and provides relevant information that potentially could improve the information garnered from large-scale screening efforts. Using the 2.5 million compound collection at the Genomics Institute of the Novartis Research Foundation (GNF) as a model, we determined that approximately 4% of the library contained compounds with potential annotation in such databases as PubChem and the World Drug Index (WDI) as well as related databases such as the Kyoto Encyclopedia of Genes and Genomes (KEGG) and ChemIDplus. Furthermore, the exact structure match analysis showed 32% of GNF compounds can be linked to third party databases via PubChem. We also showed annotations such as MeSH (medical subject headings) terms can be applied to in-house HTS databases in identifying signature biological inhibition profiles of interest as well as expediting the assay validation process. The automated annotation of thousands of screening hits in batch is becoming feasible and has the potential to play an essential role in the hit-to-lead decision making process.
The active site of hen egg-white lysozyme: flexibility and chemical bonding

DOE Office of Scientific and Technical Information (OSTI.GOV)

Held, Jeanette, E-mail: jeanette.netzel@uni-bayreuth.de; Smaalen, Sander van

Chemical bonding at the active site of lysozyme is analyzed on the basis of a multipole model employing transferable multipole parameters from a database. Large B factors at low temperatures reflect frozen-in disorder, but therefore prevent a meaningful free refinement of multipole parameters. Chemical bonding at the active site of hen egg-white lysozyme (HEWL) is analyzed on the basis of Bader’s quantum theory of atoms in molecules [QTAIM; Bader (1994 ▶), Atoms in Molecules: A Quantum Theory. Oxford University Press] applied to electron-density maps derived from a multipole model. The observation is made that the atomic displacement parameters (ADPs) ofmore » HEWL at a temperature of 100 K are larger than ADPs in crystals of small biological molecules at 298 K. This feature shows that the ADPs in the cold crystals of HEWL reflect frozen-in disorder rather than thermal vibrations of the atoms. Directly generalizing the results of multipole studies on small-molecule crystals, the important consequence for electron-density analysis of protein crystals is that multipole parameters cannot be independently varied in a meaningful way in structure refinements. Instead, a multipole model for HEWL has been developed by refinement of atomic coordinates and ADPs against the X-ray diffraction data of Wang and coworkers [Wang et al. (2007), Acta Cryst. D63, 1254–1268], while multipole parameters were fixed to the values for transferable multipole parameters from the ELMAM2 database [Domagala et al. (2012), Acta Cryst. A68, 337–351] . Static and dynamic electron densities based on this multipole model are presented. Analysis of their topological properties according to the QTAIM shows that the covalent bonds possess similar properties to the covalent bonds of small molecules. Hydrogen bonds of intermediate strength are identified for the Glu35 and Asp52 residues, which are considered to be essential parts of the active site of HEWL. Furthermore, a series of weak C—H⋯O hydrogen bonds are identified by means of the existence of bond critical points (BCPs) in the multipole electron density. It is proposed that these weak interactions might be important for defining the tertiary structure and activity of HEWL. The deprotonated state of Glu35 prevents a distinction between the Phillips and Koshland mechanisms.« less
RNA targeting by small molecule alkaloids: Studies on the binding of berberine and palmatine to polyribonucleotides and comparison to ethidium

NASA Astrophysics Data System (ADS)

Islam, Md. Maidul; Suresh Kumar, Gopinatha

2008-03-01

The binding affinity, energetics and conformational aspects of the interaction of isoquinoline alkaloids berberine and palmatine to four single stranded polyribonucleotides polyguanylic acid [poly(G)], polyinosinic acid [poly(I)], polycytidylic acid [poly(C)] and polyuridylic acid [poly(U)] were studied by absorption, fluorescence, isothermal titration calorimetry and circular dichroism spectroscopy and compared with ethidium. Berberine, palmatine and ethidium binds strongly with poly(G) and poly(I) with affinity in the order 10 5 M -1 while their binding to poly(C) and poly(U) were very weak or practically nil. The same conclusions have also emerged from isothermal titration calorimetric studies. The binding of all the three compounds to poly(C) and poly(I) was exothermic and favored by both negative enthalpy change and positive entropy change. Conformational change in the polymer associated with the binding was observed in poly(I) with all the three molecules and poly(U) with ethidium but not in poly(G) and poly(C) revealing differences in the orientation of the bound molecules in the hitherto different helical organization of these polymers. These fundamental results may be useful and serve as database for the development of futuristic RNA based small molecule therapeutics.
Database resources of the National Center for Biotechnology Information

PubMed Central

2015-01-01

The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank® nucleic acid sequence database and the PubMed database of citations and abstracts for published life science journals. Additional NCBI resources focus on literature (Bookshelf, PubMed Central (PMC) and PubReader); medical genetics (ClinVar, dbMHC, the Genetic Testing Registry, HIV-1/Human Protein Interaction Database and MedGen); genes and genomics (BioProject, BioSample, dbSNP, dbVar, Epigenomics, Gene, Gene Expression Omnibus (GEO), Genome, HomoloGene, the Map Viewer, Nucleotide, PopSet, Probe, RefSeq, Sequence Read Archive, the Taxonomy Browser, Trace Archive and UniGene); and proteins and chemicals (Biosystems, COBALT, the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), the Molecular Modeling Database (MMDB), Protein Clusters, Protein and the PubChem suite of small molecule databases). The Entrez system provides search and retrieval operations for many of these databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov. PMID:25398906
Database resources of the National Center for Biotechnology Information

PubMed Central

2016-01-01

The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank® nucleic acid sequence database and the PubMed database of citations and abstracts for published life science journals. Additional NCBI resources focus on literature (PubMed Central (PMC), Bookshelf and PubReader), health (ClinVar, dbGaP, dbMHC, the Genetic Testing Registry, HIV-1/Human Protein Interaction Database and MedGen), genomes (BioProject, Assembly, Genome, BioSample, dbSNP, dbVar, Epigenomics, the Map Viewer, Nucleotide, Probe, RefSeq, Sequence Read Archive, the Taxonomy Browser and the Trace Archive), genes (Gene, Gene Expression Omnibus (GEO), HomoloGene, PopSet and UniGene), proteins (Protein, the Conserved Domain Database (CDD), COBALT, Conserved Domain Architecture Retrieval Tool (CDART), the Molecular Modeling Database (MMDB) and Protein Clusters) and chemicals (Biosystems and the PubChem suite of small molecule databases). The Entrez system provides search and retrieval operations for most of these databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov. PMID:26615191
Database resources of the National Center for Biotechnology Information

PubMed Central

Wheeler, David L.; Barrett, Tanya; Benson, Dennis A.; Bryant, Stephen H.; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M.; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Geer, Lewis Y.; Helmberg, Wolfgang; Kapustin, Yuri; Kenton, David L.; Khovayko, Oleg; Lipman, David J.; Madden, Thomas L.; Maglott, Donna R.; Ostell, James; Pruitt, Kim D.; Schuler, Gregory D.; Schriml, Lynn M.; Sequeira, Edwin; Sherry, Stephen T.; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Suzek, Tugba O.; Tatusov, Roman; Tatusova, Tatiana A.; Wagner, Lukas; Yaschenko, Eugene

2006-01-01

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups, Retroviral Genotyping Tools, HIV-1, Human Protein Interaction Database, SAGEmap, Gene Expression Omnibus, Entrez Probe, GENSAT, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of the resources can be accessed through the NCBI home page at: . PMID:16381840
Database resources of the National Center for Biotechnology Information.

PubMed

Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bolton, Evan; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; Dicuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Krasnov, Sergey; Landsman, David; Lipman, David J; Lu, Zhiyong; Madden, Thomas L; Madej, Tom; Maglott, Donna R; Marchler-Bauer, Aron; Miller, Vadim; Karsch-Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Wang, Yanli; Wilbur, W John; Yaschenko, Eugene; Ye, Jian

2012-01-01

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Website. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Probe, Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.
Facilitating quality control for spectra assignments of small organic molecules: nmrshiftdb2--a free in-house NMR database with integrated LIMS for academic service laboratories.

PubMed

Kuhn, Stefan; Schlörer, Nils E

2015-08-01

nmrshiftdb2 supports with its laboratory information management system the integration of an electronic lab administration and management into academic NMR facilities. Also, it offers the setup of a local database, while full access to nmrshiftdb2's World Wide Web database is granted. This freely available system allows on the one hand the submission of orders for measurement, transfers recorded data automatically or manually, and enables download of spectra via web interface, as well as the integrated access to prediction, search, and assignment tools of the NMR database for lab users. On the other hand, for the staff and lab administration, flow of all orders can be supervised; administrative tools also include user and hardware management, a statistic functionality for accounting purposes, and a 'QuickCheck' function for assignment control, to facilitate quality control of assignments submitted to the (local) database. Laboratory information management system and database are based on a web interface as front end and are therefore independent of the operating system in use. Copyright © 2015 John Wiley & Sons, Ltd.
Database resources of the National Center for Biotechnology Information

PubMed Central

Acland, Abigail; Agarwala, Richa; Barrett, Tanya; Beck, Jeff; Benson, Dennis A.; Bollin, Colleen; Bolton, Evan; Bryant, Stephen H.; Canese, Kathi; Church, Deanna M.; Clark, Karen; DiCuccio, Michael; Dondoshansky, Ilya; Federhen, Scott; Feolo, Michael; Geer, Lewis Y.; Gorelenkov, Viatcheslav; Hoeppner, Marilu; Johnson, Mark; Kelly, Christopher; Khotomlianski, Viatcheslav; Kimchi, Avi; Kimelman, Michael; Kitts, Paul; Krasnov, Sergey; Kuznetsov, Anatoliy; Landsman, David; Lipman, David J.; Lu, Zhiyong; Madden, Thomas L.; Madej, Tom; Maglott, Donna R.; Marchler-Bauer, Aron; Karsch-Mizrachi, Ilene; Murphy, Terence; Ostell, James; O'Sullivan, Christopher; Panchenko, Anna; Phan, Lon; Pruitt, Don Preussm Kim D.; Rubinstein, Wendy; Sayers, Eric W.; Schneider, Valerie; Schuler, Gregory D.; Sequeira, Edwin; Sherry, Stephen T.; Shumway, Martin; Sirotkin, Karl; Siyan, Karanjit; Slotta, Douglas; Soboleva, Alexandra; Soussov, Vladimir; Starchenko, Grigory; Tatusova, Tatiana A.; Trawick, Bart W.; Vakatov, Denis; Wang, Yanli; Ward, Minghong; John Wilbur, W.; Yaschenko, Eugene; Zbicz, Kerry

2014-01-01

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, PubReader, Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link, Primer-BLAST, COBALT, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, the Genetic Testing Registry, Genome and related tools, the Map Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, ClinVar, MedGen, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Probe, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool, Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All these resources can be accessed through the NCBI home page. PMID:24259429
5SRNAdb: an information resource for 5S ribosomal RNAs.

PubMed

Szymanski, Maciej; Zielezinski, Andrzej; Barciszewski, Jan; Erdmann, Volker A; Karlowski, Wojciech M

2016-01-04

Ribosomal 5S RNA (5S rRNA) is the ubiquitous RNA component found in the large subunit of ribosomes in all known organisms. Due to its small size, abundance and evolutionary conservation 5S rRNA for many years now is used as a model molecule in studies on RNA structure, RNA-protein interactions and molecular phylogeny. 5SRNAdb (http://combio.pl/5srnadb/) is the first database that provides a high quality reference set of ribosomal 5S RNAs (5S rRNA) across three domains of life. Here, we give an overview of new developments in the database and associated web tools since 2002, including updates to database content, curation processes and user web interfaces. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
sRNAdb: A small non-coding RNA database for gram-positive bacteria

PubMed Central

2012-01-01

Background The class of small non-coding RNA molecules (sRNA) regulates gene expression by different mechanisms and enables bacteria to mount a physiological response due to adaptation to the environment or infection. Over the last decades the number of sRNAs has been increasing rapidly. Several databases like Rfam or fRNAdb were extended to include sRNAs as a class of its own. Furthermore new specialized databases like sRNAMap (gram-negative bacteria only) and sRNATarBase (target prediction) were established. To the best of the authors’ knowledge no database focusing on sRNAs from gram-positive bacteria is publicly available so far. Description In order to understand sRNA’s functional and phylogenetic relationships we have developed sRNAdb and provide tools for data analysis and visualization. The data compiled in our database is assembled from experiments as well as from bioinformatics analyses. The software enables comparison and visualization of gene loci surrounding the sRNAs of interest. To accomplish this, we use a client–server based approach. Offline versions of the database including analyses and visualization tools can easily be installed locally on the user’s computer. This feature facilitates customized local addition of unpublished sRNA candidates and related information such as promoters or terminators using tab-delimited files. Conclusion sRNAdb allows a user-friendly and comprehensive comparative analysis of sRNAs from available sequenced gram-positive prokaryotic replicons. Offline versions including analysis and visualization tools facilitate complex user specific bioinformatics analyses. PMID:22883983
Elasticity and Stability of Clathrate Hydrate: Role of Guest Molecule Motions.

PubMed

Jia, Jihui; Liang, Yunfeng; Tsuji, Takeshi; Murata, Sumihiko; Matsuoka, Toshifumi

2017-05-02

Molecular dynamic simulations were performed to determine the elastic constants of carbon dioxide (CO 2 ) and methane (CH 4 ) hydrates at one hundred pressure-temperature data points, respectively. The conditions represent marine sediments and permafrost zones where gas hydrates occur. The shear modulus and Young's modulus of the CO 2 hydrate increase anomalously with increasing temperature, whereas those of the CH 4 hydrate decrease regularly with increase in temperature. We ascribe this anomaly to the kinetic behavior of the linear CO 2 molecule, especially those in the small cages. The cavity space of the cage limits free rotational motion of the CO 2 molecule at low temperature. With increase in temperature, the CO 2 molecule can rotate easily, and enhance the stability and rigidity of the CO 2 hydrate. Our work provides a key database for the elastic properties of gas hydrates, and molecular insights into stability changes of CO 2 hydrate from high temperature of ~5 °C to low decomposition temperature of ~-150 °C.

Three dimensional model of severe acute respiratory syndrome coronavirus helicase ATPase catalytic domain and molecular design of severe acute respiratory syndrome coronavirus helicase inhibitors

NASA Astrophysics Data System (ADS)

Hoffmann, Marcin; Eitner, Krystian; von Grotthuss, Marcin; Rychlewski, Leszek; Banachowicz, Ewa; Grabarkiewicz, Tomasz; Szkoda, Tomasz; Kolinski, Andrzej

2006-05-01

The modeling of the severe acute respiratory syndrome coronavirus helicase ATPase catalytic domain was performed using the protein structure prediction Meta Server and the 3D Jury method for model selection, which resulted in the identification of 1JPR, 1UAA and 1W36 PDB structures as suitable templates for creating a full atom 3D model. This model was further utilized to design small molecules that are expected to block an ATPase catalytic pocket thus inhibit the enzymatic activity. Binding sites for various functional groups were identified in a series of molecular dynamics calculation. Their positions in the catalytic pocket were used as constraints in the Cambridge structural database search for molecules having the pharmacophores that interacted most strongly with the enzyme in a desired position. The subsequent MD simulations followed by calculations of binding energies of the designed molecules were compared to ATP identifying the most successful candidates, for likely inhibitors—molecules possessing two phosphonic acid moieties at distal ends of the molecule.
Discovery and study of novel protein tyrosine phosphatase 1B inhibitors

NASA Astrophysics Data System (ADS)

Zhang, Qian; Chen, Xi; Feng, Changgen

2017-10-01

Protein tyrosine phosphatase 1B (PTP1B) is considered to be a target for therapy of type II diabetes and obesity. So it is of great significance to take advantage of a computer aided drug design protocol involving the structured-based virtual screening with docking simulations for fast searching small molecule PTP1B inhibitors. Based on optimized complex structure of PTP1B bound with specific inhibitor of IX1, structured-based virtual screening against a library of natural products containing 35308 molecules, which was constructed based on Traditional Chinese Medicine database@ Taiwan (TCM database@ Taiwan), was conducted to determine the occurrence of PTP1B inhibitors using the Lubbock module and CDOCKER module from Discovery Studio 3.1 software package. The results were further filtered by predictive ADME simulation and predictive toxic simulation. As a result, 2 good drug-like molecules, namely para-benzoquinone compound 1 and Clavepictine analogue 2 were identified ultimately with the dock score of original inhibitor (IX1) and the receptor as a threshold. Binding model analyses revealed that these two candidate compounds have good interactions with PTP1B. The PTP1B inhibitory activity of compound 2 hasn't been reported before. The optimized compound 2 has higher scores and deserves further study.
Database resources of the National Center for Biotechnology Information

PubMed Central

Wheeler, David L.; Barrett, Tanya; Benson, Dennis A.; Bryant, Stephen H.; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M.; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Feolo, Michael; Geer, Lewis Y.; Helmberg, Wolfgang; Kapustin, Yuri; Khovayko, Oleg; Landsman, David; Lipman, David J.; Madden, Thomas L.; Maglott, Donna R.; Miller, Vadim; Ostell, James; Pruitt, Kim D.; Schuler, Gregory D.; Shumway, Martin; Sequeira, Edwin; Sherry, Steven T.; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusov, Roman L.; Tatusova, Tatiana A.; Wagner, Lukas; Yaschenko, Eugene

2008-01-01

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data available through NCBI's web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link, Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace, Assembly, and Short Read Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Entrez Probe, GENSAT, Database of Genotype and Phenotype, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool and the PubChem suite of small molecule databases. Augmenting the web applications are custom implementations of the BLAST program optimized to search specialized data sets. These resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov. PMID:18045790
Identification and Biochemical Characterization of Small-Molecule Inhibitors of Clostridium Botulinum Neurotoxin Serotype A

DTIC Science & Technology

2009-08-01

the BoNT/A protease activity were selected. Database search queries of the best candidate hit [7-((4-nitro- anilino)(phenyl) methyl )-8-quinolinol (NSC...therapeutic challenges. Trends Mol. Med. 9:291–299. 18. Gershon, H., and R. Parmegiani. 1963. Antimicrobial activity of 8-quinoli- nol, its salts with salicylic ...Parmegiani. 1962. Antimicrobial activity of 8-quinoli- nols, salicylic acids, hydroxynaphthoic acids, and salts of selected quinolinols with selected
Small-molecule inhibitors of hepatitis C virus (HCV) non-structural protein 5A (NS5A): a patent review (2010-2015).

PubMed

Ivanenkov, Yan A; Aladinskiy, Vladimir A; Bushkov, Nikolay A; Ayginin, Andrey A; Majouga, Alexander G; Ivachtchenko, Alexandre V

2017-04-01

Non-structural 5A (NS5A) protein has achieved a considerable attention as an attractive target for the treatment of hepatitis C (HCV). A number of novel NS5A inhibitors have been reported to date. Several drugs having favorable ADME properties and mild side effects were launched into the pharmaceutical market. For instance, daclatasvir was launched in 2014, elbasvir is currently undergoing registration, ledipasvir was launched in 2014 as a fixed-dose combination with sofosbuvir (NS5B inhibitor). Areas covered: Thomson integrity database and SciFinder database were used as a valuable source to collect the patents on small-molecule NS5A inhibitors. All the structures were ranked by the date of priority. Patent holder and antiviral activity for each scaffold claimed were summarized and presented in a convenient manner. A particular focus was placed on the best-in-class bis-pyrrolidine-containing NS5A inhibitors. Expert opinion: Several first generation NS5A inhibitors have recently progressed into advanced clinical trials and showed superior efficacy in reducing viral load in infected subjects. Therapy schemes of using these agents in combination with other established antiviral drugs with complementary mechanisms of action can address the emergence of resistance and poor therapeutic outcome frequently attributed to antiviral drugs.
PoSSuM v.2.0: data update and a new function for investigating ligand analogs and target proteins of small-molecule drugs.

PubMed

Ito, Jun-ichi; Ikeda, Kazuyoshi; Yamada, Kazunori; Mizuguchi, Kenji; Tomii, Kentaro

2015-01-01

PoSSuM (http://possum.cbrc.jp/PoSSuM/) is a database for detecting similar small-molecule binding sites on proteins. Since its initial release in 2011, PoSSuM has grown to provide information related to 49 million pairs of similar binding sites discovered among 5.5 million known and putative binding sites. This enlargement of the database is expected to enhance opportunities for biological and pharmaceutical applications, such as predictions of new functions and drug discovery. In this release, we have provided a new service named PoSSuM drug search (PoSSuMds) at http://possum.cbrc.jp/PoSSuM/drug_search/, in which we selected 194 approved drug compounds retrieved from ChEMBL, and detected their known binding pockets and pockets that are similar to them. Users can access and download all of the search results via a new web interface, which is useful for finding ligand analogs as well as potential target proteins. Furthermore, PoSSuMds enables users to explore the binding pocket universe within PoSSuM. Additionally, we have improved the web interface with new functions, including sortable tables and a viewer for visualizing and downloading superimposed pockets. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods.

PubMed

Wang, Liming; Zhu, L; Luan, R; Wang, L; Fu, J; Wang, X; Sui, L

2016-10-10

Dilated cardiomyopathy (DCM) is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs) were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs) and microRNAs (miRNAs) of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT) were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family). Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1), potential TFs, as well as potential miRNAs, might be involved in DCM.
Analyzing gene expression profiles in dilated cardiomyopathy via bioinformatics methods

PubMed Central

Wang, Liming; Zhu, L.; Luan, R.; Wang, L.; Fu, J.; Wang, X.; Sui, L.

2016-01-01

Dilated cardiomyopathy (DCM) is characterized by ventricular dilatation, and it is a common cause of heart failure and cardiac transplantation. This study aimed to explore potential DCM-related genes and their underlying regulatory mechanism using methods of bioinformatics. The gene expression profiles of GSE3586 were downloaded from Gene Expression Omnibus database, including 15 normal samples and 13 DCM samples. The differentially expressed genes (DEGs) were identified between normal and DCM samples using Limma package in R language. Pathway enrichment analysis of DEGs was then performed. Meanwhile, the potential transcription factors (TFs) and microRNAs (miRNAs) of these DEGs were predicted based on their binding sequences. In addition, DEGs were mapped to the cMap database to find the potential small molecule drugs. A total of 4777 genes were identified as DEGs by comparing gene expression profiles between DCM and control samples. DEGs were significantly enriched in 26 pathways, such as lymphocyte TarBase pathway and androgen receptor signaling pathway. Furthermore, potential TFs (SP1, LEF1, and NFAT) were identified, as well as potential miRNAs (miR-9, miR-200 family, and miR-30 family). Additionally, small molecules like isoflupredone and trihexyphenidyl were found to be potential therapeutic drugs for DCM. The identified DEGs (PRSS12 and FOXG1), potential TFs, as well as potential miRNAs, might be involved in DCM. PMID:27737314
Identification of antipsychotic drug fluspirilene as a potential p53-MDM2 inhibitor: a combined computational and experimental study

NASA Astrophysics Data System (ADS)

Patil, Sachin P.; Pacitti, Michael F.; Gilroy, Kevin S.; Ruggiero, John C.; Griffin, Jonathan D.; Butera, Joseph J.; Notarfrancesco, Joseph M.; Tran, Shawn; Stoddart, John W.

2015-02-01

The inhibition of tumor suppressor p53 protein due to its direct interaction with oncogenic murine double minute 2 (MDM2) protein, plays a central role in almost 50 % of all human tumor cells. Therefore, pharmacological inhibition of the p53-binding pocket on MDM2, leading to p53 activation, presents an important therapeutic target against these cancers expressing wild-type p53. In this context, the present study utilized an integrated virtual and experimental screening approach to screen a database of approved drugs for potential p53-MDM2 interaction inhibitors. Specifically, using an ensemble rigid-receptor docking approach with four MDM2 protein crystal structures, six drug molecules were identified as possible p53-MDM2 inhibitors. These drug molecules were then subjected to further molecular modeling investigation through flexible-receptor docking followed by Prime/MM-GBSA binding energy analysis. These studies identified fluspirilene, an approved antipsychotic drug, as a top hit with MDM2 binding mode and energy similar to that of a native MDM2 crystal ligand. The molecular dynamics simulations suggested stable binding of fluspirilene to the p53-binding pocket on MDM2 protein. The experimental testing of fluspirilene showed significant growth inhibition of human colon tumor cells in a p53-dependent manner. Fluspirilene also inhibited growth of several other human tumor cell lines in the NCI60 cell line panel. Taken together, these computational and experimental data suggest a potentially novel role of fluspirilene in inhibiting the p53-MDM2 interaction. It is noteworthy here that fluspirilene has a long history of safe human use, thus presenting immediate clinical potential as a cancer therapeutic. Furthermore, fluspirilene could also serve as a structurally-novel lead molecule for the development of more potent, small-molecule p53-MDM2 inhibitors against several types of cancer. Importantly, the combined computational and experimental screening protocol presented in this study may also prove useful for screening other commercially-available compound databases for identification of novel, small molecule p53-MDM2 inhibitors.
The Molecule Pages database

PubMed Central

Saunders, Brian; Lyon, Stephen; Day, Matthew; Riley, Brenda; Chenette, Emily; Subramaniam, Shankar

2008-01-01

The UCSD-Nature Signaling Gateway Molecule Pages (http://www.signaling-gateway.org/molecule) provides essential information on more than 3800 mammalian proteins involved in cellular signaling. The Molecule Pages contain expert-authored and peer-reviewed information based on the published literature, complemented by regularly updated information derived from public data source references and sequence analysis. The expert-authored data includes both a full-text review about the molecule, with citations, and highly structured data for bioinformatics interrogation, including information on protein interactions and states, transitions between states and protein function. The expert-authored pages are anonymously peer reviewed by the Nature Publishing Group. The Molecule Pages data is present in an object-relational database format and is freely accessible to the authors, the reviewers and the public from a web browser that serves as a presentation layer. The Molecule Pages are supported by several applications that along with the database and the interfaces form a multi-tier architecture. The Molecule Pages and the Signaling Gateway are routinely accessed by a very large research community. PMID:17965093
The Molecule Pages database.

PubMed

Saunders, Brian; Lyon, Stephen; Day, Matthew; Riley, Brenda; Chenette, Emily; Subramaniam, Shankar; Vadivelu, Ilango

2008-01-01

The UCSD-Nature Signaling Gateway Molecule Pages (http://www.signaling-gateway.org/molecule) provides essential information on more than 3800 mammalian proteins involved in cellular signaling. The Molecule Pages contain expert-authored and peer-reviewed information based on the published literature, complemented by regularly updated information derived from public data source references and sequence analysis. The expert-authored data includes both a full-text review about the molecule, with citations, and highly structured data for bioinformatics interrogation, including information on protein interactions and states, transitions between states and protein function. The expert-authored pages are anonymously peer reviewed by the Nature Publishing Group. The Molecule Pages data is present in an object-relational database format and is freely accessible to the authors, the reviewers and the public from a web browser that serves as a presentation layer. The Molecule Pages are supported by several applications that along with the database and the interfaces form a multi-tier architecture. The Molecule Pages and the Signaling Gateway are routinely accessed by a very large research community.
Search for β2 Adrenergic Receptor Ligands by Virtual Screening via Grid Computing and Investigation of Binding Modes by Docking and Molecular Dynamics Simulations

PubMed Central

Bai, Qifeng; Shao, Yonghua; Pan, Dabo; Zhang, Yang; Liu, Huanxiang; Yao, Xiaojun

2014-01-01

We designed a program called MolGridCal that can be used to screen small molecule database in grid computing on basis of JPPF grid environment. Based on MolGridCal program, we proposed an integrated strategy for virtual screening and binding mode investigation by combining molecular docking, molecular dynamics (MD) simulations and free energy calculations. To test the effectiveness of MolGridCal, we screened potential ligands for β2 adrenergic receptor (β2AR) from a database containing 50,000 small molecules. MolGridCal can not only send tasks to the grid server automatically, but also can distribute tasks using the screensaver function. As for the results of virtual screening, the known agonist BI-167107 of β2AR is ranked among the top 2% of the screened candidates, indicating MolGridCal program can give reasonable results. To further study the binding mode and refine the results of MolGridCal, more accurate docking and scoring methods are used to estimate the binding affinity for the top three molecules (agonist BI-167107, neutral antagonist alprenolol and inverse agonist ICI 118,551). The results indicate agonist BI-167107 has the best binding affinity. MD simulation and free energy calculation are employed to investigate the dynamic interaction mechanism between the ligands and β2AR. The results show that the agonist BI-167107 also has the lowest binding free energy. This study can provide a new way to perform virtual screening effectively through integrating molecular docking based on grid computing, MD simulations and free energy calculations. The source codes of MolGridCal are freely available at http://molgridcal.codeplex.com. PMID:25229694
Advances in computational metabolomics and databases deepen the understanding of metabolisms.

PubMed

Tsugawa, Hiroshi

2018-01-29

Mass spectrometry (MS)-based metabolomics is the popular platform for metabolome analyses. Computational techniques for the processing of MS raw data, for example, feature detection, peak alignment, and the exclusion of false-positive peaks, have been established. The next stage of untargeted metabolomics would be to decipher the mass fragmentation of small molecules for the global identification of human-, animal-, plant-, and microbiota metabolomes, resulting in a deeper understanding of metabolisms. This review is an update on the latest computational metabolomics including known/expected structure databases, chemical ontology classifications, and mass spectrometry cheminformatics for the interpretation of mass fragmentations and for the elucidation of unknown metabolites. The importance of metabolome 'databases' and 'repositories' is also discussed because novel biological discoveries are often attributable to the accumulation of data, to relational databases, and to their statistics. Lastly, a practical guide for metabolite annotations is presented as the summary of this review. Copyright © 2018 Elsevier Ltd. All rights reserved.
Data-Driven High-Throughput Prediction of the 3D Structure of Small Molecules: Review and Progress

PubMed Central

Andronico, Alessio; Randall, Arlo; Benz, Ryan W.; Baldi, Pierre

2011-01-01

Accurate prediction of the 3D structure of small molecules is essential in order to understand their physical, chemical, and biological properties including how they interact with other molecules. Here we survey the field of high-throughput methods for 3D structure prediction and set up new target specifications for the next generation of methods. We then introduce COSMOS, a novel data-driven prediction method that utilizes libraries of fragment and torsion angle parameters. We illustrate COSMOS using parameters extracted from the Cambridge Structural Database (CSD) by analyzing their distribution and then evaluating the system’s performance in terms of speed, coverage, and accuracy. Results show that COSMOS represents a significant improvement when compared to the state-of-the-art, particularly in terms of coverage of complex molecular structures, including metal-organics. COSMOS can predict structures for 96.4% of the molecules in the CSD [99.6% organic, 94.6% metal-organic] whereas the widely used commercial method CORINA predicts structures for 68.5% [98.5% organic, 51.6% metal-organic]. On the common subset of molecules predicted by both methods COSMOS makes predictions with an average speed per molecule of 0.15s [0.10s organic, 0.21s metal-organic], and an average RMSD of 1.57Å [1.26Å organic, 1.90Å metal-organic], and CORINA makes predictions with an average speed per molecule of 0.13s [0.18s organic, 0.08s metal-organic], and an average RMSD of 1.60Å [1.13Å organic, 2.11Å metal-organic]. COSMOS is available through the ChemDB chemoinformatics web portal at: http://cdb.ics.uci.edu/. PMID:21417267
BioSpider: a web server for automating metabolome annotations.

PubMed

Knox, Craig; Shrivastava, Savita; Stothard, Paul; Eisner, Roman; Wishart, David S

2007-01-01

One of the growing challenges in life science research lies in finding useful, descriptive or quantitative data about newly reported biomolecules (genes, proteins, metabolites and drugs). An even greater challenge is finding information that connects these genes, proteins, drugs or metabolites to each other. Much of this information is scattered through hundreds of different databases, abstracts or books and almost none of it is particularly well integrated. While some efforts are being undertaken at the NCBI and EBI to integrate many different databases together, this still falls short of the goal of having some kind of human-readable synopsis that summarizes the state of knowledge about a given biomolecule - especially small molecules. To address this shortfall, we have developed BioSpider. BioSpider is essentially an automated report generator designed specifically to tabulate and summarize data on biomolecules - both large and small. Specifically, BioSpider allows users to type in almost any kind of biological or chemical identifier (protein/gene name, sequence, accession number, chemical name, brand name, SMILES string, InCHI string, CAS number, etc.) and it returns an in-depth synoptic report (approximately 3-30 pages in length) about that biomolecule and any other biomolecule it may target. This summary includes physico-chemical parameters, images, models, data files, descriptions and predictions concerning the query molecule. BioSpider uses a web-crawler to scan through dozens of public databases and employs a variety of specially developed text mining tools and locally developed prediction tools to find, extract and assemble data for its reports. Because of its breadth, depth and comprehensiveness, we believe BioSpider will prove to be a particularly valuable tool for researchers in metabolomics. BioSpider is available at: www.biospider.ca
Investigation of drift gas selectivity in high resolution ion mobility spectrometry with mass spectrometry detection.

PubMed

Matz, Laura M; Hill, Herbert H; Beegle, Luther W; Kanik, Isik

2002-04-01

Recent studies in electrospray ionization (ESI)/ion mobility spectrometry (IMS) have focussed on employing different drift gases to alter separation efficiency for some molecules. This study investigates four structurally similar classes of molecules (cocaine and metabolites, amphetamines, benzodiazepines, and small peptides) to determine the effect of structure on relative mobility changes in four drift gases (helium, nitrogen, argon, carbon dioxide). Collision cross sections were plotted against drift gas polarizability and a linear relationship was found for the nineteen compounds evaluated in the study. Based on the reduced mobility database, all nineteen compounds could be separated in one of the four drift gases, however, the drift gas that provided optimal separation was specific for the two compounds.
Update of KDBI: Kinetic Data of Bio-molecular Interaction database

PubMed Central

Kumar, Pankaj; Han, B. C.; Shi, Z.; Jia, J.; Wang, Y. P.; Zhang, Y. T.; Liang, L.; Liu, Q. F.; Ji, Z. L.; Chen, Y. Z.

2009-01-01

Knowledge of the kinetics of biomolecular interactions is important for facilitating the study of cellular processes and underlying molecular events, and is essential for quantitative study and simulation of biological systems. Kinetic Data of Bio-molecular Interaction database (KDBI) has been developed to provide information about experimentally determined kinetic data of protein–protein, protein–nucleic acid, protein–ligand, nucleic acid–ligand binding or reaction events described in the literature. To accommodate increasing demand for studying and simulating biological systems, numerous improvements and updates have been made to KDBI, including new ways to access data by pathway and molecule names, data file in System Biology Markup Language format, more efficient search engine, access to published parameter sets of simulation models of 63 pathways, and 2.3-fold increase of data (19 263 entries of 10 532 distinctive biomolecular binding and 11 954 interaction events, involving 2635 proteins/protein complexes, 847 nucleic acids, 1603 small molecules and 45 multi-step processes). KDBI is publically available at http://bidd.nus.edu.sg/group/kdbi/kdbi.asp. PMID:18971255
Molecular docking based screening of compounds against VP40 from Ebola virus.

PubMed

M Alam El-Din, Hanaa; A Loutfy, Samah; Fathy, Nasra; H Elberry, Mostafa; M Mayla, Ahmed; Kassem, Sara; Naqvi, Asif

2016-01-01

Ebola virus causes severe and often fatal hemorrhagic fevers in humans. The 2014 Ebola epidemic affected multiple countries. The virus matrix protein (VP40) plays a central role in virus assembly and budding. Since there is no FDA-approved vaccine or medicine against Ebola viral infection, discovering new compounds with different binding patterns against it is required. Therefore, we aim to identify small molecules that target the Arg 134 RNA binding and active site of VP40 protein. 1800 molecules were retrieved from PubChem compound database based on Structure Similarity and Conformers of pyrimidine-2, 4-dione. Molecular docking approach using Lamarckian Genetic Algorithm was carried out to find the potent inhibitors for VP40 based on calculated ligand-protein pairwise interaction energies. The grid maps representing the protein were calculated using auto grid and grid size was set to 60*60*60 points with grid spacing of 0.375 Ǻ. Ten independent docking runs were carried out for each ligand and results were clustered according to the 1.0 Ǻ RMSD criteria. The post-docking analysis showed that binding energies ranged from -8.87 to 0.6 Kcal/mol. We report 7 molecules, which showed promising ADMET results, LD-50, as well as H-bond interaction in the binding pocket. The small molecules discovered could act as potential inhibitors for VP40 and could interfere with virus assembly and budding process.
Molecular docking based screening of compounds against VP40 from Ebola virus

PubMed Central

M Alam El-Din, Hanaa; A. Loutfy, Samah; Fathy, Nasra; H Elberry, Mostafa; M Mayla, Ahmed; Kassem, Sara; Naqvi, Asif

2016-01-01

Ebola virus causes severe and often fatal hemorrhagic fevers in humans. The 2014 Ebola epidemic affected multiple countries. The virus matrix protein (VP40) plays a central role in virus assembly and budding. Since there is no FDA-approved vaccine or medicine against Ebola viral infection, discovering new compounds with different binding patterns against it is required. Therefore, we aim to identify small molecules that target the Arg 134 RNA binding and active site of VP40 protein. 1800 molecules were retrieved from PubChem compound database based on Structure Similarity and Conformers of pyrimidine-2, 4-dione. Molecular docking approach using Lamarckian Genetic Algorithm was carried out to find the potent inhibitors for VP40 based on calculated ligand-protein pairwise interaction energies. The grid maps representing the protein were calculated using auto grid and grid size was set to 60*60*60 points with grid spacing of 0.375 Ǻ. Ten independent docking runs were carried out for each ligand and results were clustered according to the 1.0 Ǻ RMSD criteria. The post-docking analysis showed that binding energies ranged from -8.87 to 0.6 Kcal/mol. We report 7 molecules, which showed promising ADMET results, LD-50, as well as H-bond interaction in the binding pocket. The small molecules discovered could act as potential inhibitors for VP40 and could interfere with virus assembly and budding process. PMID:28149054
Database resources of the National Center for Biotechnology Information

PubMed Central

Sayers, Eric W.; Barrett, Tanya; Benson, Dennis A.; Bolton, Evan; Bryant, Stephen H.; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M.; DiCuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M.; Geer, Lewis Y.; Helmberg, Wolfgang; Kapustin, Yuri; Krasnov, Sergey; Landsman, David; Lipman, David J.; Lu, Zhiyong; Madden, Thomas L.; Madej, Tom; Maglott, Donna R.; Marchler-Bauer, Aron; Miller, Vadim; Karsch-Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D.; Schuler, Gregory D.; Sequeira, Edwin; Sherry, Stephen T.; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A.; Wagner, Lukas; Wang, Yanli; Wilbur, W. John; Yaschenko, Eugene; Ye, Jian

2012-01-01

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Website. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Probe, Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov. PMID:22140104

Database resources of the National Center for Biotechnology Information

PubMed Central

2013-01-01

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, the Genetic Testing Registry, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Probe, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool, Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page. PMID:23193264
Database resources of the National Center for Biotechnology Information.

PubMed

Wheeler, David L; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Geer, Lewis Y; Kapustin, Yuri; Khovayko, Oleg; Landsman, David; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Ostell, James; Miller, Vadim; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Steven T; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusov, Roman L; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene

2007-01-01

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link(BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace and Assembly Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Viral Genotyping Tools, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART) and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. These resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.
Database resources of the National Center for Biotechnology Information.

PubMed

Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Feolo, Michael; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Landsman, David; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Miller, Vadim; Mizrachi, Ilene; Ostell, James; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene; Ye, Jian

2009-01-01

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART) and the PubChem suite of small molecule databases. Augmenting many of the web applications is custom implementation of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Models for Mycobacterium tuberculosis Drug Discovery.

PubMed

Ekins, Sean; Madrid, Peter B; Sarker, Malabika; Li, Shao-Gang; Mittal, Nisha; Kumar, Pradeep; Wang, Xin; Stratton, Thomas P; Zimmerman, Matthew; Talcott, Carolyn; Bourbon, Pauline; Travers, Mike; Yadav, Maneesh; Freundlich, Joel S

2015-01-01

Integrated computational approaches for Mycobacterium tuberculosis (Mtb) are useful to identify new molecules that could lead to future tuberculosis (TB) drugs. Our approach uses information derived from the TBCyc pathway and genome database, the Collaborative Drug Discovery TB database combined with 3D pharmacophores and dual event Bayesian models of whole-cell activity and lack of cytotoxicity. We have prioritized a large number of molecules that may act as mimics of substrates and metabolites in the TB metabolome. We computationally searched over 200,000 commercial molecules using 66 pharmacophores based on substrates and metabolites from Mtb and further filtering with Bayesian models. We ultimately tested 110 compounds in vitro that resulted in two compounds of interest, BAS 04912643 and BAS 00623753 (MIC of 2.5 and 5 μg/mL, respectively). These molecules were used as a starting point for hit-to-lead optimization. The most promising class proved to be the quinoxaline di-N-oxides, evidenced by transcriptional profiling to induce mRNA level perturbations most closely resembling known protonophores. One of these, SRI58 exhibited an MIC = 1.25 μg/mL versus Mtb and a CC50 in Vero cells of >40 μg/mL, while featuring fair Caco-2 A-B permeability (2.3 x 10-6 cm/s), kinetic solubility (125 μM at pH 7.4 in PBS) and mouse metabolic stability (63.6% remaining after 1 h incubation with mouse liver microsomes). Despite demonstration of how a combined bioinformatics/cheminformatics approach afforded a small molecule with promising in vitro profiles, we found that SRI58 did not exhibit quantifiable blood levels in mice.
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Models for Mycobacterium tuberculosis Drug Discovery

PubMed Central

Sarker, Malabika; Li, Shao-Gang; Mittal, Nisha; Kumar, Pradeep; Wang, Xin; Stratton, Thomas P.; Zimmerman, Matthew; Talcott, Carolyn; Bourbon, Pauline; Travers, Mike; Yadav, Maneesh

2015-01-01

Integrated computational approaches for Mycobacterium tuberculosis (Mtb) are useful to identify new molecules that could lead to future tuberculosis (TB) drugs. Our approach uses information derived from the TBCyc pathway and genome database, the Collaborative Drug Discovery TB database combined with 3D pharmacophores and dual event Bayesian models of whole-cell activity and lack of cytotoxicity. We have prioritized a large number of molecules that may act as mimics of substrates and metabolites in the TB metabolome. We computationally searched over 200,000 commercial molecules using 66 pharmacophores based on substrates and metabolites from Mtb and further filtering with Bayesian models. We ultimately tested 110 compounds in vitro that resulted in two compounds of interest, BAS 04912643 and BAS 00623753 (MIC of 2.5 and 5 μg/mL, respectively). These molecules were used as a starting point for hit-to-lead optimization. The most promising class proved to be the quinoxaline di-N-oxides, evidenced by transcriptional profiling to induce mRNA level perturbations most closely resembling known protonophores. One of these, SRI58 exhibited an MIC = 1.25 μg/mL versus Mtb and a CC50 in Vero cells of >40 μg/mL, while featuring fair Caco-2 A-B permeability (2.3 x 10−6 cm/s), kinetic solubility (125 μM at pH 7.4 in PBS) and mouse metabolic stability (63.6% remaining after 1 h incubation with mouse liver microsomes). Despite demonstration of how a combined bioinformatics/cheminformatics approach afforded a small molecule with promising in vitro profiles, we found that SRI58 did not exhibit quantifiable blood levels in mice. PMID:26517557
The HITRAN 2008 Molecular Spectroscopic Database

NASA Technical Reports Server (NTRS)

Rothman, Laurence S.; Gordon, Iouli E.; Barbe, Alain; Benner, D. Chris; Bernath, Peter F.; Birk, Manfred; Boudon, V.; Brown, Linda R.; Campargue, Alain; Champion, J.-P.;

2009-01-01

This paper describes the status of the 2008 edition of the HITRAN molecular spectroscopic database. The new edition is the first official public release since the 2004 edition, although a number of crucial updates had been made available online since 2004. The HITRAN compilation consists of several components that serve as input for radiative-transfer calculation codes: individual line parameters for the microwave through visible spectra of molecules in the gas phase; absorption cross-sections for molecules having dense spectral features, i.e., spectra in which the individual lines are not resolved; individual line parameters and absorption cross sections for bands in the ultra-violet; refractive indices of aerosols, tables and files of general properties associated with the database; and database management software. The line-by-line portion of the database contains spectroscopic parameters for forty-two molecules including many of their isotopologues.

AceDRG: a stereochemical description generator for ligands

PubMed Central

Emsley, Paul; Gražulis, Saulius; Merkys, Andrius; Vaitkus, Antanas

2017-01-01

The program AceDRG is designed for the derivation of stereochemical information about small molecules. It uses local chemical and topological environment-based atom typing to derive and organize bond lengths and angles from a small-molecule database: the Crystallography Open Database (COD). Information about the hybridization states of atoms, whether they belong to small rings (up to seven-membered rings), ring aromaticity and nearest-neighbour information is encoded in the atom types. All atoms from the COD have been classified according to the generated atom types. All bonds and angles have also been classified according to the atom types and, in a certain sense, bond types. Derived data are tabulated in a machine-readable form that is freely available from CCP4. AceDRG can also generate stereochemical information, provided that the basic bonding pattern of a ligand is known. The basic bonding pattern is perceived from one of the computational chemistry file formats, including SMILES, mmCIF, SDF MOL and SYBYL MOL2 files. Using the bonding chemistry, atom types, and bond and angle tables generated from the COD, AceDRG derives the ‘ideal’ bond lengths, angles, plane groups, aromatic rings and chirality information, and writes them to an mmCIF file that can be used by the refinement program REFMAC5 and the model-building program Coot. Other refinement and model-building programs such as PHENIX and BUSTER can also use these files. AceDRG also generates one or more coordinate sets corresponding to the most favourable conformation(s) of a given ligand. AceDRG employs RDKit for chemistry perception and for initial conformation generation, as well as for the interpretation of SMILES strings, SDF MOL and SYBYL MOL2 files. PMID:28177307
An integrated one-step system to extract, analyze and annotate all relevant information from image-based cell screening of chemical libraries.

PubMed

Rabal, Obdulia; Link, Wolfgang; Serelde, Beatriz G; Bischoff, James R; Oyarzabal, Julen

2010-04-01

Here we report the development and validation of a complete solution to manage and analyze the data produced by image-based phenotypic screening campaigns of small-molecule libraries. In one step initial crude images are analyzed for multiple cytological features, statistical analysis is performed and molecules that produce the desired phenotypic profile are identified. A naïve Bayes classifier, integrating chemical and phenotypic spaces, is built and utilized during the process to assess those images initially classified as "fuzzy"-an automated iterative feedback tuning. Simultaneously, all this information is directly annotated in a relational database containing the chemical data. This novel fully automated method was validated by conducting a re-analysis of results from a high-content screening campaign involving 33 992 molecules used to identify inhibitors of the PI3K/Akt signaling pathway. Ninety-two percent of confirmed hits identified by the conventional multistep analysis method were identified using this integrated one-step system as well as 40 new hits, 14.9% of the total, originally false negatives. Ninety-six percent of true negatives were properly recognized too. A web-based access to the database, with customizable data retrieval and visualization tools, facilitates the posterior analysis of annotated cytological features which allows identification of additional phenotypic profiles; thus, further analysis of original crude images is not required.
Functional Analysis of OMICs Data and Small Molecule Compounds in an Integrated "Knowledge-Based" Platform.

PubMed

Dubovenko, Alexey; Nikolsky, Yuri; Rakhmatulin, Eugene; Nikolskaya, Tatiana

2017-01-01

Analysis of NGS and other sequencing data, gene variants, gene expression, proteomics, and other high-throughput (OMICs) data is challenging because of its biological complexity and high level of technical and biological noise. One way to deal with both problems is to perform analysis with a high fidelity annotated knowledgebase of protein interactions, pathways, and functional ontologies. This knowledgebase has to be structured in a computer-readable format and must include software tools for managing experimental data, analysis, and reporting. Here, we present MetaCore™ and Key Pathway Advisor (KPA), an integrated platform for functional data analysis. On the content side, MetaCore and KPA encompass a comprehensive database of molecular interactions of different types, pathways, network models, and ten functional ontologies covering human, mouse, and rat genes. The analytical toolkit includes tools for gene/protein list enrichment analysis, statistical "interactome" tool for the identification of over- and under-connected proteins in the dataset, and a biological network analysis module made up of network generation algorithms and filters. The suite also features Advanced Search, an application for combinatorial search of the database content, as well as a Java-based tool called Pathway Map Creator for drawing and editing custom pathway maps. Applications of MetaCore and KPA include molecular mode of action of disease research, identification of potential biomarkers and drug targets, pathway hypothesis generation, analysis of biological effects for novel small molecule compounds and clinical applications (analysis of large cohorts of patients, and translational and personalized medicine).
Identification of sumoylation activating enzyme 1 inhibitors by structure-based virtual screening.

PubMed

Kumar, Ashutosh; Ito, Akihiro; Hirohama, Mikako; Yoshida, Minoru; Zhang, Kam Y J

2013-04-22

SUMO activating enzyme 1 (SUMO E1) is responsible for the activation of SUMO in the first step of the sumoylation cascade. SUMO E1 is linked to many human diseases including cancer, thus making it a potential therapeutic target. There are few reported SUMO E1 inhibitors including several natural products. To identify small molecule inhibitors of SUMO E1 with better drug-like properties for potential therapeutic studies, we have used structure-based virtual screening to identify hits from the Maybridge small molecule library for biological assay. Our virtual screening protocol involves fast docking of the entire small molecule library with rigid protein and ligands followed by redocking of top hits using a method that incorporates both ligand and protein flexibility. Subsequently, the top-ranking compounds were prioritized using the molecular dynamics simulation-based binding free energy calculation. Out of 24 compounds that were acquired and tested using in vitro sumoylation assay, four of them showed more than 85% inhibition of sumoylation with the most active compound showing an IC50 of 14.4 μM. A similarity search with the most active compound in the ZINC database has identified three more compounds with improved potency. These compounds share a common phenyl urea scaffold and have been confirmed to inhibit SUMO E1 by in vitro SUMO-1 thioester bond formation assay. Our study suggests that these phenyl urea compounds could be used as a starting point for the development of novel therapeutic agents.
A Mapping of Drug Space from the Viewpoint of Small Molecule Metabolism

PubMed Central

Basuino, Li; Chambers, Henry F.; Lee, Deok-Sun; Wiest, Olaf G.; Babbitt, Patricia C.

2009-01-01

Small molecule drugs target many core metabolic enzymes in humans and pathogens, often mimicking endogenous ligands. The effects may be therapeutic or toxic, but are frequently unexpected. A large-scale mapping of the intersection between drugs and metabolism is needed to better guide drug discovery. To map the intersection between drugs and metabolism, we have grouped drugs and metabolites by their associated targets and enzymes using ligand-based set signatures created to quantify their degree of similarity in chemical space. The results reveal the chemical space that has been explored for metabolic targets, where successful drugs have been found, and what novel territory remains. To aid other researchers in their drug discovery efforts, we have created an online resource of interactive maps linking drugs to metabolism. These maps predict the “effect space” comprising likely target enzymes for each of the 246 MDDR drug classes in humans. The online resource also provides species-specific interactive drug-metabolism maps for each of the 385 model organisms and pathogens in the BioCyc database collection. Chemical similarity links between drugs and metabolites predict potential toxicity, suggest routes of metabolism, and reveal drug polypharmacology. The metabolic maps enable interactive navigation of the vast biological data on potential metabolic drug targets and the drug chemistry currently available to prosecute those targets. Thus, this work provides a large-scale approach to ligand-based prediction of drug action in small molecule metabolism. PMID:19701464
Surfing the Protein-Protein Interaction Surface Using Docking Methods: Application to the Design of PPI Inhibitors.

PubMed

Sable, Rushikesh; Jois, Seetharama

2015-06-23

Blocking protein-protein interactions (PPI) using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.
CHIPMUNK: A Virtual Synthesizable Small-Molecule Library for Medicinal Chemistry, Exploitable for Protein-Protein Interaction Modulators.

PubMed

Humbeck, Lina; Weigang, Sebastian; Schäfer, Till; Mutzel, Petra; Koch, Oliver

2018-03-20

A common issue during drug design and development is the discovery of novel scaffolds for protein targets. On the one hand the chemical space of purchasable compounds is rather limited; on the other hand artificially generated molecules suffer from a grave lack of accessibility in practice. Therefore, we generated a novel virtual library of small molecules which are synthesizable from purchasable educts, called CHIPMUNK (CHemically feasible In silico Public Molecular UNiverse Knowledge base). Altogether, CHIPMUNK covers over 95 million compounds and encompasses regions of the chemical space that are not covered by existing databases. The coverage of CHIPMUNK exceeds the chemical space spanned by the Lipinski rule of five to foster the exploration of novel and difficult target classes. The analysis of the generated property space reveals that CHIPMUNK is well suited for the design of protein-protein interaction inhibitors (PPIIs). Furthermore, a recently developed structural clustering algorithm (StruClus) for big data was used to partition the sub-libraries into meaningful subsets and assist scientists to process the large amount of data. These clustered subsets also contain the target space based on ChEMBL data which was included during clustering. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Combined use of computational chemistry and chemoinformatics methods for chemical discovery

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sugimoto, Manabu, E-mail: sugimoto@kumamoto-u.ac.jp; Institute for Molecular Science, 38 Nishigo-Naka, Myodaiji, Okazaki 444-8585; CREST, Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi, Saitama 332-0012

2015-12-31

Data analysis on numerical data by the computational chemistry calculations is carried out to obtain knowledge information of molecules. A molecular database is developed to systematically store chemical, electronic-structure, and knowledge-based information. The database is used to find molecules related to a keyword of “cancer”. Then the electronic-structure calculations are performed to quantitatively evaluate quantum chemical similarity of the molecules. Among the 377 compounds registered in the database, 24 molecules are found to be “cancer”-related. This set of molecules includes both carcinogens and anticancer drugs. The quantum chemical similarity analysis, which is carried out by using numerical results of themore » density-functional theory calculations, shows that, when some energy spectra are referred to, carcinogens are reasonably distinguished from the anticancer drugs. Therefore these spectral properties are considered of as important measures for classification.« less
A crystallographic perspective on sharing data and knowledge

NASA Astrophysics Data System (ADS)

Bruno, Ian J.; Groom, Colin R.

2014-10-01

The crystallographic community is in many ways an exemplar of the benefits and practices of sharing data. Since the inception of the technique, virtually every published crystal structure has been made available to others. This has been achieved through the establishment of several specialist data centres, including the Cambridge Crystallographic Data Centre, which produces the Cambridge Structural Database. Containing curated structures of small organic molecules, some containing a metal, the database has been produced for almost 50 years. This has required the development of complex informatics tools and an environment allowing expert human curation. As importantly, a financial model has evolved which has, to date, ensured the sustainability of the resource. However, the opportunities afforded by technological changes and changing attitudes to sharing data make it an opportune moment to review current practices.
The Cambridge Structural Database

PubMed Central

Groom, Colin R.; Bruno, Ian J.; Lightfoot, Matthew P.; Ward, Suzanna C.

2016-01-01

The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal–organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface. PMID:27048719
Chemical Informatics and the Drug Discovery Knowledge Pyramid

PubMed Central

Lushington, Gerald H.; Dong, Yinghua; Theertham, Bhargav

2012-01-01

The magnitude of the challenges in preclinical drug discovery is evident in the large amount of capital invested in such efforts in pursuit of a small static number of eventually successful marketable therapeutics. An explosion in the availability of potentially drug-like compounds and chemical biology data on these molecules can provide us with the means to improve the eventual success rates for compounds being considered at the preclinical level, but only if the community is able to access available information in an efficient and meaningful way. Thus, chemical database resources are critical to any serious drug discovery effort. This paper explores the basic principles underlying the development and implementation of chemical databases, and examines key issues of how molecular information may be encoded within these databases so as to enhance the likelihood that users will be able to extract meaningful information from data queries. In addition to a broad survey of conventional data representation and query strategies, key enabling technologies such as new context-sensitive chemical similarity measures and chemical cartridges are examined, with recommendations on how such resources may be integrated into a practical database environment. PMID:23782037
The Cambridge Structural Database.

PubMed

Groom, Colin R; Bruno, Ian J; Lightfoot, Matthew P; Ward, Suzanna C

2016-04-01

The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal-organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface.
Database resources of the National Center for Biotechnology Information.

PubMed

2016-01-04

The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank(®) nucleic acid sequence database and the PubMed database of citations and abstracts for published life science journals. Additional NCBI resources focus on literature (PubMed Central (PMC), Bookshelf and PubReader), health (ClinVar, dbGaP, dbMHC, the Genetic Testing Registry, HIV-1/Human Protein Interaction Database and MedGen), genomes (BioProject, Assembly, Genome, BioSample, dbSNP, dbVar, Epigenomics, the Map Viewer, Nucleotide, Probe, RefSeq, Sequence Read Archive, the Taxonomy Browser and the Trace Archive), genes (Gene, Gene Expression Omnibus (GEO), HomoloGene, PopSet and UniGene), proteins (Protein, the Conserved Domain Database (CDD), COBALT, Conserved Domain Architecture Retrieval Tool (CDART), the Molecular Modeling Database (MMDB) and Protein Clusters) and chemicals (Biosystems and the PubChem suite of small molecule databases). The Entrez system provides search and retrieval operations for most of these databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Database resources of the National Center for Biotechnology Information.

PubMed

2015-01-01

The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank(®) nucleic acid sequence database and the PubMed database of citations and abstracts for published life science journals. Additional NCBI resources focus on literature (Bookshelf, PubMed Central (PMC) and PubReader); medical genetics (ClinVar, dbMHC, the Genetic Testing Registry, HIV-1/Human Protein Interaction Database and MedGen); genes and genomics (BioProject, BioSample, dbSNP, dbVar, Epigenomics, Gene, Gene Expression Omnibus (GEO), Genome, HomoloGene, the Map Viewer, Nucleotide, PopSet, Probe, RefSeq, Sequence Read Archive, the Taxonomy Browser, Trace Archive and UniGene); and proteins and chemicals (Biosystems, COBALT, the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), the Molecular Modeling Database (MMDB), Protein Clusters, Protein and the PubChem suite of small molecule databases). The Entrez system provides search and retrieval operations for many of these databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov. Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.

Plant Reactome: a resource for plant pathways and comparative analysis

PubMed Central

Naithani, Sushma; Preece, Justin; D'Eustachio, Peter; Gupta, Parul; Amarasinghe, Vindhya; Dharmawardhana, Palitha D.; Wu, Guanming; Fabregat, Antonio; Elser, Justin L.; Weiser, Joel; Keays, Maria; Fuentes, Alfonso Munoz-Pomer; Petryszak, Robert; Stein, Lincoln D.; Ware, Doreen; Jaiswal, Pankaj

2017-01-01

Plant Reactome (http://plantreactome.gramene.org/) is a free, open-source, curated plant pathway database portal, provided as part of the Gramene project. The database provides intuitive bioinformatics tools for the visualization, analysis and interpretation of pathway knowledge to support genome annotation, genome analysis, modeling, systems biology, basic research and education. Plant Reactome employs the structural framework of a plant cell to show metabolic, transport, genetic, developmental and signaling pathways. We manually curate molecular details of pathways in these domains for reference species Oryza sativa (rice) supported by published literature and annotation of well-characterized genes. Two hundred twenty-two rice pathways, 1025 reactions associated with 1173 proteins, 907 small molecules and 256 literature references have been curated to date. These reference annotations were used to project pathways for 62 model, crop and evolutionarily significant plant species based on gene homology. Database users can search and browse various components of the database, visualize curated baseline expression of pathway-associated genes provided by the Expression Atlas and upload and analyze their Omics datasets. The database also offers data access via Application Programming Interfaces (APIs) and in various standardized pathway formats, such as SBML and BioPAX. PMID:27799469
Advanced SPARQL querying in small molecule databases.

PubMed

Galgonek, Jakub; Hurt, Tomáš; Michlíková, Vendula; Onderka, Petr; Schwarz, Jan; Vondrášek, Jiří

2016-01-01

In recent years, the Resource Description Framework (RDF) and the SPARQL query language have become more widely used in the area of cheminformatics and bioinformatics databases. These technologies allow better interoperability of various data sources and powerful searching facilities. However, we identified several deficiencies that make usage of such RDF databases restrictive or challenging for common users. We extended a SPARQL engine to be able to use special procedures inside SPARQL queries. This allows the user to work with data that cannot be simply precomputed and thus cannot be directly stored in the database. We designed an algorithm that checks a query against data ontology to identify possible user errors. This greatly improves query debugging. We also introduced an approach to visualize retrieved data in a user-friendly way, based on templates describing visualizations of resource classes. To integrate all of our approaches, we developed a simple web application. Our system was implemented successfully, and we demonstrated its usability on the ChEBI database transformed into RDF form. To demonstrate procedure call functions, we employed compound similarity searching based on OrChem. The application is publicly available at https://bioinfo.uochb.cas.cz/projects/chemRDF.
Electronic spectra from TDDFT and machine learning in chemical space

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ramakrishnan, Raghunathan; Hartmann, Mia; Tapavicza, Enrico

Due to its favorable computational efficiency, time-dependent (TD) density functional theory (DFT) enables the prediction of electronic spectra in a high-throughput manner across chemical space. Its predictions, however, can be quite inaccurate. We resolve this issue with machine learning models trained on deviations of reference second-order approximate coupled-cluster (CC2) singles and doubles spectra from TDDFT counterparts, or even from DFT gap. We applied this approach to low-lying singlet-singlet vertical electronic spectra of over 20 000 synthetically feasible small organic molecules with up to eight CONF atoms. The prediction errors decay monotonously as a function of training set size. For amore » training set of 10 000 molecules, CC2 excitation energies can be reproduced to within +/- 0.1 eV for the remaining molecules. Analysis of our spectral database via chromophore counting suggests that even higher accuracies can be achieved. Based on the evidence collected, we discuss open challenges associated with data-driven modeling of high-lying spectra and transition intensities.« less
Classification of ligand molecules in PDB with fast heuristic graph match algorithm COMPLIG.

PubMed

Saito, Mihoko; Takemura, Naomi; Shirai, Tsuyoshi

2012-12-14

A fast heuristic graph-matching algorithm, COMPLIG, was devised to classify the small-molecule ligands in the Protein Data Bank (PDB), which are currently not properly classified on structure basis. By concurrently classifying proteins and ligands, we determined the most appropriate parameter for categorizing ligands to be more than 60% identity of atoms and bonds between molecules, and we classified 11,585 types of ligands into 1946 clusters. Although the large clusters were composed of nucleotides or amino acids, a significant presence of drug compounds was also observed. Application of the system to classify the natural ligand status of human proteins in the current database suggested that, at most, 37% of the experimental structures of human proteins were in complex with natural ligands. However, protein homology- and/or ligand similarity-based modeling was implied to provide models of natural interactions for an additional 28% of the total, which might be used to increase the knowledge of intrinsic protein-metabolite interactions. Copyright © 2012 Elsevier Ltd. All rights reserved.
ChemoPy: freely available python package for computational biology and chemoinformatics.

PubMed

Cao, Dong-Sheng; Xu, Qing-Song; Hu, Qian-Nan; Liang, Yi-Zeng

2013-04-15

Molecular representation for small molecules has been routinely used in QSAR/SAR, virtual screening, database search, ranking, drug ADME/T prediction and other drug discovery processes. To facilitate extensive studies of drug molecules, we developed a freely available, open-source python package called chemoinformatics in python (ChemoPy) for calculating the commonly used structural and physicochemical features. It computes 16 drug feature groups composed of 19 descriptors that include 1135 descriptor values. In addition, it provides seven types of molecular fingerprint systems for drug molecules, including topological fingerprints, electro-topological state (E-state) fingerprints, MACCS keys, FP4 keys, atom pairs fingerprints, topological torsion fingerprints and Morgan/circular fingerprints. By applying a semi-empirical quantum chemistry program MOPAC, ChemoPy can also compute a large number of 3D molecular descriptors conveniently. The python package, ChemoPy, is freely available via http://code.google.com/p/pychem/downloads/list, and it runs on Linux and MS-Windows. Supplementary data are available at Bioinformatics online.
Computational Chemistry Comparison and Benchmark Database

National Institute of Standards and Technology Data Gateway

SRD 101 NIST Computational Chemistry Comparison and Benchmark Database (Web, free access) The NIST Computational Chemistry Comparison and Benchmark Database is a collection of experimental and ab initio thermochemical properties for a selected set of molecules. The goals are to provide a benchmark set of molecules for the evaluation of ab initio computational methods and allow the comparison between different ab initio computational methods for the prediction of thermochemical properties.
Open, Cross Platform Chemistry Application Unifying Structure Manipulation, External Tools, Databases and Visualization

DTIC Science & Technology

2014-05-30

mol.addBond(o1, h2, 1); Avogadro ::Core::Bond b2 = mol.addBond(o1, h3, 1); The QtGui::Molecule class inherits from Core::Molecule and Qt’s QObject...populated as an input (although they are all implemented in terms of the Core::Molecule class. The third is QtGui::RWMolecule which inherits from just...shown in Figure 16. The use of molecule fingerprinting techniques gives the database the ability to be searched by similarity to a desired structure, as
Creating and Using a Consumer Chemical Molecular Graphics Database: The "Molecule of the Day" - A Great Way To Begin Your Lecture

NASA Astrophysics Data System (ADS)

Scharberg, Maureen A.; Cox, Oran E.; Barelli, Carl A.

1997-07-01

"The Molecule of the Day" consumer chemical database has been created to allow introductory chemistry students to explore molecular structures of chemicals in household products, and to provide opportunities in molecular modeling for undergraduate chemistry students. Before class begins, an overhead transparency is displayed which shows a three-dimensional molecular structure of a household chemical, and lists relevant features and uses of this chemical. Within answers to questionnaires, students have commented that this molecular graphics database has helped them to visually connect the microscopic structure of a molecule with its physical and chemical properties, as well as its uses in consumer products. It is anticipated that this database will be incorporated into a navigational software package such as Netscape.
Ligandbook: an online repository for small and drug-like molecule force field parameters.

PubMed

Domanski, Jan; Beckstein, Oliver; Iorga, Bogdan I

2017-06-01

Ligandbook is a public database and archive for force field parameters of small and drug-like molecules. It is a repository for parameter sets that are part of published work but are not easily available to the community otherwise. Parameter sets can be downloaded and immediately used in molecular dynamics simulations. The sets of parameters are versioned with full histories and carry unique identifiers to facilitate reproducible research. Text-based search on rich metadata and chemical substructure search allow precise identification of desired compounds or functional groups. Ligandbook enables the rapid set up of reproducible molecular dynamics simulations of ligands and protein-ligand complexes. Ligandbook is available online at https://ligandbook.org and supports all modern browsers. Parameters can be searched and downloaded without registration, including access through a programmatic RESTful API. Deposition of files requires free user registration. Ligandbook is implemented in the PHP Symfony2 framework with TCL scripts using the CACTVS toolkit. oliver.beckstein@asu.edu or bogdan.iorga@cnrs.fr ; contact@ligandbook.org . Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
NMReDATA, a standard to report the NMR assignment and parameters of organic compounds.

PubMed

Pupier, Marion; Nuzillard, Jean-Marc; Wist, Julien; Schlörer, Nils E; Kuhn, Stefan; Erdelyi, Mate; Steinbeck, Christoph; Williams, Antony J; Butts, Craig; Claridge, Tim D W; Mikhova, Bozhana; Robien, Wolfgang; Dashti, Hesam; Eghbalnia, Hamid R; Farès, Christophe; Adam, Christian; Kessler, Pavel; Moriaud, Fabrice; Elyashberg, Mikhail; Argyropoulos, Dimitris; Pérez, Manuel; Giraudeau, Patrick; Gil, Roberto R; Trevorrow, Paul; Jeannerat, Damien

2018-04-14

Even though NMR has found countless applications in the field of small molecule characterization, there is no standard file format available for the NMR data relevant to structure characterization of small molecules. A new format is therefore introduced to associate the NMR parameters extracted from 1D and 2D spectra of organic compounds to the proposed chemical structure. These NMR parameters, which we shall call NMReDATA (for nuclear magnetic resonance extracted data), include chemical shift values, signal integrals, intensities, multiplicities, scalar coupling constants, lists of 2D correlations, relaxation times, and diffusion rates. The file format is an extension of the existing Structure Data Format, which is compatible with the commonly used MOL format. The association of an NMReDATA file with the raw and spectral data from which it originates constitutes an NMR record. This format is easily readable by humans and computers and provides a simple and efficient way for disseminating results of structural chemistry investigations, allowing automatic verification of published results, and for assisting the constitution of highly needed open-source structural databases. Copyright © 2018 John Wiley & Sons, Ltd.
Database resources of the National Center for Biotechnology Information.

PubMed

Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bolton, Evan; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Landsman, David; Lipman, David J; Lu, Zhiyong; Madden, Thomas L; Madej, Tom; Maglott, Donna R; Marchler-Bauer, Aron; Miller, Vadim; Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Wang, Yanli; Wilbur, W John; Yaschenko, Eugene; Ye, Jian

2011-01-01

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Electronic PCR, OrfFinder, Splign, ProSplign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), IBIS, Biosystems, Peptidome, OMSSA, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.
A nearest neighbor approach for automated transporter prediction and categorization from protein sequences.

PubMed

Li, Haiquan; Dai, Xinbin; Zhao, Xuechun

2008-05-01

Membrane transport proteins play a crucial role in the import and export of ions, small molecules or macromolecules across biological membranes. Currently, there are a limited number of published computational tools which enable the systematic discovery and categorization of transporters prior to costly experimental validation. To approach this problem, we utilized a nearest neighbor method which seamlessly integrates homologous search and topological analysis into a machine-learning framework. Our approach satisfactorily distinguished 484 transporter families in the Transporter Classification Database, a curated and representative database for transporters. A five-fold cross-validation on the database achieved a positive classification rate of 72.3% on average. Furthermore, this method successfully detected transporters in seven model and four non-model organisms, ranging from archaean to mammalian species. A preliminary literature-based validation has cross-validated 65.8% of our predictions on the 11 organisms, including 55.9% of our predictions overlapping with 83.6% of the predicted transporters in TransportDB.
Interactive and Versatile Navigation of Structural Databases.

PubMed

Korb, Oliver; Kuhn, Bernd; Hert, Jérôme; Taylor, Neil; Cole, Jason; Groom, Colin; Stahl, Martin

2016-05-12

We present CSD-CrossMiner, a novel tool for pharmacophore-based searches in crystal structure databases. Intuitive pharmacophore queries describing, among others, protein-ligand interaction patterns, ligand scaffolds, or protein environments can be built and modified interactively. Matching crystal structures are overlaid onto the query and visualized as soon as they are available, enabling the researcher to quickly modify a hypothesis on the fly. We exemplify the utility of the approach by showing applications relevant to real-world drug discovery projects, including the identification of novel fragments for a specific protein environment or scaffold hopping. The ability to concurrently search protein-ligand binding sites extracted from the Protein Data Bank (PDB) and small organic molecules from the Cambridge Structural Database (CSD) using the same pharmacophore query further emphasizes the flexibility of CSD-CrossMiner. We believe that CSD-CrossMiner closes an important gap in mining structural data and will allow users to extract more value from the growing number of available crystal structures.
Small Molecules-Big Data.

PubMed

Császár, Attila G; Furtenbacher, Tibor; Árendás, Péter

2016-11-17

Quantum mechanics builds large-scale graphs (networks): the vertices are the discrete energy levels the quantum system possesses, and the edges are the (quantum-mechanically allowed) transitions. Parts of the complete quantum mechanical networks can be probed experimentally via high-resolution, energy-resolved spectroscopic techniques. The complete rovibronic line list information for a given molecule can only be obtained through sophisticated quantum-chemical computations. Experiments as well as computations yield what we call spectroscopic networks (SN). First-principles SNs of even small, three to five atomic molecules can be huge, qualifying for the big data description. Besides helping to interpret high-resolution spectra, the network-theoretical view offers several ideas for improving the accuracy and robustness of the increasingly important information systems containing line-by-line spectroscopic data. For example, the smallest number of measurements necessary to perform to obtain the complete list of energy levels is given by the minimum-weight spanning tree of the SN and network clustering studies may call attention to "weakest links" of a spectroscopic database. A present-day application of spectroscopic networks is within the MARVEL (Measured Active Rotational-Vibrational Energy Levels) approach, whereby the transitions information on a measured SN is turned into experimental energy levels via a weighted linear least-squares refinement. MARVEL has been used successfully for 15 molecules and allowed to validate most of the transitions measured and come up with energy levels with well-defined and realistic uncertainties. Accurate knowledge of the energy levels with computed transition intensities allows the realistic prediction of spectra under many different circumstances, e.g., for widely different temperatures. Detailed knowledge of the energy level structure of a molecule coming from a MARVEL analysis is important for a considerable number of modeling efforts in chemistry, physics, and engineering.
Molecular targets for small-molecule modulators of circadian clocks

PubMed Central

He, Baokun; Chen, Zheng

2016-01-01

Background Circadian clocks are endogenous timing systems that regulate various aspects of mammalian metabolism, physiology and behavior. Traditional chronotherapy refers to the administration of drugs in a defined circadian time window to achieve optimal pharmacokinetic and therapeutic efficacies. In recent years, substantial efforts have been dedicated to developing novel small-molecule modulators of circadian clocks. Methods Here, we review the recent progress in the identification of molecular targets of small-molecule clock modulators and their efficacies in clock-related disorders. Specifically, we examine the clock components and regulatory factors as possible molecular targets of small molecules, and we review several key clock-related disorders as promising venues for testing the preventive/therapeutic efficacies of these small molecules. Finally, we also discuss circadian regulation of drug metabolism. Results Small molecules can modulate the period, phase and/or amplitude of the circadian cycle. Core clock proteins, nuclear hormone receptors, and clock-related kinases and other epigenetic regulators are promising molecular targets for small molecules. Through these targets small molecules exert protective effects against clock-related disorders including the metabolic syndrome, immune disorders, sleep disorders and cancer. Small molecules can also modulate circadian drug metabolism and response to existing therapeutics. Conclusion Small-molecule clock modulators target clock components or diverse cellular pathways that functionally impinge upon the clock. Target identification of new small-molecule modulators will deepen our understanding of key regulatory nodes in the circadian network. Studies of clock modulators will facilitate their therapeutic applications, alone or in combination, for clock-related diseases. PMID:26750111
Biomedical Requirements for High Productivity Computing Systems

DTIC Science & Technology

2005-04-01

server at http://www.ncbi.nlm.nih.gov/BLAST/. There are many variants of BLAST, including: 1. BLASTN - Compares a DNA query to a DNA database. Searches ...database (3 reading frames from each strand of the DNA) searching . 13 4. TBLASTN - Compares a protein query to a DNA database, in the 6 possible...the molecular during this phase. After eliminating molecules that could not match the query , an atom-by-atom search for the molecules in conducted
Recent advances in developing small molecules targeting RNA.

PubMed

Guan, Lirui; Disney, Matthew D

2012-01-20

RNAs are underexploited targets for small molecule drugs or chemical probes of function. This may be due, in part, to a fundamental lack of understanding of the types of small molecules that bind RNA specifically and the types of RNA motifs that specifically bind small molecules. In this review, we describe recent advances in the development and design of small molecules that bind to RNA and modulate function that aim to fill this void.
AtlasCBS: a web server to map and explore chemico-biological space

NASA Astrophysics Data System (ADS)

Cortés-Cabrera, Álvaro; Morreale, Antonio; Gago, Federico; Abad-Zapatero, Celerino

2012-09-01

New approaches are needed that can help decrease the unsustainable failure in small-molecule drug discovery. Ligand Efficiency Indices (LEI) are making a great impact on early-stage compound selection and prioritization. Given a target-ligand database with chemical structures and associated biological affinities/activities for a target, the AtlasCBS server generates two-dimensional, dynamical representations of its contents in terms of LEI. These variables allow an effective decoupling of the chemical (angular) and biological (radial) components. BindingDB, PDBBind and ChEMBL databases are currently implemented. Proprietary datasets can also be uploaded and compared. The utility of this atlas-like representation in the future of drug design is highlighted with some examples. The web server can be accessed at http://ub.cbm.uam.es/atlascbs and https://www.ebi.ac.uk/chembl/atlascbs.
AtlasCBS: a web server to map and explore chemico-biological space.

PubMed

Cortés-Cabrera, Alvaro; Morreale, Antonio; Gago, Federico; Abad-Zapatero, Celerino

2012-09-01

New approaches are needed that can help decrease the unsustainable failure in small-molecule drug discovery. Ligand Efficiency Indices (LEI) are making a great impact on early-stage compound selection and prioritization. Given a target-ligand database with chemical structures and associated biological affinities/activities for a target, the AtlasCBS server generates two-dimensional, dynamical representations of its contents in terms of LEI. These variables allow an effective decoupling of the chemical (angular) and biological (radial) components. BindingDB, PDBBind and ChEMBL databases are currently implemented. Proprietary datasets can also be uploaded and compared. The utility of this atlas-like representation in the future of drug design is highlighted with some examples. The web server can be accessed at http://ub.cbm.uam.es/atlascbs and https://www.ebi.ac.uk/chembl/atlascbs.
Small-Molecule “BRCA1-Mimetics” Are Antagonists of Estrogen Receptor-α

PubMed Central

Ma, Yongxian; Tomita, York; Preet, Anju; Clarke, Robert; Englund, Erikah; Grindrod, Scott; Nathan, Shyam; De Oliveira, Eliseu; Brown, Milton L.

2014-01-01

Context: Resistance to conventional antiestrogens is a major cause of treatment failure and, ultimately, death in breast cancer. Objective: The objective of the study was to identify small-molecule estrogen receptor (ER)-α antagonists that work differently from tamoxifen and other selective estrogen receptor modulators. Design: Based on in silico screening of a pharmacophore database using a computed model of the BRCA1-ER-α complex (with ER-α liganded to 17β-estradiol), we identified a candidate group of small-molecule compounds predicted to bind to a BRCA1-binding interface separate from the ligand-binding pocket and the coactivator binding site of ER-α. Among 40 candidate compounds, six inhibited estradiol-stimulated ER-α activity by at least 50% in breast carcinoma cells, with IC50 values ranging between 3 and 50 μM. These ER-α inhibitory compounds were further studied by molecular and cell biological techniques. Results: The compounds strongly inhibited ER-α activity at concentrations that yielded little or no nonspecific toxicity, but they produced only a modest inhibition of progesterone receptor activity. Importantly, the compounds blocked proliferation and inhibited ER-α activity about equally well in antiestrogen-sensitive and antiestrogen-resistant breast cancer cells. Representative compounds disrupted the interaction of BRCA1 and ER-α in the cultured cells and blocked the interaction of ER-α with the estrogen response element. However, the compounds had no effect on the total cellular ER-α levels. Conclusions: These findings suggest that we have identified a new class of ER-α antagonists that work differently from conventional antiestrogens (eg, tamoxifen and fulvestrant). PMID:25264941

Rationally designed small molecules targeting the RNA that causes myotonic dystrophy type 1 are potently bioactive.

PubMed

Childs-Disney, Jessica L; Hoskins, Jason; Rzuczek, Suzanne G; Thornton, Charles A; Disney, Matthew D

2012-05-18

RNA is an important drug target, but it is difficult to design or discover small molecules that modulate RNA function. In the present study, we report that rationally designed, modularly assembled small molecules that bind the RNA that causes myotonic dystrophy type 1 (DM1) are potently bioactive in cell culture models. DM1 is caused when an expansion of r(CUG) repeats, or r(CUG)(exp), is present in the 3' untranslated region (UTR) of the dystrophia myotonica protein kinase (DMPK) mRNA. r(CUG)(exp) folds into a hairpin with regularly repeating 5'CUG/3'GUC motifs and sequesters muscleblind-like 1 protein (MBNL1). A variety of defects are associated with DM1, including (i) formation of nuclear foci, (ii) decreased translation of DMPK mRNA due to its nuclear retention, and (iii) pre-mRNA splicing defects due to inactivation of MBNL1, which controls the alternative splicing of various pre-mRNAs. Previously, modularly assembled ligands targeting r(CUG)(exp) were designed using information in an RNA motif-ligand database. These studies showed that a bis-benzimidazole (H) binds the 5'CUG/3'GUC motif in r(CUG)(exp.) Therefore, we designed multivalent ligands to bind simultaneously multiple copies of this motif in r(CUG)(exp). Herein, we report that the designed compounds improve DM1-associated defects including improvement of translational and pre-mRNA splicing defects and the disruption of nuclear foci. These studies may establish a foundation to exploit other RNA targets in genomic sequence.
Substrate-Driven Mapping of the Degradome by Comparison of Sequence Logos

PubMed Central

Fuchs, Julian E.; von Grafenstein, Susanne; Huber, Roland G.; Kramer, Christian; Liedl, Klaus R.

2013-01-01

Sequence logos are frequently used to illustrate substrate preferences and specificity of proteases. Here, we employed the compiled substrates of the MEROPS database to introduce a novel metric for comparison of protease substrate preferences. The constructed similarity matrix of 62 proteases can be used to intuitively visualize similarities in protease substrate readout via principal component analysis and construction of protease specificity trees. Since our new metric is solely based on substrate data, we can engraft the protease tree including proteolytic enzymes of different evolutionary origin. Thereby, our analyses confirm pronounced overlaps in substrate recognition not only between proteases closely related on sequence basis but also between proteolytic enzymes of different evolutionary origin and catalytic type. To illustrate the applicability of our approach we analyze the distribution of targets of small molecules from the ChEMBL database in our substrate-based protease specificity trees. We observe a striking clustering of annotated targets in tree branches even though these grouped targets do not necessarily share similarity on protein sequence level. This highlights the value and applicability of knowledge acquired from peptide substrates in drug design of small molecules, e.g., for the prediction of off-target effects or drug repurposing. Consequently, our similarity metric allows to map the degradome and its associated drug target network via comparison of known substrate peptides. The substrate-driven view of protein-protein interfaces is not limited to the field of proteases but can be applied to any target class where a sufficient amount of known substrate data is available. PMID:24244149
Identification of RNA molecules by specific enzyme digestion and mass spectrometry: software for and implementation of RNA mass mapping

PubMed Central

Matthiesen, Rune; Kirpekar, Finn

2009-01-01

The idea of identifying or characterizing an RNA molecule based on a mass spectrum of specifically generated RNA fragments has been used in various forms for well over a decade. We have developed software—named RRM for ‘RNA mass mapping’—which can search whole prokaryotic genomes or RNA FASTA sequence databases to identify the origin of a given RNA based on a mass spectrum of RNA fragments. As input, the program uses the masses of specific RNase cleavage of the RNA under investigation. RNase T1 digestion is used here as a demonstration of the usability of the method for RNA identification. The concept for identification is that the masses of the digestion products constitute a specific fingerprint, which characterize the given RNA. The search algorithm is based on the same principles as those used in peptide mass fingerprinting, but has here been extended to work for both RNA sequence databases and for genome searches. A simple and powerful probability model for ranking RNA matches is proposed. We demonstrate viability of the entire setup by identifying the DNA template of a series of RNAs of biological and of in vitro transcriptional origin in complete microbial genomes and by identifying authentic 16S ribosomal RNAs in a ‘small ribosomal subunit RNA’ database. Thus, we present a new tool for a rapid identification of unknown RNAs using only a few picomoles of starting material. PMID:19264806
Prospecting for Novel Plant-Derived Molecules of Rauvolfia serpentina as Inhibitors of Aldose Reductase, a Potent Drug Target for Diabetes and Its Complications

PubMed Central

Pathania, Shivalika; Randhawa, Vinay; Bagler, Ganesh

2013-01-01

Aldose Reductase (AR) is implicated in the development of secondary complications of diabetes, providing an interesting target for therapeutic intervention. Extracts of Rauvolfia serpentina, a medicinal plant endemic to the Himalayan mountain range, have been known to be effective in alleviating diabetes and its complications. In this study, we aim to prospect for novel plant-derived inhibitors from R. serpentina and to understand structural basis of their interactions. An extensive library of R. serpentina molecules was compiled and computationally screened for inhibitory action against AR. The stability of complexes, with docked leads, was verified using molecular dynamics simulations. Two structurally distinct plant-derived leads were identified as inhibitors: indobine and indobinine. Further, using these two leads as templates, 16 more leads were identified through ligand-based screening of their structural analogs, from a small molecules database. Thus, we obtained plant-derived indole alkaloids, and their structural analogs, as potential AR inhibitors from a manually curated dataset of R. serpentina molecules. Indole alkaloids reported herein, as a novel structural class unreported hitherto, may provide better insights for designing potential AR inhibitors with improved efficacy and fewer side effects. PMID:23613832
Prospecting for novel plant-derived molecules of Rauvolfia serpentina as inhibitors of Aldose Reductase, a potent drug target for diabetes and its complications.

PubMed

Pathania, Shivalika; Randhawa, Vinay; Bagler, Ganesh

2013-01-01

Aldose Reductase (AR) is implicated in the development of secondary complications of diabetes, providing an interesting target for therapeutic intervention. Extracts of Rauvolfia serpentina, a medicinal plant endemic to the Himalayan mountain range, have been known to be effective in alleviating diabetes and its complications. In this study, we aim to prospect for novel plant-derived inhibitors from R. serpentina and to understand structural basis of their interactions. An extensive library of R. serpentina molecules was compiled and computationally screened for inhibitory action against AR. The stability of complexes, with docked leads, was verified using molecular dynamics simulations. Two structurally distinct plant-derived leads were identified as inhibitors: indobine and indobinine. Further, using these two leads as templates, 16 more leads were identified through ligand-based screening of their structural analogs, from a small molecules database. Thus, we obtained plant-derived indole alkaloids, and their structural analogs, as potential AR inhibitors from a manually curated dataset of R. serpentina molecules. Indole alkaloids reported herein, as a novel structural class unreported hitherto, may provide better insights for designing potential AR inhibitors with improved efficacy and fewer side effects.
Correcting ligands, metabolites, and pathways

PubMed Central

Ott, Martin A; Vriend, Gert

2006-01-01

Background A wide range of research areas in bioinformatics, molecular biology and medicinal chemistry require precise chemical structure information about molecules and reactions, e.g. drug design, ligand docking, metabolic network reconstruction, and systems biology. Most available databases, however, treat chemical structures more as illustrations than as a datafield in its own right. Lack of chemical accuracy impedes progress in the areas mentioned above. We present a database of metabolites called BioMeta that augments the existing pathway databases by explicitly assessing the validity, correctness, and completeness of chemical structure and reaction information. Description The main bulk of the data in BioMeta were obtained from the KEGG Ligand database. We developed a tool for chemical structure validation which assesses the chemical validity and stereochemical completeness of a molecule description. The validation tool was used to examine the compounds in BioMeta, showing that a relatively small number of compounds had an incorrect constitution (connectivity only, not considering stereochemistry) and that a considerable number (about one third) had incomplete or even incorrect stereochemistry. We made a large effort to correct the errors and to complete the structural descriptions. A total of 1468 structures were corrected and/or completed. We also established the reaction balance of the reactions in BioMeta and corrected 55% of the unbalanced (stoichiometrically incorrect) reactions in an automatic procedure. The BioMeta database was implemented in PostgreSQL and provided with a web-based interface. Conclusion We demonstrate that the validation of metabolite structures and reactions is a feasible and worthwhile undertaking, and that the validation results can be used to trigger corrections and improvements to BioMeta, our metabolite database. BioMeta provides some tools for rational drug design, reaction searches, and visualization. It is freely available at provided that the copyright notice of all original data is cited. The database will be useful for querying and browsing biochemical pathways, and to obtain reference information for identifying compounds. However, these applications require that the underlying data be correct, and that is the focus of BioMeta. PMID:17132165
MOSAIC: a chemical-genetic interaction data repository and web resource for exploring chemical modes of action.

PubMed

Nelson, Justin; Simpkins, Scott W; Safizadeh, Hamid; Li, Sheena C; Piotrowski, Jeff S; Hirano, Hiroyuki; Yashiroda, Yoko; Osada, Hiroyuki; Yoshida, Minoru; Boone, Charles; Myers, Chad L

2018-04-01

Chemical-genomic approaches that map interactions between small molecules and genetic perturbations offer a promising strategy for functional annotation of uncharacterized bioactive compounds. We recently developed a new high-throughput platform for mapping chemical-genetic (CG) interactions in yeast that can be scaled to screen large compound collections, and we applied this system to generate CG interaction profiles for more than 13 000 compounds. When integrated with the existing global yeast genetic interaction network, CG interaction profiles can enable mode-of-action prediction for previously uncharacterized compounds as well as discover unexpected secondary effects for known drugs. To facilitate future analysis of these valuable data, we developed a public database and web interface named MOSAIC. The website provides a convenient interface for querying compounds, bioprocesses (Gene Ontology terms) and genes for CG information including direct CG interactions, bioprocesses and gene-level target predictions. MOSAIC also provides access to chemical structure information of screened molecules, chemical-genomic profiles and the ability to search for compounds sharing structural and functional similarity. This resource will be of interest to chemical biologists for discovering new small molecule probes with specific modes-of-action as well as computational biologists interested in analysing CG interaction networks. MOSAIC is available at http://mosaic.cs.umn.edu. hisyo@riken.jp, yoshidam@riken.jp, charlie.boone@utoronto.ca or chadm@umn.edu. Supplementary data are available at Bioinformatics online.
Spectroscopy for Industrial Applications: High-Temperature Processes

NASA Astrophysics Data System (ADS)

Fateev, Alexander; Grosch, Helge; Clausen, Sonnik; Barton, Emma J.; Yurchenko, Sergei N.; Tennyson, Jonathan

2014-06-01

The continuous development of the spectroscopic databases brings new perspectives in the environmental and industrial on-line process control, monitoring and stimulates further optical sensor developments. This is because no calibration gases are needed and, in general, temperature-dependent spectral absorption features gases of interest for a specific instrument can in principle be calculated by knowing only the gas temperature and pressure in the process under investigation/monitoring. The latest HITRAN-2012 database contains IR/UV spectral data for 47 molecules and it is still growing. However use of HITRAN is limited to low-temperature processes (< 400 K) and therefor can be used for absorption spectra calculations at limited temperature/pressure ranges. For higher temperatures, the HITEMP-2010 database is available. Only a few molecules CO2, H2O, CO and NO are those of interest for e.g. various combustion and astronomical applications are included. In the recent few years, several efforts towards a development of hot line lists have been made; those have been implemented in the latest HITRAN2012 database1. High-resolution absorption measurements of NH3 (IR, 0.1 cm-1) and phenol (UV, 0.019 nm) on a flow gas cell2 up to 800 K are presented. Molecules are of great interest in various high-temperature environments including exoplanets, combustion and gasification. Measured NH3 hot lines have been assigned and spectra have been compared with that obtained by calculations based on the BYTe hot line list1. High-temperature NH3 absorption spectra have been used in the analysis of in situ high-resolution IR absorption measurements on the producer gas in low-temperature gasification process on a large scale. High-resolution UV temperature-dependent absorption cross-sections of phenol are reported for the first time. All UV data have been calibrated by relevant GC/MS measurements. Use of the data is demonstrated by the analysis of in situ UV absorption measurements on a small-scale low-temperature gasifier. A comparison between in situ, gas extraction and conventional gas sampling measurements is presented. Overall the presentation shows an example of successful industrial and academic partnerships within the framework of national and international ongoing projects.
SerpentinaDB: a database of plant-derived molecules of Rauvolfia serpentina.

PubMed

Pathania, Shivalika; Ramakrishnan, Sai Mukund; Randhawa, Vinay; Bagler, Ganesh

2015-08-04

Plant-derived molecules (PDMs) are known to be a rich source of diverse scaffolds that could serve as a basis for rational drug design. Structured compilation of phytochemicals from traditional medicinal plants can facilitate prospection for novel PDMs and their analogs as therapeutic agents. Rauvolfia serpentina is an important medicinal plant, endemic to Himalayan mountain ranges of Indian subcontinent, reported to be of immense therapeutic value against various diseases. We present SerpentinaDB, a structured compilation of 147 R. serpentina PDMs, inclusive of their plant part source, chemical classification, IUPAC, SMILES, physicochemical properties, and 3D chemical structures with associated references. It also provides refined search option for identification of analogs of natural molecules against ZINC database at user-defined cut-off. SerpentinaDB is an exhaustive resource of R. serpentina molecules facilitating prospection for therapeutic molecules from a medicinally important source of natural products. It also provides refined search option to explore the neighborhood of chemical space against ZINC database to identify analogs of natural molecules obtained as leads. In a previous study, we have demonstrated the utility of this resource by identifying novel aldose reductase inhibitors towards intervention of complications of diabetes.
PDB-wide collection of binding data: current status of the PDBbind database.

PubMed

Liu, Zhihai; Li, Yan; Han, Li; Li, Jie; Liu, Jie; Zhao, Zhixiong; Nie, Wei; Liu, Yuchen; Wang, Renxiao

2015-02-01

Molecular recognition between biological macromolecules and organic small molecules plays an important role in various life processes. Both structural information and binding data of biomolecular complexes are indispensable for depicting the underlying mechanism in such an event. The PDBbind database was created to collect experimentally measured binding data for the biomolecular complexes throughout the Protein Data Bank (PDB). It thus provides the linkage between structural information and energetic properties of biomolecular complexes, which is especially desirable for computational studies or statistical analyses. Since its first public release in 2004, the PDBbind database has been updated on an annual basis. The latest release (version 2013) provides experimental binding affinity data for 10,776 biomolecular complexes in PDB, including 8302 protein-ligand complexes and 2474 other types of complexes. In this article, we will describe the current methods used for compiling PDBbind and the updated status of this database. We will also review some typical applications of PDBbind published in the scientific literature. All contents of this database are freely accessible at the PDBbind-CN Web server at http://www.pdbbind-cn.org/. wangrx@mail.sioc.ac.cn. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Plant Reactome: a resource for plant pathways and comparative analysis.

PubMed

Naithani, Sushma; Preece, Justin; D'Eustachio, Peter; Gupta, Parul; Amarasinghe, Vindhya; Dharmawardhana, Palitha D; Wu, Guanming; Fabregat, Antonio; Elser, Justin L; Weiser, Joel; Keays, Maria; Fuentes, Alfonso Munoz-Pomer; Petryszak, Robert; Stein, Lincoln D; Ware, Doreen; Jaiswal, Pankaj

2017-01-04

Plant Reactome (http://plantreactome.gramene.org/) is a free, open-source, curated plant pathway database portal, provided as part of the Gramene project. The database provides intuitive bioinformatics tools for the visualization, analysis and interpretation of pathway knowledge to support genome annotation, genome analysis, modeling, systems biology, basic research and education. Plant Reactome employs the structural framework of a plant cell to show metabolic, transport, genetic, developmental and signaling pathways. We manually curate molecular details of pathways in these domains for reference species Oryza sativa (rice) supported by published literature and annotation of well-characterized genes. Two hundred twenty-two rice pathways, 1025 reactions associated with 1173 proteins, 907 small molecules and 256 literature references have been curated to date. These reference annotations were used to project pathways for 62 model, crop and evolutionarily significant plant species based on gene homology. Database users can search and browse various components of the database, visualize curated baseline expression of pathway-associated genes provided by the Expression Atlas and upload and analyze their Omics datasets. The database also offers data access via Application Programming Interfaces (APIs) and in various standardized pathway formats, such as SBML and BioPAX. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
ASDB: a resource for probing protein functions with small molecules.

PubMed

Liu, Zhihong; Ding, Peng; Yan, Xin; Zheng, Minghao; Zhou, Huihao; Xu, Yuehua; Du, Yunfei; Gu, Qiong; Xu, Jun

2016-06-01

: Identifying chemical probes or seeking scaffolds for a specific biological target is important for protein function studies. Therefore, we create the Annotated Scaffold Database (ASDB), a computer-readable and systematic target-annotated scaffold database, to serve such needs. The scaffolds in ASDB were derived from public databases including ChEMBL, DrugBank and TCMSP, with a scaffold-based classification approach. Each scaffold was assigned with an InChIKey as its unique identifier, energy-minimized 3D conformations, and other calculated properties. A scaffold is also associated with drugs, natural products, drug targets and medical indications. The database can be retrieved through text or structure query tools. ASDB collects 333 601 scaffolds, which are associated with 4368 targets. The scaffolds consist of 3032 scaffolds derived from drugs and 5163 scaffolds derived from natural products. For given scaffolds, scaffold-target networks can be generated from the database to demonstrate the relations of scaffolds and targets. ASDB is freely available at http://www.rcdd.org.cn/asdb/with the major web browsers. junxu@biochemomes.com or xujun9@mail.sysu.edu.cn Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Interspecies scaling and prediction of human clearance: comparison of small- and macro-molecule drugs

PubMed Central

Huh, Yeamin; Smith, David E.; Feng, Meihau Rose

2014-01-01

Human clearance prediction for small- and macro-molecule drugs was evaluated and compared using various scaling methods and statistical analysis.Human clearance is generally well predicted using single or multiple species simple allometry for macro- and small-molecule drugs excreted renally.The prediction error is higher for hepatically eliminated small-molecules using single or multiple species simple allometry scaling, and it appears that the prediction error is mainly associated with drugs with low hepatic extraction ratio (Eh). The error in human clearance prediction for hepatically eliminated small-molecules was reduced using scaling methods with a correction of maximum life span (MLP) or brain weight (BRW).Human clearance of both small- and macro-molecule drugs is well predicted using the monkey liver blood flow method. Predictions using liver blood flow from other species did not work as well, especially for the small-molecule drugs. PMID:21892879
Using the QCM Biosensor-Based T7 Phage Display Combined with Bioinformatics Analysis for Target Identification of Bioactive Small Molecule.

PubMed

Takakusagi, Yoichi; Takakusagi, Kaori; Sugawara, Fumio; Sakaguchi, Kengo

2018-01-01

Identification of target proteins that directly bind to bioactive small molecule is of great interest in terms of clarifying the mode of action of the small molecule as well as elucidating the biological phenomena at the molecular level. Of the experimental technologies available, T7 phage display allows comprehensive screening of small molecule-recognizing amino acid sequence from the peptide libraries displayed on the T7 phage capsid. Here, we describe the T7 phage display strategy that is combined with quartz-crystal microbalance (QCM) biosensor for affinity selection platform and bioinformatics analysis for small molecule-recognizing short peptides. This method dramatically enhances efficacy and throughput of the screening for small molecule-recognizing amino acid sequences without repeated rounds of selection. Subsequent execution of bioinformatics programs allows combinatorial and comprehensive target protein discovery of small molecules with its binding site, regardless of protein sample insolubility, instability, or inaccessibility of the fixed small molecules to internally located binding site on larger target proteins when conventional proteomics approaches are used.
Drug development for breast, colorectal, and non-small cell lung cancers from 1979 to 2014.

PubMed

Nixon, Nancy A; Khan, Omar F; Imam, Hasiba; Tang, Patricia A; Monzon, Jose; Li, Haocheng; Sun, Gavin; Ezeife, Doreen; Parimi, Sunil; Dowden, Scot; Tam, Vincent C

2017-12-01

Understanding the drug development pathway is critical for streamlining the development of effective cancer treatments. The objective of the current study was to delineate the drug development timeline and attrition rate of different drug classes for common cancer disease sites. Drugs entering clinical trials for breast, colorectal, and non-small cell lung cancer were identified using a pharmaceutical business intelligence database. Data regarding drug characteristics, clinical trials, and approval dates were obtained from the database, clinical trial registries, PubMed, and regulatory Web sites. A total of 411 drugs met the inclusion criteria for breast cancer, 246 drugs met the inclusion criteria for colorectal cancer, and 315 drugs met the inclusion criteria for non-small cell lung cancer. Attrition rates were 83.9% for breast cancer, 87.0% for colorectal cancer, and 92.0% for non-small cell lung cancer drugs. In the case of non-small cell lung cancer, there was a trend toward higher attrition rates for targeted monoclonal antibodies compared with other agents. No tumor site-specific differences were noted with regard to cytotoxic chemotherapy, immunomodulatory, or small molecule kinase inhibitor drugs. Drugs classified as "others" in breast cancer had lower attrition rates, primarily due to the higher success of hormonal medications. Mean drug development times were 8.9 years for breast cancer, 6.7 years for colorectal cancer, and 6.6 years for non-small cell lung cancer. Overall oncologic drug attrition rates remain high, and drugs are more likely to fail in later-stage clinical trials. The refinement of early-phase trial design may permit the selection of drugs that are more likely to succeed in the phase 3 setting. Cancer 2017;123:4672-4679. © 2017 American Cancer Society. © 2017 American Cancer Society.
Facilities for small-molecule crystallography at synchrotron sources.

PubMed

Barnett, Sarah A; Nowell, Harriott; Warren, Mark R; Wilcox, Andrian; Allan, David R

2016-01-01

Although macromolecular crystallography is a widely supported technique at synchrotron radiation facilities throughout the world, there are, in comparison, only very few beamlines dedicated to small-molecule crystallography. This limited provision is despite the increasing demand for beamtime from the chemical crystallography community and the ever greater overlap between systems that can be classed as either small macromolecules or large small molecules. In this article, a very brief overview of beamlines that support small-molecule single-crystal diffraction techniques will be given along with a more detailed description of beamline I19, a dedicated facility for small-molecule crystallography at Diamond Light Source.
Arabidopsis Hormone Database: a comprehensive genetic and phenotypic information database for plant hormone research in Arabidopsis

PubMed Central

Peng, Zhi-yu; Zhou, Xin; Li, Linchuan; Yu, Xiangchun; Li, Hongjiang; Jiang, Zhiqiang; Cao, Guangyu; Bai, Mingyi; Wang, Xingchun; Jiang, Caifu; Lu, Haibin; Hou, Xianhui; Qu, Lijia; Wang, Zhiyong; Zuo, Jianru; Fu, Xiangdong; Su, Zhen; Li, Songgang; Guo, Hongwei

2009-01-01

Plant hormones are small organic molecules that influence almost every aspect of plant growth and development. Genetic and molecular studies have revealed a large number of genes that are involved in responses to numerous plant hormones, including auxin, gibberellin, cytokinin, abscisic acid, ethylene, jasmonic acid, salicylic acid, and brassinosteroid. Here, we develop an Arabidopsis hormone database, which aims to provide a systematic and comprehensive view of genes participating in plant hormonal regulation, as well as morphological phenotypes controlled by plant hormones. Based on data from mutant studies, transgenic analysis and gene ontology (GO) annotation, we have identified a total of 1026 genes in the Arabidopsis genome that participate in plant hormone functions. Meanwhile, a phenotype ontology is developed to precisely describe myriad hormone-regulated morphological processes with standardized vocabularies. A web interface (http://ahd.cbi.pku.edu.cn) would allow users to quickly get access to information about these hormone-related genes, including sequences, functional category, mutant information, phenotypic description, microarray data and linked publications. Several applications of this database in studying plant hormonal regulation and hormone cross-talk will be presented and discussed. PMID:19015126
Arabidopsis Hormone Database: a comprehensive genetic and phenotypic information database for plant hormone research in Arabidopsis.

PubMed

Peng, Zhi-yu; Zhou, Xin; Li, Linchuan; Yu, Xiangchun; Li, Hongjiang; Jiang, Zhiqiang; Cao, Guangyu; Bai, Mingyi; Wang, Xingchun; Jiang, Caifu; Lu, Haibin; Hou, Xianhui; Qu, Lijia; Wang, Zhiyong; Zuo, Jianru; Fu, Xiangdong; Su, Zhen; Li, Songgang; Guo, Hongwei

2009-01-01

Plant hormones are small organic molecules that influence almost every aspect of plant growth and development. Genetic and molecular studies have revealed a large number of genes that are involved in responses to numerous plant hormones, including auxin, gibberellin, cytokinin, abscisic acid, ethylene, jasmonic acid, salicylic acid, and brassinosteroid. Here, we develop an Arabidopsis hormone database, which aims to provide a systematic and comprehensive view of genes participating in plant hormonal regulation, as well as morphological phenotypes controlled by plant hormones. Based on data from mutant studies, transgenic analysis and gene ontology (GO) annotation, we have identified a total of 1026 genes in the Arabidopsis genome that participate in plant hormone functions. Meanwhile, a phenotype ontology is developed to precisely describe myriad hormone-regulated morphological processes with standardized vocabularies. A web interface (http://ahd.cbi.pku.edu.cn) would allow users to quickly get access to information about these hormone-related genes, including sequences, functional category, mutant information, phenotypic description, microarray data and linked publications. Several applications of this database in studying plant hormonal regulation and hormone cross-talk will be presented and discussed.
Systems Based Study of the Therapeutic Potential of Small Charged Molecules for the Inhibition of IL-1 Mediated Cartilage Degradation

PubMed Central

Kar, Saptarshi; Smith, David W.; Gardiner, Bruce S.; Grodzinsky, Alan J.

2016-01-01

Inflammatory cytokines are key drivers of cartilage degradation in post-traumatic osteoarthritis. Cartilage degradation mediated by these inflammatory cytokines has been extensively investigated using in vitro experimental systems. Based on one such study, we have developed a computational model to quantitatively assess the impact of charged small molecules intended to inhibit IL-1 mediated cartilage degradation. We primarily focus on the simplest possible computational model of small molecular interaction with the IL-1 system—direct binding of the small molecule to the active site on the IL-1 molecule itself. We first use the model to explore the uptake and release kinetics of the small molecule inhibitor by cartilage tissue. Our results show that negatively charged small molecules are excluded from the negatively charged cartilage tissue and have uptake kinetics in the order of hours. In contrast, the positively charged small molecules are drawn into the cartilage with uptake and release timescales ranging from hours to days. Using our calibrated computational model, we subsequently explore the effect of small molecule charge and binding constant on the rate of cartilage degradation. The results from this analysis indicate that the small molecules are most effective in inhibiting cartilage degradation if they are either positively charged and/or bind strongly to IL-1α, or both. Furthermore, our results showed that the cartilage structural homeostasis can be restored by the small molecule if administered within six days following initial tissue exposure to IL-1α. We finally extended the scope of the computational model by simulating the competitive inhibition of cartilage degradation by the small molecule. Results from this model show that small molecules are more efficient in inhibiting cartilage degradation by binding directly to IL-1α rather than binding to IL-1α receptors. The results from this study can be used as a template for the design and development of more pharmacologically effective osteoarthritis drugs, and to investigate possible therapeutic options. PMID:27977731
Antibody-enabled small-molecule drug discovery.

PubMed

Lawson, Alastair D G

2012-06-29

Although antibody-based therapeutics have become firmly established as medicines for serious diseases, the value of antibodies as tools in the early stages of small-molecule drug discovery is only beginning to be realized. In particular, antibodies may provide information to reduce risk in small-molecule drug discovery by enabling the validation of targets and by providing insights into the design of small-molecule screening assays. Moreover, antibodies can act as guides in the quest for small molecules that have the ability to modulate protein-protein interactions, which have traditionally only been considered to be tractable targets for biological drugs. The development of small molecules that have similar therapeutic effects to current biologics has the potential to benefit a broader range of patients at earlier stages of disease.

The origin of intermediary metabolism

NASA Technical Reports Server (NTRS)

Morowitz, H. J.; Kostelnik, J. D.; Yang, J.; Cody, G. D.

2000-01-01

The core of intermediary metabolism in autotrophs is the citric acid cycle. In a certain group of chemoautotrophs, the reductive citric acid cycle is an engine of synthesis, taking in CO(2) and synthesizing the molecules of the cycle. We have examined the chemistry of a model system of C, H, and O that starts with carbon dioxide and reductants and uses redox couples as the energy source. To inquire into the reaction networks that might emerge, we start with the largest available database of organic molecules, Beilstein on-line, and prune by a set of physical and chemical constraints applicable to the model system. From the 3.5 million entries in Beilstein we emerge with 153 molecules that contain all 11 members of the reductive citric acid cycle. A small number of selection rules generates a very constrained subset, suggesting that this is the type of reaction model that will prove useful in the study of biogenesis. The model indicates that the metabolism shown in the universal chart of pathways may be central to the origin of life, is emergent from organic chemistry, and may be unique.
MMpI: A WideRange of Available Compounds of Matrix Metalloproteinase Inhibitors

PubMed Central

Muvva, Charuvaka; Patra, Sanjukta; Venkatesan, Subramanian

2016-01-01

Matrix metalloproteinases (MMPs) are a family of zinc-dependent proteinases involved in the regulation of the extracellular signaling and structural matrix environment of cells and tissues. MMPs are considered as promising targets for the treatment of many diseases. Therefore, creation of database on the inhibitors of MMP would definitely accelerate the research activities in this area due to its implication in above-mentioned diseases and associated limitations in the first and second generation inhibitors. In this communication, we report the development of a new MMpI database which provides resourceful information for all researchers working in this field. It is a web-accessible, unique resource that contains detailed information on the inhibitors of MMP including small molecules, peptides and MMP Drug Leads. The database contains entries of ~3000 inhibitors including ~72 MMP Drug Leads and ~73 peptide based inhibitors. This database provides the detailed molecular and structural details which are necessary for the drug discovery and development. The MMpI database contains physical properties, 2D and 3D structures (mol2 and pdb format files) of inhibitors of MMP. Other data fields are hyperlinked to PubChem, ChEMBL, BindingDB, DrugBank, PDB, MEROPS and PubMed. The database has extensive searching facility with MMpI ID, IUPAC name, chemical structure and with the title of research article. The MMP inhibitors provided in MMpI database are optimized using Python-based Hierarchical Environment for Integrated Xtallography (Phenix) software. MMpI Database is unique and it is the only public database that contains and provides the complete information on the inhibitors of MMP. Database URL: http://clri.res.in/subramanian/databases/mmpi/index.php. PMID:27509041
Sequence-structure relationships in RNA loops: establishing the basis for loop homology modeling.

PubMed

Schudoma, Christian; May, Patrick; Nikiforova, Viktoria; Walther, Dirk

2010-01-01

The specific function of RNA molecules frequently resides in their seemingly unstructured loop regions. We performed a systematic analysis of RNA loops extracted from experimentally determined three-dimensional structures of RNA molecules. A comprehensive loop-structure data set was created and organized into distinct clusters based on structural and sequence similarity. We detected clear evidence of the hallmark of homology present in the sequence-structure relationships in loops. Loops differing by <25% in sequence identity fold into very similar structures. Thus, our results support the application of homology modeling for RNA loop model building. We established a threshold that may guide the sequence divergence-based selection of template structures for RNA loop homology modeling. Of all possible sequences that are, under the assumption of isosteric relationships, theoretically compatible with actual sequences observed in RNA structures, only a small fraction is contained in the Rfam database of RNA sequences and classes implying that the actual RNA loop space may consist of a limited number of unique loop structures and conserved sequences. The loop-structure data sets are made available via an online database, RLooM. RLooM also offers functionalities for the modeling of RNA loop structures in support of RNA engineering and design efforts.
Small Molecule Chemical Probes of MicroRNA Function

PubMed Central

Velagapudi, Sai Pradeep; Vummidi, Balayeshwanth R.; Disney, Matthew D.

2015-01-01

MicroRNAs (miRNAs) are small, non-coding RNAs that control protein expression. Aberrant miRNA expression has been linked to various human diseases, and thus miRNAs have been explored as diagnostic markers and therapeutic targets. Although it is challenging to target RNA with small molecules in general, there have been successful campaigns that have identified small molecule modulators of miRNA function by targeting various pathways. For example, small molecules that modulate transcription and target nuclease processing sites in miRNA precursors have been identified. Herein, we describe challenges in developing chemical probes that target miRNAs and highlight aspects of miRNA cellular biology elucidated by using small molecule chemical probes. We expect that this area will expand dramatically in the near future as strides are made to understand small molecule recognition of RNA from a fundamental perspective. PMID:25500006
Medium-Bandgap Small-Molecule Donors Compatible with Both Fullerene and Nonfullerene Acceptors.

PubMed

Huo, Yong; Yan, Cenqi; Kan, Bin; Liu, Xiao-Fei; Chen, Li-Chuan; Hu, Chen-Xia; Lau, Tsz-Ki; Lu, Xinhui; Sun, Chun-Lin; Shao, Xiangfeng; Chen, Yongsheng; Zhan, Xiaowei; Zhang, Hao-Li

2018-03-21

Much effort has been devoted to the development of new donor materials for small-molecule organic solar cells due to their inherent advantages of well-defined molecular weight, easy purification, and good reproducibility in photovoltaic performance. Herein, we report two small-molecule donors that are compatible with both fullerene and nonfullerene acceptors. Both molecules consist of an (E)-1,2-di(thiophen-2-yl)ethane-substituted (TVT-substituted) benzo[1,2-b:4,5-b']dithiophene (BDT) as the central unit, and two rhodanine units as the terminal electron-withdrawing groups. The central units are modified with either alkyl side chains (DRBDT-TVT) or alkylthio side chains (DRBDT-STVT). Both molecules exhibit a medium bandgap with complementary absorption and proper energy level offset with typical acceptors like PC 71 BM and IDIC. The optimized devices show a decent power conversion efficiency (PCE) of 6.87% for small-molecule organic solar cells and 6.63% for nonfullerene all small-molecule organic solar cells. Our results reveal that rationally designed medium-bandgap small-molecule donors can be applied in high-performance small-molecule organic solar cells with different types of acceptors.
The Cambridge Structural Database: a quarter of a million crystal structures and rising.

PubMed

Allen, Frank H

2002-06-01

The Cambridge Structural Database (CSD) now contains data for more than a quarter of a million small-molecule crystal structures. The information content of the CSD, together with methods for data acquisition, processing and validation, are summarized, with particular emphasis on the chemical information added by CSD editors. Nearly 80% of new structural data arrives electronically, mostly in CIF format, and the CCDC acts as the official crystal structure data depository for 51 major journals. The CCDC now maintains both a CIF archive (more than 73,000 CIFs dating from 1996), as well as the distributed binary CSD archive; the availability of data in both archives is discussed. A statistical survey of the CSD is also presented and projections concerning future accession rates indicate that the CSD will contain at least 500,000 crystal structures by the year 2010.
Identification and Correction of Additive and Multiplicative Spatial Biases in Experimental High-Throughput Screening.

PubMed

Mazoure, Bogdan; Caraus, Iurie; Nadon, Robert; Makarenkov, Vladimir

2018-06-01

Data generated by high-throughput screening (HTS) technologies are prone to spatial bias. Traditionally, bias correction methods used in HTS assume either a simple additive or, more recently, a simple multiplicative spatial bias model. These models do not, however, always provide an accurate correction of measurements in wells located at the intersection of rows and columns affected by spatial bias. The measurements in these wells depend on the nature of interaction between the involved biases. Here, we propose two novel additive and two novel multiplicative spatial bias models accounting for different types of bias interactions. We describe a statistical procedure that allows for detecting and removing different types of additive and multiplicative spatial biases from multiwell plates. We show how this procedure can be applied by analyzing data generated by the four HTS technologies (homogeneous, microorganism, cell-based, and gene expression HTS), the three high-content screening (HCS) technologies (area, intensity, and cell-count HCS), and the only small-molecule microarray technology available in the ChemBank small-molecule screening database. The proposed methods are included in the AssayCorrector program, implemented in R, and available on CRAN.
Mapping the Small Molecule Interactome by Mass Spectrometry.

PubMed

Flaxman, Hope A; Woo, Christina M

2018-01-16

Mapping small molecule interactions throughout the proteome provides the critical structural basis for functional analysis of their impact on biochemistry. However, translation of mass spectrometry-based proteomics methods to directly profile the interaction between a small molecule and the whole proteome is challenging because of the substoichiometric nature of many interactions, the diversity of covalent and noncovalent interactions involved, and the subsequent computational complexity associated with their spectral assignment. Recent advances in chemical proteomics have begun fill this gap to provide a structural basis for the breadth of small molecule-protein interactions in the whole proteome. Innovations enabling direct characterization of the small molecule interactome include faster, more sensitive instrumentation coupled to chemical conjugation, enrichment, and labeling methods that facilitate detection and assignment. These methods have started to measure molecular interaction hotspots due to inherent differences in local amino acid reactivity and binding affinity throughout the proteome. Measurement of the small molecule interactome is producing structural insights and methods for probing and engineering protein biochemistry. Direct structural characterization of the small molecule interactome is a rapidly emerging area pushing new frontiers in biochemistry at the interface of small molecules and the proteome.
ChemProt-2.0: visual navigation in a disease chemical biology database

PubMed Central

Kim Kjærulff, Sonny; Wich, Louis; Kringelum, Jens; Jacobsen, Ulrik P.; Kouskoumvekaki, Irene; Audouze, Karine; Lund, Ole; Brunak, Søren; Oprea, Tudor I.; Taboureau, Olivier

2013-01-01

ChemProt-2.0 (http://www.cbs.dtu.dk/services/ChemProt-2.0) is a public available compilation of multiple chemical–protein annotation resources integrated with diseases and clinical outcomes information. The database has been updated to >1.15 million compounds with 5.32 millions bioactivity measurements for 15 290 proteins. Each protein is linked to quality-scored human protein–protein interactions data based on more than half a million interactions, for studying diseases and biological outcomes (diseases, pathways and GO terms) through protein complexes. In ChemProt-2.0, therapeutic effects as well as adverse drug reactions have been integrated allowing for suggesting proteins associated to clinical outcomes. New chemical structure fingerprints were computed based on the similarity ensemble approach. Protein sequence similarity search was also integrated to evaluate the promiscuity of proteins, which can help in the prediction of off-target effects. Finally, the database was integrated into a visual interface that enables navigation of the pharmacological space for small molecules. Filtering options were included in order to facilitate and to guide dynamic search of specific queries. PMID:23185041
TDR Targets: a chemogenomics resource for neglected diseases.

PubMed

Magariños, María P; Carmona, Santiago J; Crowther, Gregory J; Ralph, Stuart A; Roos, David S; Shanmugam, Dhanasekaran; Van Voorhis, Wesley C; Agüero, Fernán

2012-01-01

The TDR Targets Database (http://tdrtargets.org) has been designed and developed as an online resource to facilitate the rapid identification and prioritization of molecular targets for drug development, focusing on pathogens responsible for neglected human diseases. The database integrates pathogen specific genomic information with functional data (e.g. expression, phylogeny, essentiality) for genes collected from various sources, including literature curation. This information can be browsed and queried using an extensive web interface with functionalities for combining, saving, exporting and sharing the query results. Target genes can be ranked and prioritized using numerical weights assigned to the criteria used for querying. In this report we describe recent updates to the TDR Targets database, including the addition of new genomes (specifically helminths), and integration of chemical structure, property and bioactivity information for biological ligands, drugs and inhibitors and cheminformatic tools for querying and visualizing these chemical data. These changes greatly facilitate exploration of linkages (both known and predicted) between genes and small molecules, yielding insight into whether particular proteins may be druggable, effectively allowing the navigation of chemical space in a genomics context.
TDR Targets: a chemogenomics resource for neglected diseases

PubMed Central

Magariños, María P.; Carmona, Santiago J.; Crowther, Gregory J.; Ralph, Stuart A.; Roos, David S.; Shanmugam, Dhanasekaran; Van Voorhis, Wesley C.; Agüero, Fernán

2012-01-01

The TDR Targets Database (http://tdrtargets.org) has been designed and developed as an online resource to facilitate the rapid identification and prioritization of molecular targets for drug development, focusing on pathogens responsible for neglected human diseases. The database integrates pathogen specific genomic information with functional data (e.g. expression, phylogeny, essentiality) for genes collected from various sources, including literature curation. This information can be browsed and queried using an extensive web interface with functionalities for combining, saving, exporting and sharing the query results. Target genes can be ranked and prioritized using numerical weights assigned to the criteria used for querying. In this report we describe recent updates to the TDR Targets database, including the addition of new genomes (specifically helminths), and integration of chemical structure, property and bioactivity information for biological ligands, drugs and inhibitors and cheminformatic tools for querying and visualizing these chemical data. These changes greatly facilitate exploration of linkages (both known and predicted) between genes and small molecules, yielding insight into whether particular proteins may be druggable, effectively allowing the navigation of chemical space in a genomics context. PMID:22116064
Connecting proteins with drug-like compounds: Open source drug discovery workflows with BindingDB and KNIME

PubMed Central

Berthold, Michael R.; Hedrick, Michael P.; Gilson, Michael K.

2015-01-01

Today’s large, public databases of protein–small molecule interaction data are creating important new opportunities for data mining and integration. At the same time, new graphical user interface-based workflow tools offer facile alternatives to custom scripting for informatics and data analysis. Here, we illustrate how the large protein-ligand database BindingDB may be incorporated into KNIME workflows as a step toward the integration of pharmacological data with broader biomolecular analyses. Thus, we describe a collection of KNIME workflows that access BindingDB data via RESTful webservices and, for more intensive queries, via a local distillation of the full BindingDB dataset. We focus in particular on the KNIME implementation of knowledge-based tools to generate informed hypotheses regarding protein targets of bioactive compounds, based on notions of chemical similarity. A number of variants of this basic approach are tested for seven existing drugs with relatively ill-defined therapeutic targets, leading to replication of some previously confirmed results and discovery of new, high-quality hits. Implications for future development are discussed. Database URL: www.bindingdb.org PMID:26384374
The European Bioinformatics Institute's data resources: towards systems biology.

PubMed

Brooksbank, Catherine; Cameron, Graham; Thornton, Janet

2005-01-01

Genomic and post-genomic biological research has provided fine-grain insights into the molecular processes of life, but also threatens to drown biomedical researchers in data. Moreover, as new high-throughput technologies are developed, the types of data that are gathered en masse are diversifying. The need to collect, store and curate all this information in ways that allow its efficient retrieval and exploitation is greater than ever. The European Bioinformatics Institute's (EBI's) databases and tools have evolved to meet the changing needs of molecular biologists: since we last wrote about our services in the 2003 issue of Nucleic Acids Research, we have launched new databases covering protein-protein interactions (IntAct), pathways (Reactome) and small molecules (ChEBI). Our existing core databases have continued to evolve to meet the changing needs of biomedical researchers, and we have developed new data-access tools that help biologists to move intuitively through the different data types, thereby helping them to put the parts together to understand biology at the systems level. The EBI's data resources are all available on our website at http://www.ebi.ac.uk.
The European Bioinformatics Institute's data resources: towards systems biology

PubMed Central

Brooksbank, Catherine; Cameron, Graham; Thornton, Janet

2005-01-01

Genomic and post-genomic biological research has provided fine-grain insights into the molecular processes of life, but also threatens to drown biomedical researchers in data. Moreover, as new high-throughput technologies are developed, the types of data that are gathered en masse are diversifying. The need to collect, store and curate all this information in ways that allow its efficient retrieval and exploitation is greater than ever. The European Bioinformatics Institute's (EBI's) databases and tools have evolved to meet the changing needs of molecular biologists: since we last wrote about our services in the 2003 issue of Nucleic Acids Research, we have launched new databases covering protein–protein interactions (IntAct), pathways (Reactome) and small molecules (ChEBI). Our existing core databases have continued to evolve to meet the changing needs of biomedical researchers, and we have developed new data-access tools that help biologists to move intuitively through the different data types, thereby helping them to put the parts together to understand biology at the systems level. The EBI's data resources are all available on our website at http://www.ebi.ac.uk. PMID:15608238
Virtual High-Throughput Screening To Identify Novel Activin Antagonists

PubMed Central

Zhu, Jie; Mishra, Rama K.; Schiltz, Gary E.; Makanji, Yogeshwar; Scheidt, Karl A.; Mazar, Andrew P.; Woodruff, Teresa K.

2015-01-01

Activin belongs to the TGFβ superfamily, which is associated with several disease conditions, including cancer-related cachexia, preterm labor with delivery, and osteoporosis. Targeting activin and its related signaling pathways holds promise as a therapeutic approach to these diseases. A small-molecule ligand-binding groove was identified in the interface between the two activin βA subunits and was used for a virtual high-throughput in silico screening of the ZINC database to identify hits. Thirty-nine compounds without significant toxicity were tested in two well-established activin assays: FSHβ transcription and HepG2 cell apoptosis. This screening workflow resulted in two lead compounds: NUCC-474 and NUCC-555. These potential activin antagonists were then shown to inhibit activin A-mediated cell proliferation in ex vivo ovary cultures. In vivo testing showed that our most potent compound (NUCC-555) caused a dose-dependent decrease in FSH levels in ovariectomized mice. The Blitz competition binding assay confirmed target binding of NUCC-555 to the activin A:ActRII that disrupts the activin A:ActRII complex’s binding with ALK4-ECD-Fc in a dose-dependent manner. The NUCC-555 also specifically binds to activin A compared with other TGFβ superfamily member myostatin (GDF8). These data demonstrate a new in silico-based strategy for identifying small-molecule activin antagonists. Our approach is the first to identify a first-in-class small-molecule antagonist of activin binding to ALK4, which opens a completely new approach to inhibiting the activity of TGFβ receptor superfamily members. in addition, the lead compound can serve as a starting point for lead optimization toward the goal of a compound that may be effective in activin-mediated diseases. PMID:26098096
Pressure for drug development in lysosomal storage disorders - a quantitative analysis thirty years beyond the US orphan drug act.

PubMed

Mechler, Konstantin; Mountford, William K; Hoffmann, Georg F; Ries, Markus

2015-04-18

Lysosomal storage disorders are a heterogeneous group of approximately 50 monogenically inherited orphan conditions. A defect leads to the storage of complex molecules in the lysosome, and patients develop a complex multisystemic phenotype of high morbidity often associated with premature death. More than 30 years ago the Orphan Drug Act of 1983 passed the United States legislation intended to facilitate the development of drugs for rare disorders. We directed our efforts in assessing which lysosomal diseases had drug development pressure and what distinguished those with successful development and approvals from diseases not treated or without orphan drug designation. Analysis of the FDA database for orphan drug designations through descriptive and comparative statistics. Between 1983 and 2013, fourteen drugs for seven conditions received FDA approval. Overall, orphan drug status was designated 70 times for 20 conditions. Approved therapies were enzyme replacement therapies (N = 10), substrate reduction therapies (N = 1), small molecules facilitating lysosomal substrate transportation (N = 3). FDA approval was significantly associated with a disease prevalence higher than 0.5/100,000 (p = 0.00742) and clinical development programs that did not require a primary neurological endpoint (p = 0.00059). Orphan drug status was designated for enzymes, modified enzymes, fusion proteins, chemical chaperones, small molecules leading to substrate reduction, or facilitating subcellular substrate transport, stem cells as well as gene therapies. Drug development focused on more common diseases. Primarily neurological diseases were neglected. Small clinical trials with either somatic or biomarker endpoints were successful. Enzyme replacement therapy was the most successful technology. Four factors played a key role in successful orphan drug development or orphan drug designations: 1) prevalence of disease 2) endpoints 3) regulatory precedent, and 4) technology platform. Successful development seeded further innovation.
Small molecule chemical probes of microRNA function.

PubMed

Velagapudi, Sai Pradeep; Vummidi, Balayeshwanth R; Disney, Matthew D

2015-02-01

MicroRNAs (miRNAs) are small, non-coding RNAs that control protein expression. Aberrant miRNA expression has been linked to various human diseases, and thus miRNAs have been explored as diagnostic markers and therapeutic targets. Although it is challenging to target RNA with small molecules in general, there have been successful campaigns that have identified small molecule modulators of miRNA function by targeting various pathways. For example, small molecules that modulate transcription and target nuclease processing sites in miRNA precursors have been identified. Herein, we describe challenges in developing chemical probes that target miRNAs and highlight aspects of miRNA cellular biology elucidated by using small molecule chemical probes. We expect that this area will expand dramatically in the near future as progress is made in understanding small molecule recognition of RNA. Copyright © 2014. Published by Elsevier Ltd.
Identifying key genes in rheumatoid arthritis by weighted gene co-expression network analysis.

PubMed

Ma, Chunhui; Lv, Qi; Teng, Songsong; Yu, Yinxian; Niu, Kerun; Yi, Chengqin

2017-08-01

This study aimed to identify rheumatoid arthritis (RA) related genes based on microarray data using the WGCNA (weighted gene co-expression network analysis) method. Two gene expression profile datasets GSE55235 (10 RA samples and 10 healthy controls) and GSE77298 (16 RA samples and seven healthy controls) were downloaded from Gene Expression Omnibus database. Characteristic genes were identified using metaDE package. WGCNA was used to find disease-related networks based on gene expression correlation coefficients, and module significance was defined as the average gene significance of all genes used to assess the correlation between the module and RA status. Genes in the disease-related gene co-expression network were subject to functional annotation and pathway enrichment analysis using Database for Annotation Visualization and Integrated Discovery. Characteristic genes were also mapped to the Connectivity Map to screen small molecules. A total of 599 characteristic genes were identified. For each dataset, characteristic genes in the green, red and turquoise modules were most closely associated with RA, with gene numbers of 54, 43 and 79, respectively. These genes were enriched in totally enriched in 17 Gene Ontology terms, mainly related to immune response (CD97, FYB, CXCL1, IKBKE, CCR1, etc.), inflammatory response (CD97, CXCL1, C3AR1, CCR1, LYZ, etc.) and homeostasis (C3AR1, CCR1, PLN, CCL19, PPT1, etc.). Two small-molecule drugs sanguinarine and papaverine were predicted to have a therapeutic effect against RA. Genes related to immune response, inflammatory response and homeostasis presumably have critical roles in RA pathogenesis. Sanguinarine and papaverine have a potential therapeutic effect against RA. © 2017 Asia Pacific League of Associations for Rheumatology and John Wiley & Sons Australia, Ltd.
An update on the Enzyme Portal: an integrative approach for exploring enzyme knowledge

PubMed Central

Onwubiko, J.; Zaru, R.; Rosanoff, S.; Antunes, R.; Bingley, M.; Watkins, X.; O'Donovan, C.; Martin, M. J.

2017-01-01

Abstract Enzymes are a key part of life processes and are increasingly important for various areas of research such as medicine, biotechnology, bioprocessing and drug research. The goal of the Enzyme Portal is to provide an interface to all European Bioinformatics Institute (EMBL-EBI) data about enzymes (de Matos, P., et al., (2013), BMC Bioinformatics, 14 (1), 103). These data include enzyme function, sequence features and family classification, protein structure, reactions, pathways, small molecules, diseases and the associated literature. The sources of enzyme data are: the UniProt Knowledgebase (UniProtKB) (UniProt Consortium, 2015), the Protein Data Bank in Europe (PDBe), (Valenkar, S., et al., Nucleic Acids Res.2016; 44, D385–D395) Rhea—a database of enzyme-catalysed reactions (Morgat, A., et al., Nucleic Acids Res. 2015; 43, D459-D464), Reactome—a database of biochemical pathways (Fabregat, A., et al., Nucleic Acids Res. 2016; 44, D481–D487), IntEnz—a resource with enzyme nomenclature information (Fleischmann, A., et al., Nucleic Acids Res. 2004 32, D434–D437) and ChEBI (Hastings, J., et al., Nucleic Acids Res. 2013) and ChEMBL (Bento, A. P., et al., Nucleic Acids Res. 201442, 1083–1090)—resources which contain information about small-molecule chemistry and bioactivity. This article describes the redesign of Enzyme Portal and the increased functionality added to maximise integration and interpretation of these data. Use case examples of the Enzyme Portal and the versatile workflows its supports are illustrated. We welcome the suggestion of new resources for integration. PMID:28158609
Design of a bioactive small molecule that targets the myotonic dystrophy type 1 RNA via an RNA motif-ligand database and chemical similarity searching.

PubMed

Parkesh, Raman; Childs-Disney, Jessica L; Nakamori, Masayuki; Kumar, Amit; Wang, Eric; Wang, Thomas; Hoskins, Jason; Tran, Tuan; Housman, David; Thornton, Charles A; Disney, Matthew D

2012-03-14

Myotonic dystrophy type 1 (DM1) is a triplet repeating disorder caused by expanded CTG repeats in the 3'-untranslated region of the dystrophia myotonica protein kinase (DMPK) gene. The transcribed repeats fold into an RNA hairpin with multiple copies of a 5'CUG/3'GUC motif that binds the RNA splicing regulator muscleblind-like 1 protein (MBNL1). Sequestration of MBNL1 by expanded r(CUG) repeats causes splicing defects in a subset of pre-mRNAs including the insulin receptor, the muscle-specific chloride ion channel, sarco(endo)plasmic reticulum Ca(2+) ATPase 1, and cardiac troponin T. Based on these observations, the development of small-molecule ligands that target specifically expanded DM1 repeats could be of use as therapeutics. In the present study, chemical similarity searching was employed to improve the efficacy of pentamidine and Hoechst 33258 ligands that have been shown previously to target the DM1 triplet repeat. A series of in vitro inhibitors of the RNA-protein complex were identified with low micromolar IC(50)'s, which are >20-fold more potent than the query compounds. Importantly, a bis-benzimidazole identified from the Hoechst query improves DM1-associated pre-mRNA splicing defects in cell and mouse models of DM1 (when dosed with 1 mM and 100 mg/kg, respectively). Since Hoechst 33258 was identified as a DM1 binder through analysis of an RNA motif-ligand database, these studies suggest that lead ligands targeting RNA with improved biological activity can be identified by using a synergistic approach that combines analysis of known RNA-ligand interactions with chemical similarity searching.

Design of a Bioactive Small Molecule that Targets the Myotonic Dystrophy Type 1 RNA Via an RNA Motif-Ligand Database & Chemical Similarity Searching

PubMed Central

Parkesh, Raman; Childs-Disney, Jessica L.; Nakamori, Masayuki; Kumar, Amit; Wang, Eric; Wang, Thomas; Hoskins, Jason; Tran, Tuan; Housman, David; Thornton, Charles A.; Disney, Matthew D.

2012-01-01

Myotonic dystrophy type 1 (DM1) is a triplet repeating disorder caused by expanded CTG repeats in the 3′ untranslated region of the dystrophia myotonica protein kinase (DMPK) gene. The transcribed repeats fold into an RNA hairpin with multiple copies of a 5′CUG/3′GUC motif that binds the RNA splicing regulator muscleblind-like 1 protein (MBNL1). Sequestration of MBNL1 by expanded r(CUG) repeats causes splicing defects in a subset of pre-mRNAs including the insulin receptor, the muscle-specific chloride ion channel, Sarco(endo)plasmic reticulum Ca2+ ATPase 1 (Serca1/Atp2a1), and cardiac troponin T (cTNT). Based on these observations, the development of small molecule ligands that target specifically expanded DM1 repeats could serve as therapeutics. In the present study, computational screening was employed to improve the efficacy of pentamidine and Hoechst 33258 ligands that have been shown previously to target the DM1 triplet repeat. A series of inhibitors of the RNA-protein complex with low micromolar IC50’s, which are >20-fold more potent than the query compounds, were identified. Importantly, a bis-benzimidazole identified from the Hoechst query improves DM1-associated pre-mRNA splicing defects in cell and mouse models of DM1 (when dosed with 1 mM and 100 mg/kg, respectively). Since Hoechst 33258 was identified as a DM1 binder through analysis of an RNA motif-ligand database, these studies suggest that lead ligands targeting RNA with improved biological activity can be identified by using a synergistic approach that combines analysis of known RNA-ligand interactions with virtual screening. PMID:22300544
An update on the Enzyme Portal: an integrative approach for exploring enzyme knowledge.

PubMed

Pundir, S; Onwubiko, J; Zaru, R; Rosanoff, S; Antunes, R; Bingley, M; Watkins, X; O'Donovan, C; Martin, M J

2017-03-01

Enzymes are a key part of life processes and are increasingly important for various areas of research such as medicine, biotechnology, bioprocessing and drug research. The goal of the Enzyme Portal is to provide an interface to all European Bioinformatics Institute (EMBL-EBI) data about enzymes (de Matos, P., et al. , (2013), BMC Bioinformatics , (1), 103). These data include enzyme function, sequence features and family classification, protein structure, reactions, pathways, small molecules, diseases and the associated literature. The sources of enzyme data are: the UniProt Knowledgebase (UniProtKB) (UniProt Consortium, 2015), the Protein Data Bank in Europe (PDBe), (Valenkar, S., et al ., Nucleic Acids Res. 2016; , D385-D395) Rhea-a database of enzyme-catalysed reactions (Morgat, A., et al ., Nucleic Acids Res. 2015; , D459-D464), Reactome-a database of biochemical pathways (Fabregat, A., et al ., Nucleic Acids Res. 2016; , D481-D487), IntEnz-a resource with enzyme nomenclature information (Fleischmann, A., et al ., Nucleic Acids Res. 2004 , D434-D437) and ChEBI (Hastings, J., et al ., Nucleic Acids Res. 2013) and ChEMBL (Bento, A. P., et al ., Nucleic Acids Res. 2014 , 1083-1090)-resources which contain information about small-molecule chemistry and bioactivity. This article describes the redesign of Enzyme Portal and the increased functionality added to maximise integration and interpretation of these data. Use case examples of the Enzyme Portal and the versatile workflows its supports are illustrated. We welcome the suggestion of new resources for integration. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs

PubMed Central

Knox, Craig; Law, Vivian; Jewison, Timothy; Liu, Philip; Ly, Son; Frolkis, Alex; Pon, Allison; Banco, Kelly; Mak, Christine; Neveu, Vanessa; Djoumbou, Yannick; Eisner, Roman; Guo, An Chi; Wishart, David S.

2011-01-01

DrugBank (http://www.drugbank.ca) is a richly annotated database of drug and drug target information. It contains extensive data on the nomenclature, ontology, chemistry, structure, function, action, pharmacology, pharmacokinetics, metabolism and pharmaceutical properties of both small molecule and large molecule (biotech) drugs. It also contains comprehensive information on the target diseases, proteins, genes and organisms on which these drugs act. First released in 2006, DrugBank has become widely used by pharmacists, medicinal chemists, pharmaceutical researchers, clinicians, educators and the general public. Since its last update in 2008, DrugBank has been greatly expanded through the addition of new drugs, new targets and the inclusion of more than 40 new data fields per drug entry (a 40% increase in data ‘depth’). These data field additions include illustrated drug-action pathways, drug transporter data, drug metabolite data, pharmacogenomic data, adverse drug response data, ADMET data, pharmacokinetic data, computed property data and chemical classification data. DrugBank 3.0 also offers expanded database links, improved search tools for drug–drug and food–drug interaction, new resources for querying and viewing drug pathways and hundreds of new drug entries with detailed patent, pricing and manufacturer data. These additions have been complemented by enhancements to the quality and quantity of existing data, particularly with regard to drug target, drug description and drug action data. DrugBank 3.0 represents the result of 2 years of manual annotation work aimed at making the database much more useful for a wide range of ‘omics’ (i.e. pharmacogenomic, pharmacoproteomic, pharmacometabolomic and even pharmacoeconomic) applications. PMID:21059682
Defining RNA-Small Molecule Affinity Landscapes Enables Design of a Small Molecule Inhibitor of an Oncogenic Noncoding RNA.

PubMed

Velagapudi, Sai Pradeep; Luo, Yiling; Tran, Tuan; Haniff, Hafeez S; Nakai, Yoshio; Fallahi, Mohammad; Martinez, Gustavo J; Childs-Disney, Jessica L; Disney, Matthew D

2017-03-22

RNA drug targets are pervasive in cells, but methods to design small molecules that target them are sparse. Herein, we report a general approach to score the affinity and selectivity of RNA motif-small molecule interactions identified via selection. Named High Throughput Structure-Activity Relationships Through Sequencing (HiT-StARTS), HiT-StARTS is statistical in nature and compares input nucleic acid sequences to selected library members that bind a ligand via high throughput sequencing. The approach allowed facile definition of the fitness landscape of hundreds of thousands of RNA motif-small molecule binding partners. These results were mined against folded RNAs in the human transcriptome and identified an avid interaction between a small molecule and the Dicer nuclease-processing site in the oncogenic microRNA (miR)-18a hairpin precursor, which is a member of the miR-17-92 cluster. Application of the small molecule, Targapremir-18a, to prostate cancer cells inhibited production of miR-18a from the cluster, de-repressed serine/threonine protein kinase 4 protein (STK4), and triggered apoptosis. Profiling the cellular targets of Targapremir-18a via Chemical Cross-Linking and Isolation by Pull Down (Chem-CLIP), a covalent small molecule-RNA cellular profiling approach, and other studies showed specific binding of the compound to the miR-18a precursor, revealing broadly applicable factors that govern small molecule drugging of noncoding RNAs.
EDCs DataBank: 3D-Structure database of endocrine disrupting chemicals.

PubMed

Montes-Grajales, Diana; Olivero-Verbel, Jesus

2015-01-02

Endocrine disrupting chemicals (EDCs) are a group of compounds that affect the endocrine system, frequently found in everyday products and epidemiologically associated with several diseases. The purpose of this work was to develop EDCs DataBank, the only database of EDCs with three-dimensional structures. This database was built on MySQL using the EU list of potential endocrine disruptors and TEDX list. It contains the three-dimensional structures available on PubChem, as well as a wide variety of information from different databases and text mining tools, useful for almost any kind of research regarding EDCs. The web platform was developed employing HTML, CSS and PHP languages, with dynamic contents in a graphic environment, facilitating information analysis. Currently EDCs DataBank has 615 molecules, including pesticides, natural and industrial products, cosmetics, drugs and food additives, among other low molecular weight xenobiotics. Therefore, this database can be used to study the toxicological effects of these molecules, or to develop pharmaceuticals targeting hormone receptors, through docking studies, high-throughput virtual screening and ligand-protein interaction analysis. EDCs DataBank is totally user-friendly and the 3D-structures of the molecules can be downloaded in several formats. This database is freely available at http://edcs.unicartagena.edu.co. Copyright © 2014. Published by Elsevier Ireland Ltd.
PoLi: A Virtual Screening Pipeline Based On Template Pocket And Ligand Similarity

PubMed Central

Roy, Ambrish; Srinivasan, Bharath; Skolnick, Jeffrey

2015-01-01

Often in pharmaceutical research, the goal is to identify small molecules that can interact with and appropriately modify the biological behavior of a new protein target. Unfortunately, most proteins lack both known structures and small molecule binders, prerequisites of many virtual screening, VS, approaches. For such proteins, ligand homology modeling, LHM, that copies ligands from homologous and perhaps evolutionarily distant template proteins, has been shown to be a powerful VS approach to identify possible binding ligands. However, if we want to target a specific pocket for which there is no homologous holo template protein structure, then LHM will not work. To address this issue, in a new pocket based approach, PoLi, we generalize LHM by exploiting the fact that the number of distinct small molecule ligand binding pockets in proteins is small. PoLi identifies similar ligand binding pockets in a holo-template protein library, selectively copies relevant parts of template ligands and uses them for VS. In practice, PoLi is a hybrid structure and ligand based VS algorithm that integrates 2D fingerprint-based and 3D shape-based similarity metrics for improved virtual screening performance. On standard DUD and DUD-E benchmark databases, using modeled receptor structures, PoLi achieves an average enrichment factor of 13.4 and 9.6 respectively, in the top 1% of the screened library. In contrast, traditional docking based VS using AutoDock Vina and homology-based VS using FINDSITEfilt have an average enrichment of 1.6 (3.0) and 9.0 (7.9) on the DUD (DUD-E) sets respectively. Experimental validation of PoLi predictions on dihydrofolate reductase, DHFR, using differential scanning fluorimetry, DSF, identifies multiple ligands with diverse molecular scaffolds, thus demonstrating the advantage of PoLi over current state-of-the-art VS methods. PMID:26225536
Annular tautomerism: experimental observations and quantum mechanics calculations.

PubMed

Cruz-Cabeza, Aurora J; Schreyer, Adrian; Pitt, William R

2010-06-01

The use of MP2 level quantum mechanical (QM) calculations on isolated heteroaromatic ring systems for the prediction of the tautomeric propensities of whole molecules in a crystalline environment was examined. A Polarisable Continuum Model was used in the calculations to account for environment effects on the tautomeric relative stabilities. The calculated relative energies of tautomers were compared to relative abundances within the Cambridge Structural Database (CSD) and the Protein Data Bank (PDB). The work was focussed on 84 annular tautomeric forms of 34 common ring systems. Good agreement was found between the calculations and the experimental data even if the quantity of these data was limited in many cases. The QM results were compared to those produced by much faster semiempirical calculations. In a search for other sources of the useful experimental data, the relative numbers of known compounds in which prototropic positions were often substituted by heavy atoms were also analysed. A scheme which groups all annular tautomeric transformations into 10 classes was developed. The scheme was designed to encompass a comprehensive set of known and theoretically possible tautomeric ring systems generated as part of a previous study. General trends across analogous ring systems were detected as a result. The calculations and statistics collected on crystallographic data as well as the general trends observed should be useful for the better modelling of annular tautomerism in the applications such as computer-aided drug design, small molecule crystal structure prediction, the naming of compounds and the interpretation of protein-small molecule crystal structures.
Annular tautomerism: experimental observations and quantum mechanics calculations

NASA Astrophysics Data System (ADS)

Cruz-Cabeza, Aurora J.; Schreyer, Adrian; Pitt, William R.

2010-06-01

The use of MP2 level quantum mechanical (QM) calculations on isolated heteroaromatic ring systems for the prediction of the tautomeric propensities of whole molecules in a crystalline environment was examined. A Polarisable Continuum Model was used in the calculations to account for environment effects on the tautomeric relative stabilities. The calculated relative energies of tautomers were compared to relative abundances within the Cambridge Structural Database (CSD) and the Protein Data Bank (PDB). The work was focussed on 84 annular tautomeric forms of 34 common ring systems. Good agreement was found between the calculations and the experimental data even if the quantity of these data was limited in many cases. The QM results were compared to those produced by much faster semiempirical calculations. In a search for other sources of the useful experimental data, the relative numbers of known compounds in which prototropic positions were often substituted by heavy atoms were also analysed. A scheme which groups all annular tautomeric transformations into 10 classes was developed. The scheme was designed to encompass a comprehensive set of known and theoretically possible tautomeric ring systems generated as part of a previous study. General trends across analogous ring systems were detected as a result. The calculations and statistics collected on crystallographic data as well as the general trends observed should be useful for the better modelling of annular tautomerism in the applications such as computer-aided drug design, small molecule crystal structure prediction, the naming of compounds and the interpretation of protein—small molecule crystal structures.
Small Molecule Inhibitors of AI-2 Signaling in Bacteria: State-of-the-Art and Future Perspectives for Anti-Quorum Sensing Agents

PubMed Central

Guo, Min; Gamby, Sonja; Zheng, Yue; Sintim, Herman O.

2013-01-01

Bacteria respond to different small molecules that are produced by other neighboring bacteria. These molecules, called autoinducers, are classified as intraspecies (i.e., molecules produced and perceived by the same bacterial species) or interspecies (molecules that are produced and sensed between different bacterial species). AI-2 has been proposed as an interspecies autoinducer and has been shown to regulate different bacterial physiology as well as affect virulence factor production and biofilm formation in some bacteria, including bacteria of clinical relevance. Several groups have embarked on the development of small molecules that could be used to perturb AI-2 signaling in bacteria, with the ultimate goal that these molecules could be used to inhibit bacterial virulence and biofilm formation. Additionally, these molecules have the potential to be used in synthetic biology applications whereby these small molecules are used as inputs to switch on and off AI-2 receptors. In this review, we highlight the state-of-the-art in the development of small molecules that perturb AI-2 signaling in bacteria and offer our perspective on the future development and applications of these classes of molecules. PMID:23994835
Organic small molecule semiconducting chromophores for use in organic electronic devices

DOE Office of Scientific and Technical Information (OSTI.GOV)

Welch, Gregory C.; Hoven, Corey V.; Nguyen, Thuc-Quyen

Small organic molecule semi-conducting chromophores containing a pyridalthiadiazole, pyridaloxadiazole, or pyridaltriazole core structure are disclosed. Such compounds can be used in organic heterojunction devices, such as organic small molecule solar cells and transistors.
Structure-guided Discovery of Dual-recognition Chemibodies.

PubMed

Cheng, Alan C; Doherty, Elizabeth M; Johnstone, Sheree; DiMauro, Erin F; Dao, Jennifer; Luthra, Abhinav; Ye, Jay; Tang, Jie; Nixey, Thomas; Min, Xiaoshan; Tagari, Philip; Miranda, Les P; Wang, Zhulun

2018-05-15

Small molecules and antibodies each have advantages and limitations as therapeutics. Here, we present for the first time to our knowledge, the structure-guided design of "chemibodies" as small molecule-antibody hybrids that offer dual recognition of a single target by both a small molecule and an antibody, using DPP-IV enzyme as a proof of concept study. Biochemical characterization demonstrates that the chemibodies present superior DPP-IV inhibition compared to either small molecule or antibody component alone. We validated our design by successfully solving a co-crystal structure of a chemibody in complex with DPP-IV, confirming specific binding of the small molecule portion at the interior catalytic site and the Fab portion at the protein surface. The discovery of chemibodies presents considerable potential for novel therapeutics that harness the power of both small molecule and antibody modalities to achieve superior specificity, potency, and pharmacokinetic properties.
Rational design of chemical genetic probes of RNA function and lead therapeutics targeting repeating transcripts.

PubMed

Disney, Matthew D

2013-12-01

RNA is an important yet vastly underexploited target for small molecule chemical probes or lead therapeutics. Small molecules have been used successfully to modulate the function of the bacterial ribosome, viral RNAs and riboswitches. These RNAs are either highly expressed or can be targeted using substrate mimicry, a mainstay in the design of enzyme inhibitors. However, most cellular RNAs are neither highly expressed nor have a lead small molecule inhibitor, a significant challenge for drug discovery efforts. Herein, I describe the design of small molecules targeting expanded repeating transcripts that cause myotonic muscular dystrophy (DM). These test cases illustrate the challenges of designing small molecules that target RNA and the advantages of targeting repeating transcripts. Lastly, I discuss how small molecules might be more advantageous than oligonucleotides for targeting RNA. Copyright © 2013 Elsevier Ltd. All rights reserved.
Advancing Biological Understanding and Therapeutics Discovery with Small Molecule Probes

PubMed Central

Schreiber, Stuart L.; Kotz, Joanne D.; Li, Min; Aubé, Jeffrey; Austin, Christopher P.; Reed, John C.; Rosen, Hugh; White, E. Lucile; Sklar, Larry A.; Lindsley, Craig W.; Alexander, Benjamin R.; Bittker, Joshua A.; Clemons, Paul A.; de Souza, Andrea; Foley, Michael A.; Palmer, Michelle; Shamji, Alykhan F.; Wawer, Mathias J.; McManus, Owen; Wu, Meng; Zou, Beiyan; Yu, Haibo; Golden, Jennifer E.; Schoenen, Frank J.; Simeonov, Anton; Jadhav, Ajit; Jackson, Michael R.; Pinkerton, Anthony B.; Chung, Thomas D.Y.; Griffin, Patrick R.; Cravatt, Benjamin F.; Hodder, Peter S.; Roush, William R.; Roberts, Edward; Chung, Dong-Hoon; Jonsson, Colleen B.; Noah, James W.; Severson, William E.; Ananthan, Subramaniam; Edwards, Bruce; Oprea, Tudor I.; Conn, P. Jeffrey; Hopkins, Corey R.; Wood, Michael R.; Stauffer, Shaun R.; Emmitte, Kyle A.

2015-01-01

Small-molecule probes can illuminate biological processes and aid in the assessment of emerging therapeutic targets by perturbing biological systems in a manner distinct from other experimental approaches. Despite the tremendous promise of chemical tools for investigating biology and disease, small-molecule probes were unavailable for most targets and pathways as recently as a decade ago. In 2005, the U.S. National Institutes of Health launched the decade-long Molecular Libraries Program with the intent of innovating in and broadening access to small-molecule science. This Perspective describes how novel small-molecule probes identified through the program are enabling the exploration of biological pathways and therapeutic hypotheses not otherwise testable. These experiences illustrate how small-molecule probes can help bridge the chasm between biological research and the development of medicines, but also highlight the need to innovate the science of therapeutic discovery. PMID:26046436
High mobility high efficiency organic films based on pure organic materials

DOEpatents

Salzman, Rhonda F [Ann Arbor, MI; Forrest, Stephen R [Ann Arbor, MI

2009-01-27

A method of purifying small molecule organic material, performed as a series of operations beginning with a first sample of the organic small molecule material. The first step is to purify the organic small molecule material by thermal gradient sublimation. The second step is to test the purity of at least one sample from the purified organic small molecule material by spectroscopy. The third step is to repeat the first through third steps on the purified small molecule material if the spectroscopic testing reveals any peaks exceeding a threshold percentage of a magnitude of a characteristic peak of a target organic small molecule. The steps are performed at least twice. The threshold percentage is at most 10%. Preferably the threshold percentage is 5% and more preferably 2%. The threshold percentage may be selected based on the spectra of past samples that achieved target performance characteristics in finished devices.
SPLICE: A program to assemble partial query solutions from three-dimensional database searches into novel ligands

NASA Astrophysics Data System (ADS)

Ho, Chris M. W.; Marshall, Garland R.

1993-12-01

SPLICE is a program that processes partial query solutions retrieved from 3D, structural databases to generate novel, aggregate ligands. It is designed to interface with the database searching program FOUNDATION, which retrieves fragments containing any combination of a user-specified minimum number of matching query elements. SPLICE eliminates aspects of structures that are physically incapable of binding within the active site. Then, a systematic rule-based procedure is performed upon the remaining fragments to ensure receptor complementarity. All modifications are automated and remain transparent to the user. Ligands are then assembled by linking components into composite structures through overlapping bonds. As a control experiment, FOUNDATION and SPLICE were used to reconstruct a know HIV-1 protease inhibitor after it had been fragmented, reoriented, and added to a sham database of fifty different small molecules. To illustrate the capabilities of this program, a 3D search query containing the pharmacophoric elements of an aspartic proteinase-inhibitor crystal complex was searched using FOUNDATION against a subset of the Cambridge Structural Database. One hundred thirty-one compounds were retrieved, each containing any combination of at least four query elements. Compounds were automatically screened and edited for receptor complementarity. Numerous combinations of fragments were discovered that could be linked to form novel structures, containing a greater number of pharmacophoric elements than any single retrieved fragment.
GPCR & company: databases and servers for GPCRs and interacting partners.

PubMed

Kowalsman, Noga; Niv, Masha Y

2014-01-01

G-protein-coupled receptors (GPCRs) are a large superfamily of membrane receptors that are involved in a wide range of signaling pathways. To fulfill their tasks, GPCRs interact with a variety of partners, including small molecules, lipids and proteins. They are accompanied by different proteins during all phases of their life cycle. Therefore, GPCR interactions with their partners are of great interest in basic cell-signaling research and in drug discovery.Due to the rapid development of computers and internet communication, knowledge and data can be easily shared within the worldwide research community via freely available databases and servers. These provide an abundance of biological, chemical and pharmacological information.This chapter describes the available web resources for investigating GPCR interactions. We review about 40 freely available databases and servers, and provide a few sentences about the essence and the data they supply. For simplification, the databases and servers were grouped under the following topics: general GPCR-ligand interactions; particular families of GPCRs and their ligands; GPCR oligomerization; GPCR interactions with intracellular partners; and structural information on GPCRs. In conclusion, a multitude of useful tools are currently available. Summary tables are provided to ease navigation between the numerous and partially overlapping resources. Suggestions for future enhancements of the online tools include the addition of links from general to specialized databases and enabling usage of user-supplied template for GPCR structural modeling.
Characterizing protein domain associations by Small-molecule ligand binding

PubMed Central

Li, Qingliang; Cheng, Tiejun; Wang, Yanli; Bryant, Stephen H.

2012-01-01

Background Protein domains are evolutionarily conserved building blocks for protein structure and function, which are conventionally identified based on protein sequence or structure similarity. Small molecule binding domains are of great importance for the recognition of small molecules in biological systems and drug development. Many small molecules, including drugs, have been increasingly identified to bind to multiple targets, leading to promiscuous interactions with protein domains. Thus, a large scale characterization of the protein domains and their associations with respect to small-molecule binding is of particular interest to system biology research, drug target identification, as well as drug repurposing. Methods We compiled a collection of 13,822 physical interactions of small molecules and protein domains derived from the Protein Data Bank (PDB) structures. Based on the chemical similarity of these small molecules, we characterized pairwise associations of the protein domains and further investigated their global associations from a network point of view. Results We found that protein domains, despite lack of similarity in sequence and structure, were comprehensively associated through binding the same or similar small-molecule ligands. Moreover, we identified modules in the domain network that consisted of closely related protein domains by sharing similar biochemical mechanisms, being involved in relevant biological pathways, or being regulated by the same cognate cofactors. Conclusions A novel protein domain relationship was identified in the context of small-molecule binding, which is complementary to those identified by traditional sequence-based or structure-based approaches. The protein domain network constructed in the present study provides a novel perspective for chemogenomic study and network pharmacology, as well as target identification for drug repurposing. PMID:23745168
Proteoform-specific protein binding of small molecules in complex matrices

USDA-ARS?s Scientific Manuscript database

Characterizing the specific binding between protein targets and small molecules is critically important for drug discovery. Conventional assays require isolation and purification of small molecules from complex matrices through multistep chromatographic fractionation, which may alter their original ...
Prospective virtual screening for novel p53-MDM2 inhibitors using ultrafast shape recognition

NASA Astrophysics Data System (ADS)

Patil, Sachin P.; Ballester, Pedro J.; Kerezsi, Cassidy R.

2014-02-01

The p53 protein, known as the guardian of genome, is mutated or deleted in approximately 50 % of human tumors. In the rest of the cancers, p53 is expressed in its wild-type form, but its function is inhibited by direct binding with the murine double minute 2 (MDM2) protein. Therefore, inhibition of the p53-MDM2 interaction, leading to the activation of tumor suppressor p53 protein presents a fundamentally novel therapeutic strategy against several types of cancers. The present study utilized ultrafast shape recognition (USR), a virtual screening technique based on ligand-receptor 3D shape complementarity, to screen DrugBank database for novel p53-MDM2 inhibitors. Specifically, using 3D shape of one of the most potent crystal ligands of MDM2, MI-63, as the query molecule, six compounds were identified as potential p53-MDM2 inhibitors. These six USR hits were then subjected to molecular modeling investigations through flexible receptor docking followed by comparative binding energy analysis. These studies suggested a potential role of the USR-selected molecules as p53-MDM2 inhibitors. This was further supported by experimental tests showing that the treatment of human colon tumor cells with the top USR hit, telmisartan, led to a dose-dependent cell growth inhibition in a p53-dependent manner. It is noteworthy that telmisartan has a long history of safe human use as an approved anti-hypertension drug and thus may present an immediate clinical potential as a cancer therapeutic. Furthermore, it could also serve as a structurally-novel lead molecule for the development of more potent, small-molecule p53-MDM2 inhibitors against variety of cancers. Importantly, the present study demonstrates that the adopted USR-based virtual screening protocol is a useful tool for hit identification in the domain of small molecule p53-MDM2 inhibitors.
Connecting synthetic chemistry decisions to cell and genome biology using small-molecule phenotypic profiling

PubMed Central

Wagner, Bridget K.; Clemons, Paul A.

2009-01-01

Discovering small-molecule modulators for thousands of gene products requires multiple stages of biological testing, specificity evaluation, and chemical optimization. Many cellular profiling methods, including cellular sensitivity, gene-expression, and cellular imaging, have emerged as methods to assess the functional consequences of biological perturbations. Cellular profiling methods applied to small-molecule science provide opportunities to use complex phenotypic information to prioritize and optimize small-molecule structures simultaneously against multiple biological endpoints. As throughput increases and cost decreases for such technologies, we see an emerging paradigm of using more information earlier in probe- and drug-discovery efforts. Moreover, increasing access to public datasets makes possible the construction of “virtual” profiles of small-molecule performance, even when multiplexed measurements were not performed or when multidimensional profiling was not the original intent. We review some key conceptual advances in small-molecule phenotypic profiling, emphasizing connections to other information, such as protein-binding measurements, genetic perturbations, and cell states. We argue that to maximally leverage these measurements in probe and drug discovery requires a fundamental connection to synthetic chemistry, allowing the consequences of synthetic decisions to be described in terms of changes in small-molecule profiles. Mining such data in the context of chemical structure and synthesis strategies can inform decisions about chemistry procurement and library development, leading to optimal small-molecule screening collections. PMID:19825513

Small Molecule based Musculoskeletal Regenerative Engineering

PubMed Central

Lo, Kevin W.-H.; Jiang, Tao; Gagnon, Keith A.; Nelson, Clarke; Laurencin, Cato T.

2014-01-01

Clinicians and scientists working in the field of regenerative engineering are actively investigating a wide range of methods to promote musculoskeletal tissue regeneration. Small molecule-mediated tissue regeneration is emerging as a promising strategy for regenerating various musculoskeletal tissues and a large number of small molecule compounds have been recently discovered as potential bioactive molecules for musculoskeletal tissue repair and regeneration. In this review, we summarize the recent literature encompassing the past four years in the area of small bioactive molecule for promoting repair and regeneration of various musculoskeletal tissues including bone, muscle, cartilage, tendon, and nerve. PMID:24405851
New glycoproteomics software, GlycoPep Evaluator, generates decoy glycopeptides de novo and enables accurate false discovery rate analysis for small data sets.

PubMed

Zhu, Zhikai; Su, Xiaomeng; Go, Eden P; Desaire, Heather

2014-09-16

Glycoproteins are biologically significant large molecules that participate in numerous cellular activities. In order to obtain site-specific protein glycosylation information, intact glycopeptides, with the glycan attached to the peptide sequence, are characterized by tandem mass spectrometry (MS/MS) methods such as collision-induced dissociation (CID) and electron transfer dissociation (ETD). While several emerging automated tools are developed, no consensus is present in the field about the best way to determine the reliability of the tools and/or provide the false discovery rate (FDR). A common approach to calculate FDRs for glycopeptide analysis, adopted from the target-decoy strategy in proteomics, employs a decoy database that is created based on the target protein sequence database. Nonetheless, this approach is not optimal in measuring the confidence of N-linked glycopeptide matches, because the glycopeptide data set is considerably smaller compared to that of peptides, and the requirement of a consensus sequence for N-glycosylation further limits the number of possible decoy glycopeptides tested in a database search. To address the need to accurately determine FDRs for automated glycopeptide assignments, we developed GlycoPep Evaluator (GPE), a tool that helps to measure FDRs in identifying glycopeptides without using a decoy database. GPE generates decoy glycopeptides de novo for every target glycopeptide, in a 1:20 target-to-decoy ratio. The decoys, along with target glycopeptides, are scored against the ETD data, from which FDRs can be calculated accurately based on the number of decoy matches and the ratio of the number of targets to decoys, for small data sets. GPE is freely accessible for download and can work with any search engine that interprets ETD data of N-linked glycopeptides. The software is provided at https://desairegroup.ku.edu/research.
Defining RNA–Small Molecule Affinity Landscapes Enables Design of a Small Molecule Inhibitor of an Oncogenic Noncoding RNA

PubMed Central

2017-01-01

RNA drug targets are pervasive in cells, but methods to design small molecules that target them are sparse. Herein, we report a general approach to score the affinity and selectivity of RNA motif–small molecule interactions identified via selection. Named High Throughput Structure–Activity Relationships Through Sequencing (HiT-StARTS), HiT-StARTS is statistical in nature and compares input nucleic acid sequences to selected library members that bind a ligand via high throughput sequencing. The approach allowed facile definition of the fitness landscape of hundreds of thousands of RNA motif–small molecule binding partners. These results were mined against folded RNAs in the human transcriptome and identified an avid interaction between a small molecule and the Dicer nuclease-processing site in the oncogenic microRNA (miR)-18a hairpin precursor, which is a member of the miR-17-92 cluster. Application of the small molecule, Targapremir-18a, to prostate cancer cells inhibited production of miR-18a from the cluster, de-repressed serine/threonine protein kinase 4 protein (STK4), and triggered apoptosis. Profiling the cellular targets of Targapremir-18a via Chemical Cross-Linking and Isolation by Pull Down (Chem-CLIP), a covalent small molecule–RNA cellular profiling approach, and other studies showed specific binding of the compound to the miR-18a precursor, revealing broadly applicable factors that govern small molecule drugging of noncoding RNAs. PMID:28386598
HBVPathDB: a database of HBV infection-related molecular interaction network.

PubMed

Zhang, Yi; Bo, Xiao-Chen; Yang, Jing; Wang, Sheng-Qi

2005-03-21

To describe molecules or genes interaction between hepatitis B viruses (HBV) and host, for understanding how virus' and host's genes and molecules are networked to form a biological system and for perceiving mechanism of HBV infection. The knowledge of HBV infection-related reactions was organized into various kinds of pathways with carefully drawn graphs in HBVPathDB. Pathway information is stored with relational database management system (DBMS), which is currently the most efficient way to manage large amounts of data and query is implemented with powerful Structured Query Language (SQL). The search engine is written using Personal Home Page (PHP) with SQL embedded and web retrieval interface is developed for searching with Hypertext Markup Language (HTML). We present the first version of HBVPathDB, which is a HBV infection-related molecular interaction network database composed of 306 pathways with 1 050 molecules involved. With carefully drawn graphs, pathway information stored in HBVPathDB can be browsed in an intuitive way. We develop an easy-to-use interface for flexible accesses to the details of database. Convenient software is implemented to query and browse the pathway information of HBVPathDB. Four search page layout options-category search, gene search, description search, unitized search-are supported by the search engine of the database. The database is freely available at http://www.bio-inf.net/HBVPathDB/HBV/. The conventional perspective HBVPathDB have already contained a considerable amount of pathway information with HBV infection related, which is suitable for in-depth analysis of molecular interaction network of virus and host. HBVPathDB integrates pathway data-sets with convenient software for query, browsing, visualization, that provides users more opportunity to identify regulatory key molecules as potential drug targets and to explore the possible mechanism of HBV infection based on gene expression datasets.
Understanding the Halogenation Effects in Diketopyrrolopyrrole-Based Small Molecule Photovoltaics.

PubMed

Sun, Shi-Xin; Huo, Yong; Li, Miao-Miao; Hu, Xiaowen; Zhang, Hai-Jun; Zhang, You-Wen; Zhang, You-Dan; Chen, Xiao-Long; Shi, Zi-Fa; Gong, Xiong; Chen, Yongsheng; Zhang, Hao-Li

2015-09-16

Two molecules containing a central diketopyrrolopyrrole and two oligothiophene units have been designed and synthesized. Comparisons between the molecules containing terminal F (FDPP) and Cl (CDPP) atoms allowed us to evaluate the effects of halogenation on the photovoltaic properties of the small molecule organic solar cells (OSCs). The OSCs devices employing FDPP:PC71BM films showed power conversion efficiencies up to 4.32%, suggesting that fluorination is an efficient method for constructing small molecules for OSCs.
LS-align: an atom-level, flexible ligand structural alignment algorithm for high-throughput virtual screening.

PubMed

Hu, Jun; Liu, Zi; Yu, Dong-Jun; Zhang, Yang

2018-02-15

Sequence-order independent structural comparison, also called structural alignment, of small ligand molecules is often needed for computer-aided virtual drug screening. Although many ligand structure alignment programs are proposed, most of them build the alignments based on rigid-body shape comparison which cannot provide atom-specific alignment information nor allow structural variation; both abilities are critical to efficient high-throughput virtual screening. We propose a novel ligand comparison algorithm, LS-align, to generate fast and accurate atom-level structural alignments of ligand molecules, through an iterative heuristic search of the target function that combines inter-atom distance with mass and chemical bond comparisons. LS-align contains two modules of Rigid-LS-align and Flexi-LS-align, designed for rigid-body and flexible alignments, respectively, where a ligand-size independent, statistics-based scoring function is developed to evaluate the similarity of ligand molecules relative to random ligand pairs. Large-scale benchmark tests are performed on prioritizing chemical ligands of 102 protein targets involving 1,415,871 candidate compounds from the DUD-E (Database of Useful Decoys: Enhanced) database, where LS-align achieves an average enrichment factor (EF) of 22.0 at the 1% cutoff and the AUC score of 0.75, which are significantly higher than other state-of-the-art methods. Detailed data analyses show that the advanced performance is mainly attributed to the design of the target function that combines structural and chemical information to enhance the sensitivity of recognizing subtle difference of ligand molecules and the introduces of structural flexibility that help capture the conformational changes induced by the ligand-receptor binding interactions. These data demonstrate a new avenue to improve the virtual screening efficiency through the development of sensitive ligand structural alignments. http://zhanglab.ccmb.med.umich.edu/LS-align/. njyudj@njust.edu.cn or zhng@umich.edu. Supplementary data are available at Bioinformatics online.
Cyndi: a multi-objective evolution algorithm based method for bioactive molecular conformational generation.

PubMed

Liu, Xiaofeng; Bai, Fang; Ouyang, Sisheng; Wang, Xicheng; Li, Honglin; Jiang, Hualiang

2009-03-31

Conformation generation is a ubiquitous problem in molecule modelling. Many applications require sampling the broad molecular conformational space or perceiving the bioactive conformers to ensure success. Numerous in silico methods have been proposed in an attempt to resolve the problem, ranging from deterministic to non-deterministic and systemic to stochastic ones. In this work, we described an efficient conformation sampling method named Cyndi, which is based on multi-objective evolution algorithm. The conformational perturbation is subjected to evolutionary operation on the genome encoded with dihedral torsions. Various objectives are designated to render the generated Pareto optimal conformers to be energy-favoured as well as evenly scattered across the conformational space. An optional objective concerning the degree of molecular extension is added to achieve geometrically extended or compact conformations which have been observed to impact the molecular bioactivity (J Comput -Aided Mol Des 2002, 16: 105-112). Testing the performance of Cyndi against a test set consisting of 329 small molecules reveals an average minimum RMSD of 0.864 A to corresponding bioactive conformations, indicating Cyndi is highly competitive against other conformation generation methods. Meanwhile, the high-speed performance (0.49 +/- 0.18 seconds per molecule) renders Cyndi to be a practical toolkit for conformational database preparation and facilitates subsequent pharmacophore mapping or rigid docking. The copy of precompiled executable of Cyndi and the test set molecules in mol2 format are accessible in Additional file 1. On the basis of MOEA algorithm, we present a new, highly efficient conformation generation method, Cyndi, and report the results of validation and performance studies comparing with other four methods. The results reveal that Cyndi is capable of generating geometrically diverse conformers and outperforms other four multiple conformer generators in the case of reproducing the bioactive conformations against 329 structures. The speed advantage indicates Cyndi is a powerful alternative method for extensive conformational sampling and large-scale conformer database preparation.
Bias-Free Chemically Diverse Test Sets from Machine Learning.

PubMed

Swann, Ellen T; Fernandez, Michael; Coote, Michelle L; Barnard, Amanda S

2017-08-14

Current benchmarking methods in quantum chemistry rely on databases that are built using a chemist's intuition. It is not fully understood how diverse or representative these databases truly are. Multivariate statistical techniques like archetypal analysis and K-means clustering have previously been used to summarize large sets of nanoparticles however molecules are more diverse and not as easily characterized by descriptors. In this work, we compare three sets of descriptors based on the one-, two-, and three-dimensional structure of a molecule. Using data from the NIST Computational Chemistry Comparison and Benchmark Database and machine learning techniques, we demonstrate the functional relationship between these structural descriptors and the electronic energy of molecules. Archetypes and prototypes found with topological or Coulomb matrix descriptors can be used to identify smaller, statistically significant test sets that better capture the diversity of chemical space. We apply this same method to find a diverse subset of organic molecules to demonstrate how the methods can easily be reapplied to individual research projects. Finally, we use our bias-free test sets to assess the performance of density functional theory and quantum Monte Carlo methods.
Harnessing Connectivity in a Large-Scale Small-Molecule Sensitivity Dataset | Office of Cancer Genomics

Cancer.gov

Identifying genetic alterations that prime a cancer cell to respond to a particular therapeutic agent can facilitate the development of precision cancer medicines. Cancer cell-line (CCL) profiling of small-molecule sensitivity has emerged as an unbiased method to assess the relationships between genetic or cellular features of CCLs and small-molecule response. Here, we developed annotated cluster multidimensional enrichment analysis to explore the associations between groups of small molecules and groups of CCLs in a new, quantitative sensitivity dataset.
A Nonfullerene Small Molecule Acceptor with 3D Interlocking Geometry Enabling Efficient Organic Solar Cells.

PubMed

Lee, Jaewon; Singh, Ranbir; Sin, Dong Hun; Kim, Heung Gyu; Song, Kyu Chan; Cho, Kilwon

2016-01-06

A new 3D nonfullerene small-molecule acceptor is reported. The 3D interlocking geometry of the small-molecule acceptor enables uniform molecular conformation and strong intermolecular connectivity, facilitating favorable nanoscale phase separation and electron charge transfer. By employing both a novel polymer donor and a nonfullerene small-molecule acceptor in the solution-processed organic solar cells, a high-power conversion efficiency of close to 6% is demonstrated. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A "roller-wheel" Pt-containing small molecule that outperforms its polymer analogs in organic solar cells

DOE PAGES

He, Wenhan; Wu, Qin; Livshits, Maksim Y.; ...

2016-05-23

A novel Pt-bisacetylide small molecule (Pt-SM) featuring “roller-wheel” geometry was synthesized and characterized. When compared with conventional Pt-containing polymers and small molecules having “dumbbell” shaped structures, Pt-SM displays enhanced crystallinity and intermolecular π–π interactions, as well as favorable panchromatic absorption behaviors. Furthermore, organic solar cells (OSCs) employing Pt-SM achieve power conversion efficiencies (PCEs) up to 5.9%, the highest reported so far for Pt-containing polymers and small molecules.
A "roller-wheel" Pt-containing small molecule that outperforms its polymer analogs in organic solar cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

He, Wenhan; Wu, Qin; Livshits, Maksim Y.

A novel Pt-bisacetylide small molecule (Pt-SM) featuring “roller-wheel” geometry was synthesized and characterized. When compared with conventional Pt-containing polymers and small molecules having “dumbbell” shaped structures, Pt-SM displays enhanced crystallinity and intermolecular π–π interactions, as well as favorable panchromatic absorption behaviors. Furthermore, organic solar cells (OSCs) employing Pt-SM achieve power conversion efficiencies (PCEs) up to 5.9%, the highest reported so far for Pt-containing polymers and small molecules.
Developing a Multi-Dimensional Hydrodynamics Code with Astrochemical Reactions

NASA Astrophysics Data System (ADS)

Kwak, Kyujin; Yang, Seungwon

2015-08-01

The Atacama Large Millimeter/submillimeter Array (ALMA) revealed high resolution molecular lines some of which are still unidentified yet. Because formation of these astrochemical molecules has been seldom studied in traditional chemistry, observations of new molecular lines drew a lot of attention from not only astronomers but also chemists both experimental and theoretical. Theoretical calculations for the formation of these astrochemical molecules have been carried out providing reaction rates for some important molecules, and some of theoretical predictions have been measured in laboratories. The reaction rates for the astronomically important molecules are now collected to form databases some of which are publically available. By utilizing these databases, we develop a multi-dimensional hydrodynamics code that includes the reaction rates of astrochemical molecules. Because this type of hydrodynamics code is able to trace the molecular formation in a non-equilibrium fashion, it is useful to study the formation history of these molecules that affects the spatial distribution of some specific molecules. We present the development procedure of this code and some test problems in order to verify and validate the developed code.
miRSponge: a manually curated database for experimentally supported miRNA sponges and ceRNAs.

PubMed

Wang, Peng; Zhi, Hui; Zhang, Yunpeng; Liu, Yue; Zhang, Jizhou; Gao, Yue; Guo, Maoni; Ning, Shangwei; Li, Xia

2015-01-01

In this study, we describe miRSponge, a manually curated database, which aims at providing an experimentally supported resource for microRNA (miRNA) sponges. Recent evidence suggests that miRNAs are themselves regulated by competing endogenous RNAs (ceRNAs) or 'miRNA sponges' that contain miRNA binding sites. These competitive molecules can sequester miRNAs to prevent them interacting with their natural targets to play critical roles in various biological and pathological processes. It has become increasingly important to develop a high quality database to record and store ceRNA data to support future studies. To this end, we have established the experimentally supported miRSponge database that contains data on 599 miRNA-sponge interactions and 463 ceRNA relationships from 11 species following manual curating from nearly 1200 published articles. Database classes include endogenously generated molecules including coding genes, pseudogenes, long non-coding RNAs and circular RNAs, along with exogenously introduced molecules including viral RNAs and artificial engineered sponges. Approximately 70% of the interactions were identified experimentally in disease states. miRSponge provides a user-friendly interface for convenient browsing, retrieval and downloading of dataset. A submission page is also included to allow researchers to submit newly validated miRNA sponge data. Database URL: http://www.bio-bigdata.net/miRSponge. © The Author(s) 2015. Published by Oxford University Press.
miRSponge: a manually curated database for experimentally supported miRNA sponges and ceRNAs

PubMed Central

Wang, Peng; Zhi, Hui; Zhang, Yunpeng; Liu, Yue; Zhang, Jizhou; Gao, Yue; Guo, Maoni; Ning, Shangwei; Li, Xia

2015-01-01

In this study, we describe miRSponge, a manually curated database, which aims at providing an experimentally supported resource for microRNA (miRNA) sponges. Recent evidence suggests that miRNAs are themselves regulated by competing endogenous RNAs (ceRNAs) or ‘miRNA sponges’ that contain miRNA binding sites. These competitive molecules can sequester miRNAs to prevent them interacting with their natural targets to play critical roles in various biological and pathological processes. It has become increasingly important to develop a high quality database to record and store ceRNA data to support future studies. To this end, we have established the experimentally supported miRSponge database that contains data on 599 miRNA-sponge interactions and 463 ceRNA relationships from 11 species following manual curating from nearly 1200 published articles. Database classes include endogenously generated molecules including coding genes, pseudogenes, long non-coding RNAs and circular RNAs, along with exogenously introduced molecules including viral RNAs and artificial engineered sponges. Approximately 70% of the interactions were identified experimentally in disease states. miRSponge provides a user-friendly interface for convenient browsing, retrieval and downloading of dataset. A submission page is also included to allow researchers to submit newly validated miRNA sponge data. Database URL: http://www.bio-bigdata.net/miRSponge. PMID:26424084
A-π-D-π-A Electron-Donating Small Molecules for Solution-Processed Organic Solar Cells: A Review.

PubMed

Wang, Zhen; Zhu, Lingyun; Shuai, Zhigang; Wei, Zhixiang

2017-11-01

Organic solar cells based on semiconducting polymers and small molecules have attracted considerable attention in the last two decades. Moreover, the power conversion efficiencies for solution-processed solar cells containing A-π-D-π-A-type small molecules and fullerenes have reached 11%. However, the method for designing high-performance, photovoltaic small molecules still remains unclear. In this review, recent studies on A-π-D-π-A electron-donating small molecules for organic solar cells are introduced. Moreover, the relationships between molecular properties and device performances are summarized, from which inspiration for the future design of high performance organic solar cells may be obtained. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A reactive, scalable, and transferable model for molecular energies from a neural network approach based on local information

NASA Astrophysics Data System (ADS)

Unke, Oliver T.; Meuwly, Markus

2018-06-01

Despite the ever-increasing computer power, accurate ab initio calculations for large systems (thousands to millions of atoms) remain infeasible. Instead, approximate empirical energy functions are used. Most current approaches are either transferable between different chemical systems, but not particularly accurate, or they are fine-tuned to a specific application. In this work, a data-driven method to construct a potential energy surface based on neural networks is presented. Since the total energy is decomposed into local atomic contributions, the evaluation is easily parallelizable and scales linearly with system size. With prediction errors below 0.5 kcal mol-1 for both unknown molecules and configurations, the method is accurate across chemical and configurational space, which is demonstrated by applying it to datasets from nonreactive and reactive molecular dynamics simulations and a diverse database of equilibrium structures. The possibility to use small molecules as reference data to predict larger structures is also explored. Since the descriptor only uses local information, high-level ab initio methods, which are computationally too expensive for large molecules, become feasible for generating the necessary reference data used to train the neural network.
Tools for in silico target fishing.

PubMed

Cereto-Massagué, Adrià; Ojeda, María José; Valls, Cristina; Mulero, Miquel; Pujadas, Gerard; Garcia-Vallve, Santiago

2015-01-01

Computational target fishing methods are designed to identify the most probable target of a query molecule. This process may allow the prediction of the bioactivity of a compound, the identification of the mode of action of known drugs, the detection of drug polypharmacology, drug repositioning or the prediction of the adverse effects of a compound. The large amount of information regarding the bioactivity of thousands of small molecules now allows the development of these types of methods. In recent years, we have witnessed the emergence of many methods for in silico target fishing. Most of these methods are based on the similarity principle, i.e., that similar molecules might bind to the same targets and have similar bioactivities. However, the difficult validation of target fishing methods hinders comparisons of the performance of each method. In this review, we describe the different methods developed for target prediction, the bioactivity databases most frequently used by these methods, and the publicly available programs and servers that enable non-specialist users to obtain these types of predictions. It is expected that target prediction will have a large impact on drug development and on the functional food industry. Copyright © 2014 Elsevier Inc. All rights reserved.
A general electrochemical method for label-free screening of protein–small molecule interactions†

PubMed Central

Cash, Kevin J.; Ricci, Francesco

2010-01-01

Here we report a versatile method by which the interaction between a protein and a small molecule, and the disruption of that interaction by competition with other small molecules, can be monitored electrochemically directly in complex sample matrices. PMID:19826675
Toward Generalization of Iterative Small Molecule Synthesis

PubMed Central

Lehmann, Jonathan W.; Blair, Daniel J.; Burke, Martin D.

2018-01-01

Small molecules have extensive untapped potential to benefit society, but access to this potential is too often restricted by limitations inherent to the customized approach currently used to synthesize this class of chemical matter. In contrast, the “building block approach”, i.e., generalized iterative assembly of interchangeable parts, has now proven to be a highly efficient and flexible way to construct things ranging all the way from skyscrapers to macromolecules to artificial intelligence algorithms. The structural redundancy found in many small molecules suggests that they possess a similar capacity for generalized building block-based construction. It is also encouraging that many customized iterative synthesis methods have been developed that improve access to specific classes of small molecules. There has also been substantial recent progress toward the iterative assembly of many different types of small molecules, including complex natural products, pharmaceuticals, biological probes, and materials, using common building blocks and coupling chemistry. Collectively, these advances suggest that a generalized building block approach for small molecule synthesis may be within reach. PMID:29696152

Screening of Small Molecule Interactor Library by Using In-Cell NMR Spectroscopy (SMILI-NMR)

PubMed Central

Xie, Jingjing; Thapa, Rajiv; Reverdatto, Sergey; Burz, David S.; Shekhtman, Alexander

2011-01-01

We developed an in-cell NMR assay for screening small molecule interactor libraries (SMILI-NMR) for compounds capable of disrupting or enhancing specific interactions between two or more components of a biomolecular complex. The method relies on the formation of a well-defined biocomplex and utilizes in-cell NMR spectroscopy to identify the molecular surfaces involved in the interaction at atomic scale resolution. Changes in the interaction surface caused by a small molecule interfering with complex formation are used as a read-out of the assay. The in-cell nature of the experimental protocol insures that the small molecule is capable of penetrating the cell membrane and specifically engaging the target molecule(s). Utility of the method was demonstrated by screening a small dipeptide library against the FKBP–FRB protein complex involved in cell cycle arrest. The dipeptide identified by SMILI-NMR showed biological activity in a functional assay in yeast. PMID:19422228
Interactions of quercetin, curcumin, epigallocatechin gallate and folic acid with gelatin.

PubMed

Yang, Tingting; Yang, Huiru; Fan, Yan; Li, Bafang; Hou, Hu

2018-06-15

Some small bioactive molecules from food show the potential health benefits, but with poor chemical stability and bioavailability. The interactions between small molecules and gelatin were investigated. Fluorescence experiments demonstrated that the bimolecular quenching constants (k q ) of complexes (gelatin-quercetin, gelatin-curcumin, gelatin-epigallocatechin gallate, gelatin-folic acid) were 3.7 × 10 12  L·mol -1 ·s -1 , 1.4 × 10 12  L·mol -1 ·s -1 , 2.7 × 10 12  L·mol -1 ·s -1 and 8.5 × 10 12  L·mol -1 ·s -1 , indicating that fluorescence quenching did not arise from a dynamical mechanism, but from gelatin-small molecules binding. Furthermore, the affinity with gelatin was ranked in the order of folic acid > quercetin > epigallocatechin gallate > curcumin. Fluorescence spectroscopy, ultraviolet and visible absorption spectroscopy, FTIR and circular dichroism showed that the interactions between small molecules and gelatin did not significantly alter the conformation and secondary structure of gelatin. Non-covalent interactions may result in the binding of gelatin with small molecules. The interactions were considered to be through two modes: (1) small molecules bound within the hydrophobic pockets of gelatin; (2) small molecules surrounded the gelatin molecule mainly through hydrogen bonds and hydrophobic interactions. Copyright © 2018 Elsevier B.V. All rights reserved.
Survey of phosphorylation near drug binding sites in the Protein Data Bank (PDB) and their effects.

PubMed

Smith, Kyle P; Gifford, Kathleen M; Waitzman, Joshua S; Rice, Sarah E

2015-01-01

While it is currently estimated that 40 to 50% of eukaryotic proteins are phosphorylated, little is known about the frequency and local effects of phosphorylation near pharmaceutical inhibitor binding sites. In this study, we investigated how frequently phosphorylation may affect the binding of drug inhibitors to target proteins. We examined the 453 non-redundant structures of soluble mammalian drug target proteins bound to inhibitors currently available in the Protein Data Bank (PDB). We cross-referenced these structures with phosphorylation data available from the PhosphoSitePlus database. Three hundred twenty-two of 453 (71%) of drug targets have evidence of phosphorylation that has been validated by multiple methods or labs. For 132 of 453 (29%) of those, the phosphorylation site is within 12 Å of the small molecule-binding site, where it would likely alter small molecule binding affinity. We propose a framework for distinguishing between drug-phosphorylation site interactions that are likely to alter the efficacy of drugs versus those that are not. In addition we highlight examples of well-established drug targets, such as estrogen receptor alpha, for which phosphorylation may affect drug affinity and clinical efficacy. Our data suggest that phosphorylation may affect drug binding and efficacy for a significant fraction of drug target proteins. © 2014 Wiley Periodicals, Inc.
Reflectance spectroscopy of organic compounds: 1. Alkanes

NASA Astrophysics Data System (ADS)

Clark, Roger N.; Curchin, John M.; Hoefen, Todd M.; Swayze, Gregg A.

2009-03-01

Reflectance spectra of the organic compounds comprising the alkane series are presented from the ultraviolet to midinfrared, 0.35 to 15.5 μm. Alkanes are hydrocarbon molecules containing only single carbon-carbon bonds, and are found naturally on the Earth and in the atmospheres of the giant planets and Saturn's moon, Titan. This paper presents the spectral properties of the alkanes as the first in a series of papers to build a spectral database of organic compounds for use in remote sensing studies. Applications range from mapping the environment on the Earth, to the search for organic molecules and life in the solar system and throughout the universe. We show that the spectral reflectance properties of organic compounds are rich, with major diagnostic spectral features throughout the spectral range studied. Little to no spectral change was observed as a function of temperature and only small shifts and changes in the width of absorption bands were observed between liquids and solids, making remote detection of spectral properties throughout the solar system simpler. Some high molecular weight organic compounds contain single-bonded carbon chains and have spectra similar to alkanes even when they fall into other families. Small spectral differences are often present allowing discrimination among some compounds, further illustrating the need to catalog spectral properties for accurate remote sensing identification with spectroscopy.
Reflectance spectroscopy of organic compounds: 1. Alkanes

USGS Publications Warehouse

Clark, R.N.; Curchin, J.M.; Hoefen, T.M.; Swayze, G.A.

2009-01-01

Reflectance spectra of the organic compounds comprising the alkane series are presented from the ultraviolet to midinfrared, 0.35 to 15.5 /??m. Alkanes are hydrocarbon molecules containing only single carbon-carbon bonds, and are found naturally on the Earth and in the atmospheres of the giant planets and Saturn's moon, Titan. This paper presents the spectral properties of the alkanes as the first in a series of papers to build a spectral database of organic compounds for use in remote sensing studies. Applications range from mapping the environment on the Earth, to the search for organic molecules and life in the solar system and throughout the. universe. We show that the spectral reflectance properties of organic compounds are rich, with major diagnostic spectral features throughout the spectral range studied. Little to no spectral change was observed as a function of temperature and only small shifts and changes in the width of absorption bands were observed between liquids and solids, making remote detection of spectral properties throughout the solar system simpler. Some high molecular weight organic compounds contain single-bonded carbon chains and have spectra similar to alkanes even ' when they fall into other families. Small spectral differences are often present allowing discrimination among some compounds, further illustrating the need to catalog spectral properties for accurate remote sensing identification with spectroscopy.
Gene-centric Meta-analysis in 87,736 Individuals of European Ancestry Identifies Multiple Blood-Pressure-Related Loci

PubMed Central

Tragante, Vinicius; Barnes, Michael R.; Ganesh, Santhi K.; Lanktree, Matthew B.; Guo, Wei; Franceschini, Nora; Smith, Erin N.; Johnson, Toby; Holmes, Michael V.; Padmanabhan, Sandosh; Karczewski, Konrad J.; Almoguera, Berta; Barnard, John; Baumert, Jens; Chang, Yen-Pei Christy; Elbers, Clara C.; Farrall, Martin; Fischer, Mary E.; Gaunt, Tom R.; Gho, Johannes M.I.H.; Gieger, Christian; Goel, Anuj; Gong, Yan; Isaacs, Aaron; Kleber, Marcus E.; Leach, Irene Mateo; McDonough, Caitrin W.; Meijs, Matthijs F.L.; Melander, Olle; Nelson, Christopher P.; Nolte, Ilja M.; Pankratz, Nathan; Price, Tom S.; Shaffer, Jonathan; Shah, Sonia; Tomaszewski, Maciej; van der Most, Peter J.; Van Iperen, Erik P.A.; Vonk, Judith M.; Witkowska, Kate; Wong, Caroline O.L.; Zhang, Li; Beitelshees, Amber L.; Berenson, Gerald S.; Bhatt, Deepak L.; Brown, Morris; Burt, Amber; Cooper-DeHoff, Rhonda M.; Connell, John M.; Cruickshanks, Karen J.; Curtis, Sean P.; Davey-Smith, George; Delles, Christian; Gansevoort, Ron T.; Guo, Xiuqing; Haiqing, Shen; Hastie, Claire E.; Hofker, Marten H.; Hovingh, G. Kees; Kim, Daniel S.; Kirkland, Susan A.; Klein, Barbara E.; Klein, Ronald; Li, Yun R.; Maiwald, Steffi; Newton-Cheh, Christopher; O’Brien, Eoin T.; Onland-Moret, N. Charlotte; Palmas, Walter; Parsa, Afshin; Penninx, Brenda W.; Pettinger, Mary; Vasan, Ramachandran S.; Ranchalis, Jane E.; M Ridker, Paul; Rose, Lynda M.; Sever, Peter; Shimbo, Daichi; Steele, Laura; Stolk, Ronald P.; Thorand, Barbara; Trip, Mieke D.; van Duijn, Cornelia M.; Verschuren, W. Monique; Wijmenga, Cisca; Wyatt, Sharon; Young, J. Hunter; Zwinderman, Aeilko H.; Bezzina, Connie R.; Boerwinkle, Eric; Casas, Juan P.; Caulfield, Mark J.; Chakravarti, Aravinda; Chasman, Daniel I.; Davidson, Karina W.; Doevendans, Pieter A.; Dominiczak, Anna F.; FitzGerald, Garret A.; Gums, John G.; Fornage, Myriam; Hakonarson, Hakon; Halder, Indrani; Hillege, Hans L.; Illig, Thomas; Jarvik, Gail P.; Johnson, Julie A.; Kastelein, John J.P.; Koenig, Wolfgang; Kumari, Meena; März, Winfried; Murray, Sarah S.; O’Connell, Jeffery R.; Oldehinkel, Albertine J.; Pankow, James S.; Rader, Daniel J.; Redline, Susan; Reilly, Muredach P.; Schadt, Eric E.; Kottke-Marchant, Kandice; Snieder, Harold; Snyder, Michael; Stanton, Alice V.; Tobin, Martin D.; Uitterlinden, André G.; van der Harst, Pim; van der Schouw, Yvonne T.; Samani, Nilesh J.; Watkins, Hugh; Johnson, Andrew D.; Reiner, Alex P.; Zhu, Xiaofeng; de Bakker, Paul I.W.; Levy, Daniel; Asselbergs, Folkert W.; Munroe, Patricia B.; Keating, Brendan J.

2014-01-01

Blood pressure (BP) is a heritable risk factor for cardiovascular disease. To investigate genetic associations with systolic BP (SBP), diastolic BP (DBP), mean arterial pressure (MAP), and pulse pressure (PP), we genotyped ∼50,000 SNPs in up to 87,736 individuals of European ancestry and combined these in a meta-analysis. We replicated findings in an independent set of 68,368 individuals of European ancestry. Our analyses identified 11 previously undescribed associations in independent loci containing 31 genes including PDE1A, HLA-DQB1, CDK6, PRKAG2, VCL, H19, NUCB2, RELA, HOXC@ complex, FBN1, and NFAT5 at the Bonferroni-corrected array-wide significance threshold (p < 6 × 10−7) and confirmed 27 previously reported associations. Bioinformatic analysis of the 11 loci provided support for a putative role in hypertension of several genes, such as CDK6 and NUCB2. Analysis of potential pharmacological targets in databases of small molecules showed that ten of the genes are predicted to be a target for small molecules. In summary, we identified previously unknown loci associated with BP. Our findings extend our understanding of genes involved in BP regulation, which may provide new targets for therapeutic intervention or drug response stratification. PMID:24560520
PrenDB, a Substrate Prediction Database to Enable Biocatalytic Use of Prenyltransferases.

PubMed

Gunera, Jakub; Kindinger, Florian; Li, Shu-Ming; Kolb, Peter

2017-03-10

Prenyltransferases of the dimethylallyltryptophan synthase (DMATS) superfamily catalyze the attachment of prenyl or prenyl-like moieties to diverse acceptor compounds. These acceptor molecules are generally aromatic in nature and mostly indole or indole-like. Their catalytic transformation represents a major skeletal diversification step in the biosynthesis of secondary metabolites, including the indole alkaloids. DMATS enzymes thus contribute significantly to the biological and pharmacological diversity of small molecule metabolites. Understanding the substrate specificity of these enzymes could create opportunities for their biocatalytic use in preparing complex synthetic scaffolds. However, there has been no framework to achieve this in a rational way. Here, we report a chemoinformatic pipeline to enable prenyltransferase substrate prediction. We systematically catalogued 32 unique prenyltransferases and 167 unique substrates to create possible reaction matrices and compiled these data into a browsable database named PrenDB. We then used a newly developed algorithm based on molecular fragmentation to automatically extract reactive chemical epitopes. The analysis of the collected data sheds light on the thus far explored substrate space of DMATS enzymes. To assess the predictive performance of our virtual reaction extraction tool, 38 potential substrates were tested as prenyl acceptors in assays with three prenyltransferases, and we were able to detect turnover in >55% of the cases. The database, PrenDB (www.kolblab.org/prendb.php), enables the prediction of potential substrates for chemoenzymatic synthesis through substructure similarity and virtual chemical transformation techniques. It aims at making prenyltransferases and their highly regio- and stereoselective reactions accessible to the research community for integration in synthetic work flows. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
ANI-1, A data set of 20 million calculated off-equilibrium conformations for organic molecules

NASA Astrophysics Data System (ADS)

Smith, Justin S.; Isayev, Olexandr; Roitberg, Adrian E.

2017-12-01

One of the grand challenges in modern theoretical chemistry is designing and implementing approximations that expedite ab initio methods without loss of accuracy. Machine learning (ML) methods are emerging as a powerful approach to constructing various forms of transferable atomistic potentials. They have been successfully applied in a variety of applications in chemistry, biology, catalysis, and solid-state physics. However, these models are heavily dependent on the quality and quantity of data used in their fitting. Fitting highly flexible ML potentials, such as neural networks, comes at a cost: a vast amount of reference data is required to properly train these models. We address this need by providing access to a large computational DFT database, which consists of more than 20 M off equilibrium conformations for 57,462 small organic molecules. We believe it will become a new standard benchmark for comparison of current and future methods in the ML potential community.
Machine learning of molecular electronic properties in chemical compound space

NASA Astrophysics Data System (ADS)

Montavon, Grégoire; Rupp, Matthias; Gobre, Vivekanand; Vazquez-Mayagoitia, Alvaro; Hansen, Katja; Tkatchenko, Alexandre; Müller, Klaus-Robert; Anatole von Lilienfeld, O.

2013-09-01

The combination of modern scientific computing with electronic structure theory can lead to an unprecedented amount of data amenable to intelligent data analysis for the identification of meaningful, novel and predictive structure-property relationships. Such relationships enable high-throughput screening for relevant properties in an exponentially growing pool of virtual compounds that are synthetically accessible. Here, we present a machine learning model, trained on a database of ab initio calculation results for thousands of organic molecules, that simultaneously predicts multiple electronic ground- and excited-state properties. The properties include atomization energy, polarizability, frontier orbital eigenvalues, ionization potential, electron affinity and excitation energies. The machine learning model is based on a deep multi-task artificial neural network, exploiting the underlying correlations between various molecular properties. The input is identical to ab initio methods, i.e. nuclear charges and Cartesian coordinates of all atoms. For small organic molecules, the accuracy of such a ‘quantum machine’ is similar, and sometimes superior, to modern quantum-chemical methods—at negligible computational cost.
Selection and Biosensor Application of Aptamers for Small Molecules

PubMed Central

Pfeiffer, Franziska; Mayer, Günter

2016-01-01

Small molecules play a major role in the human body and as drugs, toxins, and chemicals. Tools to detect and quantify them are therefore in high demand. This review will give an overview about aptamers interacting with small molecules and their selection. We discuss the current state of the field, including advantages as well as problems associated with their use and possible solutions to tackle these. We then discuss different kinds of small molecule aptamer-based sensors described in literature and their applications, ranging from detecting drinking water contaminations to RNA imaging. PMID:27379229
Methods to enable the design of bioactive small molecules targeting RNA

PubMed Central

Disney, Matthew D.; Yildirim, Ilyas; Childs-Disney, Jessica L.

2014-01-01

RNA is an immensely important target for small molecule therapeutics or chemical probes of function. However, methods that identify, annotate, and optimize RNA-small molecule interactions that could enable the design of compounds that modulate RNA function are in their infancies. This review describes recent approaches that have been developed to understand and optimize RNA motif-small molecule interactions, including Structure-Activity Relationships Through Sequencing (StARTS), quantitative structure-activity relationships (QSAR), chemical similarity searching, structure-based design and docking, and molecular dynamics (MD) simulations. Case studies described include the design of small molecules targeting RNA expansions, the bacterial A-site, viral RNAs, and telomerase RNA. These approaches can be combined to afford a synergistic method to exploit the myriad of RNA targets in the transcriptome. PMID:24357181
Methods to enable the design of bioactive small molecules targeting RNA.

PubMed

Disney, Matthew D; Yildirim, Ilyas; Childs-Disney, Jessica L

2014-02-21

RNA is an immensely important target for small molecule therapeutics or chemical probes of function. However, methods that identify, annotate, and optimize RNA-small molecule interactions that could enable the design of compounds that modulate RNA function are in their infancies. This review describes recent approaches that have been developed to understand and optimize RNA motif-small molecule interactions, including structure-activity relationships through sequencing (StARTS), quantitative structure-activity relationships (QSAR), chemical similarity searching, structure-based design and docking, and molecular dynamics (MD) simulations. Case studies described include the design of small molecules targeting RNA expansions, the bacterial A-site, viral RNAs, and telomerase RNA. These approaches can be combined to afford a synergistic method to exploit the myriad of RNA targets in the transcriptome.
Harnessing Connectivity in a Large-Scale Small-Molecule Sensitivity Dataset.

PubMed

Seashore-Ludlow, Brinton; Rees, Matthew G; Cheah, Jaime H; Cokol, Murat; Price, Edmund V; Coletti, Matthew E; Jones, Victor; Bodycombe, Nicole E; Soule, Christian K; Gould, Joshua; Alexander, Benjamin; Li, Ava; Montgomery, Philip; Wawer, Mathias J; Kuru, Nurdan; Kotz, Joanne D; Hon, C Suk-Yee; Munoz, Benito; Liefeld, Ted; Dančík, Vlado; Bittker, Joshua A; Palmer, Michelle; Bradner, James E; Shamji, Alykhan F; Clemons, Paul A; Schreiber, Stuart L

2015-11-01

Identifying genetic alterations that prime a cancer cell to respond to a particular therapeutic agent can facilitate the development of precision cancer medicines. Cancer cell-line (CCL) profiling of small-molecule sensitivity has emerged as an unbiased method to assess the relationships between genetic or cellular features of CCLs and small-molecule response. Here, we developed annotated cluster multidimensional enrichment analysis to explore the associations between groups of small molecules and groups of CCLs in a new, quantitative sensitivity dataset. This analysis reveals insights into small-molecule mechanisms of action, and genomic features that associate with CCL response to small-molecule treatment. We are able to recapitulate known relationships between FDA-approved therapies and cancer dependencies and to uncover new relationships, including for KRAS-mutant cancers and neuroblastoma. To enable the cancer community to explore these data, and to generate novel hypotheses, we created an updated version of the Cancer Therapeutic Response Portal (CTRP v2). We present the largest CCL sensitivity dataset yet available, and an analysis method integrating information from multiple CCLs and multiple small molecules to identify CCL response predictors robustly. We updated the CTRP to enable the cancer research community to leverage these data and analyses. ©2015 American Association for Cancer Research.
[Hyponatremia associated with SSRI/NRSI: Descriptive and comparative epidemiological study of the incidence rates of the notified cases from the data of the French National Pharmacovigilance Database and the French National Health Insurance].

PubMed

Revol, R; Rault, C; Polard, E; Bellet, F; Guy, C

2018-06-01

Selective Serotonin Reuptake Inhibitors (SSRIs) and Serotonin-Norepinephrine Reuptake Inhibitors (SNRIs) are frequently prescribed. These antidepressants can potentially induce serious hyponatremia through the SIADH syndrome. That seems to concern all molecules of these classes but the individual risk of each molecule is not well known. The aims of the study were to compare the incidence rate of each molecule in order to identify the existence of molecules more at risk of inducing hyponatremia and to characterize a profile of patients at risk for hyponatremia during a treatment with a SSRI or a SNRI. The cases of hyponatremia under SSRI/SNRI were extracted from the French pharmacovigilance database (BPNV). The exposition to the different SSRIs/SNRIs in the French population was estimated from the French National Health Insurance database (SNIIRAM) using a sampled database (Echantillon Généralistes des Bénéficiaires). The study ran from 01/01/2011 to 31/12/2013. The primary study endpoint was the incidence rate of notifications of the hyponatremia cases in patients treated by SSRI/SNRI and recorded into the BNPV database, related to the average annual number of corresponding treatments initiated during the same period. The number of cases of hyponatremia included in the study was 169 for 3 749 800 adult patients initiating treatment. The incidence rate of cases was 1.64 for 100 000 persons per year (PY). The standardized incidence rates between the different molecules showed no difference except for duloxetine (2.79/100 000 PY p > 0.03). Identified risk factors were age, with a large increase of incidence rate from 75 years old (incidence 12.5 higher) and female gender. Comparison of the incidence rates from spontaneous reports indicates a greater risk of hyponatremia for duloxetine for 2011-2013. This result needs to be confirmed by other studies. The advanced age and female sex are risk factors, irrespective of the molecule. Copyright © 2017 L'Encéphale, Paris. Published by Elsevier Masson SAS. All rights reserved.
Analysis and hit filtering of a very large library of compounds screened against Mycobacterium tuberculosis.

PubMed

Ekins, Sean; Kaneko, Takushi; Lipinski, Christopher A; Bradford, Justin; Dole, Krishna; Spektor, Anna; Gregory, Kellan; Blondeau, David; Ernst, Sylvia; Yang, Jeremy; Goncharoff, Nicko; Hohman, Moses M; Bunin, Barry A

2010-11-01

There is an urgent need for new drugs against tuberculosis which annually claims 1.7-1.8 million lives. One approach to identify potential leads is to screen in vitro small molecules against Mycobacterium tuberculosis (Mtb). Until recently there was no central repository to collect information on compounds screened. Consequently, it has been difficult to analyze molecular properties of compounds that inhibit the growth of Mtb in vitro. We have collected data from publically available sources on over 300 000 small molecules deposited in the Collaborative Drug Discovery TB Database. A cheminformatics analysis on these compounds indicates that inhibitors of the growth of Mtb have statistically higher mean logP, rule of 5 alerts, while also having lower HBD count, atom count and lower PSA (ChemAxon descriptors), compared to compounds that are classed as inactive. Additionally, Bayesian models for selecting Mtb active compounds were evaluated with over 100 000 compounds and, they demonstrated 10 fold enrichment over random for the top ranked 600 compounds. This represents a promising approach for finding compounds active against Mtb in whole cells screened under the same in vitro conditions. Various sets of Mtb hit molecules were also examined by various filtering rules used widely in the pharmaceutical industry to identify compounds with potentially reactive moieties. We found differences between the number of compounds flagged by these rules in Mtb datasets, malaria hits, FDA approved drugs and antibiotics. Combining these approaches may enable selection of compounds with increased probability of inhibition of whole cell Mtb activity.
Discovering H-bonding rules in crystals with inductive logic programming.

PubMed

Ando, Howard Y; Dehaspe, Luc; Luyten, Walter; Van Craenenbroeck, Elke; Vandecasteele, Henk; Van Meervelt, Luc

2006-01-01

In the domain of crystal engineering, various schemes have been proposed for the classification of hydrogen bonding (H-bonding) patterns observed in 3D crystal structures. In this study, the aim is to complement these schemes with rules that predict H-bonding in crystals from 2D structural information only. Modern computational power and the advances in inductive logic programming (ILP) can now provide computational chemistry with the opportunity for extracting structure-specific rules from large databases that can be incorporated into expert systems. ILP technology is here applied to H-bonding in crystals to develop a self-extracting expert system utilizing data in the Cambridge Structural Database of small molecule crystal structures. A clear increase in performance was observed when the ILP system DMax was allowed to refer to the local structural environment of the possible H-bond donor/acceptor pairs. This ability distinguishes ILP from more traditional approaches that build rules on the basis of global molecular properties.
FragariaCyc: A Metabolic Pathway Database for Woodland Strawberry Fragaria vesca

PubMed Central

Naithani, Sushma; Partipilo, Christina M.; Raja, Rajani; Elser, Justin L.; Jaiswal, Pankaj

2016-01-01

FragariaCyc is a strawberry-specific cellular metabolic network based on the annotated genome sequence of Fragaria vesca L. ssp. vesca, accession Hawaii 4. It was built on the Pathway-Tools platform using MetaCyc as the reference. The experimental evidences from published literature were used for supporting/editing existing entities and for the addition of new pathways, enzymes, reactions, compounds, and small molecules in the database. To date, FragariaCyc comprises 66 super-pathways, 488 unique pathways, 2348 metabolic reactions, 3507 enzymes, and 2134 compounds. In addition to searching and browsing FragariaCyc, researchers can compare pathways across various plant metabolic networks and analyze their data using Omics Viewer tool. We view FragariaCyc as a resource for the community of researchers working with strawberry and related fruit crops. It can help understanding the regulation of overall metabolism of strawberry plant during development and in response to diseases and abiotic stresses. FragariaCyc is available online at http://pathways.cgrb.oregonstate.edu. PMID:26973684
The Cambridge Structural Database in retrospect and prospect.

PubMed

Groom, Colin R; Allen, Frank H

2014-01-13

The Cambridge Crystallographic Data Centre (CCDC) was established in 1965 to record numerical, chemical and bibliographic data relating to published organic and metal-organic crystal structures. The Cambridge Structural Database (CSD) now stores data for nearly 700,000 structures and is a comprehensive and fully retrospective historical archive of small-molecule crystallography. Nearly 40,000 new structures are added each year. As X-ray crystallography celebrates its centenary as a subject, and the CCDC approaches its own 50th year, this article traces the origins of the CCDC as a publicly funded organization and its onward development into a self-financing charitable institution. Principally, however, we describe the growth of the CSD and its extensive associated software system, and summarize its impact and value as a basis for research in structural chemistry, materials science and the life sciences, including drug discovery and drug development. Finally, the article considers the CCDC's funding model in relation to open access and open data paradigms. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Novel hits for acetylcholinesterase inhibition derived by docking-based screening on ZINC database.

PubMed

Doytchinova, Irini; Atanasova, Mariyana; Valkova, Iva; Stavrakov, Georgi; Philipova, Irena; Zhivkova, Zvetanka; Zheleva-Dimitrova, Dimitrina; Konstantinov, Spiro; Dimitrov, Ivan

2018-12-01

The inhibition of the enzyme acetylcholinesterase (AChE) increases the levels of the neurotransmitter acetylcholine and symptomatically improves the affected cognitive function. In the present study, we searched for novel AChE inhibitors by docking-based virtual screening of the standard lead-like set of ZINC database containing more than 6 million small molecules using GOLD software. The top 10 best-scored hits were tested in vitro for AChE affinity, neurotoxicity, GIT and BBB permeability. The main pharmacokinetic parameters like volume of distribution, free fraction in plasma, total clearance, and half-life were predicted by previously derived models. Nine of the compounds bind to the enzyme with affinities from 0.517 to 0.735 µM, eight of them are non-toxic. All hits permeate GIT and BBB and bind extensively to plasma proteins. Most of them are low-clearance compounds. In total, seven of the 10 hits are promising for further lead optimisation. These are structures with ZINC IDs: 00220177, 44455618, 66142300, 71804814, 72065926, 96007907, and 97159977.
New spectroscopy in the HITRAN2016 database and its impact on atmospheric retrievals

NASA Astrophysics Data System (ADS)

Gordon, I.; Rothman, L. S.; Kochanov, R. V.; Tan, Y.; Toon, G. C.

2017-12-01

The HITRAN spectroscopic database is a backbone of the interpretation of spectral atmospheric retrievals and is an important input to the radiative transfer codes. The database is serving the atmospheric community for nearly half-a-century with every new edition being released every four years. The most recent release of the database is HITRAN2016 [1]. It consists of line-by-line lists, experimental absorption cross-sections, collision-induced absorption data and aerosol indices of refraction. In this presentation it will be stressed the importance of using the most recent edition of the database in the radiative transfer codes. The line-by-line lists for most of the HITRAN molecules were updated (and two new molecules added) in comparison with the previous compilation HITRAN2012 [2] that has been in use, along with some intermediate updates, since 2012. The extent of the updates ranges from updating a few lines of certain molecules to complete replacements of the lists and introduction of additional isotopologues. In addition, the amount of molecules in cross-sectional part of the database has increased dramatically from nearly 50 to over 300. The molecules covered by the HITRAN database are important in planetary remote sensing, environment monitoring (in particular, biomass burning detection), climate applications, industrial pollution tracking, atrophysics, and more. Taking advantage of the new structure and interface available at www.hitran.org [3] and the HITRAN Application Programming Interface [4] the amount of parameters has also been significantly increased, now incorporating, for instance, non-Voigt line profiles [5]; broadening by gases other than air and "self" [6]; and other phenomena, including line mixing. This is a very important novelty that needs to be properly introduced in the radiative transfer codes in order to advance accurate interpretation of the remote sensing retrievals. This work is supported by the NASA PDART (NNX16AG51G) and AURA (NNX 17AI78G) programs. References[1] I.E. Gordon et al, JQSRT in press (2017) http://doi.org/10.1016/j.jqsrt.2017.06.038. [2] L.S. Rothman et al, JQSRT 130, 4 (2013). [3] C. Hill et al, JQSRT 177, 4 (2016). [4] R.V. Kochanov et al, JQSRT 177, 15 (2016). [5] P. Wcisło et al., JQSRT 177, 75 (2016). [6] J. S. Wilzewski et al., JQSRT 168, 193 (2016).

Database for chemical weapons detection: first results

NASA Astrophysics Data System (ADS)

Bellecci, C.; Gaudio, P.; Gelfusa, M.; Martellucci, S.; Richetta, M.; Ventura, P.; Antonucci, A.; Pasquino, F.; Ricci, V.; Sassolini, A.

2008-10-01

The quick increase of terrorism and asymmetric war is leading towards new needs involving defense and security. Nowadays we have to fight several kind of threats and use of chemical weapons against civil or military objectives is one of the most dangerous. For this reason it is necessary to find equipment, know-how and information that are useful in order to detect and identify dangerous molecules as quickly and far away as possible, so to minimize damage. Lidar/Dial are some of the most powerful optical technologies. Dial technology use two different wavelengths, in order to measure concentration profile of an investigated molecule. For this reason it is needed a "fingerprint" database which consists of an exhaustive collection of absorption coefficients data so to identify each molecule avoiding confusion with interfering ones. Nowadays there is not such a collection of data in scientific and technical literature. We used an FT-IR spectrometer and a CO2 laser source for absorption spectroscopy measurements using cells filled with the investigated molecules. The CO2 source is the transmitter of our DIAL facility. In this way we can make a proper "fingerprint" database necessary to identify dangerous molecules. The CO2 laser has been chosen because it is eye safe and, mainly, because it covers a spectral band where there is good absorption for this kind of molecules. In this paper IR spectra of mustard will be presented and compared to other substances which may interfere producing a false alarm. Methodology, experimental setup and first results are described.
Identification of a Broad-Spectrum Antiviral Small Molecule against Severe Acute Respiratory Syndrome Coronavirus and Ebola, Hendra, and Nipah Viruses by Using a Novel High-Throughput Screening Assay

PubMed Central

Elshabrawy, Hatem A.; Fan, Jilao; Haddad, Christine S.; Ratia, Kiira; Broder, Christopher C.; Caffrey, Michael

2014-01-01

ABSTRACT Severe acute respiratory syndrome coronavirus (SARS-CoV) and Ebola, Hendra, and Nipah viruses are members of different viral families and are known causative agents of fatal viral diseases. These viruses depend on cathepsin L for entry into their target cells. The viral glycoproteins need to be primed by protease cleavage, rendering them active for fusion with the host cell membrane. In this study, we developed a novel high-throughput screening assay based on peptides, derived from the glycoproteins of the aforementioned viruses, which contain the cathepsin L cleavage site. We screened a library of 5,000 small molecules and discovered a small molecule that can inhibit the cathepsin L cleavage of all viral peptides with minimal inhibition of cleavage of a host protein-derived peptide (pro-neuropeptide Y). The small molecule inhibited the entry of all pseudotyped viruses in vitro and the cleavage of SARS-CoV spike glycoprotein in an in vitro cleavage assay. In addition, the Hendra and Nipah virus fusion glycoproteins were not cleaved in the presence of the small molecule in a cell-based cleavage assay. Furthermore, we demonstrate that the small molecule is a mixed inhibitor of cathepsin L. Our broad-spectrum antiviral small molecule appears to be an ideal candidate for future optimization and development into a potent antiviral against SARS-CoV and Ebola, Hendra, and Nipah viruses. IMPORTANCE We developed a novel high-throughput screening assay to identify small molecules that can prevent cathepsin L cleavage of viral glycoproteins derived from SARS-CoV and Ebola, Hendra, and Nipah viruses that are required for their entry into the host cell. We identified a novel broad-spectrum small molecule that could block cathepsin L-mediated cleavage and thus inhibit the entry of pseudotypes bearing the glycoprotein derived from SARS-CoV or Ebola, Hendra, or Nipah virus. The small molecule can be further optimized and developed into a potent broad-spectrum antiviral drug. PMID:24501399
Identification of a broad-spectrum antiviral small molecule against severe acute respiratory syndrome coronavirus and Ebola, Hendra, and Nipah viruses by using a novel high-throughput screening assay.

PubMed

Elshabrawy, Hatem A; Fan, Jilao; Haddad, Christine S; Ratia, Kiira; Broder, Christopher C; Caffrey, Michael; Prabhakar, Bellur S

2014-04-01

Severe acute respiratory syndrome coronavirus (SARS-CoV) and Ebola, Hendra, and Nipah viruses are members of different viral families and are known causative agents of fatal viral diseases. These viruses depend on cathepsin L for entry into their target cells. The viral glycoproteins need to be primed by protease cleavage, rendering them active for fusion with the host cell membrane. In this study, we developed a novel high-throughput screening assay based on peptides, derived from the glycoproteins of the aforementioned viruses, which contain the cathepsin L cleavage site. We screened a library of 5,000 small molecules and discovered a small molecule that can inhibit the cathepsin L cleavage of all viral peptides with minimal inhibition of cleavage of a host protein-derived peptide (pro-neuropeptide Y). The small molecule inhibited the entry of all pseudotyped viruses in vitro and the cleavage of SARS-CoV spike glycoprotein in an in vitro cleavage assay. In addition, the Hendra and Nipah virus fusion glycoproteins were not cleaved in the presence of the small molecule in a cell-based cleavage assay. Furthermore, we demonstrate that the small molecule is a mixed inhibitor of cathepsin L. Our broad-spectrum antiviral small molecule appears to be an ideal candidate for future optimization and development into a potent antiviral against SARS-CoV and Ebola, Hendra, and Nipah viruses. We developed a novel high-throughput screening assay to identify small molecules that can prevent cathepsin L cleavage of viral glycoproteins derived from SARS-CoV and Ebola, Hendra, and Nipah viruses that are required for their entry into the host cell. We identified a novel broad-spectrum small molecule that could block cathepsin L-mediated cleavage and thus inhibit the entry of pseudotypes bearing the glycoprotein derived from SARS-CoV or Ebola, Hendra, or Nipah virus. The small molecule can be further optimized and developed into a potent broad-spectrum antiviral drug.
In Silico Identification and Experimental Validation of Novel Anti-Alzheimer's Multitargeted Ligands from a Marine Source Featuring a "2-Aminoimidazole plus Aromatic Group" Scaffold.

PubMed

Vitale, Rosa Maria; Rispoli, Vincenzo; Desiderio, Doriana; Sgammato, Roberta; Thellung, Stefano; Canale, Claudio; Vassalli, Massimo; Carbone, Marianna; Ciavatta, Maria Letizia; Mollo, Ernesto; Felicità, Vera; Arcone, Rosaria; Gavagnin Capoggiani, Margherita; Masullo, Mariorosario; Florio, Tullio; Amodeo, Pietro

2018-03-07

Multitargeting or polypharmacological approaches, looking for single chemical entities retaining the ability to bind two or more molecular targets, are a potentially powerful strategy to fight complex, multifactorial pathologies. Unfortunately, the search for multiligand agents is challenging because only a small subset of molecules contained in molecular databases are bioactive and even fewer are active on a preselected set of multiple targets. However, collections of natural compounds feature a significantly higher fraction of bioactive molecules than synthetic ones. In this view, we searched our library of 1175 natural compounds from marine sources for molecules including a 2-aminoimidazole+aromatic group motif, found in known compounds active on single relevant targets for Alzheimer's disease (AD). This identified two molecules, a pseudozoanthoxanthin (1) and a bromo-pyrrole alkaloid (2), which were predicted by a computational approach to possess interesting multitarget profiles on AD target proteins. Biochemical assays experimentally confirmed their biological activities. The two compounds inhibit acetylcholinesterase, butyrylcholinesterase, and β-secretase enzymes in high- to sub-micromolar range. They are also able to prevent and revert β-amyloid (Aβ) aggregation of both Aβ 1-40 and Aβ 1-42 peptides, with 1 being more active than 2. Preliminary in vivo studies suggest that compound 1 is able to restore cholinergic cortico-hippocampal functional connectivity.
The Mid-Infrared Absorption Spectra of Neutral PAHs in Dense Interstellar Clouds

NASA Technical Reports Server (NTRS)

Bernstein, M. P.; Sandford, S. A.; Allamandola, L. J.

2005-01-01

Polycyclic aromatic hydrocarbons (PAHs) are common throughout the universe and are expected to be present in dense interstellar clouds. In these environments, some P.4Hs may be present in the gas phase, but most should be frozen into ice mantles or adsorbed onto dust grains and their spectral features are expected to be seen in absorption. Here we extend our previous work on the infrared spectral properties of the small PAH naphthalene (C10H8) in several media to include the full mid-infrared laboratory spectra of 11 other PAHs and related aromatic species frozen in H2O ices. These include the molecules 1,2-dihydronaphthalene, anthracene, 9,1O-dihydroanthracene, phenanthrene, pyrene, benzo[e]pyrene, perylene, benzo(k)fluoranthene, pentacene, benzo[ghi]perylene, and coronene. These results demonstrate that PAHs and related molecules, as a class, show the same spectral behaviors as naphthalene when incorporated into H2O-rich matrices. When compared to the spectra of these same molecules isolated in inert matrices (e.g., Ar or N2), the absorption bands produced when they are frozen in H2O matrices are broader (factors of 3-10), show small position shifts in either direction (usually < 4/cm, always < 10/cm), and show variable changes in relative band strengths (typically factors of 1-3). There is no evidence of systematic increases or decreases in the absolute strengths of the bands of these molecules when they are incorporated in H2O matrices. In H2O-rich ices, their absorption bands are relatively insensitive to concentration over the range of 10 < H2O/PAH < 200): The absorption bands of these molecules are also insensitive to temperature over the 10 K < T < 125 K range, although the spectra can show dramatic changes as the ices are warmed through the temperature range in which amorphous H2O ice converts to its cubic and hexagonal crystalline forms (T > 125 Kj. Given the small observed band shifts cause by H2O, the current database of spectra from Ar matrix-isolated neutral PAHs and related molecules should be useful for the search for these species in dense clouds on the basis of observed absorption band positions. Furthermore, these data permit determination of column densities to better than a factor of 3 for PAHs in dense clouds. Column density determination of detected aromatics to better than a factor of 3 will, however, require good knowledge about the nature of the matrix in which the PAH is embedded and laboratory studies of relevant samples.
Recent Developments in β-Cell Differentiation of Pluripotent Stem Cells Induced by Small and Large Molecules

PubMed Central

Kumar, S. Suresh; Alarfaj, Abdullah A.; Munusamy, Murugan A.; Singh, A. J. A. Ranjith; Peng, I-Chia; Priya, Sivan Padma; Hamat, Rukman Awang; Higuchi, Akon

2014-01-01

Human pluripotent stem cells, including human embryonic stem cells (hESCs) and human induced pluripotent stem cells (hiPSCs), hold promise as novel therapeutic tools for diabetes treatment because of their self-renewal capacity and ability to differentiate into beta (β)-cells. Small and large molecules play important roles in each stage of β-cell differentiation from both hESCs and hiPSCs. The small and large molecules that are described in this review have significantly advanced efforts to cure diabetic disease. Lately, effective protocols have been implemented to induce hESCs and human mesenchymal stem cells (hMSCs) to differentiate into functional β-cells. Several small molecules, proteins, and growth factors promote pancreatic differentiation from hESCs and hMSCs. These small molecules (e.g., cyclopamine, wortmannin, retinoic acid, and sodium butyrate) and large molecules (e.g. activin A, betacellulin, bone morphogentic protein (BMP4), epidermal growth factor (EGF), fibroblast growth factor (FGF), keratinocyte growth factor (KGF), hepatocyte growth factor (HGF), noggin, transforming growth factor (TGF-α), and WNT3A) are thought to contribute from the initial stages of definitive endoderm formation to the final stages of maturation of functional endocrine cells. We discuss the importance of such small and large molecules in uniquely optimized protocols of β-cell differentiation from stem cells. A global understanding of various small and large molecules and their functions will help to establish an efficient protocol for β-cell differentiation. PMID:25526563
High Throughput, Label-free Screening Small Molecule Compound Libraries for Protein-Ligands using Combination of Small Molecule Microarrays and a Special Ellipsometry-based Optical Scanner.

PubMed

Landry, James P; Fei, Yiyan; Zhu, X D

2011-12-01

Small-molecule compounds remain the major source of therapeutic and preventative drugs. Developing new drugs against a protein target often requires screening large collections of compounds with diverse structures for ligands or ligand fragments that exhibit sufficiently affinity and desirable inhibition effect on the target before further optimization and development. Since the number of small molecule compounds is large, high-throughput screening (HTS) methods are needed. Small-molecule microarrays (SMM) on a solid support in combination with a suitable binding assay form a viable HTS platform. We demonstrate that by combining an oblique-incidence reflectivity difference optical scanner with SMM we can screen 10,000 small-molecule compounds on a single glass slide for protein ligands without fluorescence labeling. Furthermore using such a label-free assay platform we can simultaneously acquire binding curves of a solution-phase protein to over 10,000 immobilized compounds, thus enabling full characterization of protein-ligand interactions over a wide range of affinity constants.
PeTMbase: A Database of Plant Endogenous Target Mimics (eTMs).

PubMed

Karakülah, Gökhan; Yücebilgili Kurtoğlu, Kuaybe; Unver, Turgay

2016-01-01

MicroRNAs (miRNA) are small endogenous RNA molecules, which regulate target gene expression at post-transcriptional level. Besides, miRNA activity can be controlled by a newly discovered regulatory mechanism called endogenous target mimicry (eTM). In target mimicry, eTMs bind to the corresponding miRNAs to block the binding of specific transcript leading to increase mRNA expression. Thus, miRNA-eTM-target-mRNA regulation modules involving a wide range of biological processes; an increasing need for a comprehensive eTM database arose. Except miRSponge with limited number of Arabidopsis eTM data no available database and/or repository was developed and released for plant eTMs yet. Here, we present an online plant eTM database, called PeTMbase (http://petmbase.org), with a highly efficient search tool. To establish the repository a number of identified eTMs was obtained utilizing from high-throughput RNA-sequencing data of 11 plant species. Each transcriptome libraries is first mapped to corresponding plant genome, then long non-coding RNA (lncRNA) transcripts are characterized. Furthermore, additional lncRNAs retrieved from GREENC and PNRD were incorporated into the lncRNA catalog. Then, utilizing the lncRNA and miRNA sources a total of 2,728 eTMs were successfully predicted. Our regularly updated database, PeTMbase, provides high quality information regarding miRNA:eTM modules and will aid functional genomics studies particularly, on miRNA regulatory networks.
Delivery of small molecules for bone regenerative engineering: preclinical studies and potential clinical applications.

PubMed

Laurencin, Cato T; Ashe, Keshia M; Henry, Nicole; Kan, Ho Man; Lo, Kevin W-H

2014-06-01

Stimulation of bone regeneration using growth factors is a promising approach for musculoskeletal regenerative engineering. However, common limitations with protein growth factors, such as high manufacturing costs, protein instability, contamination issues, and unwanted immunogenic responses of the host reduce potential clinical applications. New strategies for bone regeneration that involve inexpensive and stable small molecules can obviate these problems and have a significant impact on the treatment of skeletal injury and diseases. Over the past decade, a large number of small molecules with the potential of regenerating skeletal tissue have been reported in the literature. Here, we review this literature, paying specific attention to the prospects for small molecule-based bone-regenerative engineering. We also review the preclinical study of small molecules associated with bone regeneration. Copyright © 2014 Elsevier Ltd. All rights reserved.
The 2015 edition of the GEISA spectroscopic database

NASA Astrophysics Data System (ADS)

Jacquinet-Husson, N.; Armante, R.; Scott, N. A.; Chédin, A.; Crépeau, L.; Boutammine, C.; Bouhdaoui, A.; Crevoisier, C.; Capelle, V.; Boonne, C.; Poulet-Crovisier, N.; Barbe, A.; Chris Benner, D.; Boudon, V.; Brown, L. R.; Buldyreva, J.; Campargue, A.; Coudert, L. H.; Devi, V. M.; Down, M. J.; Drouin, B. J.; Fayt, A.; Fittschen, C.; Flaud, J.-M.; Gamache, R. R.; Harrison, J. J.; Hill, C.; Hodnebrog, Ø.; Hu, S.-M.; Jacquemart, D.; Jolly, A.; Jiménez, E.; Lavrentieva, N. N.; Liu, A.-W.; Lodi, L.; Lyulin, O. M.; Massie, S. T.; Mikhailenko, S.; Müller, H. S. P.; Naumenko, O. V.; Nikitin, A.; Nielsen, C. J.; Orphal, J.; Perevalov, V. I.; Perrin, A.; Polovtseva, E.; Predoi-Cross, A.; Rotger, M.; Ruth, A. A.; Yu, S. S.; Sung, K.; Tashkun, S. A.; Tennyson, J.; Tyuterev, Vl. G.; Vander Auwera, J.; Voronin, B. A.; Makie, A.

2016-09-01

The GEISA database (Gestion et Etude des Informations Spectroscopiques Atmosphériques: Management and Study of Atmospheric Spectroscopic Information) has been developed and maintained by the http://ara.abct.lmd.polytechnique.fr. The "line parameters database" contains 52 molecular species (118 isotopologues) and transitions in the spectral range from 10-6 to 35,877.031 cm-1, representing 5,067,351 entries, against 3,794,297 in GEISA-2011. Among the previously existing molecules, 20 molecular species have been updated. A new molecule (SO3) has been added. HDO, isotopologue of H2O, is now identified as an independent molecular species. Seven new isotopologues have been added to the GEISA-2015 database. The "cross section sub-database" has been enriched by the addition of 43 new molecular species in its infrared part, 4 molecules (ethane, propane, acetone, acetonitrile) are also updated; they represent 3% of the update. A new section is added, in the near-infrared spectral region, involving 7 molecular species: CH3CN, CH3I, CH3O2, H2CO, HO2, HONO, NH3. The "microphysical and optical properties of atmospheric aerosols sub-database" has been updated for the first time since 2003. It contains more than 40 species originating from NCAR and 20 from the http://eodg.atm.ox.ac.uk/ARIA/introduction_nocol.html. As for the previous versions, this new release of GEISA and associated management software facilities are implemented and freely accessible on the http://cds-espri.ipsl.fr/etherTypo/?id=950.
Small molecule annotation for the Protein Data Bank

PubMed Central

Sen, Sanchayita; Young, Jasmine; Berrisford, John M.; Chen, Minyu; Conroy, Matthew J.; Dutta, Shuchismita; Di Costanzo, Luigi; Gao, Guanghua; Ghosh, Sutapa; Hudson, Brian P.; Igarashi, Reiko; Kengaku, Yumiko; Liang, Yuhe; Peisach, Ezra; Persikova, Irina; Mukhopadhyay, Abhik; Narayanan, Buvaneswari Coimbatore; Sahni, Gaurav; Sato, Junko; Sekharan, Monica; Shao, Chenghua; Tan, Lihua; Zhuravleva, Marina A.

2014-01-01

The Protein Data Bank (PDB) is the single global repository for three-dimensional structures of biological macromolecules and their complexes, and its more than 100 000 structures contain more than 20 000 distinct ligands or small molecules bound to proteins and nucleic acids. Information about these small molecules and their interactions with proteins and nucleic acids is crucial for our understanding of biochemical processes and vital for structure-based drug design. Small molecules present in a deposited structure may be attached to a polymer or may occur as a separate, non-covalently linked ligand. During curation of a newly deposited structure by wwPDB annotation staff, each molecule is cross-referenced to the PDB Chemical Component Dictionary (CCD). If the molecule is new to the PDB, a dictionary description is created for it. The information about all small molecule components found in the PDB is distributed via the ftp archive as an external reference file. Small molecule annotation in the PDB also includes information about ligand-binding sites and about covalent and other linkages between ligands and macromolecules. During the remediation of the peptide-like antibiotics and inhibitors present in the PDB archive in 2011, it became clear that additional annotation was required for consistent representation of these molecules, which are quite often composed of several sequential subcomponents including modified amino acids and other chemical groups. The connectivity information of the modified amino acids is necessary for correct representation of these biologically interesting molecules. The combined information is made available via a new resource called the Biologically Interesting molecules Reference Dictionary, which is complementary to the CCD and is now routinely used for annotation of peptide-like antibiotics and inhibitors. PMID:25425036
Small molecule annotation for the Protein Data Bank.

PubMed

Sen, Sanchayita; Young, Jasmine; Berrisford, John M; Chen, Minyu; Conroy, Matthew J; Dutta, Shuchismita; Di Costanzo, Luigi; Gao, Guanghua; Ghosh, Sutapa; Hudson, Brian P; Igarashi, Reiko; Kengaku, Yumiko; Liang, Yuhe; Peisach, Ezra; Persikova, Irina; Mukhopadhyay, Abhik; Narayanan, Buvaneswari Coimbatore; Sahni, Gaurav; Sato, Junko; Sekharan, Monica; Shao, Chenghua; Tan, Lihua; Zhuravleva, Marina A

2014-01-01

The Protein Data Bank (PDB) is the single global repository for three-dimensional structures of biological macromolecules and their complexes, and its more than 100,000 structures contain more than 20,000 distinct ligands or small molecules bound to proteins and nucleic acids. Information about these small molecules and their interactions with proteins and nucleic acids is crucial for our understanding of biochemical processes and vital for structure-based drug design. Small molecules present in a deposited structure may be attached to a polymer or may occur as a separate, non-covalently linked ligand. During curation of a newly deposited structure by wwPDB annotation staff, each molecule is cross-referenced to the PDB Chemical Component Dictionary (CCD). If the molecule is new to the PDB, a dictionary description is created for it. The information about all small molecule components found in the PDB is distributed via the ftp archive as an external reference file. Small molecule annotation in the PDB also includes information about ligand-binding sites and about covalent and other linkages between ligands and macromolecules. During the remediation of the peptide-like antibiotics and inhibitors present in the PDB archive in 2011, it became clear that additional annotation was required for consistent representation of these molecules, which are quite often composed of several sequential subcomponents including modified amino acids and other chemical groups. The connectivity information of the modified amino acids is necessary for correct representation of these biologically interesting molecules. The combined information is made available via a new resource called the Biologically Interesting molecules Reference Dictionary, which is complementary to the CCD and is now routinely used for annotation of peptide-like antibiotics and inhibitors. © The Author(s) 2014. Published by Oxford University Press.
Developing an Efficient and General Strategy for Immobilization of Small Molecules onto Microarrays Using Isocyanate Chemistry.

PubMed

Zhu, Chenggang; Zhu, Xiangdong; Landry, James P; Cui, Zhaomeng; Li, Quanfu; Dang, Yongjun; Mi, Lan; Zheng, Fengyun; Fei, Yiyan

2016-03-16

Small-molecule microarray (SMM) is an effective platform for identifying lead compounds from large collections of small molecules in drug discovery, and efficient immobilization of molecular compounds is a pre-requisite for the success of such a platform. On an isocyanate functionalized surface, we studied the dependence of immobilization efficiency on chemical residues on molecular compounds, terminal residues on isocyanate functionalized surface, lengths of spacer molecules, and post-printing treatment conditions, and we identified a set of optimized conditions that enable us to immobilize small molecules with significantly improved efficiencies, particularly for those molecules with carboxylic acid residues that are known to have low isocyanate reactivity. We fabricated microarrays of 3375 bioactive compounds on isocyanate functionalized glass slides under these optimized conditions and confirmed that immobilization percentage is over 73%.
Cell-targetable DNA nanocapsules for spatiotemporal release of caged bioactive small molecules

NASA Astrophysics Data System (ADS)

Veetil, Aneesh T.; Chakraborty, Kasturi; Xiao, Kangni; Minter, Myles R.; Sisodia, Sangram S.; Krishnan, Yamuna

2017-12-01

Achieving triggered release of small molecules with spatial and temporal precision at designated cells within an organism remains a challenge. By combining a cell-targetable, icosahedral DNA-nanocapsule loaded with photoresponsive polymers, we show cytosolic delivery of small molecules with the spatial resolution of single endosomes in specific cells in Caenorhabditis elegans. Our technology can report on the extent of small molecules released after photoactivation as well as pinpoint the location at which uncaging of the molecules occurred. We apply this technology to release dehydroepiandrosterone (DHEA), a neurosteroid that promotes neurogenesis and neuron survival, and determined the timescale of neuronal activation by DHEA, using light-induced release of DHEA from targeted DNA nanocapsules. Importantly, sequestration inside the DNA capsule prevents photocaged DHEA from activating neurons prematurely. Our methodology can in principle be generalized to diverse neurostimulatory molecules.
Exploring biology with small organic molecules

PubMed Central

Stockwell, Brent R.

2011-01-01

Small organic molecules have proven to be invaluable tools for investigating biological systems, but there is still much to learn from their use. To discover and to use more effectively new chemical tools to understand biology, strategies are needed that allow us to systematically explore ‘biological-activity space’. Such strategies involve analysing both protein binding of, and phenotypic responses to, small organic molecules. The mapping of biological-activity space using small molecules is akin to mapping the stars — uncharted territory is explored using a system of coordinates that describes where each new feature lies. PMID:15602550
Morphological study on small molecule acceptor-based organic solar cells with efficiencies beyond 7% (Presentation Recording)

NASA Astrophysics Data System (ADS)

Ma, Wei; Yan, He

2015-10-01

Despite the essential role of fullerenes in achieving best-performance organic solar cells (OSCs), fullerene acceptors have several drawbacks including poor light absorption, high-cost production and purification. For this reason, small molecule acceptor (SMA)-based OSCs have attracted much attention due to the easy tunability of electronic and optical properties of SMA materials. In this study, polymers with temperature dependent aggregation behaviors are combined with various small molecule acceptor materials, which lead to impressive power conversion efficiencies of up to 7.3%. The morphological and aggregation properties of the polymer:small molecule blends are studied in details. It is found that the temperature-dependent aggregation behavior of polymers allows for the processing of the polymer solutions at moderately elevated temperature, and more importantly, controlled aggregation and strong crystallization of the polymer during the film cooling and drying process. This results in a well-controlled and near-ideal polymer:small molecule morphology that is controlled by polymer aggregation during warm casting and thus insensitive to the choice of small molecules. As a result, several cases of highly efficient (PCE between 6-7.3%) SMA OSCs are achieved. The second part of this presentation will describe the morphology of a new small molecule acceptor with a unique 3D structure. The relationship between molecular structure and morphology is revealed.
Identification of small molecule inhibitors of cytokinesis and single cell wound repair

PubMed Central

Clark, Andrew G.; Sider, Jenny R.; Verbrugghe, Koen; Fenteany, Gabriel; von Dassow, George; Bement, William M.

2013-01-01

Screening of small molecule libraries offers the potential to identify compounds that inhibit specific biological processes and, ultimately, to identify macromolecules that are important players in such processes. To date, however, most screens of small molecule libraries have focused on identification of compounds that inhibit known proteins or particular steps in a given process, and have emphasized automated primary screens. Here we have used “low tech” in vivo primary screens to identify small molecules that inhibit both cytokinesis and single cell wound repair, two complex cellular processes that possess many common features. The “diversity set”, an ordered array of 1990 compounds available from the National Cancer Institute, was screened in parallel to identify compounds that inhibit cytokinesis in D. excentricus (sand dollar) embryos and single cell wound repair in X. laevis (frog) oocytes. Two small molecules were thus identified: Sph1 and Sph2. Sph1 reduces Rho activation in wound repair and suppresses formation of the spindle midzone during cytokinesis. Sph2 also reduces Rho activation in wound repair and may inhibit cytokinesis by blocking membrane fusion. The results identify two small molecules of interest for analysis of wound repair and cytokinesis, reveal that these processes are more similar than often realized and reveal the potential power of low tech screens of small molecule libraries for analysis of complex cellular processes. PMID:23125193
Manipulation of Cell Physiology Enables Gene Silencing in Well-differentiated Airway Epithelia

PubMed Central

Krishnamurthy, Sateesh; Behlke, Mark A; Ramachandran, Shyam; Salem, Aliasger K; McCray Jr, Paul B; Davidson, Beverly L

2012-01-01

The application of RNA interference-based gene silencing to the airway surface epithelium holds great promise to manipulate host and pathogen gene expression for therapeutic purposes. However, well-differentiated airway epithelia display significant barriers to double-stranded small-interfering RNA (siRNA) delivery despite testing varied classes of nonviral reagents. In well-differentiated primary pig airway epithelia (PAE) or human airway epithelia (HAE) grown at the air–liquid interface (ALI), the delivery of a Dicer-substrate small-interfering RNA (DsiRNA) duplex against hypoxanthine–guanine phosphoribosyltransferase (HPRT) with several nonviral reagents showed minimal uptake and no knockdown of the target. In contrast, poorly differentiated cells (2–5-day post-seeding) exhibited significant oligonucleotide internalization and target knockdown. This finding suggested that during differentiation, the barrier properties of the epithelium are modified to an extent that impedes oligonucleotide uptake. We used two methods to overcome this inefficiency. First, we tested the impact of epidermal growth factor (EGF), a known enhancer of macropinocytosis. Treatment of the cells with EGF improved oligonucleotide uptake resulting in significant but modest levels of target knockdown. Secondly, we used the connectivity map (Cmap) database to correlate gene expression changes during small molecule treatments on various cells types with genes that change upon mucociliary differentiation. Several different drug classes were identified from this correlative assessment. Well-differentiated epithelia treated with DsiRNAs and LY294002, a PI3K inhibitor, significantly improved gene silencing and concomitantly reduced target protein levels. These novel findings reveal that well-differentiated airway epithelia, normally resistant to siRNA delivery, can be pretreated with small molecules to improve uptake of synthetic oligonucleotide and RNA interference (RNAi) responses. PMID:23344182
Adsorption structures and energetics of molecules on metal surfaces: Bridging experiment and theory

NASA Astrophysics Data System (ADS)

Maurer, Reinhard J.; Ruiz, Victor G.; Camarillo-Cisneros, Javier; Liu, Wei; Ferri, Nicola; Reuter, Karsten; Tkatchenko, Alexandre

2016-05-01

Adsorption geometry and stability of organic molecules on surfaces are key parameters that determine the observable properties and functions of hybrid inorganic/organic systems (HIOSs). Despite many recent advances in precise experimental characterization and improvements in first-principles electronic structure methods, reliable databases of structures and energetics for large adsorbed molecules are largely amiss. In this review, we present such a database for a range of molecules adsorbed on metal single-crystal surfaces. The systems we analyze include noble-gas atoms, conjugated aromatic molecules, carbon nanostructures, and heteroaromatic compounds adsorbed on five different metal surfaces. The overall objective is to establish a diverse benchmark dataset that enables an assessment of current and future electronic structure methods, and motivates further experimental studies that provide ever more reliable data. Specifically, the benchmark structures and energetics from experiment are here compared with the recently developed van der Waals (vdW) inclusive density-functional theory (DFT) method, DFT + vdWsurf. In comparison to 23 adsorption heights and 17 adsorption energies from experiment we find a mean average deviation of 0.06 Å and 0.16 eV, respectively. This confirms the DFT + vdWsurf method as an accurate and efficient approach to treat HIOSs. A detailed discussion identifies remaining challenges to be addressed in future development of electronic structure methods, for which the here presented benchmark database may serve as an important reference.
High-throughput identification and rational design of synergistic small-molecule pairs for combating and bypassing antibiotic resistance.

PubMed

Wambaugh, Morgan A; Shakya, Viplendra P S; Lewis, Adam J; Mulvey, Matthew A; Brown, Jessica C S

2017-06-01

Antibiotic-resistant infections kill approximately 23,000 people and cost $20,000,000,000 each year in the United States alone despite the widespread use of small-molecule antimicrobial combination therapy. Antibiotic combinations typically have an additive effect: the efficacy of the combination matches the sum of the efficacies of each antibiotic when used alone. Small molecules can also act synergistically when the efficacy of the combination is greater than the additive efficacy. However, synergistic combinations are rare and have been historically difficult to identify. High-throughput identification of synergistic pairs is limited by the scale of potential combinations: a modest collection of 1,000 small molecules involves 1 million pairwise combinations. Here, we describe a high-throughput method for rapid identification of synergistic small-molecule pairs, the overlap2 method (O2M). O2M extracts patterns from chemical-genetic datasets, which are created when a collection of mutants is grown in the presence of hundreds of different small molecules, producing a precise set of phenotypes induced by each small molecule across the mutant set. The identification of mutants that show the same phenotype when treated with known synergistic molecules allows us to pinpoint additional molecule combinations that also act synergistically. As a proof of concept, we focus on combinations with the antibiotics trimethoprim and sulfamethizole, which had been standard treatment against urinary tract infections until widespread resistance decreased efficacy. Using O2M, we screened a library of 2,000 small molecules and identified several that synergize with the antibiotic trimethoprim and/or sulfamethizole. The most potent of these synergistic interactions is with the antiviral drug azidothymidine (AZT). We then demonstrate that understanding the molecular mechanism underlying small-molecule synergistic interactions allows the rational design of additional combinations that bypass drug resistance. Trimethoprim and sulfamethizole are both folate biosynthesis inhibitors. We find that this activity disrupts nucleotide homeostasis, which blocks DNA replication in the presence of AZT. Building on these data, we show that other small molecules that disrupt nucleotide homeostasis through other mechanisms (hydroxyurea and floxuridine) also act synergistically with AZT. These novel combinations inhibit the growth and virulence of trimethoprim-resistant clinical Escherichia coli and Klebsiella pneumoniae isolates, suggesting that they may be able to be rapidly advanced into clinical use. In sum, we present a generalizable method to screen for novel synergistic combinations, to identify particular mechanisms resulting in synergy, and to use the mechanistic knowledge to rationally design new combinations that bypass drug resistance.

High-throughput identification and rational design of synergistic small-molecule pairs for combating and bypassing antibiotic resistance

PubMed Central

Lewis, Adam J.; Mulvey, Matthew A.

2017-01-01

Antibiotic-resistant infections kill approximately 23,000 people and cost $20,000,000,000 each year in the United States alone despite the widespread use of small-molecule antimicrobial combination therapy. Antibiotic combinations typically have an additive effect: the efficacy of the combination matches the sum of the efficacies of each antibiotic when used alone. Small molecules can also act synergistically when the efficacy of the combination is greater than the additive efficacy. However, synergistic combinations are rare and have been historically difficult to identify. High-throughput identification of synergistic pairs is limited by the scale of potential combinations: a modest collection of 1,000 small molecules involves 1 million pairwise combinations. Here, we describe a high-throughput method for rapid identification of synergistic small-molecule pairs, the overlap2 method (O2M). O2M extracts patterns from chemical-genetic datasets, which are created when a collection of mutants is grown in the presence of hundreds of different small molecules, producing a precise set of phenotypes induced by each small molecule across the mutant set. The identification of mutants that show the same phenotype when treated with known synergistic molecules allows us to pinpoint additional molecule combinations that also act synergistically. As a proof of concept, we focus on combinations with the antibiotics trimethoprim and sulfamethizole, which had been standard treatment against urinary tract infections until widespread resistance decreased efficacy. Using O2M, we screened a library of 2,000 small molecules and identified several that synergize with the antibiotic trimethoprim and/or sulfamethizole. The most potent of these synergistic interactions is with the antiviral drug azidothymidine (AZT). We then demonstrate that understanding the molecular mechanism underlying small-molecule synergistic interactions allows the rational design of additional combinations that bypass drug resistance. Trimethoprim and sulfamethizole are both folate biosynthesis inhibitors. We find that this activity disrupts nucleotide homeostasis, which blocks DNA replication in the presence of AZT. Building on these data, we show that other small molecules that disrupt nucleotide homeostasis through other mechanisms (hydroxyurea and floxuridine) also act synergistically with AZT. These novel combinations inhibit the growth and virulence of trimethoprim-resistant clinical Escherichia coli and Klebsiella pneumoniae isolates, suggesting that they may be able to be rapidly advanced into clinical use. In sum, we present a generalizable method to screen for novel synergistic combinations, to identify particular mechanisms resulting in synergy, and to use the mechanistic knowledge to rationally design new combinations that bypass drug resistance. PMID:28632788
X-ray characterization of solid small molecule organic materials

DOEpatents

Billinge, Simon; Shankland, Kenneth; Shankland, Norman; Florence, Alastair

2014-06-10

The present invention provides, inter alia, methods of characterizing a small molecule organic material, e.g., a drug or a drug product. This method includes subjecting the solid small molecule organic material to x-ray total scattering analysis at a short wavelength, collecting data generated thereby, and mathematically transforming the data to provide a refined set of data.
Group specific internal standard technology (GSIST) for simultaneous identification and quantification of small molecules

DOEpatents

Adamec, Jiri; Yang, Wen-Chu; Regnier, Fred E

2014-01-14

Reagents and methods are provided that permit simultaneous analysis of multiple diverse small molecule analytes present in a complex mixture. Samples are labeled with chemically identical but isotopically distince forms of the labeling reagent, and analyzed using mass spectrometry. A single reagent simultaneously derivatizes multiple small molecule analytes having different reactive functional groups.
Crossing borders to bind proteins--a new concept in protein recognition based on the conjugation of small organic molecules or short peptides to polypeptides from a designed set.

PubMed

Baltzer, Lars

2011-06-01

A new concept for protein recognition and binding is highlighted. The conjugation of small organic molecules or short peptides to polypeptides from a designed set provides binder molecules that bind proteins with high affinities, and with selectivities that are equal to those of antibodies. The small organic molecules or peptides need to bind the protein targets but only with modest affinities and selectivities, because conjugation to the polypeptides results in molecules with dramatically improved binder performance. The polypeptides are selected from a set of only sixteen sequences designed to bind, in principle, any protein. The small number of polypeptides used to prepare high-affinity binders contrasts sharply with the huge libraries used in binder technologies based on selection or immunization. Also, unlike antibodies and engineered proteins, the polypeptides have unordered three-dimensional structures and adapt to the proteins to which they bind. Binder molecules for the C-reactive protein, human carbonic anhydrase II, acetylcholine esterase, thymidine kinase 1, phosphorylated proteins, the D-dimer, and a number of antibodies are used as examples to demonstrate that affinities are achieved that are higher than those of the small molecules or peptides by as much as four orders of magnitude. Evaluation by pull-down experiments and ELISA-based tests in human serum show selectivities to be equal to those of antibodies. Small organic molecules and peptides are readily available from pools of endogenous ligands, enzyme substrates, inhibitors or products, from screened small molecule libraries, from phage display, and from mRNA display. The technology is an alternative to established binder concepts for applications in drug development, diagnostics, medical imaging, and protein separation.
Membrane Fusion Induced by Small Molecules and Ions

PubMed Central

Mondal Roy, Sutapa; Sarkar, Munna

2011-01-01

Membrane fusion is a key event in many biological processes. These processes are controlled by various fusogenic agents of which proteins and peptides from the principal group. The fusion process is characterized by three major steps, namely, inter membrane contact, lipid mixing forming the intermediate step, pore opening and finally mixing of inner contents of the cells/vesicles. These steps are governed by energy barriers, which need to be overcome to complete fusion. Structural reorganization of big molecules like proteins/peptides, supplies the required driving force to overcome the energy barrier of the different intermediate steps. Small molecules/ions do not share this advantage. Hence fusion induced by small molecules/ions is expected to be different from that induced by proteins/peptides. Although several reviews exist on membrane fusion, no recent review is devoted solely to small moleculs/ions induced membrane fusion. Here we intend to present, how a variety of small molecules/ions act as independent fusogens. The detailed mechanism of some are well understood but for many it is still an unanswered question. Clearer understanding of how a particular small molecule can control fusion will open up a vista to use these moleucles instead of proteins/peptides to induce fusion both in vivo and in vitro fusion processes. PMID:21660306
The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases

PubMed Central

Caspi, Ron; Altman, Tomer; Dale, Joseph M.; Dreher, Kate; Fulcher, Carol A.; Gilham, Fred; Kaipa, Pallavi; Karthikeyan, Athikkattuvalasu S.; Kothari, Anamika; Krummenacker, Markus; Latendresse, Mario; Mueller, Lukas A.; Paley, Suzanne; Popescu, Liviu; Pujar, Anuradha; Shearer, Alexander G.; Zhang, Peifen; Karp, Peter D.

2010-01-01

The MetaCyc database (MetaCyc.org) is a comprehensive and freely accessible resource for metabolic pathways and enzymes from all domains of life. The pathways in MetaCyc are experimentally determined, small-molecule metabolic pathways and are curated from the primary scientific literature. With more than 1400 pathways, MetaCyc is the largest collection of metabolic pathways currently available. Pathways reactions are linked to one or more well-characterized enzymes, and both pathways and enzymes are annotated with reviews, evidence codes, and literature citations. BioCyc (BioCyc.org) is a collection of more than 500 organism-specific Pathway/Genome Databases (PGDBs). Each BioCyc PGDB contains the full genome and predicted metabolic network of one organism. The network, which is predicted by the Pathway Tools software using MetaCyc as a reference, consists of metabolites, enzymes, reactions and metabolic pathways. BioCyc PGDBs also contain additional features, such as predicted operons, transport systems, and pathway hole-fillers. The BioCyc Web site offers several tools for the analysis of the PGDBs, including Omics Viewers that enable visualization of omics datasets on two different genome-scale diagrams and tools for comparative analysis. The BioCyc PGDBs generated by SRI are offered for adoption by any party interested in curation of metabolic, regulatory, and genome-related information about an organism. PMID:19850718
MIRNA-DISTILLER: A Stand-Alone Application to Compile microRNA Data from Databases.

PubMed

Rieger, Jessica K; Bodan, Denis A; Zanger, Ulrich M

2011-01-01

MicroRNAs (miRNA) are small non-coding RNA molecules of ∼22 nucleotides which regulate large numbers of genes by binding to seed sequences at the 3'-untranslated region of target gene transcripts. The target mRNA is then usually degraded or translation is inhibited, although thus resulting in posttranscriptional down regulation of gene expression at the mRNA and/or protein level. Due to the bioinformatic difficulties in predicting functional miRNA binding sites, several publically available databases have been developed that predict miRNA binding sites based on different algorithms. The parallel use of different databases is currently indispensable, but highly uncomfortable and time consuming, especially when working with numerous genes of interest. We have therefore developed a new stand-alone program, termed MIRNA-DISTILLER, which allows to compile miRNA data for given target genes from public databases. Currently implemented are TargetScan, microCosm, and miRDB, which may be queried independently, pairwise, or together to calculate the respective intersections. Data are stored locally for application of further analysis tools including freely definable biological parameter filters, customized output-lists for both miRNAs and target genes, and various graphical facilities. The software, a data example file and a tutorial are freely available at http://www.ikp-stuttgart.de/content/language1/html/10415.asp.
MIRNA-DISTILLER: A Stand-Alone Application to Compile microRNA Data from Databases

PubMed Central

Rieger, Jessica K.; Bodan, Denis A.; Zanger, Ulrich M.

2011-01-01

MicroRNAs (miRNA) are small non-coding RNA molecules of ∼22 nucleotides which regulate large numbers of genes by binding to seed sequences at the 3′-untranslated region of target gene transcripts. The target mRNA is then usually degraded or translation is inhibited, although thus resulting in posttranscriptional down regulation of gene expression at the mRNA and/or protein level. Due to the bioinformatic difficulties in predicting functional miRNA binding sites, several publically available databases have been developed that predict miRNA binding sites based on different algorithms. The parallel use of different databases is currently indispensable, but highly uncomfortable and time consuming, especially when working with numerous genes of interest. We have therefore developed a new stand-alone program, termed MIRNA-DISTILLER, which allows to compile miRNA data for given target genes from public databases. Currently implemented are TargetScan, microCosm, and miRDB, which may be queried independently, pairwise, or together to calculate the respective intersections. Data are stored locally for application of further analysis tools including freely definable biological parameter filters, customized output-lists for both miRNAs and target genes, and various graphical facilities. The software, a data example file and a tutorial are freely available at http://www.ikp-stuttgart.de/content/language1/html/10415.asp PMID:22303335
Perylene-Diimide Based Donor-Acceptor-Donor Type Small-Molecule Acceptors for Solution-Processable Organic Solar Cells

NASA Astrophysics Data System (ADS)

Ganesamoorthy, Ramasamy; Vijayaraghavan, Rajagopalan; Sakthivel, Pachagounder

2017-12-01

Development of nonfullerene acceptors plays an important role in the commercial availability of plastic solar cells. We report herein synthesis of bay-substituted donor-acceptor-donor (D-A-D)-type perylene diimide (PDI)-based small molecules (SM-1 to SM-4) by Suzuki coupling method and their use as acceptors in bulk heterojunction organic solar cells (BHJ-OSCs) with poly(3-hexylthiophene) (P3HT) polymer donor. We varied the number of electron-rich thiophene units and the solubilizing side chains and also evaluated the optical and electrochemical properties of the small molecules. The synthesized small molecules were confirmed by Fourier-transform infrared (FT-IR) spectroscopy, nuclear magnetic resonance (NMR) spectroscopy, and high-resolution mass spectroscopy (HR-MS). The small molecules showed extensive and strong absorption in the ultraviolet-visible (UV-Vis) region up to 750 nm, with bandgap (E_{{g}}^{{opt}} ) reduced below <2 eV. The energy levels of small molecules SM-1 to SM-4 were suitable for use as electron-accepting materials. The small molecules showed good thermal stability up to 300°C. BHJ-OSCs with SM-1 and P3HT polymer donor showed maximum power conversion efficiency (PCE) of 0.19% with V oc of 0.30 V, J sc of 1.72 mA cm-2, and fill factor (FF) of 37%. The PCE decreased with the number of thiophene units. The PCE of SM-2 was lower than that of SM-1. This difference in PCE can be explained by the higher aggregation tendency of the bithiophene compared with the thiophene unit. Introduction of the solubilizing group in the bay position increased the aggregation property, leading to much lower PCE than for the small molecules without solubilizing group.
Hierarchical virtual screening approaches in small molecule drug discovery.

PubMed

Kumar, Ashutosh; Zhang, Kam Y J

2015-01-01

Virtual screening has played a significant role in the discovery of small molecule inhibitors of therapeutic targets in last two decades. Various ligand and structure-based virtual screening approaches are employed to identify small molecule ligands for proteins of interest. These approaches are often combined in either hierarchical or parallel manner to take advantage of the strength and avoid the limitations associated with individual methods. Hierarchical combination of ligand and structure-based virtual screening approaches has received noteworthy success in numerous drug discovery campaigns. In hierarchical virtual screening, several filters using ligand and structure-based approaches are sequentially applied to reduce a large screening library to a number small enough for experimental testing. In this review, we focus on different hierarchical virtual screening strategies and their application in the discovery of small molecule modulators of important drug targets. Several virtual screening studies are discussed to demonstrate the successful application of hierarchical virtual screening in small molecule drug discovery. Copyright © 2014 Elsevier Inc. All rights reserved.
Systematic development of small molecules to inhibit specific microscopic steps of Aβ42 aggregation in Alzheimer's disease.

PubMed

Habchi, Johnny; Chia, Sean; Limbocker, Ryan; Mannini, Benedetta; Ahn, Minkoo; Perni, Michele; Hansson, Oskar; Arosio, Paolo; Kumita, Janet R; Challa, Pavan Kumar; Cohen, Samuel I A; Linse, Sara; Dobson, Christopher M; Knowles, Tuomas P J; Vendruscolo, Michele

2017-01-10

The aggregation of the 42-residue form of the amyloid-β peptide (Aβ42) is a pivotal event in Alzheimer's disease (AD). The use of chemical kinetics has recently enabled highly accurate quantifications of the effects of small molecules on specific microscopic steps in Aβ42 aggregation. Here, we exploit this approach to develop a rational drug discovery strategy against Aβ42 aggregation that uses as a read-out the changes in the nucleation and elongation rate constants caused by candidate small molecules. We thus identify a pool of compounds that target specific microscopic steps in Aβ42 aggregation. We then test further these small molecules in human cerebrospinal fluid and in a Caenorhabditis elegans model of AD. Our results show that this strategy represents a powerful approach to identify systematically small molecule lead compounds, thus offering an appealing opportunity to reduce the attrition problem in drug discovery.
Synthesis of many different types of organic small molecules using one automated process.

PubMed

Li, Junqi; Ballmer, Steven G; Gillis, Eric P; Fujii, Seiko; Schmidt, Michael J; Palazzolo, Andrea M E; Lehmann, Jonathan W; Morehouse, Greg F; Burke, Martin D

2015-03-13

Small-molecule synthesis usually relies on procedures that are highly customized for each target. A broadly applicable automated process could greatly increase the accessibility of this class of compounds to enable investigations of their practical potential. Here we report the synthesis of 14 distinct classes of small molecules using the same fully automated process. This was achieved by strategically expanding the scope of a building block-based synthesis platform to include even C(sp3)-rich polycyclic natural product frameworks and discovering a catch-and-release chromatographic purification protocol applicable to all of the corresponding intermediates. With thousands of compatible building blocks already commercially available, many small molecules are now accessible with this platform. More broadly, these findings illuminate an actionable roadmap to a more general and automated approach for small-molecule synthesis. Copyright © 2015, American Association for the Advancement of Science.
Antibody-Mediated Small Molecule Detection Using Programmable DNA-Switches.

PubMed

Rossetti, Marianna; Ippodrino, Rudy; Marini, Bruna; Palleschi, Giuseppe; Porchetta, Alessandro

2018-06-13

The development of rapid, cost-effective, and single-step methods for the detection of small molecules is crucial for improving the quality and efficiency of many applications ranging from life science to environmental analysis. Unfortunately, current methodologies still require multiple complex, time-consuming washing and incubation steps, which limit their applicability. In this work we present a competitive DNA-based platform that makes use of both programmable DNA-switches and antibodies to detect small target molecules. The strategy exploits both the advantages of proximity-based methods and structure-switching DNA-probes. The platform is modular and versatile and it can potentially be applied for the detection of any small target molecule that can be conjugated to a nucleic acid sequence. Here the rational design of programmable DNA-switches is discussed, and the sensitive, rapid, and single-step detection of different environmentally relevant small target molecules is demonstrated.
Large scale nanoparticle screening for small molecule analysis in laser desorption ionization mass spectrometry

DOE PAGES

Yagnik, Gargey B.; Hansen, Rebecca L.; Korte, Andrew R.; ...

2016-08-30

Nanoparticles (NPs) have been suggested as efficient matrixes for small molecule profiling and imaging by laser-desorption ionization mass spectrometry (LDI-MS), but so far there has been no systematic study comparing different NPs in the analysis of various classes of small molecules. Here, we present a large scale screening of 13 NPs for the analysis of two dozen small metabolite molecules. Many NPs showed much higher LDI efficiency than organic matrixes in positive mode and some NPs showed comparable efficiencies for selected analytes in negative mode. Our results suggest that a thermally driven desorption process is a key factor for metalmore » oxide NPs, but chemical interactions are also very important, especially for other NPs. Furthermore, the screening results provide a useful guideline for the selection of NPs in the LDI-MS analysis of small molecules.« less
Large scale nanoparticle screening for small molecule analysis in laser desorption ionization mass spectrometry

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yagnik, Gargey B.; Hansen, Rebecca L.; Korte, Andrew R.

Nanoparticles (NPs) have been suggested as efficient matrixes for small molecule profiling and imaging by laser-desorption ionization mass spectrometry (LDI-MS), but so far there has been no systematic study comparing different NPs in the analysis of various classes of small molecules. Here, we present a large scale screening of 13 NPs for the analysis of two dozen small metabolite molecules. Many NPs showed much higher LDI efficiency than organic matrixes in positive mode and some NPs showed comparable efficiencies for selected analytes in negative mode. Our results suggest that a thermally driven desorption process is a key factor for metalmore » oxide NPs, but chemical interactions are also very important, especially for other NPs. Furthermore, the screening results provide a useful guideline for the selection of NPs in the LDI-MS analysis of small molecules.« less
FlexAID: Revisiting Docking on Non-Native-Complex Structures.

PubMed

Gaudreault, Francis; Najmanovich, Rafael J

2015-07-27

Small-molecule protein docking is an essential tool in drug design and to understand molecular recognition. In the present work we introduce FlexAID, a small-molecule docking algorithm that accounts for target side-chain flexibility and utilizes a soft scoring function, i.e. one that is not highly dependent on specific geometric criteria, based on surface complementarity. The pairwise energy parameters were derived from a large dataset of true positive poses and negative decoys from the PDBbind database through an iterative process using Monte Carlo simulations. The prediction of binding poses is tested using the widely used Astex dataset as well as the HAP2 dataset, while performance in virtual screening is evaluated using a subset of the DUD dataset. We compare FlexAID to AutoDock Vina, FlexX, and rDock in an extensive number of scenarios to understand the strengths and limitations of the different programs as well as to reported results for Glide, GOLD, and DOCK6 where applicable. The most relevant among these scenarios is that of docking on flexible non-native-complex structures where as is the case in reality, the target conformation in the bound form is not known a priori. We demonstrate that FlexAID, unlike other programs, is robust against increasing structural variability. FlexAID obtains equivalent sampling success as GOLD and performs better than AutoDock Vina or FlexX in all scenarios against non-native-complex structures. FlexAID is better than rDock when there is at least one critical side-chain movement required upon ligand binding. In virtual screening, FlexAID results are lower on average than those of AutoDock Vina and rDock. The higher accuracy in flexible targets where critical movements are required, intuitive PyMOL-integrated graphical user interface and free source code as well as precompiled executables for Windows, Linux, and Mac OS make FlexAID a welcome addition to the arsenal of existing small-molecule protein docking methods.
Small Molecule Signaling Agents: The Integrated Chemistry and Biochemistry of Nitrogen Oxides, Oxides of Carbon, Dioxygen, Hydrogen Sulfide, and Their Derived Species

PubMed Central

Fukuto, Jon M.; Carrington, Samantha J.; Tantillo, Dean J.; Harrison, Jason G.; Ignarro, Louis J.; Freeman, Bruce A.; Chen, Andrew; Wink, David A.

2014-01-01

Several small molecule species formally known primarily as toxic gases have, over the past 20 years, been shown to be endogenously generated signaling molecules. The biological signaling associated with the small molecules NO, CO, H2S (and the nonendogenously generated O2), and their derived species have become a topic of extreme interest. It has become increasingly clear that these small molecule signaling agents form an integrated signaling web that affects/regulates numerous physiological processes. The chemical interactions between these species and each other or biological targets is an important factor in their roles as signaling agents. Thus, a fundamental understanding of the chemistry of these molecules is essential to understanding their biological/physiological utility. This review focuses on this chemistry and attempts to establish the chemical basis for their signaling functions. PMID:22263838
Fluorination-enabled optimal morphology leads to over 11% efficiency for inverted small-molecule organic solar cells

PubMed Central

Deng, Dan; Zhang, Yajie; Zhang, Jianqi; Wang, Zaiyu; Zhu, Lingyun; Fang, Jin; Xia, Benzheng; Wang, Zhen; Lu, Kun; Ma, Wei; Wei, Zhixiang

2016-01-01

Solution-processable small molecules for organic solar cells have attracted intense attention for their advantages of definite molecular structures compared with their polymer counterparts. However, the device efficiencies based on small molecules are still lower than those of polymers, especially for inverted devices, the highest efficiency of which is <9%. Here we report three novel solution-processable small molecules, which contain π-bridges with gradient-decreased electron density and end acceptors substituted with various fluorine atoms (0F, 1F and 2F, respectively). Fluorination leads to an optimal active layer morphology, including an enhanced domain purity, the formation of hierarchical domain size and a directional vertical phase gradation. The optimal morphology balances charge separation and transfer, and facilitates charge collection. As a consequence, fluorinated molecules exhibit excellent inverted device performance, and an average power conversion efficiency of 11.08% is achieved for a two-fluorine atom substituted molecule. PMID:27991486
[Effect of annealing temperature on the crystallization and spectroscopic response of a small-molecule semiconductor doped in polymer film].

PubMed

Yin, Ming; Zhang, Xin-Ping; Liu, Hong-Mei

2012-11-01

The crystallization properties of the perylene (EPPTC) molecules doped in the solid film of the derivative of polyfluorene (F8BT) at different annealing temperatures, as well as the consequently induced spectroscopic response of the exciplex emission in the heterojunction structures, were studied in the present paper. Experimental results showed that the phase separation between the small and the polymer molecules in the blend film is enhanced with increasing the annealing temperature, which leads to the crystallization of the EPPTC molecules due to the strong pi-pi stacking. The size of the crystal phase increases with increasing the annealing temperature. However, this process weakens the mechanisms of the heterojunction configuration, thus, the total interfacial area between the small and the polymer molecules and the amount of exciplex are reduced significantly in the blend film. Meanwhile, the energy transfer from the polymer to the small molecules is also reduced. As a result, the emission from the exciplex becomes weaker with increasing the annealing temperature, whereas the stronger emission from the polymer molecules and from the crystal phase of the small molecules can be observed. These experimental results are very important for understanding and tailoring the organic heterojunction structures. Furthermore, this provides photophysics for improving the performance of photovoltaic or solar cell devices.
The structure and dipole moment of globular proteins in solution and crystalline states: use of NMR and X-ray databases for the numerical calculation of dipole moment.

PubMed

Takashima, S

2001-04-05

The large dipole moment of globular proteins has been well known because of the detailed studies using dielectric relaxation and electro-optical methods. The search for the origin of these dipolemoments, however, must be based on the detailed knowledge on protein structure with atomic resolutions. At present, we have two sources of information on the structure of protein molecules: (1) x-ray databases obtained in crystalline state; (2) NMR databases obtained in solution state. While x-ray databases consist of only one model, NMR databases, because of the fluctuation of the protein folding in solution, consist of a number of models, thus enabling the computation of dipole moment repeated for all these models. The aim of this work, using these databases, is the detailed investigation on the interdependence between the structure and dipole moment of protein molecules. The dipole moment of protein molecules has roughly two components: one dipole moment is due to surface charges and the other, core dipole moment, is due to polar groups such as N--H and C==O bonds. The computation of surface charge dipole moment consists of two steps: (A) calculation of the pK shifts of charged groups for electrostatic interactions and (B) calculation of the dipole moment using the pK corrected for electrostatic shifts. The dipole moments of several proteins were computed using both NMR and x-ray databases. The dipole moments of these two sets of calculations are, with a few exceptions, in good agreement with one another and also with measured dipole moments.

Hippocampal and cortical neuronal growth mediated by the small molecule natural product clovanemagnolol.

PubMed

Khaing, Zin; Kang, Danby; Camelio, Andrew M; Schmidt, Christine E; Siegel, Dionicio

2011-08-15

The use of small molecule surrogates of growth factors that directly or indirectly promote growth represents an attractive approach to regenerative medicine. With synthetic access to clovanemagnolol, a small molecule initially isolated from the bark of the Bigleaf Magnolia tree, we have examined the small molecule's ability to promote growth of embryonic hippocampal and cortical neurons in serum-free medium. Comparisons with magnolol, a known promoter of growth, reveals that clovanmagnolol is a potent neurotrophic agent, promoting neuronal growth at concentrations of 10 nM. In addition, both clovanemagnolol and magnolol promote growth through a biphasic dose response. Copyright © 2011 Elsevier Ltd. All rights reserved.
Second-generation DNA-templated macrocycle libraries for the discovery of bioactive small molecules.

PubMed

Usanov, Dmitry L; Chan, Alix I; Maianti, Juan Pablo; Liu, David R

2018-07-01

DNA-encoded libraries have emerged as a widely used resource for the discovery of bioactive small molecules, and offer substantial advantages compared with conventional small-molecule libraries. Here, we have developed and streamlined multiple fundamental aspects of DNA-encoded and DNA-templated library synthesis methodology, including computational identification and experimental validation of a 20 × 20 × 20 × 80 set of orthogonal codons, chemical and computational tools for enhancing the structural diversity and drug-likeness of library members, a highly efficient polymerase-mediated template library assembly strategy, and library isolation and purification methods. We have integrated these improved methods to produce a second-generation DNA-templated library of 256,000 small-molecule macrocycles with improved drug-like physical properties. In vitro selection of this library for insulin-degrading enzyme affinity resulted in novel insulin-degrading enzyme inhibitors, including one of unusual potency and novel macrocycle stereochemistry (IC 50 = 40 nM). Collectively, these developments enable DNA-templated small-molecule libraries to serve as more powerful, accessible, streamlined and cost-effective tools for bioactive small-molecule discovery.
Use of mRNA expression signatures to discover small molecule inhibitors of skeletal muscle atrophy

PubMed Central

Adams, Christopher M.; Ebert, Scott M.; Dyle, Michael C.

2017-01-01

Purpose of review Here, we discuss a recently developed experimental strategy for discovering small molecules with potential to prevent and treat skeletal muscle atrophy. Recent findings Muscle atrophy involves and requires widespread changes in skeletal muscle gene expression, which generate complex but measurable patterns of positive and negative changes in skeletal muscle mRNA levels (a.k.a. mRNA expression signatures of muscle atrophy). Many bioactive small molecules generate their own characteristic mRNA expression signatures, and by identifying small molecules whose signatures approximate mirror images of muscle atrophy signatures, one may identify small molecules with potential to prevent and/or reverse muscle atrophy. Unlike a conventional drug discovery approach, this strategy does not rely on a predefined molecular target but rather exploits the complexity of muscle atrophy to identify small molecules that counter the entire spectrum of pathological changes in atrophic muscle. We discuss how this strategy has been used to identify two natural compounds, ursolic acid and tomatidine, that reduce muscle atrophy and improve skeletal muscle function. Summary Discovery strategies based on mRNA expression signatures can elucidate new approaches for preserving and restoring muscle mass and function. PMID:25807353
Use of mRNA expression signatures to discover small molecule inhibitors of skeletal muscle atrophy.

PubMed

Adams, Christopher M; Ebert, Scott M; Dyle, Michael C

2015-05-01

Here, we discuss a recently developed experimental strategy for discovering small molecules with potential to prevent and treat skeletal muscle atrophy. Muscle atrophy involves and requires widespread changes in skeletal muscle gene expression, which generate complex but measurable patterns of positive and negative changes in skeletal muscle mRNA levels (a.k.a. mRNA expression signatures of muscle atrophy). Many bioactive small molecules generate their own characteristic mRNA expression signatures, and by identifying small molecules whose signatures approximate mirror images of muscle atrophy signatures, one may identify small molecules with potential to prevent and/or reverse muscle atrophy. Unlike a conventional drug discovery approach, this strategy does not rely on a predefined molecular target but rather exploits the complexity of muscle atrophy to identify small molecules that counter the entire spectrum of pathological changes in atrophic muscle. We discuss how this strategy has been used to identify two natural compounds, ursolic acid and tomatidine, that reduce muscle atrophy and improve skeletal muscle function. Discovery strategies based on mRNA expression signatures can elucidate new approaches for preserving and restoring muscle mass and function.
Mass amplifying probe for sensitive fluorescence anisotropy detection of small molecules in complex biological samples.

PubMed

Cui, Liang; Zou, Yuan; Lin, Ninghang; Zhu, Zhi; Jenkins, Gareth; Yang, Chaoyong James

2012-07-03

Fluorescence anisotropy (FA) is a reliable and excellent choice for fluorescence sensing. One of the key factors influencing the FA value for any molecule is the molar mass of the molecule being measured. As a result, the FA method with functional nucleic acid aptamers has been limited to macromolecules such as proteins and is generally not applicable for the analysis of small molecules because their molecular masses are relatively too small to produce observable FA value changes. We report here a molecular mass amplifying strategy to construct anisotropy aptamer probes for small molecules. The probe is designed in such a way that only when a target molecule binds to the probe does it activate its binding ability to an anisotropy amplifier (a high molecular mass molecule such as protein), thus significantly increasing the molecular mass and FA value of the probe/target complex. Specifically, a mass amplifying probe (MAP) consists of a targeting aptamer domain against a target molecule and molecular mass amplifying aptamer domain for the amplifier protein. The probe is initially rendered inactive by a small blocking strand partially complementary to both target aptamer and amplifier protein aptamer so that the mass amplifying aptamer domain would not bind to the amplifier protein unless the probe has been activated by the target. In this way, we prepared two probes that constitute a target (ATP and cocaine respectively) aptamer, a thrombin (as the mass amplifier) aptamer, and a fluorophore. Both probes worked well against their corresponding small molecule targets, and the detection limits for ATP and cocaine were 0.5 μM and 0.8 μM, respectively. More importantly, because FA is less affected by environmental interferences, ATP in cell media and cocaine in urine were directly detected without any tedious sample pretreatment. Our results established that our molecular mass amplifying strategy can be used to design aptamer probes for rapid, sensitive, and selective detection of small molecules by means of FA in complex biological samples.
MATCH: An Atom- Typing Toolset for Molecular Mechanics Force Fields

PubMed Central

Yesselman, Joseph D.; Price, Daniel J.; Knight, Jennifer L.; Brooks, Charles L.

2011-01-01

We introduce a toolset of program libraries collectively titled MATCH (Multipurpose Atom-Typer for CHARMM) for the automated assignment of atom types and force field parameters for molecular mechanics simulation of organic molecules. The toolset includes utilities for the conversion from multiple chemical structure file formats into a molecular graph. A general chemical pattern-matching engine using this graph has been implemented whereby assignment of molecular mechanics atom types, charges and force field parameters is achieved by comparison against a customizable list of chemical fragments. While initially designed to complement the CHARMM simulation package and force fields by generating the necessary input topology and atom-type data files, MATCH can be expanded to any force field and program, and has core functionality that makes it extendable to other applications such as fragment-based property prediction. In the present work, we demonstrate the accurate construction of atomic parameters of molecules within each force field included in CHARMM36 through exhaustive cross validation studies illustrating that bond increment rules derived from one force field can be transferred to another. In addition, using leave-one-out substitution it is shown that it is also possible to substitute missing intra and intermolecular parameters with ones included in a force field to complete the parameterization of novel molecules. Finally, to demonstrate the robustness of MATCH and the coverage of chemical space offered by the recent CHARMM CGENFF force field (Vanommeslaeghe, et al., JCC., 2010, 31, 671–690), one million molecules from the PubChem database of small molecules are typed, parameterized and minimized. PMID:22042689
Profiling the NIH Small Molecule Repository for Compounds That Generate H2O2 by Redox Cycling in Reducing Environments

PubMed Central

2010-01-01

We have screened the Library of Pharmacologically Active Compounds (LOPAC) and the National Institutes of Health (NIH) Small Molecule Repository (SMR) libraries in a horseradish peroxidase–phenol red (HRP-PR) H2O2 detection assay to identify redox cycling compounds (RCCs) capable of generating H2O2 in buffers containing dithiothreitol (DTT). Two RCCs were identified in the LOPAC set, the ortho-naphthoquinone β-lapachone and the para-naphthoquinone NSC 95397. Thirty-seven (0.02%) concentration-dependent RCCs were identified from 195,826 compounds in the NIH SMR library; 3 singleton structures, 9 ortho-quinones, 2 para-quinones, 4 pyrimidotriazinediones, 15 arylsulfonamides, 2 nitrothiophene-2-carboxylates, and 2 tolyl hydrazides. Sixty percent of the ortho-quinones and 80% of the pyrimidotriazinediones in the library were confirmed as RCCs. In contrast, only 3.9% of the para-quinones were confirmed as RCCs. Fifteen of the 251 arylsulfonamides in the library were confirmed as RCCs, and since we screened 17,868 compounds with a sulfonamide functional group we conclude that the redox cycling activity of the arylsulfonamide RCCs is due to peripheral reactive enone, aromatic, or heterocyclic functions. Cross-target queries of the University of Pittsburgh Drug Discovery Institute (UPDDI) and PubChem databases revealed that the RCCs exhibited promiscuous bioactivity profiles and have populated both screening databases with significantly higher numbers of active flags than non-RCCs. RCCs were promiscuously active against protein targets known to be susceptible to oxidation, but were also active in cell growth inhibition assays, and against other targets thought to be insensitive to oxidation. Profiling compound libraries or the hits from screening campaigns in the HRP-PR H2O2 detection assay significantly reduce the timelines and resources required to identify and eliminate promiscuous nuisance RCCs from the candidates for lead optimization. PMID:20070233
Influence of thermocleavable functionality on organic field-effect transistor performance of small molecules

NASA Astrophysics Data System (ADS)

Mahale, Rajashree Y.; Dharmapurikar, Satej S.; Chini, Mrinmoy Kumar; Venugopalan, Vijay

2017-06-01

Diketopyrrolopyrrole based donor-acceptor-donor conjugated small molecules using ethylene dioxythiophene as a donor was synthesized. Electron deficient diketopyrrolopyrrole unit was substituted with thermocleavable (tert-butyl acetate) side chains. The thermal treatment of the molecules at 160 °C eliminated the tert-butyl ester group results in the formation of corresponding acid. Optical and theoretical studies revealed that the molecules adopted a change in molecular arrangement after thermolysis. The conjugated small molecules possessed p-channel charge transport characteristics in organic field effect transistors. The charge carrier mobility was increased after thermolysis of tert-butyl ester group to 5.07 × 10-5 cm2/V s.
A graph-based approach to construct target-focused libraries for virtual screening.

PubMed

Naderi, Misagh; Alvin, Chris; Ding, Yun; Mukhopadhyay, Supratik; Brylinski, Michal

2016-01-01

Due to exorbitant costs of high-throughput screening, many drug discovery projects commonly employ inexpensive virtual screening to support experimental efforts. However, the vast majority of compounds in widely used screening libraries, such as the ZINC database, will have a very low probability to exhibit the desired bioactivity for a given protein. Although combinatorial chemistry methods can be used to augment existing compound libraries with novel drug-like compounds, the broad chemical space is often too large to be explored. Consequently, the trend in library design has shifted to produce screening collections specifically tailored to modulate the function of a particular target or a protein family. Assuming that organic compounds are composed of sets of rigid fragments connected by flexible linkers, a molecule can be decomposed into its building blocks tracking their atomic connectivity. On this account, we developed eSynth, an exhaustive graph-based search algorithm to computationally synthesize new compounds by reconnecting these building blocks following their connectivity patterns. We conducted a series of benchmarking calculations against the Directory of Useful Decoys, Enhanced database. First, in a self-benchmarking test, the correctness of the algorithm is validated with the objective to recover a molecule from its building blocks. Encouragingly, eSynth can efficiently rebuild more than 80 % of active molecules from their fragment components. Next, the capability to discover novel scaffolds is assessed in a cross-benchmarking test, where eSynth successfully reconstructed 40 % of the target molecules using fragments extracted from chemically distinct compounds. Despite an enormous chemical space to be explored, eSynth is computationally efficient; half of the molecules are rebuilt in less than a second, whereas 90 % take only about a minute to be generated. eSynth can successfully reconstruct chemically feasible molecules from molecular fragments. Furthermore, in a procedure mimicking the real application, where one expects to discover novel compounds based on a small set of already developed bioactives, eSynth is capable of generating diverse collections of molecules with the desired activity profiles. Thus, we are very optimistic that our effort will contribute to targeted drug discovery. eSynth is freely available to the academic community at www.brylinski.org/content/molecular-synthesis.Graphical abstractAssuming that organic compounds are composed of sets of rigid fragments connected by flexible linkers, a molecule can be decomposed into its building blocks tracking their atomic connectivity. Here, we developed eSynth, an automated method to synthesize new compounds by reconnecting these building blocks following the connectivity patterns via an exhaustive graph-based search algorithm. eSynth opens up a possibility to rapidly construct virtual screening libraries for targeted drug discovery.
Discovery of Small Molecules that Inhibit the Disordered Protein, p27Kip1

PubMed Central

Iconaru, Luigi I.; Ban, David; Bharatham, Kavitha; Ramanathan, Arvind; Zhang, Weixing; Shelat, Anang A.; Zuo, Jian; Kriwacki, Richard W.

2015-01-01

Disordered proteins are highly prevalent in biological systems, they control myriad signaling and regulatory processes, and their levels and/or cellular localization are often altered in human disease. In contrast to folded proteins, disordered proteins, due to conformational heterogeneity and dynamics, are not considered viable drug targets. We challenged this paradigm by identifying through NMR-based screening small molecules that bound specifically, albeit weakly, to the disordered cell cycle regulator, p27Kip1 (p27). Two groups of molecules bound to sites created by transient clusters of aromatic residues within p27. Conserved chemical features within these two groups of small molecules exhibited complementarity to their binding sites within p27, establishing structure-activity relationships for small molecule:disordered protein interactions. Finally, one compound counteracted the Cdk2/cyclin A inhibitory function of p27 in vitro, providing proof-of-principle that small molecules can inhibit the function of a disordered protein (p27) through sequestration in a conformation incapable of folding and binding to a natural regulatory target (Cdk2/cyclin A). PMID:26507530
Discovery of Small Molecules that Inhibit the Disordered Protein, p27 Kip1

DOE PAGES

Iconaru, Luigi I.; Ban, David; Bharatham, Kavitha; ...

2015-10-28

In disordered proteins we see that they are highly prevalent in biological systems. They control myriad signaling and regulatory processes, and their levels and/or cellular localization are often altered in human disease. In contrast to folded proteins, disordered proteins, due to conformational heterogeneity and dynamics, are not considered viable drug targets. We challenged this paradigm by identifying through NMR-based screening small molecules that bound specifically, albeit weakly, to the disordered cell cycle regulator, p27 Kip1 (p27). Moreover, two groups of molecules bound to sites created by transient clusters of aromatic residues within p27. Conserved chemical features within these two groupsmore » of small molecules exhibited complementarity to their binding sites within p27, establishing structure-activity relationships for small molecule: disordered protein interactions. Finally, one compound counteracted the Cdk2/cyclin A inhibitory function of p27 in vitro, providing proof-of- principle that small molecules can inhibit the function of a disordered protein (p27) through sequestration in a conformation incapable of folding and binding to a natural regulatory target (Cdk2/cyclin A).« less
Strategy to discover diverse optimal molecules in the small molecule universe.

PubMed

Rupakheti, Chetan; Virshup, Aaron; Yang, Weitao; Beratan, David N

2015-03-23

The small molecule universe (SMU) is defined as a set of over 10(60) synthetically feasible organic molecules with molecular weight less than ∼500 Da. Exhaustive enumerations and evaluation of all SMU molecules for the purpose of discovering favorable structures is impossible. We take a stochastic approach and extend the ACSESS framework ( Virshup et al. J. Am. Chem. Soc. 2013 , 135 , 7296 - 7303 ) to develop diversity oriented molecular libraries that can generate a set of compounds that is representative of the small molecule universe and that also biases the library toward favorable physical property values. We show that the approach is efficient compared to exhaustive enumeration and to existing evolutionary algorithms for generating such libraries by testing in the NKp fitness landscape model and in the fully enumerated GDB-9 chemical universe containing 3 × 10(5) molecules.
Strategy To Discover Diverse Optimal Molecules in the Small Molecule Universe

PubMed Central

2015-01-01

The small molecule universe (SMU) is defined as a set of over 1060 synthetically feasible organic molecules with molecular weight less than ∼500 Da. Exhaustive enumerations and evaluation of all SMU molecules for the purpose of discovering favorable structures is impossible. We take a stochastic approach and extend the ACSESS framework (Virshup et al. J. Am. Chem. Soc.2013, 135, 7296–730323548177) to develop diversity oriented molecular libraries that can generate a set of compounds that is representative of the small molecule universe and that also biases the library toward favorable physical property values. We show that the approach is efficient compared to exhaustive enumeration and to existing evolutionary algorithms for generating such libraries by testing in the NKp fitness landscape model and in the fully enumerated GDB-9 chemical universe containing 3 × 105 molecules. PMID:25594586
Possible association between Helicobacter pylori infection and nonalcoholic fatty liver disease.

PubMed

Chen, Chang-Xi; Mao, Yu-Shan; Foster, Parker; Zhu, Zhong-Wei; Du, Juan; Guo, Chuan-Yong

2017-03-01

Possible association between Helicobacter pylori infection (HPI) and nonalcoholic fatty liver disease (NAFLD) has been proposed by several studies with inconsistent conclusions. Here, we studied the association between HPI and NAFLD at 3 levels: (i) genetic level; (ii) small molecular level; and (iii) clinical level. Relation data between diseases, genes, and small molecules were acquired from Pathway Studio ResNet Mammalian database. Clinical data were acquired from 2263 elderly South Chinese subjects, including 603 NAFLD patients and 1660 subjects without NAFLD. Results showed that HPI and NAFLD present significantly shared genetic bases (95 genes, p value = 2.5E-72), demonstrating multiple common genetic pathways (enrichment p value ≤ 4.38E-20 for the top 10 pathways). Genetic network analysis suggested that mutual regulation may exist between HPI and NAFLD through 21 out of 95 genes. Furthermore, 85 out of the 95 genes manifested strong interaction with 12 small molecules/drugs that demonstrate effectiveness in treating both diseases. Clinical results showed that HPI rate in the NAFLD group was significantly higher than that in the group without NAFLD (51.9% vs. 43.6%; p value = 4.9E-4). Multivariate logistic regression results supported the observations and suggested that HPI served as a risk factor for NAFLD in the experiment data studied (odds ratio: 1.387, p value = 0.018). Results from this study support the hypothesis that complex biological association may exist between HPI and NAFLD, which partially explains the significant clinical co-incidence in the elderly population of south China.
A combined pharmacophore modeling, 3D-QSAR and molecular docking study of substituted bicyclo-[3.3.0]oct-2-enes as liver receptor homolog-1 (LRH-1) agonists

NASA Astrophysics Data System (ADS)

Lalit, Manisha; Gangwal, Rahul P.; Dhoke, Gaurao V.; Damre, Mangesh V.; Khandelwal, Kanchan; Sangamwar, Abhay T.

2013-10-01

A combined pharmacophore modelling, 3D-QSAR and molecular docking approach was employed to reveal structural and chemical features essential for the development of small molecules as LRH-1 agonists. The best HypoGen pharmacophore hypothesis (Hypo1) consists of one hydrogen-bond donor (HBD), two general hydrophobic (H), one hydrophobic aromatic (HYAr) and one hydrophobic aliphatic (HYA) feature. It has exhibited high correlation coefficient of 0.927, cost difference of 85.178 bit and low RMS value of 1.411. This pharmacophore hypothesis was cross-validated using test set, decoy set and Cat-Scramble methodology. Subsequently, validated pharmacophore hypothesis was used in the screening of small chemical databases. Further, 3D-QSAR models were developed based on the alignment obtained using substructure alignment. The best CoMFA and CoMSIA model has exhibited excellent rncv2 values of 0.991 and 0.987, and rcv2 values of 0.767 and 0.703, respectively. CoMFA predicted rpred2 of 0.87 and CoMSIA predicted rpred2 of 0.78 showed that the predicted values were in good agreement with the experimental values. Molecular docking analysis reveals that π-π interaction with His390 and hydrogen bond interaction with His390/Arg393 is essential for LRH-1 agonistic activity. The results from pharmacophore modelling, 3D-QSAR and molecular docking are complementary to each other and could serve as a powerful tool for the discovery of potent small molecules as LRH-1 agonists.
Small Molecule Protection of Bone Marrow Hematopoietic Stem Cells

DTIC Science & Technology

2015-10-01

several recently identified small molecules can protect hematopoietic stem cells (HSCs) from damage or killing by endogenous aldehydes . Proof-of-concept...anemia bone marrow failure CD34+ hematopoietic stem cells aldehydes formaldehyde DNA damage DNA base adduct DNA-protein crosslink mass...below. Revised Specific Aim 1: Small molecule protection of human cells from aldehyde - induced killing (in vitro studies - no mice or human subjects
Cheminformatics-aided discovery of small-molecule Protein-Protein Interaction (PPI) dual inhibitors of Tumor Necrosis Factor (TNF) and Receptor Activator of NF-κB Ligand (RANKL).

PubMed

Melagraki, Georgia; Ntougkos, Evangelos; Rinotas, Vagelis; Papaneophytou, Christos; Leonis, Georgios; Mavromoustakos, Thomas; Kontopidis, George; Douni, Eleni; Afantitis, Antreas; Kollias, George

2017-04-01

We present an in silico drug discovery pipeline developed and applied for the identification and virtual screening of small-molecule Protein-Protein Interaction (PPI) compounds that act as dual inhibitors of TNF and RANKL through the trimerization interface. The cheminformatics part of the pipeline was developed by combining structure-based with ligand-based modeling using the largest available set of known TNF inhibitors in the literature (2481 small molecules). To facilitate virtual screening, the consensus predictive model was made freely available at: http://enalos.insilicotox.com/TNFPubChem/. We thus generated a priority list of nine small molecules as candidates for direct TNF function inhibition. In vitro evaluation of these compounds led to the selection of two small molecules that act as potent direct inhibitors of TNF function, with IC50 values comparable to those of a previously-described direct inhibitor (SPD304), but with significantly reduced toxicity. These molecules were also identified as RANKL inhibitors and validated in vitro with respect to this second functionality. Direct binding of the two compounds was confirmed both for TNF and RANKL, as well as their ability to inhibit the biologically-active trimer forms. Molecular dynamics calculations were also carried out for the two small molecules in each protein to offer additional insight into the interactions that govern TNF and RANKL complex formation. To our knowledge, these compounds, namely T8 and T23, constitute the second and third published examples of dual small-molecule direct function inhibitors of TNF and RANKL, and could serve as lead compounds for the development of novel treatments for inflammatory and autoimmune diseases.
13 CFR 121.109 - What must a concern do in order to be identified as a small business concern in any Federal...

Code of Federal Regulations, 2014 CFR

2014-01-01

... be identified as a small business concern in any Federal procurement databases? 121.109 Section 121... order to be identified as a small business concern in any Federal procurement databases? (a) In order to be identified as a small business concern in the System for Award Management (SAM) database (or any...
Analysis of secondary structural elements in human microRNA hairpin precursors.

PubMed

Liu, Biao; Childs-Disney, Jessica L; Znosko, Brent M; Wang, Dan; Fallahi, Mohammad; Gallo, Steven M; Disney, Matthew D

2016-03-01

MicroRNAs (miRNAs) regulate gene expression by targeting complementary mRNAs for destruction or translational repression. Aberrant expression of miRNAs has been associated with various diseases including cancer, thus making them interesting therapeutic targets. The composite of secondary structural elements that comprise miRNAs could aid the design of small molecules that modulate their function. We analyzed the secondary structural elements, or motifs, present in all human miRNA hairpin precursors and compared them to highly expressed human RNAs with known structures and other RNAs from various organisms. Amongst human miRNAs, there are 3808 are unique motifs, many residing in processing sites. Further, we identified motifs in miRNAs that are not present in other highly expressed human RNAs, desirable targets for small molecules. MiRNA motifs were incorporated into a searchable database that is freely available. We also analyzed the most frequently occurring bulges and internal loops for each RNA class and found that the smallest loops possible prevail. However, the distribution of loops and the preferred closing base pairs were unique to each class. Collectively, we have completed a broad survey of motifs found in human miRNA precursors, highly expressed human RNAs, and RNAs from other organisms. Interestingly, unique motifs were identified in human miRNA processing sites, binding to which could inhibit miRNA maturation and hence function.
Gene-centric meta-analysis in 87,736 individuals of European ancestry identifies multiple blood-pressure-related loci.

PubMed

Tragante, Vinicius; Barnes, Michael R; Ganesh, Santhi K; Lanktree, Matthew B; Guo, Wei; Franceschini, Nora; Smith, Erin N; Johnson, Toby; Holmes, Michael V; Padmanabhan, Sandosh; Karczewski, Konrad J; Almoguera, Berta; Barnard, John; Baumert, Jens; Chang, Yen-Pei Christy; Elbers, Clara C; Farrall, Martin; Fischer, Mary E; Gaunt, Tom R; Gho, Johannes M I H; Gieger, Christian; Goel, Anuj; Gong, Yan; Isaacs, Aaron; Kleber, Marcus E; Mateo Leach, Irene; McDonough, Caitrin W; Meijs, Matthijs F L; Melander, Olle; Nelson, Christopher P; Nolte, Ilja M; Pankratz, Nathan; Price, Tom S; Shaffer, Jonathan; Shah, Sonia; Tomaszewski, Maciej; van der Most, Peter J; Van Iperen, Erik P A; Vonk, Judith M; Witkowska, Kate; Wong, Caroline O L; Zhang, Li; Beitelshees, Amber L; Berenson, Gerald S; Bhatt, Deepak L; Brown, Morris; Burt, Amber; Cooper-DeHoff, Rhonda M; Connell, John M; Cruickshanks, Karen J; Curtis, Sean P; Davey-Smith, George; Delles, Christian; Gansevoort, Ron T; Guo, Xiuqing; Haiqing, Shen; Hastie, Claire E; Hofker, Marten H; Hovingh, G Kees; Kim, Daniel S; Kirkland, Susan A; Klein, Barbara E; Klein, Ronald; Li, Yun R; Maiwald, Steffi; Newton-Cheh, Christopher; O'Brien, Eoin T; Onland-Moret, N Charlotte; Palmas, Walter; Parsa, Afshin; Penninx, Brenda W; Pettinger, Mary; Vasan, Ramachandran S; Ranchalis, Jane E; M Ridker, Paul; Rose, Lynda M; Sever, Peter; Shimbo, Daichi; Steele, Laura; Stolk, Ronald P; Thorand, Barbara; Trip, Mieke D; van Duijn, Cornelia M; Verschuren, W Monique; Wijmenga, Cisca; Wyatt, Sharon; Young, J Hunter; Zwinderman, Aeilko H; Bezzina, Connie R; Boerwinkle, Eric; Casas, Juan P; Caulfield, Mark J; Chakravarti, Aravinda; Chasman, Daniel I; Davidson, Karina W; Doevendans, Pieter A; Dominiczak, Anna F; FitzGerald, Garret A; Gums, John G; Fornage, Myriam; Hakonarson, Hakon; Halder, Indrani; Hillege, Hans L; Illig, Thomas; Jarvik, Gail P; Johnson, Julie A; Kastelein, John J P; Koenig, Wolfgang; Kumari, Meena; März, Winfried; Murray, Sarah S; O'Connell, Jeffery R; Oldehinkel, Albertine J; Pankow, James S; Rader, Daniel J; Redline, Susan; Reilly, Muredach P; Schadt, Eric E; Kottke-Marchant, Kandice; Snieder, Harold; Snyder, Michael; Stanton, Alice V; Tobin, Martin D; Uitterlinden, André G; van der Harst, Pim; van der Schouw, Yvonne T; Samani, Nilesh J; Watkins, Hugh; Johnson, Andrew D; Reiner, Alex P; Zhu, Xiaofeng; de Bakker, Paul I W; Levy, Daniel; Asselbergs, Folkert W; Munroe, Patricia B; Keating, Brendan J

2014-03-06

Blood pressure (BP) is a heritable risk factor for cardiovascular disease. To investigate genetic associations with systolic BP (SBP), diastolic BP (DBP), mean arterial pressure (MAP), and pulse pressure (PP), we genotyped ~50,000 SNPs in up to 87,736 individuals of European ancestry and combined these in a meta-analysis. We replicated findings in an independent set of 68,368 individuals of European ancestry. Our analyses identified 11 previously undescribed associations in independent loci containing 31 genes including PDE1A, HLA-DQB1, CDK6, PRKAG2, VCL, H19, NUCB2, RELA, HOXC@ complex, FBN1, and NFAT5 at the Bonferroni-corrected array-wide significance threshold (p < 6 × 10(-7)) and confirmed 27 previously reported associations. Bioinformatic analysis of the 11 loci provided support for a putative role in hypertension of several genes, such as CDK6 and NUCB2. Analysis of potential pharmacological targets in databases of small molecules showed that ten of the genes are predicted to be a target for small molecules. In summary, we identified previously unknown loci associated with BP. Our findings extend our understanding of genes involved in BP regulation, which may provide new targets for therapeutic intervention or drug response stratification. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Structural insight into exosite binding and discovery of novel exosite inhibitors of botulinum neurotoxin serotype A through in silico screening

NASA Astrophysics Data System (ADS)

Hu, Xin; Legler, Patricia M.; Southall, Noel; Maloney, David J.; Simeonov, Anton; Jadhav, Ajit

2014-07-01

Botulinum neurotoxin serotype A (BoNT/A) is the most lethal toxin among the Tier 1 Select Agents. Development of potent and selective small molecule inhibitors against BoNT/A zinc metalloprotease remains a challenging problem due to its exceptionally large substrate binding surface and conformational plasticity. The exosites of the catalytic domain of BoNT/A are intriguing alternative sites for small molecule intervention, but their suitability for inhibitor design remains largely unexplored. In this study, we employed two recently identified exosite inhibitors, D-chicoric acid and lomofungin, to probe the structural features of the exosites and molecular mechanisms of synergistic inhibition. The results showed that D-chicoric acid favors binding at the α-exosite, whereas lomofungin preferentially binds at the β-exosite by mimicking the substrate β-sheet binding interaction. Molecular dynamics simulations and binding interaction analysis of the exosite inhibitors with BoNT/A revealed key elements and hotspots that likely contribute to the inhibitor binding and synergistic inhibition. Finally, we performed database virtual screening for novel inhibitors of BoNT/A targeting the exosites. Hits C1 and C2 showed non-competitive inhibition and likely target the α- and β-exosites, respectively. The identified exosite inhibitors may provide novel candidates for structure-based development of therapeutics against BoNT/A intoxication.
Structural insight into exosite binding and discovery of novel exosite inhibitors of botulinum neurotoxin serotype A through in silico screening.

PubMed

Hu, Xin; Legler, Patricia M; Southall, Noel; Maloney, David J; Simeonov, Anton; Jadhav, Ajit

2014-07-01

Botulinum neurotoxin serotype A (BoNT/A) is the most lethal toxin among the Tier 1 Select Agents. Development of potent and selective small molecule inhibitors against BoNT/A zinc metalloprotease remains a challenging problem due to its exceptionally large substrate binding surface and conformational plasticity. The exosites of the catalytic domain of BoNT/A are intriguing alternative sites for small molecule intervention, but their suitability for inhibitor design remains largely unexplored. In this study, we employed two recently identified exosite inhibitors, D-chicoric acid and lomofungin, to probe the structural features of the exosites and molecular mechanisms of synergistic inhibition. The results showed that D-chicoric acid favors binding at the α-exosite, whereas lomofungin preferentially binds at the β-exosite by mimicking the substrate β-sheet binding interaction. Molecular dynamics simulations and binding interaction analysis of the exosite inhibitors with BoNT/A revealed key elements and hotspots that likely contribute to the inhibitor binding and synergistic inhibition. Finally, we performed database virtual screening for novel inhibitors of BoNT/A targeting the exosites. Hits C1 and C2 showed non-competitive inhibition and likely target the α- and β-exosites, respectively. The identified exosite inhibitors may provide novel candidates for structure-based development of therapeutics against BoNT/A intoxication.
A Computational Approach to Finding Novel Targets for Existing Drugs

PubMed Central

Li, Yvonne Y.; An, Jianghong; Jones, Steven J. M.

2011-01-01

Repositioning existing drugs for new therapeutic uses is an efficient approach to drug discovery. We have developed a computational drug repositioning pipeline to perform large-scale molecular docking of small molecule drugs against protein drug targets, in order to map the drug-target interaction space and find novel interactions. Our method emphasizes removing false positive interaction predictions using criteria from known interaction docking, consensus scoring, and specificity. In all, our database contains 252 human protein drug targets that we classify as reliable-for-docking as well as 4621 approved and experimental small molecule drugs from DrugBank. These were cross-docked, then filtered through stringent scoring criteria to select top drug-target interactions. In particular, we used MAPK14 and the kinase inhibitor BIM-8 as examples where our stringent thresholds enriched the predicted drug-target interactions with known interactions up to 20 times compared to standard score thresholds. We validated nilotinib as a potent MAPK14 inhibitor in vitro (IC50 40 nM), suggesting a potential use for this drug in treating inflammatory diseases. The published literature indicated experimental evidence for 31 of the top predicted interactions, highlighting the promising nature of our approach. Novel interactions discovered may lead to the drug being repositioned as a therapeutic treatment for its off-target's associated disease, added insight into the drug's mechanism of action, and added insight into the drug's side effects. PMID:21909252
A dictionary to identify small molecules and drugs in free text.

PubMed

Hettne, Kristina M; Stierum, Rob H; Schuemie, Martijn J; Hendriksen, Peter J M; Schijvenaars, Bob J A; Mulligen, Erik M van; Kleinjans, Jos; Kors, Jan A

2009-11-15

From the scientific community, a lot of effort has been spent on the correct identification of gene and protein names in text, while less effort has been spent on the correct identification of chemical names. Dictionary-based term identification has the power to recognize the diverse representation of chemical information in the literature and map the chemicals to their database identifiers. We developed a dictionary for the identification of small molecules and drugs in text, combining information from UMLS, MeSH, ChEBI, DrugBank, KEGG, HMDB and ChemIDplus. Rule-based term filtering, manual check of highly frequent terms and disambiguation rules were applied. We tested the combined dictionary and the dictionaries derived from the individual resources on an annotated corpus, and conclude the following: (i) each of the different processing steps increase precision with a minor loss of recall; (ii) the overall performance of the combined dictionary is acceptable (precision 0.67, recall 0.40 (0.80 for trivial names); (iii) the combined dictionary performed better than the dictionary in the chemical recognizer OSCAR3; (iv) the performance of a dictionary based on ChemIDplus alone is comparable to the performance of the combined dictionary. The combined dictionary is freely available as an XML file in Simple Knowledge Organization System format on the web site http://www.biosemantics.org/chemlist.
Role of Chemical Reactivity and Transition State Modeling for Virtual Screening.

PubMed

Karthikeyan, Muthukumarasamy; Vyas, Renu; Tambe, Sanjeev S; Radhamohan, Deepthi; Kulkarni, Bhaskar D

2015-01-01

Every drug discovery research program involves synthesis of a novel and potential drug molecule utilizing atom efficient, economical and environment friendly synthetic strategies. The current work focuses on the role of the reactivity based fingerprints of compounds as filters for virtual screening using a tool ChemScore. A reactant-like (RLS) and a product- like (PLS) score can be predicted for a given compound using the binary fingerprints derived from the numerous known organic reactions which capture the molecule-molecule interactions in the form of addition, substitution, rearrangement, elimination and isomerization reactions. The reaction fingerprints were applied to large databases in biology and chemistry, namely ChEMBL, KEGG, HMDB, DSSTox, and the Drug Bank database. A large network of 1113 synthetic reactions was constructed to visualize and ascertain the reactant product mappings in the chemical reaction space. The cumulative reaction fingerprints were computed for 4000 molecules belonging to 29 therapeutic classes of compounds, and these were found capable of discriminating between the cognition disorder related and anti-allergy compounds with reasonable accuracy of 75% and AUC 0.8. In this study, the transition state based fingerprints were also developed and used effectively for virtual screening in drug related databases. The methodology presented here provides an efficient handle for the rapid scoring of molecular libraries for virtual screening.
Profiling protein function with small molecule microarrays

PubMed Central

Winssinger, Nicolas; Ficarro, Scott; Schultz, Peter G.; Harris, Jennifer L.

2002-01-01

The regulation of protein function through posttranslational modification, local environment, and protein–protein interaction is critical to cellular function. The ability to analyze on a genome-wide scale protein functional activity rather than changes in protein abundance or structure would provide important new insights into complex biological processes. Herein, we report the application of a spatially addressable small molecule microarray to an activity-based profile of proteases in crude cell lysates. The potential of this small molecule-based profiling technology is demonstrated by the detection of caspase activation upon induction of apoptosis, characterization of the activated caspase, and inhibition of the caspase-executed apoptotic phenotype using the small molecule inhibitor identified in the microarray-based profile. PMID:12167675
Identification of Thiotetronic Acid Antibiotic Biosynthetic Pathways by Target-directed Genome Mining.

PubMed

Tang, Xiaoyu; Li, Jie; Millán-Aguiñaga, Natalie; Zhang, Jia Jia; O'Neill, Ellis C; Ugalde, Juan A; Jensen, Paul R; Mantovani, Simone M; Moore, Bradley S

2015-12-18

Recent genome sequencing efforts have led to the rapid accumulation of uncharacterized or "orphaned" secondary metabolic biosynthesis gene clusters (BGCs) in public databases. This increase in DNA-sequenced big data has given rise to significant challenges in the applied field of natural product genome mining, including (i) how to prioritize the characterization of orphan BGCs and (ii) how to rapidly connect genes to biosynthesized small molecules. Here, we show that by correlating putative antibiotic resistance genes that encode target-modified proteins with orphan BGCs, we predict the biological function of pathway specific small molecules before they have been revealed in a process we call target-directed genome mining. By querying the pan-genome of 86 Salinispora bacterial genomes for duplicated house-keeping genes colocalized with natural product BGCs, we prioritized an orphan polyketide synthase-nonribosomal peptide synthetase hybrid BGC (tlm) with a putative fatty acid synthase resistance gene. We employed a new synthetic double-stranded DNA-mediated cloning strategy based on transformation-associated recombination to efficiently capture tlm and the related ttm BGCs directly from genomic DNA and to heterologously express them in Streptomyces hosts. We show the production of a group of unusual thiotetronic acid natural products, including the well-known fatty acid synthase inhibitor thiolactomycin that was first described over 30 years ago, yet never at the genetic level in regards to biosynthesis and autoresistance. This finding not only validates the target-directed genome mining strategy for the discovery of antibiotic producing gene clusters without a priori knowledge of the molecule synthesized but also paves the way for the investigation of novel enzymology involved in thiotetronic acid natural product biosynthesis.
Free-standing few-layered graphene oxide films: selective, steady and lasting permeation of organic molecules with adjustable speeds

NASA Astrophysics Data System (ADS)

Huang, Tao; An, Qi; Luan, Xinglong; Zhang, Qian; Zhang, Yihe

2016-01-01

A variety of small molecules with diameters around 1 nm possess a range of functions, such as antibiotic, antimicrobic, anticoagulant, pesticidal and chemotherapy effects, making these molecules especially useful in various applications ranging from medical treatment to environmental microbiological control. However, the long-term steady delivery (release or permeation) of these small molecules with adjustable and controllable speeds has remained an especially challenging task. In this study, we prepared covalently cross-linked free-standing few-layered GO films using a layer-by-layer technique in combination with photochemical cross-linkages, and achieved a controlled release of positively charged, negatively charged, and zwitterionic small molecules with adjustable and controllable speeds. The steady delivery of the small molecule lasted up to 9 days. Other functionalities, such as graphene-enhanced Raman spectra and electrochemical properties that could also be integrated or employed in delivery systems, were also studied for our films. We expect the special molecular delivery properties of our films to lead to new possibilities in drug/fertilizer delivery and environmental microbiological control applications.A variety of small molecules with diameters around 1 nm possess a range of functions, such as antibiotic, antimicrobic, anticoagulant, pesticidal and chemotherapy effects, making these molecules especially useful in various applications ranging from medical treatment to environmental microbiological control. However, the long-term steady delivery (release or permeation) of these small molecules with adjustable and controllable speeds has remained an especially challenging task. In this study, we prepared covalently cross-linked free-standing few-layered GO films using a layer-by-layer technique in combination with photochemical cross-linkages, and achieved a controlled release of positively charged, negatively charged, and zwitterionic small molecules with adjustable and controllable speeds. The steady delivery of the small molecule lasted up to 9 days. Other functionalities, such as graphene-enhanced Raman spectra and electrochemical properties that could also be integrated or employed in delivery systems, were also studied for our films. We expect the special molecular delivery properties of our films to lead to new possibilities in drug/fertilizer delivery and environmental microbiological control applications. Electronic supplementary information (ESI) available: AFM images of GO and GO films, UV-vis spectra of delayed release, and permeation fidelities. See DOI: 10.1039/c5nr08129g
Metabolonote: A Wiki-Based Database for Managing Hierarchical Metadata of Metabolome Analyses

PubMed Central

Ara, Takeshi; Enomoto, Mitsuo; Arita, Masanori; Ikeda, Chiaki; Kera, Kota; Yamada, Manabu; Nishioka, Takaaki; Ikeda, Tasuku; Nihei, Yoshito; Shibata, Daisuke; Kanaya, Shigehiko; Sakurai, Nozomu

2015-01-01

Metabolomics – technology for comprehensive detection of small molecules in an organism – lags behind the other “omics” in terms of publication and dissemination of experimental data. Among the reasons for this are difficulty precisely recording information about complicated analytical experiments (metadata), existence of various databases with their own metadata descriptions, and low reusability of the published data, resulting in submitters (the researchers who generate the data) being insufficiently motivated. To tackle these issues, we developed Metabolonote, a Semantic MediaWiki-based database designed specifically for managing metabolomic metadata. We also defined a metadata and data description format, called “Togo Metabolome Data” (TogoMD), with an ID system that is required for unique access to each level of the tree-structured metadata such as study purpose, sample, analytical method, and data analysis. Separation of the management of metadata from that of data and permission to attach related information to the metadata provide advantages for submitters, readers, and database developers. The metadata are enriched with information such as links to comparable data, thereby functioning as a hub of related data resources. They also enhance not only readers’ understanding and use of data but also submitters’ motivation to publish the data. The metadata are computationally shared among other systems via APIs, which facilitate the construction of novel databases by database developers. A permission system that allows publication of immature metadata and feedback from readers also helps submitters to improve their metadata. Hence, this aspect of Metabolonote, as a metadata preparation tool, is complementary to high-quality and persistent data repositories such as MetaboLights. A total of 808 metadata for analyzed data obtained from 35 biological species are published currently. Metabolonote and related tools are available free of cost at http://metabolonote.kazusa.or.jp/. PMID:25905099
Metabolonote: a wiki-based database for managing hierarchical metadata of metabolome analyses.

PubMed

Ara, Takeshi; Enomoto, Mitsuo; Arita, Masanori; Ikeda, Chiaki; Kera, Kota; Yamada, Manabu; Nishioka, Takaaki; Ikeda, Tasuku; Nihei, Yoshito; Shibata, Daisuke; Kanaya, Shigehiko; Sakurai, Nozomu

2015-01-01

Metabolomics - technology for comprehensive detection of small molecules in an organism - lags behind the other "omics" in terms of publication and dissemination of experimental data. Among the reasons for this are difficulty precisely recording information about complicated analytical experiments (metadata), existence of various databases with their own metadata descriptions, and low reusability of the published data, resulting in submitters (the researchers who generate the data) being insufficiently motivated. To tackle these issues, we developed Metabolonote, a Semantic MediaWiki-based database designed specifically for managing metabolomic metadata. We also defined a metadata and data description format, called "Togo Metabolome Data" (TogoMD), with an ID system that is required for unique access to each level of the tree-structured metadata such as study purpose, sample, analytical method, and data analysis. Separation of the management of metadata from that of data and permission to attach related information to the metadata provide advantages for submitters, readers, and database developers. The metadata are enriched with information such as links to comparable data, thereby functioning as a hub of related data resources. They also enhance not only readers' understanding and use of data but also submitters' motivation to publish the data. The metadata are computationally shared among other systems via APIs, which facilitate the construction of novel databases by database developers. A permission system that allows publication of immature metadata and feedback from readers also helps submitters to improve their metadata. Hence, this aspect of Metabolonote, as a metadata preparation tool, is complementary to high-quality and persistent data repositories such as MetaboLights. A total of 808 metadata for analyzed data obtained from 35 biological species are published currently. Metabolonote and related tools are available free of cost at http://metabolonote.kazusa.or.jp/.
The electric dipole moment of DNA-binding HU protein calculated by the use of an NMR database.

PubMed

Takashima, S; Yamaoka, K

1999-08-30

Electric birefringence measurements indicated the presence of a large permanent dipole moment in HU protein-DNA complex. In order to substantiate this observation, numerical computation of the dipole moment of HU protein homodimer was carried out by using NMR protein databases. The dipole moments of globular proteins have hitherto been calculated with X-ray databases and NMR data have never been used before. The advantages of NMR databases are: (a) NMR data are obtained, unlike X-ray databases, using protein solutions. Accordingly, this method eliminates the bothersome question as to the possible alteration of the protein structure due to the transition from the crystalline state to the solution state. This question is particularly important for proteins such as HU protein which has some degree of internal flexibility; (b) the three-dimensional coordinates of hydrogen atoms in protein molecules can be determined with a sufficient resolution and this enables the N-H as well as C = O bond moments to be calculated. Since the NMR database of HU protein from Bacillus stearothermophilus consists of 25 models, the surface charge as well as the core dipole moments were computed for each of these structures. The results of these calculations show that the net permanent dipole moments of HU protein homodimer is approximately 500-530 D (1 D = 3.33 x 10(-30) Cm) at pH 7.5 and 600-630 D at the isoelectric point (pH 10.5). These permanent dipole moments are unusually large for a small protein of the size of 19.5 kDa. Nevertheless, the result of numerical calculations is compatible with the electro-optical observation, confirming a very large dipole moment in this protein.
Fast 3D shape screening of large chemical databases through alignment-recycling

PubMed Central

Fontaine, Fabien; Bolton, Evan; Borodina, Yulia; Bryant, Stephen H

2007-01-01

Background Large chemical databases require fast, efficient, and simple ways of looking for similar structures. Although such tasks are now fairly well resolved for graph-based similarity queries, they remain an issue for 3D approaches, particularly for those based on 3D shape overlays. Inspired by a recent technique developed to compare molecular shapes, we designed a hybrid methodology, alignment-recycling, that enables efficient retrieval and alignment of structures with similar 3D shapes. Results Using a dataset of more than one million PubChem compounds of limited size (< 28 heavy atoms) and flexibility (< 6 rotatable bonds), we obtained a set of a few thousand diverse structures covering entirely the 3D shape space of the conformers of the dataset. Transformation matrices gathered from the overlays between these diverse structures and the 3D conformer dataset allowed us to drastically (100-fold) reduce the CPU time required for shape overlay. The alignment-recycling heuristic produces results consistent with de novo alignment calculation, with better than 80% hit list overlap on average. Conclusion Overlay-based 3D methods are computationally demanding when searching large databases. Alignment-recycling reduces the CPU time to perform shape similarity searches by breaking the alignment problem into three steps: selection of diverse shapes to describe the database shape-space; overlay of the database conformers to the diverse shapes; and non-optimized overlay of query and database conformers using common reference shapes. The precomputation, required by the first two steps, is a significant cost of the method; however, once performed, querying is two orders of magnitude faster. Extensions and variations of this methodology, for example, to handle more flexible and larger small-molecules are discussed. PMID:17880744
Ligand solvation in molecular docking.

PubMed

Shoichet, B K; Leach, A R; Kuntz, I D

1999-01-01

Solvation plays an important role in ligand-protein association and has a strong impact on comparisons of binding energies for dissimilar molecules. When databases of such molecules are screened for complementarity to receptors of known structure, as often occurs in structure-based inhibitor discovery, failure to consider ligand solvation often leads to putative ligands that are too highly charged or too large. To correct for the different charge states and sizes of the ligands, we calculated electrostatic and non-polar solvation free energies for molecules in a widely used molecular database, the Available Chemicals Directory (ACD). A modified Born equation treatment was used to calculate the electrostatic component of ligand solvation. The non-polar component of ligand solvation was calculated based on the surface area of the ligand and parameters derived from the hydration energies of apolar ligands. These solvation energies were subtracted from the ligand-receptor interaction energies. We tested the usefulness of these corrections by screening the ACD for molecules that complemented three proteins of known structure, using a molecular docking program. Correcting for ligand solvation improved the rankings of known ligands and discriminated against molecules with inappropriate charge states and sizes.
A Fragment-Based Method of Creating Small-Molecule Libraries to Target the Aggregation of Intrinsically Disordered Proteins.

PubMed

Joshi, Priyanka; Chia, Sean; Habchi, Johnny; Knowles, Tuomas P J; Dobson, Christopher M; Vendruscolo, Michele

2016-03-14

The aggregation process of intrinsically disordered proteins (IDPs) has been associated with a wide range of neurodegenerative disorders, including Alzheimer's and Parkinson's diseases. Currently, however, no drug in clinical use targets IDP aggregation. To facilitate drug discovery programs in this important and challenging area, we describe a fragment-based approach of generating small-molecule libraries that target specific IDPs. The method is based on the use of molecular fragments extracted from compounds reported in the literature to inhibit of the aggregation of IDPs. These fragments are used to screen existing large generic libraries of small molecules to form smaller libraries specific for given IDPs. We illustrate this approach by describing three distinct small-molecule libraries to target, Aβ, tau, and α-synuclein, which are three IDPs implicated in Alzheimer's and Parkinson's diseases. The strategy described here offers novel opportunities for the identification of effective molecular scaffolds for drug discovery for neurodegenerative disorders and to provide insights into the mechanism of small-molecule binding to IDPs.
Systematic development of small molecules to inhibit specific microscopic steps of Aβ42 aggregation in Alzheimer’s disease

PubMed Central

Habchi, Johnny; Chia, Sean; Limbocker, Ryan; Mannini, Benedetta; Ahn, Minkoo; Perni, Michele; Hansson, Oskar; Arosio, Paolo; Kumita, Janet R.; Challa, Pavan Kumar; Cohen, Samuel I. A.; Dobson, Christopher M.; Knowles, Tuomas P. J.; Vendruscolo, Michele

2017-01-01

The aggregation of the 42-residue form of the amyloid-β peptide (Aβ42) is a pivotal event in Alzheimer’s disease (AD). The use of chemical kinetics has recently enabled highly accurate quantifications of the effects of small molecules on specific microscopic steps in Aβ42 aggregation. Here, we exploit this approach to develop a rational drug discovery strategy against Aβ42 aggregation that uses as a read-out the changes in the nucleation and elongation rate constants caused by candidate small molecules. We thus identify a pool of compounds that target specific microscopic steps in Aβ42 aggregation. We then test further these small molecules in human cerebrospinal fluid and in a Caenorhabditis elegans model of AD. Our results show that this strategy represents a powerful approach to identify systematically small molecule lead compounds, thus offering an appealing opportunity to reduce the attrition problem in drug discovery. PMID:28011763
Genome-Scale Architecture of Small Molecule Regulatory Networks and the Fundamental Trade-Off between Regulation and Enzymatic Activity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Reznik, Ed; Christodoulou, Dimitris; Goldford, Joshua E.

Metabolic flux is in part regulated by endogenous small molecules that modulate the catalytic activity of an enzyme, e.g., allosteric inhibition. In contrast to transcriptional regulation of enzymes, technical limitations have hindered the production of a genome-scale atlas of small molecule-enzyme regulatory interactions. Here, we develop a framework leveraging the vast, but fragmented, biochemical literature to reconstruct and analyze the small molecule regulatory network (SMRN) of the model organism Escherichia coli, including the primary metabolite regulators and enzyme targets. Using metabolic control analysis, we prove a fundamental trade-off between regulation and enzymatic activity, and we combine it with metabolomic measurementsmore » and the SMRN to make inferences on the sensitivity of enzymes to their regulators. By generalizing the analysis to other organisms, we identify highly conserved regulatory interactions across evolutionarily divergent species, further emphasizing a critical role for small molecule interactions in the maintenance of metabolic homeostasis.« less
Simultaneous optimization of biomolecular energy function on features from small molecules and macromolecules

PubMed Central

Park, Hahnbeom; Bradley, Philip; Greisen, Per; Liu, Yuan; Mulligan, Vikram Khipple; Kim, David E.; Baker, David; DiMaio, Frank

2017-01-01

Most biomolecular modeling energy functions for structure prediction, sequence design, and molecular docking, have been parameterized using existing macromolecular structural data; this contrasts molecular mechanics force fields which are largely optimized using small-molecule data. In this study, we describe an integrated method that enables optimization of a biomolecular modeling energy function simultaneously against small-molecule thermodynamic data and high-resolution macromolecular structural data. We use this approach to develop a next-generation Rosetta energy function that utilizes a new anisotropic implicit solvation model, and an improved electrostatics and Lennard-Jones model, illustrating how energy functions can be considerably improved in their ability to describe large-scale energy landscapes by incorporating both small-molecule and macromolecule data. The energy function improves performance in a wide range of protein structure prediction challenges, including monomeric structure prediction, protein-protein and protein-ligand docking, protein sequence design, and prediction of the free energy changes by mutation, while reasonably recapitulating small-molecule thermodynamic properties. PMID:27766851
Identifying the preferred RNA motifs and chemotypes that interact by probing millions of combinations.

PubMed

Tran, Tuan; Disney, Matthew D

2012-01-01

RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here, we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (among a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole and pyridinium chemotypes allow for specific recognition of RNA motifs. As targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses.
Design of a bioactive small molecule that targets r(AUUCU) repeats in spinocerebellar ataxia 10.

PubMed

Yang, Wang-Yong; Gao, Rui; Southern, Mark; Sarkar, Partha S; Disney, Matthew D

2016-06-01

RNA is an important target for chemical probes of function and lead therapeutics; however, it is difficult to target with small molecules. One approach to tackle this problem is to identify compounds that target RNA structures and utilize them to multivalently target RNA. Here we show that small molecules can be identified to selectively bind RNA base pairs by probing a library of RNA-focused small molecules. A small molecule that selectively binds AU base pairs informed design of a dimeric compound (2AU-2) that targets the pathogenic RNA, expanded r(AUUCU) repeats, that causes spinocerebellar ataxia type 10 (SCA10) in patient-derived cells. Indeed, 2AU-2 (50 nM) ameliorates various aspects of SCA10 pathology including improvement of mitochondrial dysfunction, reduced activation of caspase 3, and reduction of nuclear foci. These studies provide a first-in-class chemical probe to study SCA10 RNA toxicity and potentially define broadly applicable compounds targeting RNA AU base pairs in cells.
Identifying the Preferred RNA Motifs and Chemotypes that Interact by Probing Millions of Combinations

PubMed Central

Tran, Tuan; Disney, Matthew D.

2012-01-01

RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (amongst a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole, and pyridinium chemotypes allow for specific recognition of RNA motifs. Since targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses. PMID:23047683

Small-Molecule-Directed Hepatocyte-Like Cell Differentiation of Human Pluripotent Stem Cells.

PubMed

Mathapati, Santosh; Siller, Richard; Impellizzeri, Agata A R; Lycke, Max; Vegheim, Karianne; Almaas, Runar; Sullivan, Gareth J

2016-08-17

Hepatocyte-like cells (HLCs) generated in vitro from human pluripotent stem cells (hPSCs) provide an invaluable resource for basic research, regenerative medicine, drug screening, toxicology, and modeling of liver disease and development. This unit describes a small-molecule-driven protocol for in vitro differentiation of hPSCs into HLCs without the use of growth factors. hPSCs are coaxed through a developmentally relevant route via the primitive streak to definitive endoderm (DE) using the small molecule CHIR99021 (a Wnt agonist), replacing the conventional growth factors Wnt3A and activin A. The small-molecule-derived DE is then differentiated to hepatoblast-like cells in the presence of dimethyl sulfoxide. The resulting hepatoblasts are then differentiated to HLCs with N-hexanoic-Tyr, Ile-6 aminohexanoic amide (Dihexa, a hepatocyte growth factor agonist) and dexamethasone. The protocol provides an efficient and reproducible procedure for differentiation of hPSCs into HLCs utilizing small molecules. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Bioorthogonal cyclization-mediated in situ self-assembly of small-molecule probes for imaging caspase activity in vivo

NASA Astrophysics Data System (ADS)

Ye, Deju; Shuhendler, Adam J.; Cui, Lina; Tong, Ling; Tee, Sui Seng; Tikhomirov, Grigory; Felsher, Dean W.; Rao, Jianghong

2014-06-01

Directed self-assembly of small molecules in living systems could enable a myriad of applications in biology and medicine, and already this has been used widely to synthesize supramolecules and nano/microstructures in solution and in living cells. However, controlling the self-assembly of synthetic small molecules in living animals is challenging because of the complex and dynamic in vivo physiological environment. Here we employ an optimized first-order bioorthogonal cyclization reaction to control the self-assembly of a fluorescent small molecule, and demonstrate its in vivo applicability by imaging caspase-3/7 activity in human tumour xenograft mouse models of chemotherapy. The fluorescent nanoparticles assembled in situ were imaged successfully in both apoptotic cells and tumour tissues using three-dimensional structured illumination microscopy. This strategy combines the advantages offered by small molecules with those of nanomaterials and should find widespread use for non-invasive imaging of enzyme activity in vivo.
9,10-Azaboraphenanthrene-containing small molecules and conjugated polymers: synthesis and their application in chemodosimeters for the ratiometric detection of fluoride ions.

PubMed

Zhang, Weidong; Li, Guoping; Xu, Letian; Zhuo, Yue; Wan, Wenming; Yan, Ni; He, Gang

2018-05-21

The introduction of main group elements into conjugated scaffolds is emerging as a key route to novel optoelectronic materials. Herein, an efficient and versatile way to synthesize polymerizable 9,10-azaboraphenanthrene ( BNP )-containing monomers by aromaticity-driven ring expansion reactions between highly antiaromatic borafluorene and azides is reported, and the corresponding conjugated small molecules and polymers are developed as well. The BNP -containing small molecules and conjugated polymers showed good air/moisture stability and notable fluorescence properties. Addition of fluoride ions to the BNP -based small molecules and polymers induced a rapid change in the emission color from blue to green/yellow, respectively, accompanied by strong intensity changes. The conjugated polymers showed better ratiometric sensing performance than small molecules due to the exciton migration along the conjugated chains. Further experiments showed that the sensing process is fully reversible. The films prepared by solution-deposition of BNP -based compounds in the presence of polycaprolactone also showed good ratiometric sensing for fluoride ions.
Small molecules enhance CRISPR genome editing in pluripotent stem cells.

PubMed

Yu, Chen; Liu, Yanxia; Ma, Tianhua; Liu, Kai; Xu, Shaohua; Zhang, Yu; Liu, Honglei; La Russa, Marie; Xie, Min; Ding, Sheng; Qi, Lei S

2015-02-05

The bacterial CRISPR-Cas9 system has emerged as an effective tool for sequence-specific gene knockout through non-homologous end joining (NHEJ), but it remains inefficient for precise editing of genome sequences. Here we develop a reporter-based screening approach for high-throughput identification of chemical compounds that can modulate precise genome editing through homology-directed repair (HDR). Using our screening method, we have identified small molecules that can enhance CRISPR-mediated HDR efficiency, 3-fold for large fragment insertions and 9-fold for point mutations. Interestingly, we have also observed that a small molecule that inhibits HDR can enhance frame shift insertion and deletion (indel) mutations mediated by NHEJ. The identified small molecules function robustly in diverse cell types with minimal toxicity. The use of small molecules provides a simple and effective strategy to enhance precise genome engineering applications and facilitates the study of DNA repair mechanisms in mammalian cells. Copyright © 2015 Elsevier Inc. All rights reserved.
Genome-Scale Architecture of Small Molecule Regulatory Networks and the Fundamental Trade-Off between Regulation and Enzymatic Activity

DOE PAGES

Reznik, Ed; Christodoulou, Dimitris; Goldford, Joshua E.; ...

2017-09-12

Metabolic flux is in part regulated by endogenous small molecules that modulate the catalytic activity of an enzyme, e.g., allosteric inhibition. In contrast to transcriptional regulation of enzymes, technical limitations have hindered the production of a genome-scale atlas of small molecule-enzyme regulatory interactions. Here, we develop a framework leveraging the vast, but fragmented, biochemical literature to reconstruct and analyze the small molecule regulatory network (SMRN) of the model organism Escherichia coli, including the primary metabolite regulators and enzyme targets. Using metabolic control analysis, we prove a fundamental trade-off between regulation and enzymatic activity, and we combine it with metabolomic measurementsmore » and the SMRN to make inferences on the sensitivity of enzymes to their regulators. By generalizing the analysis to other organisms, we identify highly conserved regulatory interactions across evolutionarily divergent species, further emphasizing a critical role for small molecule interactions in the maintenance of metabolic homeostasis.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)

Chang, Yung-Ting; Department of Electrical Engineering, Graduate Institute of Photonics and Optoelectronics, National Taiwan University, Taipei, Taiwan 10617, Taiwan; Liu, Shun-Wei

Single-layer blue phosphorescence organic light emitting diodes (OLEDs) with either small-molecule or polymer hosts are fabricated using solution process and the performances of devices with different hosts are investigated. The small-molecule device exhibits luminous efficiency of 14.7 cd/A and maximum power efficiency of 8.39 lm/W, which is the highest among blue phosphorescence OLEDs with single-layer solution process and small molecular hosts. Using the same solution process for all devices, comparison of light out-coupling enhancement, with brightness enhancement film (BEF), between small-molecule and polymer based OLEDs is realized. Due to different dipole orientation and anisotropic refractive index, polymer-based OLEDs would trap less lightmore » than small molecule-based OLEDs internally, about 37% better based simulation results. In spite of better electrical and spectroscopic characteristics, including ambipolar characteristics, higher carrier mobility, higher photoluminescence quantum yield, and larger triplet state energy, the overall light out-coupling efficiency of small molecule-based devices is worse than that of polymer-based devices without BEF. However, with BEF for light out-coupling enhancement, the improved ratio in luminous flux and luminous efficiency for small molecule based device is 1.64 and 1.57, respectively, which are significantly better than those of PVK (poly-9-vinylcarbazole) devices. In addition to the theoretical optical simulation, the experimental data also confirm the origins of differential light-outcoupling enhancement. The maximum luminous efficiency and power efficiency are enhanced from 14.7 cd/A and 8.39 lm/W to 23 cd/A and 13.2 lm/W, respectively, with laminated BEF, which are both the highest so far for single-layer solution-process blue phosphorescence OLEDs with small molecule hosts.« less
Plasmonic Aptamer-Gold Nanoparticle Sensors for Small Molecule Fingerprint Identification

DTIC Science & Technology

2014-08-01

AFRL-RH-WP-TR-2014-0107 PLASMONIC APTAMER -GOLD NANOPARTICLE SENSORS FOR SMALL MOLECULE FINGERPRINT IDENTIFICATION Jorge Chávez Grant Slusher...Plasmonic Aptamer -Gold Nanoparticle Sensors for Small Molecule Fingerprint Identification 5a. CONTRACT NUMBER N/A 5b. GRANT NUMBER 5c. PROGRAM...The utilization of the plasmonic response of aptamer -gold nanoparticle conjugates (Apt-AuNPs) to design cross- reactive arrays for fingerprint
Covalent small-molecule-RNA complex formation enables cellular profiling of small-molecule-RNA interactions.

PubMed

Guan, Lirui; Disney, Matthew D

2013-09-16

Won't let you go! A strategy is described to design small molecules that react with their cellular RNA targets. This approach not only improves the activity of compounds targeting RNA in cell culture by a factor of about 2500 but also enables cell-wide profiling of its RNA targets. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Bottom-up design of small molecules that stimulate exon 10 skipping in mutant MAPT pre-mRNA.

PubMed

Luo, Yiling; Disney, Matthew D

2014-09-22

One challenge in chemical biology is to develop small molecules that control cellular protein content. The amount and identity of proteins are influenced by the RNAs that encode them; thus, protein content in a cell could be affected by targeting mRNA. However, RNA has been traditionally difficult to target with small molecules. In this report, we describe controlling the protein products of the mutated microtubule-associated protein tau (MAPT) mature mRNA with a small molecule. MAPT mutations in exon 10 are associated with inherited frontotemporal dementia and Parkinsonism linked to chromosome 17 (FTDP-17), an incurable disease that is directly caused by increased inclusion of exon 10 in MAPT mRNA. Recent studies have shown that mutations within a hairpin at the MAPT exon 10-intron junction decrease the thermodynamic stability of the RNA, increasing binding to U1 snRNP and thus exon 10 inclusion. Therefore, we designed small molecules that bind and stabilize a mutant MAPT by using Inforna, a computational approach based on information about RNA-small-molecule interactions. The optimal compound selectively bound the mutant MAPT hairpin and thermodynamically stabilized its folding, facilitating exon 10 exclusion. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Small Molecule Supplements Improve Cultured Megakaryocyte Polyploidization by Modulating Multiple Cell Cycle Regulators.

PubMed

Zou, Xiaojing; Qu, Mingyi; Fang, Fang; Fan, Zeng; Chen, Lin; Yue, Wen; Xie, Xiaoyan; Pei, Xuetao

2017-01-01

Platelets (PLTs) are produced by megakaryocytes (MKs) that completed differentiation and endomitosis. Endomitosis is an important process in which the cell replicates its DNA without cytokinesis and develops highly polyploid MK. In this study, to gain a better PLTs production, four small molecules (Rho-Rock inhibitor (RRI), nicotinamide (NIC), Src inhibitor (SI), and Aurora B inhibitor (ABI)) and their combinations were surveyed as MK culture supplements for promoting polyploidization. Three leukemia cell lines as well as primary mononuclear cells were chosen in the function and mechanism studies of the small molecules. In an optimal culture method, cells were treated with different small molecules and their combinations. The impact of the small molecules on megakaryocytic surface marker expression, polyploidy, proliferation, and apoptosis was examined for the best MK polyploidization supplement. The elaborate analysis confirmed that the combination of SI and RRI together with our MK induction system might result in efficient ploidy promotion. Our experiments demonstrated that, besides direct downregulation on the expression of cytoskeleton protein actin, SI and RRI could significantly enhance the level of cyclins through the suppression of p53 and p21. The verified small molecule combination might be further used in the in vitro PLT manufacture and clinical applications.
Small Molecule Supplements Improve Cultured Megakaryocyte Polyploidization by Modulating Multiple Cell Cycle Regulators

PubMed Central

Fang, Fang; Chen, Lin; Yue, Wen

2017-01-01

Platelets (PLTs) are produced by megakaryocytes (MKs) that completed differentiation and endomitosis. Endomitosis is an important process in which the cell replicates its DNA without cytokinesis and develops highly polyploid MK. In this study, to gain a better PLTs production, four small molecules (Rho-Rock inhibitor (RRI), nicotinamide (NIC), Src inhibitor (SI), and Aurora B inhibitor (ABI)) and their combinations were surveyed as MK culture supplements for promoting polyploidization. Three leukemia cell lines as well as primary mononuclear cells were chosen in the function and mechanism studies of the small molecules. In an optimal culture method, cells were treated with different small molecules and their combinations. The impact of the small molecules on megakaryocytic surface marker expression, polyploidy, proliferation, and apoptosis was examined for the best MK polyploidization supplement. The elaborate analysis confirmed that the combination of SI and RRI together with our MK induction system might result in efficient ploidy promotion. Our experiments demonstrated that, besides direct downregulation on the expression of cytoskeleton protein actin, SI and RRI could significantly enhance the level of cyclins through the suppression of p53 and p21. The verified small molecule combination might be further used in the in vitro PLT manufacture and clinical applications. PMID:29201898
Small Molecules Affect Human Dental Pulp Stem Cell Properties Via Multiple Signaling Pathways

PubMed Central

Al-Habib, Mey; Yu, Zongdong

2013-01-01

One fundamental issue regarding stem cells for regenerative medicine is the maintenance of stem cell stemness. The purpose of the study was to test whether small molecules can enhance stem cell properties of mesenchymal stem cells (MSCs) derived from human dental pulp (hDPSCs), which have potential for multiple clinical applications. We identified the effects of small molecules (Pluripotin (SC1), 6-bromoindirubin-3-oxime and rapamycin) on the maintenance of hDPSC properties in vitro and the mechanisms involved in exerting the effects. Primary cultures of hDPSCs were exposed to optimal concentrations of these small molecules. Treated hDPSCs were analyzed for their proliferation, the expression levels of pluripotent and MSC markers, differentiation capacities, and intracellular signaling activations. We found that small molecule treatments decreased cell proliferation and increased the expression of STRO-1, NANOG, OCT4, and SOX2, while diminishing cell differentiation into odonto/osteogenic, adipogenic, and neurogenic lineages in vitro. These effects involved Ras-GAP-, ERK1/2-, and mTOR-signaling pathways, which may preserve the cell self-renewal capacity, while suppressing differentiation. We conclude that small molecules appear to enhance the immature state of hDPSCs in culture, which may be used as a strategy for adult stem cell maintenance and extend their capacity for regenerative applications. PMID:23573877
Methodologies for Studying B. subtilis Biofilms as a Model for Characterizing Small Molecule Biofilm Inhibitors.

PubMed

Bucher, Tabitha; Kartvelishvily, Elena; Kolodkin-Gal, Ilana

2016-10-09

This work assesses different methodologies to study the impact of small molecule biofilm inhibitors, such as D-amino acids, on the development and resilience of Bacillus subtilis biofilms. First, methods are presented that select for small molecule inhibitors with biofilm-specific targets in order to separate the effect of the small molecule inhibitors on planktonic growth from their effect on biofilm formation. Next, we focus on how inoculation conditions affect the sensitivity of multicellular, floating B. subtilis cultures to small molecule inhibitors. The results suggest that discrepancies in the reported effects of such inhibitors such as D-amino acids are due to inconsistent pre-culture conditions. Furthermore, a recently developed protocol is described for evaluating the contribution of small molecule treatments towards biofilm resistance to antibacterial substances. Lastly, scanning electron microscopy (SEM) techniques are presented to analyze the three-dimensional spatial arrangement of cells and their surrounding extracellular matrix in a B. subtilis biofilm. SEM facilitates insight into the three-dimensional biofilm architecture and the matrix texture. A combination of the methods described here can greatly assist the study of biofilm development in the presence and absence of biofilm inhibitors, and shed light on the mechanism of action of these inhibitors.
77 FR 36034 - Notice of Funding Availability for the Small Business Transportation Resource Center Program

Federal Register 2010, 2011, 2012, 2013, 2014

2012-06-15

... construct a database of regional small businesses that currently or may in the future participate in DOT direct and DOT funded transportation related contracts, and make this database available to OSDBU, upon request. 2. Utilize the database of regional transportation-related small businesses to match...
Real-Time Ligand Binding Pocket Database Search Using Local Surface Descriptors

PubMed Central

Chikhi, Rayan; Sael, Lee; Kihara, Daisuke

2010-01-01

Due to the increasing number of structures of unknown function accumulated by ongoing structural genomics projects, there is an urgent need for computational methods for characterizing protein tertiary structures. As functions of many of these proteins are not easily predicted by conventional sequence database searches, a legitimate strategy is to utilize structure information in function characterization. Of a particular interest is prediction of ligand binding to a protein, as ligand molecule recognition is a major part of molecular function of proteins. Predicting whether a ligand molecule binds a protein is a complex problem due to the physical nature of protein-ligand interactions and the flexibility of both binding sites and ligand molecules. However, geometric and physicochemical complementarity is observed between the ligand and its binding site in many cases. Therefore, ligand molecules which bind to a local surface site in a protein can be predicted by finding similar local pockets of known binding ligands in the structure database. Here, we present two representations of ligand binding pockets and utilize them for ligand binding prediction by pocket shape comparison. These representations are based on mapping of surface properties of binding pockets, which are compactly described either by the two dimensional pseudo-Zernike moments or the 3D Zernike descriptors. These compact representations allow a fast real-time pocket searching against a database. Thorough benchmark study employing two different datasets show that our representations are competitive with the other existing methods. Limitations and potentials of the shape-based methods as well as possible improvements are discussed. PMID:20455259
Real-time ligand binding pocket database search using local surface descriptors.

PubMed

Chikhi, Rayan; Sael, Lee; Kihara, Daisuke

2010-07-01

Because of the increasing number of structures of unknown function accumulated by ongoing structural genomics projects, there is an urgent need for computational methods for characterizing protein tertiary structures. As functions of many of these proteins are not easily predicted by conventional sequence database searches, a legitimate strategy is to utilize structure information in function characterization. Of particular interest is prediction of ligand binding to a protein, as ligand molecule recognition is a major part of molecular function of proteins. Predicting whether a ligand molecule binds a protein is a complex problem due to the physical nature of protein-ligand interactions and the flexibility of both binding sites and ligand molecules. However, geometric and physicochemical complementarity is observed between the ligand and its binding site in many cases. Therefore, ligand molecules which bind to a local surface site in a protein can be predicted by finding similar local pockets of known binding ligands in the structure database. Here, we present two representations of ligand binding pockets and utilize them for ligand binding prediction by pocket shape comparison. These representations are based on mapping of surface properties of binding pockets, which are compactly described either by the two-dimensional pseudo-Zernike moments or the three-dimensional Zernike descriptors. These compact representations allow a fast real-time pocket searching against a database. Thorough benchmark studies employing two different datasets show that our representations are competitive with the other existing methods. Limitations and potentials of the shape-based methods as well as possible improvements are discussed.
miRWalk--database: prediction of possible miRNA binding sites by "walking" the genes of three genomes.

PubMed

Dweep, Harsh; Sticht, Carsten; Pandey, Priyanka; Gretz, Norbert

2011-10-01

MicroRNAs are small, non-coding RNA molecules that can complementarily bind to the mRNA 3'-UTR region to regulate the gene expression by transcriptional repression or induction of mRNA degradation. Increasing evidence suggests a new mechanism by which miRNAs may regulate target gene expression by binding in promoter and amino acid coding regions. Most of the existing databases on miRNAs are restricted to mRNA 3'-UTR region. To address this issue, we present miRWalk, a comprehensive database on miRNAs, which hosts predicted as well as validated miRNA binding sites, information on all known genes of human, mouse and rat. All mRNAs, mitochondrial genes and 10 kb upstream flanking regions of all known genes of human, mouse and rat were analyzed by using a newly developed algorithm named 'miRWalk' as well as with eight already established programs for putative miRNA binding sites. An automated and extensive text-mining search was performed on PubMed database to extract validated information on miRNAs. Combined information was put into a MySQL database. miRWalk presents predicted and validated information on miRNA-target interaction. Such a resource enables researchers to validate new targets of miRNA not only on 3'-UTR, but also on the other regions of all known genes. The 'Validated Target module' is updated every month and the 'Predicted Target module' is updated every 6 months. miRWalk is freely available at http://mirwalk.uni-hd.de/. Copyright © 2011 Elsevier Inc. All rights reserved.
Design and Development of a Technology Platform for DNA-Encoded Library Production and Affinity Selection.

PubMed

Castañón, Jesús; Román, José Pablo; Jessop, Theodore C; de Blas, Jesús; Haro, Rubén

2018-06-01

DNA-encoded libraries (DELs) have emerged as an efficient and cost-effective drug discovery tool for the exploration and screening of very large chemical space using small-molecule collections of unprecedented size. Herein, we report an integrated automation and informatics system designed to enhance the quality, efficiency, and throughput of the production and affinity selection of these libraries. The platform is governed by software developed according to a database-centric architecture to ensure data consistency, integrity, and availability. Through its versatile protocol management functionalities, this application captures the wide diversity of experimental processes involved with DEL technology, keeps track of working protocols in the database, and uses them to command robotic liquid handlers for the synthesis of libraries. This approach provides full traceability of building-blocks and DNA tags in each split-and-pool cycle. Affinity selection experiments and high-throughput sequencing reads are also captured in the database, and the results are automatically deconvoluted and visualized in customizable representations. Researchers can compare results of different experiments and use machine learning methods to discover patterns in data. As of this writing, the platform has been validated through the generation and affinity selection of various libraries, and it has become the cornerstone of the DEL production effort at Lilly.
Drug development and nonclinical to clinical translational databases: past and current efforts.

PubMed

Monticello, Thomas M

2015-01-01

The International Consortium for Innovation and Quality (IQ) in Pharmaceutical Development is a science-focused organization of pharmaceutical and biotechnology companies. The mission of the Preclinical Safety Leadership Group (DruSafe) of the IQ is to advance science-based standards for nonclinical development of pharmaceutical products and to promote high-quality and effective nonclinical safety testing that can enable human risk assessment. DruSafe is creating an industry-wide database to determine the accuracy with which the interpretation of nonclinical safety assessments in animal models correctly predicts human risk in the early clinical development of biopharmaceuticals. This initiative aligns with the 2011 Food and Drug Administration strategic plan to advance regulatory science and modernize toxicology to enhance product safety. Although similar in concept to the initial industry-wide concordance data set conducted by International Life Sciences Institute's Health and Environmental Sciences Institute (HESI/ILSI), the DruSafe database will proactively track concordance, include exposure data and large and small molecules, and will continue to expand with longer duration nonclinical and clinical study comparisons. The output from this work will help identify actual human and animal adverse event data to define both the reliability and the potential limitations of nonclinical data and testing paradigms in predicting human safety in phase 1 clinical trials. © 2014 by The Author(s).
Causal biological network database: a comprehensive platform of causal biological network models focused on the pulmonary and vascular systems

PubMed Central

Boué, Stéphanie; Talikka, Marja; Westra, Jurjen Willem; Hayes, William; Di Fabio, Anselmo; Park, Jennifer; Schlage, Walter K.; Sewer, Alain; Fields, Brett; Ansari, Sam; Martin, Florian; Veljkovic, Emilija; Kenney, Renee; Peitsch, Manuel C.; Hoeng, Julia

2015-01-01

With the wealth of publications and data available, powerful and transparent computational approaches are required to represent measured data and scientific knowledge in a computable and searchable format. We developed a set of biological network models, scripted in the Biological Expression Language, that reflect causal signaling pathways across a wide range of biological processes, including cell fate, cell stress, cell proliferation, inflammation, tissue repair and angiogenesis in the pulmonary and cardiovascular context. This comprehensive collection of networks is now freely available to the scientific community in a centralized web-based repository, the Causal Biological Network database, which is composed of over 120 manually curated and well annotated biological network models and can be accessed at http://causalbionet.com. The website accesses a MongoDB, which stores all versions of the networks as JSON objects and allows users to search for genes, proteins, biological processes, small molecules and keywords in the network descriptions to retrieve biological networks of interest. The content of the networks can be visualized and browsed. Nodes and edges can be filtered and all supporting evidence for the edges can be browsed and is linked to the original articles in PubMed. Moreover, networks may be downloaded for further visualization and evaluation. Database URL: http://causalbionet.com PMID:25887162

IDAAPM: integrated database of ADMET and adverse effects of predictive modeling based on FDA approved drug data.

PubMed

Legehar, Ashenafi; Xhaard, Henri; Ghemtio, Leo

2016-01-01

The disposition of a pharmaceutical compound within an organism, i.e. its Absorption, Distribution, Metabolism, Excretion, Toxicity (ADMET) properties and adverse effects, critically affects late stage failure of drug candidates and has led to the withdrawal of approved drugs. Computational methods are effective approaches to reduce the number of safety issues by analyzing possible links between chemical structures and ADMET or adverse effects, but this is limited by the size, quality, and heterogeneity of the data available from individual sources. Thus, large, clean and integrated databases of approved drug data, associated with fast and efficient predictive tools are desirable early in the drug discovery process. We have built a relational database (IDAAPM) to integrate available approved drug data such as drug approval information, ADMET and adverse effects, chemical structures and molecular descriptors, targets, bioactivity and related references. The database has been coupled with a searchable web interface and modern data analytics platform (KNIME) to allow data access, data transformation, initial analysis and further predictive modeling. Data were extracted from FDA resources and supplemented from other publicly available databases. Currently, the database contains information regarding about 19,226 FDA approval applications for 31,815 products (small molecules and biologics) with their approval history, 2505 active ingredients, together with as many ADMET properties, 1629 molecular structures, 2.5 million adverse effects and 36,963 experimental drug-target bioactivity data. IDAAPM is a unique resource that, in a single relational database, provides detailed information on FDA approved drugs including their ADMET properties and adverse effects, the corresponding targets with bioactivity data, coupled with a data analytics platform. It can be used to perform basic to complex drug-target ADMET or adverse effects analysis and predictive modeling. IDAAPM is freely accessible at http://idaapm.helsinki.fi and can be exploited through a KNIME workflow connected to the database.Graphical abstractFDA approved drug data integration for predictive modeling.
Similar compounds searching system by using the gene expression microarray database.

PubMed

Toyoshiba, Hiroyoshi; Sawada, Hiroshi; Naeshiro, Ichiro; Horinouchi, Akira

2009-04-10

Numbers of microarrays have been examined and several public and commercial databases have been developed. However, it is not easy to compare in-house microarray data with those in a database because of insufficient reproducibility due to differences in the experimental conditions. As one of the approach to use these databases, we developed the similar compounds searching system (SCSS) on a toxicogenomics database. The datasets of 55 compounds administered to rats in the Toxicogenomics Project (TGP) database in Japan were used in this study. Using the fold-change ranking method developed by Lamb et al. [Lamb, J., Crawford, E.D., Peck, D., Modell, J.W., Blat, I.C., Wrobel, M.J., Lerner, J., Brunet, J.P., Subramanian, A., Ross, K.N., Reich, M., Hieronymus, H., Wei, G., Armstrong, S.A., Haggarty, S.J., Clemons, P.A., Wei, R., Carr, S.A., Lander, E.S., Golub, T.R., 2006. The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313, 1929-1935] and criteria called hit ratio, the system let us compare in-house microarray data and those in the database. In-house generated data for clofibrate, phenobarbital, and a proprietary compound were tested to evaluate the performance of the SCSS method. Phenobarbital and clofibrate, which were included in the TGP database, scored highest by the SCSS method. Other high scoring compounds had effects similar to either phenobarbital (a cytochrome P450s inducer) or clofibrate (a peroxisome proliferator). Some of high scoring compounds identified using the proprietary compound-administered rats have been known to cause similar toxicological changes in different species. Our results suggest that the SCSS method could be used in drug discovery and development. Moreover, this method may be a powerful tool to understand the mechanisms by which biological systems respond to various chemical compounds and may also predict adverse effects of new compounds.
Electronic Structure of Small Lanthanide Containing Molecules

NASA Astrophysics Data System (ADS)

Kafader, Jared O.; Ray, Manisha; Topolski, Josey E.; Chick Jarrold, Caroline

2016-06-01

Lanthanide-based materials have unusual electronic properties because of the high number of electronic degrees of freedom arising from partial occupation of 4f orbitals, which make these materials optimal for their utilization in many applications including electronics and catalysis. Electronic spectroscopy of small lanthanide molecules helps us understand the role of these 4f electrons, which are generally considered core-like because of orbital contraction, but are energetically similar to valence electrons. The spectroscopy of small lanthanide-containing molecules is relatively unexplored and to broaden this understanding we have completed the characterization of small cerium, praseodymium, and europium molecules using photoelectron spectroscopy coupled with DFT calculations. The characterization of PrO, EuH, EuO/EuOH, and CexOy molecules have allowed for the determination of their electron affinity, the assignment of numerous anion to neutral state transitions, modeling of anion/neutral structures and electron orbital occupation.
System dynamics of subcellular transport.

PubMed

Chen, Vivien Y; Khersonsky, Sonya M; Shedden, Kerby; Chang, Young Tae; Rosania, Gus R

2004-01-01

In pharmacokinetic experiments, interpretations often hinge on treating cells as a "black box": a single, lumped compartment or boundary. Here, a combinatorial library of fluorescent small molecules was used to visualize subcellular transport pathways in living cells, using a kinetic, high content imaging system to monitor spatiotemporal variations of intracellular probe distribution. Most probes accumulate in cytoplasmic vesicles and probe kinetics conform to a nested, two-compartment dynamical system. At steady state, probes preferentially partition from the extracellular medium to the cytosol, and from the cytosol to cytoplasmic vesicles, with hydrophobic molecules favoring sequestration. Altogether, these results point to a general organizing principle underlying the system dynamics of subcellular, small molecule transport. In addition to plasma membrane permeability, subcellular transport phenomena can determine the active concentration of small molecules in the cytosol and the efflux of small molecules from cells. Fundamentally, direct observation of intracellular probe distribution challenges the simple boundary model of classical pharmacokinetics, which considers cells as static permeability barriers.
Cloning and analysis of fetal ovary microRNAs in cattle.

PubMed

Tripurani, Swamy K; Xiao, Caide; Salem, Mohamed; Yao, Jianbo

2010-07-01

Ovarian folliculogenesis and early embryogenesis are complex processes, which require tightly regulated expression and interaction of a multitude of genes. Small endogenous RNA molecules, termed microRNAs (miRNAs), are involved in the regulation of gene expression during folliculogenesis and early embryonic development. To identify miRNAs in bovine oocytes/ovaries, a bovine fetal ovary miRNA library was constructed. Sequence analysis of random clones from the library identified 679 miRNA sequences, which represent 58 distinct bovine miRNAs. Of these distinct miRNAs, 42 are known bovine miRNAs present in the miRBase database and the remaining 16 miRNAs include 15 new bovine miRNAs that are homologous to miRNAs identified in other species, and one novel miRNA, which does not match any miRNAs in the database. The precursor sequences for 14 of the new 15 miRNAs as well as the novel miRNA were identified from the bovine genome database and their hairpin structures were predicted. Expression analysis of the 58 miRNAs in fetal ovaries in comparison to somatic tissue pools identified 8 miRNAs predominantly expressed in fetal ovaries. Further analysis of the eight miRNAs in germinal vesicle (GV) stage oocytes identified two miRNAs (bta-mir424 and bta-mir-10b), that are highly abundant in GV oocytes. Both miRNAs show similar expression patterns during oocyte maturation and preimplantation development of bovine embryos, being abundant in GV and MII stage oocytes, as well as in early stage embryos (until 16-cell stage). The amount of the novel miRNA is relatively small in oocytes and early cleavage embryos but greater in blastocysts, suggesting a role of this miRNA in blastocyst cell differentiation. Copyright 2010 Elsevier B.V. All rights reserved.
Activation of Polymine Catabolism as a Novel Strategy for Treating and/or Preventing Human Prostate Cancer

DTIC Science & Technology

2006-03-01

strategy against prostate cancer and thus, worthy of small molecule discovery and development. On the basis of findings obtained over the past 3...support for the discovery and development of specific small molecule inducers of SSAT as a novel therapeutic strategy targeting prostate cancer. This...D. Unscheduled Findings. Findings under Tasks 1 and 3 provided genetic evidence for the discovery and development of small molecule inducers of
A-D-A small molecules for solution-processed organic photovoltaic cells.

PubMed

Ni, Wang; Wan, Xiangjian; Li, Miaomiao; Wang, Yunchuang; Chen, Yongsheng

2015-03-25

A-D-A small molecules have drawn more and more attention in solution-processed organic solar cells due to the advantages of a diversity of structures, easy control of energy levels, etc. Recently, a power conversion efficiency of nearly 10% has been achieved through careful material design and device optimization. This feature article reviews recent representative progress in the design and application of A-D-A small molecules in organic photovoltaic cells.
Delivery of small molecules for bone regenerative engineering: preclinical studies and potential clinical applications

PubMed Central

Laurencin, Cato T.; Ashe, Keshia M.; Henry, Nicole; Kan, Ho Man; Lo, Kevin W-H.

2014-01-01

Stimulation of bone regeneration using growth factors is a promising approach for musculoskeletal regenerative engineering. Common limitations with protein growth factors are high manufacturing costs, protein instability, contamination issues, and unwanted immunogenic responses of the host. New strategies for bone regeneration that obviate these problems can have a significant impact on the treatment of skeletal injury and diseases. Over the past decade, a large number of small molecules with the potential of regenerating skeletal tissue have been reported in the literature. Here, we review this literature, paying specific attention to the prospects for small molecule-based bone-regenerative engineering. We also review the preclinical study of small molecules associated with bone regeneration. PMID:24508820
Cobalt coated substrate for matrix-free analysis of small molecules by laser desorption/ionization mass spectrometry

NASA Astrophysics Data System (ADS)

Yalcin, Talat; Li, Liang

2009-12-01

Small molecule analysis is one of the most challenging issues in matrix-assisted laser desorption/ionization (MALDI) mass spectrometry. We have developed a cobalt coated substrate as a target for matrix-free analysis of small molecules in laser desorption/ionization mass spectrometry. Cobalt coating of 60-70 nm thickness has been characterized by scanning electron microscopy, energy dispersive X-ray analysis, X-ray diffraction, and laser induced breakdown spectroscopy. This target facilitates hundreds of samples to be spotted and analyzed without mixing any matrices, in a very short time. This can save a lot of time and money and can be a very practical approach for the analysis of small molecules by laser desorption/ionization mass spectrometry.
Combinatorics of feedback in cellular uptake and metabolism of small molecules.

PubMed

Krishna, Sandeep; Semsey, Szabolcs; Sneppen, Kim

2007-12-26

We analyze the connection between structure and function for regulatory motifs associated with cellular uptake and usage of small molecules. Based on the boolean logic of the feedback we suggest four classes: the socialist, consumer, fashion, and collector motifs. We find that the socialist motif is good for homeostasis of a useful but potentially poisonous molecule, whereas the consumer motif is optimal for nutrition molecules. Accordingly, examples of these motifs are found in, respectively, the iron homeostasis system in various organisms and in the uptake of sugar molecules in bacteria. The remaining two motifs have no obvious analogs in small molecule regulation, but we illustrate their behavior using analogies to fashion and obesity. These extreme motifs could inspire construction of synthetic systems that exhibit bistable, history-dependent states, and homeostasis of flux (rather than concentration).
An algorithm to identify functional groups in organic molecules.

PubMed

Ertl, Peter

2017-06-07

The concept of functional groups forms a basis of organic chemistry, medicinal chemistry, toxicity assessment, spectroscopy and also chemical nomenclature. All current software systems to identify functional groups are based on a predefined list of substructures. We are not aware of any program that can identify all functional groups in a molecule automatically. The algorithm presented in this article is an attempt to solve this scientific challenge. An algorithm to identify functional groups in a molecule based on iterative marching through its atoms is described. The procedure is illustrated by extracting functional groups from the bioactive portion of the ChEMBL database, resulting in identification of 3080 unique functional groups. A new algorithm to identify all functional groups in organic molecules is presented. The algorithm is relatively simple and full details with examples are provided, therefore implementation in any cheminformatics toolkit should be relatively easy. The new method allows the analysis of functional groups in large chemical databases in a way that was not possible using previous approaches. Graphical abstract .
QSAR modeling and chemical space analysis of antimalarial compounds

NASA Astrophysics Data System (ADS)

Sidorov, Pavel; Viira, Birgit; Davioud-Charvet, Elisabeth; Maran, Uko; Marcou, Gilles; Horvath, Dragos; Varnek, Alexandre

2017-05-01

Generative topographic mapping (GTM) has been used to visualize and analyze the chemical space of antimalarial compounds as well as to build predictive models linking structure of molecules with their antimalarial activity. For this, a database, including 3000 molecules tested in one or several of 17 anti- Plasmodium activity assessment protocols, has been compiled by assembling experimental data from in-house and ChEMBL databases. GTM classification models built on subsets corresponding to individual bioassays perform similarly to the earlier reported SVM models. Zones preferentially populated by active and inactive molecules, respectively, clearly emerge in the class landscapes supported by the GTM model. Their analysis resulted in identification of privileged structural motifs of potential antimalarial compounds. Projection of marketed antimalarial drugs on this map allowed us to delineate several areas in the chemical space corresponding to different mechanisms of antimalarial activity. This helped us to make a suggestion about the mode of action of the molecules populating these zones.
QSAR modeling and chemical space analysis of antimalarial compounds.

PubMed

Sidorov, Pavel; Viira, Birgit; Davioud-Charvet, Elisabeth; Maran, Uko; Marcou, Gilles; Horvath, Dragos; Varnek, Alexandre

2017-05-01

Generative topographic mapping (GTM) has been used to visualize and analyze the chemical space of antimalarial compounds as well as to build predictive models linking structure of molecules with their antimalarial activity. For this, a database, including ~3000 molecules tested in one or several of 17 anti-Plasmodium activity assessment protocols, has been compiled by assembling experimental data from in-house and ChEMBL databases. GTM classification models built on subsets corresponding to individual bioassays perform similarly to the earlier reported SVM models. Zones preferentially populated by active and inactive molecules, respectively, clearly emerge in the class landscapes supported by the GTM model. Their analysis resulted in identification of privileged structural motifs of potential antimalarial compounds. Projection of marketed antimalarial drugs on this map allowed us to delineate several areas in the chemical space corresponding to different mechanisms of antimalarial activity. This helped us to make a suggestion about the mode of action of the molecules populating these zones.
Parametrization of an Orbital-Based Linear-Scaling Quantum Force Field for Noncovalent Interactions

PubMed Central

2015-01-01

We parametrize a linear-scaling quantum mechanical force field called mDC for the accurate reproduction of nonbonded interactions. We provide a new benchmark database of accurate ab initio interactions between sulfur-containing molecules. A variety of nonbond databases are used to compare the new mDC method with other semiempirical, molecular mechanical, ab initio, and combined semiempirical quantum mechanical/molecular mechanical methods. It is shown that the molecular mechanical force field significantly and consistently reproduces the benchmark results with greater accuracy than the semiempirical models and our mDC model produces errors twice as small as the molecular mechanical force field. The comparisons between the methods are extended to the docking of drug candidates to the Cyclin-Dependent Kinase 2 protein receptor. We correlate the protein–ligand binding energies to their experimental inhibition constants and find that the mDC produces the best correlation. Condensed phase simulation of mDC water is performed and shown to produce O–O radial distribution functions similar to TIP4P-EW. PMID:24803856
SmallSat Database

NASA Technical Reports Server (NTRS)

Petropulos, Dolores; Bittner, David; Murawski, Robert; Golden, Bert

2015-01-01

The SmallSat has an unrealized potential in both the private industry and in the federal government. Currently over 70 companies, 50 universities and 17 governmental agencies are involved in SmallSat research and development. In 1994, the U.S. Army Missile and Defense mapped the moon using smallSat imagery. Since then Smart Phones have introduced this imagery to the people of the world as diverse industries watched this trend. The deployment cost of smallSats is also greatly reduced compared to traditional satellites due to the fact that multiple units can be deployed in a single mission. Imaging payloads have become more sophisticated, smaller and lighter. In addition, the growth of small technology obtained from private industries has led to the more widespread use of smallSats. This includes greater revisit rates in imagery, significantly lower costs, the ability to update technology more frequently and the ability to decrease vulnerability of enemy attacks. The popularity of smallSats show a changing mentality in this fast paced world of tomorrow. What impact has this created on the NASA communication networks now and in future years? In this project, we are developing the SmallSat Relational Database which can support a simulation of smallSats within the NASA SCaN Compatability Environment for Networks and Integrated Communications (SCENIC) Modeling and Simulation Lab. The NASA Space Communications and Networks (SCaN) Program can use this modeling to project required network support needs in the next 10 to 15 years. The SmallSat Rational Database could model smallSats just as the other SCaN databases model the more traditional larger satellites, with a few exceptions. One being that the smallSat Database is designed to be built-to-order. The SmallSat database holds various hardware configurations that can be used to model a smallSat. It will require significant effort to develop as the research material can only be populated by hand to obtain the unique data required. When completed it will interface with the SCENIC environment to allow modeling of smallSats. The SmallSat Relational Database can also be integrated with the SCENIC Simulation modeling system that is currently in development. The SmallSat Relational Database simulation will be of great significance in assisting the NASA SCaN group to understand the impact the smallSats have made which have populated the lower orbit around our mother earth. What I have created and worked on this summer session 2015, is the basis for a tool that will be of value to the NASA SCaN SCENIC Simulation Environment for years to come.
Hydrocarbon Spectral Database

National Institute of Standards and Technology Data Gateway

SRD 115 Hydrocarbon Spectral Database (Web, free access) All of the rotational spectral lines observed and reported in the open literature for 91 hydrocarbon molecules have been tabulated. The isotopic molecular species, assigned quantum numbers, observed frequency, estimated measurement uncertainty and reference are given for each transition reported.
Diatomic Spectral Database

National Institute of Standards and Technology Data Gateway

SRD 114 Diatomic Spectral Database (Web, free access) All of the rotational spectral lines observed and reported in the open literature for 121 diatomic molecules have been tabulated. The isotopic molecular species, assigned quantum numbers, observed frequency, estimated measurement uncertainty, and reference are given for each transition reported.
Triatomic Spectral Database

National Institute of Standards and Technology Data Gateway

SRD 117 Triatomic Spectral Database (Web, free access) All of the rotational spectral lines observed and reported in the open literature for 55 triatomic molecules have been tabulated. The isotopic molecular species, assigned quantum numbers, observed frequency, estimated measurement uncertainty and reference are given for each transition reported.
Atmospheric Precipitations, Hailstone and Rainwater, as a Novel Source of Streptomyces Producing Bioactive Natural Products.

PubMed

Sarmiento-Vizcaíno, Aida; Espadas, Julia; Martín, Jesús; Braña, Alfredo F; Reyes, Fernando; García, Luis A; Blanco, Gloria

2018-01-01

A cultivation-dependent approach revealed that highly diverse populations of Streptomyces were present in atmospheric precipitations from a hailstorm event sampled in February 2016 in the Cantabrian Sea coast, North of Spain. A total of 29 bioactive Streptomyces strains isolated from small samples of hailstone and rainwater, collected from this hailstorm event, were studied here. Taxonomic identification by 16S rRNA sequencing revealed more than 20 different Streptomyces species, with their closest homologs displaying mainly oceanic but also terrestrial origins. Backward trajectory analysis revealed that the air-mass sources of the hailstorm event, with North Western winds, were originated in the Arctic Ocean (West Greenland and North Iceland) and Canada (Labrador), depending on the altitude. After traveling across the North Atlantic Ocean during 4 days the air mass reached Europe and precipitated as hailstone and rain water at the sampling place in Spain. The finding of Streptomyces species able to survive and disperse through the atmosphere increases our knowledge of the biogeography of genus Streptomyces on Earth, and reinforces our previous dispersion model, suggesting a generalized feature for the genus which could have been essential in his evolution. This unique atmospheric-derived Streptomyces collection was screened for production of bioactive secondary metabolites. Analyses of isolates ethyl acetate extracts by LC-UV-MS and further database comparison revealed an extraordinary diversity of bioactive natural products. One hundred molecules were identified, mostly displaying contrasted antibiotic and antitumor/cytotoxic activities, but also antiparasitic, antiviral, anti-inflammatory, neuroprotector, and insecticide properties. More interestingly, 38 molecules not identified in natural products databases might represent new natural products. Our results revealed for the first time an extraordinary diversity of Streptomyc es species in the atmosphere able to produce an extraordinary repertoire of bioactive molecules, thus providing a very promising source for the discovery of novel pharmaceutical natural products.
Atmospheric Precipitations, Hailstone and Rainwater, as a Novel Source of Streptomyces Producing Bioactive Natural Products

PubMed Central

Sarmiento-Vizcaíno, Aida; Espadas, Julia; Martín, Jesús; Braña, Alfredo F.; Reyes, Fernando; García, Luis A.; Blanco, Gloria

2018-01-01

A cultivation-dependent approach revealed that highly diverse populations of Streptomyces were present in atmospheric precipitations from a hailstorm event sampled in February 2016 in the Cantabrian Sea coast, North of Spain. A total of 29 bioactive Streptomyces strains isolated from small samples of hailstone and rainwater, collected from this hailstorm event, were studied here. Taxonomic identification by 16S rRNA sequencing revealed more than 20 different Streptomyces species, with their closest homologs displaying mainly oceanic but also terrestrial origins. Backward trajectory analysis revealed that the air-mass sources of the hailstorm event, with North Western winds, were originated in the Arctic Ocean (West Greenland and North Iceland) and Canada (Labrador), depending on the altitude. After traveling across the North Atlantic Ocean during 4 days the air mass reached Europe and precipitated as hailstone and rain water at the sampling place in Spain. The finding of Streptomyces species able to survive and disperse through the atmosphere increases our knowledge of the biogeography of genus Streptomyces on Earth, and reinforces our previous dispersion model, suggesting a generalized feature for the genus which could have been essential in his evolution. This unique atmospheric-derived Streptomyces collection was screened for production of bioactive secondary metabolites. Analyses of isolates ethyl acetate extracts by LC-UV-MS and further database comparison revealed an extraordinary diversity of bioactive natural products. One hundred molecules were identified, mostly displaying contrasted antibiotic and antitumor/cytotoxic activities, but also antiparasitic, antiviral, anti-inflammatory, neuroprotector, and insecticide properties. More interestingly, 38 molecules not identified in natural products databases might represent new natural products. Our results revealed for the first time an extraordinary diversity of Streptomyces species in the atmosphere able to produce an extraordinary repertoire of bioactive molecules, thus providing a very promising source for the discovery of novel pharmaceutical natural products. PMID:29740412

SCRIPDB: a portal for easy access to syntheses, chemicals and reactions in patents

PubMed Central

Heifets, Abraham; Jurisica, Igor

2012-01-01

The patent literature is a rich catalog of biologically relevant chemicals; many public and commercial molecular databases contain the structures disclosed in patent claims. However, patents are an equally rich source of metadata about bioactive molecules, including mechanism of action, disease class, homologous experimental series, structural alternatives, or the synthetic pathways used to produce molecules of interest. Unfortunately, this metadata is discarded when chemical structures are deposited separately in databases. SCRIPDB is a chemical structure database designed to make this metadata accessible. SCRIPDB provides the full original patent text, reactions and relationships described within any individual patent, in addition to the molecular files common to structural databases. We discuss how such information is valuable in medical text mining, chemical image analysis, reaction extraction and in silico pharmaceutical lead optimization. SCRIPDB may be searched by exact chemical structure, substructure or molecular similarity and the results may be restricted to patents describing synthetic routes. SCRIPDB is available at http://dcv.uhnres.utoronto.ca/SCRIPDB. PMID:22067445
Chemical Space: Big Data Challenge for Molecular Diversity.

PubMed

Awale, Mahendra; Visini, Ricardo; Probst, Daniel; Arús-Pous, Josep; Reymond, Jean-Louis

2017-10-25

Chemical space describes all possible molecules as well as multi-dimensional conceptual spaces representing the structural diversity of these molecules. Part of this chemical space is available in public databases ranging from thousands to billions of compounds. Exploiting these databases for drug discovery represents a typical big data problem limited by computational power, data storage and data access capacity. Here we review recent developments of our laboratory, including progress in the chemical universe databases (GDB) and the fragment subset FDB-17, tools for ligand-based virtual screening by nearest neighbor searches, such as our multi-fingerprint browser for the ZINC database to select purchasable screening compounds, and their application to discover potent and selective inhibitors for calcium channel TRPV6 and Aurora A kinase, the polypharmacology browser (PPB) for predicting off-target effects, and finally interactive 3D-chemical space visualization using our online tools WebDrugCS and WebMolCS. All resources described in this paper are available for public use at www.gdb.unibe.ch.
[Construction of chemical information database based on optical structure recognition technique].

PubMed

Lv, C Y; Li, M N; Zhang, L R; Liu, Z M

2018-04-18

To create a protocol that could be used to construct chemical information database from scientific literature quickly and automatically. Scientific literature, patents and technical reports from different chemical disciplines were collected and stored in PDF format as fundamental datasets. Chemical structures were transformed from published documents and images to machine-readable data by using the name conversion technology and optical structure recognition tool CLiDE. In the process of molecular structure information extraction, Markush structures were enumerated into well-defined monomer molecules by means of QueryTools in molecule editor ChemDraw. Document management software EndNote X8 was applied to acquire bibliographical references involving title, author, journal and year of publication. Text mining toolkit ChemDataExtractor was adopted to retrieve information that could be used to populate structured chemical database from figures, tables, and textual paragraphs. After this step, detailed manual revision and annotation were conducted in order to ensure the accuracy and completeness of the data. In addition to the literature data, computing simulation platform Pipeline Pilot 7.5 was utilized to calculate the physical and chemical properties and predict molecular attributes. Furthermore, open database ChEMBL was linked to fetch known bioactivities, such as indications and targets. After information extraction and data expansion, five separate metadata files were generated, including molecular structure data file, molecular information, bibliographical references, predictable attributes and known bioactivities. Canonical simplified molecular input line entry specification as primary key, metadata files were associated through common key nodes including molecular number and PDF number to construct an integrated chemical information database. A reasonable construction protocol of chemical information database was created successfully. A total of 174 research articles and 25 reviews published in Marine Drugs from January 2015 to June 2016 collected as essential data source, and an elementary marine natural product database named PKU-MNPD was built in accordance with this protocol, which contained 3 262 molecules and 19 821 records. This data aggregation protocol is of great help for the chemical information database construction in accuracy, comprehensiveness and efficiency based on original documents. The structured chemical information database can facilitate the access to medical intelligence and accelerate the transformation of scientific research achievements.
Chemical correction of pre-mRNA splicing defects associated with sequestration of muscleblind-like 1 protein by expanded r(CAG)-containing transcripts.

PubMed

Kumar, Amit; Parkesh, Raman; Sznajder, Lukasz J; Childs-Disney, Jessica L; Sobczak, Krzysztof; Disney, Matthew D

2012-03-16

Recently, it was reported that expanded r(CAG) triplet repeats (r(CAG)(exp)) associated with untreatable neurological diseases cause pre-mRNA mis-splicing likely due to sequestration of muscleblind-like 1 (MBNL1) splicing factor. Bioactive small molecules that bind the 5'CAG/3'GAC motif found in r(CAG)(exp) hairpin structure were identified by using RNA binding studies and virtual screening/chemical similarity searching. Specifically, a benzylguanidine-containing small molecule was found to improve pre-mRNA alternative splicing of MBNL1-sensitive exons in cells expressing the toxic r(CAG)(exp). The compound was identified by first studying the binding of RNA 1 × 1 nucleotide internal loops to small molecules known to have affinity for nucleic acids. Those studies identified 4',6-diamidino-2-phenylindole (DAPI) as a specific binder to RNAs with the 5'CAG/3'GAC motif. DAPI was then used as a query molecule in a shape- and chemistry alignment-based virtual screen to identify compounds with improved properties, which identified 4-guanidinophenyl 4-guanidinobenzoate, a small molecule that improves pre-mRNA splicing defects associated with the r(CAG)(exp)-MBNL1 complex. This compound may facilitate the development of therapeutics to treat diseases caused by r(CAG)(exp) and could serve as a useful chemical tool to dissect the mechanisms of r(CAG)(exp) toxicity. The approach used in these studies, defining the small RNA motifs that bind small molecules with known affinity for nucleic acids and then using virtual screening to optimize them for bioactivity, may be generally applicable for designing small molecules that target other RNAs in the human genomic sequence.
A Computational Investigation of Small-Molecule Engagement of Hot Spots at Protein-Protein Interaction Interfaces.

PubMed

Xu, David; Si, Yubing; Meroueh, Samy O

2017-09-25

The binding affinity of a protein-protein interaction is concentrated at amino acids known as hot spots. It has been suggested that small molecules disrupt protein-protein interactions by either (i) engaging receptor protein hot spots or (ii) mimicking hot spots of the protein ligand. Yet, no systematic studies have been done to explore how effectively existing small-molecule protein-protein interaction inhibitors mimic or engage hot spots at protein interfaces. Here, we employ explicit-solvent molecular dynamics simulations and end-point MM-GBSA free energy calculations to explore this question. We select 36 compounds for which high-quality binding affinity and cocrystal structures are available. Five complexes that belong to three classes of protein-protein interactions (primary, secondary, and tertiary) were considered, namely, BRD4•H4, XIAP•Smac, MDM2•p53, Bcl-xL•Bak, and IL-2•IL-2Rα. Computational alanine scanning using MM-GBSA identified hot-spot residues at the interface of these protein interactions. Decomposition energies compared the interaction of small molecules with individual receptor hot spots to those of the native protein ligand. Pharmacophore analysis was used to investigate how effectively small molecules mimic the position of hot spots of the protein ligand. Finally, we study whether small molecules mimic the effects of the native protein ligand on the receptor dynamics. Our results show that, in general, existing small-molecule inhibitors of protein-protein interactions do not optimally mimic protein-ligand hot spots, nor do they effectively engage protein receptor hot spots. The more effective use of hot spots in future drug design efforts may result in smaller compounds with higher ligand efficiencies that may lead to greater success in clinical trials.
Cancer Theranostic Nanoparticles Self-Assembled from Amphiphilic Small Molecules with Equilibrium Shift-Induced Renal Clearance

PubMed Central

Ma, Yuan; Mou, Quanbing; Sun, Mo; Yu, Chunyang; Li, Jianqi; Huang, Xiaohua; Zhu, Xinyuan; Yan, Deyue; Shen, Jian

2016-01-01

Nano drug delivery systems have emerged as promising candidates for cancer therapy, whereas their uncertainly complete elimination from the body within specific timescales restricts their clinical translation. Compared with hepatic clearance of nanoparticles, renal excretion of small molecules is preferred to minimize the agent-induced toxicity. Herein, we construct in vivo renal-clearable nanoparticles, which are self-assembled from amphiphilic small molecules holding the capabilities of magnetic resonance imaging (MRI) and chemotherapy. The assembled nanoparticles can accumulate in tumor tissues for their nano-characteristics, while the small molecules dismantled from the nanoparticles can be efficiently cleared by kidneys. The renal-clearable nanoparticles exhibit excellent tumor-inhibition performance as well as low side effects and negligible chronic toxicity. These results demonstrate a potential strategy for small molecular nano drug delivery systems with obvious anticancer effect and low-toxic metabolism pathway for clinical applications. PMID:27446502
Effects of endogenous small molecular compounds on the rheological properties, texture and microstructure of soymilk coagulum: Removal of phytate using ultrafiltration.

PubMed

Wang, Ruican; Guo, Shuntang

2016-11-15

This study aims to clarify the roles played by endogenous small molecular components in soymilk coagulation process and the properties of gels. Soymilk samples with decreasing levels of small molecules were prepared by ultrafiltration, to reduce the amount of phytate and salts. CaSO4-induced coagulation process was analyzed using rheological methods. Results showed that removal of free small molecules decreased the activation energy of protein coagulation, resulting in accelerated reaction and increased gel strength. However, too fast a reaction led to the drop in storage modulus (G'). Microscopic observation suggested that accelerated coagulation generated a coarse and non-uniform gel network with large pores. This network could not hold much water, leading to serious syneresis. Endogenous small molecules in soymilk were vital in the fine gel structure. Coagulation rate could be controlled by adjusting the amount of small molecules to obtain tofu products with the optimal texture. Copyright © 2016 Elsevier Ltd. All rights reserved.
Induction and reversal of myotonic dystrophy type 1 pre-mRNA splicing defects by small molecules.

PubMed

Childs-Disney, Jessica L; Stepniak-Konieczna, Ewa; Tran, Tuan; Yildirim, Ilyas; Park, HaJeung; Chen, Catherine Z; Hoskins, Jason; Southall, Noel; Marugan, Juan J; Patnaik, Samarjit; Zheng, Wei; Austin, Chris P; Schatz, George C; Sobczak, Krzysztof; Thornton, Charles A; Disney, Matthew D

2013-01-01

The ability to control pre-mRNA splicing with small molecules could facilitate the development of therapeutics or cell-based circuits that control gene function. Myotonic dystrophy type 1 is caused by the dysregulation of alternative pre-mRNA splicing due to sequestration of muscleblind-like 1 protein (MBNL1) by expanded, non-coding r(CUG) repeats (r(CUG)(exp)). Here we report two small molecules that induce or ameliorate alternative splicing dysregulation. A thiophene-containing small molecule (1) inhibits the interaction of MBNL1 with its natural pre-mRNA substrates. Compound (2), a substituted naphthyridine, binds r(CUG)(exp) and displaces MBNL1. Structural models show that 1 binds MBNL1 in the Zn-finger domain and that 2 interacts with UU loops in r(CUG)(exp). This study provides a structural framework for small molecules that target MBNL1 by mimicking r(CUG)(exp) and shows that targeting MBNL1 causes dysregulation of alternative splicing, suggesting that MBNL1 is thus not a suitable therapeutic target for the treatment of myotonic dystrophy type 1.
Induction and Reversal of Myotonic Dystrophy Type 1 Pre-mRNA Splicing Defects by Small Molecules

PubMed Central

Childs-Disney, Jessica L.; Stepniak-Konieczna, Ewa; Tran, Tuan; Yildirim, Ilyas; Park, HaJeung; Chen, Catherine Z.; Hoskins, Jason; Southall, Noel; Marugan, Juan J.; Patnaik, Samarjit; Zheng, Wei; Austin, Chris P.; Schatz, George C.; Sobczak, Krzysztof; Thornton, Charles A.; Disney, Matthew D.

2013-01-01

The ability to control pre-mRNA splicing with small molecules could facilitate the development of therapeutics or cell-based circuits that control gene function. Myotonic dystrophy type 1 (DM1) is caused by the dysregulation of alternative pre-mRNA splicing due to sequestration of muscleblind-like 1 protein (MBNL1) by expanded, non-coding r(CUG) repeats (r(CUG)exp). Here we report two small molecules that induce or ameliorate alternative splicing dysregulation. The thiophene-containing small molecule (1) inhibits the interaction of MBNL1 with its natural pre-mRNA substrates. Compound (2), a substituted naphthyridine, binds r(CUG)exp and displaces MBNL1. Structural models show that 1 binds MBNL1 in the Zn-finger domain and that 2 interacts with UU loops in r(CUG)exp. This study provides a structural framework for small molecules that target MBNL1 by mimicking r(CUG)exp and shows that targeting MBNL1 causes dysregulation of alternative splicing, suggesting that MBNL1 is thus not a suitable therapeutic target for the treatment of DM1. PMID:23806903
Selecting, Acquiring, and Using Small Molecule Libraries for High-Throughput Screening

PubMed Central

Dandapani, Sivaraman; Rosse, Gerard; Southall, Noel; Salvino, Joseph M.; Thomas, Craig J.

2015-01-01

The selection, acquisition and use of high quality small molecule libraries for screening is an essential aspect of drug discovery and chemical biology programs. Screening libraries continue to evolve as researchers gain a greater appreciation of the suitability of small molecules for specific biological targets, processes and environments. The decisions surrounding the make-up of any given small molecule library is informed by a multitude of variables and opinions vary on best-practices. The fitness of any collection relies upon upfront filtering to avoiding problematic compounds, assess appropriate physicochemical properties, install the ideal level of structural uniqueness and determine the desired extent of molecular complexity. These criteria are under constant evaluation and revision as academic and industrial organizations seek out collections that yield ever improving results from their screening portfolios. Practical questions including cost, compound management, screening sophistication and assay objective also play a significant role in the choice of library composition. This overview attempts to offer advice to all organizations engaged in small molecule screening based upon current best practices and theoretical considerations in library selection and acquisition. PMID:26705509
Selecting, Acquiring, and Using Small Molecule Libraries for High-Throughput Screening.

PubMed

Dandapani, Sivaraman; Rosse, Gerard; Southall, Noel; Salvino, Joseph M; Thomas, Craig J

The selection, acquisition and use of high quality small molecule libraries for screening is an essential aspect of drug discovery and chemical biology programs. Screening libraries continue to evolve as researchers gain a greater appreciation of the suitability of small molecules for specific biological targets, processes and environments. The decisions surrounding the make-up of any given small molecule library is informed by a multitude of variables and opinions vary on best-practices. The fitness of any collection relies upon upfront filtering to avoiding problematic compounds, assess appropriate physicochemical properties, install the ideal level of structural uniqueness and determine the desired extent of molecular complexity. These criteria are under constant evaluation and revision as academic and industrial organizations seek out collections that yield ever improving results from their screening portfolios. Practical questions including cost, compound management, screening sophistication and assay objective also play a significant role in the choice of library composition. This overview attempts to offer advice to all organizations engaged in small molecule screening based upon current best practices and theoretical considerations in library selection and acquisition.
Simulation-based cheminformatic analysis of organelle-targeted molecules: lysosomotropic monobasic amines.

PubMed

Zhang, Xinyuan; Zheng, Nan; Rosania, Gus R

2008-09-01

Cell-based molecular transport simulations are being developed to facilitate exploratory cheminformatic analysis of virtual libraries of small drug-like molecules. For this purpose, mathematical models of single cells are built from equations capturing the transport of small molecules across membranes. In turn, physicochemical properties of small molecules can be used as input to simulate intracellular drug distribution, through time. Here, with mathematical equations and biological parameters adjusted so as to mimic a leukocyte in the blood, simulations were performed to analyze steady state, relative accumulation of small molecules in lysosomes, mitochondria, and cytosol of this target cell, in the presence of a homogenous extracellular drug concentration. Similarly, with equations and parameters set to mimic an intestinal epithelial cell, simulations were also performed to analyze steady state, relative distribution and transcellular permeability in this non-target cell, in the presence of an apical-to-basolateral concentration gradient. With a test set of ninety-nine monobasic amines gathered from the scientific literature, simulation results helped analyze relationships between the chemical diversity of these molecules and their intracellular distributions.
Thermal Degradation of Small Molecules: A Global Metabolomic Investigation.

PubMed

Fang, Mingliang; Ivanisevic, Julijana; Benton, H Paul; Johnson, Caroline H; Patti, Gary J; Hoang, Linh T; Uritboonthai, Winnie; Kurczy, Michael E; Siuzdak, Gary

2015-11-03

Thermal processes are widely used in small molecule chemical analysis and metabolomics for derivatization, vaporization, chromatography, and ionization, especially in gas chromatography mass spectrometry (GC/MS). In this study the effect of heating was examined on a set of 64 small molecule standards and, separately, on human plasma metabolite extracts. The samples, either derivatized or underivatized, were heated at three different temperatures (60, 100, and 250 °C) at different exposure times (30 s, 60 s, and 300 s). All the samples were analyzed by liquid chromatography coupled to electrospray ionization mass spectrometry (LC/MS) and the data processed by XCMS Online ( xcmsonline.scripps.edu ). The results showed that heating at an elevated temperature of 100 °C had an appreciable effect on both the underivatized and derivatized molecules, and heating at 250 °C created substantial changes in the profile. For example, over 40% of the molecular peaks were altered in the plasma metabolite analysis after heating (250 °C, 300s) with a significant formation of degradation and transformation products. The analysis of 64 small molecule standards validated the temperature-induced changes observed on the plasma metabolites, where most of the small molecules degraded at elevated temperatures even after minimal exposure times (30 s). For example, tri- and diorganophosphates (e.g., adenosine triphosphate and adenosine diphosphate) were readily degraded into a mono-organophosphate (e.g., adenosine monophosphate) during heating. Nucleosides and nucleotides (e.g., inosine and inosine monophosphate) were also found to be transformed into purine derivatives (e.g., hypoxanthine). A newly formed transformation product, oleoyl ethyl amide, was identified in both the underivatized and derivatized forms of the plasma extracts and small molecule standard mixture, and was likely generated from oleic acid. Overall these analyses show that small molecules and metabolites undergo significant time-sensitive alterations when exposed to elevated temperatures, especially those conditions that mimic sample preparation and analysis in GC/MS experiments.
Protein Scaffolding for Small Molecule Catalysts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Baker, David

We aim to design hybrid catalysts for energy production and storage that combine the high specificity, affinity, and tunability of proteins with the potent chemical reactivities of small organometallic molecules. The widely used Rosetta and RosettaDesign methodologies will be extended to model novel protein / small molecule catalysts in which one or many small molecule active centers are supported and coordinated by protein scaffolding. The promise of such hybrid molecular systems will be demonstrated with the nickel-phosphine hydrogenase of DuBois et. al.We will enhance the hydrogenase activity of the catalyst by designing protein scaffolds that incorporate proton relays and systematicallymore » modulate the local environment of the catalyticcenter. In collaboration with DuBois and Shaw, the designs will be experimentally synthesized and characterized.« less
SPLINTS: small-molecule protein ligand interface stabilizers.

PubMed

Fischer, Eric S; Park, Eunyoung; Eck, Michael J; Thomä, Nicolas H

2016-04-01

Regulatory protein-protein interactions are ubiquitous in biology, and small molecule protein-protein interaction inhibitors are an important focus in drug discovery. Remarkably little attention has been given to the opposite strategy-stabilization of protein-protein interactions, despite the fact that several well-known therapeutics act through this mechanism. From a structural perspective, we consider representative examples of small molecules that induce or stabilize the association of protein domains to inhibit, or alter, signaling for nuclear hormone, GTPase, kinase, phosphatase, and ubiquitin ligase pathways. These SPLINTS (small-molecule protein ligand interface stabilizers) drive interactions that are in some cases physiologically relevant, and in others entirely adventitious. The diverse structural mechanisms employed suggest approaches for a broader and systematic search for such compounds in drug discovery. Copyright © 2016 Elsevier Ltd. All rights reserved.
Selective small-molecule inhibitors as chemical tools to define the roles of matrix metalloproteinases in disease.

PubMed

Meisel, Jayda E; Chang, Mayland

2017-11-01

The focus of this article is to highlight novel inhibitors and current examples where the use of selective small-molecule inhibitors has been critical in defining the roles of matrix metalloproteinases (MMPs) in disease. Selective small-molecule inhibitors are surgical chemical tools that can inhibit the targeted enzyme; they are the method of choice to ascertain the roles of MMPs and complement studies with knockout animals. This strategy can identify targets for therapeutic development as exemplified by the use of selective small-molecule MMP inhibitors in diabetic wound healing, spinal cord injury, stroke, traumatic brain injury, cancer metastasis, and viral infection. This article is part of a Special Issue entitled: Matrix Metalloproteinases edited by Rafael Fridman. Copyright © 2017 Elsevier B.V. All rights reserved.
Visualization of molecular structures using HoloLens-based augmented reality

PubMed Central

Hoffman, MA; Provance, JB

2017-01-01

Biological molecules and biologically active small molecules are complex three dimensional structures. Current flat screen monitors are limited in their ability to convey the full three dimensional characteristics of these molecules. Augmented reality devices, including the Microsoft HoloLens, offer an immersive platform to change how we interact with molecular visualizations. We describe a process to incorporate the three dimensional structures of small molecules and complex proteins into the Microsoft HoloLens using aspirin and the human leukocyte antigen (HLA) as examples. Small molecular structures can be introduced into the HoloStudio application, which provides native support for rotating, resizing and performing other interactions with these molecules. Larger molecules can be imported through the Unity gaming development platform and then Microsoft Visual Developer. The processes described here can be modified to import a wide variety of molecular structures into augmented reality systems and improve our comprehension of complex structural features. PMID:28815109
Structures, electronic properties and reaction paths from Fe(CO)5 molecule to small Fe clusters

NASA Astrophysics Data System (ADS)

Li, Zhi; Zhao, Zhen

2018-04-01

The geometries, electrical characters and reaction paths from Fe(CO)5 molecule to small Fe clusters were investigated by using all-electron density functional theory. The results show that in the decomposition process of pentacarbonyl-iron, Fe(CO)5 molecule prefers to remove a carbon monoxide and adsorb another Fe(CO)5 molecule to produce nonacarbonyldiiron Fe2(CO)9 then Fe2(CO)9 gradually removes carbon monoxide to produce small Fe clusters. As It can be seen from the highest occupied molecule orbital-lowest unoccupied molecule orbital gap curves, the Fe(CO)n=3, and 5 and Fe2(CO)n=3, 7 and 9 intermediates have higher chemical stability than their neighbors. The local magnetic moment of the carbon monoxide is aligning anti-ferromagnetic. The effect of external magnetic field to the initial decomposition products of Fe(CO)5 can be ignored.
Directed Chemical Evolution with an Outsized Genetic Code

PubMed Central

Krusemark, Casey J.; Tilmans, Nicolas P.; Brown, Patrick O.; Harbury, Pehr B.

2016-01-01

The first demonstration that macromolecules could be evolved in a test tube was reported twenty-five years ago. That breakthrough meant that billions of years of chance discovery and refinement could be compressed into a few weeks, and provided a powerful tool that now dominates all aspects of protein engineering. A challenge has been to extend this scientific advance into synthetic chemical space: to enable the directed evolution of abiotic molecules. The problem has been tackled in many ways. These include expanding the natural genetic code to include unnatural amino acids, engineering polyketide and polypeptide synthases to produce novel products, and tagging combinatorial chemistry libraries with DNA. Importantly, there is still no small-molecule analog of directed protein evolution, i.e. a substantiated approach for optimizing complex (≥ 10^9 diversity) populations of synthetic small molecules over successive generations. We present a key advance towards this goal: a tool for genetically-programmed synthesis of small-molecule libraries from large chemical alphabets. The approach accommodates alphabets that are one to two orders of magnitude larger than any in Nature, and facilitates evolution within the chemical spaces they create. This is critical for small molecules, which are built up from numerous and highly varied chemical fragments. We report a proof-of-concept chemical evolution experiment utilizing an outsized genetic code, and demonstrate that fitness traits can be passed from an initial small-molecule population through to the great-grandchildren of that population. The results establish the practical feasibility of engineering synthetic small molecules through accelerated evolution. PMID:27508294
Metathesis depolymerizable surfactants

DOEpatents

Jamison, Gregory M [Albuquerque, NM; Wheeler, David R [Albuquerque, NM; Loy, Douglas A [Tucson, AZ; Simmons, Blake A [San Francisco, CA; Long, Timothy M [Evanston, IL; McElhanon, James R [Manteca, CA; Rahimian, Kamyar [Albuquerque, NM; Staiger, Chad L [Albuquerque, NM

2008-04-15

A class of surfactant molecules whose structure includes regularly spaced unsaturation in the tail group and thus, can be readily decomposed by ring-closing metathesis, and particularly by the action of a transition metal catalyst, to form small molecule products. These small molecules are designed to have increased volatility and/or enhanced solubility as compared to the original surfactant molecule and are thus easily removed by solvent extraction or vacuum extraction at low temperature. By producing easily removable decomposition products, the surfactant molecules become particularly desirable as template structures for preparing meso- and microstructural materials with tailored properties.

Concentration-related response potentiometric titrations to study the interaction of small molecules with large biomolecules.

PubMed

Hamidi-Asl, Ezat; Daems, Devin; De Wael, Karolien; Van Camp, Guy; Nagels, Luc J

2014-12-16

In the present paper, the utility of a special potentiometric titration approach for recognition and calculation of biomolecule/small-molecule interactions is reported. This approach is fast, sensitive, reproducible, and inexpensive in comparison to the other methods for the determination of the association constant values (Ka) and the interaction energies (ΔG). The potentiometric titration measurement is based on the use of a classical polymeric membrane indicator electrode in a solution of the small-molecule ligand. The biomolecule is used as a titrant. The potential is measured versus a reference electrode and transformed into a concentration-related signal over the entire concentration interval, also at low concentrations, where the millivolt (y-axis) versus log canalyte (x-axis) potentiometric calibration curve is not linear. In the procedure, Ka is calculated for the interaction of cocaine with a cocaine binding aptamer and with an anticocaine antibody. To study the selectivity and cross-reactivity, other oligonucleotides and aptamers are tested, as well as other small ligand molecules such as tetrakis(4-chlorophenyl)borate, metergoline, lidocaine, and bromhexine. The calculated Ka compared favorably to the value reported in the literature using surface plasmon resonance. The potentiometric titration approach called "concentration-related response potentiometry" is used to study molecular interaction for seven macromolecular target molecules and four small-molecule ligands.
Challenges and Opportunities for Small-Molecule Fluorescent Probes in Redox Biology Applications.

PubMed

Jiang, Xiqian; Wang, Lingfei; Carroll, Shaina L; Chen, Jianwei; Wang, Meng C; Wang, Jin

2018-02-16

The concentrations of reactive oxygen/nitrogen species (ROS/RNS) are critical to various biochemical processes. Small-molecule fluorescent probes have been widely used to detect and/or quantify ROS/RNS in many redox biology studies and serve as an important complementary to protein-based sensors with unique applications. Recent Advances: New sensing reactions have emerged in probe development, allowing more selective and quantitative detection of ROS/RNS, especially in live cells. Improvements have been made in sensing reactions, fluorophores, and bioavailability of probe molecules. In this review, we will not only summarize redox-related small-molecule fluorescent probes but also lay out the challenges of designing probes to help redox biologists independently evaluate the quality of reported small-molecule fluorescent probes, especially in the chemistry literature. We specifically highlight the advantages of reversibility in sensing reactions and its applications in ratiometric probe design for quantitative measurements in living cells. In addition, we compare the advantages and disadvantages of small-molecule probes and protein-based probes. The low physiological relevant concentrations of most ROS/RNS call for new sensing reactions with better selectivity, kinetics, and reversibility; fluorophores with high quantum yield, wide wavelength coverage, and Stokes shifts; and structural design with good aqueous solubility, membrane permeability, low protein interference, and organelle specificity. Antioxid. Redox Signal. 00, 000-000.
Design strategy for photoinduced electron transfer-based small-molecule fluorescent probes of biomacromolecules.

PubMed

Zhang, Wei; Ma, Zhao; Du, Lupei; Li, Minyong

2014-06-07

As the cardinal support of innumerable biological processes, biomacromolecules such as proteins, nucleic acids and polysaccharides are of importance to living systems. The key to understanding biological processes is to realize the role of these biomacromolecules in thte localization, distribution, conformation and interaction with other molecules. With the current development and adaptation of fluorescent technologies in biomedical and pharmaceutical fields, the fluorescence imaging (FLI) approach of using small-molecule fluorescent probes is becoming an up-to-the-minute method for the detection and monitoring of these imperative biomolecules in life sciences. However, conventional small-molecule fluorescent probes may provide undesirable results because of their intrinsic deficiencies such as low signal-to-noise ratio (SNR) and false-positive errors. Recently, small-molecule fluorescent probes with a photoinduced electron transfer (PET) "on/off" switch for biomacromolecules have been thoroughly considered. When recognized by the biomacromolecules, these probes turn on/off the PET switch and change the fluorescence intensity to present a high SNR result. It should be emphasized that these PET-based fluorescent probes could be advantageous for understanding the pathogenesis of various diseases caused by abnormal expression of biomacromolecules. The discussion of this successful strategy involved in this review will be a valuable guide for the further development of new PET-based small-molecule fluorescent probes for biomacromolecules.
Inhibition of HIF-2.alpha. heterodimerization with HIF1.beta. (ARNT)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bruick, Richard K.; Caldwell, Charles G.; Frantz, Doug E.

2017-09-12

Provided is a method of inhibiting heterodimerization of HIF-2.alpha. to HIF1.beta. (ARNT) comprising binding certain small molecules to the HIF-2.alpha. PAS-B domain cavity but not to HIF1.alpha. and inhibiting HIF-2.alpha. heterodimerization to HIF1.beta. (ARNT) but not inhibiting HIF1.alpha. heterodimerization to HIF1.beta. (ARNT). Those certain small molecules are also referenced synonymously as HIF2-HDI and HIF2.alpha. heterodimerization inhibitors and also simply as certain small molecules.
Fullerene-free small molecule organic solar cells with a high open circuit voltage of 1.15 V.

PubMed

Ni, Wang; Li, Miaomiao; Kan, Bin; Liu, Feng; Wan, Xiangjian; Zhang, Qian; Zhang, Hongtao; Russell, Thomas P; Chen, Yongsheng

2016-01-11

A new small molecule named DTBTF with thiobarbituric acid as a terminal group was designed and synthesized as an acceptor for organic photovoltaic applications. DTBTF exhibits strong absorption in the visible region, and a relatively high lying LUMO energy level (-3.62 eV). All-small-molecule organic solar cells based on DR3TSBDT:DTBTF blend films show a considerable PCE of 3.84% with a high V(oc) of 1.15 V.
A theoretical study in extracting the essential features and dynamics of molecular motions: Intrinsic geometry methods for PF(5) pseudorotations and statistical methods for argon clusters

NASA Astrophysics Data System (ADS)

Panahi, Nima S.

We studied the problem of understanding and computing the essential features and dynamics of molecular motions through the development of two theories for two different systems. First, we studied the process of the Berry Pseudorotation of PF5 and the rotations it induces in the molecule through its natural and intrinsic geometric nature by setting it in the language of fiber bundles and graph theory. With these tools, we successfully extracted the essentials of the process' loops and induced rotations. The infinite number of pseudorotation loops were broken down into a small set of essential loops called "super loops", with their intrinsic properties and link to the physical movements of the molecule extensively studied. In addition, only the three "self-edge loops" generated any induced rotations, and then only a finite number of classes of them. Second, we studied applying the statistical methods of Principal Components Analysis (PCA) and Principal Coordinate Analysis (PCO) to capture only the most important changes in Argon clusters so as to reduce computational costs and graph the potential energy surface (PES) in three dimensions respectively. Both methods proved successful, but PCA was only partially successful since one will only see advantages for PES database systems much larger than those both currently being studied and those that can be computationally studied in the next few decades to come. In addition, PCA is only needed for the very rare case of a PES database that does not already include Hessian eigenvalues.
Design of two-photon molecular tandem architectures for solar cells by ab initio theory

DOE PAGES

Ornso, Kristian B.; Garcia-Lastra, Juan M.; De La Torre, Gema; ...

2015-03-04

An extensive database of spectroscopic properties of molecules from ab initio calculations is used to design molecular complexes for use in tandem solar cells that convert two photons into a single electron–hole pair, thereby increasing the output voltage while covering a wider spectral range. Three different architectures are considered: the first two involve a complex consisting of two dye molecules with appropriately matched frontier orbitals, connected by a molecular diode. Optimized combinations of dye molecules are determined by taking advantage of our computational database of the structural and energetic properties of several thousand porphyrin dyes. The third design is amore » molecular analogy of the intermediate band solar cell, and involves a single dye molecule with strong intersystem crossing to ensure a long lifetime of the intermediate state. Based on the calculated energy levels and molecular orbitals, energy diagrams are presented for the individual steps in the operation of such tandem solar cells. We find that theoretical open circuit voltages of up to 1.8 V can be achieved using these tandem designs. Questions about the practical implementation of prototypical devices, such as the synthesis of the tandem molecules and potential loss mechanisms, are addressed.« less
A size selective porous silicon grating-coupled Bloch surface and sub-surface wave biosensor.

PubMed

Rodriguez, Gilberto A; Ryckman, Judson D; Jiao, Yang; Weiss, Sharon M

2014-03-15

A porous silicon (PSi) grating-coupled Bloch surface and sub-surface wave (BSW/BSSW) biosensor is demonstrated to size selectively detect the presence of both large and small molecules. The BSW is used to sense large immobilized analytes at the surface of the structure while the BSSW that is confined inside but near the top of the structure is used to sensitively detect small molecules. Functionality of the BSW and BSSW modes is theoretically described by dispersion relations, field confinements, and simulated refractive index shifts within the structure. The theoretical results are experimentally verified by detecting two different small chemical molecules and one large 40 base DNA oligonucleotide. The PSi-BSW/BSSW structure is benchmarked against current porous silicon technology and is shown to have a 6-fold higher sensitivity in detecting large molecules and a 33% improvement in detecting small molecules. This is the first report of a grating-coupled BSW biosensor and the first report of a BSSW propagating mode. © 2013 Published by Elsevier B.V.
[Innovative application of small molecules to influence -pathogenicity of dental plaque].

PubMed

Janus, M M; Volgenant, C M C; Krom, B P

2018-05-01

Current preventive measures against infectious oral diseases are mainly focussed on plaque removal and promoting a healthy lifestyle. This in vitro study investigated a third preventive method: maintaining healthy dental plaque with the use of small molecules. As a model of dental plaque, in vitro biofilms were cultivated under conditions that induce pathogenic characteristics. The effect of erythritol and other small molecules on the pathogenic characteristics and bacterial composition of the biofilm was evaluated. The artificial sweetener erythritol and the molecule 3-Oxo-N-(2-oxycyclohexyl)dodecanamide (3-Oxo-N) had no clinically relevant effect on total biofilm formation. Erythritol did, however, lower the gingivitis related protease activity of the biofilm, while 3-Oxo-N blocked the caries related lactic acid accumulation. Furthermore, both substances ensured the biofilm maintained a young, non-pathogenic microbial composition. This shows it is possible to influence the dental plaque in a positive manner in vitro with the help of small molecules. Further research is necessary before this manipulation of dental plaque can be applied.
Molecular scaffold analysis of natural products databases in the public domain.

PubMed

Yongye, Austin B; Waddell, Jacob; Medina-Franco, José L

2012-11-01

Natural products represent important sources of bioactive compounds in drug discovery efforts. In this work, we compiled five natural products databases available in the public domain and performed a comprehensive chemoinformatic analysis focused on the content and diversity of the scaffolds with an overview of the diversity based on molecular fingerprints. The natural products databases were compared with each other and with a set of molecules obtained from in-house combinatorial libraries, and with a general screening commercial library. It was found that publicly available natural products databases have different scaffold diversity. In contrast to the common concept that larger libraries have the largest scaffold diversity, the largest natural products collection analyzed in this work was not the most diverse. The general screening library showed, overall, the highest scaffold diversity. However, considering the most frequent scaffolds, the general reference library was the least diverse. In general, natural products databases in the public domain showed low molecule overlap. In addition to benzene and acyclic compounds, flavones, coumarins, and flavanones were identified as the most frequent molecular scaffolds across the different natural products collections. The results of this work have direct implications in the computational and experimental screening of natural product databases for drug discovery. © 2012 John Wiley & Sons A/S.
A Prospective Method to Guide Small Molecule Drug Design

ERIC Educational Resources Information Center

Johnson, Alan T.

2015-01-01

At present, small molecule drug design follows a retrospective path when considering what analogs are to be made around a current hit or lead molecule with the focus often on identifying a compound with higher intrinsic potency. What this approach overlooks is the simultaneous need to also improve the physicochemical (PC) and pharmacokinetic (PK)…
Demonstration of sub-femtomole sensitivity for small molecules with microsphere ring resonator sensors

NASA Astrophysics Data System (ADS)

White, Ian M.; Oveys, Hesam; Fan, Xudong

2006-02-01

Optical microsphere resonators can function as highly sensitive bio/chemical sensors due to the large Q-factor, which leads to high light-matter interaction. The whispering gallery modes (WGM) arise at the surface of the microsphere, creating a highly enhanced optical field that interacts with matter on or near the microsphere surface. As a result, the spectral position of the WGM is extremely sensitive to refractive index changes near the surface, such as when bio/chemical molecules bind to the sphere. We show the potential feasibility of a microsphere ring resonator as a sensor for small molecules by demonstrating detection of sub-femtomole changes in SiO II molecules at the surface of the microsphere. In this experiment, the silica molecules act as an excellent model for small molecule analytes because of their 60 Dalton molecular weight, and because we know nearly the exact quantity of molecules at the surface, which enables a sensitivity characterization. We measure the spectral shifts in the WGMs when low concentrations of hydrofluoric acid (HF) are added to a solution that is being probed by the microsphere. As the HF molecules break apart the SiO II molecules at the sphere surface, the WGMs shift due to the sub-nano-scale decrease in the size of the microsphere. These calculations show that the sensitivity of this microsphere resonator is on the order of 500 attomoles. Our results will lead to the utilization of optical microspheres for detection of trace quantities of small molecules for such applications as drug discovery, environmental monitoring, and enzyme detection using peptide cleavage.
Small Molecule Targeted Recruitment of a Nuclease to RNA.

PubMed

Costales, Matthew G; Matsumoto, Yasumasa; Velagapudi, Sai Pradeep; Disney, Matthew D

2018-06-06

The choreography between RNA synthesis and degradation is a key determinant in biology. Engineered systems such as CRISPR have been developed to rid a cell of RNAs. Here, we show that a small molecule can recruit a nuclease to a specific transcript, triggering its destruction. A small molecule that selectively binds the oncogenic microRNA(miR)-96 hairpin precursor was appended with a short 2'-5' poly(A) oligonucleotide. The conjugate locally activated endogenous, latent ribonuclease (RNase L), which selectively cleaved the miR-96 precursor in cancer cells in a catalytic and sub-stoichiometric fashion. Silencing miR-96 derepressed pro-apoptotic FOXO1 transcription factor, triggering apoptosis in breast cancer, but not healthy breast, cells. These results demonstrate that small molecules can be programmed to selectively cleave RNA via nuclease recruitment and has broad implications.
Small-molecule control of protein function through Staudinger reduction

NASA Astrophysics Data System (ADS)

Luo, Ji; Liu, Qingyang; Morihiro, Kunihiko; Deiters, Alexander

2016-11-01

Using small molecules to control the function of proteins in live cells with complete specificity is highly desirable, but challenging. Here we report a small-molecule switch that can be used to control protein activity. The approach uses a phosphine-mediated Staudinger reduction to activate protein function. Genetic encoding of an ortho-azidobenzyloxycarbonyl amino acid using a pyrrolysyl transfer RNA synthetase/tRNACUA pair in mammalian cells enables the site-specific introduction of a small-molecule-removable protecting group into the protein of interest. Strategic placement of this group renders the protein inactive until deprotection through a bioorthogonal Staudinger reduction delivers the active wild-type protein. This developed methodology was applied to the conditional control of several cellular processes, including bioluminescence (luciferase), fluorescence (enhanced green fluorescent protein), protein translocation (nuclear localization sequence), DNA recombination (Cre) and gene editing (Cas9).
RISC-Target Interaction: Cleavage and Translational Suppression

PubMed Central

van den Berg, Arjen; Mols, Johann; Han, Jiahuai

2008-01-01

Summary Small RNA molecules have been known and utilized to suppress gene expression for more than a decade. The discovery that these small RNA molecules are endogenously expressed in many organisms and have a critical role in controlling gene expression have led to the arising of a whole new field of research. Termed small interfering RNA (siRNA) or microRNA (miRNA) these ~22 nt RNA molecules have the capability to suppress gene expression through various mechanisms once they are incorporated in the multi-protein RNA-Induced Silencing Complex (RISC) and interact with their target mRNA. This review introduces siRNAs and microRNAs in a historical perspective and focuses on the key molecules in RISC, structural properties and mechanisms underlying the process of small RNA regulated post-transcriptional suppression of gene expression. PMID:18692607
Inkjet-Printed Small-Molecule Organic Light-Emitting Diodes: Halogen-Free Inks, Printing Optimization, and Large-Area Patterning.

PubMed

Zhou, Lu; Yang, Lei; Yu, Mengjie; Jiang, Yi; Liu, Cheng-Fang; Lai, Wen-Yong; Huang, Wei

2017-11-22

Manufacturing small-molecule organic light-emitting diodes (OLEDs) via inkjet printing is rather attractive for realizing high-efficiency and long-life-span devices, yet it is challenging. In this paper, we present our efforts on systematical investigation and optimization of the ink properties and the printing process to enable facile inkjet printing of conjugated light-emitting small molecules. Various factors on influencing the inkjet-printed film quality during the droplet generation, the ink spreading on the substrates, and its solidification processes have been systematically investigated and optimized. Consequently, halogen-free inks have been developed and large-area patterning inkjet printing on flexible substrates with efficient blue emission has been successfully demonstrated. Moreover, OLEDs manufactured by inkjet printing the light-emitting small molecules manifested superior performance as compared with their corresponding spin-cast counterparts.
Biased and unbiased strategies to identify biologically active small molecules.

PubMed

Abet, Valentina; Mariani, Angelica; Truscott, Fiona R; Britton, Sébastien; Rodriguez, Raphaël

2014-08-15

Small molecules are central players in chemical biology studies. They promote the perturbation of cellular processes underlying diseases and enable the identification of biological targets that can be validated for therapeutic intervention. Small molecules have been shown to accurately tune a single function of pluripotent proteins in a reversible manner with exceptional temporal resolution. The identification of molecular probes and drugs remains a worthy challenge that can be addressed by the use of biased and unbiased strategies. Hypothesis-driven methodologies employs a known biological target to synthesize complementary hits while discovery-driven strategies offer the additional means of identifying previously unanticipated biological targets. This review article provides a general overview of recent synthetic frameworks that gave rise to an impressive arsenal of biologically active small molecules with unprecedented cellular mechanisms. Copyright © 2014. Published by Elsevier Ltd.
Screening small-molecule compound microarrays for protein ligands without fluorescence labeling with a high-throughput scanning microscope.

PubMed

Fei, Yiyan; Landry, James P; Sun, Yungshin; Zhu, Xiangdong; Wang, Xiaobing; Luo, Juntao; Wu, Chun-Yi; Lam, Kit S

2010-01-01

We describe a high-throughput scanning optical microscope for detecting small-molecule compound microarrays on functionalized glass slides. It is based on measurements of oblique-incidence reflectivity difference and employs a combination of a y-scan galvometer mirror and an x-scan translation stage with an effective field of view of 2 cm x 4 cm. Such a field of view can accommodate a printed small-molecule compound microarray with as many as 10,000 to 20,000 targets. The scanning microscope is capable of measuring kinetics as well as endpoints of protein-ligand reactions simultaneously. We present the experimental results on solution-phase protein reactions with small-molecule compound microarrays synthesized from one-bead, one-compound combinatorial chemistry and immobilized on a streptavidin-functionalized glass slide.
Screening small-molecule compound microarrays for protein ligands without fluorescence labeling with a high-throughput scanning microscope

PubMed Central

Fei, Yiyan; Landry, James P.; Sun, Yungshin; Zhu, Xiangdong; Wang, Xiaobing; Luo, Juntao; Wu, Chun-Yi; Lam, Kit S.

2010-01-01

We describe a high-throughput scanning optical microscope for detecting small-molecule compound microarrays on functionalized glass slides. It is based on measurements of oblique-incidence reflectivity difference and employs a combination of a y-scan galvometer mirror and an x-scan translation stage with an effective field of view of 2 cm×4 cm. Such a field of view can accommodate a printed small-molecule compound microarray with as many as 10,000 to 20,000 targets. The scanning microscope is capable of measuring kinetics as well as endpoints of protein-ligand reactions simultaneously. We present the experimental results on solution-phase protein reactions with small-molecule compound microarrays synthesized from one-bead, one-compound combinatorial chemistry and immobilized on a streptavidin-functionalized glass slide. PMID:20210464
Synthetic Small Molecule Inhibitors of Hh Signaling As Anti-Cancer Chemotherapeutics

PubMed Central

Maschinot, C.A.; Pace, J.R.; Hadden, M.K.

2016-01-01

The hedgehog (Hh) pathway is a developmental signaling pathway that is essential to the proper embryonic development of many vertebrate systems. Dysregulation of Hh signaling has been implicated as a causative factor in the development and progression of several forms of human cancer. As such, the development of small molecule inhibitors of Hh signaling as potential anti-cancer chemotherapeutics has been a major area of research interest in both academics and industry over the past ten years. Through these efforts, synthetic small molecules that target multiple components of the Hh pathway have been identified and advanced to preclinical or clinical development. The goal of this review is to provide an update on the current status of several synthetic small molecule Hh pathway inhibitors and explore the potential of several recently disclosed inhibitory scaffolds. PMID:26310919

Simple re-instantiation of small databases using cloud computing.

PubMed

Tan, Tin Wee; Xie, Chao; De Silva, Mark; Lim, Kuan Siong; Patro, C Pawan K; Lim, Shen Jean; Govindarajan, Kunde Ramamoorthy; Tong, Joo Chuan; Choo, Khar Heng; Ranganathan, Shoba; Khan, Asif M

2013-01-01

Small bioinformatics databases, unlike institutionally funded large databases, are vulnerable to discontinuation and many reported in publications are no longer accessible. This leads to irreproducible scientific work and redundant effort, impeding the pace of scientific progress. We describe a Web-accessible system, available online at http://biodb100.apbionet.org, for archival and future on demand re-instantiation of small databases within minutes. Depositors can rebuild their databases by downloading a Linux live operating system (http://www.bioslax.com), preinstalled with bioinformatics and UNIX tools. The database and its dependencies can be compressed into an ".lzm" file for deposition. End-users can search for archived databases and activate them on dynamically re-instantiated BioSlax instances, run as virtual machines over the two popular full virtualization standard cloud-computing platforms, Xen Hypervisor or vSphere. The system is adaptable to increasing demand for disk storage or computational load and allows database developers to use the re-instantiated databases for integration and development of new databases. Herein, we demonstrate that a relatively inexpensive solution can be implemented for archival of bioinformatics databases and their rapid re-instantiation should the live databases disappear.
Simple re-instantiation of small databases using cloud computing

PubMed Central

2013-01-01

Background Small bioinformatics databases, unlike institutionally funded large databases, are vulnerable to discontinuation and many reported in publications are no longer accessible. This leads to irreproducible scientific work and redundant effort, impeding the pace of scientific progress. Results We describe a Web-accessible system, available online at http://biodb100.apbionet.org, for archival and future on demand re-instantiation of small databases within minutes. Depositors can rebuild their databases by downloading a Linux live operating system (http://www.bioslax.com), preinstalled with bioinformatics and UNIX tools. The database and its dependencies can be compressed into an ".lzm" file for deposition. End-users can search for archived databases and activate them on dynamically re-instantiated BioSlax instances, run as virtual machines over the two popular full virtualization standard cloud-computing platforms, Xen Hypervisor or vSphere. The system is adaptable to increasing demand for disk storage or computational load and allows database developers to use the re-instantiated databases for integration and development of new databases. Conclusions Herein, we demonstrate that a relatively inexpensive solution can be implemented for archival of bioinformatics databases and their rapid re-instantiation should the live databases disappear. PMID:24564380
Collisional excitation of molecules in dense interstellar clouds

NASA Technical Reports Server (NTRS)

Green, S.

1985-01-01

State transitions which permit the identification of the molecular species in dense interstellar clouds are reviewed, along with the techniques used to calculate the transition energies, the database on known molecular transitions and the accuracy of the values. The transition energies cannot be measured directly and therefore must be modeled analytically. Scattering theory is used to determine the intermolecular forces on the basis of quantum mechanics. The nuclear motions can also be modeled with classical mechanics. Sample rate constants are provided for molecular systems known to inhabit dense interstellar clouds. The values serve as a database for interpreting microwave and RF astrophysical data on the transitions undergone by interstellar molecules.
Discovery of non-peptidic small molecule inhibitors of cyclophilin D as neuroprotective agents in Aβ-induced mitochondrial dysfunction

NASA Astrophysics Data System (ADS)

Park, Insun; Londhe, Ashwini M.; Lim, Ji Woong; Park, Beoung-Geon; Jung, Seo Yun; Lee, Jae Yeol; Lim, Sang Min; No, Kyoung Tai; Lee, Jiyoun; Pae, Ae Nim

2017-10-01

Cyclophilin D (CypD) is a mitochondria-specific cyclophilin that is known to play a pivotal role in the formation of the mitochondrial permeability transition pore (mPTP).The formation and opening of the mPTP disrupt mitochondrial homeostasis, cause mitochondrial dysfunction and eventually lead to cell death. Several recent studies have found that CypD promotes the formation of the mPTP upon binding to β amyloid (Aβ) peptides inside brain mitochondria, suggesting that neuronal CypD has a potential to be a promising therapeutic target for Alzheimer's disease (AD). In this study, we generated an energy-based pharmacophore model by using the crystal structure of CypD—cyclosporine A (CsA) complex and performed virtual screening of ChemDiv database, which yielded forty-five potential hit compounds with novel scaffolds. We further tested those compounds using mitochondrial functional assays in neuronal cells and identified fifteen compounds with excellent protective effects against Aβ-induced mitochondrial dysfunction. To validate whether these effects derived from binding to CypD, we performed surface plasmon resonance (SPR)—based direct binding assays with selected compounds and discovered compound 29 was found to have the equilibrium dissociation constants (KD) value of 88.2 nM. This binding affinity value and biological activity correspond well with our predicted binding mode. We believe that this study offers new insights into the rational design of small molecule CypD inhibitors, and provides a promising lead for future therapeutic development.
Structure Guided Chemical Modifications of Propylthiouracil Reveal Novel Small Molecule Inhibitors of Cytochrome b5 Reductase 3 That Increase Nitric Oxide Bioavailability*

PubMed Central

Rahaman, Md. Mizanur; Reinders, Fabio G.; Koes, David; Nguyen, Anh T.; Mutchler, Stephanie M.; Sparacino-Watkins, Courtney; Alvarez, Roger A.; Miller, Megan P.; Cheng, Dongmei; Chen, Bill B.; Jackson, Edwin K.; Camacho, Carlos J.; Straub, Adam C.

2015-01-01

NADH cytochrome b5 reductase 3 (CYB5R3) is critical for reductive reactions such as fatty acid elongation, cholesterol biosynthesis, drug metabolism, and methemoglobin reduction. Although the physiological and metabolic importance of CYB5R3 has been established in hepatocytes and erythrocytes, emerging investigations suggest that CYB5R3 is critical for nitric oxide signaling and vascular function. However, advancement toward fully understanding CYB5R3 function has been limited due to a lack of potent small molecule inhibitors. Because of this restriction, we modeled the binding mode of propylthiouracil, a weak inhibitor of CYB5R3 (IC50 = ∼275 μm), and used it as a guide to predict thiouracil-biased inhibitors from the set of commercially available compounds in the ZINC database. Using this approach, we validated two new potent derivatives of propylthiouracil, ZINC05626394 (IC50 = 10.81 μm) and ZINC39395747 (IC50 = 9.14 μm), both of which inhibit CYB5R3 activity in cultured cells. Moreover, we found that ZINC39395747 significantly increased NO bioavailability in renal vascular cells, augmented renal blood flow, and decreased systemic blood pressure in response to vasoconstrictors in spontaneously hypertensive rats. These compounds will serve as a new tool to examine the biological functions of CYB5R3 in physiology and disease and also as a platform for new drug development. PMID:26001785
Market Exclusivity Time for Top Selling Originator Drugs in Canada: A Cohort Study.

PubMed

Lexchin, Joel

2017-09-01

This study looks at market exclusivity time for the top selling originator drugs in Canada. Total sales for drugs without competition were also calculated. A list of the top selling originator drugs by dollar sales from 2009 to 2015 inclusive, except for 2010, was compiled along with their annual sales. Health Canada databases were used to extract the following information: generic name, date of Notice of Compliance (NOC, date of marketing authorization), whether the product was a small molecule drug or a biologic, and date of NOC for a generic or biosimilar. Market exclusivity time was calculated in days for drugs. A total of 121 drugs were identified. There were 96 small molecule drugs (63 with a generic competitor and 33 with no generic competitor) and 25 biologics (none with a biosimilar competitor). The 63 drugs with a competitor had a mean market exclusivity time of 4478 days (12.3 years) (95% CI 4159-4798). The 58 drugs without competition had total annual sales of Can$8.59 billion and were on the market for a median of 5357 days (14.7 years) (interquartile range 3291-6679) as of January 31, 2017. Top selling originator drugs in Canada have a considerably longer period of market exclusivity than the 8 to 10 years that the research-based pharmaceutical industry claims. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Platelet Activating Factor Receptor Activation Improves siRNA Uptake and RNAi Responses in Well-differentiated Airway Epithelia.

PubMed

Krishnamurthy, Sateesh; Behlke, Mark A; Apicella, Michael A; McCray, Paul B; Davidson, Beverly L

2014-07-15

Well-differentiated human airway epithelia present formidable barriers to efficient siRNA delivery. We previously reported that treatment of airway epithelia with specific small molecules improves oligonucleotide uptake and facilitates RNAi responses. Here, we exploited the platelet activating factor receptor (PAFR) pathway, utilized by specific bacteria to transcytose into epithelia, as a trigger for internalization of Dicer-substrate siRNAs (DsiRNA). PAFR is a G-protein coupled receptor which can be engaged and activated by phosphorylcholine residues on the lipooligosaccharide (LOS) of nontypeable Haemophilus influenzae and the teichoic acid of Streptococcus pneumoniae as well as by its natural ligand, platelet activating factor (PAF). When well-differentiated airway epithelia were simultaneously treated with either nontypeable Haemophilus influenzae LOS or PAF and transduced with DsiRNA formulated with the peptide transductin, we observed silencing of both endogenous and exogenous targets. PAF receptor antagonists prevented LOS or PAF-assisted DsiRNA silencing, demonstrating that ligand engagement of PAFR is essential for this process. Additionally, PAF-assisted DsiRNA transfection decreased CFTR protein expression and function and reduced exogenous viral protein levels and titer in human airway epithelia. Treatment with spiperone, a small molecule identified using the Connectivity map database to correlate gene expression changes in response to drug treatment with those associated with PAFR stimulation, also induced silencing. These results suggest that the signaling pathway activated by PAFR binding can be manipulated to facilitate siRNA entry and function in difficult to transfect well-differentiated airway epithelial cells.
Design and synthesis of type-III mimetics of ShK toxin

NASA Astrophysics Data System (ADS)

Baell, Jonathan B.; Harvey, Andrew J.; Norton, Raymond S.

2002-04-01

ShK toxin is a structurally defined, 35-residue polypeptide which blocks the voltage-gated Kv1.3 potassium channel in T-lymphocytes and has been identified as a possible immunosuppressant. Our interest lies in the rational design and synthesis of type-III mimetics of protein and polypeptide structure and function. ShK toxin is a challenging target for mimetic design as its binding epitope consists of relatively weakly binding residues, some of which are discontinuous. We discuss here our investigations into the design and synthesis of 1st generation, small molecule mimetics of ShK toxin and highlight any principles relevant to the generic design of type-III mimetics of continuous and discontinuous binding epitopes. We complement our approach with attempted pharmacophore-based database mining.
Prediction, Detection, and Validation of Isotope Clusters in Mass Spectrometry Data

PubMed Central

Treutler, Hendrik; Neumann, Steffen

2016-01-01

Mass spectrometry is a key analytical platform for metabolomics. The precise quantification and identification of small molecules is a prerequisite for elucidating the metabolism and the detection, validation, and evaluation of isotope clusters in LC-MS data is important for this task. Here, we present an approach for the improved detection of isotope clusters using chemical prior knowledge and the validation of detected isotope clusters depending on the substance mass using database statistics. We find remarkable improvements regarding the number of detected isotope clusters and are able to predict the correct molecular formula in the top three ranks in 92% of the cases. We make our methodology freely available as part of the Bioconductor packages xcms version 1.50.0 and CAMERA version 1.30.0. PMID:27775610
HippDB: a database of readily targeted helical protein-protein interactions.

PubMed

Bergey, Christina M; Watkins, Andrew M; Arora, Paramjit S

2013-11-01

HippDB catalogs every protein-protein interaction whose structure is available in the Protein Data Bank and which exhibits one or more helices at the interface. The Web site accepts queries on variables such as helix length and sequence, and it provides computational alanine scanning and change in solvent-accessible surface area values for every interfacial residue. HippDB is intended to serve as a starting point for structure-based small molecule and peptidomimetic drug development. HippDB is freely available on the web at http://www.nyu.edu/projects/arora/hippdb. The Web site is implemented in PHP, MySQL and Apache. Source code freely available for download at http://code.google.com/p/helidb, implemented in Perl and supported on Linux. arora@nyu.edu.
New Small Molecule Agonists to the Thyrotropin Receptor

PubMed Central

Ali, M. Rejwan; Ma, Risheng; David, Martine; Morshed, Syed A.; Ohlmeyer, Michael; Felsenfeld, Dan P.; Lau, Zerlina; Mezei, Mihaly; Davies, Terry F.

2015-01-01

Background Novel small molecular ligands (SMLs) to the thyrotropin receptor (TSHR) have potential as improved molecular probes and as therapeutic agents for the treatment of thyroid dysfunction and thyroid cancer. Methods To identify novel SMLs to the TSHR, we developed a transcription-based luciferase-cAMP high-throughput screening system and we screened 48,224 compounds from a 100K library in duplicate. Results We obtained 62 hits using the cut-off criteria of the mean±three standard deviations above the baseline. Twenty molecules with the greatest activity were rescreened against the parent CHO-luciferase cell for nonspecific activation, and we selected two molecules (MS437 and MS438) with the highest potency for further study. These lead molecules demonstrated no detectible cross-reactivity with homologous receptors when tested against luteinizing hormone (LH)/human chorionic gonadotropin receptor and follicle stimulating hormone receptor–expressing cells. Molecule MS437 had a TSHR-stimulating potency with an EC50 of 13×10−8 M, and molecule MS438 had an EC50 of 5.3×10−8 M. The ability of these small molecule agonists to bind to the transmembrane domain of the receptor and initiate signal transduction was suggested by their activation of a chimeric receptor consisting of an LHR ectodomain and a TSHR transmembrane. Molecular modeling demonstrated that these molecules bound to residues S505 and E506 for MS438 and T501 for MS437 in the intrahelical region of transmembrane helix 3. We also examined the G protein activating ability of these molecules using CHO cells co-expressing TSHRs transfected with luciferase reporter vectors in order to measure Gsα, Gβγ, Gαq, and Gα12 activation quantitatively. The MS437 and MS438 molecules showed potent activation of Gsα, Gαq, and Gα12 similar to TSH, but neither the small molecule agonists nor TSH showed activation of the Gβγ pathway. The small molecules MS437 and MS438 also showed upregulation of thyroglobulin (Tg), sodium iodine symporter (NIS), and TSHR gene expression. Conclusions Pharmacokinetic analysis of MS437 and MS438 indicated their pharmacotherapeutic potential, and their intraperitoneal administration to normal female mice resulted in significantly increased serum thyroxine levels, which could be maintained by repeated treatments. These molecules can therefore serve as lead molecules for further development of powerful TSH agonists. PMID:25333622
Targeting RNA in mammalian systems with small molecules.

PubMed

Donlic, Anita; Hargrove, Amanda E

2018-05-03

The recognition of RNA functions beyond canonical protein synthesis has challenged the central dogma of molecular biology. Indeed, RNA is now known to directly regulate many important cellular processes, including transcription, splicing, translation, and epigenetic modifications. The misregulation of these processes in disease has led to an appreciation of RNA as a therapeutic target. This potential was first recognized in bacteria and viruses, but discoveries of new RNA classes following the sequencing of the human genome have invigorated exploration of its disease-related functions in mammals. As stable structure formation is evolving as a hallmark of mammalian RNAs, the prospect of utilizing small molecules to specifically probe the function of RNA structural domains and their interactions is gaining increased recognition. To date, researchers have discovered bioactive small molecules that modulate phenotypes by binding to expanded repeats, microRNAs, G-quadruplex structures, and RNA splice sites in neurological disorders, cancers, and other diseases. The lessons learned from achieving these successes both call for additional studies and encourage exploration of the plethora of mammalian RNAs whose precise mechanisms of action remain to be elucidated. Efforts toward understanding fundamental principles of small molecule-RNA recognition combined with advances in methodology development should pave the way toward targeting emerging RNA classes such as long noncoding RNAs. Together, these endeavors can unlock the full potential of small molecule-based probing of RNA-regulated processes and enable us to discover new biology and underexplored avenues for therapeutic intervention in human disease. This article is categorized under: RNA Methods > RNA Analyses In Vitro and In Silico RNA Interactions with Proteins and Other Molecules > Small Molecule-RNA Interactions RNA in Disease and Development > RNA in Disease. © 2018 Wiley Periodicals, Inc.
The HITRAN2016 molecular spectroscopic database

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gordon, I. E.; Rothman, L. S.; Hill, C.

This paper describes the contents of the 2016 edition of the HITRAN molecular spectroscopic compilation. The new edition replaces the previous HITRAN edition of 2012 and its updates during the intervening years. The HITRAN molecular absorption compilation is comprised of five major components: the traditional line-by-line spectroscopic parameters required for high-resolution radiative-transfer codes, infrared absorption cross-sections for molecules not yet amenable to representation in a line-by-line form, collision-induced absorption data, aerosol indices of refraction, and general tables such as partition sums that apply globally to the data. The new HITRAN is greatly extended in terms of accuracy, spectral coverage, additionalmore » absorption phenomena, added line-shape formalisms, and validity. Moreover, molecules, isotopologues, and perturbing gases have been added that address the issues of atmospheres beyond the Earth. Of considerable note, experimental IR cross-sections for almost 200 additional significant molecules have been added to the database.« less
PhAST: pharmacophore alignment search tool.

PubMed

Hähnke, Volker; Hofmann, Bettina; Grgat, Tomislav; Proschak, Ewgenij; Steinhilber, Dieter; Schneider, Gisbert

2009-04-15

We present a ligand-based virtual screening technique (PhAST) for rapid hit and lead structure searching in large compound databases. Molecules are represented as strings encoding the distribution of pharmacophoric features on the molecular graph. In contrast to other text-based methods using SMILES strings, we introduce a new form of text representation that describes the pharmacophore of molecules. This string representation opens the opportunity for revealing functional similarity between molecules by sequence alignment techniques in analogy to homology searching in protein or nucleic acid sequence databases. We favorably compared PhAST with other current ligand-based virtual screening methods in a retrospective analysis using the BEDROC metric. In a prospective application, PhAST identified two novel inhibitors of 5-lipoxygenase product formation with minimal experimental effort. This outcome demonstrates the applicability of PhAST to drug discovery projects and provides an innovative concept of sequence-based compound screening with substantial scaffold hopping potential. 2008 Wiley Periodicals, Inc.
Improving Photoconductance of Fluorinated Donors with Fluorinated Acceptors

DOE Office of Scientific and Technical Information (OSTI.GOV)

Garner, Logan E.; Larson, Bryon; Oosterhout, Stefan

2016-11-21

This work investigates the influence of fluorination of both donor and acceptor materials on the generation of free charge carriers in small molecule donor/fullerene acceptor BHJ OPV active layers. A fluorinated and non-fluorinated small molecule analogue were synthesized and their optoelectronic properties characterized. The intrinsic photoconductance of blends of these small molecule donors was investigated using time-resolved microwave conductivity. Blends of the two donor molecules with a traditional non-fluorinated fullerene (PC70BM) as well as a fluorinated fullerene (C60(CF3)2-1) were investigated using 5% and 50% fullerene loading. We demonstrate for the first time that photoconductance in a 50:50 donor:acceptor BHJ blendmore » using a fluorinated fullerene can actually be improved relative to a traditional non-fluorinated fullerene by fluorinating the donor molecule as well.« less
Comparison of small molecules and oligonucleotides that target a toxic, non-coding RNA.

PubMed

Costales, Matthew G; Rzuczek, Suzanne G; Disney, Matthew D

2016-06-01

Potential RNA targets for chemical probes and therapeutic modalities are pervasive in the transcriptome. Oligonucleotide-based therapeutics are commonly used to target RNA sequence. Small molecules are emerging as a modality to target RNA structures selectively, but their development is still in its infancy. In this work, we compare the activity of oligonucleotides and several classes of small molecules that target the non-coding r(CCUG) repeat expansion (r(CCUG)(exp)) that causes myotonic dystrophy type 2 (DM2), an incurable disease that is the second-most common cause of adult onset muscular dystrophy. Small molecule types investigated include monomers, dimers, and multivalent compounds synthesized on-site by using RNA-templated click chemistry. Oligonucleotides investigated include phosphorothioates that cleave their target and vivo-morpholinos that modulate target RNA activity via binding. We show that compounds assembled on-site that recognize structure have the highest potencies amongst small molecules and are similar in potency to a vivo-morpholino modified oligonucleotide that targets sequence. These studies are likely to impact the design of therapeutic modalities targeting other repeats expansions that cause fragile X syndrome and amyotrophic lateral sclerosis, for example. Copyright © 2016. Published by Elsevier Ltd.
In Situ Oxidation Synthesis of p-Type Composite with Narrow-Bandgap Small Organic Molecule Coating on Single-Walled Carbon Nanotube: Flexible Film and Thermoelectric Performance.

PubMed

Gao, Caiyan; Chen, Guangming

2018-03-01

Although composites of organic polymers or n-type small molecule/carbon nanotube (CNT) have achieved significant advances in thermoelectric (TE) applications, p-type TE composites of small organic molecules as thick surface coating layers on the surfaces of inorganic nanoparticles still remain a great challenge. Taking advantage of in situ oxidation reaction of thieno[3,4-b]pyrazine (TP) into TP di-N-oxide (TPNO) on single-walled CNT (SWCNT) surface, a novel synthesis strategy is proposed to achieve flexible films of TE composites with narrow-bandgap (1.19 eV) small molecule coating on SWCNT surface. The TE performance can be effectively enhanced and conveniently tuned by poly(sodium-p-styrenesulfonate) content, TPNO/SWCNT mass ratio, and posttreatment by various polar solvents. The maximum of the composite power factor at room temperature is 29.4 ± 1.0 µW m -1 K -2 . The work presents a way to achieve flexible films of p-type small organic molecule/inorganic composites with clear surface coating morphology for TE application. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Measurement of Small Molecular Dopant F4TCNQ and C 60F 36 Diffusion in Organic Bilayer Architectures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Jun; Rochester, Chris W.; Jacobs, Ian E.

2015-12-03

The diffusion of molecules through and between organic layers is a serious stability concern in organic electronic devices. In this paper, the temperature-dependent diffusion of molecular dopants through small molecule hole transport layers is observed. Specifically we investigate bilayer stacks of small molecules used for hole transport (MeO-TPD) and p-type dopants (F4TCNQ and C 60F 36) used in hole injection layers for organic light emitting diodes and hole collection electrodes for organic photovoltaics. With the use of absorbance spectroscopy, photoluminescence spectroscopy, neutron reflectometry, and near-edge X-ray absorption fine structure spectroscopy, we are able to obtain a comprehensive picture of themore » diffusion of fluorinated small molecules through MeO-TPD layers. F4TCNQ spontaneously diffuses into the MeO-TPD material even at room temperature, while C 60F 36, a much bulkier molecule, is shown to have a substantially higher morphological stability. Finally, this study highlights that the differences in size/geometry and thermal properties of small molecular dopants can have a significant impact on their diffusion in organic device architectures.« less
A semantic web ontology for small molecules and their biological targets.

PubMed

Choi, Jooyoung; Davis, Melissa J; Newman, Andrew F; Ragan, Mark A

2010-05-24

A wide range of data on sequences, structures, pathways, and networks of genes and gene products is available for hypothesis testing and discovery in biological and biomedical research. However, data describing the physical, chemical, and biological properties of small molecules have not been well-integrated with these resources. Semantically rich representations of chemical data, combined with Semantic Web technologies, have the potential to enable the integration of small molecule and biomolecular data resources, expanding the scope and power of biomedical and pharmacological research. We employed the Semantic Web technologies Resource Description Framework (RDF) and Web Ontology Language (OWL) to generate a Small Molecule Ontology (SMO) that represents concepts and provides unique identifiers for biologically relevant properties of small molecules and their interactions with biomolecules, such as proteins. We instanced SMO using data from three public data sources, i.e., DrugBank, PubChem and UniProt, and converted to RDF triples. Evaluation of SMO by use of predetermined competency questions implemented as SPARQL queries demonstrated that data from chemical and biomolecular data sources were effectively represented and that useful knowledge can be extracted. These results illustrate the potential of Semantic Web technologies in chemical, biological, and pharmacological research and in drug discovery.
DBDA as a Novel Matrix for the Analyses of Small Molecules and Quantification of Fatty Acids by Negative Ion MALDI-TOF MS.

PubMed

Ling, Ling; Li, Ying; Wang, Sheng; Guo, Liming; Xiao, Chunsheng; Chen, Xuesi; Guo, Xinhua

2018-04-01

Matrix interference ions in low mass range has always been a concern when using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to analyze small molecules (<500 Da). In this work, a novel matrix, N1,N4-dibenzylidenebenzene-1,4-diamine (DBDA) was synthesized for the analyses of small molecules by negative ion MALDI-TOF MS. Notably, only neat ions ([M-H] - ) of fatty acids without matrix interference appeared in the mass spectra and the limit of detection (LOD) reached 0.3 fmol. DBDA also has great performance towards other small molecules such as amino acids, peptides, and nucleotide. Furthermore, with this novel matrix, the free fatty acids in serum were quantitatively analyzed based on the correlation curves with correlation coefficient of 0.99. In addition, UV-Vis experiments and molecular orbital calculations were performed to explore mechanism about DBDA used as matrix in the negative ion mode. The present work shows that the DBDA matrix is a highly sensitive matrix with few interference ions for analysis of small molecules. Meanwhile, DBDA is able to precisely quantify the fatty acids in real biological samples. Graphical Abstract ᅟ.

Exporters for Production of Amino Acids and Other Small Molecules.

PubMed

Eggeling, Lothar

Microbes are talented catalysts to synthesize valuable small molecules in their cytosol. However, to make full use of their skills - and that of metabolic engineers - the export of intracellularly synthesized molecules to the culture medium has to be considered. This step is as essential as is each step for the synthesis of the favorite molecule of the metabolic engineer, but is frequently not taken into account. To export small molecules via the microbial cell envelope, a range of different types of carrier proteins is recognized to be involved, which are primary active carriers, secondary active carriers, or proteins increasing diffusion. Relevant export may require just one carrier as is the case with L-lysine export by Corynebacterium glutamicum or involve up to four carriers as known for L-cysteine excretion by Escherichia coli. Meanwhile carriers for a number of small molecules of biotechnological interest are recognized, like for production of peptides, nucleosides, diamines, organic acids, or biofuels. In addition to carriers involved in amino acid excretion, such carriers and their impact on product formation are described, as well as the relatedness of export carriers which may serve as a hint to identify further carriers required to improve product formation by engineering export.
Reaction-based small-molecule fluorescent probes for dynamic detection of ROS and transient redox changes in living cells and small animals.

PubMed

Lü, Rui

2017-09-01

Dynamic detection of transient redox changes in living cells and animals has broad implications for human health and disease diagnosis, because intracellular redox homeostasis regulated by reactive oxygen species (ROS) plays important role in cell functions, normal physiological functions and some serious human diseases (e.g., cancer, Alzheimer's disease, diabetes, etc.) usually have close relationship with the intracellular redox status. Small-molecule ROS-responsive fluorescent probes can act as powerful tools for dynamic detection of ROS and redox changes in living cells and animals through fluorescence imaging techniques; and great advances have been achieved recently in the design and synthesis of small-molecule ROS-responsive fluorescent probes. This article highlights up-to-date achievements in designing and using the reaction-based small-molecule fluorescent probes (with high sensitivity and selectivity to ROS and redox cycles) in the dynamic detection of ROS and transient redox changes in living cells and animals through fluorescence imaging. Copyright © 2017. Published by Elsevier Ltd.
The Endoplasmic Reticulum Membrane Is Permeable to Small Molecules

PubMed Central

Le Gall, Sylvie; Neuhof, Andrea; Rapoport, Tom

2004-01-01

The lumen of the endoplasmic reticulum (ER) differs from the cytosol in its content of ions and other small molecules, but it is unclear whether the ER membrane is as impermeable as other membranes in the cell. Here, we have tested the permeability of the ER membrane to small, nonphysiological molecules. We report that isolated ER vesicles allow different chemical modification reagents to pass from the outside into the lumen with little hindrance. In permeabilized cells, the ER membrane allows the passage of a small, charged modification reagent that is unable to cross the plasma membrane or the lysosomal and trans-Golgi membranes. A larger polar reagent of ∼5 kDa is unable to pass through the ER membrane. Permeation of the small molecules is passive because it occurs at low temperature in the absence of energy. These data indicate that the ER membrane is significantly more leaky than other cellular membranes, a property that may be required for protein folding and other functions of the ER. PMID:14617815
Cucurbituril mediated single molecule detection and identification via recognition tunneling.

PubMed

Xiao, Bohuai; Liang, Feng; Liu, Simin; Im, JongOne; Li, Yunchuan; Liu, Jing; Zhang, Bintian; Zhou, Jianghao; He, Jin; Chang, Shuai

2018-06-08

Recognition tunneling (RT) is an emerging technique for investigating single molecules in a tunnel junction. We have previously demonstrated its capability of single molecule detection and identification, as well as probing the dynamics of intermolecular bonding at the single molecule level. Here by introducing cucurbituril as a new class of recognition molecule, we demonstrate a powerful platform for electronically investigating the host-guest chemistry at single molecule level. In this report, we first investigated the single molecule electrical properties of cucurbituril in a tunnel junction. Then we studied two model guest molecules, aminoferrocene and amantadine, which were encapsulated by cucurbituril. Small differences in conductance and lifetime can be recognized between the host-guest complexes with the inclusion of different guest molecules. By using a machine learning algorithm to classify the RT signals in a hyper dimensional space, the accuracy of guest molecule recognition can be significantly improved, suggesting the possibility of using cucurbituril molecule for single molecule identification. This work enables a new class of recognition molecule for RT technique and opens the door for detecting a vast variety of small molecules by electrical measurements.
Target gene screening and evaluation of prognostic values in non-small cell lung cancers by bioinformatics analysis.

PubMed

Piao, Junjie; Sun, Jie; Yang, Yang; Jin, Tiefeng; Chen, Liyan; Lin, Zhenhua

2018-03-20

Non-small cell lung cancer (NSCLC) is the major leading cause of cancer-related deaths worldwide. This study aims to explore molecular mechanism of NSCLC. Microarray dataset was obtained from the Gene Expression Omnibus (GEO) database, and analyzed by using GEO2R. Functional and pathway enrichment analysis were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database. Then, STRING, Cytoscape and MCODE were applied to construct the Protein-protein interaction (PPI) network and screen hub genes. Following, overall survival (OS) analysis of hub genes was performed by using the Kaplan-Meier plotter online tool. Moreover, miRecords was also applied to predict the targets of the differentially expressed microRNAs (DEMs). A total of 228 DEGs were identified, and they were mainly enriched in the terms of cell adhesion molecules, leukocyte transendothelial migration and ECM-receptor interaction. A PPI network was constructed, and 16 hub genes were identified, including TEK, ANGPT1, MMP9, VWF, CDH5, EDN1, ESAM, CCNE1, CDC45, PRC1, CCNB2, AURKA, MELK, CDC20, TOP2A and PTTG1. Among the genes, expressions of 14 hub genes were associated with prognosis of NSCLC patients. Additionally, a total of 11 DEMs were also identified. Our results provide some potential underlying biomarkers for NSCLC. Further studies are required to elucidate the pathogenesis of NSCLC. Copyright © 2018 Elsevier B.V. All rights reserved.
Insilico profiling of microRNAs in Korean ginseng (Panax ginseng Meyer)

PubMed Central

Mathiyalagan, Ramya; Subramaniyam, Sathiyamoorthy; Natarajan, Sathishkumar; Kim, Yeon Ju; Sun, Myung Suk; Kim, Se Young; Kim, Yu-Jin; Yang, Deok Chun

2013-01-01

MicroRNAs (miRNAs) are a class of recently discovered non-coding small RNA molecules, on average approximately 21 nucleotides in length, which underlie numerous important biological roles in gene regulation in various organisms. The miRNA database (release 18) has 18,226 miRNAs, which have been deposited from different species. Although miRNAs have been identified and validated in many plant species, no studies have been reported on discovering miRNAs in Panax ginseng Meyer, which is a traditionally known medicinal plant in oriental medicine, also known as Korean ginseng. It has triterpene ginseng saponins called ginsenosides, which are responsible for its various pharmacological activities. Predicting conserved miRNAs by homology-based analysis with available expressed sequence tag (EST) sequences can be powerful, if the species lacks whole genome sequence information. In this study by using the EST based computational approach, 69 conserved miRNAs belonging to 44 miRNA families were identified in Korean ginseng. The digital gene expression patterns of predicted conserved miRNAs were analyzed by deep sequencing using small RNA sequences of flower buds, leaves, and lateral roots. We have found that many of the identified miRNAs showed tissue specific expressions. Using the insilico method, 346 potential targets were identified for the predicted 69 conserved miRNAs by searching the ginseng EST database, and the predicted targets were mainly involved in secondary metabolic processes, responses to biotic and abiotic stress, and transcription regulator activities, as well as a variety of other metabolic processes. PMID:23717176
Development of small molecule biosensors by coupling the recognition of the bacterial allosteric transcription factor with isothermal strand displacement amplification.

PubMed

Yao, Yongpeng; Li, Shanshan; Cao, Jiaqian; Liu, Weiwei; Fan, Keqiang; Xiang, Wensheng; Yang, Keqian; Kong, Deming; Wang, Weishan

2018-05-08

Here, we demonstrate an easy-to-implement and general biosensing strategy by coupling the small-molecule recognition of the bacterial allosteric transcription factor (aTF) with isothermal strand displacement amplification (SDA) in vitro. Based on this strategy, we developed two biosensors for the detection of an antiseptic, p-hydroxybenzoic acid, and a disease marker, uric acid, using bacterial aTF HosA and HucR, respectively, highlighting the great potential of this strategy for the development of small-molecule biosensors.
A structural biology perspective on bioactive small molecules and their plant targets.

PubMed

Kumari, Selva; van der Hoorn, Renier A L

2011-10-01

Structural biology efforts in recent years have generated numerous co-crystal structures of bioactive small molecules interacting with their plant targets. These studies include the targets of various phytohormones, pathogen-derived effectors, herbicides and other bioactive compounds. Here we discuss that this collection of structures contains excellent examples of nine collective observations: molecular glues, allostery, inhibitors, molecular mimicry, promiscuous binding sites, unexpected electron densities, natural selection at atomic resolution, and applications in structure-guided mutagenesis and small molecule design. Copyright © 2011 Elsevier Ltd. All rights reserved.
New developments in microbial interspecies signaling.

PubMed

Shank, Elizabeth Anne; Kolter, Roberto

2009-04-01

There is a growing appreciation that in addition to well-documented intraspecies quorum sensing systems, small molecules act as signals between microbes of different species. This review will focus on how bacterial small molecules modulate these interspecies interactions. We will particularly emphasize complex relationships such as those between microbes and insects, interactions resulting in non-antagonistic outcomes (i.e. developmental and morphological processes), how co-culture can lead to the discovery of new small molecules, and the use of known compounds to evoke unexpected responses and mediate crosstalk between microbes.
Tailoring the interface using thiophene small molecules in TiO2/P3HT hybrid solar cells.

PubMed

Freitas, Flavio S; Clifford, John N; Palomares, Emilio; Nogueira, Ana F

2012-09-14

In this paper we focus on the effect of carboxylated thiophene small molecules as interface modifiers in TiO(2)/P3HT hybrid solar cells. Our results show that small differences in the chemical structure of these molecules, for example, the presence of the -CH(2)- group in the 2-thiopheneacetic acid (TAA), can greatly increase the TiO(2) surface wettability, improving the TiO(2)/polymer contact. This effect is important to enhance exciton splitting and charge separation.
Janus Kinase Antagonists and Other Novel Small Molecules for the Treatment of Crohn's Disease.

PubMed

Boland, Brigid S; Vermeire, Séverine

2017-09-01

There is an ongoing, unmet need for effective therapies for Crohn's disease. Treatments for Crohn's disease continue to evolve from the traditional biologics to novel small molecules, with targeted mechanisms directed toward pathways that are dysregulated in Crohn's disease. There are multiple emerging mechanisms of action, including Janus kinase inhibition, Smad7 inhibition, and sphingosine-1-phosphate receptor modulators, that are administered as oral medications, and small molecules represent the next generation of therapies for Crohn's disease. Copyright © 2017 Elsevier Inc. All rights reserved.
PLMItRNA, a database for mitochondrial tRNA genes and tRNAs in photosynthetic eukaryotes.

PubMed

Damiano, F; Gallerani, R; Liuni, S; Licciulli, F; Ceci, L R

2001-01-01

The PLMItRNA database for mitochondrial tRNA molecules and genes in VIRIDIPLANTAE: (green plants) [Volpetti,V., Gallerani,R., DeBenedetto,C., Liuni,S., Licciulli,F. and Ceci,L.R. (2000) Nucleic Acids Res., 28, 159-162] has been enlarged to include algae. The database now contains 436 genes and 16 tRNA entries relative to 25 higher plants, eight green algae, four red algae (RHODOPHYTAE:) and two STRAMENOPILES: The PLMItRNA database is accessible via the WWW at http://bio-www.ba.cnr.it:8000/PLMItRNA.
Theoretical Investigation of Single-Molecule Sensing Using Nanotube-Enhanced Circular Dichroism.

PubMed

Silva, Jaime; Milne, Bruce F; Nogueira, Fernando

2018-06-19

First-principles calculations have been used to investigate the potential use of circular dichroism (CD) spectroscopy in single-molecule sensing. Using a real-space implementation of time-dependent density functional theory (TDDFT), several systems involving single-walled carbon nanotubes (SWCNT) and small molecules have been studied to evaluate their CD response. Large induced CD (ICD) effects, differing for each test molecule, were observed in all SWCNT-molecule complexes. As the SWCNT used in this study shows no intrinsic CD response, the ICD spectra are the result of interaction with the small molecules. This finding is general and independent of the (a)chiral nature of the adsorbed molecule. Our results indicate that it is possible to design a system that uses SWCNT for detection of molecules using the change in CD spectrum of the system induced by adsorption of the molecule onto the SWCNT surface.
2016 White Paper on recent issues in bioanalysis: focus on biomarker assay validation (BAV) (Part 1 - small molecules, peptides and small molecule biomarkers by LCMS).

PubMed

Yang, Eric; Welink, Jan; Cape, Stephanie; Woolf, Eric; Sydor, Jens; James, Christopher; Goykhman, Dina; Arnold, Mark; Addock, Neil; Bauer, Ronald; Buonarati, Michael; Ciccimaro, Eugene; Dodda, Raj; Evans, Christopher; Garofolo, Fabio; Hughes, Nicola; Islam, Rafiq; Nehls, Corey; Wilson, Amanda; Briscoe, Chad; Bustard, Mark; Coppola, Laura; Croft, Stephanie; Drexler, Dieter; Ferrari, Luca; Fraier, Daniela; Jenkins, Rand; Kadavil, John; King, Lloyd; Li, Wenkui; Lima Santos, Gustavo Mendes; Musuku, Adrien; Ramanathan, Ragu; Saito, Yoshiro; Savoie, Natasha; Summerfield, Scott; Sun, Rachel; Tampal, Nilufer; Vinter, Steve; Wakelin-Smith, Jason; Yue, Qin

2016-10-07

The 2016 10 th Workshop on Recent Issues in Bioanalysis (10 th WRIB) took place in Orlando, Florida with participation of close to 700 professionals from pharmaceutical/biopharmaceutical companies, biotechnology companies, contract research organizations, and regulatory agencies worldwide. WRIB was once again a 5-day, weeklong event - A Full Immersion Week of Bioanalysis including Biomarkers and Immunogenicity. As usual, it was specifically designed to facilitate sharing, reviewing, discussing and agreeing on approaches to address the most current issues of interest including both small and large molecule analysis involving LCMS, hybrid LBA/LCMS, and LBA approaches, with the focus on biomarkers and immunogenicity. This 2016 White Paper encompasses recommendations emerging from the extensive discussions held during the workshop, and is aimed to provide the bioanalytical community with key information and practical solutions on topics and issues addressed, in an effort to enable advances in scientific excellence, improved quality and better regulatory compliance. This white paper is published in 3 parts due to length. This part (Part 1) discusses the recommendations for small molecules, peptides and small molecule biomarkers by LCMS. Part 2 (Hybrid LBA/LCMS and regulatory inputs from major global health authorities) and Part 3 (large molecule bioanalysis using LBA, biomarkers and immunogenicity) will be published in the Bioanalysis journal, issue 23.
Diffusion of small molecules into medaka embryos improved by electroporation

PubMed Central

2013-01-01

Background Diffusion of small molecules into fish embryos is essential for many experimental procedures in developmental biology and toxicology. Since we observed a weak uptake of lithium into medaka eggs we started a detailed analysis of its diffusion properties using small fluorescent molecules. Results Contrary to our expectations, not the rigid outer chorion but instead membrane systems surrounding the embryo/yolk turned out to be the limiting factor for diffusion into medaka eggs. The consequence is a bi-phasic uptake of small molecules first reaching the pervitelline space with a diffusion half-time in the range of a few minutes. This is followed by a slow second phase (half-time in the range of several hours) during which accumulation in the embryo/yolk takes place. Treatment with detergents improved the uptake, but strongly affected the internal distribution of the molecules. Testing electroporation we could establish conditions to overcome the diffusion barrier. Applying this method to lithium chloride we observed anterior truncations in medaka embryos in agreement with its proposed activation of Wnt signalling. Conclusions The diffusion of small molecules into medaka embryos is slow, caused by membrane systems underneath the chorion. These results have important implications for pharmacologic/toxicologic techniques like the fish embryo test, which therefore require extended incubation times in order to reach sufficient concentrations in the embryos. PMID:23815821
Causal biological network database: a comprehensive platform of causal biological network models focused on the pulmonary and vascular systems.

PubMed

Boué, Stéphanie; Talikka, Marja; Westra, Jurjen Willem; Hayes, William; Di Fabio, Anselmo; Park, Jennifer; Schlage, Walter K; Sewer, Alain; Fields, Brett; Ansari, Sam; Martin, Florian; Veljkovic, Emilija; Kenney, Renee; Peitsch, Manuel C; Hoeng, Julia

2015-01-01

With the wealth of publications and data available, powerful and transparent computational approaches are required to represent measured data and scientific knowledge in a computable and searchable format. We developed a set of biological network models, scripted in the Biological Expression Language, that reflect causal signaling pathways across a wide range of biological processes, including cell fate, cell stress, cell proliferation, inflammation, tissue repair and angiogenesis in the pulmonary and cardiovascular context. This comprehensive collection of networks is now freely available to the scientific community in a centralized web-based repository, the Causal Biological Network database, which is composed of over 120 manually curated and well annotated biological network models and can be accessed at http://causalbionet.com. The website accesses a MongoDB, which stores all versions of the networks as JSON objects and allows users to search for genes, proteins, biological processes, small molecules and keywords in the network descriptions to retrieve biological networks of interest. The content of the networks can be visualized and browsed. Nodes and edges can be filtered and all supporting evidence for the edges can be browsed and is linked to the original articles in PubMed. Moreover, networks may be downloaded for further visualization and evaluation. Database URL: http://causalbionet.com © The Author(s) 2015. Published by Oxford University Press.
Small molecules targeting LapB protein prevent Listeria attachment to catfish muscle

PubMed Central

Das, Bhaskar; Lawrence, Mark

2017-01-01

Listeria monocytogenes is a Gram-positive foodborne pathogen and the causative agent of listeriosis. L. monocytogenes lapB gene encodes a cell wall surface anchor protein, and mutation of this gene causes Listeria attenuation in mice. In this work, the potential role of Listeria LapB protein in catfish fillet attachment was investigated. To achieve this, boron-based small molecules designed to interfere with the active site of the L. monocytogenes LapB protein were developed, and their ability to prevent L. monocytogenes attachment to fish fillet was tested. Results indicated that seven out of nine different small molecules were effective in reducing the Listeria attachment to catfish fillets. Of these, three small molecules (SM3, SM5, and SM7) were highly effective in blocking Listeria attachment to catfish fillets. This study suggests an alternative strategy for reduction of L. monocytogenes contamination in fresh and frozen fish products. PMID:29253892
Incorporation of ionic liquid into porous polymer monoliths to enhance the separation of small molecules in reversed-phase high-performance liquid chromatography.

PubMed

Wang, Jiafei; Bai, Ligai; Wei, Zhen; Qin, Junxiao; Ma, Yamin; Liu, Haiyan

2015-06-01

An ionic liquid was incorporated into the porous polymer monoliths to afford stationary phases with enhanced chromatographic performance for small molecules in reversed-phase high-performance liquid chromatography. The effect of the ionic liquid in the polymerization mixture on the performance of the monoliths was studied in detail. While monoliths without ionic liquid exhibited poor resolution and low efficiency, the addition of ionic liquid to the polymerization mixture provides highly increased resolution and high efficiency. The chromatographic performances of the monoliths were demonstrated by the separations of various small molecules including aromatic hydrocarbons, isomers, and homologues using a binary polar mobile phase. The present column efficiency reached 27 000 plates/m, which showed that the ionic liquid monoliths are alternative stationary phases in the separation of small molecules by high-performance liquid chromatography. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Precise small molecule recognition of a toxic CUG RNA repeat expansion

PubMed Central

Rzuczek, Suzanne G; Colgan, Lesley A; Nakai, Yoshio; Cameron, Michael D; Furling, Denis; Yasuda, Ryohei; Disney, Matthew D

2017-01-01

Excluding the ribosome and riboswitches, developing small molecules that selectively target RNA is a longstanding problem in chemical biology. A typical cellular RNA is difficult to target because it has little tertiary, but abundant secondary structure. We designed allele-selective compounds that target such an RNA, the toxic noncoding repeat expansion (r(CUG)exp) that causes myotonic dystrophy type 1 (DM1). We developed several strategies to generate allele-selective small molecules, including non-covalent binding, covalent binding, cleavage and on-site probe synthesis. Covalent binding and cleavage enabled target profiling in cells derived from individuals with DM1, showing precise recognition of r(CUG)exp. In the on-site probe synthesis approach, small molecules bound adjacent sites in r(CUG)exp and reacted to afford picomolar inhibitors via a proximity-based click reaction only in DM1-affected cells. We expanded this approach to image r(CUG)exp in its natural context. PMID:27941760
Precise small-molecule recognition of a toxic CUG RNA repeat expansion.

PubMed

Rzuczek, Suzanne G; Colgan, Lesley A; Nakai, Yoshio; Cameron, Michael D; Furling, Denis; Yasuda, Ryohei; Disney, Matthew D

2017-02-01

Excluding the ribosome and riboswitches, developing small molecules that selectively target RNA is a longstanding problem in chemical biology. A typical cellular RNA is difficult to target because it has little tertiary, but abundant secondary structure. We designed allele-selective compounds that target such an RNA, the toxic noncoding repeat expansion (r(CUG) exp ) that causes myotonic dystrophy type 1 (DM1). We developed several strategies to generate allele-selective small molecules, including non-covalent binding, covalent binding, cleavage and on-site probe synthesis. Covalent binding and cleavage enabled target profiling in cells derived from individuals with DM1, showing precise recognition of r(CUG) exp . In the on-site probe synthesis approach, small molecules bound adjacent sites in r(CUG) exp and reacted to afford picomolar inhibitors via a proximity-based click reaction only in DM1-affected cells. We expanded this approach to image r(CUG) exp in its natural context.

Small molecule alteration of RNA sequence in cells and animals.

PubMed

Guan, Lirui; Luo, Yiling; Ja, William W; Disney, Matthew D

2017-10-18

RNA regulation and maintenance are critical for proper cell function. Small molecules that specifically alter RNA sequence would be exceptionally useful as probes of RNA structure and function or as potential therapeutics. Here, we demonstrate a photochemical approach for altering the trinucleotide expanded repeat causative of myotonic muscular dystrophy type 1 (DM1), r(CUG) exp . The small molecule, 2H-4-Ru, binds to r(CUG) exp and converts guanosine residues to 8-oxo-7,8-dihydroguanosine upon photochemical irradiation. We demonstrate targeted modification upon irradiation in cell culture and in Drosophila larvae provided a diet containing 2H-4-Ru. Our results highlight a general chemical biology approach for altering RNA sequence in vivo by using small molecules and photochemistry. Furthermore, these studies show that addition of 8-oxo-G lesions into RNA 3' untranslated regions does not affect its steady state levels. Copyright © 2017 Elsevier Ltd. All rights reserved.
High-Throughput RT-PCR for small-molecule screening assays

PubMed Central

Bittker, Joshua A.

2012-01-01

Quantitative measurement of the levels of mRNA expression using real-time reverse transcription polymerase chain reaction (RT-PCR) has long been used for analyzing expression differences in tissue or cell lines of interest. This method has been used somewhat less frequently to measure the changes in gene expression due to perturbagens such as small molecules or siRNA. The availability of new instrumentation for liquid handling and real-time PCR analysis as well as the commercial availability of start-to-finish kits for RT-PCR has enabled the use of this method for high-throughput small-molecule screening on a scale comparable to traditional high-throughput screening (HTS) assays. This protocol focuses on the special considerations necessary for using quantitative RT-PCR as a primary small-molecule screening assay, including the different methods available for mRNA isolation and analysis. PMID:23487248
Selective inhibition of c-Myc/Max dimerization and DNA binding by small molecules.

PubMed

Kiessling, Anke; Sperl, Bianca; Hollis, Angela; Eick, Dirk; Berg, Thorsten

2006-07-01

bZip and bHLHZip protein family members comprise a large fraction of eukaryotic transcription factors and need to bind DNA in order to exert most of their fundamental biological roles. Their binding to DNA requires homo- or heterodimerization via alpha-helical domains, which generally do not contain obvious binding sites for small molecules. We have identified two small molecules, dubbed Mycro1 and Mycro2, which inhibit the protein-protein interactions between the bHLHZip proteins c-Myc and Max. Mycros are the first inhibitors of c-Myc/Max dimerization, which have been demonstrated to inhibit DNA binding of c-Myc with preference over other dimeric transcription factors in vitro. Mycros inhibit c-Myc-dependent proliferation, gene transcription, and oncogenic transformation in the low micromolar concentration range. Our data support the idea that dimeric transcription factors can be druggable even in the absence of obvious small-molecule binding pockets.
Small-molecule pheromones and hormones controlling nematode development.

PubMed

Butcher, Rebecca A

2017-05-17

The existence of small-molecule signals that influence development in Caenorhabditis elegans has been known for several decades, but only in recent years have the chemical structures of several of these signals been established. The identification of these signals has enabled connections to be made between these small molecules and fundamental signaling pathways in C. elegans that influence not only development but also metabolism, fertility, and lifespan. Spurred by these important discoveries and aided by recent advances in comparative metabolomics and NMR spectroscopy, the field of nematode chemistry has the potential to expand dramatically in the coming years. This Perspective will focus on small-molecule pheromones and hormones that influence developmental events in the nematode life cycle (ascarosides, dafachronic acids, and nemamides), will cover more recent work regarding the biosynthesis of these signals, and will explore how the discovery of these signals is transforming our understanding of nematode development and physiology.
A small molecule chemical chaperone optimizes its unfolded state contraction and denaturant like properties

NASA Astrophysics Data System (ADS)

Sharma, Sunny; Sarkar, Suparna; Paul, Simanta Sarani; Roy, Syamal; Chattopadhyay, Krishnananda

2013-12-01

Protein aggregation is believed to occur through the formation of misfolded conformations. It is expected that, in order to minimize aggregation, an effective small molecule chaperone would destabilize these intermediates. To study the mechanism of a chemical chaperone, we have designed a series of mutant proteins in which a tryptophan residue experiences different local environments and solvent exposures. We show that these mutants correspond to a series of conformationally altered proteins with varying degree of misfolding stress and aggregation propensities. Using arginine as a model small molecule, we show that a combination of unfolded state contraction and denaturant like properties results in selective targeting and destabilization of the partially folded proteins. In comparison, the effect of arginine towards the folded like control mutant, which is not aggregation prone, is significantly less. Other small molecules, lacking either of the above two properties, do not offer any specificity towards the misfolded proteins.
Side-chain Engineering of Benzo[1,2-b:4,5-b’]dithiophene Core-structured Small Molecules for High-Performance Organic Solar Cells

PubMed Central

Yin, Xinxing; An, Qiaoshi; Yu, Jiangsheng; Guo, Fengning; Geng, Yongliang; Bian, Linyi; Xu, Zhongsheng; Zhou, Baojing; Xie, Linghai; Zhang, Fujun; Tang, Weihua

2016-01-01

Three novel small molecules have been developed by side-chain engineering on benzo[1,2-b:4,5-b’]dithiophene (BDT) core. The typical acceptor-donor-acceptor (A-D-A) structure is adopted with 4,8-functionalized BDT moieties as core, dioctylterthiophene as π bridge and 3-ethylrhodanine as electron-withdrawing end group. Side-chain engineering on BDT core exhibits small but measurable effect on the optoelectronic properties of small molecules. Theoretical simulation and X-ray diffraction study reveal the subtle tuning of interchain distance between conjugated backbones has large effect on the charge transport and thus the photovoltaic performance of these molecules. Bulk-heterojunction solar cells fabricated with a configuration of ITO/PEDOT:PSS/SM:PC71BM/PFN/Al exhibit a highest power conversion efficiency (PCE) of 6.99% after solvent vapor annealing. PMID:27140224
Side-chain Engineering of Benzo[1,2-b:4,5-b']dithiophene Core-structured Small Molecules for High-Performance Organic Solar Cells.

PubMed

Yin, Xinxing; An, Qiaoshi; Yu, Jiangsheng; Guo, Fengning; Geng, Yongliang; Bian, Linyi; Xu, Zhongsheng; Zhou, Baojing; Xie, Linghai; Zhang, Fujun; Tang, Weihua

2016-05-03

Three novel small molecules have been developed by side-chain engineering on benzo[1,2-b:4,5-b']dithiophene (BDT) core. The typical acceptor-donor-acceptor (A-D-A) structure is adopted with 4,8-functionalized BDT moieties as core, dioctylterthiophene as π bridge and 3-ethylrhodanine as electron-withdrawing end group. Side-chain engineering on BDT core exhibits small but measurable effect on the optoelectronic properties of small molecules. Theoretical simulation and X-ray diffraction study reveal the subtle tuning of interchain distance between conjugated backbones has large effect on the charge transport and thus the photovoltaic performance of these molecules. Bulk-heterojunction solar cells fabricated with a configuration of ITO/PEDOT:PSS/SM:PC71BM/PFN/Al exhibit a highest power conversion efficiency (PCE) of 6.99% after solvent vapor annealing.
Small molecule solution-processed bulk heterojunction solar cells with inverted structure using porphyrin donor

NASA Astrophysics Data System (ADS)

Yamamoto, Takaki; Hatano, Junichi; Nakagawa, Takafumi; Yamaguchi, Shigeru; Matsuo, Yutaka

2013-01-01

Utilizing tetraethynyl porphyrin derivative (TE-Por) as a small molecule donor material, we fabricated a small molecule solution-processed bulk heterojunction (BHJ) solar cell with inverted structure, which exhibited 1.6% power conversion efficiency (JSC (short-circuit current) = 4.6 mA/cm2, VOC (open-circuit voltage) = 0.90 V, and FF (fill factor) = 0.39) in the device configuration indium tin oxide/TiOx (titanium sub-oxide)/[6,6]-phenyl-C61-butyric acid methyl ester:TE-Por (5:1)/MoOx (molybdenum sub-oxide)/Au under AM1.5 G illumination at 100 mW/cm2. Without encapsulation, the small molecule solution-processed inverted BHJ solar cell also showed remarkable durability to air, where it kept over 73% of its initial power conversion efficiency after storage for 28 days under ambient atmosphere in the dark.
Multivalent small molecule pan-RAS inhibitors

PubMed Central

Welsch, Matthew E.; Kaplan, Anna; Chambers, Jennifer M.; Stokes, Michael E.; Bos, Pieter H.; Zask, Arie; Zhang, Yan; Sanchez-Martin, Marta; Badgley, Michael A.; Huang, Christine S.; Tran, Timothy H.; Akkiraju, Hemanth; Brown, Lewis M.; Nandakumar, Renu; Cremers, Serge; Yang, Wan S.; Tong, Liang; Olive, Kenneth P.; Ferrando, Adolfo; Stockwell, Brent R.

2017-01-01

SUMMARY Design of small molecules that disrupt protein-protein interactions, including the interaction of RAS proteins and their effectors, have potential use as chemical probes and therapeutic agents. We describe here the synthesis and testing of potential small molecule pan-RAS ligands, which were designed to interact with adjacent sites on the surface of oncogenic KRAS. One compound, termed 3144, was found to bind to RAS proteins using microscale thermophoresis, nuclear magnetic resonance spectroscopy and isothermal titration calorimetry, and to exhibit lethality in cells partially dependent on expression of RAS proteins. This compound was metabolically stable in liver microsomes and displayed anti-tumor activity in xenograft mouse cancer models. These findings suggest that pan-RAS inhibition may be an effective therapeutic strategy for some cancers, and that structure-based design of small molecules targeting multiple adjacent sites to create multivalent inhibitors may be effective for some proteins. PMID:28235199
Complex small-molecule architectures regulate phenotypic plasticity in a nematode.

PubMed

Bose, Neelanjan; Ogawa, Akira; von Reuss, Stephan H; Yim, Joshua J; Ragsdale, Erik J; Sommer, Ralf J; Schroeder, Frank C

2012-12-07

Chemistry the worm's way: The nematode Pristionchus pacificus constructs elaborate small molecules from modified building blocks of primary metabolism, including an unusual xylopyranose-based nucleoside (see scheme). These compounds act as signaling molecules to control adult phenotypic plasticity and dauer development and provide examples of modular generation of structural diversity in metazoans. Copyright © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Inhibiting prolyl isomerase activity by hybrid organic-inorganic molecules containing rhodium(II) fragments.

PubMed

Coughlin, Jane M; Kundu, Rituparna; Cooper, Julian C; Ball, Zachary T

2014-11-15

A small molecule containing a rhodium(II) tetracarboxylate fragment is shown to be a potent inhibitor of the prolyl isomerase FKBP12. The use of small molecules conjugates of rhodium(II) is presented as a general strategy for developing new protein inhibitors based on distinct structural and sequence features of the enzyme active site. Copyright © 2014 Elsevier Ltd. All rights reserved.
Affinity modulation of small-molecule ligands by borrowing endogenous protein surfaces

PubMed Central

Briesewitz, Roger; Ray, Gregory T.; Wandless, Thomas J.; Crabtree, Gerald R.

1999-01-01

A general strategy is described for improving the binding properties of small-molecule ligands to protein targets. A bifunctional molecule is created by chemically linking a ligand of interest to another small molecule that binds tightly to a second protein. When the ligand of interest is presented to the target protein by the second protein, additional protein–protein interactions outside of the ligand-binding sites serve either to increase or decrease the affinity of the binding event. We have applied this approach to an intractable target, the SH2 domain, and demonstrate a 3-fold enhancement over the natural peptide. This approach provides a way to modulate the potency and specificity of biologically active compounds. PMID:10051576
Large-Scale Validation of Mixed-Solvent Simulations to Assess Hotspots at Protein-Protein Interaction Interfaces.

PubMed

Ghanakota, Phani; van Vlijmen, Herman; Sherman, Woody; Beuming, Thijs

2018-04-23

The ability to target protein-protein interactions (PPIs) with small molecule inhibitors offers great promise in expanding the druggable target space and addressing a broad range of untreated diseases. However, due to their nature and function of interacting with protein partners, PPI interfaces tend to extend over large surfaces without the typical pockets of enzymes and receptors. These features present unique challenges for small molecule inhibitor design. As such, determining whether a particular PPI of interest could be pursued with a small molecule discovery strategy requires an understanding of the characteristics of the PPI interface and whether it has hotspots that can be leveraged by small molecules to achieve desired potency. Here, we assess the ability of mixed-solvent molecular dynamic (MSMD) simulations to detect hotspots at PPI interfaces. MSMD simulations using three cosolvents (acetonitrile, isopropanol, and pyrimidine) were performed on a large test set of 21 PPI targets that have been experimentally validated by small molecule inhibitors. We compare MSMD, which includes explicit solvent and full protein flexibility, to a simpler approach that does not include dynamics or explicit solvent (SiteMap) and find that MSMD simulations reveal additional information about the characteristics of these targets and the ability for small molecules to inhibit the PPI interface. In the few cases were MSMD simulations did not detect hotspots, we explore the shortcomings of this technique and propose future improvements. Finally, using Interleukin-2 as an example, we highlight the advantage of the MSMD approach for detecting transient cryptic druggable pockets that exists at PPI interfaces.
Inhibitor of PI3K/Akt Signaling Pathway Small Molecule Promotes Motor Neuron Differentiation of Human Endometrial Stem Cells Cultured on Electrospun Biocomposite Polycaprolactone/Collagen Scaffolds.

PubMed

Ebrahimi-Barough, Somayeh; Hoveizi, Elham; Yazdankhah, Meysam; Ai, Jafar; Khakbiz, Mehrdad; Faghihi, Faezeh; Tajerian, Roksana; Bayat, Neda

2017-05-01

Small molecules as useful chemical tools can affect cell differentiation and even change cell fate. It is demonstrated that LY294002, a small molecule inhibitor of phosphatidylinositol 3-kinase (PI3K)/Akt signal pathway, can inhibit proliferation and promote neuronal differentiation of mesenchymal stem cells (MSCs). The purpose of this study was to investigate the differentiation effect of Ly294002 small molecule on the human endometrial stem cells (hEnSCs) into motor neuron-like cells on polycaprolactone (PCL)/collagen scaffolds. hEnSCs were cultured in a neurogenic inductive medium containing 1 μM LY294002 on the surface of PCL/collagen electrospun fibrous scaffolds. Cell attachment and viability of cells on scaffolds were characterized by scanning electron microscope (SEM) and 3-(4,5-dimethylthiazoyl-2-yl)2,5-diphenyltetrazolium bromide (MTT) assay. The expression of neuron-specific markers was assayed by real-time PCR and immunocytochemistry analysis after 15 days post induction. Results showed that attachment and differentiation of hEnSCs into motor neuron-like cells on the scaffolds with Ly294002 small molecule were higher than that of the cells on tissue culture plates as control group. In conclusion, PCL/collagen electrospun scaffolds with Ly294002 have potential for being used in neural tissue engineering because of its bioactive and three-dimensional structure which enhances viability and differentiation of hEnSCs into neurons through inhibition of the PI3K/Akt pathway. Thus, manipulation of this pathway by small molecules can enhance neural differentiation.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Iconaru, Luigi I.; Ban, David; Bharatham, Kavitha

In disordered proteins we see that they are highly prevalent in biological systems. They control myriad signaling and regulatory processes, and their levels and/or cellular localization are often altered in human disease. In contrast to folded proteins, disordered proteins, due to conformational heterogeneity and dynamics, are not considered viable drug targets. We challenged this paradigm by identifying through NMR-based screening small molecules that bound specifically, albeit weakly, to the disordered cell cycle regulator, p27 Kip1 (p27). Moreover, two groups of molecules bound to sites created by transient clusters of aromatic residues within p27. Conserved chemical features within these two groupsmore » of small molecules exhibited complementarity to their binding sites within p27, establishing structure-activity relationships for small molecule: disordered protein interactions. Finally, one compound counteracted the Cdk2/cyclin A inhibitory function of p27 in vitro, providing proof-of- principle that small molecules can inhibit the function of a disordered protein (p27) through sequestration in a conformation incapable of folding and binding to a natural regulatory target (Cdk2/cyclin A).« less
[Status of libraries and databases for natural products at abroad].

PubMed

Zhao, Li-Mei; Tan, Ning-Hua

2015-01-01

For natural products are one of the important sources for drug discovery, libraries and databases of natural products are significant for the development and research of natural products. At present, most of compound libraries at abroad are synthetic or combinatorial synthetic molecules, resulting to access natural products difficult; for information of natural products are scattered with different standards, it is difficult to construct convenient, comprehensive and large-scale databases for natural products. This paper reviewed the status of current accessing libraries and databases for natural products at abroad and provided some important information for the development of libraries and database for natural products.
Tulane/Xavier Vaccine Development/Engineering Project

DTIC Science & Technology

2009-02-01

spectroscopic studies with polar dyes (e.g. proflavine ) have verified these compounds’ ability to encapsulate and solvate small polar dye molecules in...systems. Fluorescent microscopy studies verify that they significantly enhance the transport of polar small molecules ( proflavin dye) through
Electronegativity Equalization Method: Parameterization and Validation for Large Sets of Organic, Organohalogene and Organometal Molecule

PubMed Central

Vařeková, Radka Svobodová; Jiroušková, Zuzana; Vaněk, Jakub; Suchomel, Šimon; Koča, Jaroslav

2007-01-01

The Electronegativity Equalization Method (EEM) is a fast approach for charge calculation. A challenging part of the EEM is the parameterization, which is performed using ab initio charges obtained for a set of molecules. The goal of our work was to perform the EEM parameterization for selected sets of organic, organohalogen and organometal molecules. We have performed the most robust parameterization published so far. The EEM parameterization was based on 12 training sets selected from a database of predicted 3D structures (NCI DIS) and from a database of crystallographic structures (CSD). Each set contained from 2000 to 6000 molecules. We have shown that the number of molecules in the training set is very important for quality of the parameters. We have improved EEM parameters (STO-3G MPA charges) for elements that were already parameterized, specifically: C, O, N, H, S, F and Cl. The new parameters provide more accurate charges than those published previously. We have also developed new parameters for elements that were not parameterized yet, specifically for Br, I, Fe and Zn. We have also performed crossover validation of all obtained parameters using all training sets that included relevant elements and confirmed that calculated parameters provide accurate charges.
How Database Management Systems Can Be Used To Evaluate Program Effectiveness in Small School Districts.

ERIC Educational Resources Information Center

Hoffman, Tony

Sophisticated database management systems (DBMS) for microcomputers are becoming increasingly easy to use, allowing small school districts to develop their own autonomous databases for tracking enrollment and student progress in special education. DBMS applications can be designed for maintenance by district personnel with little technical…
Mass action at the single-molecule level.

PubMed

Shon, Min Ju; Cohen, Adam E

2012-09-05

We developed a system to reversibly encapsulate small numbers of molecules in an array of nanofabricated "dimples". This system enables highly parallel, long-term, and attachment-free studies of molecular dynamics via single-molecule fluorescence. In studies of bimolecular reactions of small numbers of confined molecules, we see phenomena that, while expected from basic statistical mechanics, are not observed in bulk chemistry. Statistical fluctuations in the occupancy of sealed reaction chambers lead to steady-state fluctuations in reaction equilibria and rates. These phenomena are likely to be important whenever reactions happen in confined geometries.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.