Sample records for complex molecular machine

  1. Ferrocene-containing non-interlocked molecular machines.

    PubMed

    Scottwell, Synøve Ø; Crowley, James D

    2016-02-11

    Ferrocene is the prototypical organometallic sandwich complex and despite over 60 years passing since the discovery and elucidation of ferrocene's structure, research into ferrocene-containing compounds continues to grow as potential new applications in catalysis, biology and the material sciences are found. Ferrocene is chemically robust and readily functionalized which enables its facile incorporation into more complex molecular systems. This coupled with ferrocene's reversible redox properties and ability function as a "molecular ball bearing" has led to the use of ferrocene as a component in wide range of interlocked and non-interlocked synthetic molecular machine systems. This review will focus on the exploitation of ferrocene (and related sandwich complexes) for the development of non-interlocked synthetic molecular machines.

  2. Modeling Stochastic Kinetics of Molecular Machines at Multiple Levels: From Molecules to Modules

    PubMed Central

    Chowdhury, Debashish

    2013-01-01

    A molecular machine is either a single macromolecule or a macromolecular complex. In spite of the striking superficial similarities between these natural nanomachines and their man-made macroscopic counterparts, there are crucial differences. Molecular machines in a living cell operate stochastically in an isothermal environment far from thermodynamic equilibrium. In this mini-review we present a catalog of the molecular machines and an inventory of the essential toolbox for theoretically modeling these machines. The tool kits include 1), nonequilibrium statistical-physics techniques for modeling machines and machine-driven processes; and 2), statistical-inference methods for reverse engineering a functional machine from the empirical data. The cell is often likened to a microfactory in which the machineries are organized in modular fashion; each module consists of strongly coupled multiple machines, but different modules interact weakly with each other. This microfactory has its own automated supply chain and delivery system. Buoyed by the success achieved in modeling individual molecular machines, we advocate integration of these models in the near future to develop models of functional modules. A system-level description of the cell from the perspective of molecular machinery (the mechanome) is likely to emerge from further integrations that we envisage here. PMID:23746505

  3. Modeling stochastic kinetics of molecular machines at multiple levels: from molecules to modules.

    PubMed

    Chowdhury, Debashish

    2013-06-04

    A molecular machine is either a single macromolecule or a macromolecular complex. In spite of the striking superficial similarities between these natural nanomachines and their man-made macroscopic counterparts, there are crucial differences. Molecular machines in a living cell operate stochastically in an isothermal environment far from thermodynamic equilibrium. In this mini-review we present a catalog of the molecular machines and an inventory of the essential toolbox for theoretically modeling these machines. The tool kits include 1), nonequilibrium statistical-physics techniques for modeling machines and machine-driven processes; and 2), statistical-inference methods for reverse engineering a functional machine from the empirical data. The cell is often likened to a microfactory in which the machineries are organized in modular fashion; each module consists of strongly coupled multiple machines, but different modules interact weakly with each other. This microfactory has its own automated supply chain and delivery system. Buoyed by the success achieved in modeling individual molecular machines, we advocate integration of these models in the near future to develop models of functional modules. A system-level description of the cell from the perspective of molecular machinery (the mechanome) is likely to emerge from further integrations that we envisage here. Copyright © 2013 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  4. Stereodivergent synthesis with a programmable molecular machine

    NASA Astrophysics Data System (ADS)

    Kassem, Salma; Lee, Alan T. L.; Leigh, David A.; Marcos, Vanesa; Palmer, Leoni I.; Pisano, Simone

    2017-09-01

    It has been convincingly argued that molecular machines that manipulate individual atoms, or highly reactive clusters of atoms, with Ångström precision are unlikely to be realized. However, biological molecular machines routinely position rather less reactive substrates in order to direct chemical reaction sequences, from sequence-specific synthesis by the ribosome to polyketide synthases, where tethered molecules are passed from active site to active site in multi-enzyme complexes. Artificial molecular machines have been developed for tasks that include sequence-specific oligomer synthesis and the switching of product chirality, a photo-responsive host molecule has been described that is able to mechanically twist a bound molecular guest, and molecular fragments have been selectively transported in either direction between sites on a molecular platform through a ratchet mechanism. Here we detail an artificial molecular machine that moves a substrate between different activating sites to achieve different product outcomes from chemical synthesis. This molecular robot can be programmed to stereoselectively produce, in a sequential one-pot operation, an excess of any one of four possible diastereoisomers from the addition of a thiol and an alkene to an α,β-unsaturated aldehyde in a tandem reaction process. The stereodivergent synthesis includes diastereoisomers that cannot be selectively synthesized through conventional iminium-enamine organocatalysis. We anticipate that future generations of programmable molecular machines may have significant roles in chemical synthesis and molecular manufacturing.

  5. Advances in molecular labeling, high throughput imaging and machine intelligence portend powerful functional cellular biochemistry tools.

    PubMed

    Price, Jeffrey H; Goodacre, Angela; Hahn, Klaus; Hodgson, Louis; Hunter, Edward A; Krajewski, Stanislaw; Murphy, Robert F; Rabinovich, Andrew; Reed, John C; Heynen, Susanne

    2002-01-01

    Cellular behavior is complex. Successfully understanding systems at ever-increasing complexity is fundamental to advances in modern science and unraveling the functional details of cellular behavior is no exception. We present a collection of prospectives to provide a glimpse of the techniques that will aid in collecting, managing and utilizing information on complex cellular processes via molecular imaging tools. These include: 1) visualizing intracellular protein activity with fluorescent markers, 2) high throughput (and automated) imaging of multilabeled cells in statistically significant numbers, and 3) machine intelligence to analyze subcellular image localization and pattern. Although not addressed here, the importance of combining cell-image-based information with detailed molecular structure and ligand-receptor binding models cannot be overlooked. Advanced molecular imaging techniques have the potential to impact cellular diagnostics for cancer screening, clinical correlations of tissue molecular patterns for cancer biology, and cellular molecular interactions for accelerating drug discovery. The goal of finally understanding all cellular components and behaviors will be achieved by advances in both instrumentation engineering (software and hardware) and molecular biochemistry. Copyright 2002 Wiley-Liss, Inc.

  6. MoleculeNet: a benchmark for molecular machine learning† †Electronic supplementary information (ESI) available. See DOI: 10.1039/c7sc02664a

    PubMed Central

    Wu, Zhenqin; Ramsundar, Bharath; Feinberg, Evan N.; Gomes, Joseph; Geniesse, Caleb; Pappu, Aneesh S.; Leswing, Karl

    2017-01-01

    Molecular machine learning has been maturing rapidly over the last few years. Improved methods and the presence of larger datasets have enabled machine learning algorithms to make increasingly accurate predictions about molecular properties. However, algorithmic progress has been limited due to the lack of a standard benchmark to compare the efficacy of proposed methods; most new algorithms are benchmarked on different datasets making it challenging to gauge the quality of proposed methods. This work introduces MoleculeNet, a large scale benchmark for molecular machine learning. MoleculeNet curates multiple public datasets, establishes metrics for evaluation, and offers high quality open-source implementations of multiple previously proposed molecular featurization and learning algorithms (released as part of the DeepChem open source library). MoleculeNet benchmarks demonstrate that learnable representations are powerful tools for molecular machine learning and broadly offer the best performance. However, this result comes with caveats. Learnable representations still struggle to deal with complex tasks under data scarcity and highly imbalanced classification. For quantum mechanical and biophysical datasets, the use of physics-aware featurizations can be more important than choice of particular learning algorithm. PMID:29629118

  7. The Design, Synthesis, and Study of Solid-State Molecular Rotors: Structure/Function Relationships for Condensed-Phase Anisotropic Dynamics

    NASA Astrophysics Data System (ADS)

    Vogelsberg, Cortnie Sue

    Amphidynamic crystals are an extremely promising platform for the development of artificial molecular machines and stimuli-responsive materials. In analogy to skeletal muscle, their function will rely upon the collective operation of many densely packed molecular machines (i.e. actin-bound myosin) that are self-assembled in a highly organized anisotropic medium. By choosing lattice-forming elements and moving "parts" with specific functionalities, individual molecular machines may be synthesized and self-assembled in order to carry out desirable functions. In recent years, efforts in the design of amphidynamic materials based on molecular gyroscopes and compasses have shown that a certain amount of free volume is essential to facilitate internal rotation and reorientation within a crystal. In order to further establish structure/function relationships to advance the development of increasingly complex molecular machinery, molecular rotors and a molecular "spinning" top were synthesized and incorporated into a variety of solid-state architectures with different degrees of periodicity, dimensionality, and free volume. Specifically, lamellar molecular crystals, hierarchically ordered periodic mesoporous organosilicas, and metal-organic frameworks were targeted for the development of solid-state molecular machines. Using an array of solid-state nuclear magnetic resonance spectroscopy techniques, the dynamic properties of these novel molecular machine assemblies were determined and correlated with their corresponding structural features. It was found that architecture type has a profound influence on functional dynamics. The study of layered molecular crystals, composed of either molecular rotors or "spinning" tops, probed functional dynamics within dense, highly organized environments. From their study, it was discovered that: 1) crystallographically distinct sites may be utilized to differentiate machine function, 2) halogen bonding interactions are sufficiently strong to direct an assembly of molecular machines, 3) the relative flexibility of the crystal environment proximate to a dynamic component may have a significant effect on its function, and, 4) molecular machines, which possess both solid-state photochemical reactivity and dynamics may show complex reaction kinetics if the correlation time of the dynamic process and the lifetime of the excited state occur on the same time scale and the dynamic moiety inherently participates as a reaction intermediate. The study of periodic mesoporous organosilica with hierarchical order probed molecular dynamics within 2D layers of molecular rotors, organized in only one dimension and with ca. 50% exposed to the mesopore free volume. From their study, it was discovered that: 1) molecular rotors, which comprise the layers of the mesopore walls, form a 2D rotational glass, 2) rotator dynamics within the 2D rotational glass undergo a transition to a 2D rotational fluid, and, 3) a 2D rotational glass transition may be exploited to develop hyper-sensitive thermally activated molecular machines. The study of a metal-organic framework assembled from molecular rotors probed dynamics in a periodic three-dimensional free-volume environment, without the presence of close contacts. From the study of this solid-state material, it was determined that: 1) the intrinsic electronic barrier is one of the few factors, which may affect functional dynamics in a true free-volume environment, and, 2) molecular machines with dynamic barriers <

  8. Signalling networks and dynamics of allosteric transitions in bacterial chaperonin GroEL: implications for iterative annealing of misfolded proteins.

    PubMed

    Thirumalai, D; Hyeon, Changbong

    2018-06-19

    Signal transmission at the molecular level in many biological complexes occurs through allosteric transitions. Allostery describes the responses of a complex to binding of ligands at sites that are spatially well separated from the binding region. We describe the structural perturbation method, based on phonon propagation in solids, which can be used to determine the signal-transmitting allostery wiring diagram (AWD) in large but finite-sized biological complexes. Application to the bacterial chaperonin GroEL-GroES complex shows that the AWD determined from structures also drives the allosteric transitions dynamically. From both a structural and dynamical perspective these transitions are largely determined by formation and rupture of salt-bridges. The molecular description of allostery in GroEL provides insights into its function, which is quantitatively described by the iterative annealing mechanism. Remarkably, in this complex molecular machine, a deep connection is established between the structures, reaction cycle during which GroEL undergoes a sequence of allosteric transitions, and function, in a self-consistent manner.This article is part of a discussion meeting issue 'Allostery and molecular machines'. © 2018 The Author(s).

  9. Whole-cell biocomputing

    NASA Technical Reports Server (NTRS)

    Simpson, M. L.; Sayler, G. S.; Fleming, J. T.; Applegate, B.

    2001-01-01

    The ability to manipulate systems on the molecular scale naturally leads to speculation about the rational design of molecular-scale machines. Cells might be the ultimate molecular-scale machines and our ability to engineer them is relatively advanced when compared with our ability to control the synthesis and direct the assembly of man-made materials. Indeed, engineered whole cells deployed in biosensors can be considered one of the practical successes of molecular-scale devices. However, these devices explore only a small portion of cellular functionality. Individual cells or self-organized groups of cells perform extremely complex functions that include sensing, communication, navigation, cooperation and even fabrication of synthetic nanoscopic materials. In natural systems, these capabilities are controlled by complex genetic regulatory circuits, which are only partially understood and not readily accessible for use in engineered systems. Here, we focus on efforts to mimic the functionality of man-made information-processing systems within whole cells.

  10. In vitro assembly of semi-artificial molecular machine and its use for detection of DNA damage.

    PubMed

    Minchew, Candace L; Didenko, Vladimir V

    2012-01-11

    Naturally occurring bio-molecular machines work in every living cell and display a variety of designs. Yet the development of artificial molecular machines centers on devices capable of directional motion, i.e. molecular motors, and on their scaled-down mechanical parts (wheels, axels, pendants etc). This imitates the macro-machines, even though the physical properties essential for these devices, such as inertia and momentum conservation, are not usable in the nanoworld environments. Alternative designs, which do not follow the mechanical macromachines schemes and use mechanisms developed in the evolution of biological molecules, can take advantage of the specific conditions of the nanoworld. Besides, adapting actual biological molecules for the purposes of nano-design reduces potential dangers the nanotechnology products may pose. Here we demonstrate the assembly and application of one such bio-enabled construct, a semi-artificial molecular device which combines a naturally-occurring molecular machine with artificial components. From the enzymology point of view, our construct is a designer fluorescent enzyme-substrate complex put together to perform a specific useful function. This assembly is by definition a molecular machine, as it contains one. Yet, its integration with the engineered part - fluorescent dual hairpin - re-directs it to a new task of labeling DNA damage. Our construct assembles out of a 32-mer DNA and an enzyme vaccinia topoisomerase I (VACC TOPO). The machine then uses its own material to fabricate two fluorescently labeled detector units (Figure 1). One of the units (green fluorescence) carries VACC TOPO covalently attached to its 3'end and another unit (red fluorescence) is a free hairpin with a terminal 3'OH. The units are short-lived and quickly reassemble back into the original construct, which subsequently recleaves. In the absence of DNA breaks these two units continuously separate and religate in a cyclic manner. In tissue sections with DNA damage, the topoisomerase-carrying detector unit selectively attaches to blunt-ended DNA breaks with 5'OH (DNase II-type breaks), fluorescently labeling them. The second, enzyme-free hairpin formed after oligonucleotide cleavage, will ligate to a 5'PO(4) blunt-ended break (DNase I-type breaks), if T4 DNA ligase is present in the solution. When T4 DNA ligase is added to a tissue section or a solution containing DNA with 5'PO(4) blunt-ended breaks, the ligase reacts with 5'PO(4) DNA ends, forming semi-stable enzyme-DNA complexes. The blunt ended hairpins will interact with these complexes releasing ligase and covalently linking hairpins to DNA, thus labeling 5'PO(4) blunt-ended DNA breaks. This development exemplifies a new practical approach to the design of molecular machines and provides a useful sensor for detection of apoptosis and DNA damage in fixed cells and tissues. Copyright © 2012 Journal of Visualized Experiments

  11. Machine Learning Estimates of Natural Product Conformational Energies

    PubMed Central

    Rupp, Matthias; Bauer, Matthias R.; Wilcken, Rainer; Lange, Andreas; Reutlinger, Michael; Boeckler, Frank M.; Schneider, Gisbert

    2014-01-01

    Machine learning has been used for estimation of potential energy surfaces to speed up molecular dynamics simulations of small systems. We demonstrate that this approach is feasible for significantly larger, structurally complex molecules, taking the natural product Archazolid A, a potent inhibitor of vacuolar-type ATPase, from the myxobacterium Archangium gephyra as an example. Our model estimates energies of new conformations by exploiting information from previous calculations via Gaussian process regression. Predictive variance is used to assess whether a conformation is in the interpolation region, allowing a controlled trade-off between prediction accuracy and computational speed-up. For energies of relaxed conformations at the density functional level of theory (implicit solvent, DFT/BLYP-disp3/def2-TZVP), mean absolute errors of less than 1 kcal/mol were achieved. The study demonstrates that predictive machine learning models can be developed for structurally complex, pharmaceutically relevant compounds, potentially enabling considerable speed-ups in simulations of larger molecular structures. PMID:24453952

  12. Fundamental Characteristics of AAA+ Protein Family Structure and Function.

    PubMed

    Miller, Justin M; Enemark, Eric J

    2016-01-01

    Many complex cellular events depend on multiprotein complexes known as molecular machines to efficiently couple the energy derived from adenosine triphosphate hydrolysis to the generation of mechanical force. Members of the AAA+ ATPase superfamily (ATPases Associated with various cellular Activities) are critical components of many molecular machines. AAA+ proteins are defined by conserved modules that precisely position the active site elements of two adjacent subunits to catalyze ATP hydrolysis. In many cases, AAA+ proteins form a ring structure that translocates a polymeric substrate through the central channel using specialized loops that project into the central channel. We discuss the major features of AAA+ protein structure and function with an emphasis on pivotal aspects elucidated with archaeal proteins.

  13. An artificial molecular machine that builds an asymmetric catalyst

    NASA Astrophysics Data System (ADS)

    De Bo, Guillaume; Gall, Malcolm A. Y.; Kuschel, Sonja; De Winter, Julien; Gerbaux, Pascal; Leigh, David A.

    2018-05-01

    Biomolecular machines perform types of complex molecular-level tasks that artificial molecular machines can aspire to. The ribosome, for example, translates information from the polymer track it traverses (messenger RNA) to the new polymer it constructs (a polypeptide)1. The sequence and number of codons read determines the sequence and number of building blocks incorporated into the biomachine-synthesized polymer. However, neither control of sequence2,3 nor the transfer of length information from one polymer to another (which to date has only been accomplished in man-made systems through template synthesis)4 is easily achieved in the synthesis of artificial macromolecules. Rotaxane-based molecular machines5-7 have been developed that successively add amino acids8-10 (including β-amino acids10) to a growing peptide chain by the action of a macrocycle moving along a mono-dispersed oligomeric track derivatized with amino-acid phenol esters. The threaded macrocycle picks up groups that block its path and links them through successive native chemical ligation reactions11 to form a peptide sequence corresponding to the order of the building blocks on the track. Here, we show that as an alternative to translating sequence information, a rotaxane molecular machine can transfer the narrow polydispersity of a leucine-ester-derivatized polystyrene chain synthesized by atom transfer radical polymerization12 to a molecular-machine-made homo-leucine oligomer. The resulting narrow-molecular-weight oligomer folds to an α-helical secondary structure13 that acts as an asymmetric catalyst for the Juliá-Colonna epoxidation14,15 of chalcones.

  14. Internal force corrections with machine learning for quantum mechanics/molecular mechanics simulations.

    PubMed

    Wu, Jingheng; Shen, Lin; Yang, Weitao

    2017-10-28

    Ab initio quantum mechanics/molecular mechanics (QM/MM) molecular dynamics simulation is a useful tool to calculate thermodynamic properties such as potential of mean force for chemical reactions but intensely time consuming. In this paper, we developed a new method using the internal force correction for low-level semiempirical QM/MM molecular dynamics samplings with a predefined reaction coordinate. As a correction term, the internal force was predicted with a machine learning scheme, which provides a sophisticated force field, and added to the atomic forces on the reaction coordinate related atoms at each integration step. We applied this method to two reactions in aqueous solution and reproduced potentials of mean force at the ab initio QM/MM level. The saving in computational cost is about 2 orders of magnitude. The present work reveals great potentials for machine learning in QM/MM simulations to study complex chemical processes.

  15. Just Working with the Cellular Machine: A High School Game for Teaching Molecular Biology

    ERIC Educational Resources Information Center

    Cardoso, Fernanda Serpa; Dumpel, Renata; Gomes da Silva, Luisa B.; Rodrigues, Carlos R.; Santos, Dilvani O.; Cabral, Lucio Mendes; Castro, Helena C.

    2008-01-01

    Molecular biology is a difficult comprehension subject due to its high complexity, thus requiring new teaching approaches. Herein, we developed an interdisciplinary board game involving the human immune system response against a bacterial infection for teaching molecular biology at high school. Initially, we created a database with several…

  16. Fundamental Characteristics of AAA+ Protein Family Structure and Function

    PubMed Central

    2016-01-01

    Many complex cellular events depend on multiprotein complexes known as molecular machines to efficiently couple the energy derived from adenosine triphosphate hydrolysis to the generation of mechanical force. Members of the AAA+ ATPase superfamily (ATPases Associated with various cellular Activities) are critical components of many molecular machines. AAA+ proteins are defined by conserved modules that precisely position the active site elements of two adjacent subunits to catalyze ATP hydrolysis. In many cases, AAA+ proteins form a ring structure that translocates a polymeric substrate through the central channel using specialized loops that project into the central channel. We discuss the major features of AAA+ protein structure and function with an emphasis on pivotal aspects elucidated with archaeal proteins. PMID:27703410

  17. Learning surface molecular structures via machine vision

    NASA Astrophysics Data System (ADS)

    Ziatdinov, Maxim; Maksov, Artem; Kalinin, Sergei V.

    2017-08-01

    Recent advances in high resolution scanning transmission electron and scanning probe microscopies have allowed researchers to perform measurements of materials structural parameters and functional properties in real space with a picometre precision. In many technologically relevant atomic and/or molecular systems, however, the information of interest is distributed spatially in a non-uniform manner and may have a complex multi-dimensional nature. One of the critical issues, therefore, lies in being able to accurately identify (`read out') all the individual building blocks in different atomic/molecular architectures, as well as more complex patterns that these blocks may form, on a scale of hundreds and thousands of individual atomic/molecular units. Here we employ machine vision to read and recognize complex molecular assemblies on surfaces. Specifically, we combine Markov random field model and convolutional neural networks to classify structural and rotational states of all individual building blocks in molecular assembly on the metallic surface visualized in high-resolution scanning tunneling microscopy measurements. We show how the obtained full decoding of the system allows us to directly construct a pair density function—a centerpiece in analysis of disorder-property relationship paradigm—as well as to analyze spatial correlations between multiple order parameters at the nanoscale, and elucidate reaction pathway involving molecular conformation changes. The method represents a significant shift in our way of analyzing atomic and/or molecular resolved microscopic images and can be applied to variety of other microscopic measurements of structural, electronic, and magnetic orders in different condensed matter systems.

  18. Well-tempered metadynamics as a tool for characterizing multi-component, crystalline molecular machines.

    PubMed

    Ilott, Andrew J; Palucha, Sebastian; Hodgkinson, Paul; Wilson, Mark R

    2013-10-10

    The well-tempered, smoothly converging form of the metadynamics algorithm has been implemented in classical molecular dynamics simulations and used to obtain an estimate of the free energy surface explored by the molecular rotations in the plastic crystal, octafluoronaphthalene. The biased simulations explore the full energy surface extremely efficiently, more than 4 orders of magnitude faster than unbiased molecular dynamics runs. The metadynamics collective variables used have also been expanded to include the simultaneous orientations of three neighboring octafluoronaphthalene molecules. Analysis of the resultant three-dimensional free energy surface, which is sampled to a very high degree despite its significant complexity, demonstrates that there are strong correlations between the molecular orientations. Although this correlated motion is of limited applicability in terms of exploiting dynamical motion in octafluoronaphthalene, the approach used is extremely well suited to the investigation of the function of crystalline molecular machines.

  19. Feature selection and classification of protein-protein complexes based on their binding affinities using machine learning approaches.

    PubMed

    Yugandhar, K; Gromiha, M Michael

    2014-09-01

    Protein-protein interactions are intrinsic to virtually every cellular process. Predicting the binding affinity of protein-protein complexes is one of the challenging problems in computational and molecular biology. In this work, we related sequence features of protein-protein complexes with their binding affinities using machine learning approaches. We set up a database of 185 protein-protein complexes for which the interacting pairs are heterodimers and their experimental binding affinities are available. On the other hand, we have developed a set of 610 features from the sequences of protein complexes and utilized Ranker search method, which is the combination of Attribute evaluator and Ranker method for selecting specific features. We have analyzed several machine learning algorithms to discriminate protein-protein complexes into high and low affinity groups based on their Kd values. Our results showed a 10-fold cross-validation accuracy of 76.1% with the combination of nine features using support vector machines. Further, we observed accuracy of 83.3% on an independent test set of 30 complexes. We suggest that our method would serve as an effective tool for identifying the interacting partners in protein-protein interaction networks and human-pathogen interactions based on the strength of interactions. © 2014 Wiley Periodicals, Inc.

  20. Bio-AIMS Collection of Chemoinformatics Web Tools based on Molecular Graph Information and Artificial Intelligence Models.

    PubMed

    Munteanu, Cristian R; Gonzalez-Diaz, Humberto; Garcia, Rafael; Loza, Mabel; Pazos, Alejandro

    2015-01-01

    The molecular information encoding into molecular descriptors is the first step into in silico Chemoinformatics methods in Drug Design. The Machine Learning methods are a complex solution to find prediction models for specific biological properties of molecules. These models connect the molecular structure information such as atom connectivity (molecular graphs) or physical-chemical properties of an atom/group of atoms to the molecular activity (Quantitative Structure - Activity Relationship, QSAR). Due to the complexity of the proteins, the prediction of their activity is a complicated task and the interpretation of the models is more difficult. The current review presents a series of 11 prediction models for proteins, implemented as free Web tools on an Artificial Intelligence Model Server in Biosciences, Bio-AIMS (http://bio-aims.udc.es/TargetPred.php). Six tools predict protein activity, two models evaluate drug - protein target interactions and the other three calculate protein - protein interactions. The input information is based on the protein 3D structure for nine models, 1D peptide amino acid sequence for three tools and drug SMILES formulas for two servers. The molecular graph descriptor-based Machine Learning models could be useful tools for in silico screening of new peptides/proteins as future drug targets for specific treatments.

  1. Learning surface molecular structures via machine vision

    DOE PAGES

    Ziatdinov, Maxim; Maksov, Artem; Kalinin, Sergei V.

    2017-08-10

    Recent advances in high resolution scanning transmission electron and scanning probe microscopies have allowed researchers to perform measurements of materials structural parameters and functional properties in real space with a picometre precision. In many technologically relevant atomic and/or molecular systems, however, the information of interest is distributed spatially in a non-uniform manner and may have a complex multi-dimensional nature. One of the critical issues, therefore, lies in being able to accurately identify (‘read out’) all the individual building blocks in different atomic/molecular architectures, as well as more complex patterns that these blocks may form, on a scale of hundreds andmore » thousands of individual atomic/molecular units. Here we employ machine vision to read and recognize complex molecular assemblies on surfaces. Specifically, we combine Markov random field model and convolutional neural networks to classify structural and rotational states of all individual building blocks in molecular assembly on the metallic surface visualized in high-resolution scanning tunneling microscopy measurements. We show how the obtained full decoding of the system allows us to directly construct a pair density function—a centerpiece in analysis of disorder-property relationship paradigm—as well as to analyze spatial correlations between multiple order parameters at the nanoscale, and elucidate reaction pathway involving molecular conformation changes. Here, the method represents a significant shift in our way of analyzing atomic and/or molecular resolved microscopic images and can be applied to variety of other microscopic measurements of structural, electronic, and magnetic orders in different condensed matter systems.« less

  2. Learning surface molecular structures via machine vision

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ziatdinov, Maxim; Maksov, Artem; Kalinin, Sergei V.

    Recent advances in high resolution scanning transmission electron and scanning probe microscopies have allowed researchers to perform measurements of materials structural parameters and functional properties in real space with a picometre precision. In many technologically relevant atomic and/or molecular systems, however, the information of interest is distributed spatially in a non-uniform manner and may have a complex multi-dimensional nature. One of the critical issues, therefore, lies in being able to accurately identify (‘read out’) all the individual building blocks in different atomic/molecular architectures, as well as more complex patterns that these blocks may form, on a scale of hundreds andmore » thousands of individual atomic/molecular units. Here we employ machine vision to read and recognize complex molecular assemblies on surfaces. Specifically, we combine Markov random field model and convolutional neural networks to classify structural and rotational states of all individual building blocks in molecular assembly on the metallic surface visualized in high-resolution scanning tunneling microscopy measurements. We show how the obtained full decoding of the system allows us to directly construct a pair density function—a centerpiece in analysis of disorder-property relationship paradigm—as well as to analyze spatial correlations between multiple order parameters at the nanoscale, and elucidate reaction pathway involving molecular conformation changes. Here, the method represents a significant shift in our way of analyzing atomic and/or molecular resolved microscopic images and can be applied to variety of other microscopic measurements of structural, electronic, and magnetic orders in different condensed matter systems.« less

  3. Classifying Structures in the ISM with Machine Learning Techniques

    NASA Astrophysics Data System (ADS)

    Beaumont, Christopher; Goodman, A. A.; Williams, J. P.

    2011-01-01

    The processes which govern molecular cloud evolution and star formation often sculpt structures in the ISM: filaments, pillars, shells, outflows, etc. Because of their morphological complexity, these objects are often identified manually. Manual classification has several disadvantages; the process is subjective, not easily reproducible, and does not scale well to handle increasingly large datasets. We have explored to what extent machine learning algorithms can be trained to autonomously identify specific morphological features in molecular cloud datasets. We show that the Support Vector Machine algorithm can successfully locate filaments and outflows blended with other emission structures. When the objects of interest are morphologically distinct from the surrounding emission, this autonomous classification achieves >90% accuracy. We have developed a set of IDL-based tools to apply this technique to other datasets.

  4. Biomolecular Dynamics: Order-Disorder Transitions and Energy Landscapes

    PubMed Central

    Whitford, Paul C.; Sanbonmatsu, Karissa Y.; Onuchic, José N.

    2013-01-01

    While the energy landscape theory of protein folding is now a widely accepted view for understanding how relatively-weak molecular interactions lead to rapid and cooperative protein folding, such a framework must be extended to describe the large-scale functional motions observed in molecular machines. In this review, we discuss 1) the development of the energy landscape theory of biomolecular folding, 2) recent advances towards establishing a consistent understanding of folding and function, and 3) emerging themes in the functional motions of enzymes, biomolecular motors, and other biomolecular machines. Recent theoretical, computational, and experimental lines of investigation are providing a very dynamic picture of biomolecular motion. In contrast to earlier ideas, where molecular machines were thought to function similarly to macroscopic machines, with rigid components that move along a few degrees of freedom in a deterministic fashion, biomolecular complexes are only marginally stable. Since the stabilizing contribution of each atomic interaction is on the order of the thermal fluctuations in solution, the rigid body description of molecular function must be revisited. An emerging theme is that functional motions encompass order-disorder transitions and structural flexibility provide significant contributions to the free-energy. In this review, we describe the biological importance of order-disorder transitions and discuss the statistical-mechanical foundation of theoretical approaches that can characterize such transitions. PMID:22790780

  5. Combining Machine Learning Systems and Multiple Docking Simulation Packages to Improve Docking Prediction Reliability for Network Pharmacology

    PubMed Central

    Hsin, Kun-Yi; Ghosh, Samik; Kitano, Hiroaki

    2013-01-01

    Increased availability of bioinformatics resources is creating opportunities for the application of network pharmacology to predict drug effects and toxicity resulting from multi-target interactions. Here we present a high-precision computational prediction approach that combines two elaborately built machine learning systems and multiple molecular docking tools to assess binding potentials of a test compound against proteins involved in a complex molecular network. One of the two machine learning systems is a re-scoring function to evaluate binding modes generated by docking tools. The second is a binding mode selection function to identify the most predictive binding mode. Results from a series of benchmark validations and a case study show that this approach surpasses the prediction reliability of other techniques and that it also identifies either primary or off-targets of kinase inhibitors. Integrating this approach with molecular network maps makes it possible to address drug safety issues by comprehensively investigating network-dependent effects of a drug or drug candidate. PMID:24391846

  6. The Yeast Nuclear Pore Complex

    PubMed Central

    Rout, Michael P.; Aitchison, John D.; Suprapto, Adisetyantari; Hjertaas, Kelly; Zhao, Yingming; Chait, Brian T.

    2000-01-01

    An understanding of how the nuclear pore complex (NPC) mediates nucleocytoplasmic exchange requires a comprehensive inventory of the molecular components of the NPC and a knowledge of how each component contributes to the overall structure of this large molecular translocation machine. Therefore, we have taken a comprehensive approach to classify all components of the yeast NPC (nucleoporins). This involved identifying all the proteins present in a highly enriched NPC fraction, determining which of these proteins were nucleoporins, and localizing each nucleoporin within the NPC. Using these data, we present a map of the molecular architecture of the yeast NPC and provide evidence for a Brownian affinity gating mechanism for nucleocytoplasmic transport. PMID:10684247

  7. Towards better modelling of drug-loading in solid lipid nanoparticles: Molecular dynamics, docking experiments and Gaussian Processes machine learning.

    PubMed

    Hathout, Rania M; Metwally, Abdelkader A

    2016-11-01

    This study represents one of the series applying computer-oriented processes and tools in digging for information, analysing data and finally extracting correlations and meaningful outcomes. In this context, binding energies could be used to model and predict the mass of loaded drugs in solid lipid nanoparticles after molecular docking of literature-gathered drugs using MOE® software package on molecularly simulated tripalmitin matrices using GROMACS®. Consequently, Gaussian processes as a supervised machine learning artificial intelligence technique were used to correlate the drugs' descriptors (e.g. M.W., xLogP, TPSA and fragment complexity) with their molecular docking binding energies. Lower percentage bias was obtained compared to previous studies which allows the accurate estimation of the loaded mass of any drug in the investigated solid lipid nanoparticles by just projecting its chemical structure to its main features (descriptors). Copyright © 2016 Elsevier B.V. All rights reserved.

  8. Understanding the role of dynamics in the iron sulfur cluster molecular machine.

    PubMed

    di Maio, Danilo; Chandramouli, Balasubramanian; Yan, Robert; Brancato, Giuseppe; Pastore, Annalisa

    2017-01-01

    The bacterial proteins IscS, IscU and CyaY, the bacterial orthologue of frataxin, play an essential role in the biological machine that assembles the prosthetic FeS cluster groups on proteins. They form functionally binary and ternary complexes both in vivo and in vitro. Yet, the mechanism by which they work remains unclear. We carried out extensive molecular dynamics simulations to understand the nature of their interactions and the role of dynamics starting from the crystal structure of a IscS-IscU complex and the experimentally-based model of a ternary IscS-IscU-CyaY complex and used nuclear magnetic resonance to experimentally test the interface. We show that, while being firmly anchored to IscS, IscU has a pivotal motion around the interface. Our results also describe how the catalytic loop of IscS can flip conformation to allow FeS cluster assembly. This motion is hampered in the ternary complex explaining its inhibitory properties in cluster formation. We conclude that the observed 'fluid' IscS-IscU interface provides the binary complex with a functional adaptability exploited in partner recognition and unravels the molecular determinants of the reported inhibitory action of CyaY in the IscS-IscU-CyaY complex explained in terms of the hampering effect on specific IscU-IscS movements. Our study provides the first mechanistic basis to explain how the IscS-IscU complex selects its binding partners and supports the inhibitory role of CyaY in the ternary complex. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  9. Biomimetic molecular design tools that learn, evolve, and adapt.

    PubMed

    Winkler, David A

    2017-01-01

    A dominant hallmark of living systems is their ability to adapt to changes in the environment by learning and evolving. Nature does this so superbly that intensive research efforts are now attempting to mimic biological processes. Initially this biomimicry involved developing synthetic methods to generate complex bioactive natural products. Recent work is attempting to understand how molecular machines operate so their principles can be copied, and learning how to employ biomimetic evolution and learning methods to solve complex problems in science, medicine and engineering. Automation, robotics, artificial intelligence, and evolutionary algorithms are now converging to generate what might broadly be called in silico-based adaptive evolution of materials. These methods are being applied to organic chemistry to systematize reactions, create synthesis robots to carry out unit operations, and to devise closed loop flow self-optimizing chemical synthesis systems. Most scientific innovations and technologies pass through the well-known "S curve", with slow beginning, an almost exponential growth in capability, and a stable applications period. Adaptive, evolving, machine learning-based molecular design and optimization methods are approaching the period of very rapid growth and their impact is already being described as potentially disruptive. This paper describes new developments in biomimetic adaptive, evolving, learning computational molecular design methods and their potential impacts in chemistry, engineering, and medicine.

  10. Biomimetic molecular design tools that learn, evolve, and adapt

    PubMed Central

    2017-01-01

    A dominant hallmark of living systems is their ability to adapt to changes in the environment by learning and evolving. Nature does this so superbly that intensive research efforts are now attempting to mimic biological processes. Initially this biomimicry involved developing synthetic methods to generate complex bioactive natural products. Recent work is attempting to understand how molecular machines operate so their principles can be copied, and learning how to employ biomimetic evolution and learning methods to solve complex problems in science, medicine and engineering. Automation, robotics, artificial intelligence, and evolutionary algorithms are now converging to generate what might broadly be called in silico-based adaptive evolution of materials. These methods are being applied to organic chemistry to systematize reactions, create synthesis robots to carry out unit operations, and to devise closed loop flow self-optimizing chemical synthesis systems. Most scientific innovations and technologies pass through the well-known “S curve”, with slow beginning, an almost exponential growth in capability, and a stable applications period. Adaptive, evolving, machine learning-based molecular design and optimization methods are approaching the period of very rapid growth and their impact is already being described as potentially disruptive. This paper describes new developments in biomimetic adaptive, evolving, learning computational molecular design methods and their potential impacts in chemistry, engineering, and medicine. PMID:28694872

  11. Imaging patterns predict patient survival and molecular subtype in glioblastoma via machine learning techniques

    PubMed Central

    Macyszyn, Luke; Akbari, Hamed; Pisapia, Jared M.; Da, Xiao; Attiah, Mark; Pigrish, Vadim; Bi, Yingtao; Pal, Sharmistha; Davuluri, Ramana V.; Roccograndi, Laura; Dahmane, Nadia; Martinez-Lage, Maria; Biros, George; Wolf, Ronald L.; Bilello, Michel; O'Rourke, Donald M.; Davatzikos, Christos

    2016-01-01

    Background MRI characteristics of brain gliomas have been used to predict clinical outcome and molecular tumor characteristics. However, previously reported imaging biomarkers have not been sufficiently accurate or reproducible to enter routine clinical practice and often rely on relatively simple MRI measures. The current study leverages advanced image analysis and machine learning algorithms to identify complex and reproducible imaging patterns predictive of overall survival and molecular subtype in glioblastoma (GB). Methods One hundred five patients with GB were first used to extract approximately 60 diverse features from preoperative multiparametric MRIs. These imaging features were used by a machine learning algorithm to derive imaging predictors of patient survival and molecular subtype. Cross-validation ensured generalizability of these predictors to new patients. Subsequently, the predictors were evaluated in a prospective cohort of 29 new patients. Results Survival curves yielded a hazard ratio of 10.64 for predicted long versus short survivors. The overall, 3-way (long/medium/short survival) accuracy in the prospective cohort approached 80%. Classification of patients into the 4 molecular subtypes of GB achieved 76% accuracy. Conclusions By employing machine learning techniques, we were able to demonstrate that imaging patterns are highly predictive of patient survival. Additionally, we found that GB subtypes have distinctive imaging phenotypes. These results reveal that when imaging markers related to infiltration, cell density, microvascularity, and blood–brain barrier compromise are integrated via advanced pattern analysis methods, they form very accurate predictive biomarkers. These predictive markers used solely preoperative images, hence they can significantly augment diagnosis and treatment of GB patients. PMID:26188015

  12. Discovering local order parameters in liquid water using machine learning

    NASA Astrophysics Data System (ADS)

    Soto, Adrian; Lu, Deyu; Yoo, Shinjae; Fernandez-Serra, Marivi

    The local arrangement of water molecules in liquid phase is still being discussed and questioned. The prevailing view is that water is composed of a mixture of two structurally different liquids. One of the main challenges has been to find order parameters that are able to discriminate the complex structures of these distinct molecular environments. Several local order parameters have been proposed and studied in all sorts of atomistic simulations of liquid water but, to date, none has been able to capture the predicted dual character. This presents an ideal problem to treat with methods capable of unveiling information from complex data. In this talk we will discuss how local order parameters can be constructed from molecular dynamics trajectories by using machine learning and other related techniques. Work was partially supported by DOE Award No. DE-FG02-09ER16052, by DOE Early Career Award No. DE-SC0003871, by BNL LDRD 16-039 project and BNL Contract No. DE-SC0012704.

  13. Rotary ATPases

    PubMed Central

    Stewart, Alastair G.; Sobti, Meghna; Harvey, Richard P.; Stock, Daniela

    2013-01-01

    Rotary ATPases are molecular rotary motors involved in biological energy conversion. They either synthesize or hydrolyze the universal biological energy carrier adenosine triphosphate. Recent work has elucidated the general architecture and subunit compositions of all three sub-types of rotary ATPases. Composite models of the intact F-, V- and A-type ATPases have been constructed by fitting high-resolution X-ray structures of individual subunits or sub-complexes into low-resolution electron densities of the intact enzymes derived from electron cryo-microscopy. Electron cryo-tomography has provided new insights into the supra-molecular arrangement of eukaryotic ATP synthases within mitochondria and mass-spectrometry has started to identify specifically bound lipids presumed to be essential for function. Taken together these molecular snapshots show that nano-scale rotary engines have much in common with basic design principles of man made machines from the function of individual “machine elements” to the requirement of the right “fuel” and “oil” for different types of motors. PMID:23369889

  14. A machine learning approach for ranking clusters of docked protein‐protein complexes by pairwise cluster comparison

    PubMed Central

    Pfeiffenberger, Erik; Chaleil, Raphael A.G.; Moal, Iain H.

    2017-01-01

    ABSTRACT Reliable identification of near‐native poses of docked protein–protein complexes is still an unsolved problem. The intrinsic heterogeneity of protein–protein interactions is challenging for traditional biophysical or knowledge based potentials and the identification of many false positive binding sites is not unusual. Often, ranking protocols are based on initial clustering of docked poses followed by the application of an energy function to rank each cluster according to its lowest energy member. Here, we present an approach of cluster ranking based not only on one molecular descriptor (e.g., an energy function) but also employing a large number of descriptors that are integrated in a machine learning model, whereby, an extremely randomized tree classifier based on 109 molecular descriptors is trained. The protocol is based on first locally enriching clusters with additional poses, the clusters are then characterized using features describing the distribution of molecular descriptors within the cluster, which are combined into a pairwise cluster comparison model to discriminate near‐native from incorrect clusters. The results show that our approach is able to identify clusters containing near‐native protein–protein complexes. In addition, we present an analysis of the descriptors with respect to their power to discriminate near native from incorrect clusters and how data transformations and recursive feature elimination can improve the ranking performance. Proteins 2017; 85:528–543. © 2016 Wiley Periodicals, Inc. PMID:27935158

  15. Light-operated machines based on threaded molecular structures.

    PubMed

    Credi, Alberto; Silvi, Serena; Venturi, Margherita

    2014-01-01

    Rotaxanes and related species represent the most common implementation of the concept of artificial molecular machines, because the supramolecular nature of the interactions between the components and their interlocked architecture allow a precise control on the position and movement of the molecular units. The use of light to power artificial molecular machines is particularly valuable because it can play the dual role of "writing" and "reading" the system. Moreover, light-driven machines can operate without accumulation of waste products, and photons are the ideal inputs to enable autonomous operation mechanisms. In appropriately designed molecular machines, light can be used to control not only the stability of the system, which affects the relative position of the molecular components but also the kinetics of the mechanical processes, thereby enabling control on the direction of the movements. This step forward is necessary in order to make a leap from molecular machines to molecular motors.

  16. Physics Meets Biology (LBNL Summer Lecture Series)

    ScienceCinema

    Chu, Steven

    2018-05-09

    Summer Lecture Series 2006: If scientists could take advantage of the awesomely complex and beautiful functioning of biology's natural molecular machines, their potential for application in many disciplines would be incalculable. Nobel Laureate and Director of the Lawrence Berkeley National Laboratory Steve Chu explores Possible solutions to global warming and its consequences.

  17. Molecular Thermodynamics for Cell Biology as Taught with Boxes

    ERIC Educational Resources Information Center

    Mayorga, Luis S.; Lopez, Maria Jose; Becker, Wayne M.

    2012-01-01

    Thermodynamic principles are basic to an understanding of the complex fluxes of energy and information required to keep cells alive. These microscopic machines are nonequilibrium systems at the micron scale that are maintained in pseudo-steady-state conditions by very sophisticated processes. Therefore, several nonstandard concepts need to be…

  18. Imaging patterns predict patient survival and molecular subtype in glioblastoma via machine learning techniques.

    PubMed

    Macyszyn, Luke; Akbari, Hamed; Pisapia, Jared M; Da, Xiao; Attiah, Mark; Pigrish, Vadim; Bi, Yingtao; Pal, Sharmistha; Davuluri, Ramana V; Roccograndi, Laura; Dahmane, Nadia; Martinez-Lage, Maria; Biros, George; Wolf, Ronald L; Bilello, Michel; O'Rourke, Donald M; Davatzikos, Christos

    2016-03-01

    MRI characteristics of brain gliomas have been used to predict clinical outcome and molecular tumor characteristics. However, previously reported imaging biomarkers have not been sufficiently accurate or reproducible to enter routine clinical practice and often rely on relatively simple MRI measures. The current study leverages advanced image analysis and machine learning algorithms to identify complex and reproducible imaging patterns predictive of overall survival and molecular subtype in glioblastoma (GB). One hundred five patients with GB were first used to extract approximately 60 diverse features from preoperative multiparametric MRIs. These imaging features were used by a machine learning algorithm to derive imaging predictors of patient survival and molecular subtype. Cross-validation ensured generalizability of these predictors to new patients. Subsequently, the predictors were evaluated in a prospective cohort of 29 new patients. Survival curves yielded a hazard ratio of 10.64 for predicted long versus short survivors. The overall, 3-way (long/medium/short survival) accuracy in the prospective cohort approached 80%. Classification of patients into the 4 molecular subtypes of GB achieved 76% accuracy. By employing machine learning techniques, we were able to demonstrate that imaging patterns are highly predictive of patient survival. Additionally, we found that GB subtypes have distinctive imaging phenotypes. These results reveal that when imaging markers related to infiltration, cell density, microvascularity, and blood-brain barrier compromise are integrated via advanced pattern analysis methods, they form very accurate predictive biomarkers. These predictive markers used solely preoperative images, hence they can significantly augment diagnosis and treatment of GB patients. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Neuro-Oncology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  19. Molecular propulsion: chemical sensing and chemotaxis of DNA driven by RNA polymerase.

    PubMed

    Yu, Hua; Jo, Kyubong; Kounovsky, Kristy L; de Pablo, Juan J; Schwartz, David C

    2009-04-29

    Living cells sense extracellular signals and direct their movements in response to stimuli in environment. Such autonomous movement allows these machines to sample chemical change over a distance, leading to chemotaxis. Synthetic catalytic rods have been reported to chemotax toward hydrogen peroxide fuel. Nevertheless individualized autonomous control of movement of a population of biomolecules under physiological conditions has not been demonstrated. Here we show the first experimental evidence that a molecular complex consisting of a DNA template and associating RNA polymerases (RNAPs) displays chemokinetic motion driven by transcription substrates nucleoside triphosphates (NTPs). Furthermore this molecular complex exhibits a biased migration into a concentration gradient of NTPs, resembling chemotaxis. We describe this behavior as "Molecular Propulsion", in which RNAP transcriptional actions deform DNA template conformation engendering measurable enhancement of motility. Our results provide new opportunities for designing and directing nanomachines by imposing external triggers within an experimental system.

  20. Ab initio genotype–phenotype association reveals intrinsic modularity in genetic networks

    PubMed Central

    Slonim, Noam; Elemento, Olivier; Tavazoie, Saeed

    2006-01-01

    Microbial species express an astonishing diversity of phenotypic traits, behaviors, and metabolic capacities. However, our molecular understanding of these phenotypes is based almost entirely on studies in a handful of model organisms that together represent only a small fraction of this phenotypic diversity. Furthermore, many microbial species are not amenable to traditional laboratory analysis because of their exotic lifestyles and/or lack of suitable molecular genetic techniques. As an adjunct to experimental analysis, we have developed a computational information-theoretic framework that produces high-confidence gene–phenotype predictions using cross-species distributions of genes and phenotypes across 202 fully sequenced archaea and eubacteria. In addition to identifying the genetic basis of complex traits, our approach reveals the organization of these genes into generic preferentially co-inherited modules, many of which correspond directly to known enzymatic pathways, molecular complexes, signaling pathways, and molecular machines. PMID:16732191

  1. Operation of micro and molecular machines: a new concept with its origins in interface science.

    PubMed

    Ariga, Katsuhiko; Ishihara, Shinsuke; Izawa, Hironori; Xia, Hong; Hill, Jonathan P

    2011-03-21

    A landmark accomplishment of nanotechnology would be successful fabrication of ultrasmall machines that can work like tweezers, motors, or even computing devices. Now we must consider how operation of micro- and molecular machines might be implemented for a wide range of applications. If these machines function only under limited conditions and/or require specialized apparatus then they are useless for practical applications. Therefore, it is important to carefully consider the access of functionality of the molecular or nanoscale systems by conventional stimuli at the macroscopic level. In this perspective, we will outline the position of micro- and molecular machines in current science and technology. Most of these machines are operated by light irradiation, application of electrical or magnetic fields, chemical reactions, and thermal fluctuations, which cannot always be applied in remote machine operation. We also propose strategies for molecular machine operation using the most conventional of stimuli, that of macroscopic mechanical force, achieved through mechanical operation of molecular machines located at an air-water interface. The crucial roles of the characteristics of an interfacial environment, i.e. connection between macroscopic dimension and nanoscopic function, and contact of media with different dielectric natures, are also described.

  2. Single-Molecule Real-Time 3D Imaging of the Transcription Cycle by Modulation Interferometry.

    PubMed

    Wang, Guanshi; Hauver, Jesse; Thomas, Zachary; Darst, Seth A; Pertsinidis, Alexandros

    2016-12-15

    Many essential cellular processes, such as gene control, employ elaborate mechanisms involving the coordination of large, multi-component molecular assemblies. Few structural biology tools presently have the combined spatial-temporal resolution and molecular specificity required to capture the movement, conformational changes, and subunit association-dissociation kinetics, three fundamental elements of how such intricate molecular machines work. Here, we report a 3D single-molecule super-resolution imaging study using modulation interferometry and phase-sensitive detection that achieves <2 nm axial localization precision, well below the few-nanometer-sized individual protein components. To illustrate the capability of this technique in probing the dynamics of complex macromolecular machines, we visualize the movement of individual multi-subunit E. coli RNA polymerases through the complete transcription cycle, dissect the kinetics of the initiation-elongation transition, and determine the fate of σ 70 initiation factors during promoter escape. Modulation interferometry sets the stage for single-molecule studies of several hitherto difficult-to-investigate multi-molecular transactions that underlie genome regulation. Copyright © 2016 Elsevier Inc. All rights reserved.

  3. A review of supervised machine learning applied to ageing research.

    PubMed

    Fabris, Fabio; Magalhães, João Pedro de; Freitas, Alex A

    2017-04-01

    Broadly speaking, supervised machine learning is the computational task of learning correlations between variables in annotated data (the training set), and using this information to create a predictive model capable of inferring annotations for new data, whose annotations are not known. Ageing is a complex process that affects nearly all animal species. This process can be studied at several levels of abstraction, in different organisms and with different objectives in mind. Not surprisingly, the diversity of the supervised machine learning algorithms applied to answer biological questions reflects the complexities of the underlying ageing processes being studied. Many works using supervised machine learning to study the ageing process have been recently published, so it is timely to review these works, to discuss their main findings and weaknesses. In summary, the main findings of the reviewed papers are: the link between specific types of DNA repair and ageing; ageing-related proteins tend to be highly connected and seem to play a central role in molecular pathways; ageing/longevity is linked with autophagy and apoptosis, nutrient receptor genes, and copper and iron ion transport. Additionally, several biomarkers of ageing were found by machine learning. Despite some interesting machine learning results, we also identified a weakness of current works on this topic: only one of the reviewed papers has corroborated the computational results of machine learning algorithms through wet-lab experiments. In conclusion, supervised machine learning has contributed to advance our knowledge and has provided novel insights on ageing, yet future work should have a greater emphasis in validating the predictions.

  4. AAA+ Machines of Protein Destruction in Mycobacteria.

    PubMed

    Alhuwaider, Adnan Ali H; Dougan, David A

    2017-01-01

    The bacterial cytosol is a complex mixture of macromolecules (proteins, DNA, and RNA), which collectively are responsible for an enormous array of cellular tasks. Proteins are central to most, if not all, of these tasks and as such their maintenance (commonly referred to as protein homeostasis or proteostasis) is vital for cell survival during normal and stressful conditions. The two key aspects of protein homeostasis are, (i) the correct folding and assembly of proteins (coupled with their delivery to the correct cellular location) and (ii) the timely removal of unwanted or damaged proteins from the cell, which are performed by molecular chaperones and proteases, respectively. A major class of proteins that contribute to both of these tasks are the AAA+ (ATPases associated with a variety of cellular activities) protein superfamily. Although much is known about the structure of these machines and how they function in the model Gram-negative bacterium Escherichia coli , we are only just beginning to discover the molecular details of these machines and how they function in mycobacteria. Here we review the different AAA+ machines, that contribute to proteostasis in mycobacteria. Primarily we will focus on the recent advances in the structure and function of AAA+ proteases, the substrates they recognize and the cellular pathways they control. Finally, we will discuss the recent developments related to these machines as novel drug targets.

  5. Synthesis of single-molecule nanocars.

    PubMed

    Vives, Guillaume; Tour, James M

    2009-03-17

    The drive to miniaturize devices has led to a variety of molecular machines inspired by macroscopic counterparts such as molecular motors, switches, shuttles, turnstiles, barrows, elevators, and nanovehicles. Such nanomachines are designed for controlled mechanical motion and the transport of nanocargo. As researchers miniaturize devices, they can consider two complementary approaches: (1) the "top-down" approach, which reduces the size of macroscopic objects to reach an equivalent microscopic entity using photolithography and related techniques and (2) the "bottom-up" approach, which builds functional microscopic or nanoscopic entities from molecular building blocks. The top-down approach, extensively used by the semiconductor industry, is nearing its scaling limits. On the other hand, the bottom-up approach takes advantage of the self-assembly of smaller molecules into larger networks by exploiting typically weak molecular interactions. But self-assembly alone will not permit complex assembly. Using nanomachines, we hope to eventually consider complex, enzyme-like directed assembly. With that ultimate goal, we are currently exploring the control of nanomachines that would provide a basis for the future bottom-up construction of complex systems. This Account describes the synthesis of a class of molecular machines that resemble macroscopic vehicles. We designed these so-called nanocars for study at the single-molecule level by scanning probe microscopy (SPM). The vehicles have a chassis connected to wheel-terminated axles and convert energy inputs such as heat, electric fields, or light into controlled motion on a surface, ultimately leading to transport of nanocargo. At first, we used C(60) fullerenes as wheels, which allowed the demonstration of a directional rolling mechanism of a nanocar on a gold surface by STM. However, because of the low solubility of the fullerene nanocars and the incompatibility of fullerenes with photochemical processes, we developed new p-carborane- and ruthenium-based wheels with greater solubility in organic solvents. Although fullerene wheels must be attached in the final synthetic step, p-carborane- and ruthenium-based wheels do not inhibit organometallic coupling reactions, which allows a more convergent synthesis of molecular machines. We also prepared functional nanotrucks for the transport of atoms and molecules, as well as self-assembling nanocars and nanotrains. Although engineering challenges such as movement over long distance and non-atomically flat surfaces remain, the greatest current research challenge is imaging. The detailed study of nanocars requires complementary single molecule imaging techniques such as STM, AFM, TEM, or single-molecule fluorescence microscopy. Further developments in engineering and synthesis could lead to enzyme-like manipulation and assembly of atoms and small molecules in nonbiological environments.

  6. The Physics and Physical Chemistry of Molecular Machines.

    PubMed

    Astumian, R Dean; Mukherjee, Shayantani; Warshel, Arieh

    2016-06-17

    The concept of a "power stroke"-a free-energy releasing conformational change-appears in almost every textbook that deals with the molecular details of muscle, the flagellar rotor, and many other biomolecular machines. Here, it is shown by using the constraints of microscopic reversibility that the power stroke model is incorrect as an explanation of how chemical energy is used by a molecular machine to do mechanical work. Instead, chemically driven molecular machines operating under thermodynamic constraints imposed by the reactant and product concentrations in the bulk function as information ratchets in which the directionality and stopping torque or stopping force are controlled entirely by the gating of the chemical reaction that provides the fuel for the machine. The gating of the chemical free energy occurs through chemical state dependent conformational changes of the molecular machine that, in turn, are capable of generating directional mechanical motions. In strong contrast to this general conclusion for molecular machines driven by catalysis of a chemical reaction, a power stroke may be (and often is) an essential component for a molecular machine driven by external modulation of pH or redox potential or by light. This difference between optical and chemical driving properties arises from the fundamental symmetry difference between the physics of optical processes, governed by the Bose-Einstein relations, and the constraints of microscopic reversibility for thermally activated processes. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  7. Many-Body Descriptors for Predicting Molecular Properties with Machine Learning: Analysis of Pairwise and Three-Body Interactions in Molecules.

    PubMed

    Pronobis, Wiktor; Tkatchenko, Alexandre; Müller, Klaus-Robert

    2018-06-12

    Machine learning (ML) based prediction of molecular properties across chemical compound space is an important and alternative approach to efficiently estimate the solutions of highly complex many-electron problems in chemistry and physics. Statistical methods represent molecules as descriptors that should encode molecular symmetries and interactions between atoms. Many such descriptors have been proposed; all of them have advantages and limitations. Here, we propose a set of general two-body and three-body interaction descriptors which are invariant to translation, rotation, and atomic indexing. By adapting the successfully used kernel ridge regression methods of machine learning, we evaluate our descriptors on predicting several properties of small organic molecules calculated using density-functional theory. We use two data sets. The GDB-7 set contains 6868 molecules with up to 7 heavy atoms of type CNO. The GDB-9 set is composed of 131722 molecules with up to 9 heavy atoms containing CNO. When trained on 5000 random molecules, our best model achieves an accuracy of 0.8 kcal/mol (on the remaining 1868 molecules of GDB-7) and 1.5 kcal/mol (on the remaining 126722 molecules of GDB-9) respectively. Applying a linear regression model on our novel many-body descriptors performs almost equal to a nonlinear kernelized model. Linear models are readily interpretable: a feature importance ranking measure helps to obtain qualitative and quantitative insights on the importance of two- and three-body molecular interactions for predicting molecular properties computed with quantum-mechanical methods.

  8. Putting Humpty-Dumpty Together: Clustering the Functional Dynamics of Single Biomolecular Machines Such as the Spliceosome.

    PubMed

    Rohlman, C E; Blanco, M R; Walter, N G

    2016-01-01

    The spliceosome is a biomolecular machine that, in all eukaryotes, accomplishes site-specific splicing of introns from precursor messenger RNAs (pre-mRNAs) with high fidelity. Operating at the nanometer scale, where inertia and friction have lost the dominant role they play in the macroscopic realm, the spliceosome is highly dynamic and assembles its active site around each pre-mRNA anew. To understand the structural dynamics underlying the molecular motors, clocks, and ratchets that achieve functional accuracy in the yeast spliceosome (a long-standing model system), we have developed single-molecule fluorescence resonance energy transfer (smFRET) approaches that report changes in intra- and intermolecular interactions in real time. Building on our work using hidden Markov models (HMMs) to extract kinetic and conformational state information from smFRET time trajectories, we recognized that HMM analysis of individual state transitions as independent stochastic events is insufficient for a biomolecular machine as complex as the spliceosome. In this chapter, we elaborate on the recently developed smFRET-based Single-Molecule Cluster Analysis (SiMCAn) that dissects the intricate conformational dynamics of a pre-mRNA through the splicing cycle in a model-free fashion. By leveraging hierarchical clustering techniques developed for Bioinformatics, SiMCAn efficiently analyzes large datasets to first identify common molecular behaviors. Through a second level of clustering based on the abundance of dynamic behaviors exhibited by defined functional intermediates that have been stalled by biochemical or genetic tools, SiMCAn then efficiently assigns pre-mRNA FRET states and transitions to specific splicing complexes, with the potential to find heretofore undescribed conformations. SiMCAn thus arises as a general tool to analyze dynamic cellular machines more broadly. © 2016 Elsevier Inc. All rights reserved.

  9. Specificity of interactions among the DNA-packaging machine components of T4-related bacteriophages.

    PubMed

    Gao, Song; Rao, Venigalla B

    2011-02-04

    Tailed bacteriophages use powerful molecular motors to package the viral genome into a preformed capsid. Packaging at a rate of up to ∼2000 bp/s and generating a power density twice that of an automobile engine, the phage T4 motor is the fastest and most powerful reported to date. Central to DNA packaging are dynamic interactions among the packaging components, capsid (gp23), portal (gp20), motor (gp17, large "terminase"), and regulator (gp16, small terminase), leading to precise orchestration of the packaging process, but the mechanisms are poorly understood. Here we analyzed the interactions between small and large terminases of T4-related phages. Our results show that the gp17 packaging ATPase is maximally stimulated by homologous, but not heterologous, gp16. Multiple interaction sites are identified in both gp16 and gp17. The specificity determinants in gp16 are clustered in the diverged N- and C-terminal domains (regions I-III). Swapping of diverged region(s), such as replacing C-terminal RB49 region III with that of T4, switched ATPase stimulation specificity. Two specificity regions, amino acids 37-52 and 290-315, are identified in or near the gp17-ATPase "transmission" subdomain II. gp16 binding at these sites might cause a conformational change positioning the ATPase-coupling residues into the catalytic pocket, triggering ATP hydrolysis. These results lead to a model in which multiple weak interactions between motor and regulator allow dynamic assembly and disassembly of various packaging complexes, depending on the functional state of the packaging machine. This might be a general mechanism for regulation of the phage packaging machine and other complex molecular machines.

  10. Protein function in precision medicine: deep understanding with machine learning.

    PubMed

    Rost, Burkhard; Radivojac, Predrag; Bromberg, Yana

    2016-08-01

    Precision medicine and personalized health efforts propose leveraging complex molecular, medical and family history, along with other types of personal data toward better life. We argue that this ambitious objective will require advanced and specialized machine learning solutions. Simply skimming some low-hanging results off the data wealth might have limited potential. Instead, we need to better understand all parts of the system to define medically relevant causes and effects: how do particular sequence variants affect particular proteins and pathways? How do these effects, in turn, cause the health or disease-related phenotype? Toward this end, deeper understanding will not simply diffuse from deeper machine learning, but from more explicit focus on understanding protein function, context-specific protein interaction networks, and impact of variation on both. © 2016 Federation of European Biochemical Societies.

  11. Allocating dissipation across a molecular machine cycle to maximize flux

    PubMed Central

    Brown, Aidan I.; Sivak, David A.

    2017-01-01

    Biomolecular machines consume free energy to break symmetry and make directed progress. Nonequilibrium ATP concentrations are the typical free energy source, with one cycle of a molecular machine consuming a certain number of ATP, providing a fixed free energy budget. Since evolution is expected to favor rapid-turnover machines that operate efficiently, we investigate how this free energy budget can be allocated to maximize flux. Unconstrained optimization eliminates intermediate metastable states, indicating that flux is enhanced in molecular machines with fewer states. When maintaining a set number of states, we show that—in contrast to previous findings—the flux-maximizing allocation of dissipation is not even. This result is consistent with the coexistence of both “irreversible” and reversible transitions in molecular machine models that successfully describe experimental data, which suggests that, in evolved machines, different transitions differ significantly in their dissipation. PMID:29073016

  12. Real-time monitoring of enzyme-free strand displacement cascades by colorimetric assays

    NASA Astrophysics Data System (ADS)

    Duan, Ruixue; Wang, Boya; Hong, Fan; Zhang, Tianchi; Jia, Yongmei; Huang, Jiayu; Hakeem, Abdul; Liu, Nannan; Lou, Xiaoding; Xia, Fan

    2015-03-01

    The enzyme-free toehold-mediated strand displacement reaction has shown potential for building programmable DNA circuits, biosensors, molecular machines and chemical reaction networks. Here we report a simple colorimetric method using gold nanoparticles as signal generators for the real-time detection of the product of the strand displacement cascade. During the process the assembled gold nanoparticles can be separated, resulting in a color change of the solution. This assay can also be applied in complex mixtures, fetal bovine serum, and to detect single-base mismatches. These results suggest that this method could be of general utility to monitor more complex enzyme-free strand displacement reaction-based programmable systems or for further low-cost diagnostic applications.The enzyme-free toehold-mediated strand displacement reaction has shown potential for building programmable DNA circuits, biosensors, molecular machines and chemical reaction networks. Here we report a simple colorimetric method using gold nanoparticles as signal generators for the real-time detection of the product of the strand displacement cascade. During the process the assembled gold nanoparticles can be separated, resulting in a color change of the solution. This assay can also be applied in complex mixtures, fetal bovine serum, and to detect single-base mismatches. These results suggest that this method could be of general utility to monitor more complex enzyme-free strand displacement reaction-based programmable systems or for further low-cost diagnostic applications. Electronic supplementary information (ESI) available: Experimental procedures and analytical data are provided. See DOI: 10.1039/c5nr00697j

  13. In vitro molecular machine learning algorithm via symmetric internal loops of DNA.

    PubMed

    Lee, Ji-Hoon; Lee, Seung Hwan; Baek, Christina; Chun, Hyosun; Ryu, Je-Hwan; Kim, Jin-Woo; Deaton, Russell; Zhang, Byoung-Tak

    2017-08-01

    Programmable biomolecules, such as DNA strands, deoxyribozymes, and restriction enzymes, have been used to solve computational problems, construct large-scale logic circuits, and program simple molecular games. Although studies have shown the potential of molecular computing, the capability of computational learning with DNA molecules, i.e., molecular machine learning, has yet to be experimentally verified. Here, we present a novel molecular learning in vitro model in which symmetric internal loops of double-stranded DNA are exploited to measure the differences between training instances, thus enabling the molecules to learn from small errors. The model was evaluated on a data set of twenty dialogue sentences obtained from the television shows Friends and Prison Break. The wet DNA-computing experiments confirmed that the molecular learning machine was able to generalize the dialogue patterns of each show and successfully identify the show from which the sentences originated. The molecular machine learning model described here opens the way for solving machine learning problems in computer science and biology using in vitro molecular computing with the data encoded in DNA molecules. Copyright © 2017. Published by Elsevier B.V.

  14. Microcompartments and protein machines in prokaryotes.

    PubMed

    Saier, Milton H

    2013-01-01

    The prokaryotic cell was once thought of as a 'bag of enzymes' with little or no intracellular compartmentalization. In this view, most reactions essential for life occurred as a consequence of random molecular collisions involving substrates, cofactors and cytoplasmic enzymes. Our current conception of a prokaryote is far from this view. We now consider a bacterium or an archaeon as a highly structured, nonrandom collection of functional membrane-embedded and proteinaceous molecular machines, each of which serves a specialized function. In this article we shall present an overview of such microcompartments including (1) the bacterial cytoskeleton and the apparati allowing DNA segregation during cell division; (2) energy transduction apparati involving light-driven proton pumping and ion gradient-driven ATP synthesis; (3) prokaryotic motility and taxis machines that mediate cell movements in response to gradients of chemicals and physical forces; (4) machines of protein folding, secretion and degradation; (5) metabolosomes carrying out specific chemical reactions; (6) 24-hour clocks allowing bacteria to coordinate their metabolic activities with the daily solar cycle, and (7) proteinaceous membrane compartmentalized structures such as sulfur granules and gas vacuoles. Membrane-bound prokaryotic organelles were considered in a recent Journal of Molecular Microbiology and Biotechnology written symposium concerned with membranous compartmentalization in bacteria [J Mol Microbiol Biotechnol 2013;23:1-192]. By contrast, in this symposium, we focus on proteinaceous microcompartments. These two symposia, taken together, provide the interested reader with an objective view of the remarkable complexity of what was once thought of as a simple noncompartmentalized cell. Copyright © 2013 S. Karger AG, Basel.

  15. An autonomous chemically fuelled small-molecule motor

    NASA Astrophysics Data System (ADS)

    Wilson, Miriam R.; Solà, Jordi; Carlone, Armando; Goldup, Stephen M.; Lebrasseur, Nathalie; Leigh, David A.

    2016-06-01

    Molecular machines are among the most complex of all functional molecules and lie at the heart of nearly every biological process. A number of synthetic small-molecule machines have been developed, including molecular muscles, synthesizers, pumps, walkers, transporters and light-driven and electrically driven rotary motors. However, although biological molecular motors are powered by chemical gradients or the hydrolysis of adenosine triphosphate (ATP), so far there are no synthetic small-molecule motors that can operate autonomously using chemical energy (that is, the components move with net directionality as long as a chemical fuel is present). Here we describe a system in which a small molecular ring (macrocycle) is continuously transported directionally around a cyclic molecular track when powered by irreversible reactions of a chemical fuel, 9-fluorenylmethoxycarbonyl chloride. Key to the design is that the rate of reaction of this fuel with reactive sites on the cyclic track is faster when the macrocycle is far from the reactive site than when it is near to it. We find that a bulky pyridine-based catalyst promotes carbonate-forming reactions that ratchet the displacement of the macrocycle away from the reactive sites on the track. Under reaction conditions where both attachment and cleavage of the 9-fluorenylmethoxycarbonyl groups occur through different processes, and the cleavage reaction occurs at a rate independent of macrocycle location, net directional rotation of the molecular motor continues for as long as unreacted fuel remains. We anticipate that autonomous chemically fuelled molecular motors will find application as engines in molecular nanotechnology.

  16. Nanoscale swimmers: hydrodynamic interactions and propulsion of molecular machines

    NASA Astrophysics Data System (ADS)

    Sakaue, T.; Kapral, R.; Mikhailov, A. S.

    2010-06-01

    Molecular machines execute nearly regular cyclic conformational changes as a result of ligand binding and product release. This cyclic conformational dynamics is generally non-reciprocal so that under time reversal a different sequence of machine conformations is visited. Since such changes occur in a solvent, coupling to solvent hydrodynamic modes will generally result in self-propulsion of the molecular machine. These effects are investigated for a class of coarse grained models of protein machines consisting of a set of beads interacting through pair-wise additive potentials. Hydrodynamic effects are incorporated through a configuration-dependent mobility tensor, and expressions for the propulsion linear and angular velocities, as well as the stall force, are obtained. In the limit where conformational changes are small so that linear response theory is applicable, it is shown that propulsion is exponentially small; thus, propulsion is nonlinear phenomenon. The results are illustrated by computations on a simple model molecular machine.

  17. Towards a molecular logic machine

    NASA Astrophysics Data System (ADS)

    Remacle, F.; Levine, R. D.

    2001-06-01

    Finite state logic machines can be realized by pump-probe spectroscopic experiments on an isolated molecule. The most elaborate setup, a Turing machine, can be programmed to carry out a specific computation. We argue that a molecule can be similarly programmed, and provide examples using two photon spectroscopies. The states of the molecule serve as the possible states of the head of the Turing machine and the physics of the problem determines the possible instructions of the program. The tape is written in an alphabet that allows the listing of the different pump and probe signals that are applied in a given experiment. Different experiments using the same set of molecular levels correspond to different tapes that can be read and processed by the same head and program. The analogy to a Turing machine is not a mechanical one and is not completely molecular because the tape is not part of the molecular machine. We therefore also discuss molecular finite state machines, such as sequential devices, for which the tape is not part of the machine. Nonmolecular tapes allow for quite long input sequences with a rich alphabet (at the level of 7 bits) and laser pulse shaping experiments provide concrete examples. Single molecule spectroscopies show that a single molecule can be repeatedly cycled through a logical operation.

  18. A machine for the preliminary investigation of design features influencing the wear behaviour of knee prostheses.

    PubMed

    McGloughlin, T M; Murphy, D M; Kavanagh, A G

    2004-01-01

    Degradation of tibial inserts in vivo has been found to be multifactorial in nature, resulting in a complex interaction of many variables. A range of kinematic conditions occurs at the tibio-femoral interface, giving rise to various degrees of rolling and sliding at this interface. The movement of the tibio-femoral contact point may be an influential factor in the overall wear of ultra-high molecular weight polyethylene (UHMWPE) tibial components. As part of this study a three-station wear-test machine was designed and built to investigate the influence of rolling and sliding on the wear behaviour of specific design aspects of contemporary knee prostheses. Using the machine, it is possible to monitor the effect of various slide roll ratios on the performance of contemporary bearing designs from a geometrical and materials perspective.

  19. RNA Polymerase Structure, Function, Regulation, Dynamics, Fidelity, and Roles in GENE EXPRESSION | Center for Cancer Research

    Cancer.gov

    Multi-subunit RNA polymerases (RNAP) are ornate molecular machines that translocate on a DNA template as they generate a complementary RNA chain. RNAPs are highly conserved in evolution among eukarya, eubacteria, archaea, and some viruses. As such, multi-subunit RNAPs appear to be an irreplaceable advance in the evolution of complex life on earth. Because of their stepwise

  20. The RNA-induced silencing complex: a versatile gene-silencing machine.

    PubMed

    Pratt, Ashley J; MacRae, Ian J

    2009-07-03

    RNA interference is a powerful mechanism of gene silencing that underlies many aspects of eukaryotic biology. On the molecular level, RNA interference is mediated by a family of ribonucleoprotein complexes called RNA-induced silencing complexes (RISCs), which can be programmed to target virtually any nucleic acid sequence for silencing. The ability of RISC to locate target RNAs has been co-opted by evolution many times to generate a broad spectrum of gene-silencing pathways. Here, we review the fundamental biochemical and biophysical properties of RISC that facilitate gene targeting and describe the various mechanisms of gene silencing known to exploit RISC activity.

  1. Integrated machine learning, molecular docking and 3D-QSAR based approach for identification of potential inhibitors of trypanosomal N-myristoyltransferase.

    PubMed

    Singh, Nidhi; Shah, Priyanka; Dwivedi, Hemlata; Mishra, Shikha; Tripathi, Renu; Sahasrabuddhe, Amogh A; Siddiqi, Mohammad Imran

    2016-11-15

    N-Myristoyltransferase (NMT) catalyzes the transfer of myristate to the amino-terminal glycine of a subset of proteins, a co-translational modification involved in trafficking substrate proteins to membrane locations, stabilization and protein-protein interactions. It is a studied and validated pre-clinical drug target for fungal and parasitic infections. In the present study, a machine learning approach, docking studies and CoMFA analysis have been integrated with the objective of translation of knowledge into a pipelined workflow towards the identification of putative hits through the screening of large compound libraries. In the proposed pipeline, the reported parasitic NMT inhibitors have been used to develop predictive machine learning classification models. Simultaneously, a TbNMT complex model was generated to establish the relationship between the binding mode of the inhibitors for LmNMT and TbNMT through molecular dynamics simulation studies. A 3D-QSAR model was developed and used to predict the activity of the proposed hits in the subsequent step. The hits classified as active based on the machine learning model were assessed as the potential anti-trypanosomal NMT inhibitors through molecular docking studies, predicted activity using a QSAR model and visual inspection. In the final step, the proposed pipeline was validated through in vitro experiments. A total of seven hits have been proposed and tested in vitro for evaluation of dual inhibitory activity against Leishmania donovani and Trypanosoma brucei. Out of these five compounds showed significant inhibition against both of the organisms. The common topmost active compound SEW04173 belongs to a pyrazole carboxylate scaffold and is anticipated to enrich the chemical space with enhanced potency through optimization.

  2. Molecular machines open cell membranes

    NASA Astrophysics Data System (ADS)

    García-López, Víctor; Chen, Fang; Nilewski, Lizanne G.; Duret, Guillaume; Aliyan, Amir; Kolomeisky, Anatoly B.; Robinson, Jacob T.; Wang, Gufeng; Pal, Robert; Tour, James M.

    2017-08-01

    Beyond the more common chemical delivery strategies, several physical techniques are used to open the lipid bilayers of cellular membranes. These include using electric and magnetic fields, temperature, ultrasound or light to introduce compounds into cells, to release molecular species from cells or to selectively induce programmed cell death (apoptosis) or uncontrolled cell death (necrosis). More recently, molecular motors and switches that can change their conformation in a controlled manner in response to external stimuli have been used to produce mechanical actions on tissue for biomedical applications. Here we show that molecular machines can drill through cellular bilayers using their molecular-scale actuation, specifically nanomechanical action. Upon physical adsorption of the molecular motors onto lipid bilayers and subsequent activation of the motors using ultraviolet light, holes are drilled in the cell membranes. We designed molecular motors and complementary experimental protocols that use nanomechanical action to induce the diffusion of chemical species out of synthetic vesicles, to enhance the diffusion of traceable molecular machines into and within live cells, to induce necrosis and to introduce chemical species into live cells. We also show that, by using molecular machines that bear short peptide addends, nanomechanical action can selectively target specific cell-surface recognition sites. Beyond the in vitro applications demonstrated here, we expect that molecular machines could also be used in vivo, especially as their design progresses to allow two-photon, near-infrared and radio-frequency activation.

  3. Molecular machines open cell membranes.

    PubMed

    García-López, Víctor; Chen, Fang; Nilewski, Lizanne G; Duret, Guillaume; Aliyan, Amir; Kolomeisky, Anatoly B; Robinson, Jacob T; Wang, Gufeng; Pal, Robert; Tour, James M

    2017-08-30

    Beyond the more common chemical delivery strategies, several physical techniques are used to open the lipid bilayers of cellular membranes. These include using electric and magnetic fields, temperature, ultrasound or light to introduce compounds into cells, to release molecular species from cells or to selectively induce programmed cell death (apoptosis) or uncontrolled cell death (necrosis). More recently, molecular motors and switches that can change their conformation in a controlled manner in response to external stimuli have been used to produce mechanical actions on tissue for biomedical applications. Here we show that molecular machines can drill through cellular bilayers using their molecular-scale actuation, specifically nanomechanical action. Upon physical adsorption of the molecular motors onto lipid bilayers and subsequent activation of the motors using ultraviolet light, holes are drilled in the cell membranes. We designed molecular motors and complementary experimental protocols that use nanomechanical action to induce the diffusion of chemical species out of synthetic vesicles, to enhance the diffusion of traceable molecular machines into and within live cells, to induce necrosis and to introduce chemical species into live cells. We also show that, by using molecular machines that bear short peptide addends, nanomechanical action can selectively target specific cell-surface recognition sites. Beyond the in vitro applications demonstrated here, we expect that molecular machines could also be used in vivo, especially as their design progresses to allow two-photon, near-infrared and radio-frequency activation.

  4. Irrelevance of the Power Stroke for the Directionality, Stopping Force, and Optimal Efficiency of Chemically Driven Molecular Machines

    PubMed Central

    Astumian, R. Dean

    2015-01-01

    A simple model for a chemically driven molecular walker shows that the elastic energy stored by the molecule and released during the conformational change known as the power-stroke (i.e., the free-energy difference between the pre- and post-power-stroke states) is irrelevant for determining the directionality, stopping force, and efficiency of the motor. Further, the apportionment of the dependence on the externally applied force between the forward and reverse rate constants of the power-stroke (or indeed among all rate constants) is irrelevant for determining the directionality, stopping force, and efficiency of the motor. Arguments based on the principle of microscopic reversibility demonstrate that this result is general for all chemically driven molecular machines, and even more broadly that the relative energies of the states of the motor have no role in determining the directionality, stopping force, or optimal efficiency of the machine. Instead, the directionality, stopping force, and optimal efficiency are determined solely by the relative heights of the energy barriers between the states. Molecular recognition—the ability of a molecular machine to discriminate between substrate and product depending on the state of the machine—is far more important for determining the intrinsic directionality and thermodynamics of chemo-mechanical coupling than are the details of the internal mechanical conformational motions of the machine. In contrast to the conclusions for chemical driving, a power-stroke is very important for the directionality and efficiency of light-driven molecular machines and for molecular machines driven by external modulation of thermodynamic parameters. PMID:25606678

  5. Molecular switch-like regulation in motor proteins.

    PubMed

    Tafoya, Sara; Bustamante, Carlos

    2018-06-19

    Motor proteins are powered by nucleotide hydrolysis and exert mechanical work to carry out many fundamental biological tasks. To ensure their correct and efficient performance, the motors' activities are allosterically regulated by additional factors that enhance or suppress their NTPase activity. Here, we review two highly conserved mechanisms of ATP hydrolysis activation and repression operating in motor proteins-the glutamate switch and the arginine finger-and their associated regulatory factors. We examine the implications of these regulatory mechanisms in proteins that are formed by multiple ATPase subunits. We argue that the regulatory mechanisms employed by motor proteins display features similar to those described in small GTPases, which require external regulatory elements, such as dissociation inhibitors, exchange factors and activating proteins, to switch the protein's function 'on' and 'off'. Likewise, similar regulatory roles are taken on by the motor's substrate, additional binding factors, and even adjacent subunits in multimeric complexes. However, in motor proteins, more than one regulatory factor and the two mechanisms described here often underlie the machine's operation. Furthermore, ATPase regulation takes place throughout the motor's cycle, which enables a more complex function than the binary 'active' and 'inactive' states.This article is part of a discussion meeting issue 'Allostery and molecular machines'. © 2018 The Author(s).

  6. Mitochondrial AAA proteases--towards a molecular understanding of membrane-bound proteolytic machines.

    PubMed

    Gerdes, Florian; Tatsuta, Takashi; Langer, Thomas

    2012-01-01

    Mitochondrial AAA proteases play an important role in the maintenance of mitochondrial proteostasis. They regulate and promote biogenesis of mitochondrial proteins by acting as processing enzymes and ensuring the selective turnover of misfolded proteins. Impairment of AAA proteases causes pleiotropic defects in various organisms including neurodegeneration in humans. AAA proteases comprise ring-like hexameric complexes in the mitochondrial inner membrane and are functionally conserved from yeast to man, but variations are evident in the subunit composition of orthologous enzymes. Recent structural and biochemical studies revealed how AAA proteases degrade their substrates in an ATP dependent manner. Intersubunit coordination of the ATP hydrolysis leads to an ordered ATP hydrolysis within the AAA ring, which ensures efficient substrate dislocation from the membrane and translocation to the proteolytic chamber. In this review, we summarize recent findings on the molecular mechanisms underlying the versatile functions of mitochondrial AAA proteases and their relevance to those of the other AAA+ machines. Copyright © 2011 Elsevier B.V. All rights reserved.

  7. High performance computing in biology: multimillion atom simulations of nanoscale systems

    PubMed Central

    Sanbonmatsu, K. Y.; Tung, C.-S.

    2007-01-01

    Computational methods have been used in biology for sequence analysis (bioinformatics), all-atom simulation (molecular dynamics and quantum calculations), and more recently for modeling biological networks (systems biology). Of these three techniques, all-atom simulation is currently the most computationally demanding, in terms of compute load, communication speed, and memory load. Breakthroughs in electrostatic force calculation and dynamic load balancing have enabled molecular dynamics simulations of large biomolecular complexes. Here, we report simulation results for the ribosome, using approximately 2.64 million atoms, the largest all-atom biomolecular simulation published to date. Several other nanoscale systems with different numbers of atoms were studied to measure the performance of the NAMD molecular dynamics simulation program on the Los Alamos National Laboratory Q Machine. We demonstrate that multimillion atom systems represent a 'sweet spot' for the NAMD code on large supercomputers. NAMD displays an unprecedented 85% parallel scaling efficiency for the ribosome system on 1024 CPUs. We also review recent targeted molecular dynamics simulations of the ribosome that prove useful for studying conformational changes of this large biomolecular complex in atomic detail. PMID:17187988

  8. Engineering molecular machines

    NASA Astrophysics Data System (ADS)

    Erman, Burak

    2016-04-01

    Biological molecular motors use chemical energy, mostly in the form of ATP hydrolysis, and convert it to mechanical energy. Correlated thermal fluctuations are essential for the function of a molecular machine and it is the hydrolysis of ATP that modifies the correlated fluctuations of the system. Correlations are consequences of the molecular architecture of the protein. The idea that synthetic molecular machines may be constructed by designing the proper molecular architecture is challenging. In their paper, Sarkar et al (2016 New J. Phys. 18 043006) propose a synthetic molecular motor based on the coarse grained elastic network model of proteins and show by numerical simulations that motor function is realized, ranging from deterministic to thermal, depending on temperature. This work opens up a new range of possibilities of molecular architecture based engine design.

  9. Equation-free and variable free modeling for complex/multiscale systems. Coarse-grained computation in science and engineering using fine-grained models

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kevrekidis, Ioannis G.

    The work explored the linking of modern developing machine learning techniques (manifold learning and in particular diffusion maps) with traditional PDE modeling/discretization/scientific computation techniques via the equation-free methodology developed by the PI. The result (in addition to several PhD degrees, two of them by CSGF Fellows) was a sequence of strong developments - in part on the algorithmic side, linking data mining with scientific computing, and in part on applications, ranging from PDE discretizations to molecular dynamics and complex network dynamics.

  10. Purification of Arp2/3 complex from Saccharomyces cerevisiae

    PubMed Central

    Doolittle, Lynda K.; Rosen, Michael K.; Padrick, Shae B.

    2014-01-01

    Summary Much of cellular control over actin dynamics comes through regulation of actin filament initiation. At the molecular level, this is accomplished through a collection of cellular protein machines, called actin nucleation factors, which position actin monomers to initiate a new actin filament. The Arp2/3 complex is a principal actin nucleation factor used throughout the eukaryotic family tree. The budding yeast Saccharomyces cerevisiae has proven to be not only an excellent genetic platform for the study of the Arp2/3 complex, but also an excellent source for the purification of endogenous Arp2/3 complex. Here we describe a protocol for the preparation of endogenous Arp2/3 complex from wild type Saccharomyces cerevisiae. This protocol produces material suitable for biochemical study, and yields milligram quantities of purified Arp2/3 complex. PMID:23868593

  11. Microcompartments and Protein Machines in Prokaryotes

    PubMed Central

    Saier, Milton H.

    2013-01-01

    The prokaryotic cell was once thought of as a “bag of enzymes” with little or no intracellular compartmentalization. In this view, most reactions essential for life occurred as a consequence of random molecular collisions involving substrates, cofactors and cytoplasmic enzymes. Our current conception of a prokaryote is far from this view. We now consider a bacterium or an archaeon as a highly structured, non-random collection of functional membrane-embedded and proteinaceous molecular machines, each of which serves a specialized function. In this article we shall present an overview of such microcompartments including (i) the bacterial cytoskeleton and the apparati allowing DNA segregation during cells division, (ii) energy transduction apparati involving light-driven proton pumping and ion gradient-driven ATP synthesis, (iii) prokaryotic motility and taxis machines that mediate cell movements in response to gradients of chemicals and physical forces, (iv) machines of protein folding, secretion and degradation, (v) metabolasomes carrying out specific chemical reactions, (vi) 24 hour clocks allowing bacteria to coordinate their metabolic activities with the daily solar cycle and (vii) proteinaceous membrane compartmentalized structures such as sulfur granules and gas vacuoles. Membrane-bounded prokaryotic organelles were considered in a recent JMMB written symposium concerned with membraneous compartmentalization in bacteria [Saier and Bogdanov, 2013]. By contrast, in this symposium, we focus on proteinaceous microcompartments. These two symposia, taken together, provide the interested reader with an objective view of the remarkable complexity of what was once thought of as a simple non-compartmentalized cell. PMID:23920489

  12. Machine Learning Based Dimensionality Reduction Facilitates Ligand Diffusion Paths Assessment: A Case of Cytochrome P450cam.

    PubMed

    Rydzewski, J; Nowak, W

    2016-04-12

    In this work we propose an application of a nonlinear dimensionality reduction method to represent the high-dimensional configuration space of the ligand-protein dissociation process in a manner facilitating interpretation. Rugged ligand expulsion paths are mapped into 2-dimensional space. The mapping retains the main structural changes occurring during the dissociation. The topological similarity of the reduced paths may be easily studied using the Fréchet distances, and we show that this measure facilitates machine learning classification of the diffusion pathways. Further, low-dimensional configuration space allows for identification of residues active in transport during the ligand diffusion from a protein. The utility of this approach is illustrated by examination of the configuration space of cytochrome P450cam involved in expulsing camphor by means of enhanced all-atom molecular dynamics simulations. The expulsion trajectories are sampled and constructed on-the-fly during molecular dynamics simulations using the recently developed memetic algorithms [ Rydzewski, J.; Nowak, W. J. Chem. Phys. 2015 , 143 ( 12 ), 124101 ]. We show that the memetic algorithms are effective for enforcing the ligand diffusion and cavity exploration in the P450cam-camphor complex. Furthermore, we demonstrate that machine learning techniques are helpful in inspecting ligand diffusion landscapes and provide useful tools to examine structural changes accompanying rare events.

  13. Ribosomal Translocation: One Step Closer to the Molecular Mechanism

    PubMed Central

    Shoji, Shinichiro; Walker, Sarah E.; Fredrick, Kurt

    2010-01-01

    Protein synthesis occurs in ribosomes, the targets of numerous antibiotics. How these large and complex machines read and move along mRNA have proven to be challenging questions. In this Review, we focus on translocation, the last step of the elongation cycle in which movement of tRNA and mRNA is catalyzed by elongation factor G. Translocation entails large-scale movements of the tRNAs and conformational changes in the ribosome that require numerous tertiary contacts to be disrupted and reformed. We highlight recent progress toward elucidating the molecular basis of translocation and how various antibiotics influence tRNA–mRNA movement. PMID:19173642

  14. How molecular motors work – insights from the molecular machinist's toolbox: the Nobel prize in Chemistry 2016

    PubMed Central

    Astumian, R. D.

    2017-01-01

    The Nobel prize in Chemistry for 2016 was awarded to Jean Pierre Sauvage, Sir James Fraser Stoddart, and Bernard (Ben) Feringa for their contributions to the design and synthesis of molecular machines. While this field is still in its infancy, and at present there are no commercial applications, many observers have stressed the tremendous potential of molecular machines to revolutionize technology. However, perhaps the most important result so far accruing from the synthesis of molecular machines is the insight provided into the fundamental mechanisms by which molecular motors, including biological motors such as kinesin, myosin, FoF1 ATPase, and the flagellar motor, function. The ability to “tinker” with separate components of molecular motors allows asking, and answering, specific questions about mechanism, particularly with regard to light driven vs. chemistry driven molecular motors. PMID:28572896

  15. Single molecule detection, thermal fluctuation and life

    PubMed Central

    YANAGIDA, Toshio; ISHII, Yoshiharu

    2017-01-01

    Single molecule detection has contributed to our understanding of the unique mechanisms of life. Unlike artificial man-made machines, biological molecular machines integrate thermal noises rather than avoid them. For example, single molecule detection has demonstrated that myosin motors undergo biased Brownian motion for stepwise movement and that single protein molecules spontaneously change their conformation, for switching to interactions with other proteins, in response to thermal fluctuation. Thus, molecular machines have flexibility and efficiency not seen in artificial machines. PMID:28190869

  16. A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking.

    PubMed

    Ballester, Pedro J; Mitchell, John B O

    2010-05-01

    Accurately predicting the binding affinities of large sets of diverse protein-ligand complexes is an extremely challenging task. The scoring functions that attempt such computational prediction are essential for analysing the outputs of molecular docking, which in turn is an important technique for drug discovery, chemical biology and structural biology. Each scoring function assumes a predetermined theory-inspired functional form for the relationship between the variables that characterize the complex, which also include parameters fitted to experimental or simulation data and its predicted binding affinity. The inherent problem of this rigid approach is that it leads to poor predictivity for those complexes that do not conform to the modelling assumptions. Moreover, resampling strategies, such as cross-validation or bootstrapping, are still not systematically used to guard against the overfitting of calibration data in parameter estimation for scoring functions. We propose a novel scoring function (RF-Score) that circumvents the need for problematic modelling assumptions via non-parametric machine learning. In particular, Random Forest was used to implicitly capture binding effects that are hard to model explicitly. RF-Score is compared with the state of the art on the demanding PDBbind benchmark. Results show that RF-Score is a very competitive scoring function. Importantly, RF-Score's performance was shown to improve dramatically with training set size and hence the future availability of more high-quality structural and interaction data is expected to lead to improved versions of RF-Score. pedro.ballester@ebi.ac.uk; jbom@st-andrews.ac.uk Supplementary data are available at Bioinformatics online.

  17. Machine learning molecular dynamics for the simulation of infrared spectra.

    PubMed

    Gastegger, Michael; Behler, Jörg; Marquetand, Philipp

    2017-10-01

    Machine learning has emerged as an invaluable tool in many research areas. In the present work, we harness this power to predict highly accurate molecular infrared spectra with unprecedented computational efficiency. To account for vibrational anharmonic and dynamical effects - typically neglected by conventional quantum chemistry approaches - we base our machine learning strategy on ab initio molecular dynamics simulations. While these simulations are usually extremely time consuming even for small molecules, we overcome these limitations by leveraging the power of a variety of machine learning techniques, not only accelerating simulations by several orders of magnitude, but also greatly extending the size of systems that can be treated. To this end, we develop a molecular dipole moment model based on environment dependent neural network charges and combine it with the neural network potential approach of Behler and Parrinello. Contrary to the prevalent big data philosophy, we are able to obtain very accurate machine learning models for the prediction of infrared spectra based on only a few hundreds of electronic structure reference points. This is made possible through the use of molecular forces during neural network potential training and the introduction of a fully automated sampling scheme. We demonstrate the power of our machine learning approach by applying it to model the infrared spectra of a methanol molecule, n -alkanes containing up to 200 atoms and the protonated alanine tripeptide, which at the same time represents the first application of machine learning techniques to simulate the dynamics of a peptide. In all of these case studies we find an excellent agreement between the infrared spectra predicted via machine learning models and the respective theoretical and experimental spectra.

  18. Molecular Dynamics Modeling and Simulation of Diamond Cutting of Cerium.

    PubMed

    Zhang, Junjie; Zheng, Haibing; Shuai, Maobing; Li, Yao; Yang, Yang; Sun, Tao

    2017-12-01

    The coupling between structural phase transformations and dislocations induces challenges in understanding the deformation behavior of metallic cerium at the nanoscale. In the present work, we elucidate the underlying mechanism of cerium under ultra-precision diamond cutting by means of molecular dynamics modeling and simulations. The molecular dynamics model of diamond cutting of cerium is established by assigning empirical potentials to describe atomic interactions and evaluating properties of two face-centered cubic cerium phases. Subsequent molecular dynamics simulations reveal that dislocation slip dominates the plastic deformation of cerium under the cutting process. In addition, the analysis based on atomic radial distribution functions demonstrates that there are trivial phase transformations from the γ-Ce to the δ-Ce occurred in both machined surface and formed chip. Following investigations on machining parameter dependence reveal the optimal machining conditions for achieving high quality of machined surface of cerium.

  19. Molecular Dynamics Modeling and Simulation of Diamond Cutting of Cerium

    NASA Astrophysics Data System (ADS)

    Zhang, Junjie; Zheng, Haibing; Shuai, Maobing; Li, Yao; Yang, Yang; Sun, Tao

    2017-07-01

    The coupling between structural phase transformations and dislocations induces challenges in understanding the deformation behavior of metallic cerium at the nanoscale. In the present work, we elucidate the underlying mechanism of cerium under ultra-precision diamond cutting by means of molecular dynamics modeling and simulations. The molecular dynamics model of diamond cutting of cerium is established by assigning empirical potentials to describe atomic interactions and evaluating properties of two face-centered cubic cerium phases. Subsequent molecular dynamics simulations reveal that dislocation slip dominates the plastic deformation of cerium under the cutting process. In addition, the analysis based on atomic radial distribution functions demonstrates that there are trivial phase transformations from the γ-Ce to the δ-Ce occurred in both machined surface and formed chip. Following investigations on machining parameter dependence reveal the optimal machining conditions for achieving high quality of machined surface of cerium.

  20. Symmetry-Adapted Machine Learning for Tensorial Properties of Atomistic Systems.

    PubMed

    Grisafi, Andrea; Wilkins, David M; Csányi, Gábor; Ceriotti, Michele

    2018-01-19

    Statistical learning methods show great promise in providing an accurate prediction of materials and molecular properties, while minimizing the need for computationally demanding electronic structure calculations. The accuracy and transferability of these models are increased significantly by encoding into the learning procedure the fundamental symmetries of rotational and permutational invariance of scalar properties. However, the prediction of tensorial properties requires that the model respects the appropriate geometric transformations, rather than invariance, when the reference frame is rotated. We introduce a formalism that extends existing schemes and makes it possible to perform machine learning of tensorial properties of arbitrary rank, and for general molecular geometries. To demonstrate it, we derive a tensor kernel adapted to rotational symmetry, which is the natural generalization of the smooth overlap of atomic positions kernel commonly used for the prediction of scalar properties at the atomic scale. The performance and generality of the approach is demonstrated by learning the instantaneous response to an external electric field of water oligomers of increasing complexity, from the isolated molecule to the condensed phase.

  1. Symmetry-Adapted Machine Learning for Tensorial Properties of Atomistic Systems

    NASA Astrophysics Data System (ADS)

    Grisafi, Andrea; Wilkins, David M.; Csányi, Gábor; Ceriotti, Michele

    2018-01-01

    Statistical learning methods show great promise in providing an accurate prediction of materials and molecular properties, while minimizing the need for computationally demanding electronic structure calculations. The accuracy and transferability of these models are increased significantly by encoding into the learning procedure the fundamental symmetries of rotational and permutational invariance of scalar properties. However, the prediction of tensorial properties requires that the model respects the appropriate geometric transformations, rather than invariance, when the reference frame is rotated. We introduce a formalism that extends existing schemes and makes it possible to perform machine learning of tensorial properties of arbitrary rank, and for general molecular geometries. To demonstrate it, we derive a tensor kernel adapted to rotational symmetry, which is the natural generalization of the smooth overlap of atomic positions kernel commonly used for the prediction of scalar properties at the atomic scale. The performance and generality of the approach is demonstrated by learning the instantaneous response to an external electric field of water oligomers of increasing complexity, from the isolated molecule to the condensed phase.

  2. Crystalline molecular machines: Encoding supramolecular dynamics into molecular structure

    PubMed Central

    Garcia-Garibay, Miguel A.

    2005-01-01

    Crystalline molecular machines represent an exciting new branch of crystal engineering and materials science with important implications to nanotechnology. Crystalline molecular machines are crystals built with molecules that are structurally programmed to respond collectively to mechanic, electric, magnetic, or photonic stimuli to fulfill specific functions. One of the main challenges in their construction derives from the picometric precision required for their mechanic operation within the close-packed, self-assembled environment of crystalline solids. In this article, we outline some of the general guidelines for their design and apply them for the construction of molecular crystals with units intended to emulate macroscopic gyroscopes and compasses. Recent advances in the preparation, crystallization, and dynamic characterization of these interesting systems offer a foothold to the possibilities and help highlight some avenues for future experimentation. PMID:16046543

  3. Single-molecule studies of multi-protein machines

    NASA Astrophysics Data System (ADS)

    van Oijen, Antoine

    2010-03-01

    Advances in optical imaging and molecular manipulation techniques have made it possible to observe individual enzymes and record molecular movies that provide new insight into their dynamics and reaction mechanisms. In a biological context, most of these enzymes function in concert with other enzymes in multi-protein complexes, so an important future direction will be the utilization of single-molecule techniques to unravel the orchestration of large macromolecular assemblies. Our group is developing the single-molecule tools that will make it possible to study biochemical pathways of arbitrary complexity at the single-molecule level. I will discuss results of single-molecule experiments on the replisome, the molecular machinery that is responsible for replication of DNA. We stretch individual DNA molecules and use their elastic properties to obtain dynamic information on the proteins that unwind the double helix and copy its genetic information. Furthermore, we visualize fluorescently labeled components of the replisome and thus obtain information on stochiometry and exchange kinetics. This simultaneous observation of catalytic activity and composition allows us to gain deeper insight into the structure-function relationship of the replisome.

  4. Molecular-Sized DNA or RNA Sequencing Machine | NCI Technology Transfer Center | TTC

    Cancer.gov

    The National Cancer Institute's Gene Regulation and Chromosome Biology Laboratory is seeking statements of capability or interest from parties interested in collaborative research to co-develop a molecular-sized DNA or RNA sequencing machine.

  5. Advances in molecular dynamics simulation of ultra-precision machining of hard and brittle materials

    NASA Astrophysics Data System (ADS)

    Guo, Xiaoguang; Li, Qiang; Liu, Tao; Kang, Renke; Jin, Zhuji; Guo, Dongming

    2017-03-01

    Hard and brittle materials, such as silicon, SiC, and optical glasses, are widely used in aerospace, military, integrated circuit, and other fields because of their excellent physical and chemical properties. However, these materials display poor machinability because of their hard and brittle properties. Damages such as surface micro-crack and subsurface damage often occur during machining of hard and brittle materials. Ultra-precision machining is widely used in processing hard and brittle materials to obtain nanoscale machining quality. However, the theoretical mechanism underlying this method remains unclear. This paper provides a review of present research on the molecular dynamics simulation of ultra-precision machining of hard and brittle materials. The future trends in this field are also discussed.

  6. Determining protein complex connectivity using a probabilistic deletion network derived from quantitative proteomics.

    PubMed

    Sardiu, Mihaela E; Gilmore, Joshua M; Carrozza, Michael J; Li, Bing; Workman, Jerry L; Florens, Laurence; Washburn, Michael P

    2009-10-06

    Protein complexes are key molecular machines executing a variety of essential cellular processes. Despite the availability of genome-wide protein-protein interaction studies, determining the connectivity between proteins within a complex remains a major challenge. Here we demonstrate a method that is able to predict the relationship of proteins within a stable protein complex. We employed a combination of computational approaches and a systematic collection of quantitative proteomics data from wild-type and deletion strain purifications to build a quantitative deletion-interaction network map and subsequently convert the resulting data into an interdependency-interaction model of a complex. We applied this approach to a data set generated from components of the Saccharomyces cerevisiae Rpd3 histone deacetylase complexes, which consists of two distinct small and large complexes that are held together by a module consisting of Rpd3, Sin3 and Ume1. The resulting representation reveals new protein-protein interactions and new submodule relationships, providing novel information for mapping the functional organization of a complex.

  7. Dynamic combinatorial libraries: from exploring molecular recognition to systems chemistry.

    PubMed

    Li, Jianwei; Nowak, Piotr; Otto, Sijbren

    2013-06-26

    Dynamic combinatorial chemistry (DCC) is a subset of combinatorial chemistry where the library members interconvert continuously by exchanging building blocks with each other. Dynamic combinatorial libraries (DCLs) are powerful tools for discovering the unexpected and have given rise to many fascinating molecules, ranging from interlocked structures to self-replicators. Furthermore, dynamic combinatorial molecular networks can produce emergent properties at systems level, which provide exciting new opportunities in systems chemistry. In this perspective we will highlight some new methodologies in this field and analyze selected examples of DCLs that are under thermodynamic control, leading to synthetic receptors, catalytic systems, and complex self-assembled supramolecular architectures. Also reviewed are extensions of the principles of DCC to systems that are not at equilibrium and may therefore harbor richer functional behavior. Examples include self-replication and molecular machines.

  8. Mechanisms of nuclear pore complex assembly - two different ways of building one molecular machine.

    PubMed

    Otsuka, Shotaro; Ellenberg, Jan

    2018-02-01

    The nuclear pore complex (NPC) mediates all macromolecular transport across the nuclear envelope. In higher eukaryotes that have an open mitosis, NPCs assemble at two points in the cell cycle: during nuclear assembly in late mitosis and during nuclear growth in interphase. How the NPC, the largest nonpolymeric protein complex in eukaryotic cells, self-assembles inside cells remained unclear. Recent studies have started to uncover the assembly process, and evidence has been accumulating that postmitotic and interphase NPC assembly use fundamentally different mechanisms; the duration, structural intermediates, and regulation by molecular players are different and different types of membrane deformation are involved. In this Review, we summarize the current understanding of these two modes of NPC assembly and discuss the structural and regulatory steps that might drive the assembly processes. We furthermore integrate understanding of NPC assembly with the mechanisms for rapid nuclear growth in embryos and, finally, speculate on the evolutionary origin of the NPC implied by the presence of two distinct assembly mechanisms. © 2017 The Authors. FEBS Letters published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.

  9. Harnessing Reversible Electronic Energy Transfer: From Molecular Dyads to Molecular Machines.

    PubMed

    Denisov, Sergey A; Yu, Shinlin; Pozzo, Jean-Luc; Jonusauskas, Gediminas; McClenaghan, Nathan D

    2016-06-17

    Reversible electronic energy transfer (REET) may be instilled in bi-/multichromophoric molecule-based systems, following photoexcitation, upon judicious structural integration of matched chromophores. This leads to a new set of photophysical properties for the ensemble, which can be fully characterized by steady-state and time-resolved spectroscopic methods. Herein, we take a comprehensive look at progress in the development of this type of supermolecule in the last five years, which has seen systems evolve from covalently tethered dyads to synthetic molecular machines, exemplified by two different pseudorotaxanes. Indeed, REET holds promise in the control of movement in molecular machines, their assembly/disassembly, as well as in charge separation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. CNC Machining Of The Complex Copper Electrodes

    NASA Astrophysics Data System (ADS)

    Popan, Ioan Alexandru; Balc, Nicolae; Popan, Alina

    2015-07-01

    This paper presents the machining process of the complex copper electrodes. Machining of the complex shapes in copper is difficult because this material is soft and sticky. This research presents the main steps for processing those copper electrodes at a high dimensional accuracy and a good surface quality. Special tooling solutions are required for this machining process and optimal process parameters have been found for the accurate CNC equipment, using smart CAD/CAM software.

  11. Size-independent neural networks based first-principles method for accurate prediction of heat of formation of fuels

    NASA Astrophysics Data System (ADS)

    Yang, GuanYa; Wu, Jiang; Chen, ShuGuang; Zhou, WeiJun; Sun, Jian; Chen, GuanHua

    2018-06-01

    Neural network-based first-principles method for predicting heat of formation (HOF) was previously demonstrated to be able to achieve chemical accuracy in a broad spectrum of target molecules [L. H. Hu et al., J. Chem. Phys. 119, 11501 (2003)]. However, its accuracy deteriorates with the increase in molecular size. A closer inspection reveals a systematic correlation between the prediction error and the molecular size, which appears correctable by further statistical analysis, calling for a more sophisticated machine learning algorithm. Despite the apparent difference between simple and complex molecules, all the essential physical information is already present in a carefully selected set of small molecule representatives. A model that can capture the fundamental physics would be able to predict large and complex molecules from information extracted only from a small molecules database. To this end, a size-independent, multi-step multi-variable linear regression-neural network-B3LYP method is developed in this work, which successfully improves the overall prediction accuracy by training with smaller molecules only. And in particular, the calculation errors for larger molecules are drastically reduced to the same magnitudes as those of the smaller molecules. Specifically, the method is based on a 164-molecule database that consists of molecules made of hydrogen and carbon elements. 4 molecular descriptors were selected to encode molecule's characteristics, among which raw HOF calculated from B3LYP and the molecular size are also included. Upon the size-independent machine learning correction, the mean absolute deviation (MAD) of the B3LYP/6-311+G(3df,2p)-calculated HOF is reduced from 16.58 to 1.43 kcal/mol and from 17.33 to 1.69 kcal/mol for the training and testing sets (small molecules), respectively. Furthermore, the MAD of the testing set (large molecules) is reduced from 28.75 to 1.67 kcal/mol.

  12. Chromatin Computation

    PubMed Central

    Bryant, Barbara

    2012-01-01

    In living cells, DNA is packaged along with protein and RNA into chromatin. Chemical modifications to nucleotides and histone proteins are added, removed and recognized by multi-functional molecular complexes. Here I define a new computational model, in which chromatin modifications are information units that can be written onto a one-dimensional string of nucleosomes, analogous to the symbols written onto cells of a Turing machine tape, and chromatin-modifying complexes are modeled as read-write rules that operate on a finite set of adjacent nucleosomes. I illustrate the use of this “chromatin computer” to solve an instance of the Hamiltonian path problem. I prove that chromatin computers are computationally universal – and therefore more powerful than the logic circuits often used to model transcription factor control of gene expression. Features of biological chromatin provide a rich instruction set for efficient computation of nontrivial algorithms in biological time scales. Modeling chromatin as a computer shifts how we think about chromatin function, suggests new approaches to medical intervention, and lays the groundwork for the engineering of a new class of biological computing machines. PMID:22567109

  13. Protein complexes, big data, machine learning and integrative proteomics: lessons learned over a decade of systematic analysis of protein interaction networks.

    PubMed

    Havugimana, Pierre C; Hu, Pingzhao; Emili, Andrew

    2017-10-01

    Elucidation of the networks of physical (functional) interactions present in cells and tissues is fundamental for understanding the molecular organization of biological systems, the mechanistic basis of essential and disease-related processes, and for functional annotation of previously uncharacterized proteins (via guilt-by-association or -correlation). After a decade in the field, we felt it timely to document our own experiences in the systematic analysis of protein interaction networks. Areas covered: Researchers worldwide have contributed innovative experimental and computational approaches that have driven the rapidly evolving field of 'functional proteomics'. These include mass spectrometry-based methods to characterize macromolecular complexes on a global-scale and sophisticated data analysis tools - most notably machine learning - that allow for the generation of high-quality protein association maps. Expert commentary: Here, we recount some key lessons learned, with an emphasis on successful workflows, and challenges, arising from our own and other groups' ongoing efforts to generate, interpret and report proteome-scale interaction networks in increasingly diverse biological contexts.

  14. Chemical sensors from the cooperative actuation of multistep electrochemical molecular machines of polypyrrole: potentiostatic study. Trying to replicate muscle’s fatigue signals

    NASA Astrophysics Data System (ADS)

    Beaumont, Samuel; Otero, Toribio F.

    2018-07-01

    Polypyrrole film electrodes are constituted by multielectronic electrochemical molecular machines (every polymeric molecule) counterions and water, mimicking the intracellular matrix of muscular cells. The influence of the electrolyte concentration on the reversible oxidation/reduction of polypyrrole films was studied in NaCl aqueous solutions by consecutive square potential waves. The consumed redox charge and the consumed electrical energy change as a function of the concentration. That means that the extension (the consumed charge) of the reaction involving conformational, or allosteric, movements of the reacting polymeric chains (molecular machines) responds to (senses) the chemical energy of the reaction ambient. A theoretical description of the attained empirical results is presented getting the sensing equations and the concomitant sensitivities. Those results could indicate the origin and nature of the neural signals sent to the brain from biological haptic muscles working by cooperative actuation of the actin-myosin molecular machines driven by chemical reactions and sensing, simultaneously, the fatigue state of the muscle.

  15. Real-time monitoring of enzyme-free strand displacement cascades by colorimetric assays.

    PubMed

    Duan, Ruixue; Wang, Boya; Hong, Fan; Zhang, Tianchi; Jia, Yongmei; Huang, Jiayu; Hakeem, Abdul; Liu, Nannan; Lou, Xiaoding; Xia, Fan

    2015-03-19

    The enzyme-free toehold-mediated strand displacement reaction has shown potential for building programmable DNA circuits, biosensors, molecular machines and chemical reaction networks. Here we report a simple colorimetric method using gold nanoparticles as signal generators for the real-time detection of the product of the strand displacement cascade. During the process the assembled gold nanoparticles can be separated, resulting in a color change of the solution. This assay can also be applied in complex mixtures, fetal bovine serum, and to detect single-base mismatches. These results suggest that this method could be of general utility to monitor more complex enzyme-free strand displacement reaction-based programmable systems or for further low-cost diagnostic applications.

  16. Derivative Free Optimization of Complex Systems with the Use of Statistical Machine Learning Models

    DTIC Science & Technology

    2015-09-12

    AFRL-AFOSR-VA-TR-2015-0278 DERIVATIVE FREE OPTIMIZATION OF COMPLEX SYSTEMS WITH THE USE OF STATISTICAL MACHINE LEARNING MODELS Katya Scheinberg...COMPLEX SYSTEMS WITH THE USE OF STATISTICAL MACHINE LEARNING MODELS 5a.  CONTRACT NUMBER 5b.  GRANT NUMBER FA9550-11-1-0239 5c.  PROGRAM ELEMENT...developed, which has been the focus of our research. 15. SUBJECT TERMS optimization, Derivative-Free Optimization, Statistical Machine Learning 16. SECURITY

  17. Just working with the cellular machine: A high school game for teaching molecular biology.

    PubMed

    Cardoso, Fernanda Serpa; Dumpel, Renata; da Silva, Luisa B Gomes; Rodrigues, Carlos R; Santos, Dilvani O; Cabral, Lucio Mendes; Castro, Helena C

    2008-03-01

    Molecular biology is a difficult comprehension subject due to its high complexity, thus requiring new teaching approaches. Herein, we developed an interdisciplinary board game involving the human immune system response against a bacterial infection for teaching molecular biology at high school. Initially, we created a database with several questions and a game story that invites the students for helping the human immunological system to produce antibodies (IgG) and fight back a pathogenic bacterium second-time invasion. The game involves answering questions completing the game board in which the antibodies "are synthesized" through the molecular biology process. At the end, a problem-based learning approach is used, and a last question is raised about proteins. Biology teachers and high school students evaluated the game and considered it an easy and interesting tool for teaching the theme. An increase of about 5-30% in answering molecular biology questions revealed that the game improves learning and induced a more engaged and proactive learning profile in the high school students. Copyright © 2008 International Union of Biochemistry and Molecular Biology, Inc.

  18. Interlocking Mechanism between Molecular Gears Attached to Surfaces.

    PubMed

    Zhao, Rundong; Zhao, Yan-Ling; Qi, Fei; Hermann, Klaus E; Zhang, Rui-Qin; Van Hove, Michel A

    2018-03-27

    While molecular machines play an increasingly significant role in nanoscience research and applications, there remains a shortage of investigations and understanding of the molecular gear (cogwheel), which is an indispensable and fundamental component to drive a larger correlated molecular machine system. Employing ab initio calculations, we investigate model systems consisting of molecules adsorbed on metal or graphene surfaces, ranging from very simple triple-arm gears such as PF 3 and NH 3 to larger multiarm gears based on carbon rings. We explore in detail the transmission of slow rotational motion from one gear to the next by these relatively simple molecules, so as to isolate and reveal the mechanisms of the relevant intermolecular interactions. Several characteristics of molecular gears are discussed, in particular the flexibility of the arms and the slipping and skipping between interlocking arms of adjacent gears, which differ from familiar macroscopic rigid gears. The underlying theoretical concepts suggest strongly that other analogous structures may also exhibit similar behavior which may inspire future exploration in designing large correlated molecular machines.

  19. Predicting Protein-protein Association Rates using Coarse-grained Simulation and Machine Learning

    NASA Astrophysics Data System (ADS)

    Xie, Zhong-Ru; Chen, Jiawen; Wu, Yinghao

    2017-04-01

    Protein-protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predicted rate was overestimated in the benchmark test compared to the experimental results for a group of protein complexes. We hypothesized that this resulted from molecular flexibility at the interface regions of the interacting proteins. After applying a machine learning algorithm with input variables that accounted for both the conformational flexibility and the energetic factor of binding, we successfully identified most of the protein complexes with overestimated association rates and improved our final prediction by using a cross-validation test. This method was then applied to a new independent test set and resulted in a similar prediction accuracy to that obtained using the training set. It has been thought that diffusion-limited protein association is dominated by long-range interactions. Our results provide strong evidence that the conformational flexibility also plays an important role in regulating protein association. Our studies provide new insights into the mechanism of protein association and offer a computationally efficient tool for predicting its rate.

  20. Predicting Protein–protein Association Rates using Coarse-grained Simulation and Machine Learning

    PubMed Central

    Xie, Zhong-Ru; Chen, Jiawen; Wu, Yinghao

    2017-01-01

    Protein–protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predicted rate was overestimated in the benchmark test compared to the experimental results for a group of protein complexes. We hypothesized that this resulted from molecular flexibility at the interface regions of the interacting proteins. After applying a machine learning algorithm with input variables that accounted for both the conformational flexibility and the energetic factor of binding, we successfully identified most of the protein complexes with overestimated association rates and improved our final prediction by using a cross-validation test. This method was then applied to a new independent test set and resulted in a similar prediction accuracy to that obtained using the training set. It has been thought that diffusion-limited protein association is dominated by long-range interactions. Our results provide strong evidence that the conformational flexibility also plays an important role in regulating protein association. Our studies provide new insights into the mechanism of protein association and offer a computationally efficient tool for predicting its rate. PMID:28418043

  1. Predicting Protein-protein Association Rates using Coarse-grained Simulation and Machine Learning.

    PubMed

    Xie, Zhong-Ru; Chen, Jiawen; Wu, Yinghao

    2017-04-18

    Protein-protein interactions dominate all major biological processes in living cells. We have developed a new Monte Carlo-based simulation algorithm to study the kinetic process of protein association. We tested our method on a previously used large benchmark set of 49 protein complexes. The predicted rate was overestimated in the benchmark test compared to the experimental results for a group of protein complexes. We hypothesized that this resulted from molecular flexibility at the interface regions of the interacting proteins. After applying a machine learning algorithm with input variables that accounted for both the conformational flexibility and the energetic factor of binding, we successfully identified most of the protein complexes with overestimated association rates and improved our final prediction by using a cross-validation test. This method was then applied to a new independent test set and resulted in a similar prediction accuracy to that obtained using the training set. It has been thought that diffusion-limited protein association is dominated by long-range interactions. Our results provide strong evidence that the conformational flexibility also plays an important role in regulating protein association. Our studies provide new insights into the mechanism of protein association and offer a computationally efficient tool for predicting its rate.

  2. Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening

    PubMed Central

    Mu, Lin

    2018-01-01

    This work introduces a number of algebraic topology approaches, including multi-component persistent homology, multi-level persistent homology, and electrostatic persistence for the representation, characterization, and description of small molecules and biomolecular complexes. In contrast to the conventional persistent homology, multi-component persistent homology retains critical chemical and biological information during the topological simplification of biomolecular geometric complexity. Multi-level persistent homology enables a tailored topological description of inter- and/or intra-molecular interactions of interest. Electrostatic persistence incorporates partial charge information into topological invariants. These topological methods are paired with Wasserstein distance to characterize similarities between molecules and are further integrated with a variety of machine learning algorithms, including k-nearest neighbors, ensemble of trees, and deep convolutional neural networks, to manifest their descriptive and predictive powers for protein-ligand binding analysis and virtual screening of small molecules. Extensive numerical experiments involving 4,414 protein-ligand complexes from the PDBBind database and 128,374 ligand-target and decoy-target pairs in the DUD database are performed to test respectively the scoring power and the discriminatory power of the proposed topological learning strategies. It is demonstrated that the present topological learning outperforms other existing methods in protein-ligand binding affinity prediction and ligand-decoy discrimination. PMID:29309403

  3. 70% efficiency of bistate molecular machines explained by information theory, high dimensional geometry and evolutionary convergence.

    PubMed

    Schneider, Thomas D

    2010-10-01

    The relationship between information and energy is key to understanding biological systems. We can display the information in DNA sequences specifically bound by proteins by using sequence logos, and we can measure the corresponding binding energy. These can be compared by noting that one of the forms of the second law of thermodynamics defines the minimum energy dissipation required to gain one bit of information. Under the isothermal conditions that molecular machines function this is [Formula in text] joules per bit (kB is Boltzmann's constant and T is the absolute temperature). Then an efficiency of binding can be computed by dividing the information in a logo by the free energy of binding after it has been converted to bits. The isothermal efficiencies of not only genetic control systems, but also visual pigments are near 70%. From information and coding theory, the theoretical efficiency limit for bistate molecular machines is ln 2=0.6931. Evolutionary convergence to maximum efficiency is limited by the constraint that molecular states must be distinct from each other. The result indicates that natural molecular machines operate close to their information processing maximum (the channel capacity), and implies that nanotechnology can attain this goal.

  4. 70% efficiency of bistate molecular machines explained by information theory, high dimensional geometry and evolutionary convergence

    PubMed Central

    Schneider, Thomas D.

    2010-01-01

    The relationship between information and energy is key to understanding biological systems. We can display the information in DNA sequences specifically bound by proteins by using sequence logos, and we can measure the corresponding binding energy. These can be compared by noting that one of the forms of the second law of thermodynamics defines the minimum energy dissipation required to gain one bit of information. Under the isothermal conditions that molecular machines function this is joules per bit ( is Boltzmann's constant and T is the absolute temperature). Then an efficiency of binding can be computed by dividing the information in a logo by the free energy of binding after it has been converted to bits. The isothermal efficiencies of not only genetic control systems, but also visual pigments are near 70%. From information and coding theory, the theoretical efficiency limit for bistate molecular machines is ln 2 = 0.6931. Evolutionary convergence to maximum efficiency is limited by the constraint that molecular states must be distinct from each other. The result indicates that natural molecular machines operate close to their information processing maximum (the channel capacity), and implies that nanotechnology can attain this goal. PMID:20562221

  5. Mechanoregulation of molecular motors in flagella

    NASA Astrophysics Data System (ADS)

    Gadelha, Hermes

    2014-11-01

    Molecular motors are nano-biological machines responsible for exerting forces that drive movement in living organisms, from cargo transport to cell division and motility. Interestingly, despite the inherent complexity of many interacting motors, order and structure may arise naturally, as exemplified by the harmonic, self-organized undulatory motion of the flagellum. The real mechanisms behind this collective spontaneous oscillation are still unknown, and it is challenging task to measure experimentally the molecular motor dynamics within the flagellar structure in real time. In this talk we will explore different competing hypotheses that are capable of generating flagellar bending waves that ``resemble'' in-vitro observations, emphasizing the need for further mathematical analysis and model validation. It also highlight that this is a fertile and challenging area of inter-disciplinary research for applied mathematicians and demonstrates the importance of future observational and theoretical studies in understanding the underlying mechanics of these motile cell appendages.

  6. PyBioMed: a python library for various molecular representations of chemicals, proteins and DNAs and their interactions.

    PubMed

    Dong, Jie; Yao, Zhi-Jiang; Zhang, Lin; Luo, Feijun; Lin, Qinlu; Lu, Ai-Ping; Chen, Alex F; Cao, Dong-Sheng

    2018-03-20

    With the increasing development of biotechnology and informatics technology, publicly available data in chemistry and biology are undergoing explosive growth. Such wealthy information in these data needs to be extracted and transformed to useful knowledge by various data mining methods. Considering the amazing rate at which data are accumulated in chemistry and biology fields, new tools that process and interpret large and complex interaction data are increasingly important. So far, there are no suitable toolkits that can effectively link the chemical and biological space in view of molecular representation. To further explore these complex data, an integrated toolkit for various molecular representation is urgently needed which could be easily integrated with data mining algorithms to start a full data analysis pipeline. Herein, the python library PyBioMed is presented, which comprises functionalities for online download for various molecular objects by providing different IDs, the pretreatment of molecular structures, the computation of various molecular descriptors for chemicals, proteins, DNAs and their interactions. PyBioMed is a feature-rich and highly customized python library used for the characterization of various complex chemical and biological molecules and interaction samples. The current version of PyBioMed could calculate 775 chemical descriptors and 19 kinds of chemical fingerprints, 9920 protein descriptors based on protein sequences, more than 6000 DNA descriptors from nucleotide sequences, and interaction descriptors from pairwise samples using three different combining strategies. Several examples and five real-life applications were provided to clearly guide the users how to use PyBioMed as an integral part of data analysis projects. By using PyBioMed, users are able to start a full pipelining from getting molecular data, pretreating molecules, molecular representation to constructing machine learning models conveniently. PyBioMed provides various user-friendly and highly customized APIs to calculate various features of biological molecules and complex interaction samples conveniently, which aims at building integrated analysis pipelines from data acquisition, data checking, and descriptor calculation to modeling. PyBioMed is freely available at http://projects.scbdd.com/pybiomed.html .

  7. Diverse Effects, Complex Causes: Children Use Information about Machines' Functional Diversity to Infer Internal Complexity

    ERIC Educational Resources Information Center

    Ahl, Richard E.; Keil, Frank C.

    2017-01-01

    Four studies explored the abilities of 80 adults and 180 children (4-9 years), from predominantly middle-class families in the Northeastern United States, to use information about machines' observable functional capacities to infer their internal, "hidden" mechanistic complexity. Children as young as 4 and 5 years old used machines'…

  8. Metal-centred azaphosphatriptycene gear with a photo- and thermally driven mechanical switching function based on coordination isomerism.

    PubMed

    Ube, Hitoshi; Yasuda, Yoshihiro; Sato, Hiroyasu; Shionoya, Mitsuhiko

    2017-02-08

    Metal ions can serve as a centre of molecular motions due to their coordination geometry, reversible bonding nature and external stimuli responsiveness. Such essential features of metal ions have been utilized for metal-mediated molecular machines with the ability to motion switch via metallation/demetallation or coordination number variation at the metal centre; however, motion switching based on the change in coordination geometry remain largely unexplored. Herein, we report a Pt II -centred molecular gear that demonstrates control of rotor engagement and disengagement based on photo- and thermally driven cis-trans isomerization at the Pt II centre. This molecular rotary motion transmitter has been constructed from two coordinating azaphosphatriptycene rotators and one Pt II ion as a stator. Isomerization between an engaged cis-form and a disengaged trans-form is reversibly driven by ultraviolet irradiation and heating. Such a photo- and thermally triggered motional interconversion between engaged/disengaged states on a metal ion would provide a selector switch for more complex interlocking systems.

  9. Research on the EDM Technology for Micro-holes at Complex Spatial Locations

    NASA Astrophysics Data System (ADS)

    Y Liu, J.; Guo, J. M.; Sun, D. J.; Cai, Y. H.; Ding, L. T.; Jiang, H.

    2017-12-01

    For the demands on machining micro-holes at complex spatial location, several key technical problems are conquered such as micro-Electron Discharge Machining (micro-EDM) power supply system’s development, the host structure’s design and machining process technical. Through developing low-voltage power supply circuit, high-voltage circuit, micro and precision machining circuit and clearance detection system, the narrow pulse and high frequency six-axis EDM machining power supply system is developed to meet the demands on micro-hole discharging machining. With the method of combining the CAD structure design, CAE simulation analysis, modal test, ODS (Operational Deflection Shapes) test and theoretical analysis, the host construction and key axes of the machine tool are optimized to meet the position demands of the micro-holes. Through developing the special deionized water filtration system to make sure that the machining process is stable enough. To verify the machining equipment and processing technical developed in this paper through developing the micro-hole’s processing flow and test on the real machine tool. As shown in the final test results: the efficient micro-EDM machining pulse power supply system, machine tool host system, deionized filtration system and processing method developed in this paper meet the demands on machining micro-holes at complex spatial locations.

  10. The bacterial segrosome: a dynamic nucleoprotein machine for DNA trafficking and segregation.

    PubMed

    Hayes, Finbarr; Barillà, Daniela

    2006-02-01

    The genomes of unicellular and multicellular organisms must be partitioned equitably in coordination with cytokinesis to ensure faithful transmission of duplicated genetic material to daughter cells. Bacteria use sophisticated molecular mechanisms to guarantee accurate segregation of both plasmids and chromosomes at cell division. Plasmid segregation is most commonly mediated by a Walker-type ATPase and one of many DNA-binding proteins that assemble on a cis-acting centromere to form a nucleoprotein complex (the segrosome) that mediates intracellular plasmid transport. Bacterial chromosome segregation involves a multipartite strategy in which several discrete protein complexes potentially participate. Shedding light on the basis of genome segregation in bacteria could indicate new strategies aimed at combating pathogenic and antibiotic-resistant bacteria.

  11. Structural dynamics of the MecA-ClpC complex: a type II AAA+ protein unfolding machine.

    PubMed

    Liu, Jing; Mei, Ziqing; Li, Ningning; Qi, Yutao; Xu, Yanji; Shi, Yigong; Wang, Feng; Lei, Jianlin; Gao, Ning

    2013-06-14

    The MecA-ClpC complex is a bacterial type II AAA(+) molecular machine responsible for regulated unfolding of substrates, such as transcription factors ComK and ComS, and targeting them to ClpP for degradation. The six subunits of the MecA-ClpC complex form a closed barrel-like structure, featured with three stacked rings and a hollow passage, where substrates are threaded and translocated through successive pores. Although the general concepts of how polypeptides are unfolded and translocated by internal pore loops of AAA(+) proteins have long been conceived, the detailed mechanistic model remains elusive. With cryoelectron microscopy, we captured four different structures of the MecA-ClpC complexes. These complexes differ in the nucleotide binding states of the two AAA(+) rings and therefore might presumably reflect distinctive, representative snapshots from a dynamic unfolding cycle of this hexameric complex. Structural analysis reveals that nucleotide binding and hydrolysis modulate the hexameric complex in a number of ways, including the opening of the N-terminal ring, the axial and radial positions of pore loops, the compactness of the C-terminal ring, as well as the relative rotation between the two nucleotide-binding domain rings. More importantly, our structural and biochemical data indicate there is an active allosteric communication between the two AAA(+) rings and suggest that concerted actions of the two AAA(+) rings are required for the efficiency of the substrate unfolding and translocation. These findings provide important mechanistic insights into the dynamic cycle of the MecA-ClpC unfoldase and especially lay a foundation toward the complete understanding of the structural dynamics of the general type II AAA(+) hexamers.

  12. Hydrogen exchange mass spectrometry of bacteriorhodopsin reveals light-induced changes in the structural dynamics of a biomolecular machine.

    PubMed

    Pan, Yan; Brown, Leonid; Konermann, Lars

    2011-12-21

    Many proteins act as molecular machines that are fuelled by a nonthermal energy source. Examples include transmembrane pumps and stator-rotor complexes. These systems undergo cyclic motions (CMs) that are being driven along a well-defined conformational trajectory. Superimposed on these CMs are thermal fluctuations (TFs) that are coupled to stochastic motions of the solvent. Here we explore whether the TFs of a molecular machine are affected by the occurrence of CMs. Bacteriorhodopsin (BR) is a light-driven proton pump that serves as a model system in this study. The function of BR is based on a photocycle that involves trans/cis isomerization of a retinal chromophore, as well as motions of transmembrane helices. Hydrogen/deuterium exchange (HDX) mass spectrometry was used to monitor the TFs of BR, focusing on the monomeric form of the protein. Comparative HDX studies were conducted under illumination and in the dark. The HDX kinetics of BR are dramatically accelerated in the presence of light. The isotope exchange rates and the number of backbone amides involved in EX2 opening transitions increase roughly 2-fold upon illumination. In contrast, light/dark control experiments on retinal-free protein produced no discernible differences. It can be concluded that the extent of TFs in BR strongly depends on photon-driven CMs. The light-induced differences in HDX behavior are ascribed to protein destabilization. Specifically, the thermodynamic stability of the dark-adapted protein is estimated to be 5.5 kJ mol(-1) under the conditions of our work. This value represents the free energy difference between the folded state F and a significantly unfolded conformer U. Illumination reduces the stability of F by 2.2 kJ mol(-1). Mechanical agitation caused by isomerization of the chromophore is transferred to the surrounding protein scaffold, and subsequently, the energy dissipates into the solvent. Light-induced retinal motions therefore act analogously to an internal heat source that promotes the occurrence of TFs. Overall, our data highlight the potential of HDX methods for probing the structural dynamics of molecular machines under "engine on" and "engine off" conditions. © 2011 American Chemical Society

  13. Energy landscapes for machine learning

    NASA Astrophysics Data System (ADS)

    Ballard, Andrew J.; Das, Ritankar; Martiniani, Stefano; Mehta, Dhagash; Sagun, Levent; Stevenson, Jacob D.; Wales, David J.

    Machine learning techniques are being increasingly used as flexible non-linear fitting and prediction tools in the physical sciences. Fitting functions that exhibit multiple solutions as local minima can be analysed in terms of the corresponding machine learning landscape. Methods to explore and visualise molecular potential energy landscapes can be applied to these machine learning landscapes to gain new insight into the solution space involved in training and the nature of the corresponding predictions. In particular, we can define quantities analogous to molecular structure, thermodynamics, and kinetics, and relate these emergent properties to the structure of the underlying landscape. This Perspective aims to describe these analogies with examples from recent applications, and suggest avenues for new interdisciplinary research.

  14. Photonics walking up a human hair

    NASA Astrophysics Data System (ADS)

    Zeng, Hao; Parmeggiani, Camilla; Martella, Daniele; Wasylczyk, Piotr; Burresi, Matteo; Wiersma, Diederik S.

    2016-03-01

    While animals have access to sugars as energy source, this option is generally not available to artificial machines and robots. Energy delivery is thus the bottleneck for creating independent robots and machines, especially on micro- and nano- meter length scales. We have found a way to produce polymeric nano-structures with local control over the molecular alignment, which allowed us to solve the above issue. By using a combination of polymers, of which part is optically sensitive, we can create complex functional structures with nanometer accuracy, responsive to light. In particular, this allowed us to realize a structure that can move autonomously over surfaces (it can "walk") using the environmental light as its energy source. The robot is only 60 μm in total length, thereby smaller than any known terrestrial walking species, and it is capable of random, directional walking and rotating on different dry surfaces.

  15. Quantum-chemical insights from deep tensor neural networks

    PubMed Central

    Schütt, Kristof T.; Arbabzadah, Farhad; Chmiela, Stefan; Müller, Klaus R.; Tkatchenko, Alexandre

    2017-01-01

    Learning from data has led to paradigm shifts in a multitude of disciplines, including web, text and image search, speech recognition, as well as bioinformatics. Can machine learning enable similar breakthroughs in understanding quantum many-body systems? Here we develop an efficient deep learning approach that enables spatially and chemically resolved insights into quantum-mechanical observables of molecular systems. We unify concepts from many-body Hamiltonians with purpose-designed deep tensor neural networks, which leads to size-extensive and uniformly accurate (1 kcal mol−1) predictions in compositional and configurational chemical space for molecules of intermediate size. As an example of chemical relevance, the model reveals a classification of aromatic rings with respect to their stability. Further applications of our model for predicting atomic energies and local chemical potentials in molecules, reliable isomer energies, and molecules with peculiar electronic structure demonstrate the potential of machine learning for revealing insights into complex quantum-chemical systems. PMID:28067221

  16. Quantum-chemical insights from deep tensor neural networks.

    PubMed

    Schütt, Kristof T; Arbabzadah, Farhad; Chmiela, Stefan; Müller, Klaus R; Tkatchenko, Alexandre

    2017-01-09

    Learning from data has led to paradigm shifts in a multitude of disciplines, including web, text and image search, speech recognition, as well as bioinformatics. Can machine learning enable similar breakthroughs in understanding quantum many-body systems? Here we develop an efficient deep learning approach that enables spatially and chemically resolved insights into quantum-mechanical observables of molecular systems. We unify concepts from many-body Hamiltonians with purpose-designed deep tensor neural networks, which leads to size-extensive and uniformly accurate (1 kcal mol -1 ) predictions in compositional and configurational chemical space for molecules of intermediate size. As an example of chemical relevance, the model reveals a classification of aromatic rings with respect to their stability. Further applications of our model for predicting atomic energies and local chemical potentials in molecules, reliable isomer energies, and molecules with peculiar electronic structure demonstrate the potential of machine learning for revealing insights into complex quantum-chemical systems.

  17. Quantum-chemical insights from deep tensor neural networks

    NASA Astrophysics Data System (ADS)

    Schütt, Kristof T.; Arbabzadah, Farhad; Chmiela, Stefan; Müller, Klaus R.; Tkatchenko, Alexandre

    2017-01-01

    Learning from data has led to paradigm shifts in a multitude of disciplines, including web, text and image search, speech recognition, as well as bioinformatics. Can machine learning enable similar breakthroughs in understanding quantum many-body systems? Here we develop an efficient deep learning approach that enables spatially and chemically resolved insights into quantum-mechanical observables of molecular systems. We unify concepts from many-body Hamiltonians with purpose-designed deep tensor neural networks, which leads to size-extensive and uniformly accurate (1 kcal mol-1) predictions in compositional and configurational chemical space for molecules of intermediate size. As an example of chemical relevance, the model reveals a classification of aromatic rings with respect to their stability. Further applications of our model for predicting atomic energies and local chemical potentials in molecules, reliable isomer energies, and molecules with peculiar electronic structure demonstrate the potential of machine learning for revealing insights into complex quantum-chemical systems.

  18. Machine-learned analysis of quantitative sensory testing responses to noxious cold stimulation in healthy subjects.

    PubMed

    Weyer-Menkhoff, I; Thrun, M C; Lötsch, J

    2018-05-01

    Pain in response to noxious cold has a complex molecular background probably involving several types of sensors. A recent observation has been the multimodal distribution of human cold pain thresholds. This study aimed at analysing reproducibility and stability of this observation and further exploration of data patterns supporting a complex background. Pain thresholds to noxious cold stimuli (range 32-0 °C, tonic: temperature decrease -1 °C/s, phasic: temperature decrease -8 °C/s) were acquired in 148 healthy volunteers. The probability density distribution was analysed using machine-learning derived methods implemented as Gaussian mixture modeling (GMM), emergent self-organizing maps and self-organizing swarms of data agents. The probability density function of pain responses was trimodal (mean thresholds at 25.9, 18.4 and 8.0 °C for tonic and 24.5, 18.1 and 7.5 °C for phasic stimuli). Subjects' association with Gaussian modes was consistent between both types of stimuli (weighted Cohen's κ = 0.91). Patterns emerging in self-organizing neuronal maps and swarms could be associated with different trends towards decreasing cold pain sensitivity in different Gaussian modes. On self-organizing maps, the third Gaussian mode emerged as particularly distinct. Thresholds at, roughly, 25 and 18 °C agree with known working temperatures of TRPM8 and TRPA1 ion channels, respectively, and hint at relative local dominance of either channel in respective subjects. Data patterns suggest involvement of further distinct mechanisms in cold pain perception at lower temperatures. Findings support data science approaches to identify biologically plausible hints at complex molecular mechanisms underlying human pain phenotypes. Sensitivity to pain is heterogeneous. Data-driven computational research approaches allow the identification of subgroups of subjects with a distinct pattern of sensitivity to cold stimuli. The subgroups are reproducible with different types of noxious cold stimuli. Subgroups show pattern that hints at distinct and inter-individually different types of the underlying molecular background. © 2018 European Pain Federation - EFIC®.

  19. Controlled clockwise and anticlockwise rotational switching of a molecular motor.

    PubMed

    Perera, U G E; Ample, F; Kersell, H; Zhang, Y; Vives, G; Echeverria, J; Grisolia, M; Rapenne, G; Joachim, C; Hla, S-W

    2013-01-01

    The design of artificial molecular machines often takes inspiration from macroscopic machines. However, the parallels between the two systems are often only superficial, because most molecular machines are governed by quantum processes. Previously, rotary molecular motors powered by light and chemical energy have been developed. In electrically driven motors, tunnelling electrons from the tip of a scanning tunnelling microscope have been used to drive the rotation of a simple rotor in a single direction and to move a four-wheeled molecule across a surface. Here, we show that a stand-alone molecular motor adsorbed on a gold surface can be made to rotate in a clockwise or anticlockwise direction by selective inelastic electron tunnelling through different subunits of the motor. Our motor is composed of a tripodal stator for vertical positioning, a five-arm rotor for controlled rotations, and a ruthenium atomic ball bearing connecting the static and rotational parts. The directional rotation arises from sawtooth-like rotational potentials, which are solely determined by the internal molecular structure and are independent of the surface adsorption site.

  20. Prospects of application of additive technologies for increasing the efficiency of impeller machines

    NASA Astrophysics Data System (ADS)

    Belova, O. V.; Borisov, Yu. A.

    2017-08-01

    Impeller machine is a device in which the flow path carries out the supply (or retraction) of mechanical energy to the flow of a working fluid passing through the machine. To increase the efficiency of impeller machines, it is necessary to use design modern technologies, namely the use of numerical methods for conducting research in the field of gas dynamics, as well as additive manufacturing (AM) for the of both prototypes and production model. AM technologies are deservedly rightly called revolutionary because they give unique possibility for manufacturing products, creating perfect forms, both light and durable. The designers face the challenge of developing a new design methodology, since AM allows the use of the concept of "Complexity For Free". The "Complexity For Free" conception is based on: complexity of the form; hierarchical complexity; complexity of the material; functional complexity. The new technical items design method according to a functional principle is also investigated.

  1. The Effect of Two Receivers on Broadcast Molecular Communication Systems.

    PubMed

    Lu, Yi; Higgins, Matthew D; Noel, Adam; Leeson, Mark S; Chen, Yunfei

    2016-12-01

    Molecular communication is a paradigm that utilizes molecules to exchange information between nano-machines. When considering such systems where multiple receivers are present, prior work has assumed for simplicity that they do not interfere with each other. This paper aims to address this issue and shows to what extent an interfering receiver, [Formula: see text], will have an impact on the target receiver, [Formula: see text], with respect to Bit Error Rate (BER) and capacity. Furthermore, approximations of the Binomial distribution are applied to reduce the complexity of calculations. Results show the sensitivity in communication performance due to the relative location of the interfering receiver. Critically, placing [Formula: see text] between the transmitter [Formula: see text] and [Formula: see text] causes a significant increase in BER or decrease in capacity.

  2. Comparative Characterization of Crofelemer Samples Using Data Mining and Machine Learning Approaches With Analytical Stability Data Sets.

    PubMed

    Nariya, Maulik K; Kim, Jae Hyun; Xiong, Jian; Kleindl, Peter A; Hewarathna, Asha; Fisher, Adam C; Joshi, Sangeeta B; Schöneich, Christian; Forrest, M Laird; Middaugh, C Russell; Volkin, David B; Deeds, Eric J

    2017-11-01

    There is growing interest in generating physicochemical and biological analytical data sets to compare complex mixture drugs, for example, products from different manufacturers. In this work, we compare various crofelemer samples prepared from a single lot by filtration with varying molecular weight cutoffs combined with incubation for different times at different temperatures. The 2 preceding articles describe experimental data sets generated from analytical characterization of fractionated and degraded crofelemer samples. In this work, we use data mining techniques such as principal component analysis and mutual information scores to help visualize the data and determine discriminatory regions within these large data sets. The mutual information score identifies chemical signatures that differentiate crofelemer samples. These signatures, in many cases, would likely be missed by traditional data analysis tools. We also found that supervised learning classifiers robustly discriminate samples with around 99% classification accuracy, indicating that mathematical models of these physicochemical data sets are capable of identifying even subtle differences in crofelemer samples. Data mining and machine learning techniques can thus identify fingerprint-type attributes of complex mixture drugs that may be used for comparative characterization of products. Copyright © 2017 American Pharmacists Association®. All rights reserved.

  3. A comparison of machine learning and Bayesian modelling for molecular serotyping.

    PubMed

    Newton, Richard; Wernisch, Lorenz

    2017-08-11

    Streptococcus pneumoniae is a human pathogen that is a major cause of infant mortality. Identifying the pneumococcal serotype is an important step in monitoring the impact of vaccines used to protect against disease. Genomic microarrays provide an effective method for molecular serotyping. Previously we developed an empirical Bayesian model for the classification of serotypes from a molecular serotyping array. With only few samples available, a model driven approach was the only option. In the meanwhile, several thousand samples have been made available to us, providing an opportunity to investigate serotype classification by machine learning methods, which could complement the Bayesian model. We compare the performance of the original Bayesian model with two machine learning algorithms: Gradient Boosting Machines and Random Forests. We present our results as an example of a generic strategy whereby a preliminary probabilistic model is complemented or replaced by a machine learning classifier once enough data are available. Despite the availability of thousands of serotyping arrays, a problem encountered when applying machine learning methods is the lack of training data containing mixtures of serotypes; due to the large number of possible combinations. Most of the available training data comprises samples with only a single serotype. To overcome the lack of training data we implemented an iterative analysis, creating artificial training data of serotype mixtures by combining raw data from single serotype arrays. With the enhanced training set the machine learning algorithms out perform the original Bayesian model. However, for serotypes currently lacking sufficient training data the best performing implementation was a combination of the results of the Bayesian Model and the Gradient Boosting Machine. As well as being an effective method for classifying biological data, machine learning can also be used as an efficient method for revealing subtle biological insights, which we illustrate with an example.

  4. High-pressure microscopy for tracking dynamic properties of molecular machines.

    PubMed

    Nishiyama, Masayoshi

    2017-12-01

    High-pressure microscopy is one of the powerful techniques to visualize the effects of hydrostatic pressures on research targets. It could be used for monitoring the pressure-induced changes in the structure and function of molecular machines in vitro and in vivo. This review focuses on the dynamic properties of the assemblies and machines, analyzed by means of high-pressure microscopy measurement. We developed a high-pressure microscope that is optimized both for the best image formation and for the stability to hydrostatic pressure up to 150 MPa. Application of pressure could change polymerization and depolymerization processes of the microtubule cytoskeleton, suggesting a modulation of the intermolecular interaction between tubulin molecules. A novel motility assay demonstrated that high hydrostatic pressure induces counterclockwise (CCW) to clockwise (CW) reversals of the Escherichia coli flagellar motor. The present techniques could be extended to study how molecular machines in complicated systems respond to mechanical stimuli. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Molecular Rotors as Switches

    PubMed Central

    Xue, Mei; Wang, Kang L.

    2012-01-01

    The use of a functional molecular unit acting as a state variable provides an attractive alternative for the next generations of nanoscale electronics. It may help overcome the limits of conventional MOSFETd due to their potential scalability, low-cost, low variability, and highly integratable characteristics as well as the capability to exploit bottom-up self-assembly processes. This bottom-up construction and the operation of nanoscale machines/devices, in which the molecular motion can be controlled to perform functions, have been studied for their functionalities. Being triggered by external stimuli such as light, electricity or chemical reagents, these devices have shown various functions including those of diodes, rectifiers, memories, resonant tunnel junctions and single settable molecular switches that can be electronically configured for logic gates. Molecule-specific electronic switching has also been reported for several of these device structures, including nanopores containing oligo(phenylene ethynylene) monolayers, and planar junctions incorporating rotaxane and catenane monolayers for the construction and operation of complex molecular machines. A specific electrically driven surface mounted molecular rotor is described in detail in this review. The rotor is comprised of a monolayer of redox-active ligated copper compounds sandwiched between a gold electrode and a highly-doped P+ Si. This electrically driven sandwich-type monolayer molecular rotor device showed an on/off ratio of approximately 104, a read window of about 2.5 V, and a retention time of greater than 104 s. The rotation speed of this type of molecular rotor has been reported to be in the picosecond timescale, which provides a potential of high switching speed applications. Current-voltage spectroscopy (I-V) revealed a temperature-dependent negative differential resistance (NDR) associated with the device. The analysis of the device I–V characteristics suggests the source of the observed switching effects to be the result of the redox-induced ligand rotation around the copper metal center and this attribution of switching is consistent with the observed temperature dependence of the switching behavior as well as the proposed energy diagram of the device. The observed resistance switching shows the potential for future non-volatile memories and logic devices applications. This review will discuss the progress and provide a perspective of molecular motion for nanoelectronics and other applications.

  6. Molecular graph convolutions: moving beyond fingerprints

    NASA Astrophysics Data System (ADS)

    Kearnes, Steven; McCloskey, Kevin; Berndl, Marc; Pande, Vijay; Riley, Patrick

    2016-08-01

    Molecular "fingerprints" encoding structural information are the workhorse of cheminformatics and machine learning in drug discovery applications. However, fingerprint representations necessarily emphasize particular aspects of the molecular structure while ignoring others, rather than allowing the model to make data-driven decisions. We describe molecular graph convolutions, a machine learning architecture for learning from undirected graphs, specifically small molecules. Graph convolutions use a simple encoding of the molecular graph—atoms, bonds, distances, etc.—which allows the model to take greater advantage of information in the graph structure. Although graph convolutions do not outperform all fingerprint-based methods, they (along with other graph-based methods) represent a new paradigm in ligand-based virtual screening with exciting opportunities for future improvement.

  7. Molecular graph convolutions: moving beyond fingerprints.

    PubMed

    Kearnes, Steven; McCloskey, Kevin; Berndl, Marc; Pande, Vijay; Riley, Patrick

    2016-08-01

    Molecular "fingerprints" encoding structural information are the workhorse of cheminformatics and machine learning in drug discovery applications. However, fingerprint representations necessarily emphasize particular aspects of the molecular structure while ignoring others, rather than allowing the model to make data-driven decisions. We describe molecular graph convolutions, a machine learning architecture for learning from undirected graphs, specifically small molecules. Graph convolutions use a simple encoding of the molecular graph-atoms, bonds, distances, etc.-which allows the model to take greater advantage of information in the graph structure. Although graph convolutions do not outperform all fingerprint-based methods, they (along with other graph-based methods) represent a new paradigm in ligand-based virtual screening with exciting opportunities for future improvement.

  8. Condensins: universal organizers of chromosomes with diverse functions

    PubMed Central

    Hirano, Tatsuya

    2012-01-01

    Condensins are multisubunit protein complexes that play a fundamental role in the structural and functional organization of chromosomes in the three domains of life. Most eukaryotic species have two different types of condensin complexes, known as condensins I and II, that fulfill nonoverlapping functions and are subjected to differential regulation during mitosis and meiosis. Recent studies revealed that the two complexes contribute to a wide variety of interphase chromosome functions, such as gene regulation, recombination, and repair. Also emerging are their cell type- and tissue-specific functions and relevance to human disease. Biochemical and structural analyses of eukaryotic and bacterial condensins steadily uncover the mechanisms of action of this class of highly sophisticated molecular machines. Future studies on condensins will not only enhance our understanding of chromosome architecture and dynamics, but also help address a previously underappreciated yet profound set of questions in chromosome biology. PMID:22855829

  9. Tribology and energy efficiency: from molecules to lubricated contacts to complete machines.

    PubMed

    Taylor, Robert Ian

    2012-01-01

    The impact of lubricants on energy efficiency is considered. Molecular details of base oils used in lubricants can have a great impact on the lubricant's physical properties which will affect the energy efficiency performance of a lubricant. In addition, molecular details of lubricant additives can result in significant differences in measured friction coefficients for machine elements operating in the mixed/boundary lubrication regime. In single machine elements, these differences will result in lower friction losses, and for complete systems (such as cars, trucks, hydraulic circuits, industrial gearboxes etc.) lower fuel consumption or lower electricity consumption can result.

  10. Morphological diagnostics of star formation in molecular clouds

    NASA Astrophysics Data System (ADS)

    Beaumont, Christopher Norris

    Molecular clouds are the birth sites of all star formation in the present-day universe. They represent the initial conditions of star formation, and are the primary medium by which stars transfer energy and momentum back to parsec scales. Yet, the physical evolution of molecular clouds remains poorly understood. This is not due to a lack of observational data, nor is it due to an inability to simulate the conditions inside molecular clouds. Instead, the physics and structure of the interstellar medium are sufficiently complex that interpreting molecular cloud data is very difficult. This dissertation mitigates this problem, by developing more sophisticated ways to interpret morphological information in molecular cloud observations and simulations. In particular, I have focused on leveraging machine learning techniques to identify physically meaningful substructures in the interstellar medium, as well as techniques to inter-compare molecular cloud simulations to observations. These contributions make it easier to understand the interplay between molecular clouds and star formation. Specific contributions include: new insight about the sheet-like geometry of molecular clouds based on observations of stellar bubbles; a new algorithm to disambiguate overlapping yet morphologically distinct cloud structures; a new perspective on the relationship between molecular cloud column density distributions and the sizes of cloud substructures; a quantitative analysis of how projection effects affect measurements of cloud properties; and an automatically generated, statistically-calibrated catalog of bubbles identified from their infrared morphologies.

  11. Effectiveness of Podcasts as Laboratory Instructional Support: Learner Perceptions of Machine Shop and Welding Students

    ERIC Educational Resources Information Center

    Lauritzen, Louis Dee

    2014-01-01

    Machine shop students face the daunting task of learning the operation of complex three-dimensional machine tools, and welding students must develop specific motor skills in addition to understanding the complexity of material types and characteristics. The use of consumer technology by the Millennial generation of vocational students, the…

  12. Complex biomarker discovery in neuroimaging data: Finding a needle in a haystack☆

    PubMed Central

    Atluri, Gowtham; Padmanabhan, Kanchana; Fang, Gang; Steinbach, Michael; Petrella, Jeffrey R.; Lim, Kelvin; MacDonald, Angus; Samatova, Nagiza F.; Doraiswamy, P. Murali; Kumar, Vipin

    2013-01-01

    Neuropsychiatric disorders such as schizophrenia, bipolar disorder and Alzheimer's disease are major public health problems. However, despite decades of research, we currently have no validated prognostic or diagnostic tests that can be applied at an individual patient level. Many neuropsychiatric diseases are due to a combination of alterations that occur in a human brain rather than the result of localized lesions. While there is hope that newer imaging technologies such as functional and anatomic connectivity MRI or molecular imaging may offer breakthroughs, the single biomarkers that are discovered using these datasets are limited by their inability to capture the heterogeneity and complexity of most multifactorial brain disorders. Recently, complex biomarkers have been explored to address this limitation using neuroimaging data. In this manuscript we consider the nature of complex biomarkers being investigated in the recent literature and present techniques to find such biomarkers that have been developed in related areas of data mining, statistics, machine learning and bioinformatics. PMID:24179856

  13. X-ray free electron lasers motivate bioanalytical characterization of protein nanocrystals: serial femtosecond crystallography.

    PubMed

    Bogan, Michael J

    2013-04-02

    Atomic resolution structures of large biomacromolecular complexes can now be recorded at room temperature from crystals with submicrometer dimensions using intense femtosecond pulses delivered by the world's largest and most powerful X-ray machine, a laser called the Linac Coherent Light Source. Abundant opportunities exist for the bioanalytical sciences to help extend this revolutionary advance in structural biology to the ultimate goal of recording molecular-movies of noncrystalline biomacromolecules. This Feature will introduce the concept of serial femtosecond crystallography to the nonexpert, briefly review progress to date, and highlight some potential contributions from the analytical sciences.

  14. Binding Affinity prediction with Property Encoded Shape Distribution signatures

    PubMed Central

    Das, Sourav; Krein, Michael P.

    2010-01-01

    We report the use of the molecular signatures known as “Property-Encoded Shape Distributions” (PESD) together with standard Support Vector Machine (SVM) techniques to produce validated models that can predict the binding affinity of a large number of protein ligand complexes. This “PESD-SVM” method uses PESD signatures that encode molecular shapes and property distributions on protein and ligand surfaces as features to build SVM models that require no subjective feature selection. A simple protocol was employed for tuning the SVM models during their development, and the results were compared to SFCscore – a regression-based method that was previously shown to perform better than 14 other scoring functions. Although the PESD-SVM method is based on only two surface property maps, the overall results were comparable. For most complexes with a dominant enthalpic contribution to binding (ΔH/-TΔS > 3), a good correlation between true and predicted affinities was observed. Entropy and solvent were not considered in the present approach and further improvement in accuracy would require accounting for these components rigorously. PMID:20095526

  15. Dancing with Swarms: Utilizing Swarm Intelligence to Build, Investigate, and Control Complex Systems

    NASA Astrophysics Data System (ADS)

    Jacob, Christian

    We are surrounded by a natural world of massively parallel, decentralized biological "information processing" systems, a world that exhibits fascinating emergent properties in many ways. In fact, our very own bodies are the result of emergent patterns, as the development of any multi-cellular organism is determined by localized interactions among an enormous number of cells, carefully orchestrated by enzymes, signalling proteins and other molecular "agents". What is particularly striking about these highly distributed developmental processes is that a centralized control agency is completely absent. This is also the case for many other biological systems, such as termites which build their nests—without an architect that draws a plan, or brain cells evolving into a complex `mind machine'—without an explicit blueprint of a network layout.

  16. Computational Nanotechnology of Materials, Devices, and Machines: Carbon Nanotubes

    NASA Technical Reports Server (NTRS)

    Srivastava, Deepak; Kwak, Dolhan (Technical Monitor)

    2000-01-01

    The mechanics and chemistry of carbon nanotubes have relevance for their numerous electronic applications. Mechanical deformations such as bending and twisting affect the nanotube's conductive properties, and at the same time they possess high strength and elasticity. Two principal techniques were utilized including the analysis of large scale classical molecular dynamics on a shared memory architecture machine and a quantum molecular dynamics methodology. In carbon based electronics, nanotubes are used as molecular wires with topological defects which are mediated through various means. Nanotubes can be connected to form junctions.

  17. Micro-machined calorimetric biosensors

    DOEpatents

    Doktycz, Mitchel J.; Britton, Jr., Charles L.; Smith, Stephen F.; Oden, Patrick I.; Bryan, William L.; Moore, James A.; Thundat, Thomas G.; Warmack, Robert J.

    2002-01-01

    A method and apparatus are provided for detecting and monitoring micro-volumetric enthalpic changes caused by molecular reactions. Micro-machining techniques are used to create very small thermally isolated masses incorporating temperature-sensitive circuitry. The thermally isolated masses are provided with a molecular layer or coating, and the temperature-sensitive circuitry provides an indication when the molecules of the coating are involved in an enthalpic reaction. The thermally isolated masses may be provided singly or in arrays and, in the latter case, the molecular coatings may differ to provide qualitative and/or quantitative assays of a substance.

  18. In vitro labeling strategies for in cellulo fluorescence microscopy of single ribonucleoprotein machines.

    PubMed

    Custer, Thomas C; Walter, Nils G

    2017-07-01

    RNA plays a fundamental, ubiquitous role as either substrate or functional component of many large cellular complexes-"molecular machines"-used to maintain and control the readout of genetic information, a functional landscape that we are only beginning to understand. The cellular mechanisms for the spatiotemporal organization of the plethora of RNAs involved in gene expression are particularly poorly understood. Intracellular single-molecule fluorescence microscopy provides a powerful emerging tool for probing the pertinent mechanistic parameters that govern cellular RNA functions, including those of protein coding messenger RNAs (mRNAs). Progress has been hampered, however, by the scarcity of efficient high-yield methods to fluorescently label RNA molecules without the need to drastically increase their molecular weight through artificial appendages that may result in altered behavior. Herein, we employ T7 RNA polymerase to body label an RNA with a cyanine dye, as well as yeast poly(A) polymerase to strategically place multiple 2'-azido-modifications for subsequent fluorophore labeling either between the body and tail or randomly throughout the tail. Using a combination of biochemical and single-molecule fluorescence microscopy approaches, we demonstrate that both yeast poly(A) polymerase labeling strategies result in fully functional mRNA, whereas protein coding is severely diminished in the case of body labeling. © 2016 The Protein Society.

  19. Multiclass cancer diagnosis using tumor gene expression signatures

    DOE PAGES

    Ramaswamy, S.; Tamayo, P.; Rifkin, R.; ...

    2001-12-11

    The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a supportmore » vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.« less

  20. ATOMIC RESOLUTION CRYO ELECTRON MICROSCOPY OF MACROMOLECULAR COMPLEXES

    PubMed Central

    ZHOU, Z. HONG

    2013-01-01

    Single-particle cryo electron microscopy (cryoEM) is a technique for determining three-dimensional (3D) structures from projection images of molecular complexes preserved in their “native,” noncrystalline state. Recently, atomic or near-atomic resolution structures of several viruses and protein assemblies have been determined by single-particle cryoEM, allowing ab initio atomic model building by following the amino acid side chains or nucleic acid bases identifiable in their cryoEM density maps. In particular, these cryoEM structures have revealed extended arms contributing to molecular interactions that are otherwise not resolved by the conventional structural method of X-ray crystallography at similar resolutions. High-resolution cryoEM requires careful consideration of a number of factors, including proper sample preparation to ensure structural homogeneity, optimal configuration of electron imaging conditions to record high-resolution cryoEM images, accurate determination of image parameters to correct image distortions, efficient refinement and computation to reconstruct a 3D density map, and finally appropriate choice of modeling tools to construct atomic models for functional interpretation. This progress illustrates the power of cryoEM and ushers it into the arsenal of structural biology, alongside conventional techniques of X-ray crystallography and NMR, as a major tool (and sometimes the preferred one) for the studies of molecular interactions in supramolecular assemblies or machines. PMID:21501817

  1. Mapping of the extinction in giant molecular clouds using optical star counts

    NASA Astrophysics Data System (ADS)

    Cambrésy, L.

    1999-05-01

    This paper presents large scale extinction maps of most nearby Giant Molecular Clouds of the Galaxy (Lupus, rho Ophiuchus, Scorpius, Coalsack, Taurus, Chamaeleon, Musca, Corona Australis, Serpens, IC 5146, Vela, Orion, Monoceros R1 and R2, Rosette, Carina) derived from a star count method using an adaptive grid and a wavelet decomposition applied to the optical data provided by the USNO-Precision Measuring Machine. The distribution of the extinction in the clouds leads to estimate their total individual masses M and their maximum of extinction. I show that the relation between the mass contained within an iso-extinction contour and the extinction is similar from cloud to cloud and allows the extrapolation of the maximum of extinction in the range 5.7 to 25.5 magnitudes. I found that about half of the mass is contained in regions where the visual extinction is smaller than 1 magnitude. The star count method used on large scale ( ~ 250 square degrees) is a powerful and relatively straightforward method to estimate the mass of molecular complexes. A systematic study of the all sky would lead to discover new clouds as I did in the Lupus complex for which I found a sixth cloud of about 10(4) M_⊙.

  2. Computer Aided Simulation Machining Programming In 5-Axis Nc Milling Of Impeller Leaf

    NASA Astrophysics Data System (ADS)

    Huran, Liu

    At present, cad/cam (computer-aided design and manufacture) have fine wider and wider application in mechanical industry. For the complex surfaces, the traditional machine tool can no longer satisfy the requirement of such complex task. Only by the help of cad/cam can fulfill the requirement. The machining of the vane surface of the impeller leaf has been considered as the hardest challenge. Because of their complex shape, the 5-axis cnc machine tool is needed for the machining of such parts. The material is hard to cut, the requirement for the surface finish and clearance is very high, so that the manufacture quality of impeller leaf represent the level of 5-axis machining. This paper opened a new field in machining the complicated surface, based on a relatively more rigid mathematical basis. The theory presented here is relatively more systematical. Since the lack of theoretical guidance, in the former research, people have to try in machining many times. Such case will be changed. The movement of the cutter determined by this method is definite, and the residual is the smallest while the times of travel is the fewest. The criterion is simple and the calculation is easy.

  3. Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies.

    PubMed

    Hansen, Katja; Montavon, Grégoire; Biegler, Franziska; Fazli, Siamac; Rupp, Matthias; Scheffler, Matthias; von Lilienfeld, O Anatole; Tkatchenko, Alexandre; Müller, Klaus-Robert

    2013-08-13

    The accurate and reliable prediction of properties of molecules typically requires computationally intensive quantum-chemical calculations. Recently, machine learning techniques applied to ab initio calculations have been proposed as an efficient approach for describing the energies of molecules in their given ground-state structure throughout chemical compound space (Rupp et al. Phys. Rev. Lett. 2012, 108, 058301). In this paper we outline a number of established machine learning techniques and investigate the influence of the molecular representation on the methods performance. The best methods achieve prediction errors of 3 kcal/mol for the atomization energies of a wide variety of molecules. Rationales for this performance improvement are given together with pitfalls and challenges when applying machine learning approaches to the prediction of quantum-mechanical observables.

  4. The ribosome as an optimal decoder: a lesson in molecular recognition.

    PubMed

    Savir, Yonatan; Tlusty, Tsvi

    2013-04-11

    The ribosome is a complex molecular machine that, in order to synthesize proteins, has to decode mRNAs by pairing their codons with matching tRNAs. Decoding is a major determinant of fitness and requires accurate and fast selection of correct tRNAs among many similar competitors. However, it is unclear whether the modern ribosome, and in particular its large conformational changes during decoding, are the outcome of adaptation to its task as a decoder or the result of other constraints. Here, we derive the energy landscape that provides optimal discrimination between competing substrates and thereby optimal tRNA decoding. We show that the measured landscape of the prokaryotic ribosome is sculpted in this way. This model suggests that conformational changes of the ribosome and tRNA during decoding are means to obtain an optimal decoder. Our analysis puts forward a generic mechanism that may be utilized broadly by molecular recognition systems. Copyright © 2013 Elsevier Inc. All rights reserved.

  5. The Biomolecular Interaction Network Database and related tools 2005 update

    PubMed Central

    Alfarano, C.; Andrade, C. E.; Anthony, K.; Bahroos, N.; Bajec, M.; Bantoft, K.; Betel, D.; Bobechko, B.; Boutilier, K.; Burgess, E.; Buzadzija, K.; Cavero, R.; D'Abreo, C.; Donaldson, I.; Dorairajoo, D.; Dumontier, M. J.; Dumontier, M. R.; Earles, V.; Farrall, R.; Feldman, H.; Garderman, E.; Gong, Y.; Gonzaga, R.; Grytsan, V.; Gryz, E.; Gu, V.; Haldorsen, E.; Halupa, A.; Haw, R.; Hrvojic, A.; Hurrell, L.; Isserlin, R.; Jack, F.; Juma, F.; Khan, A.; Kon, T.; Konopinsky, S.; Le, V.; Lee, E.; Ling, S.; Magidin, M.; Moniakis, J.; Montojo, J.; Moore, S.; Muskat, B.; Ng, I.; Paraiso, J. P.; Parker, B.; Pintilie, G.; Pirone, R.; Salama, J. J.; Sgro, S.; Shan, T.; Shu, Y.; Siew, J.; Skinner, D.; Snyder, K.; Stasiuk, R.; Strumpf, D.; Tuekam, B.; Tao, S.; Wang, Z.; White, M.; Willis, R.; Wolting, C.; Wong, S.; Wrong, A.; Xin, C.; Yao, R.; Yates, B.; Zhang, S.; Zheng, K.; Pawson, T.; Ouellette, B. F. F.; Hogue, C. W. V.

    2005-01-01

    The Biomolecular Interaction Network Database (BIND) (http://bind.ca) archives biomolecular interaction, reaction, complex and pathway information. Our aim is to curate the details about molecular interactions that arise from published experimental research and to provide this information, as well as tools to enable data analysis, freely to researchers worldwide. BIND data are curated into a comprehensive machine-readable archive of computable information and provides users with methods to discover interactions and molecular mechanisms. BIND has worked to develop new methods for visualization that amplify the underlying annotation of genes and proteins to facilitate the study of molecular interaction networks. BIND has maintained an open database policy since its inception in 1999. Data growth has proceeded at a tremendous rate, approaching over 100 000 records. New services provided include a new BIND Query and Submission interface, a Standard Object Access Protocol service and the Small Molecule Interaction Database (http://smid.blueprint.org) that allows users to determine probable small molecule binding sites of new sequences and examine conserved binding residues. PMID:15608229

  6. A machine learning approach to computer-aided molecular design

    NASA Astrophysics Data System (ADS)

    Bolis, Giorgio; Di Pace, Luigi; Fabrocini, Filippo

    1991-12-01

    Preliminary results of a machine learning application concerning computer-aided molecular design applied to drug discovery are presented. The artificial intelligence techniques of machine learning use a sample of active and inactive compounds, which is viewed as a set of positive and negative examples, to allow the induction of a molecular model characterizing the interaction between the compounds and a target molecule. The algorithm is based on a twofold phase. In the first one — the specialization step — the program identifies a number of active/inactive pairs of compounds which appear to be the most useful in order to make the learning process as effective as possible and generates a dictionary of molecular fragments, deemed to be responsible for the activity of the compounds. In the second phase — the generalization step — the fragments thus generated are combined and generalized in order to select the most plausible hypothesis with respect to the sample of compounds. A knowledge base concerning physical and chemical properties is utilized during the inductive process.

  7. Neural Network and Nearest Neighbor Algorithms for Enhancing Sampling of Molecular Dynamics.

    PubMed

    Galvelis, Raimondas; Sugita, Yuji

    2017-06-13

    The free energy calculations of complex chemical and biological systems with molecular dynamics (MD) are inefficient due to multiple local minima separated by high-energy barriers. The minima can be escaped using an enhanced sampling method such as metadynamics, which apply bias (i.e., importance sampling) along a set of collective variables (CV), but the maximum number of CVs (or dimensions) is severely limited. We propose a high-dimensional bias potential method (NN2B) based on two machine learning algorithms: the nearest neighbor density estimator (NNDE) and the artificial neural network (ANN) for the bias potential approximation. The bias potential is constructed iteratively from short biased MD simulations accounting for correlation among CVs. Our method is capable of achieving ergodic sampling and calculating free energy of polypeptides with up to 8-dimensional bias potential.

  8. Molecular graph convolutions: moving beyond fingerprints

    PubMed Central

    Kearnes, Steven; McCloskey, Kevin; Berndl, Marc; Pande, Vijay; Riley, Patrick

    2016-01-01

    Molecular “fingerprints” encoding structural information are the workhorse of cheminformatics and machine learning in drug discovery applications. However, fingerprint representations necessarily emphasize particular aspects of the molecular structure while ignoring others, rather than allowing the model to make data-driven decisions. We describe molecular graph convolutions, a machine learning architecture for learning from undirected graphs, specifically small molecules. Graph convolutions use a simple encoding of the molecular graph—atoms, bonds, distances, etc.—which allows the model to take greater advantage of information in the graph structure. Although graph convolutions do not outperform all fingerprint-based methods, they (along with other graph-based methods) represent a new paradigm in ligand-based virtual screening with exciting opportunities for future improvement. PMID:27558503

  9. Machine-learning scoring functions for identifying native poses of ligands docked to known and novel proteins.

    PubMed

    Ashtawy, Hossam M; Mahapatra, Nihar R

    2015-01-01

    Molecular docking is a widely-employed method in structure-based drug design. An essential component of molecular docking programs is a scoring function (SF) that can be used to identify the most stable binding pose of a ligand, when bound to a receptor protein, from among a large set of candidate poses. Despite intense efforts in developing conventional SFs, which are either force-field based, knowledge-based, or empirical, their limited docking power (or ability to successfully identify the correct pose) has been a major impediment to cost-effective drug discovery. Therefore, in this work, we explore a range of novel SFs employing different machine-learning (ML) approaches in conjunction with physicochemical and geometrical features characterizing protein-ligand complexes to predict the native or near-native pose of a ligand docked to a receptor protein's binding site. We assess the docking accuracies of these new ML SFs as well as those of conventional SFs in the context of the 2007 PDBbind benchmark dataset on both diverse and homogeneous (protein-family-specific) test sets. Further, we perform a systematic analysis of the performance of the proposed SFs in identifying native poses of ligands that are docked to novel protein targets. We find that the best performing ML SF has a success rate of 80% in identifying poses that are within 1 Å root-mean-square deviation from the native poses of 65 different protein families. This is in comparison to a success rate of only 70% achieved by the best conventional SF, ASP, employed in the commercial docking software GOLD. In addition, the proposed ML SFs perform better on novel proteins that they were never trained on before. We also observed steady gains in the performance of these scoring functions as the training set size and number of features were increased by considering more protein-ligand complexes and/or more computationally-generated poses for each complex.

  10. Machine-learning scoring functions for identifying native poses of ligands docked to known and novel proteins

    PubMed Central

    2015-01-01

    Background Molecular docking is a widely-employed method in structure-based drug design. An essential component of molecular docking programs is a scoring function (SF) that can be used to identify the most stable binding pose of a ligand, when bound to a receptor protein, from among a large set of candidate poses. Despite intense efforts in developing conventional SFs, which are either force-field based, knowledge-based, or empirical, their limited docking power (or ability to successfully identify the correct pose) has been a major impediment to cost-effective drug discovery. Therefore, in this work, we explore a range of novel SFs employing different machine-learning (ML) approaches in conjunction with physicochemical and geometrical features characterizing protein-ligand complexes to predict the native or near-native pose of a ligand docked to a receptor protein's binding site. We assess the docking accuracies of these new ML SFs as well as those of conventional SFs in the context of the 2007 PDBbind benchmark dataset on both diverse and homogeneous (protein-family-specific) test sets. Further, we perform a systematic analysis of the performance of the proposed SFs in identifying native poses of ligands that are docked to novel protein targets. Results and conclusion We find that the best performing ML SF has a success rate of 80% in identifying poses that are within 1 Å root-mean-square deviation from the native poses of 65 different protein families. This is in comparison to a success rate of only 70% achieved by the best conventional SF, ASP, employed in the commercial docking software GOLD. In addition, the proposed ML SFs perform better on novel proteins that they were never trained on before. We also observed steady gains in the performance of these scoring functions as the training set size and number of features were increased by considering more protein-ligand complexes and/or more computationally-generated poses for each complex. PMID:25916860

  11. PSB27: A thylakoid protein enabling Arabidopsis to adapt to changing light intensity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hou, Xin; Garcia, Veder J.; Buchanan, Bob B.

    Project Title: Immunophilins in the assembly and maintenance of photosynthetic electron transport chain in Arabidopsis Applicant: The Regents of the University of California PI: Sheng Luan, University of California at Berkeley Photosynthetic light energy conversion entails coordinated function of complex molecular machines that capture and convert light energy into chemical forms through photosynthetic electron transport chain. Each molecular machine, such as photosystem II (PSII), may consist of dozens of protein subunits and small molecule cofactors. Despite advanced understanding of the structure and function of these complexes, little is known about “How individual proteins and cofactors assemble into a functional machinemore » and how do these molecular machines maintain their structure and function under a highly hazardous lumenal environment.” Our studies on immunophilins have unexpectedly contributed to the understanding of this question. Originally defined as cellular receptors for immunosuppressants, immunophilins have been discovered in a wide range of organisms from bacteria, fungi, plants, to animals. Immunophilins function in protein folding processes as chaperones and foldases. Arabidopsis genome encodes ca. 50 immunophilins. The most striking finding is that 16 immunophilin members are targeted to chloroplast thylakoid lumen, by far the largest group in the lumenal proteome. What is the function of immunophilins in the thylakoid lumen? Our studies have demonstrated critical roles for several immunophilins in the biogenesis and maintenance of photosynthetic complexes such as PSII. These studies have made a critical link between immunophilins and the assembly of photosynthetic machines and thus opened up a new area of research in photosynthesis. Our goal is to dissect the roles of immunophilins and their partners in the assembly and maintenance of the photosynthetic electron transport chain. The specific objectives for this funding period will be: 1. To dissect the mechanism of action for CYP38. Our studies suggest an “autoinhibitory” model for CYP38 action. We plan to test this model and determine the mechanistic details for CYP38 function by biochemical and genetic procedures. 2. To determine the mechanism of action of FKBP20-2/FKBP16-3. Studies on FBKP16-3 and FKBP20-2 suggest that they may work together as a redox-dependent duo in PSII assembly. We will test this hypothesis using biochemical and genetic tools. 3. To determine the function of lumenal FKBPs by multi-gene mutagenesis approach. Using new genetic knockout (KO) procedures especially CRISPR/CAS9 system, we plan to generate multi-gene KO models that will likely provide vital information on those FKBPs with functional redundancy. 4. Functional analysis of thylakoid lumen network. We will focus our effort on objectives 1-3. Objective 4 represents a long-term goal to establish a more comprehensive network of proteins in the thylakoid lumen and their function in the assembly and maintenance of photosynthetic complexes. I hypothesize that the 16 immunophilin-type chaperones and foldases in the thylakoid lumen constitute an “assembly line” for the various components in the photosynthetic electron transport chain. Understanding this assembly line will enable further engineering of more efficient photosynthetic machines to capture more light energy and enhance plant productivity. Furthermore, it is envisioned that such molecular “assembly line” may be rebuilt to produce artificial photosynthesis in vitro or in non-photosynthetic organisms. These are highly relevant to the mission of “Photosynthetic Systems” and “Physical Biosciences” programs to “enhance our understanding of energy capture, conversion, and/or storage.” During the funding period, we have performed a number of experiments under the proposed objectives and obtained several new results. We have found in Objective 1 that CYP38 is associated to the thylokoid membrane with a larger complex that might be PSII supercomplex. Under objective 2, we have found that FKBP16-2 interacted with PSB27 that was further pursuited and published a research article in PNAS (attached). Under Objective 3, we have identified several mutants of other FKBPs in the thyalkoid lumen that should be further studied if future funding is available. Under Objective 4, we have started to build a network of lumenal proteins that play a number of roles in photosynthesis. For example, the CYP37 and CYP28 are linked to chloroplast signaling to nucleus, critical for controlling plant response to high light and adaptation to climate change. Unfortunately these studies have been terminated due to funding shortage.« less

  12. Molecular Machine Powered Surface Programmatic Chain Reaction for Highly Sensitive Electrochemical Detection of Protein.

    PubMed

    Zhu, Jing; Gan, Haiying; Wu, Jie; Ju, Huangxian

    2018-04-17

    A bipedal molecular machine powered surface programmatic chain reaction was designed for electrochemical signal amplification and highly sensitive electrochemical detection of protein. The bipedal molecular machine was built through aptamer-target specific recognition for the binding of one target protein with two DNA probes, which hybridized with surface-tethered hairpin DNA 1 (H1) via proximity effect to expose the prelocked toehold domain of H1 for the hybridization of ferrocene-labeled hairpin DNA 2 (H2-Fc). The toehold-mediated strand displacement reaction brought the electrochemical signal molecule Fc close to the electrode and meanwhile released the bipedal molecular machine to traverse the sensing surface by the surface programmatic chain reaction. Eventually, a large number of duplex structures of H1-H2 with ferrocene groups facing to the electrode were formed on the sensor surface to generate an amplified electrochemical signal. Using thrombin as a model target, this method showed a linear detection range from 2 pM to 20 nM with a detection limit of 0.76 pM. The proposed detection strategy was enzyme-free and allowed highly sensitive and selective detection of a variety of protein targets by using corresponding DNA-based affinity probes, showing potential application in bioanalysis.

  13. Bypassing the Kohn-Sham equations with machine learning.

    PubMed

    Brockherde, Felix; Vogt, Leslie; Li, Li; Tuckerman, Mark E; Burke, Kieron; Müller, Klaus-Robert

    2017-10-11

    Last year, at least 30,000 scientific papers used the Kohn-Sham scheme of density functional theory to solve electronic structure problems in a wide variety of scientific fields. Machine learning holds the promise of learning the energy functional via examples, bypassing the need to solve the Kohn-Sham equations. This should yield substantial savings in computer time, allowing larger systems and/or longer time-scales to be tackled, but attempts to machine-learn this functional have been limited by the need to find its derivative. The present work overcomes this difficulty by directly learning the density-potential and energy-density maps for test systems and various molecules. We perform the first molecular dynamics simulation with a machine-learned density functional on malonaldehyde and are able to capture the intramolecular proton transfer process. Learning density models now allows the construction of accurate density functionals for realistic molecular systems.Machine learning allows electronic structure calculations to access larger system sizes and, in dynamical simulations, longer time scales. Here, the authors perform such a simulation using a machine-learned density functional that avoids direct solution of the Kohn-Sham equations.

  14. Beta-propeller crystal structure of Psathyrella velutina lectin: an integrin-like fungal protein interacting with monosaccharides and calcium.

    PubMed

    Cioci, Gianluca; Mitchell, Edward P; Chazalet, Valerie; Debray, Henri; Oscarson, Stefan; Lahmann, Martina; Gautier, Catherine; Breton, Christelle; Perez, Serge; Imberty, Anne

    2006-04-14

    The lectin from the mushroom Psathyrella velutina recognises specifically N-acetylglucosamine and N-acetylneuraminic acid containing glycans. The crystal structure of the 401 amino acid residue lectin shows that it adopts a very regular seven-bladed beta-propeller fold with the N-terminal region tucked into the central cavity around the pseudo 7-fold axis. In the complex with N-acetylglucosamine, six monosaccharides are bound in pockets located between two consecutive propeller blades. Due to the repeats shown by the sequence the binding sites are very similar. Five hydrogen bonds between the protein and the sugar hydroxyl and N-acetyl groups stabilize the complex, together with the hydrophobic interactions with a conserved tyrosine and histidine. The complex with N-acetylneuraminic acid shows molecular mimicry with the same hydrogen bond network, but with different orientations of the carbohydrate ring in the binding site. The beta-hairpin loops connecting the two inner beta-strands of each blade are metal binding sites and two to three calcium ions were located in the structure. The multispecificity and high multivalency of this mushroom lectin, combined with its similarity to the extracellular domain of an important class of cell adhesion molecules, integrins, are another example of the outstanding success of beta-propeller structures as molecular binding machines in nature.

  15. Motor proteins and molecular motors: how to operate machines at the nanoscale.

    PubMed

    Kolomeisky, Anatoly B

    2013-11-20

    Several classes of biological molecules that transform chemical energy into mechanical work are known as motor proteins or molecular motors. These nanometer-sized machines operate in noisy stochastic isothermal environments, strongly supporting fundamental cellular processes such as the transfer of genetic information, transport, organization and functioning. In the past two decades motor proteins have become a subject of intense research efforts, aimed at uncovering the fundamental principles and mechanisms of molecular motor dynamics. In this review, we critically discuss recent progress in experimental and theoretical studies on motor proteins. Our focus is on analyzing fundamental concepts and ideas that have been utilized to explain the non-equilibrium nature and mechanisms of molecular motors.

  16. Synthesis of a pH-Sensitive Hetero[4]Rotaxane Molecular Machine that Combines [c2]Daisy and [2]Rotaxane Arrangements.

    PubMed

    Waelès, Philip; Riss-Yaw, Benjamin; Coutrot, Frédéric

    2016-05-10

    The synthesis of a novel pH-sensitive hetero[4]rotaxane molecular machine through a self-sorting strategy is reported. The original tetra-interlocked molecular architecture combines a [c2]daisy chain scaffold linked to two [2]rotaxane units. Actuation of the system through pH variation is possible thanks to the specific interactions of the dibenzo-24-crown-8 (DB24C8) macrocycles for ammonium, anilinium, and triazolium molecular stations. Selective deprotonation of the anilinium moieties triggers shuttling of the unsubstituted DB24C8 along the [2]rotaxane units. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Study of Environmental Data Complexity using Extreme Learning Machine

    NASA Astrophysics Data System (ADS)

    Leuenberger, Michael; Kanevski, Mikhail

    2017-04-01

    The main goals of environmental data science using machine learning algorithm deal, in a broad sense, around the calibration, the prediction and the visualization of hidden relationship between input and output variables. In order to optimize the models and to understand the phenomenon under study, the characterization of the complexity (at different levels) should be taken into account. Therefore, the identification of the linear or non-linear behavior between input and output variables adds valuable information for the knowledge of the phenomenon complexity. The present research highlights and investigates the different issues that can occur when identifying the complexity (linear/non-linear) of environmental data using machine learning algorithm. In particular, the main attention is paid to the description of a self-consistent methodology for the use of Extreme Learning Machines (ELM, Huang et al., 2006), which recently gained a great popularity. By applying two ELM models (with linear and non-linear activation functions) and by comparing their efficiency, quantification of the linearity can be evaluated. The considered approach is accompanied by simulated and real high dimensional and multivariate data case studies. In conclusion, the current challenges and future development in complexity quantification using environmental data mining are discussed. References - Huang, G.-B., Zhu, Q.-Y., Siew, C.-K., 2006. Extreme learning machine: theory and applications. Neurocomputing 70 (1-3), 489-501. - Kanevski, M., Pozdnoukhov, A., Timonin, V., 2009. Machine Learning for Spatial Environmental Data. EPFL Press; Lausanne, Switzerland, p.392. - Leuenberger, M., Kanevski, M., 2015. Extreme Learning Machines for spatial environmental data. Computers and Geosciences 85, 64-73.

  18. Clustering the Orion B giant molecular cloud based on its molecular emission.

    PubMed

    Bron, Emeric; Daudon, Chloé; Pety, Jérôme; Levrier, François; Gerin, Maryvonne; Gratier, Pierre; Orkisz, Jan H; Guzman, Viviana; Bardeau, Sébastien; Goicoechea, Javier R; Liszt, Harvey; Öberg, Karin; Peretto, Nicolas; Sievers, Albrecht; Tremblin, Pascal

    2018-02-01

    Previous attempts at segmenting molecular line maps of molecular clouds have focused on using position-position-velocity data cubes of a single molecular line to separate the spatial components of the cloud. In contrast, wide field spectral imaging over a large spectral bandwidth in the (sub)mm domain now allows one to combine multiple molecular tracers to understand the different physical and chemical phases that constitute giant molecular clouds (GMCs). We aim at using multiple tracers (sensitive to different physical processes and conditions) to segment a molecular cloud into physically/chemically similar regions (rather than spatially connected components), thus disentangling the different physical/chemical phases present in the cloud. We use a machine learning clustering method, namely the Meanshift algorithm, to cluster pixels with similar molecular emission, ignoring spatial information. Clusters are defined around each maximum of the multidimensional Probability Density Function (PDF) of the line integrated intensities. Simple radiative transfer models were used to interpret the astrophysical information uncovered by the clustering analysis. A clustering analysis based only on the J = 1 - 0 lines of three isotopologues of CO proves suffcient to reveal distinct density/column density regimes ( n H ~ 100 cm -3 , ~ 500 cm -3 , and > 1000 cm -3 ), closely related to the usual definitions of diffuse, translucent and high-column-density regions. Adding two UV-sensitive tracers, the J = 1 - 0 line of HCO + and the N = 1 - 0 line of CN, allows us to distinguish two clearly distinct chemical regimes, characteristic of UV-illuminated and UV-shielded gas. The UV-illuminated regime shows overbright HCO + and CN emission, which we relate to a photochemical enrichment effect. We also find a tail of high CN/HCO + intensity ratio in UV-illuminated regions. Finer distinctions in density classes ( n H ~ 7 × 10 3 cm -3 ~ 4 × 10 4 cm -3 ) for the densest regions are also identified, likely related to the higher critical density of the CN and HCO + (1 - 0) lines. These distinctions are only possible because the high-density regions are spatially resolved. Molecules are versatile tracers of GMCs because their line intensities bear the signature of the physics and chemistry at play in the gas. The association of simultaneous multi-line, wide-field mapping and powerful machine learning methods such as the Meanshift clustering algorithm reveals how to decode the complex information available in these molecular tracers.

  19. Molecular, metabolic, and genetic control: An introduction

    NASA Astrophysics Data System (ADS)

    Tyson, John J.; Mackey, Michael C.

    2001-03-01

    The living cell is a miniature, self-reproducing, biochemical machine. Like all machines, it has a power supply, a set of working components that carry out its necessary tasks, and control systems that ensure the proper coordination of these tasks. In this Special Issue, we focus on the molecular regulatory systems that control cell metabolism, gene expression, environmental responses, development, and reproduction. As for the control systems in human-engineered machines, these regulatory networks can be described by nonlinear dynamical equations, for example, ordinary differential equations, reaction-diffusion equations, stochastic differential equations, or cellular automata. The articles collected here illustrate (i) a range of theoretical problems presented by modern concepts of cellular regulation, (ii) some strategies for converting molecular mechanisms into dynamical systems, (iii) some useful mathematical tools for analyzing and simulating these systems, and (iv) the sort of results that derive from serious interplay between theory and experiment.

  20. Cascades in Compartments: En Route to Machine-Assisted Biotechnology.

    PubMed

    Rabe, Kersten S; Müller, Joachim; Skoupi, Marc; Niemeyer, Christof M

    2017-10-23

    Biological compartmentalization is a fundamental principle of life that allows cells to metabolize, propagate, or communicate with their environment. Much research is devoted to understanding this basic principle and to harness biomimetic compartments and catalytic cascades as tools for technological processes. This Review summarizes the current state-of-the-art of these developments, with a special emphasis on length scales, mass transport phenomena, and molecular scaffolding approaches, ranging from small cross-linkers over proteins and nucleic acids to colloids and patterned surfaces. We conclude that the future exploration and exploitation of these complex systems will largely benefit from technical solutions for the integrated, machine-assisted development and maintenance of a next generation of biotechnological processes. These goals should be achievable by implementing microfluidics, robotics, and added manufacturing techniques supplemented by theoretical simulations as well as computer-aided process modeling based on big data obtained from multiscale experimental analyses. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  1. Massively parallel digital high resolution melt for rapid and absolutely quantitative sequence profiling

    NASA Astrophysics Data System (ADS)

    Velez, Daniel Ortiz; Mack, Hannah; Jupe, Julietta; Hawker, Sinead; Kulkarni, Ninad; Hedayatnia, Behnam; Zhang, Yang; Lawrence, Shelley; Fraley, Stephanie I.

    2017-02-01

    In clinical diagnostics and pathogen detection, profiling of complex samples for low-level genotypes represents a significant challenge. Advances in speed, sensitivity, and extent of multiplexing of molecular pathogen detection assays are needed to improve patient care. We report the development of an integrated platform enabling the identification of bacterial pathogen DNA sequences in complex samples in less than four hours. The system incorporates a microfluidic chip and instrumentation to accomplish universal PCR amplification, High Resolution Melting (HRM), and machine learning within 20,000 picoliter scale reactions, simultaneously. Clinically relevant concentrations of bacterial DNA molecules are separated by digitization across 20,000 reactions and amplified with universal primers targeting the bacterial 16S gene. Amplification is followed by HRM sequence fingerprinting in all reactions, simultaneously. The resulting bacteria-specific melt curves are identified by Support Vector Machine learning, and individual pathogen loads are quantified. The platform reduces reaction volumes by 99.995% and achieves a greater than 200-fold increase in dynamic range of detection compared to traditional PCR HRM approaches. Type I and II error rates are reduced by 99% and 100% respectively, compared to intercalating dye-based digital PCR (dPCR) methods. This technology could impact a number of quantitative profiling applications, especially infectious disease diagnostics.

  2. Energy Landscapes: From Protein Folding to Molecular Assembly

    Science.gov Websites

    been used, for example, in DNA origami, in which artificial structures and machines are built in a mechanical processes and eventually to reproduce these in artificial machines. This conference will provide

  3. Machine Learning Helps Identify CHRONO as a Circadian Clock Component

    PubMed Central

    Venkataraman, Anand; Ramanathan, Chidambaram; Kavakli, Ibrahim H.; Hughes, Michael E.; Baggs, Julie E.; Growe, Jacqueline; Liu, Andrew C.; Kim, Junhyong; Hogenesch, John B.

    2014-01-01

    Over the last decades, researchers have characterized a set of “clock genes” that drive daily rhythms in physiology and behavior. This arduous work has yielded results with far-reaching consequences in metabolic, psychiatric, and neoplastic disorders. Recent attempts to expand our understanding of circadian regulation have moved beyond the mutagenesis screens that identified the first clock components, employing higher throughput genomic and proteomic techniques. In order to further accelerate clock gene discovery, we utilized a computer-assisted approach to identify and prioritize candidate clock components. We used a simple form of probabilistic machine learning to integrate biologically relevant, genome-scale data and ranked genes on their similarity to known clock components. We then used a secondary experimental screen to characterize the top candidates. We found that several physically interact with known clock components in a mammalian two-hybrid screen and modulate in vitro cellular rhythms in an immortalized mouse fibroblast line (NIH 3T3). One candidate, Gene Model 129, interacts with BMAL1 and functionally represses the key driver of molecular rhythms, the BMAL1/CLOCK transcriptional complex. Given these results, we have renamed the gene CHRONO (computationally highlighted repressor of the network oscillator). Bi-molecular fluorescence complementation and co-immunoprecipitation demonstrate that CHRONO represses by abrogating the binding of BMAL1 to its transcriptional co-activator CBP. Most importantly, CHRONO knockout mice display a prolonged free-running circadian period similar to, or more drastic than, six other clock components. We conclude that CHRONO is a functional clock component providing a new layer of control on circadian molecular dynamics. PMID:24737000

  4. Mitochondria and the non-genetic origins of cell-to-cell variability: More is different.

    PubMed

    Guantes, Raúl; Díaz-Colunga, Juan; Iborra, Francisco J

    2016-01-01

    Gene expression activity is heterogeneous in a population of isogenic cells. Identifying the molecular basis of this variability will improve our understanding of phenomena like tumor resistance to drugs, virus infection, or cell fate choice. The complexity of the molecular steps and machines involved in transcription and translation could introduce sources of randomness at many levels, but a common constraint to most of these processes is its energy dependence. In eukaryotic cells, most of this energy is provided by mitochondria. A clonal population of cells may show a large variability in the number and functionality of mitochondria. Here, we discuss how differences in the mitochondrial content of each cell contribute to heterogeneity in gene products. Changes in the amount of mitochondria can also entail drastic alterations of a cell's gene expression program, which ultimately leads to phenotypic diversity. Also watch the Video Abstract. © 2015 WILEY Periodicals, Inc.

  5. Microtubules move the nucleus to quiescence.

    PubMed

    Laporte, Damien; Sagot, Isabelle

    2014-01-01

    The nucleus is a cellular compartment that hosts several macro-molecular machines displaying a highly complex spatial organization. This tight architectural orchestration determines not only DNA replication and repair but also regulates gene expression. In budding yeast microtubules play a key role in structuring the nucleus since they condition the Rabl arrangement in G1 and chromosome partitioning during mitosis through their attachment to centromeres via the kinetochore proteins. Recently, we have shown that upon quiescence entry, intranuclear microtubules emanating from the spindle pole body elongate to form a highly stable bundle that spans the entire nucleus. Here, we examine some molecular mechanisms that may underlie the formation of this structure. As the intranuclear microtubule bundle causes a profound re-organization of the yeast nucleus and is required for cell survival during quiescence, we discuss the possibility that the assembly of such a structure participates in quiescence establishment.

  6. The "Origin-of-Life Reactor" and Reduction of CO2 by H2 in Inorganic Precipitates.

    PubMed

    Jackson, J Baz

    2017-08-01

    It has been suggested that inorganic membranes were forerunners of organic membranes at the origin of life. Such membranes, interposed between alkaline fluid in submarine vents and the more acidic Hadean ocean, were thought to house inorganic molecular machines. H + flowed down the pH gradient (ΔpH) from ocean to vent through the molecular machines to drive metabolic reactions for early life. A set of experiments was performed by Herschy et al. (J Mol Evol 79:213-227, 2014) who followed earlier work to construct inorganic precipitate membranes which, they argued, would be transected by a ΔpH. They supposed that inorganic molecular machines might assemble by chance in the precipitate membranes, and be capable of using the ΔpH to drive unfavourable reduction of CO 2 by H 2 to formate and formaldehyde. Indeed, these workers detected both of these compounds in their origin-of-life reaction vessel and contend this was proof of principle for their hypothesis. However, it is shown here by a straightforward calculation that the formate produced was only that which reached on approach to equilibrium without any driving force from ΔpH. We conclude that the reaction was facilitated by isotropic catalysts in the precipitate membrane but not by an anisotropic ΔpH-driven molecular machine.

  7. Learning molecular energies using localized graph kernels.

    PubMed

    Ferré, Grégoire; Haut, Terry; Barros, Kipton

    2017-03-21

    Recent machine learning methods make it possible to model potential energy of atomic configurations with chemical-level accuracy (as calculated from ab initio calculations) and at speeds suitable for molecular dynamics simulation. Best performance is achieved when the known physical constraints are encoded in the machine learning models. For example, the atomic energy is invariant under global translations and rotations; it is also invariant to permutations of same-species atoms. Although simple to state, these symmetries are complicated to encode into machine learning algorithms. In this paper, we present a machine learning approach based on graph theory that naturally incorporates translation, rotation, and permutation symmetries. Specifically, we use a random walk graph kernel to measure the similarity of two adjacency matrices, each of which represents a local atomic environment. This Graph Approximated Energy (GRAPE) approach is flexible and admits many possible extensions. We benchmark a simple version of GRAPE by predicting atomization energies on a standard dataset of organic molecules.

  8. Learning molecular energies using localized graph kernels

    NASA Astrophysics Data System (ADS)

    Ferré, Grégoire; Haut, Terry; Barros, Kipton

    2017-03-01

    Recent machine learning methods make it possible to model potential energy of atomic configurations with chemical-level accuracy (as calculated from ab initio calculations) and at speeds suitable for molecular dynamics simulation. Best performance is achieved when the known physical constraints are encoded in the machine learning models. For example, the atomic energy is invariant under global translations and rotations; it is also invariant to permutations of same-species atoms. Although simple to state, these symmetries are complicated to encode into machine learning algorithms. In this paper, we present a machine learning approach based on graph theory that naturally incorporates translation, rotation, and permutation symmetries. Specifically, we use a random walk graph kernel to measure the similarity of two adjacency matrices, each of which represents a local atomic environment. This Graph Approximated Energy (GRAPE) approach is flexible and admits many possible extensions. We benchmark a simple version of GRAPE by predicting atomization energies on a standard dataset of organic molecules.

  9. Analytic Complexity and Challenges in Identifying Mixtures of Exposures Associated with Phenotypes in the Exposome Era.

    PubMed

    Patel, Chirag J

    2017-01-01

    Mixtures, or combinations and interactions between multiple environmental exposures, are hypothesized to be causally linked with disease and health-related phenotypes. Established and emerging molecular measurement technologies to assay the exposome , the comprehensive battery of exposures encountered from birth to death, promise a new way of identifying mixtures in disease in the epidemiological setting. In this opinion, we describe the analytic complexity and challenges in identifying mixtures associated with phenotype and disease. Existing and emerging machine-learning methods and data analytic approaches (e.g., "environment-wide association studies" [EWASs]), as well as large cohorts may enhance possibilities to identify mixtures of correlated exposures associated with phenotypes; however, the analytic complexity of identifying mixtures is immense. If the exposome concept is realized, new analytical methods and large sample sizes will be required to ascertain how mixtures are associated with disease. The author recommends documenting prevalent correlated exposures and replicated main effects prior to identifying mixtures.

  10. Tomography and generative training with quantum Boltzmann machines

    NASA Astrophysics Data System (ADS)

    Kieferová, Mária; Wiebe, Nathan

    2017-12-01

    The promise of quantum neural nets, which utilize quantum effects to model complex data sets, has made their development an aspirational goal for quantum machine learning and quantum computing in general. Here we provide methods of training quantum Boltzmann machines. Our work generalizes existing methods and provides additional approaches for training quantum neural networks that compare favorably to existing methods. We further demonstrate that quantum Boltzmann machines enable a form of partial quantum state tomography that further provides a generative model for the input quantum state. Classical Boltzmann machines are incapable of this. This verifies the long-conjectured connection between tomography and quantum machine learning. Finally, we prove that classical computers cannot simulate our training process in general unless BQP=BPP , provide lower bounds on the complexity of the training procedures and numerically investigate training for small nonstoquastic Hamiltonians.

  11. Graph Kernels for Molecular Similarity.

    PubMed

    Rupp, Matthias; Schneider, Gisbert

    2010-04-12

    Molecular similarity measures are important for many cheminformatics applications like ligand-based virtual screening and quantitative structure-property relationships. Graph kernels are formal similarity measures defined directly on graphs, such as the (annotated) molecular structure graph. Graph kernels are positive semi-definite functions, i.e., they correspond to inner products. This property makes them suitable for use with kernel-based machine learning algorithms such as support vector machines and Gaussian processes. We review the major types of kernels between graphs (based on random walks, subgraphs, and optimal assignments, respectively), and discuss their advantages, limitations, and successful applications in cheminformatics. Copyright © 2010 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Reversibility in Quantum Models of Stochastic Processes

    NASA Astrophysics Data System (ADS)

    Gier, David; Crutchfield, James; Mahoney, John; James, Ryan

    Natural phenomena such as time series of neural firing, orientation of layers in crystal stacking and successive measurements in spin-systems are inherently probabilistic. The provably minimal classical models of such stochastic processes are ɛ-machines, which consist of internal states, transition probabilities between states and output values. The topological properties of the ɛ-machine for a given process characterize the structure, memory and patterns of that process. However ɛ-machines are often not ideal because their statistical complexity (Cμ) is demonstrably greater than the excess entropy (E) of the processes they represent. Quantum models (q-machines) of the same processes can do better in that their statistical complexity (Cq) obeys the relation Cμ >= Cq >= E. q-machines can be constructed to consider longer lengths of strings, resulting in greater compression. With code-words of sufficiently long length, the statistical complexity becomes time-symmetric - a feature apparently novel to this quantum representation. This result has ramifications for compression of classical information in quantum computing and quantum communication technology.

  13. A Genetic Algorithm Based Support Vector Machine Model for Blood-Brain Barrier Penetration Prediction

    PubMed Central

    Zhang, Daqing; Xiao, Jianfeng; Zhou, Nannan; Luo, Xiaomin; Jiang, Hualiang; Chen, Kaixian

    2015-01-01

    Blood-brain barrier (BBB) is a highly complex physical barrier determining what substances are allowed to enter the brain. Support vector machine (SVM) is a kernel-based machine learning method that is widely used in QSAR study. For a successful SVM model, the kernel parameters for SVM and feature subset selection are the most important factors affecting prediction accuracy. In most studies, they are treated as two independent problems, but it has been proven that they could affect each other. We designed and implemented genetic algorithm (GA) to optimize kernel parameters and feature subset selection for SVM regression and applied it to the BBB penetration prediction. The results show that our GA/SVM model is more accurate than other currently available log BB models. Therefore, to optimize both SVM parameters and feature subset simultaneously with genetic algorithm is a better approach than other methods that treat the two problems separately. Analysis of our log BB model suggests that carboxylic acid group, polar surface area (PSA)/hydrogen-bonding ability, lipophilicity, and molecular charge play important role in BBB penetration. Among those properties relevant to BBB penetration, lipophilicity could enhance the BBB penetration while all the others are negatively correlated with BBB penetration. PMID:26504797

  14. Fullrmc, a rigid body Reverse Monte Carlo modeling package enabled with machine learning and artificial intelligence.

    PubMed

    Aoun, Bachir

    2016-05-05

    A new Reverse Monte Carlo (RMC) package "fullrmc" for atomic or rigid body and molecular, amorphous, or crystalline materials is presented. fullrmc main purpose is to provide a fully modular, fast and flexible software, thoroughly documented, complex molecules enabled, written in a modern programming language (python, cython, C and C++ when performance is needed) and complying to modern programming practices. fullrmc approach in solving an atomic or molecular structure is different from existing RMC algorithms and software. In a nutshell, traditional RMC methods and software randomly adjust atom positions until the whole system has the greatest consistency with a set of experimental data. In contrast, fullrmc applies smart moves endorsed with reinforcement machine learning to groups of atoms. While fullrmc allows running traditional RMC modeling, the uniqueness of this approach resides in its ability to customize grouping atoms in any convenient way with no additional programming efforts and to apply smart and more physically meaningful moves to the defined groups of atoms. In addition, fullrmc provides a unique way with almost no additional computational cost to recur a group's selection, allowing the system to go out of local minimas by refining a group's position or exploring through and beyond not allowed positions and energy barriers the unrestricted three dimensional space around a group. © 2016 Wiley Periodicals, Inc.

  15. Fullrmc, a rigid body reverse monte carlo modeling package enabled with machine learning and artificial intelligence

    DOE PAGES

    Aoun, Bachir

    2016-01-22

    Here, a new Reverse Monte Carlo (RMC) package ‘fullrmc’ for atomic or rigid body and molecular, amorphous or crystalline materials is presented. fullrmc main purpose is to provide a fully modular, fast and flexible software, thoroughly documented, complex molecules enabled, written in a modern programming language (python, cython ,C and C++ when performance is needed) and complying to modern programming practices. fullrmc approach in solving an atomic or molecular structure is different from existing RMC algorithms and software. In a nutshell, traditional RMC methods and software randomly adjust atom positions until the whole system has the greatest consistency with amore » set of experimental data. In contrast, fullrmc applies smart moves endorsed with reinforcement machine learning to groups of atoms. While fullrmc allows running traditional RMC modelling, the uniqueness of this approach resides in its ability to customize grouping atoms in any convenient way with no additional programming efforts and to apply smart and more physically meaningful moves to the defined groups of atoms. Also fullrmc provides a unique way with almost no additional computational cost to recur a group’s selection, allowing the system to go out of local minimas by refining a group’s position or exploring through and beyond not allowed positions and energy barriers the unrestricted three dimensional space around a group.« less

  16. Fullrmc, a rigid body reverse monte carlo modeling package enabled with machine learning and artificial intelligence

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aoun, Bachir

    Here, a new Reverse Monte Carlo (RMC) package ‘fullrmc’ for atomic or rigid body and molecular, amorphous or crystalline materials is presented. fullrmc main purpose is to provide a fully modular, fast and flexible software, thoroughly documented, complex molecules enabled, written in a modern programming language (python, cython ,C and C++ when performance is needed) and complying to modern programming practices. fullrmc approach in solving an atomic or molecular structure is different from existing RMC algorithms and software. In a nutshell, traditional RMC methods and software randomly adjust atom positions until the whole system has the greatest consistency with amore » set of experimental data. In contrast, fullrmc applies smart moves endorsed with reinforcement machine learning to groups of atoms. While fullrmc allows running traditional RMC modelling, the uniqueness of this approach resides in its ability to customize grouping atoms in any convenient way with no additional programming efforts and to apply smart and more physically meaningful moves to the defined groups of atoms. Also fullrmc provides a unique way with almost no additional computational cost to recur a group’s selection, allowing the system to go out of local minimas by refining a group’s position or exploring through and beyond not allowed positions and energy barriers the unrestricted three dimensional space around a group.« less

  17. A systematic approach to prioritize drug targets using machine learning, a molecular descriptor-based classification model, and high-throughput screening of plant derived molecules: a case study in oral cancer.

    PubMed

    Randhawa, Vinay; Kumar Singh, Anil; Acharya, Vishal

    2015-12-01

    Systems-biology inspired identification of drug targets and machine learning-based screening of small molecules which modulate their activity have the potential to revolutionize modern drug discovery by complementing conventional methods. To utilize the effectiveness of such pipelines, we first analyzed the dysregulated gene pairs between control and tumor samples and then implemented an ensemble-based feature selection approach to prioritize targets in oral squamous cell carcinoma (OSCC) for therapeutic exploration. Based on the structural information of known inhibitors of CXCR4-one of the best targets identified in this study-a feature selection was implemented for the identification of optimal structural features (molecular descriptor) based on which a classification model was generated. Furthermore, the CXCR4-centered descriptor-based classification model was finally utilized to screen a repository of plant derived small-molecules to obtain potential inhibitors. The application of our methodology may assist effective selection of the best targets which may have previously been overlooked, that in turn will lead to the development of new oral cancer medications. The small molecules identified in this study can be ideal candidates for trials as potential novel anti-oral cancer agents. Importantly, distinct steps of this whole study may provide reference for the analysis of other complex human diseases.

  18. Synthetic oligorotaxanes exert high forces when folding under mechanical load

    NASA Astrophysics Data System (ADS)

    Sluysmans, Damien; Hubert, Sandrine; Bruns, Carson J.; Zhu, Zhixue; Stoddart, J. Fraser; Duwez, Anne-Sophie

    2018-01-01

    Folding is a ubiquitous process that nature uses to control the conformations of its molecular machines, allowing them to perform chemical and mechanical tasks. Over the years, chemists have synthesized foldamers that adopt well-defined and stable folded architectures, mimicking the control expressed by natural systems1,2. Mechanically interlocked molecules, such as rotaxanes and catenanes, are prototypical molecular machines that enable the controlled movement and positioning of their component parts3-5. Recently, combining the exquisite complexity of these two classes of molecules, donor-acceptor oligorotaxane foldamers have been synthesized, in which interactions between the mechanically interlocked component parts dictate the single-molecule assembly into a folded secondary structure6-8. Here we report on the mechanochemical properties of these molecules. We use atomic force microscopy-based single-molecule force spectroscopy to mechanically unfold oligorotaxanes, made of oligomeric dumbbells incorporating 1,5-dioxynaphthalene units encircled by cyclobis(paraquat-p-phenylene) rings. Real-time capture of fluctuations between unfolded and folded states reveals that the molecules exert forces of up to 50 pN against a mechanical load of up to 150 pN, and displays transition times of less than 10 μs. While the folding is at least as fast as that observed in proteins, it is remarkably more robust, thanks to the mechanically interlocked structure. Our results show that synthetic oligorotaxanes have the potential to exceed the performance of natural folding proteins.

  19. Learning Kinetic Monte Carlo Models of Condensed Phase High Temperature Chemistry from Molecular Dynamics

    NASA Astrophysics Data System (ADS)

    Yang, Qian; Sing-Long, Carlos; Chen, Enze; Reed, Evan

    2017-06-01

    Complex chemical processes, such as the decomposition of energetic materials and the chemistry of planetary interiors, are typically studied using large-scale molecular dynamics simulations that run for weeks on high performance parallel machines. These computations may involve thousands of atoms forming hundreds of molecular species and undergoing thousands of reactions. It is natural to wonder whether this wealth of data can be utilized to build more efficient, interpretable, and predictive models. In this talk, we will use techniques from statistical learning to develop a framework for constructing Kinetic Monte Carlo (KMC) models from molecular dynamics data. We will show that our KMC models can not only extrapolate the behavior of the chemical system by as much as an order of magnitude in time, but can also be used to study the dynamics of entirely different chemical trajectories with a high degree of fidelity. Then, we will discuss three different methods for reducing our learned KMC models, including a new and efficient data-driven algorithm using L1-regularization. We demonstrate our framework throughout on a system of high-temperature high-pressure liquid methane, thought to be a major component of gas giant planetary interiors.

  20. Quantifying the assembly of multicomponent molecular machines by single-molecule total internal reflection fluorescence microscopy

    PubMed Central

    Boehm, Elizabeth M.; Subramanyam, Shyamal; Ghoneim, Mohamed; Washington, M. Todd; Spies, Maria

    2016-01-01

    Large, dynamic macromolecular complexes play essential roles in many cellular processes. Knowing how the components of these complexes associate with one another and undergo structural rearrangements is critical to understanding how they function. Single-molecule total internal reflection fluorescence (TIRF) microscopy is a powerful approach for addressing these fundamental issues. In this article, we first discuss single-molecule TIRF microscopes and strategies to immobilize and fluorescently label macromolecules. We then review the use of single-molecule TIRF microscopy to study the formation of binary macromolecular complexes using one-color imaging and inhibitors. We conclude with a discussion of the use of TIRF microscopy to examine the formation of higher-order (i.e., ternary, quaternary, etc.) complexes using multi-color setups. The focus throughout this article is on experimental design, controls, data acquisition, and data analysis. We hope that single-molecule TIRF microscopy, which has largely been the province of specialists, will soon become as common in the tool box of biophysicists and biochemists as structural approaches has become today. PMID:27793278

  1. Are Nonadiabatic Reaction Dynamics the Key to Novel Organosilicon Molecules? The Silicon (Si(3P))-Dimethylacetylene (C4H6(X1A1g)) System as a Case Study.

    PubMed

    Thomas, Aaron M; Dangi, Beni B; Yang, Tao; Kaiser, Ralf I; Lin, Lin; Chou, Tzu-Jung; Chang, Agnes H H

    2018-06-06

    The bimolecular gas phase reaction of ground-state silicon (Si; 3 P) with dimethylacetylene (C 4 H 6 ; X 1 A 1g ) was investigated under single collision conditions in a crossed molecular beams machine. Merged with electronic structure calculations, the data propose nonadiabatic reaction dynamics leading to the formation of singlet SiC 4 H 4 isomer(s) and molecular hydrogen (H 2 ) via indirect scattering dynamics along with intersystem crossing (ISC) from the triplet to the singlet surface. The reaction may lead to distinct energetically accessible singlet SiC 4 H 4 isomers ( 1 p8- 1 p24) in overall exoergic reaction(s) (-107 -20 +12 kJ mol -1 ). All feasible reaction products are either cyclic, carry carbene analogous silylene moieties, or carry C-Si-H or C-Si-C bonds that would require extensive isomerization from the initial collision complex(es) to the fragmenting singlet intermediate(s). The present study demonstrates the first successful crossed beams study of an exoergic reaction channel arising from bimolecular collisions of silicon, Si( 3 P), with a hydrocarbon molecule.

  2. X-ray structure determination using low-resolution electron microscopy maps for molecular replacement

    DOE PAGES

    Jackson, Ryan N.; McCoy, Airlie J.; Terwilliger, Thomas C.; ...

    2015-07-30

    Structures of multi-subunit macromolecular machines are primarily determined by either electron microscopy (EM) or X-ray crystallography. In many cases, a structure for a complex can be obtained at low resolution (at a coarse level of detail) with EM and at higher resolution (with finer detail) by X-ray crystallography. The integration of these two structural techniques is becoming increasingly important for generating atomic models of macromolecular complexes. A low-resolution EM image can be a powerful tool for obtaining the "phase" information that is missing from an X-ray crystallography experiment, however integration of EM and X-ray diffraction data has been technically challenging.more » Here we show a step-by-step protocol that explains how low-resolution EM maps can be placed in the crystallographic unit cell by molecular replacement, and how initial phases computed from the placed EM density are extended to high resolution by averaging maps over non-crystallographic symmetry. As the resolution gap between EM and Xray crystallography continues to narrow, the use of EM maps to help with X-ray crystal structure determination, as described in this protocol, will become increasingly effective.« less

  3. The Security of Machine Learning

    DTIC Science & Technology

    2008-04-24

    Machine learning has become a fundamental tool for computer security, since it can rapidly evolve to changing and complex situations. That...adaptability is also a vulnerability: attackers can exploit machine learning systems. We present a taxonomy identifying and analyzing attacks against machine ...We use our framework to survey and analyze the literature of attacks against machine learning systems. We also illustrate our taxonomy by showing

  4. A nanojet: propulsion of a molecular machine by an asymmetric distribution of reaction--products

    NASA Astrophysics Data System (ADS)

    Liverpool, Tanniemola; Golestanian, Ramin; Ajdari, Armand

    2006-03-01

    A simple model for the reaction-driven propulsion of a small device is proposed as a model for (part of) a molecular machine in aqueous media. Motion of the device is driven by an asymmetric distribution of reaction products. We calculate the propulsive velocity of the device as well as the scale of the velocity fluctuations. We also consider the effects of hydrodynamic flow as well as a number of different scenarios for the kinetics of the reaction.

  5. Propulsion of a Molecular Machine by Asymmetric Distribution of Reaction Products

    NASA Astrophysics Data System (ADS)

    Golestanian, Ramin; Liverpool, Tanniemola B.; Ajdari, Armand

    2005-06-01

    A simple model for the reaction-driven propulsion of a small device is proposed as a model for (part of) a molecular machine in aqueous media. The motion of the device is driven by an asymmetric distribution of reaction products. The propulsive velocity of the device is calculated as well as the scale of the velocity fluctuations. The effects of hydrodynamic flow as well as a number of different scenarios for the kinetics of the reaction are addressed.

  6. Propulsion of a molecular machine by asymmetric distribution of reaction products.

    PubMed

    Golestanian, Ramin; Liverpool, Tanniemola B; Ajdari, Armand

    2005-06-10

    A simple model for the reaction-driven propulsion of a small device is proposed as a model for (part of) a molecular machine in aqueous media. The motion of the device is driven by an asymmetric distribution of reaction products. The propulsive velocity of the device is calculated as well as the scale of the velocity fluctuations. The effects of hydrodynamic flow as well as a number of different scenarios for the kinetics of the reaction are addressed.

  7. On Machine Capacitance Dimensional and Surface Profile Measurement System

    NASA Technical Reports Server (NTRS)

    Resnick, Ralph

    1993-01-01

    A program was awarded under the Air Force Machine Tool Sensor Improvements Program Research and Development Announcement to develop and demonstrate the use of a Capacitance Sensor System including Capacitive Non-Contact Analog Probe and a Capacitive Array Dimensional Measurement System to check the dimensions of complex shapes and contours on a machine tool or in an automated inspection cell. The manufacturing of complex shapes and contours and the subsequent verification of those manufactured shapes is fundamental and widespread throughout industry. The critical profile of a gear tooth; the overall shape of a graphite EDM electrode; the contour of a turbine blade in a jet engine; and countless other components in varied applications possess complex shapes that require detailed and complex inspection procedures. Current inspection methods for complex shapes and contours are expensive, time-consuming, and labor intensive.

  8. Early experiences in developing and managing the neuroscience gateway.

    PubMed

    Sivagnanam, Subhashini; Majumdar, Amit; Yoshimoto, Kenneth; Astakhov, Vadim; Bandrowski, Anita; Martone, MaryAnn; Carnevale, Nicholas T

    2015-02-01

    The last few decades have seen the emergence of computational neuroscience as a mature field where researchers are interested in modeling complex and large neuronal systems and require access to high performance computing machines and associated cyber infrastructure to manage computational workflow and data. The neuronal simulation tools, used in this research field, are also implemented for parallel computers and suitable for high performance computing machines. But using these tools on complex high performance computing machines remains a challenge because of issues with acquiring computer time on these machines located at national supercomputer centers, dealing with complex user interface of these machines, dealing with data management and retrieval. The Neuroscience Gateway is being developed to alleviate and/or hide these barriers to entry for computational neuroscientists. It hides or eliminates, from the point of view of the users, all the administrative and technical barriers and makes parallel neuronal simulation tools easily available and accessible on complex high performance computing machines. It handles the running of jobs and data management and retrieval. This paper shares the early experiences in bringing up this gateway and describes the software architecture it is based on, how it is implemented, and how users can use this for computational neuroscience research using high performance computing at the back end. We also look at parallel scaling of some publicly available neuronal models and analyze the recent usage data of the neuroscience gateway.

  9. Early experiences in developing and managing the neuroscience gateway

    PubMed Central

    Sivagnanam, Subhashini; Majumdar, Amit; Yoshimoto, Kenneth; Astakhov, Vadim; Bandrowski, Anita; Martone, MaryAnn; Carnevale, Nicholas. T.

    2015-01-01

    SUMMARY The last few decades have seen the emergence of computational neuroscience as a mature field where researchers are interested in modeling complex and large neuronal systems and require access to high performance computing machines and associated cyber infrastructure to manage computational workflow and data. The neuronal simulation tools, used in this research field, are also implemented for parallel computers and suitable for high performance computing machines. But using these tools on complex high performance computing machines remains a challenge because of issues with acquiring computer time on these machines located at national supercomputer centers, dealing with complex user interface of these machines, dealing with data management and retrieval. The Neuroscience Gateway is being developed to alleviate and/or hide these barriers to entry for computational neuroscientists. It hides or eliminates, from the point of view of the users, all the administrative and technical barriers and makes parallel neuronal simulation tools easily available and accessible on complex high performance computing machines. It handles the running of jobs and data management and retrieval. This paper shares the early experiences in bringing up this gateway and describes the software architecture it is based on, how it is implemented, and how users can use this for computational neuroscience research using high performance computing at the back end. We also look at parallel scaling of some publicly available neuronal models and analyze the recent usage data of the neuroscience gateway. PMID:26523124

  10. Machine learning phases of matter

    NASA Astrophysics Data System (ADS)

    Carrasquilla, Juan; Melko, Roger G.

    2017-02-01

    Condensed-matter physics is the study of the collective behaviour of infinitely complex assemblies of electrons, nuclei, magnetic moments, atoms or qubits. This complexity is reflected in the size of the state space, which grows exponentially with the number of particles, reminiscent of the `curse of dimensionality' commonly encountered in machine learning. Despite this curse, the machine learning community has developed techniques with remarkable abilities to recognize, classify, and characterize complex sets of data. Here, we show that modern machine learning architectures, such as fully connected and convolutional neural networks, can identify phases and phase transitions in a variety of condensed-matter Hamiltonians. Readily programmable through modern software libraries, neural networks can be trained to detect multiple types of order parameter, as well as highly non-trivial states with no conventional order, directly from raw state configurations sampled with Monte Carlo.

  11. Machine learning of molecular properties: Locality and active learning

    NASA Astrophysics Data System (ADS)

    Gubaev, Konstantin; Podryabinkin, Evgeny V.; Shapeev, Alexander V.

    2018-06-01

    In recent years, the machine learning techniques have shown great potent1ial in various problems from a multitude of disciplines, including materials design and drug discovery. The high computational speed on the one hand and the accuracy comparable to that of density functional theory on another hand make machine learning algorithms efficient for high-throughput screening through chemical and configurational space. However, the machine learning algorithms available in the literature require large training datasets to reach the chemical accuracy and also show large errors for the so-called outliers—the out-of-sample molecules, not well-represented in the training set. In the present paper, we propose a new machine learning algorithm for predicting molecular properties that addresses these two issues: it is based on a local model of interatomic interactions providing high accuracy when trained on relatively small training sets and an active learning algorithm of optimally choosing the training set that significantly reduces the errors for the outliers. We compare our model to the other state-of-the-art algorithms from the literature on the widely used benchmark tests.

  12. Support vector machine prediction of enzyme function with conjoint triad feature and hierarchical context.

    PubMed

    Wang, Yong-Cui; Wang, Yong; Yang, Zhi-Xia; Deng, Nai-Yang

    2011-06-20

    Enzymes are known as the largest class of proteins and their functions are usually annotated by the Enzyme Commission (EC), which uses a hierarchy structure, i.e., four numbers separated by periods, to classify the function of enzymes. Automatically categorizing enzyme into the EC hierarchy is crucial to understand its specific molecular mechanism. In this paper, we introduce two key improvements in predicting enzyme function within the machine learning framework. One is to introduce the efficient sequence encoding methods for representing given proteins. The second one is to develop a structure-based prediction method with low computational complexity. In particular, we propose to use the conjoint triad feature (CTF) to represent the given protein sequences by considering not only the composition of amino acids but also the neighbor relationships in the sequence. Then we develop a support vector machine (SVM)-based method, named as SVMHL (SVM for hierarchy labels), to output enzyme function by fully considering the hierarchical structure of EC. The experimental results show that our SVMHL with the CTF outperforms SVMHL with the amino acid composition (AAC) feature both in predictive accuracy and Matthew's correlation coefficient (MCC). In addition, SVMHL with the CTF obtains the accuracy and MCC ranging from 81% to 98% and 0.82 to 0.98 when predicting the first three EC digits on a low-homologous enzyme dataset. We further demonstrate that our method outperforms the methods which do not take account of hierarchical relationship among enzyme categories and alternative methods which incorporate prior knowledge about inter-class relationships. Our structure-based prediction model, SVMHL with the CTF, reduces the computational complexity and outperforms the alternative approaches in enzyme function prediction. Therefore our new method will be a useful tool for enzyme function prediction community.

  13. Tools and procedures for visualization of proteins and other biomolecules.

    PubMed

    Pan, Lurong; Aller, Stephen G

    2015-04-01

    Protein, peptides, and nucleic acids are biomolecules that drive biological processes in living organisms. An enormous amount of structural data for a large number of these biomolecules has been described with atomic precision in the form of structural "snapshots" that are freely available in public repositories. These snapshots can help explain how the biomolecules function, the nature of interactions between multi-molecular complexes, and even how small-molecule drugs can modulate the biomolecules for clinical benefits. Furthermore, these structural snapshots serve as inputs for sophisticated computer simulations to turn the biomolecules into moving, "breathing" molecular machines for understanding their dynamic properties in real-time computer simulations. In order for the researcher to take advantage of such a wealth of structural data, it is necessary to gain competency in the use of computer molecular visualization tools for exploring the structures and visualizing three-dimensional spatial representations. Here, we present protocols for using two common visualization tools--the Web-based Jmol and the stand-alone PyMOL package--as well as a few examples of other popular tools. Copyright © 2015 John Wiley & Sons, Inc.

  14. Multitasking runtime systems for the Cedar Multiprocessor

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Guzzi, M.D.

    1986-07-01

    The programming of a MIMD machine is more complex than for SISD and SIMD machines. The multiple computational resources of the machine must be made available to the programming language compiler and to the programmer so that multitasking programs may be written. This thesis will explore the additional complexity of programming a MIMD machine, the Cedar Multiprocessor specifically, and the multitasking runtime system necessary to provide multitasking resources to the user. First, the problem will be well defined: the Cedar machine, its operating system, the programming language, and multitasking concepts will be described. Second, a solution to the problem, calledmore » macrotasking, will be proposed. This solution provides multitasking facilities to the programmer at a very coarse level with many visible machine dependencies. Third, an alternate solution, called microtasking, will be proposed. This solution provides multitasking facilities of a much finer grain. This solution does not depend so rigidly on the specific architecture of the machine. Finally, the two solutions will be compared for effectiveness. 12 refs., 16 figs.« less

  15. Metal-organic frameworks with dynamic interlocked components

    NASA Astrophysics Data System (ADS)

    Vukotic, V. Nicholas; Harris, Kristopher J.; Zhu, Kelong; Schurko, Robert W.; Loeb, Stephen J.

    2012-06-01

    The dynamics of mechanically interlocked molecules such as rotaxanes and catenanes have been studied in solution as examples of rudimentary molecular switches and machines, but in this medium, the molecules are randomly dispersed and their motion incoherent. As a strategy for achieving a higher level of molecular organization, we have constructed a metal-organic framework material using a [2]rotaxane as the organic linker and binuclear Cu(II) units as the nodes. Activation of the as-synthesized material creates a void space inside the rigid framework that allows the soft macrocyclic ring of the [2]rotaxane to rotate rapidly, unimpeded by neighbouring molecular components. Variable-temperature 13C and 2H solid-state NMR experiments are used to characterize the nature and rate of the dynamic processes occurring inside this unique material. These results provide a blueprint for the future creation of solid-state molecular switches and molecular machines based on mechanically interlocked molecules.

  16. Single molecule thermodynamics in biological motors.

    PubMed

    Taniguchi, Yuichi; Karagiannis, Peter; Nishiyama, Masayoshi; Ishii, Yoshiharu; Yanagida, Toshio

    2007-04-01

    Biological molecular machines use thermal activation energy to carry out various functions. The process of thermal activation has the stochastic nature of output events that can be described according to the laws of thermodynamics. Recently developed single molecule detection techniques have allowed each distinct enzymatic event of single biological machines to be characterized providing clues to the underlying thermodynamics. In this study, the thermodynamic properties in the stepping movement of a biological molecular motor have been examined. A single molecule detection technique was used to measure the stepping movements at various loads and temperatures and a range of thermodynamic parameters associated with the production of each forward and backward step including free energy, enthalpy, entropy and characteristic distance were obtained. The results show that an asymmetry in entropy is a primary factor that controls the direction in which the motor will step. The investigation on single molecule thermodynamics has the potential to reveal dynamic properties underlying the mechanisms of how biological molecular machines work.

  17. LOOP IIId of the HCV IRES is essential for the structural rearrangement of the 40S-HCV IRES complex

    PubMed Central

    Angulo, Jenniffer; Ulryck, Nathalie; Deforges, Jules; Chamond, Nathalie; Lopez-Lastra, Marcelo; Masquida, Benoît; Sargueil, Bruno

    2016-01-01

    As obligatory intracellular parasites, viruses rely on cellular machines to complete their life cycle, and most importantly they recruit the host ribosomes to translate their mRNA. The Hepatitis C viral mRNA initiates translation by directly binding the 40S ribosomal subunit in such a way that the initiation codon is correctly positioned in the P site of the ribosome. Such a property is likely to be central for many viruses, therefore the description of host-pathogen interaction at the molecular level is instrumental to provide new therapeutic targets. In this study, we monitored the 40S ribosomal subunit and the viral RNA structural rearrangement induced upon the formation of the binary complex. We further took advantage of an IRES viral mutant mRNA deficient for translation to identify the interactions necessary to promote translation. Using a combination of structure probing in solution and molecular modeling we establish a whole atom model which appears to be very similar to the one obtained recently by cryoEM. Our model brings new information on the complex, and most importantly reveals some structural rearrangement within the ribosome. This study suggests that the formation of a ‘kissing complex’ between the viral RNA and the 18S ribosomal RNA locks the 40S ribosomal subunit in a conformation proficient for translation. PMID:26626152

  18. Characteristics for electrochemical machining with nanoscale voltage pulses.

    PubMed

    Lee, E S; Back, S Y; Lee, J T

    2009-06-01

    Electrochemical machining has traditionally been used in highly specialized fields, such as those of the aerospace and defense industries. It is now increasingly being applied in other industries, where parts with difficult-to-cut material, complex geometry and tribology, and devices of nanoscale and microscale are required. Electric characteristic plays a principal function role in and chemical characteristic plays an assistant function role in electrochemical machining. Therefore, essential parameters in electrochemical machining can be described current density, machining time, inter-electrode gap size, electrolyte, electrode shape etc. Electrochemical machining provides an economical and effective method for machining high strength, high tension and heat-resistant materials into complex shapes such as turbine blades of titanium and aluminum alloys. The application of nanoscale voltage pulses between a tool electrode and a workpiece in an electrochemical environment allows the three-dimensional machining of conducting materials with sub-micrometer precision. In this study, micro probe are developed by electrochemical etching and micro holes are manufactured using these micro probe as tool electrodes. Micro holes and microgroove can be accurately achieved by using nanoscale voltages pulses.

  19. Trapping of the Enoyl-Acyl Carrier Protein Reductase–Acyl Carrier Protein Interaction

    PubMed Central

    Tallorin, Lorillee; Finzel, Kara; Nguyen, Quynh G.; Beld, Joris; La Clair, James J.; Burkart, Michael D.

    2016-01-01

    An ideal target for metabolic engineering, fatty acid biosynthesis remains poorly understood on a molecular level. These carrier protein-dependent pathways require fundamental protein–protein interactions to guide reactivity and processivity, and their control has become one of the major hurdles in successfully adapting these biological machines. Our laboratory has developed methods to prepare acyl carrier proteins (ACPs) loaded with substrate mimetics and cross-linkers to visualize and trap interactions with partner enzymes, and we continue to expand the tools for studying these pathways. We now describe application of the slow-onset, tight-binding inhibitor triclosan to explore the interactions between the type II fatty acid ACP from Escherichia coli, AcpP, and its corresponding enoyl-ACP reductase, FabI. We show that the AcpP–triclosan complex demonstrates nM binding, inhibits in vitro activity, and can be used to isolate FabI in complex proteomes. PMID:26938266

  20. A communication theoretical analysis of FRET-based mobile ad hoc molecular nanonetworks.

    PubMed

    Kuscu, Murat; Akan, Ozgur B

    2014-09-01

    Nanonetworks refer to a group of nanosized machines with very basic operational capabilities communicating to each other in order to accomplish more complex tasks such as in-body drug delivery, or chemical defense. Realizing reliable and high-rate communication between these nanomachines is a fundamental problem for the practicality of these nanonetworks. Recently, we have proposed a molecular communication method based on Förster Resonance Energy Transfer (FRET) which is a nonradiative excited state energy transfer phenomenon observed among fluorescent molecules, i.e., fluorophores. We have modeled the FRET-based communication channel considering the fluorophores as single-molecular immobile nanomachines, and shown its reliability at high rates, and practicality at the current stage of nanotechnology. In this study, for the first time in the literature, we investigate the network of mobile nanomachines communicating through FRET. We introduce two novel mobile molecular nanonetworks: FRET-based mobile molecular sensor/actor nanonetwork (FRET-MSAN) which is a distributed system of mobile fluorophores acting as sensor or actor node; and FRET-based mobile ad hoc molecular nanonetwork (FRET-MAMNET) which consists of fluorophore-based nanotransmitter, nanoreceivers and nanorelays. We model the single message propagation based on birth-death processes with continuous time Markov chains. We evaluate the performance of FRET-MSAN and FRET-MAMNET in terms of successful transmission probability and mean extinction time of the messages, system throughput, channel capacity and achievable communication rates.

  1. Repetitive Protein Unfolding by the trans Ring of the GroEL-GroES Chaperonin Complex Stimulates Folding*

    PubMed Central

    Lin, Zong; Puchalla, Jason; Shoup, Daniel; Rye, Hays S.

    2013-01-01

    A key constraint on the growth of most organisms is the slow and inefficient folding of many essential proteins. To deal with this problem, several diverse families of protein folding machines, known collectively as molecular chaperones, developed early in evolutionary history. The functional role and operational steps of these remarkably complex nanomachines remain subjects of active debate. Here we present evidence that, for the GroEL-GroES chaperonin system, the non-native substrate protein enters the folding cycle on the trans ring of the double-ring GroEL-ATP-GroES complex rather than the ADP-bound complex. The properties of this ATP complex are designed to ensure that non-native substrate protein binds first, followed by ATP and finally GroES. This binding order ensures efficient occupancy of the open GroEL ring and allows for disruption of misfolded structures through two phases of multiaxis unfolding. In this model, repeated cycles of partial unfolding, followed by confinement within the GroEL-GroES chamber, provide the most effective overall mechanism for facilitating the folding of the most stringently dependent GroEL substrate proteins. PMID:24022487

  2. Optimal quantum cloning based on the maximin principle by using a priori information

    NASA Astrophysics Data System (ADS)

    Kang, Peng; Dai, Hong-Yi; Wei, Jia-Hua; Zhang, Ming

    2016-10-01

    We propose an optimal 1 →2 quantum cloning method based on the maximin principle by making full use of a priori information of amplitude and phase about the general cloned qubit input set, which is a simply connected region enclosed by a "longitude-latitude grid" on the Bloch sphere. Theoretically, the fidelity of the optimal quantum cloning machine derived from this method is the largest in terms of the maximin principle compared with that of any other machine. The problem solving is an optimization process that involves six unknown complex variables, six vectors in an uncertain-dimensional complex vector space, and four equality constraints. Moreover, by restricting the structure of the quantum cloning machine, the optimization problem is simplified as a three-real-parameter suboptimization problem with only one equality constraint. We obtain the explicit formula for a suboptimal quantum cloning machine. Additionally, the fidelity of our suboptimal quantum cloning machine is higher than or at least equal to that of universal quantum cloning machines and phase-covariant quantum cloning machines. It is also underlined that the suboptimal cloning machine outperforms the "belt quantum cloning machine" for some cases.

  3. Improved Prediction of Blood-Brain Barrier Permeability Through Machine Learning with Combined Use of Molecular Property-Based Descriptors and Fingerprints.

    PubMed

    Yuan, Yaxia; Zheng, Fang; Zhan, Chang-Guo

    2018-03-21

    Blood-brain barrier (BBB) permeability of a compound determines whether the compound can effectively enter the brain. It is an essential property which must be accounted for in drug discovery with a target in the brain. Several computational methods have been used to predict the BBB permeability. In particular, support vector machine (SVM), which is a kernel-based machine learning method, has been used popularly in this field. For SVM training and prediction, the compounds are characterized by molecular descriptors. Some SVM models were based on the use of molecular property-based descriptors (including 1D, 2D, and 3D descriptors) or fragment-based descriptors (known as the fingerprints of a molecule). The selection of descriptors is critical for the performance of a SVM model. In this study, we aimed to develop a generally applicable new SVM model by combining all of the features of the molecular property-based descriptors and fingerprints to improve the accuracy for the BBB permeability prediction. The results indicate that our SVM model has improved accuracy compared to the currently available models of the BBB permeability prediction.

  4. Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets.

    PubMed

    Korotcov, Alexandru; Tkachenko, Valery; Russo, Daniel P; Ekins, Sean

    2017-12-04

    Machine learning methods have been applied to many data sets in pharmaceutical research for several decades. The relative ease and availability of fingerprint type molecular descriptors paired with Bayesian methods resulted in the widespread use of this approach for a diverse array of end points relevant to drug discovery. Deep learning is the latest machine learning algorithm attracting attention for many of pharmaceutical applications from docking to virtual screening. Deep learning is based on an artificial neural network with multiple hidden layers and has found considerable traction for many artificial intelligence applications. We have previously suggested the need for a comparison of different machine learning methods with deep learning across an array of varying data sets that is applicable to pharmaceutical research. End points relevant to pharmaceutical research include absorption, distribution, metabolism, excretion, and toxicity (ADME/Tox) properties, as well as activity against pathogens and drug discovery data sets. In this study, we have used data sets for solubility, probe-likeness, hERG, KCNQ1, bubonic plague, Chagas, tuberculosis, and malaria to compare different machine learning methods using FCFP6 fingerprints. These data sets represent whole cell screens, individual proteins, physicochemical properties as well as a data set with a complex end point. Our aim was to assess whether deep learning offered any improvement in testing when assessed using an array of metrics including AUC, F1 score, Cohen's kappa, Matthews correlation coefficient and others. Based on ranked normalized scores for the metrics or data sets Deep Neural Networks (DNN) ranked higher than SVM, which in turn was ranked higher than all the other machine learning methods. Visualizing these properties for training and test sets using radar type plots indicates when models are inferior or perhaps over trained. These results also suggest the need for assessing deep learning further using multiple metrics with much larger scale comparisons, prospective testing as well as assessment of different fingerprints and DNN architectures beyond those used.

  5. Twin target self-amplification-based DNA machine for highly sensitive detection of cancer-related gene.

    PubMed

    Xu, Huo; Jiang, Yifan; Liu, Dengyou; Liu, Kai; Zhang, Yafeng; Yu, Suhong; Shen, Zhifa; Wu, Zai-Sheng

    2018-06-29

    The sensitive detection of cancer-related genes is of great significance for early diagnosis and treatment of human cancers, and previous isothermal amplification sensing systems were often based on the reuse of target DNA, the amplification of enzymatic products and the accumulation of reporting probes. However, no reporting probes are able to be transformed into target species and in turn initiate the signal of other probes. Herein we reported a simple, isothermal and highly sensitive homogeneous assay system for tumor suppressor p53 gene detection based on a new autonomous DNA machine, where the signaling probe, molecular beacon (MB), was able to execute the function similar to target DNA besides providing the common signal. In the presence of target p53 gene, the operation of DNA machine can be initiated, and cyclical nucleic acid strand-displacement polymerization (CNDP) and nicking/polymerization cyclical amplification (NPCA) occur, during which the MB was opened by target species and cleaved by restriction endonuclease. In turn, the cleaved fragments could activate the next signaling process as target DNA did. According to the functional similarity, the cleaved fragment was called twin target, and the corresponding fashion to amplify the signal was named twin target self-amplification. Utilizing this newly-proposed DNA machine, the target DNA could be detected down to 0.1 pM with a wide dynamic range (6 orders of magnitude) and single-base mismatched targets were discriminated, indicating a very high assay sensitivity and good specificity. In addition, the DNA machine was not only used to screen the p53 gene in complex biological matrix but also was capable of practically detecting genomic DNA p53 extracted from A549 cell line. This indicates that the proposed DNA machine holds the potential application in biomedical research and early clinical diagnosis. Copyright © 2018 Elsevier B.V. All rights reserved.

  6. Clustering the Orion B giant molecular cloud based on its molecular emission

    PubMed Central

    Bron, Emeric; Daudon, Chloé; Pety, Jérôme; Levrier, François; Gerin, Maryvonne; Gratier, Pierre; Orkisz, Jan H.; Guzman, Viviana; Bardeau, Sébastien; Goicoechea, Javier R.; Liszt, Harvey; Öberg, Karin; Peretto, Nicolas; Sievers, Albrecht; Tremblin, Pascal

    2017-01-01

    Context Previous attempts at segmenting molecular line maps of molecular clouds have focused on using position-position-velocity data cubes of a single molecular line to separate the spatial components of the cloud. In contrast, wide field spectral imaging over a large spectral bandwidth in the (sub)mm domain now allows one to combine multiple molecular tracers to understand the different physical and chemical phases that constitute giant molecular clouds (GMCs). Aims We aim at using multiple tracers (sensitive to different physical processes and conditions) to segment a molecular cloud into physically/chemically similar regions (rather than spatially connected components), thus disentangling the different physical/chemical phases present in the cloud. Methods We use a machine learning clustering method, namely the Meanshift algorithm, to cluster pixels with similar molecular emission, ignoring spatial information. Clusters are defined around each maximum of the multidimensional Probability Density Function (PDF) of the line integrated intensities. Simple radiative transfer models were used to interpret the astrophysical information uncovered by the clustering analysis. Results A clustering analysis based only on the J = 1 – 0 lines of three isotopologues of CO proves suffcient to reveal distinct density/column density regimes (nH ~ 100 cm−3, ~ 500 cm−3, and > 1000 cm−3), closely related to the usual definitions of diffuse, translucent and high-column-density regions. Adding two UV-sensitive tracers, the J = 1 − 0 line of HCO+ and the N = 1 − 0 line of CN, allows us to distinguish two clearly distinct chemical regimes, characteristic of UV-illuminated and UV-shielded gas. The UV-illuminated regime shows overbright HCO+ and CN emission, which we relate to a photochemical enrichment effect. We also find a tail of high CN/HCO+ intensity ratio in UV-illuminated regions. Finer distinctions in density classes (nH ~ 7 × 103 cm−3 ~ 4 × 104 cm−3) for the densest regions are also identified, likely related to the higher critical density of the CN and HCO+ (1 – 0) lines. These distinctions are only possible because the high-density regions are spatially resolved. Conclusions Molecules are versatile tracers of GMCs because their line intensities bear the signature of the physics and chemistry at play in the gas. The association of simultaneous multi-line, wide-field mapping and powerful machine learning methods such as the Meanshift clustering algorithm reveals how to decode the complex information available in these molecular tracers. PMID:29456256

  7. Review of Statistical Learning Methods in Integrated Omics Studies (An Integrated Information Science).

    PubMed

    Zeng, Irene Sui Lan; Lumley, Thomas

    2018-01-01

    Integrated omics is becoming a new channel for investigating the complex molecular system in modern biological science and sets a foundation for systematic learning for precision medicine. The statistical/machine learning methods that have emerged in the past decade for integrated omics are not only innovative but also multidisciplinary with integrated knowledge in biology, medicine, statistics, machine learning, and artificial intelligence. Here, we review the nontrivial classes of learning methods from the statistical aspects and streamline these learning methods within the statistical learning framework. The intriguing findings from the review are that the methods used are generalizable to other disciplines with complex systematic structure, and the integrated omics is part of an integrated information science which has collated and integrated different types of information for inferences and decision making. We review the statistical learning methods of exploratory and supervised learning from 42 publications. We also discuss the strengths and limitations of the extended principal component analysis, cluster analysis, network analysis, and regression methods. Statistical techniques such as penalization for sparsity induction when there are fewer observations than the number of features and using Bayesian approach when there are prior knowledge to be integrated are also included in the commentary. For the completeness of the review, a table of currently available software and packages from 23 publications for omics are summarized in the appendix.

  8. Kernel-based whole-genome prediction of complex traits: a review.

    PubMed

    Morota, Gota; Gianola, Daniel

    2014-01-01

    Prediction of genetic values has been a focus of applied quantitative genetics since the beginning of the 20th century, with renewed interest following the advent of the era of whole genome-enabled prediction. Opportunities offered by the emergence of high-dimensional genomic data fueled by post-Sanger sequencing technologies, especially molecular markers, have driven researchers to extend Ronald Fisher and Sewall Wright's models to confront new challenges. In particular, kernel methods are gaining consideration as a regression method of choice for genome-enabled prediction. Complex traits are presumably influenced by many genomic regions working in concert with others (clearly so when considering pathways), thus generating interactions. Motivated by this view, a growing number of statistical approaches based on kernels attempt to capture non-additive effects, either parametrically or non-parametrically. This review centers on whole-genome regression using kernel methods applied to a wide range of quantitative traits of agricultural importance in animals and plants. We discuss various kernel-based approaches tailored to capturing total genetic variation, with the aim of arriving at an enhanced predictive performance in the light of available genome annotation information. Connections between prediction machines born in animal breeding, statistics, and machine learning are revisited, and their empirical prediction performance is discussed. Overall, while some encouraging results have been obtained with non-parametric kernels, recovering non-additive genetic variation in a validation dataset remains a challenge in quantitative genetics.

  9. Learning molecular energies using localized graph kernels

    DOE PAGES

    Ferré, Grégoire; Haut, Terry Scot; Barros, Kipton Marcos

    2017-03-21

    We report that recent machine learning methods make it possible to model potential energy of atomic configurations with chemical-level accuracy (as calculated from ab initio calculations) and at speeds suitable for molecular dynamics simulation. Best performance is achieved when the known physical constraints are encoded in the machine learning models. For example, the atomic energy is invariant under global translations and rotations; it is also invariant to permutations of same-species atoms. Although simple to state, these symmetries are complicated to encode into machine learning algorithms. In this paper, we present a machine learning approach based on graph theory that naturallymore » incorporates translation, rotation, and permutation symmetries. Specifically, we use a random walk graph kernel to measure the similarity of two adjacency matrices, each of which represents a local atomic environment. This Graph Approximated Energy (GRAPE) approach is flexible and admits many possible extensions. Finally, we benchmark a simple version of GRAPE by predicting atomization energies on a standard dataset of organic molecules.« less

  10. Trajectories of the ribosome as a Brownian nanomachine

    DOE PAGES

    Dashti, Ali; Schwander, Peter; Langlois, Robert; ...

    2014-11-24

    In a Brownian machine, there is a tiny device buffeted by the random motions of molecules in the environment, is capable of exploiting these thermal motions for many of the conformational changes in its work cycle. Such machines are now thought to be ubiquitous, with the ribosome, a molecular machine responsible for protein synthesis, increasingly regarded as prototypical. We present a new analytical approach capable of determining the free-energy landscape and the continuous trajectories of molecular machines from a large number of snapshots obtained by cryogenic electron microscopy. We demonstrate this approach in the context of experimental cryogenic electron microscopemore » images of a large ensemble of nontranslating ribosomes purified from yeast cells. The free-energy landscape is seen to contain a closed path of low energy, along which the ribosome exhibits conformational changes known to be associated with the elongation cycle. This approach allows model-free quantitative analysis of the degrees of freedom and the energy landscape underlying continuous conformational changes in nanomachines, including those important for biological function.« less

  11. Learning molecular energies using localized graph kernels

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ferré, Grégoire; Haut, Terry Scot; Barros, Kipton Marcos

    We report that recent machine learning methods make it possible to model potential energy of atomic configurations with chemical-level accuracy (as calculated from ab initio calculations) and at speeds suitable for molecular dynamics simulation. Best performance is achieved when the known physical constraints are encoded in the machine learning models. For example, the atomic energy is invariant under global translations and rotations; it is also invariant to permutations of same-species atoms. Although simple to state, these symmetries are complicated to encode into machine learning algorithms. In this paper, we present a machine learning approach based on graph theory that naturallymore » incorporates translation, rotation, and permutation symmetries. Specifically, we use a random walk graph kernel to measure the similarity of two adjacency matrices, each of which represents a local atomic environment. This Graph Approximated Energy (GRAPE) approach is flexible and admits many possible extensions. Finally, we benchmark a simple version of GRAPE by predicting atomization energies on a standard dataset of organic molecules.« less

  12. All tangled up: how cells direct, manage and exploit topoisomerase function

    PubMed Central

    Vos, Seychelle M.; Tretter, Elsa M.; Schmidt, Bryan H.; Berger, James M.

    2015-01-01

    Preface Topoisomerases are complex molecular machines that modulate DNA topology to maintain chromosome superstructure and integrity. Although capable of stand-alone activity in vitro, topoisomerases frequently are linked to larger pathways and systems that resolve specific DNA superstructures and intermediates arising from cellular processes such as DNA repair, transcription, replication, and chromosome compaction. Topoisomerase activity is indispensible to cells, but requires the transient breakage of DNA strands. This property has been exploited, often for significant clinical benefit, by various exogenous agents that interfere with cell proliferation. Despite decades of study, surprising findings involving topoisomerases continue to emerge with respect to their cellular function, regulation, and utility as therapeutic targets. PMID:22108601

  13. Computational Nanotechnology of Materials, Electronics and Machines: Carbon Nanotubes

    NASA Technical Reports Server (NTRS)

    Srivastava, Deepak

    2001-01-01

    This report presents the goals and research of the Integrated Product Team (IPT) on Devices and Nanotechnology. NASA's needs for this technology are discussed and then related to the research focus of the team. The two areas of focus for technique development are: 1) large scale classical molecular dynamics on a shared memory architecture machine; and 2) quantum molecular dynamics methodology. The areas of focus for research are: 1) nanomechanics/materials; 2) carbon based electronics; 3) BxCyNz composite nanotubes and junctions; 4) nano mechano-electronics; and 5) nano mechano-chemistry.

  14. Remote Photoregulated Ring Gliding in a [2]Rotaxane via a Molecular Effector.

    PubMed

    Tron, Arnaud; Pianet, Isabelle; Martinez-Cuezva, Alberto; Tucker, James H R; Pisciottani, Luca; Alajarin, Mateo; Berna, Jose; McClenaghan, Nathan D

    2017-01-06

    A molecular barbiturate messenger, which is reversibly released/captured by a photoswitchable artificial molecular receptor, is shown to act as an effector to control ring gliding on a distant hydrogen-bonding [2]rotaxane. Thus, light-driven chemical communication governing the operation of a remote molecular machine is demonstrated using an information-rich neutral molecule.

  15. Cactus

    ERIC Educational Resources Information Center

    Hyde, Hartley

    2007-01-01

    The transfer of data from one part of a computer to another has always been a complex task in which speed is traded against accuracy and the time required for error correction. Much more complex therefore is the transfer of information from one machine to another of a different type. Difficulties arise when machines are updated, when file formats…

  16. Normal contour error measurement on-machine and compensation method for polishing complex surface by MRF

    NASA Astrophysics Data System (ADS)

    Chen, Hua; Chen, Jihong; Wang, Baorui; Zheng, Yongcheng

    2016-10-01

    The Magnetorheological finishing (MRF) process, based on the dwell time method with the constant normal spacing for flexible polishing, would bring out the normal contour error in the fine polishing complex surface such as aspheric surface. The normal contour error would change the ribbon's shape and removal characteristics of consistency for MRF. Based on continuously scanning the normal spacing between the workpiece and the finder by the laser range finder, the novel method was put forward to measure the normal contour errors while polishing complex surface on the machining track. The normal contour errors was measured dynamically, by which the workpiece's clamping precision, multi-axis machining NC program and the dynamic performance of the MRF machine were achieved for the verification and security check of the MRF process. The unit for measuring the normal contour errors of complex surface on-machine was designed. Based on the measurement unit's results as feedback to adjust the parameters of the feed forward control and the multi-axis machining, the optimized servo control method was presented to compensate the normal contour errors. The experiment for polishing 180mm × 180mm aspherical workpiece of fused silica by MRF was set up to validate the method. The results show that the normal contour error was controlled in less than 10um. And the PV value of the polished surface accuracy was improved from 0.95λ to 0.09λ under the conditions of the same process parameters. The technology in the paper has been being applied in the PKC600-Q1 MRF machine developed by the China Academe of Engineering Physics for engineering application since 2014. It is being used in the national huge optical engineering for processing the ultra-precision optical parts.

  17. New Single Piece Blast Hardware design

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ulrich, Andri; Steinzig, Michael Louis; Aragon, Daniel Adrian

    W, Q and PF engineers and machinists designed and fabricated, on the new Mazak i300, the first Single Piece Blast Hardware (unclassified design shown) reducing fabrication and inspection time by over 50%. The first DU Single Piece is completed and will be used for Hydro Test 3680. Past hydro tests used a twopiece assembly due to a lack of equipment capable of machining the complex saddle shape in a single piece. The i300 provides turning and milling 5-axis machining on one machine. The milling head on the i300 can machine past 90 relative to the spindle axis. This makes itmore » possible to machine the complex saddle surface on a single piece. Going to a single piece eliminates tolerance problems, such as tilting and eccentricity, that typically occurred when assembling the two pieces together« less

  18. The Machine Intelligence Hex Project

    ERIC Educational Resources Information Center

    Chalup, Stephan K.; Mellor, Drew; Rosamond, Fran

    2005-01-01

    Hex is a challenging strategy board game for two players. To enhance students' progress in acquiring understanding and practical experience with complex machine intelligence and programming concepts we developed the Machine Intelligence Hex (MIHex) project. The associated undergraduate student assignment is about designing and implementing Hex…

  19. Visualization and characterization of individual type III protein secretion machines in live bacteria

    PubMed Central

    Lara-Tejero, María; Bewersdorf, Jörg; Galán, Jorge E.

    2017-01-01

    Type III protein secretion machines have evolved to deliver bacterially encoded effector proteins into eukaryotic cells. Although electron microscopy has provided a detailed view of these machines in isolation or fixed samples, little is known about their organization in live bacteria. Here we report the visualization and characterization of the Salmonella type III secretion machine in live bacteria by 2D and 3D single-molecule switching superresolution microscopy. This approach provided access to transient components of this machine, which previously could not be analyzed. We determined the subcellular distribution of individual machines, the stoichiometry of the different components of this machine in situ, and the spatial distribution of the substrates of this machine before secretion. Furthermore, by visualizing this machine in Salmonella mutants we obtained major insights into the machine’s assembly. This study bridges a major resolution gap in the visualization of this nanomachine and may serve as a paradigm for the examination of other bacterially encoded molecular machines. PMID:28533372

  20. Towards a framework of human factors certification of complex human-machine systems

    NASA Technical Reports Server (NTRS)

    Bukasa, Birgit

    1994-01-01

    As far as total automation is not realized, the combination of technical and social components in man-machine systems demands not only contributions from engineers but at least to an equal extent from behavioral scientists. This has been neglected far too long. The psychological, social and cultural aspects of technological innovations were almost totally overlooked. Yet, along with expected safety improvements the institutionalization of human factors is on the way. The introduction of human factors certification of complex man-machine systems will be a milestone in this process.

  1. Computational Nanotechnology of Molecular Materials, Electronics, and Actuators with Carbon Nanotubes and Fullerenes

    NASA Technical Reports Server (NTRS)

    Srivastava, Deepak; Menon, Madhu; Cho, Kyeongjae; Biegel, Bryan (Technical Monitor)

    2001-01-01

    The role of computational nanotechnology in developing next generation of multifunctional materials, molecular scale electronic and computing devices, sensors, actuators, and machines is described through a brief review of enabling computational techniques and few recent examples derived from computer simulations of carbon nanotube based molecular nanotechnology.

  2. Optimizing the way kinematical feed chains with great distance between slides are chosen for CNC machine tools

    NASA Astrophysics Data System (ADS)

    Lucian, P.; Gheorghe, S.

    2017-08-01

    This paper presents a new method, based on FRISCO formula, for optimizing the choice of the best control system for kinematical feed chains with great distance between slides used in computer numerical controlled machine tools. Such machines are usually, but not limited to, used for machining large and complex parts (mostly in the aviation industry) or complex casting molds. For such machine tools the kinematic feed chains are arranged in a dual-parallel drive structure that allows the mobile element to be moved by the two kinematical branches and their related control systems. Such an arrangement allows for high speed and high rigidity (a critical requirement for precision machining) during the machining process. A significant issue for such an arrangement it’s the ability of the two parallel control systems to follow the same trajectory accurately in order to address this issue it is necessary to achieve synchronous motion control for the two kinematical branches ensuring that the correct perpendicular position it’s kept by the mobile element during its motion on the two slides.

  3. A simplified concept for controlling oxygen mixtures in the anaesthetic machine--better, cheaper and more user-friendly?

    PubMed

    Berge, J A; Gramstad, L; Grimnes, S

    1995-05-01

    Modern anaesthetic machines are equipped with several safety components to prevent delivery of hypoxic mixtures. However, such a technical development has increased the complexity of the equipment. We report a reconstructed anaesthetic machine in which a paramagnetic oxygen analyzer has provided the means to simplify the apparatus. The new machine is devoid of several components conventionally included to prevent hypoxic mixtures: oxygen failure protection device, reservoir O2 alarm, N2O/air selector, and proportioning system for oxygen/nitrous oxide delivery. These devices have been replaced by a simple safety system using a paramagnetic oxygen analyzer at the common gas outlet, which in a feed-back system cuts off the supply of nitrous oxide whenever the oxygen concentration falls below 25%. The simplified construction of the anaesthetic machine has important consequences for safety, cost and user-friendliness. Reducing the complexity of the construction also simplifies the pre-use checkout procedure, and an efficient 5-point check list is presented for the new machine.

  4. Assessing the potential of surface-immobilized molecular logic machines for integration with solid state technology.

    PubMed

    Dunn, Katherine E; Trefzer, Martin A; Johnson, Steven; Tyrrell, Andy M

    2016-08-01

    Molecular computation with DNA has great potential for low power, highly parallel information processing in a biological or biochemical context. However, significant challenges remain for the field of DNA computation. New technology is needed to allow multiplexed label-free readout and to enable regulation of molecular state without addition of new DNA strands. These capabilities could be provided by hybrid bioelectronic systems in which biomolecular computing is integrated with conventional electronics through immobilization of DNA machines on the surface of electronic circuitry. Here we present a quantitative experimental analysis of a surface-immobilized OR gate made from DNA and driven by strand displacement. The purpose of our work is to examine the performance of a simple representative surface-immobilized DNA logic machine, to provide valuable information for future work on hybrid bioelectronic systems involving DNA devices. We used a quartz crystal microbalance to examine a DNA monolayer containing approximately 5×10(11)gatescm(-2), with an inter-gate separation of approximately 14nm, and we found that the ensemble of gates took approximately 6min to switch. The gates could be switched repeatedly, but the switching efficiency was significantly degraded on the second and subsequent cycles when the binding site for the input was near to the surface. Otherwise, the switching efficiency could be 80% or better, and the power dissipated by the ensemble of gates during switching was approximately 0.1nWcm(-2), which is orders of magnitude less than the power dissipated during switching of an equivalent array of transistors. We propose an architecture for hybrid DNA-electronic systems in which information can be stored and processed, either in series or in parallel, by a combination of molecular machines and conventional electronics. In this architecture, information can flow freely and in both directions between the solution-phase and the underlying electronics via surface-immobilized DNA machines that provide the interface between the molecular and electronic domains. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  5. Biochemistry and neuroscience: the twain need to meet.

    PubMed

    Kennedy, Mary B

    2017-04-01

    Neuroscience has come to mean the study of electrophysiology of neurons and synapses, micro and macro-scale neuroanatomy, and the functional organization of brain areas. The molecular axis of the field, as reflected in textbooks, often includes only descriptions of the structure and function of individual channels and receptor proteins, and the extracellular signals that guide development and repair. Studies of cytosolic 'molecular machines', large assemblies of proteins that orchestrate regulation of neuronal functions, have been neglected. However, a complete understanding of brain function that will enable new strategies for treatment of the most intractable neural disorders will require that in vitro biochemical studies of molecular machines be reintegrated into the field of neuroscience. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. BamA POTRA Domain Interacts with a Native Lipid Membrane Surface.

    PubMed

    Fleming, Patrick J; Patel, Dhilon S; Wu, Emilia L; Qi, Yifei; Yeom, Min Sun; Sousa, Marcelo Carlos; Fleming, Karen G; Im, Wonpil

    2016-06-21

    The outer membrane of Gram-negative bacteria is an asymmetric membrane with lipopolysaccharides on the external leaflet and phospholipids on the periplasmic leaflet. This outer membrane contains mainly β-barrel transmembrane proteins and lipidated periplasmic proteins (lipoproteins). The multisubunit protein β-barrel assembly machine (BAM) catalyzes the insertion and folding of the β-barrel proteins into this membrane. In Escherichia coli, the BAM complex consists of five subunits, a core transmembrane β-barrel with a long periplasmic domain (BamA) and four lipoproteins (BamB/C/D/E). The BamA periplasmic domain is composed of five globular subdomains in tandem called POTRA motifs that are key to BAM complex formation and interaction with the substrate β-barrel proteins. The BAM complex is believed to undergo conformational cycling while facilitating insertion of client proteins into the outer membrane. Reports describing variable conformations and dynamics of the periplasmic POTRA domain have been published. Therefore, elucidation of the conformational dynamics of the POTRA domain in full-length BamA is important to understand the function of this molecular complex. Using molecular dynamics simulations, we present evidence that the conformational flexibility of the POTRA domain is modulated by binding to the periplasmic surface of a native lipid membrane. Furthermore, membrane binding of the POTRA domain is compatible with both BamB and BamD binding, suggesting that conformational selection of different POTRA domain conformations may be involved in the mechanism of BAM-facilitated insertion of outer membrane β-barrel proteins. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  7. Architectonics: Design of Molecular Architecture for Functional Applications.

    PubMed

    Avinash, M B; Govindaraju, Thimmaiah

    2018-02-20

    The term architectonics has its roots in the architectural and philosophical (as early as 1600s) literature that refers to "the theory of structure" and "the structure of theory", respectively. The concept of architectonics has been adapted to advance the field of molecular self-assembly and termed as molecular architectonics. In essence, the methodology of organizing molecular units in the required and controlled configurations to develop advanced functional systems for materials and biological applications comprises the field of molecular architectonics. This concept of designing noncovalent systems enables to focus on different functional aspects of designer molecules for biological and nonbiological applications and also strengthens our efforts toward the mastery over the art of controlled molecular self-assemblies. Programming complex molecular interactions and assemblies for specific functions has been one of the most challenging tasks in the modern era. Meticulously ordered molecular assemblies can impart remarkable developments in several areas spanning energy, health, and environment. For example, the well-defined nano-, micro-, and macroarchitectures of functional molecules with specific molecular ordering possess potential applications in flexible electronics, photovoltaics, photonic crystals, microreactors, sensors, drug delivery, biomedicine, and superhydrophobic coatings, among others. The functional molecular architectures having unparalleled properties are widely evident in various designs of Nature. By drawing inspirations from Nature, intended molecular architectures can be designed and developed to harvest various functions, as there is an inexhaustible resource and scope. In this Account, we present exquisite designer molecules developed by our group and others with an objective to master the art of molecular recognition and self-assembly for functional applications. We demonstrate the tailor-ability of molecular self-assemblies by employing biomolecules like amino acids and nucleobases as auxiliaries. Naphthalenediimide (NDI), perylenediimide (PDI), and few other molecular systems serve as functional modules. The effects of stereochemistry and minute structural modifications in the molecular designs on the supramolecular interactions, and construction of self-assembled zero-dimensional (OD), one-dimensional (1D), and two-dimensional (2D) nano- and microarchitectures like particles, spheres, cups, bowls, fibers, belts, helical belts, supercoiled helices, sheets, fractals, and honeycomb-like arrays are discussed in extensive detail. Additionally, we present molecular systems that showcase the elegant designs of coassembly, templated assembly, hierarchical assembly, transient self-assembly, chiral denaturation, retentive helical memory, self-replication, supramolecular regulation, supramolecular speciation, supernon linearity, dynamic pathway complexity, supramolecular heterojunction, living supramolecular polymerization, and molecular machines. Finally, we describe the molecular engineering principles learnt over the years that have led to several applications, namely, organic electronics, self-cleaning, high-mechanical strength, and tissue engineering.

  8. Protein interactions and complexes in human microRNA biogenesis and function

    PubMed Central

    Perron, Marjorie P.; Provost, Patrick

    2010-01-01

    Encoded in the genome of most eukaryotes, microRNAs (miRNAs) have been proposed to regulate specifically up to 90% of human genes through a process known as miRNA-guided RNA silencing. The aim of this review is to present this process as the integration of a succession of specialized molecular machines exerting well defined functions. The nuclear microprocessor complex initially recognizes and processes its primary miRNA substrate into a miRNA precursor (pre-miRNA). This structure is then exported to the cytoplasm by the Exportin-5 complex where it is presented to the pre-miRNA processing complex. Following pre-miRNA conversion into a miRNA:miRNA* duplex, this complex is assembled into a miRNA-containing ribonucleoprotein (miRNP) complex, after which the miRNA strand is selected. The degree of complementarity of the miRNA for its messenger RNA (mRNA) target guides the recruitment of the miRNP complex. Initially repressing its translation, the miRNP-silenced mRNA is directed to the P-bodies, where the mRNA is either released from its inhibition upon a cellular signal and/or actively degraded. The potency and specificity of miRNA biogenesis and function rely on the distinct protein·protein, protein·RNA and RNA:RNA interactions found in different complexes, each of which fulfill a specific function in a well orchestrated process. PMID:17981733

  9. Machine learning of single molecule free energy surfaces and the impact of chemistry and environment upon structure and dynamics

    NASA Astrophysics Data System (ADS)

    Mansbach, Rachael A.; Ferguson, Andrew L.

    2015-03-01

    The conformational states explored by polymers and proteins can be controlled by environmental conditions (e.g., temperature, pressure, and solvent) and molecular chemistry (e.g., molecular weight and side chain identity). We introduce an approach employing the diffusion map nonlinear machine learning technique to recover single molecule free energy landscapes from molecular simulations, quantify changes to the landscape as a function of external conditions and molecular chemistry, and relate these changes to modifications of molecular structure and dynamics. In an application to an n-eicosane chain, we quantify the thermally accessible chain configurations as a function of temperature and solvent conditions. In an application to a family of polyglutamate-derivative homopeptides, we quantify helical stability as a function of side chain length, resolve the critical side chain length for the helix-coil transition, and expose the molecular mechanisms underpinning side chain-mediated helix stability. By quantifying single molecule responses through perturbations to the underlying free energy surface, our approach provides a quantitative bridge between experimentally controllable variables and microscopic molecular behavior, guiding and informing rational engineering of desirable molecular structure and function.

  10. Synthetic Ion Channels and DNA Logic Gates as Components of Molecular Robots.

    PubMed

    Kawano, Ryuji

    2018-02-19

    A molecular robot is a next-generation biochemical machine that imitates the actions of microorganisms. It is made of biomaterials such as DNA, proteins, and lipids. Three prerequisites have been proposed for the construction of such a robot: sensors, intelligence, and actuators. This Minireview focuses on recent research on synthetic ion channels and DNA computing technologies, which are viewed as potential candidate components of molecular robots. Synthetic ion channels, which are embedded in artificial cell membranes (lipid bilayers), sense ambient ions or chemicals and import them. These artificial sensors are useful components for molecular robots with bodies consisting of a lipid bilayer because they enable the interface between the inside and outside of the molecular robot to function as gates. After the signal molecules arrive inside the molecular robot, they can operate DNA logic gates, which perform computations. These functions will be integrated into the intelligence and sensor sections of molecular robots. Soon, these molecular machines will be able to be assembled to operate as a mass microrobot and play an active role in environmental monitoring and in vivo diagnosis or therapy. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. Machine learning of single molecule free energy surfaces and the impact of chemistry and environment upon structure and dynamics.

    PubMed

    Mansbach, Rachael A; Ferguson, Andrew L

    2015-03-14

    The conformational states explored by polymers and proteins can be controlled by environmental conditions (e.g., temperature, pressure, and solvent) and molecular chemistry (e.g., molecular weight and side chain identity). We introduce an approach employing the diffusion map nonlinear machine learning technique to recover single molecule free energy landscapes from molecular simulations, quantify changes to the landscape as a function of external conditions and molecular chemistry, and relate these changes to modifications of molecular structure and dynamics. In an application to an n-eicosane chain, we quantify the thermally accessible chain configurations as a function of temperature and solvent conditions. In an application to a family of polyglutamate-derivative homopeptides, we quantify helical stability as a function of side chain length, resolve the critical side chain length for the helix-coil transition, and expose the molecular mechanisms underpinning side chain-mediated helix stability. By quantifying single molecule responses through perturbations to the underlying free energy surface, our approach provides a quantitative bridge between experimentally controllable variables and microscopic molecular behavior, guiding and informing rational engineering of desirable molecular structure and function.

  12. Study of a variable mass Atwood's machine using a smartphone

    NASA Astrophysics Data System (ADS)

    Lopez, Dany; Caprile, Isidora; Corvacho, Fernando; Reyes, Orfa

    2018-03-01

    The Atwood machine was invented in 1784 by George Atwood and this system has been widely studied both theoretically and experimentally over the years. Nowadays, it is commonplace that many experimental physics courses include both Atwood's machine and variable mass to introduce more complex concepts in physics. To study the dynamics of the masses that compose the variable Atwood's machine, laboratories typically use a smart pulley. Now, the first work that introduced a smartphone as data acquisition equipment to study the acceleration in the Atwood's machine was the one by M. Monteiro et al. Since then, there has been no further information available on the usage of smartphones in variable mass systems. This prompted us to do a study of this kind of system by means of data obtained with a smartphone and to show the practicality of using smartphones in complex experimental situations.

  13. Synthetic Molecular Machines for Active Self-Assembly: Prototype Algorithms, Designs, and Experimental Study

    NASA Astrophysics Data System (ADS)

    Dabby, Nadine L.

    Computer science and electrical engineering have been the great success story of the twentieth century. The neat modularity and mapping of a language onto circuits has led to robots on Mars, desktop computers and smartphones. But these devices are not yet able to do some of the things that life takes for granted: repair a scratch, reproduce, regenerate, or grow exponentially fast--all while remaining functional. This thesis explores and develops algorithms, molecular implementations, and theoretical proofs in the context of "active self-assembly" of molecular systems. The long-term vision of active self-assembly is the theoretical and physical implementation of materials that are composed of reconfigurable units with the programmability and adaptability of biology's numerous molecular machines. En route to this goal, we must first find a way to overcome the memory limitations of molecular systems, and to discover the limits of complexity that can be achieved with individual molecules. One of the main thrusts in molecular programming is to use computer science as a tool for figuring out what can be achieved. While molecular systems that are Turing-complete have been demonstrated [Winfree, 1996], these systems still cannot achieve some of the feats biology has achieved. One might think that because a system is Turing-complete, capable of computing "anything," that it can do any arbitrary task. But while it can simulate any digital computational problem, there are many behaviors that are not "computations" in a classical sense, and cannot be directly implemented. Examples include exponential growth and molecular motion relative to a surface. Passive self-assembly systems cannot implement these behaviors because (a) molecular motion relative to a surface requires a source of fuel that is external to the system, and (b) passive systems are too slow to assemble exponentially-fast-growing structures. We call these behaviors "energetically incomplete" programmable behaviors. This class of behaviors includes any behavior where a passive physical system simply does not have enough physical energy to perform the specified tasks in the requisite amount of time. As we will demonstrate and prove, a sufficiently expressive implementation of an "active" molecular self-assembly approach can achieve these behaviors. Using an external source of fuel solves part of the problem, so the system is not "energetically incomplete." But the programmable system also needs to have sufficient expressive power to achieve the specified behaviors. Perhaps surprisingly, some of these systems do not even require Turing completeness to be sufficiently expressive. Building on a large variety of work by other scientists in the fields of DNA nanotechnology, chemistry and reconfigurable robotics, this thesis introduces several research contributions in the context of active self-assembly. We show that simple primitives such as insertion and deletion are able to generate complex and interesting results such as the growth of a linear polymer in logarithmic time and the ability of a linear polymer to treadmill. To this end we developed a formal model for active-self assembly that is directly implementable with DNA molecules. We show that this model is computationally equivalent to a machine capable of producing strings that are stronger than regular languages and, at most, as strong as context-free grammars. This is a great advance in the theory of active self-assembly as prior models were either entirely theoretical or only implementable in the context of macro-scale robotics. We developed a chain reaction method for the autonomous exponential growth of a linear DNA polymer. Our method is based on the insertion of molecules into the assembly, which generates two new insertion sites for every initial one employed. The building of a line in logarithmic time is a first step toward building a shape in logarithmic time. We demonstrate the first construction of a synthetic linear polymer that grows exponentially fast via insertion. We show that monomer molecules are converted into the polymer in logarithmic time via spectrofluorimetry and gel electrophoresis experiments. We also demonstrate the division of these polymers via the addition of a single DNA complex that competes with the insertion mechanism. This shows the growth of a population of polymers in logarithmic time. We characterize the DNA insertion mechanism that we utilize in Chapter 4. We experimentally demonstrate that we can control the kinetics of this reaction over at least seven orders of magnitude, by programming the sequences of DNA that initiate the reaction. In addition, we review co-authored work on programming molecular robots using prescriptive landscapes of DNA origami; this was the first microscopic demonstration of programming a molecular robot to walk on a 2-dimensional surface. We developed a snapshot method for imaging these random walking molecular robots and a CAPTCHA-like analysis method for difficult-to-interpret imaging data.

  14. Metabolite identification through multiple kernel learning on fragmentation trees.

    PubMed

    Shen, Huibin; Dührkop, Kai; Böcker, Sebastian; Rousu, Juho

    2014-06-15

    Metabolite identification from tandem mass spectrometric data is a key task in metabolomics. Various computational methods have been proposed for the identification of metabolites from tandem mass spectra. Fragmentation tree methods explore the space of possible ways in which the metabolite can fragment, and base the metabolite identification on scoring of these fragmentation trees. Machine learning methods have been used to map mass spectra to molecular fingerprints; predicted fingerprints, in turn, can be used to score candidate molecular structures. Here, we combine fragmentation tree computations with kernel-based machine learning to predict molecular fingerprints and identify molecular structures. We introduce a family of kernels capturing the similarity of fragmentation trees, and combine these kernels using recently proposed multiple kernel learning approaches. Experiments on two large reference datasets show that the new methods significantly improve molecular fingerprint prediction accuracy. These improvements result in better metabolite identification, doubling the number of metabolites ranked at the top position of the candidates list. © The Author 2014. Published by Oxford University Press.

  15. Molecular machines operating on the nanoscale: from classical to quantum

    PubMed Central

    2016-01-01

    Summary The main physical features and operating principles of isothermal nanomachines in the microworld, common to both classical and quantum machines, are reviewed. Special attention is paid to the dual, constructive role of dissipation and thermal fluctuations, the fluctuation–dissipation theorem, heat losses and free energy transduction, thermodynamic efficiency, and thermodynamic efficiency at maximum power. Several basic models are considered and discussed to highlight generic physical features. This work examines some common fallacies that continue to plague the literature. In particular, the erroneous beliefs that one should minimize friction and lower the temperature for high performance of Brownian machines, and that the thermodynamic efficiency at maximum power cannot exceed one-half are discussed. The emerging topic of anomalous molecular motors operating subdiffusively but very efficiently in the viscoelastic environment of living cells is also discussed. PMID:27335728

  16. [Big Data Revolution or Data Hubris? : On the Data Positivism of Molecular Biology].

    PubMed

    Gramelsberger, Gabriele

    2017-12-01

    Genome data, the core of the 2008 proclaimed big data revolution in biology, are automatically generated and analyzed. The transition from the manual laboratory practice of electrophoresis sequencing to automated DNA-sequencing machines and software-based analysis programs was completed between 1982 and 1992. This transition facilitated the first data deluge, which was considerably increased by the second and third generation of DNA-sequencers during the 2000s. However, the strategies for evaluating sequence data were also transformed along with this transition. The paper explores both the computational strategies of automation, as well as the data evaluation culture connected with it, in order to provide a complete picture of the complexity of today's data generation and its intrinsic data positivism. This paper is thereby guided by the question, whether this data positivism is the basis of the big data revolution of molecular biology announced today, or it marks the beginning of its data hubris.

  17. Conceptual Study of Permanent Magnet Machine Ship Propulsion Systems

    DTIC Science & Technology

    1977-12-01

    cycloconverter subsystem is designed using advanced thyristors and can be either water or air cooled. The machine-cycloconverter, many-phase or parallel...Turnb, Phase, Poles, Air Gap ................................. 3-9 3-5 Machine Characteristics Versus Number of Poles (large machine, 40 000 hp). Poles...cylindrical permanent magnet generator forces the power conditioner to provide for both frequency change and voltage control. The complexity of this dual

  18. Fourier series of atomic radial distribution functions: A molecular fingerprint for machine learning models of quantum chemical properties

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    von Lilienfeld, O. Anatole; Ramakrishnan, Raghunathan; Rupp, Matthias

    We introduce a fingerprint representation of molecules based on a Fourier series of atomic radial distribution functions. This fingerprint is unique (except for chirality), continuous, and differentiable with respect to atomic coordinates and nuclear charges. It is invariant with respect to translation, rotation, and nuclear permutation, and requires no preconceived knowledge about chemical bonding, topology, or electronic orbitals. As such, it meets many important criteria for a good molecular representation, suggesting its usefulness for machine learning models of molecular properties trained across chemical compound space. To assess the performance of this new descriptor, we have trained machine learning models ofmore » molecular enthalpies of atomization for training sets with up to 10 k organic molecules, drawn at random from a published set of 134 k organic molecules with an average atomization enthalpy of over 1770 kcal/mol. We validate the descriptor on all remaining molecules of the 134 k set. For a training set of 10 k molecules, the fingerprint descriptor achieves a mean absolute error of 8.0 kcal/mol. This is slightly worse than the performance attained using the Coulomb matrix, another popular alternative, reaching 6.2 kcal/mol for the same training and test sets. (c) 2015 Wiley Periodicals, Inc.« less

  19. Stress Genes and Proteins in the Archaea

    PubMed Central

    Macario, Alberto J. L.; Lange, Marianne; Ahring, Birgitte K.; De Macario, Everly Conway

    1999-01-01

    The field covered in this review is new; the first sequence of a gene encoding the molecular chaperone Hsp70 and the first description of a chaperonin in the archaea were reported in 1991. These findings boosted research in other areas beyond the archaea that were directly relevant to bacteria and eukaryotes, for example, stress gene regulation, the structure-function relationship of the chaperonin complex, protein-based molecular phylogeny of organisms and eukaryotic-cell organelles, molecular biology and biochemistry of life in extreme environments, and stress tolerance at the cellular and molecular levels. In the last 8 years, archaeal stress genes and proteins belonging to the families Hsp70, Hsp60 (chaperonins), Hsp40(DnaJ), and small heat-shock proteins (sHsp) have been studied. The hsp70(dnaK), hsp40(dnaJ), and grpE genes (the chaperone machine) have been sequenced in seven, four, and two species, respectively, but their expression has been examined in detail only in the mesophilic methanogen Methanosarcina mazei S-6. The proteins possess markers typical of bacterial homologs but none of the signatures distinctive of eukaryotes. In contrast, gene expression and transcription initiation signals and factors are of the eucaryal type, which suggests a hybrid archaeal-bacterial complexion for the Hsp70 system. Another remarkable feature is that several archaeal species in different phylogenetic branches do not have the gene hsp70(dnaK), an evolutionary puzzle that raises the important question of what replaces the product of this gene, Hsp70(DnaK), in protein biogenesis and refolding and for stress resistance. Although archaea are prokaryotes like bacteria, their Hsp60 (chaperonin) family is of type (group) II, similar to that of the eukaryotic cytosol; however, unlike the latter, which has several different members, the archaeal chaperonin system usually includes only two (in some species one and in others possibly three) related subunits of ∼60 kDa. These form, in various combinations depending on the species, a large structure or chaperonin complex sometimes called the thermosome. This multimolecular assembly is similar to the bacterial chaperonin complex GroEL/S, but it is made of only the large, double-ring oligomers each with eight (or nine) subunits instead of seven as in the bacterial complex. Like Hsp70(DnaK), the archaeal chaperonin subunits are remarkable for their evolution, but for a different reason. Ubiquitous among archaea, the chaperonins show a pattern of recurrent gene duplication—hetero-oligomeric chaperonin complexes appear to have evolved several times independently. The stress response and stress tolerance in the archaea involve chaperones, chaperonins, other heat shock (stress) proteins including sHsp, thermoprotectants, the proteasome, as yet incompletely understood thermoresistant features of many molecules, and formation of multicellular structures. The latter structures include single- and mixed-species (bacterial-archaeal) types. Many questions remain unanswered, and the field offers extraordinary opportunities owing to the diversity, genetic makeup, and phylogenetic position of archaea and the variety of ecosystems they inhabit. Specific aspects that deserve investigation are elucidation of the mechanism of action of the chaperonin complex at different temperatures, identification of the partners and substitutes for the Hsp70 chaperone machine, analysis of protein folding and refolding in hyperthermophiles, and determination of the molecular mechanisms involved in stress gene regulation in archaeal species that thrive under widely different conditions (temperature, pH, osmolarity, and barometric pressure). These studies are now possible with uni- and multicellular archaeal models and are relevant to various areas of basic and applied research, including exploration and conquest of ecosystems inhospitable to humans and many mammals and plants. PMID:10585970

  20. A Mechanical Switch Couples T Cell Receptor Triggering to the Cytoplasmic Juxtamembrane Regions of CD3ζζ.

    PubMed

    Lee, Mark S; Glassman, Caleb R; Deshpande, Neha R; Badgandi, Hemant B; Parrish, Heather L; Uttamapinant, Chayasith; Stawski, Philipp S; Ting, Alice Y; Kuhns, Michael S

    2015-08-18

    The eight-subunit T cell receptor (TCR)-CD3 complex is the primary determinant for T cell fate decisions. Yet how it relays ligand-specific information across the cell membrane for conversion to chemical signals remains unresolved. We hypothesized that TCR engagement triggers a change in the spatial relationship between the associated CD3ζζ subunits at the junction where they emerge from the membrane into the cytoplasm. Using three in situ proximity assays based on ID-PRIME, FRET, and EPOR activity, we determined that the cytosolic juxtamembrane regions of the CD3ζζ subunits are spread apart upon assembly into the TCR-CD3 complex. TCR engagement then triggered their apposition. This mechanical switch resides upstream of the CD3ζζ intracellular motifs that initiate chemical signaling, as well as the polybasic stretches that regulate signal potentiation. These findings provide a framework from which to examine triggering events for activating immune receptors and other complex molecular machines. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Driving and controlling molecular surface rotors with a terahertz electric field.

    PubMed

    Neumann, Jan; Gottschalk, Kay E; Astumian, R Dean

    2012-06-26

    Great progress has been made in the design and synthesis of molecular motors and rotors. Loosely inspired by biomolecular machines such as kinesin and the FoF1 ATPsynthase, these molecules are hoped to provide elements for construction of more elaborate structures that can carry out tasks at the nanoscale corresponding to the tasks accomplished by elementary machines in the macroscopic world. Most of the molecular motors synthesized to date suffer from the drawback that they operate relatively slowly (less than kHz). Here we show by molecular dynamics studies of a diethyl sulfide rotor on a gold(111) surface that a high-frequency oscillating electric field normal to the surface can drive directed rotation at GHz frequencies. The maximum directed rotation rate is 10(10) rotations per second, significantly faster than the rotation of previously reported directional molecular rotors. Understanding the fundamental basis of directed motion of surface rotors is essential for the further development of efficient externally driven artificial rotors. Our results represent a step toward the design of a surface-bound molecular rotary motor with a tunable rotation frequency and direction.

  2. Procedure and computer program to calculate machine contribution to sawmill recovery

    Treesearch

    Philip H. Steele; Hiram Hallock; Stanford Lunstrum

    1981-01-01

    The importance of considering individual machine contribution to total mill efficiency is discussed. A method for accurately calculating machine contribution is introduced, and an example is given using this method. A FORTRAN computer program to make the necessary complex calculations automatically is also presented with user instructions.

  3. Comprehensive Reactive Receiver Modeling for Diffusive Molecular Communication Systems: Reversible Binding, Molecule Degradation, and Finite Number of Receptors.

    PubMed

    Ahmadzadeh, Arman; Arjmandi, Hamidreza; Burkovski, Andreas; Schober, Robert

    2016-10-01

    This paper studies the problem of receiver modeling in molecular communication systems. We consider the diffusive molecular communication channel between a transmitter nano-machine and a receiver nano-machine in a fluid environment. The information molecules released by the transmitter nano-machine into the environment can degrade in the channel via a first-order degradation reaction and those that reach the receiver nano-machine can participate in a reversible bimolecular reaction with receiver receptor proteins. Thereby, we distinguish between two scenarios. In the first scenario, we assume that the entire surface of the receiver is covered by receptor molecules. We derive a closed-form analytical expression for the expected received signal at the receiver, i.e., the expected number of activated receptors on the surface of the receiver. Then, in the second scenario, we consider the case where the number of receptor molecules is finite and the uniformly distributed receptor molecules cover the receiver surface only partially. We show that the expected received signal for this scenario can be accurately approximated by the expected received signal for the first scenario after appropriately modifying the forward reaction rate constant. The accuracy of the derived analytical results is verified by Brownian motion particle-based simulations of the considered environment, where we also show the impact of the effect of receptor occupancy on the derived analytical results.

  4. Algorithm for detection the QRS complexes based on support vector machine

    NASA Astrophysics Data System (ADS)

    Van, G. V.; Podmasteryev, K. V.

    2017-11-01

    The efficiency of computer ECG analysis depends on the accurate detection of QRS-complexes. This paper presents an algorithm for QRS complex detection based of support vector machine (SVM). The proposed algorithm is evaluated on annotated standard databases such as MIT-BIH Arrhythmia database. The QRS detector obtained a sensitivity Se = 98.32% and specificity Sp = 95.46% for MIT-BIH Arrhythmia database. This algorithm can be used as the basis for the software to diagnose electrical activity of the heart.

  5. Design of robotic cells based on relative handling modules with use of SolidWorks system

    NASA Astrophysics Data System (ADS)

    Gaponenko, E. V.; Anciferov, S. I.

    2018-05-01

    The article presents a diagramed engineering solution for a robotic cell with six degrees of freedom for machining of complex details, consisting of the base with a tool installation module and a detail machining module made as parallel structure mechanisms. The output links of the detail machining module and the tool installation module can move along X-Y-Z coordinate axes each. A 3D-model of the complex is designed in the SolidWorks system. It will be used further for carrying out engineering calculations and mathematical analysis and obtaining all required documentation.

  6. Molecular Symmetry in Ab Initio Calculations

    NASA Astrophysics Data System (ADS)

    Madhavan, P. V.; Written, J. L.

    1987-05-01

    A scheme is presented for the construction of the Fock matrix in LCAO-SCF calculations and for the transformation of basis integrals to LCAO-MO integrals that can utilize several symmetry unique lists of integrals corresponding to different symmetry groups. The algorithm is fully compatible with vector processing machines and is especially suited for parallel processing machines.

  7. Crystal Orientation Effect on the Subsurface Deformation of Monocrystalline Germanium in Nanometric Cutting.

    PubMed

    Lai, Min; Zhang, Xiaodong; Fang, Fengzhou

    2017-12-01

    Molecular dynamics simulations of nanometric cutting on monocrystalline germanium are conducted to investigate the subsurface deformation during and after nanometric cutting. The continuous random network model of amorphous germanium is established by molecular dynamics simulation, and its characteristic parameters are extracted to compare with those of the machined deformed layer. The coordination number distribution and radial distribution function (RDF) show that the machined surface presents the similar amorphous state. The anisotropic subsurface deformation is studied by nanometric cutting on the (010), (101), and (111) crystal planes of germanium, respectively. The deformed structures are prone to extend along the 110 slip system, which leads to the difference in the shape and thickness of the deformed layer on various directions and crystal planes. On machined surface, the greater thickness of subsurface deformed layer induces the greater surface recovery height. In order to get the critical thickness limit of deformed layer on machined surface of germanium, the optimized cutting direction on each crystal plane is suggested according to the relevance of the nanometric cutting to the nanoindentation.

  8. Machine learning in genetics and genomics

    PubMed Central

    Libbrecht, Maxwell W.; Noble, William Stafford

    2016-01-01

    The field of machine learning promises to enable computers to assist humans in making sense of large, complex data sets. In this review, we outline some of the main applications of machine learning to genetic and genomic data. In the process, we identify some recurrent challenges associated with this type of analysis and provide general guidelines to assist in the practical application of machine learning to real genetic and genomic data. PMID:25948244

  9. Analysing and Rationalising Molecular and Materials Databases Using Machine-Learning

    NASA Astrophysics Data System (ADS)

    de, Sandip; Ceriotti, Michele

    Computational materials design promises to greatly accelerate the process of discovering new or more performant materials. Several collaborative efforts are contributing to this goal by building databases of structures, containing between thousands and millions of distinct hypothetical compounds, whose properties are computed by high-throughput electronic-structure calculations. The complexity and sheer amount of information has made manual exploration, interpretation and maintenance of these databases a formidable challenge, making it necessary to resort to automatic analysis tools. Here we will demonstrate how, starting from a measure of (dis)similarity between database items built from a combination of local environment descriptors, it is possible to apply hierarchical clustering algorithms, as well as dimensionality reduction methods such as sketchmap, to analyse, classify and interpret trends in molecular and materials databases, as well as to detect inconsistencies and errors. Thanks to the agnostic and flexible nature of the underlying metric, we will show how our framework can be applied transparently to different kinds of systems ranging from organic molecules and oligopeptides to inorganic crystal structures as well as molecular crystals. Funded by National Center for Computational Design and Discovery of Novel Materials (MARVEL) and Swiss National Science Foundation.

  10. Structures of EccB 1 and EccD 1 from the core complex of the mycobacterial ESX-1 type VII secretion system

    DOE PAGES

    Wagner, Jonathan M.; Chan, Sum; Evans, Timothy J.; ...

    2016-02-27

    The ESX-1 type VII secretion system is an important determinant of virulence in pathogenic mycobacteria, including Mycobacterium tuberculosis. This complicated molecular machine secretes folded proteins through the mycobacterial cell envelope to subvert the host immune response. Despite its important role in disease very little is known about the molecular architecture of the ESX-1 secretion system. This study characterizes the structures of the soluble domains of two conserved core ESX-1 components – EccB 1 and EccD 1. The periplasmic domain of EccB 1 consists of 4 repeat domains and a central domain, which together form a quasi 2-fold symmetrical structure. Themore » repeat domains of EccB 1 are structurally similar to a known peptidoglycan binding protein suggesting a role in anchoring the ESX-1 system within the periplasmic space. The cytoplasmic domain of EccD 1 has a ubiquitin-like fold and forms a dimer with a negatively charged groove. In conclusion, these structures represent a major step towards resolving the molecular architecture of the entire ESX-1 assembly and may contribute to ESX-1 targeted tuberculosis intervention strategies.« less

  11. Structures of EccB 1 and EccD 1 from the core complex of the mycobacterial ESX-1 type VII secretion system

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wagner, Jonathan M.; Chan, Sum; Evans, Timothy J.

    The ESX-1 type VII secretion system is an important determinant of virulence in pathogenic mycobacteria, including Mycobacterium tuberculosis. This complicated molecular machine secretes folded proteins through the mycobacterial cell envelope to subvert the host immune response. Despite its important role in disease very little is known about the molecular architecture of the ESX-1 secretion system. This study characterizes the structures of the soluble domains of two conserved core ESX-1 components – EccB 1 and EccD 1. The periplasmic domain of EccB 1 consists of 4 repeat domains and a central domain, which together form a quasi 2-fold symmetrical structure. Themore » repeat domains of EccB 1 are structurally similar to a known peptidoglycan binding protein suggesting a role in anchoring the ESX-1 system within the periplasmic space. The cytoplasmic domain of EccD 1 has a ubiquitin-like fold and forms a dimer with a negatively charged groove. In conclusion, these structures represent a major step towards resolving the molecular architecture of the entire ESX-1 assembly and may contribute to ESX-1 targeted tuberculosis intervention strategies.« less

  12. Design of efficient molecular organic light-emitting diodes by a high-throughput virtual screening and experimental approach.

    PubMed

    Gómez-Bombarelli, Rafael; Aguilera-Iparraguirre, Jorge; Hirzel, Timothy D; Duvenaud, David; Maclaurin, Dougal; Blood-Forsythe, Martin A; Chae, Hyun Sik; Einzinger, Markus; Ha, Dong-Gwang; Wu, Tony; Markopoulos, Georgios; Jeon, Soonok; Kang, Hosuk; Miyazaki, Hiroshi; Numata, Masaki; Kim, Sunghan; Huang, Wenliang; Hong, Seong Ik; Baldo, Marc; Adams, Ryan P; Aspuru-Guzik, Alán

    2016-10-01

    Virtual screening is becoming a ground-breaking tool for molecular discovery due to the exponential growth of available computer time and constant improvement of simulation and machine learning techniques. We report an integrated organic functional material design process that incorporates theoretical insight, quantum chemistry, cheminformatics, machine learning, industrial expertise, organic synthesis, molecular characterization, device fabrication and optoelectronic testing. After exploring a search space of 1.6 million molecules and screening over 400,000 of them using time-dependent density functional theory, we identified thousands of promising novel organic light-emitting diode molecules across the visible spectrum. Our team collaboratively selected the best candidates from this set. The experimentally determined external quantum efficiencies for these synthesized candidates were as large as 22%.

  13. Design of efficient molecular organic light-emitting diodes by a high-throughput virtual screening and experimental approach

    NASA Astrophysics Data System (ADS)

    Gómez-Bombarelli, Rafael; Aguilera-Iparraguirre, Jorge; Hirzel, Timothy D.; Duvenaud, David; MacLaurin, Dougal; Blood-Forsythe, Martin A.; Chae, Hyun Sik; Einzinger, Markus; Ha, Dong-Gwang; Wu, Tony; Markopoulos, Georgios; Jeon, Soonok; Kang, Hosuk; Miyazaki, Hiroshi; Numata, Masaki; Kim, Sunghan; Huang, Wenliang; Hong, Seong Ik; Baldo, Marc; Adams, Ryan P.; Aspuru-Guzik, Alán

    2016-10-01

    Virtual screening is becoming a ground-breaking tool for molecular discovery due to the exponential growth of available computer time and constant improvement of simulation and machine learning techniques. We report an integrated organic functional material design process that incorporates theoretical insight, quantum chemistry, cheminformatics, machine learning, industrial expertise, organic synthesis, molecular characterization, device fabrication and optoelectronic testing. After exploring a search space of 1.6 million molecules and screening over 400,000 of them using time-dependent density functional theory, we identified thousands of promising novel organic light-emitting diode molecules across the visible spectrum. Our team collaboratively selected the best candidates from this set. The experimentally determined external quantum efficiencies for these synthesized candidates were as large as 22%.

  14. Highly ordered molecular rotor matrix on a nanopatterned template: titanyl phthalocyanine molecules on FeO/Pt(111).

    PubMed

    Lu, Shuangzan; Huang, Min; Qin, Zhihui; Yu, Yinghui; Guo, Qinmin; Cao, Gengyu

    2018-08-03

    Molecular rotors, motors and gears play important roles in artificial molecular machines, in which rotor and motor matrices are highly desirable for large-scale bottom-up fabrication of molecular machines. Here we demonstrate the fabrication of a highly ordered molecular rotor matrix by depositing nonplanar dipolar titanyl phthalocyanine (TiOPc, C 32 H 16 N 8 OTi) molecules on a Moiré patterned dipolar FeO/Pt(111) substrate. TiOPc molecules with O atoms pointing outwards from the substrate (upward) or towards the substrate (downward) are alternatively adsorbed on the fcc sites by strong lateral confinement. The adsorbed molecules, i.e. two kinds of molecular rotors, show different scanning tunneling microscopy images, thermal stabilities and rotational characteristics. Density functional theory calculations clarify that TiOPc molecules anchoring upwards with high adsorption energies correspond to low-rotational-rate rotors, while those anchoring downwards with low adsorption energies correspond to high-rotational-rate rotors. A robust rotor matrix fully occupied by low-rate rotors is fabricated by depositing molecules on the substrate at elevated temperature. Such a paradigm opens up a promising route to fabricate functional molecular rotor matrices, driven motor matrices and even gear groups on solid substrates.

  15. Temperature and oxygenation during organ preservation: friends or foes?

    PubMed

    Gilbo, Nicholas; Monbaliu, Diethard

    2017-06-01

    The liberalization of donor selection criteria in organ transplantation, with the increased use of suboptimal grafts, has stimulated interest in ischemia-reperfusion injury prevention and graft reconditioning. Organ preservation technologies are changing considerably, mostly through the reintroduction of dynamic machine preservation. Here, we review the current evidence on the role of temperature and oxygenation during dynamic machine preservation. A large but complex body of evidence exists and comparative studies are few. Oxygenation seems to support an advantageous effect in hypothermic machine preservation and is mandatory in normothermic machine preservation, although in the latter, supraphysiological oxygen tensions should be avoided. High-risk grafts, such as suboptimal organs, may optimally benefit from oxygenated perfusion conditions that support metabolism and activate mechanisms of repair such as subnormothermic machine preservation, controlled oxygenated rewarming, and normothermic machine preservation. For lower risk grafts, oxygenation during hypothermic machine preservation may sufficiently reduce injuries and recharge the cellular energy to secure functional recovery after transplantation. The relationship between temperature and oxygenation in organ preservation is more complex than physiological laws would suggest. Rather than one default perfusion temperature/oxygenation standard, perfusion protocols should be tailored for specific needs of grafts of different quality.

  16. Synaptic organic transistors with a vacuum-deposited charge-trapping nanosheet

    NASA Astrophysics Data System (ADS)

    Kim, Chang-Hyun; Sung, Sujin; Yoon, Myung-Han

    2016-09-01

    Organic neuromorphic devices hold great promise for unconventional signal processing and efficient human-machine interfaces. Herein, we propose novel synaptic organic transistors devised to overcome the traditional trade-off between channel conductance and memory performance. A vacuum-processed, nanoscale metallic interlayer provides an ultra-flat surface for a high-mobility molecular film as well as a desirable degree of charge trapping, allowing for low-temperature fabrication of uniform device arrays on plastic. The device architecture is implemented by widely available electronic materials in combination with conventional deposition methods. Therefore, our results are expected to generate broader interests in incorporation of organic electronics into large-area neuromorphic systems, with potential in gate-addressable complex logic circuits and transparent multifunctional interfaces receiving direct optical and cellular stimulation.

  17. Deep learning in pharmacogenomics: from gene regulation to patient stratification.

    PubMed

    Kalinin, Alexandr A; Higgins, Gerald A; Reamaroon, Narathip; Soroushmehr, Sayedmohammadreza; Allyn-Feuer, Ari; Dinov, Ivo D; Najarian, Kayvan; Athey, Brian D

    2018-05-01

    This Perspective provides examples of current and future applications of deep learning in pharmacogenomics, including: identification of novel regulatory variants located in noncoding domains of the genome and their function as applied to pharmacoepigenomics; patient stratification from medical records; and the mechanistic prediction of drug response, targets and their interactions. Deep learning encapsulates a family of machine learning algorithms that has transformed many important subfields of artificial intelligence over the last decade, and has demonstrated breakthrough performance improvements on a wide range of tasks in biomedicine. We anticipate that in the future, deep learning will be widely used to predict personalized drug response and optimize medication selection and dosing, using knowledge extracted from large and complex molecular, epidemiological, clinical and demographic datasets.

  18. Three dimensional electron microscopy and in silico tools for macromolecular structure determination

    PubMed Central

    Borkotoky, Subhomoi; Meena, Chetan Kumar; Khan, Mohammad Wahab; Murali, Ayaluru

    2013-01-01

    Recently, structural biology witnessed a major tool - electron microscopy - in solving the structures of macromolecules in addition to the conventional techniques, X-ray crystallography and nuclear magnetic resonance (NMR). Three dimensional transmission electron microscopy (3DTEM) is one of the most sophisticated techniques for structure determination of molecular machines. Known to give the 3-dimensional structures in its native form with literally no upper limit on size of the macromolecule, this tool does not need the crystallization of the protein. Combining the 3DTEM data with in silico tools, one can have better refined structure of a desired complex. In this review we are discussing about the recent advancements in three dimensional electron microscopy and tools associated with it. PMID:27092033

  19. Silicon sample holder for molecular beam epitaxy on pre-fabricated integrated circuits

    NASA Technical Reports Server (NTRS)

    Hoenk, Michael E. (Inventor); Grunthaner, Paula J. (Inventor); Grunthaner, Frank J. (Inventor)

    1994-01-01

    The sample holder of the invention is formed of the same semiconductor crystal as the integrated circuit on which the molecular beam expitaxial process is to be performed. In the preferred embodiment, the sample holder comprises three stacked micro-machined silicon wafers: a silicon base wafer having a square micro-machined center opening corresponding in size and shape to the active area of a CCD imager chip, a silicon center wafer micro-machined as an annulus having radially inwardly pointing fingers whose ends abut the edges of and center the CCD imager chip within the annulus, and a silicon top wafer micro-machined as an annulus having cantilevered membranes which extend over the top of the CCD imager chip. The micro-machined silicon wafers are stacked in the order given above with the CCD imager chip centered in the center wafer and sandwiched between the base and top wafers. The thickness of the center wafer is about 20% less than the thickness of the CCD imager chip. Preferably, four titanium wires, each grasping the edges of the top and base wafers, compress all three wafers together, flexing the cantilever fingers of the top wafer to accommodate the thickness of the CCD imager chip, acting as a spring holding the CCD imager chip in place.

  20. An active role for machine learning in drug development

    PubMed Central

    Murphy, Robert F.

    2014-01-01

    Due to the complexity of biological systems, cutting-edge machine-learning methods will be critical for future drug development. In particular, machine-vision methods to extract detailed information from imaging assays and active-learning methods to guide experimentation will be required to overcome the dimensionality problem in drug development. PMID:21587249

  1. The Machines Are Coming: Future Directions in Instructional Communication Research. Forum: The Future of Instructional Communication

    ERIC Educational Resources Information Center

    Edwards, Autumn; Edwards, Chad

    2017-01-01

    Educational encounters of the future (and increasingly, of the present) will involve a complex collaboration of human and machine intelligences and agents, partnering to enhance learning and growth. Increasingly, "students and instructors are not only talking 'through' machines, but also [talking] 'to them', and 'within them'" (Edwards…

  2. Nanotube Heterojunctions and Endo-Fullerenes for Nanoelectronics

    NASA Technical Reports Server (NTRS)

    Srivastava, Deepak; Menon, M.; Andriotis, Antonis; Cho, K.; Park, Jun; Biegel, Bryan A. (Technical Monitor)

    2002-01-01

    Topics discussed include: (1) Light-Weight Multi-Functional Materials: Nanomechanics; Nanotubes and Composites; Thermal/Chemical/Electrical Characterization; (2) Biomimetic/Revolutionary Concepts: Evolutionary Computing and Sensing; Self-Heating Materials; (3) Central Computing System: Molecular Electronics; Materials for Quantum Bits; and (4) Molecular Machines.

  3. Nanomedicine: Tiny Particles and Machines Give Huge Gains

    PubMed Central

    Tong, Sheng; Fine, Eli J.; Lin, Yanni; Cradick, Thomas J.; Bao, Gang

    2014-01-01

    Nanomedicine is an emerging field that integrates nanotechnology, biomolecular engineering, life sciences and medicine; it is expected to produce major breakthroughs in medical diagnostics and therapeutics. Nano-scale structures and devices are compatible in size with proteins and nucleic acids in living cells. Therefore, the design, characterization and application of nano-scale probes, carriers and machines may provide unprecedented opportunities for achieving a better control of biological processes, and drastic improvements in disease detection, therapy, and prevention. Recent advances in nanomedicine include the development of nanoparticle-based probes for molecular imaging, nano-carriers for drug/gene delivery, multi-functional nanoparticles for theranostics, and molecular machines for biological and medical studies. This article provides an overview of the nanomedicine field, with an emphasis on nanoparticles for imaging and therapy, as well as engineered nucleases for genome editing. The challenges in translating nanomedicine approaches to clinical applications are discussed. PMID:24297494

  4. The ligand binding mechanism to purine nucleoside phosphorylase elucidated via molecular dynamics and machine learning.

    PubMed

    Decherchi, Sergio; Berteotti, Anna; Bottegoni, Giovanni; Rocchia, Walter; Cavalli, Andrea

    2015-01-27

    The study of biomolecular interactions between a drug and its biological target is of paramount importance for the design of novel bioactive compounds. In this paper, we report on the use of molecular dynamics (MD) simulations and machine learning to study the binding mechanism of a transition state analogue (DADMe-immucillin-H) to the purine nucleoside phosphorylase (PNP) enzyme. Microsecond-long MD simulations allow us to observe several binding events, following different dynamical routes and reaching diverse binding configurations. These simulations are used to estimate kinetic and thermodynamic quantities, such as kon and binding free energy, obtaining a good agreement with available experimental data. In addition, we advance a hypothesis for the slow-onset inhibition mechanism of DADMe-immucillin-H against PNP. Combining extensive MD simulations with machine learning algorithms could therefore be a fruitful approach for capturing key aspects of drug-target recognition and binding.

  5. METHODS OF TREATMENT OF COMPLEX SURFACES ON METAL CUTTING MACHINES (CHAPTERS 1 AND 12),

    DTIC Science & Technology

    FORGING, MOLDINGS, MANDRELS, MARINE PROPELLERS, AERIAL PROPELLERS, TURBINE BLADES, ABRASIVES, IMPELLERS, AIRCRAFT PANELS, METAL PLATES, CAMS, ELECTROEROSIVE MACHINING, CHEMICAL MILLING, MAGNETOSTRICTIVE ELEMENTS, USSR.

  6. Direct interaction of the bacteriophage SPP1 packaging ATPase with the portal protein.

    PubMed

    Oliveira, Leonor; Cuervo, Ana; Tavares, Paulo

    2010-03-05

    DNA packaging in tailed bacteriophages and other viruses requires assembly of a complex molecular machine at a specific vertex of the procapsid. This machine is composed of the portal protein that provides a tunnel for DNA entry, an ATPase that fuels DNA translocation (large terminase subunit), and most frequently, a small terminase subunit. Here we characterized the interaction between the terminase ATPase subunit of bacteriophage SPP1 (gp2) and the procapsid portal vertex. We found, by affinity pulldown assays with purified proteins, that gp2 interacts with the portal protein, gp6, independently of the terminase small subunit gp1, DNA, or ATP. The gp2-procapsid interaction via the portal protein depends on gp2 concentration and requires the presence of divalent cations. Competition experiments showed that isolated gp6 can only inhibit gp2-procapsid interactions and DNA packaging at gp6:procapsid molar ratios above 10-fold. Assays with gp6 carrying mutations in distinct regions of its structure that affect the portal-induced stimulation of ATPase and DNA packaging revealed that none of these mutations impedes gp2-gp6 binding. Our results demonstrate that the SPP1 packaging ATPase binds directly to the portal and that the interaction is stronger with the portal embedded in procapsids. Identification of mutations in gp6 that allow for assembly of the ATPase-portal complex but impair DNA packaging support an intricate cross-talk between the two proteins for activity of the DNA translocation motor.

  7. Covering Cavities by Electrodeposition

    NASA Technical Reports Server (NTRS)

    Schmeets, M.; Duesberg, J.

    1986-01-01

    Reworking technique allows complex surfaces to be reshaped. Contours of large machined parts reworked quickly and inexpensively by electrodeposition and machining, with little risk of damage. Reworking method employs simple, reliable, well-known procedures.

  8. Clustering the Orion B giant molecular cloud based on its molecular emission

    NASA Astrophysics Data System (ADS)

    Bron, Emeric; Daudon, Chloé; Pety, Jérôme; Levrier, François; Gerin, Maryvonne; Gratier, Pierre; Orkisz, Jan H.; Guzman, Viviana; Bardeau, Sébastien; Goicoechea, Javier R.; Liszt, Harvey; Öberg, Karin; Peretto, Nicolas; Sievers, Albrecht; Tremblin, Pascal

    2018-02-01

    Context. Previous attempts at segmenting molecular line maps of molecular clouds have focused on using position-position-velocity data cubes of a single molecular line to separate the spatial components of the cloud. In contrast, wide field spectral imaging over a large spectral bandwidth in the (sub)mm domain now allows one to combine multiple molecular tracers to understand the different physical and chemical phases that constitute giant molecular clouds (GMCs). Aims: We aim at using multiple tracers (sensitive to different physical processes and conditions) to segment a molecular cloud into physically/chemically similar regions (rather than spatially connected components), thus disentangling the different physical/chemical phases present in the cloud. Methods: We use a machine learning clustering method, namely the Meanshift algorithm, to cluster pixels with similar molecular emission, ignoring spatial information. Clusters are defined around each maximum of the multidimensional probability density function (PDF) of the line integrated intensities. Simple radiative transfer models were used to interpret the astrophysical information uncovered by the clustering analysis. Results: A clustering analysis based only on the J = 1-0 lines of three isotopologues of CO proves sufficient to reveal distinct density/column density regimes (nH 100 cm-3, 500 cm-3, and >1000 cm-3), closely related to the usual definitions of diffuse, translucent and high-column-density regions. Adding two UV-sensitive tracers, the J = 1-0 line of HCO+ and the N = 1-0 line of CN, allows us to distinguish two clearly distinct chemical regimes, characteristic of UV-illuminated and UV-shielded gas. The UV-illuminated regime shows overbright HCO+ and CN emission, which we relate to a photochemical enrichment effect. We also find a tail of high CN/HCO+ intensity ratio in UV-illuminated regions. Finer distinctions in density classes (nH 7 × 103 cm-3, 4 × 104 cm-3) for the densest regions are also identified, likely related to the higher critical density of the CN and HCO+ (1-0) lines. These distinctions are only possible because the high-density regions are spatially resolved. Conclusions: Molecules are versatile tracers of GMCs because their line intensities bear the signature of the physics and chemistry at play in the gas. The association of simultaneous multi-line, wide-field mapping and powerful machine learning methods such as the Meanshift clustering algorithm reveals how to decode the complex information available in these molecular tracers. Data products associated with this paper are available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/610/A12 and at http://www.iram.fr/ pety/ORION-B

  9. Molecular transformers in the cell: lessons learned from the DegP protease-chaperone.

    PubMed

    Sawa, Justyna; Heuck, Alexander; Ehrmann, Michael; Clausen, Tim

    2010-04-01

    Structure-function analysis of DegP revealed a novel mechanism for protease and chaperone regulation. Binding of unfolded proteins induces the oligomer reassembly from the resting hexamer (DegP6) into the functional protease-chaperone DegP12/24. The newly formed cage exhibits the characteristics of a proteolytic folding chamber, shredding those proteins that are severely misfolded while stabilizing and protecting proteins present in their native state. Isolation of native DegP complexes with folded outer membrane proteins (OMPs) highlights the importance of DegP in OMP biogenesis. The encapsulated OMP beta-barrel is significantly stabilized in the hydrophobic chamber of DegP12/24 and thus DegP seems to employ a reciprocal mechanism to those chaperones assisting the folding of water soluble proteins via polar interactions. In addition, we discuss in this review similarities to other complex proteolytic machines that, like DegP, are under control of a substrate-induced or stress-induced oligomer conversion.

  10. Perturbation Theory/Machine Learning Model of ChEMBL Data for Dopamine Targets: Docking, Synthesis, and Assay of New l-Prolyl-l-leucyl-glycinamide Peptidomimetics.

    PubMed

    Ferreira da Costa, Joana; Silva, David; Caamaño, Olga; Brea, José M; Loza, Maria Isabel; Munteanu, Cristian R; Pazos, Alejandro; García-Mera, Xerardo; González-Díaz, Humbert

    2018-06-25

    Predicting drug-protein interactions (DPIs) for target proteins involved in dopamine pathways is a very important goal in medicinal chemistry. We can tackle this problem using Molecular Docking or Machine Learning (ML) models for one specific protein. Unfortunately, these models fail to account for large and complex big data sets of preclinical assays reported in public databases. This includes multiple conditions of assays, such as different experimental parameters, biological assays, target proteins, cell lines, organism of the target, or organism of assay. On the other hand, perturbation theory (PT) models allow us to predict the properties of a query compound or molecular system in experimental assays with multiple boundary conditions based on a previously known case of reference. In this work, we report the first PTML (PT + ML) study of a large ChEMBL data set of preclinical assays of compounds targeting dopamine pathway proteins. The best PTML model found predicts 50000 cases with accuracy of 70-91% in training and external validation series. We also compared the linear PTML model with alternative PTML models trained with multiple nonlinear methods (artificial neural network (ANN), Random Forest, Deep Learning, etc.). Some of the nonlinear methods outperform the linear model but at the cost of a notable increment of the complexity of the model. We illustrated the practical use of the new model with a proof-of-concept theoretical-experimental study. We reported for the first time the organic synthesis, chemical characterization, and pharmacological assay of a new series of l-prolyl-l-leucyl-glycinamide (PLG) peptidomimetic compounds. In addition, we performed a molecular docking study for some of these compounds with the software Vina AutoDock. The work ends with a PTML model predictive study of the outcomes of the new compounds in a large number of assays. Therefore, this study offers a new computational methodology for predicting the outcome for any compound in new assays. This PTML method focuses on the prediction with a simple linear model of multiple pharmacological parameters (IC 50 , EC 50 , K i , etc.) for compounds in assays involving different cell lines used, organisms of the protein target, or organism of assay for proteins in the dopamine pathway.

  11. Ryan King | NREL

    Science.gov Websites

    research focuses on optimization and machine learning applied to complex energy systems and turbulent flows techniques to improve wind plant design and controls and developed a new data-driven machine learning closure

  12. A new machine classification method applied to human peripheral blood leukocytes

    NASA Technical Reports Server (NTRS)

    Rorvig, Mark E.; Fitzpatrick, Steven J.; Vitthal, Sanjay; Ladoulis, Charles T.

    1994-01-01

    Human beings judge images by complex mental processes, whereas computing machines extract features. By reducing scaled human judgments and machine extracted features to a common metric space and fitting them by regression, the judgments of human experts rendered on a sample of images may be imposed on an image population to provide automatic classification.

  13. Study of a Variable Mass Atwood's Machine Using a Smartphone

    ERIC Educational Resources Information Center

    Lopez, Dany; Caprile, Isidora; Corvacho, Fernando; Reyes, Orfa

    2018-01-01

    The Atwood machine was invented in 1784 by George Atwood and this system has been widely studied both theoretically and experimentally over the years. Nowadays, it is commonplace that many experimental physics courses include both Atwood's machine and variable mass to introduce more complex concepts in physics. To study the dynamics of the masses…

  14. Switchable host-guest systems on surfaces.

    PubMed

    Yang, Ying-Wei; Sun, Yu-Long; Song, Nan

    2014-07-15

    CONSPECTUS: For device miniaturization, nanotechnology follows either the "top-down" approach scaling down existing larger-scale devices or the "bottom-up' approach assembling the smallest possible building blocks to functional nanoscale entities. For synthetic nanodevices, self-assembly on surfaces is a superb method to achieve useful functions and enable their interactions with the surrounding world. Consequently, adaptability and responsiveness to external stimuli are other prerequisites for their successful operation. Mechanically interlocked molecules such as rotaxanes and catenanes, and their precursors, that is, molecular switches and supramolecular switches including pseudorotaxanes, are molecular machines or prototypes of machines capable of mechanical motion induced by chemical signals, biological inputs, light or redox processes as the external stimuli. Switching of these functional host-guest systems on surfaces becomes a fundamental requirement for artificial molecular machines to work, mimicking the molecular machines in nature, such as proteins and their assemblies operating at dynamic interfaces such as the surfaces of cell membranes. Current research endeavors in material science and technology are focused on developing either a new class of materials or materials with novel/multiple functionalities by shifting host-guest chemistry from solution phase to surfaces. In this Account, we present our most recent attempts of building monolayers of rotaxanes/pseudorotaxanes on surfaces, providing stimuli-induced macroscopic effects and further understanding on the switchable host-guest systems at interfaces. Biocompatible versions of molecular machines based on synthetic macrocycles, such as cucurbiturils, pillararenes, calixarenes, and cyclodextrins, have been employed to form self-assembled monolayers of gates on the surfaces of mesoporous silica nanoparticles to regulate the controlled release of cargo/drug molecules under a range of external stimuli, such as light, pH variations, competitive binding, and enzyme. Rotaxanes have also been assembled onto the surfaces of gold nanodisks and microcantilevers to realize active molecular plasmonics and synthetic molecular actuators for device fabrication and function. Pillararenes have been successfully used to control and aid the synthesis of gold nanoparticles, semiconducting quantum dots, and magnetic nanoparticles. The resulting organic-inorganic hydrid nanomaterials have been successfully used for controlled self-assembly, herbicide sensing and detection, pesticide removal, and so forth, taking advantage of the selective binding of pillarenes toward target molecules. Cyclodextrins have also been successfully functionalized onto the surface of gold nanoparticles to serve as recycling extractors for C60. Many interesting prototypes of nanodevices based on synthetic macrocycles and their host-guest chemistry have been constructed and served for different potential applications. This Account will be a summary of the efforts made mainly by us, and others, on the host-guest chemistry of synthetic macrocyclic compounds on the surfaces of different solid supports.

  15. Certification of highly complex safety-related systems.

    PubMed

    Reinert, D; Schaefer, M

    1999-01-01

    The BIA has now 15 years of experience with the certification of complex electronic systems for safety-related applications in the machinery sector. Using the example of machining centres this presentation will show the systematic procedure for verifying and validating control systems using Application Specific Integrated Circuits (ASICs) and microcomputers for safety functions. One section will describe the control structure of machining centres with control systems using "integrated safety." A diverse redundant architecture combined with crossmonitoring and forced dynamization is explained. In the main section the steps of the systematic certification procedure are explained showing some results of the certification of drilling machines. Specification reviews, design reviews with test case specification, statistical analysis, and walk-throughs are the analytical measures in the testing process. Systematic tests based on the test case specification, Electro Magnetic Interference (EMI), and environmental testing, and site acceptance tests on the machines are the testing measures for validation. A complex software driven system is always undergoing modification. Most of the changes are not safety-relevant but this has to be proven. A systematic procedure for certifying software modifications is presented in the last section of the paper.

  16. Distinctive Roles for Periplasmic Proteases in the Maintenance of Essential Outer Membrane Protein Assembly.

    PubMed

    Soltes, Garner R; Martin, Nicholas R; Park, Eunhae; Sutterlin, Holly A; Silhavy, Thomas J

    2017-10-15

    Outer membrane protein (OMP) biogenesis in Escherichia coli is a robust process essential to the life of the organism. It is catalyzed by the β-barrel assembly machine (Bam) complex, and a number of quality control factors, including periplasmic chaperones and proteases, maintain the integrity of this trafficking pathway. Little is known, however, about how periplasmic proteases recognize and degrade OMP substrates when assembly is compromised or whether different proteases recognize the same substrate at distinct points in the assembly pathway. In this work, we use well-defined assembly-defective mutants of LptD, the essential lipopolysaccharide assembly translocon, to show that the periplasmic protease DegP degrades substrates with assembly defects that prevent or impair initial contact with Bam, causing the mutant protein to accumulate in the periplasm. In contrast, another periplasmic protease, BepA, degrades a LptD mutant substrate that has engaged the Bam complex and formed a nearly complete barrel. Furthermore, we describe the role of the outer membrane lipoprotein YcaL, a protease of heretofore unknown function, in the degradation of a LptD substrate that has engaged the Bam complex but is stalled at an earlier step in the assembly process that is not accessible to BepA. Our results demonstrate that multiple periplasmic proteases monitor OMPs at distinct points in the assembly process. IMPORTANCE OMP assembly is catalyzed by the essential Bam complex and occurs in a cellular environment devoid of energy sources. Assembly intermediates that misfold can compromise this essential molecular machine. Here we demonstrate distinctive roles for three different periplasmic proteases that can clear OMP substrates with folding defects that compromise assembly at three different stages. These quality control factors help ensure the integrity of the permeability barrier that contributes to the intrinsic resistance of Gram-negative organisms to many antibiotics. Copyright © 2017 American Society for Microbiology.

  17. Three-point compound sine plate offers cost and weight savings

    NASA Technical Reports Server (NTRS)

    Barras, A. P.

    1972-01-01

    Work piece adjustment fixture reduces size, weight and set-up complexity of alignment platforms used in metal blank machining. Design benefits designers and manufacturers of machine tools and measuring equipment.

  18. ReactionPredictor: prediction of complex chemical reactions at the mechanistic level using machine learning.

    PubMed

    Kayala, Matthew A; Baldi, Pierre

    2012-10-22

    Proposing reasonable mechanisms and predicting the course of chemical reactions is important to the practice of organic chemistry. Approaches to reaction prediction have historically used obfuscating representations and manually encoded patterns or rules. Here we present ReactionPredictor, a machine learning approach to reaction prediction that models elementary, mechanistic reactions as interactions between approximate molecular orbitals (MOs). A training data set of productive reactions known to occur at reasonable rates and yields and verified by inclusion in the literature or textbooks is derived from an existing rule-based system and expanded upon with manual curation from graduate level textbooks. Using this training data set of complex polar, hypervalent, radical, and pericyclic reactions, a two-stage machine learning prediction framework is trained and validated. In the first stage, filtering models trained at the level of individual MOs are used to reduce the space of possible reactions to consider. In the second stage, ranking models over the filtered space of possible reactions are used to order the reactions such that the productive reactions are the top ranked. The resulting model, ReactionPredictor, perfectly ranks polar reactions 78.1% of the time and recovers all productive reactions 95.7% of the time when allowing for small numbers of errors. Pericyclic and radical reactions are perfectly ranked 85.8% and 77.0% of the time, respectively, rising to >93% recovery for both reaction types with a small number of allowed errors. Decisions about which of the polar, pericyclic, or radical reaction type ranking models to use can be made with >99% accuracy. Finally, for multistep reaction pathways, we implement the first mechanistic pathway predictor using constrained tree-search to discover a set of reasonable mechanistic steps from given reactants to given products. Webserver implementations of both the single step and pathway versions of ReactionPredictor are available via the chemoinformatics portal http://cdb.ics.uci.edu/.

  19. Protein-protein interaction networks: unraveling the wiring of molecular machines within the cell.

    PubMed

    De Las Rivas, Javier; Fontanillo, Celia

    2012-11-01

    Mapping and understanding of the protein interaction networks with their key modules and hubs can provide deeper insights into the molecular machinery underlying complex phenotypes. In this article, we present the basic characteristics and definitions of protein networks, starting with a distinction of the different types of associations between proteins. We focus the review on protein-protein interactions (PPIs), a subset of associations defined as physical contacts between proteins that occur by selective molecular docking in a particular biological context. We present such definition as opposed to other types of protein associations derived from regulatory, genetic, structural or functional relations. To determine PPIs, a variety of binary and co-complex methods exist; however, not all the technologies provide the same information and data quality. A way of increasing confidence in a given protein interaction is to integrate orthogonal experimental evidences. The use of several complementary methods testing each single interaction assesses the accuracy of PPI data and tries to minimize the occurrence of false interactions. Following this approach there have been important efforts to unify primary databases of experimentally proven PPIs into integrated databases. These meta-databases provide a measure of the confidence of interactions based on the number of experimental proofs that report them. As a conclusion, we can state that integrated information allows the building of more reliable interaction networks. Identification of communities, cliques, modules and hubs by analysing the topological parameters and graph properties of the protein networks allows the discovery of central/critical nodes, which are candidates to regulate cellular flux and dynamics.

  20. A Second-Generation Device for Automated Training and Quantitative Behavior Analyses of Molecularly-Tractable Model Organisms

    PubMed Central

    Blackiston, Douglas; Shomrat, Tal; Nicolas, Cindy L.; Granata, Christopher; Levin, Michael

    2010-01-01

    A deep understanding of cognitive processes requires functional, quantitative analyses of the steps leading from genetics and the development of nervous system structure to behavior. Molecularly-tractable model systems such as Xenopus laevis and planaria offer an unprecedented opportunity to dissect the mechanisms determining the complex structure of the brain and CNS. A standardized platform that facilitated quantitative analysis of behavior would make a significant impact on evolutionary ethology, neuropharmacology, and cognitive science. While some animal tracking systems exist, the available systems do not allow automated training (feedback to individual subjects in real time, which is necessary for operant conditioning assays). The lack of standardization in the field, and the numerous technical challenges that face the development of a versatile system with the necessary capabilities, comprise a significant barrier keeping molecular developmental biology labs from integrating behavior analysis endpoints into their pharmacological and genetic perturbations. Here we report the development of a second-generation system that is a highly flexible, powerful machine vision and environmental control platform. In order to enable multidisciplinary studies aimed at understanding the roles of genes in brain function and behavior, and aid other laboratories that do not have the facilities to undergo complex engineering development, we describe the device and the problems that it overcomes. We also present sample data using frog tadpoles and flatworms to illustrate its use. Having solved significant engineering challenges in its construction, the resulting design is a relatively inexpensive instrument of wide relevance for several fields, and will accelerate interdisciplinary discovery in pharmacology, neurobiology, regenerative medicine, and cognitive science. PMID:21179424

  1. Machine-Building for Fuel and Energy Complex: Perspective Forms of Interaction

    NASA Astrophysics Data System (ADS)

    Nikitenko, S. M.; Goosen, E. V.; Pakhomova, E. A.; Rozhkova, O. V.; Mesyats, M. A.

    2017-10-01

    The article is devoted to the study of the existing forms of cooperation between the authorities, business and science in the fuel and energy complex and the machine-building industry at the regional level. The possibilities of applying the concept of the “triple helix” and its multi-helix modifications for the implementation of the import substitution program for high- tech products have been considered.

  2. Computational Nanotechnology Molecular Electronics, Materials and Machines

    NASA Technical Reports Server (NTRS)

    Srivastava, Deepak; Biegel, Bryan A. (Technical Monitor)

    2002-01-01

    This presentation covers research being performed on computational nanotechnology, carbon nanotubes and fullerenes at the NASA Ames Research Center. Topics cover include: nanomechanics of nanomaterials, nanotubes and composite materials, molecular electronics with nanotube junctions, kinky chemistry, and nanotechnology for solid-state quantum computers using fullerenes.

  3. The Molecular Industrial Revolution: Automated Synthesis of Small Molecules.

    PubMed

    Trobe, Melanie; Burke, Martin D

    2018-04-09

    Today we are poised for a transition from the highly customized crafting of specific molecular targets by hand to the increasingly general and automated assembly of different types of molecules with the push of a button. Creating machines that are capable of making many different types of small molecules on demand, akin to that which has been achieved on the macroscale with 3D printers, is challenging. Yet important progress is being made toward this objective with two complementary approaches: 1) Automation of customized synthesis routes to different targets by machines that enable the use of many reactions and starting materials, and 2) automation of generalized platforms that make many different targets using common coupling chemistry and building blocks. Continued progress in these directions has the potential to shift the bottleneck in molecular innovation from synthesis to imagination, and thereby help drive a new industrial revolution on the molecular scale. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Machine learning of accurate energy-conserving molecular force fields.

    PubMed

    Chmiela, Stefan; Tkatchenko, Alexandre; Sauceda, Huziel E; Poltavsky, Igor; Schütt, Kristof T; Müller, Klaus-Robert

    2017-05-01

    Using conservation of energy-a fundamental property of closed classical and quantum mechanical systems-we develop an efficient gradient-domain machine learning (GDML) approach to construct accurate molecular force fields using a restricted number of samples from ab initio molecular dynamics (AIMD) trajectories. The GDML implementation is able to reproduce global potential energy surfaces of intermediate-sized molecules with an accuracy of 0.3 kcal mol -1 for energies and 1 kcal mol -1 Å̊ -1 for atomic forces using only 1000 conformational geometries for training. We demonstrate this accuracy for AIMD trajectories of molecules, including benzene, toluene, naphthalene, ethanol, uracil, and aspirin. The challenge of constructing conservative force fields is accomplished in our work by learning in a Hilbert space of vector-valued functions that obey the law of energy conservation. The GDML approach enables quantitative molecular dynamics simulations for molecules at a fraction of cost of explicit AIMD calculations, thereby allowing the construction of efficient force fields with the accuracy and transferability of high-level ab initio methods.

  5. Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity

    NASA Astrophysics Data System (ADS)

    Huang, Bing; von Lilienfeld, O. Anatole

    2016-10-01

    The predictive accuracy of Machine Learning (ML) models of molecular properties depends on the choice of the molecular representation. Inspired by the postulates of quantum mechanics, we introduce a hierarchy of representations which meet uniqueness and target similarity criteria. To systematically control target similarity, we simply rely on interatomic many body expansions, as implemented in universal force-fields, including Bonding, Angular (BA), and higher order terms. Addition of higher order contributions systematically increases similarity to the true potential energy and predictive accuracy of the resulting ML models. We report numerical evidence for the performance of BAML models trained on molecular properties pre-calculated at electron-correlated and density functional theory level of theory for thousands of small organic molecules. Properties studied include enthalpies and free energies of atomization, heat capacity, zero-point vibrational energies, dipole-moment, polarizability, HOMO/LUMO energies and gap, ionization potential, electron affinity, and electronic excitations. After training, BAML predicts energies or electronic properties of out-of-sample molecules with unprecedented accuracy and speed.

  6. Machine learning of accurate energy-conserving molecular force fields

    PubMed Central

    Chmiela, Stefan; Tkatchenko, Alexandre; Sauceda, Huziel E.; Poltavsky, Igor; Schütt, Kristof T.; Müller, Klaus-Robert

    2017-01-01

    Using conservation of energy—a fundamental property of closed classical and quantum mechanical systems—we develop an efficient gradient-domain machine learning (GDML) approach to construct accurate molecular force fields using a restricted number of samples from ab initio molecular dynamics (AIMD) trajectories. The GDML implementation is able to reproduce global potential energy surfaces of intermediate-sized molecules with an accuracy of 0.3 kcal mol−1 for energies and 1 kcal mol−1 Å̊−1 for atomic forces using only 1000 conformational geometries for training. We demonstrate this accuracy for AIMD trajectories of molecules, including benzene, toluene, naphthalene, ethanol, uracil, and aspirin. The challenge of constructing conservative force fields is accomplished in our work by learning in a Hilbert space of vector-valued functions that obey the law of energy conservation. The GDML approach enables quantitative molecular dynamics simulations for molecules at a fraction of cost of explicit AIMD calculations, thereby allowing the construction of efficient force fields with the accuracy and transferability of high-level ab initio methods. PMID:28508076

  7. Role of the α clamp in the protein translocation mechanism of anthrax toxin

    PubMed Central

    Brown, Michael J.; Thoren, Katie L.; Krantz, Bryan A.

    2015-01-01

    Membrane-embedded molecular machines are utilized to move water-soluble proteins across these barriers. Anthrax toxin forms one such machine through the self-assembly of its three component proteins—protective antigen (PA), lethal factor (LF), and edema factor (EF). Upon endocytosis into host cells, acidification of the endosome induces PA to form a membrane-inserted channel, which unfolds LF and EF and translocates them into the host cytosol. Translocation is driven by the proton motive force, comprised of the chemical potential, the proton-gradient (ΔpH), and the membrane potential (ΔΨ). A crystal structure of the lethal toxin core complex revealed an “α clamp” structure that binds to substrate helices nonspecifically. Here we test the hypothesis that through the recognition of unfolding helical structure the α clamp can accelerate the rate of translocation. We produced a synthetic PA mutant in which an α helix was crosslinked into the α clamp to block its function. This synthetic construct impairs translocation by raising a yet uncharacterized translocation barrier shown to be much less force dependent than the known unfolding barrier. We also report that the α clamp more stably binds substrates that can form helices than those, such as polyproline, that cannot. Hence the α clamp recognizes substrates by a general shape-complementarity mechanism. Substrates that are incapable of forming compact secondary structure (due to the introduction of a polyproline track) are severely deficient for translocation. Therefore, the α clamp and its recognition of helical structure in the translocating substrate play key roles in the molecular mechanism of protein translocation. PMID:26344833

  8. Machine Learning Prediction of the Energy Gap of Graphene Nanoflakes Using Topological Autocorrelation Vectors.

    PubMed

    Fernandez, Michael; Abreu, Jose I; Shi, Hongqing; Barnard, Amanda S

    2016-11-14

    The possibility of band gap engineering in graphene opens countless new opportunities for application in nanoelectronics. In this work, the energy gaps of 622 computationally optimized graphene nanoflakes were mapped to topological autocorrelation vectors using machine learning techniques. Machine learning modeling revealed that the most relevant correlations appear at topological distances in the range of 1 to 42 with prediction accuracy higher than 80%. The data-driven model can statistically discriminate between graphene nanoflakes with different energy gaps on the basis of their molecular topology.

  9. Hybrid organic-inorganic rotaxanes and molecular shuttles.

    PubMed

    Lee, Chin-Fa; Leigh, David A; Pritchard, Robin G; Schultz, David; Teat, Simon J; Timco, Grigore A; Winpenny, Richard E P

    2009-03-19

    The tetravalency of carbon and its ability to form covalent bonds with itself and other elements enables large organic molecules with complex structures, functions and dynamics to be constructed. The varied electronic configurations and bonding patterns of inorganic elements, on the other hand, can impart diverse electronic, magnetic, catalytic and other useful properties to molecular-level structures. Some hybrid organic-inorganic materials that combine features of both chemistries have been developed, most notably metal-organic frameworks, dense and extended organic-inorganic frameworks and coordination polymers. Metal ions have also been incorporated into molecules that contain interlocked subunits, such as rotaxanes and catenanes, and structures in which many inorganic clusters encircle polymer chains have been described. Here we report the synthesis of a series of discrete rotaxane molecules in which inorganic and organic structural units are linked together mechanically at the molecular level. Structural units (dialkyammonium groups) in dumb-bell-shaped organic molecules template the assembly of essentially inorganic 'rings' about 'axles' to form rotaxanes consisting of various numbers of rings and axles. One of the rotaxanes behaves as a 'molecular shuttle': the ring moves between two binding sites on the axle in a large-amplitude motion typical of some synthetic molecular machine systems. The architecture of the rotaxanes ensures that the electronic, magnetic and paramagnetic characteristics of the inorganic rings-properties that could make them suitable as qubits for quantum computers-can influence, and potentially be influenced by, the organic portion of the molecule.

  10. Redox control of molecular motion in switchable artificial nanoscale devices.

    PubMed

    Credi, Alberto; Semeraro, Monica; Silvi, Serena; Venturi, Margherita

    2011-03-15

    The design, synthesis, and operation of molecular-scale systems that exhibit controllable motions of their component parts is a topic of great interest in nanoscience and a fascinating challenge of nanotechnology. The development of this kind of species constitutes the premise to the construction of molecular machines and motors, which in a not-too-distant future could find applications in fields such as materials science, information technology, energy conversion, diagnostics, and medicine. In the past 25 years the development of supramolecular chemistry has enabled the construction of an interesting variety of artificial molecular machines. These devices operate via electronic and molecular rearrangements and, like the macroscopic counterparts, they need energy to work as well as signals to communicate with the operator. Here we outline the design principles at the basis of redox switching of molecular motion in artificial nanodevices. Redox processes, chemically, electrically, or photochemically induced, can indeed supply the energy to bring about molecular motions. Moreover, in the case of electrically and photochemically induced processes, electrochemical and photochemical techniques can be used to read the state of the system, and thus to control and monitor the operation of the device. Some selected examples are also reported to describe the most representative achievements in this research area.

  11. NanoDesign: Concepts and Software for a Nanotechnology Based on Functionalized Fullerenes

    NASA Technical Reports Server (NTRS)

    Globus, Al; Jaffe, Richard; Chancellor, Marisa K. (Technical Monitor)

    1996-01-01

    Eric Drexler has proposed a hypothetical nanotechnology based on diamond and investigated the properties of such molecular systems. While attractive, diamonoid nanotechnology is not physically accessible with straightforward extensions of current laboratory techniques. We propose a nanotechnology based on functionalized fullerenes and investigate carbon nanotube based gears with teeth added via a benzyne reaction known to occur with C60. The gears are single-walled carbon nanotubes with appended coenzyme groups for teeth. Fullerenes are in widespread laboratory use and can be functionalized in many ways. Companion papers computationally demonstrate the properties of these gears (they appear to work) and the accessibility of the benzyne/nanotube reaction. This paper describes the molecular design techniques and rationale as well as the software that implements these design techniques. The software is a set of persistent C++ objects controlled by TCL command scripts. The c++/tcl interface is automatically generated by a software system called tcl_c++ developed by the author and described here. The objects keep track of different portions of the molecular machinery to allow different simulation techniques and boundary conditions to be applied as appropriate. This capability has been required to demonstrate (computationally) our gear's feasibility. A new distributed software architecture featuring a WWW universal client, CORBA distributed objects, and agent software is under consideration. The software architecture is intended to eventually enable a widely disbursed group to develop complex simulated molecular machines.

  12. Single Nucleobase Identification Using Biophysical Signatures from Nanoelectronic Quantum Tunneling.

    PubMed

    Korshoj, Lee E; Afsari, Sepideh; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

    2017-03-01

    Nanoelectronic DNA sequencing can provide an important alternative to sequencing-by-synthesis by reducing sample preparation time, cost, and complexity as a high-throughput next-generation technique with accurate single-molecule identification. However, sample noise and signature overlap continue to prevent high-resolution and accurate sequencing results. Probing the molecular orbitals of chemically distinct DNA nucleobases offers a path for facile sequence identification, but molecular entropy (from nucleotide conformations) makes such identification difficult when relying only on the energies of lowest-unoccupied and highest-occupied molecular orbitals (LUMO and HOMO). Here, nine biophysical parameters are developed to better characterize molecular orbitals of individual nucleobases, intended for single-molecule DNA sequencing using quantum tunneling of charges. For this analysis, theoretical models for quantum tunneling are combined with transition voltage spectroscopy to obtain measurable parameters unique to the molecule within an electronic junction. Scanning tunneling spectroscopy is then used to measure these nine biophysical parameters for DNA nucleotides, and a modified machine learning algorithm identified nucleobases. The new parameters significantly improve base calling over merely using LUMO and HOMO frontier orbital energies. Furthermore, high accuracies for identifying DNA nucleobases were observed at different pH conditions. These results have significant implications for developing a robust and accurate high-throughput nanoelectronic DNA sequencing technique. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. Development of a machine learning potential for graphene

    NASA Astrophysics Data System (ADS)

    Rowe, Patrick; Csányi, Gábor; Alfè, Dario; Michaelides, Angelos

    2018-02-01

    We present an accurate interatomic potential for graphene, constructed using the Gaussian approximation potential (GAP) machine learning methodology. This GAP model obtains a faithful representation of a density functional theory (DFT) potential energy surface, facilitating highly accurate (approaching the accuracy of ab initio methods) molecular dynamics simulations. This is achieved at a computational cost which is orders of magnitude lower than that of comparable calculations which directly invoke electronic structure methods. We evaluate the accuracy of our machine learning model alongside that of a number of popular empirical and bond-order potentials, using both experimental and ab initio data as references. We find that whilst significant discrepancies exist between the empirical interatomic potentials and the reference data—and amongst the empirical potentials themselves—the machine learning model introduced here provides exemplary performance in all of the tested areas. The calculated properties include: graphene phonon dispersion curves at 0 K (which we predict with sub-meV accuracy), phonon spectra at finite temperature, in-plane thermal expansion up to 2500 K as compared to NPT ab initio molecular dynamics simulations and a comparison of the thermally induced dispersion of graphene Raman bands to experimental observations. We have made our potential freely available online at [http://www.libatoms.org].

  14. Method for forming precision clockplate with pivot pins

    DOEpatents

    Wild, Ronald L [Albuquerque, NM

    2010-06-01

    Methods are disclosed for producing a precision clockplate with rotational bearing surfaces (e.g. pivot pins). The methods comprise providing an electrically conductive blank, conventionally machining oversize features comprising bearing surfaces into the blank, optionally machining of a relief on non-bearing surfaces, providing wire accesses adjacent to bearing surfaces, threading the wire of an electrical discharge machine through the accesses and finishing the bearing surfaces by wire electrical discharge machining. The methods have been shown to produce bearing surfaces of comparable dimension and tolerances as those produced by micro-machining methods such as LIGA, at reduced cost and complexity.

  15. Multiprocessor Z-Buffer Architecture for High-Speed, High Complexity Computer Image Generation.

    DTIC Science & Technology

    1983-12-01

    Oversampling 50 17. "Poking Through" Effects 51 18. Sampling Paths 52 19. Triangle Variables 54 20. Intelligent Tiling Algorithm 61 21. Tiler Functional Blocks...64 * 22. HSD Interface 65 23. Tiling Machine Setup 67 24. Tiling Machine 68 25. Tile Accumulate 69 26. A lx$ Sorting Machine 77 27. A 2x8 Sorting...Delay 227 87. Effect of Triangle Size on Tiler Throughput Rates 229 88. Tiling Machine Setup Stage Performance for Oversample Mode 234 89. Tiling

  16. Information, knowledge and the future of machines.

    PubMed

    MacFarlane, Alistair G J

    2003-08-15

    This wide-ranging survey considers the future of machines in terms of information, complexity and the growth of knowledge shared amongst agents. Mechanical and human agents are compared and contrasted, and it is argued that, for the foreseeable future, their roles will be complementary. The future development of machines is examined in terms of unions of human and machine agency evolving as part of economic activity. Limits to, and threats posed by, the continuing evolution of such a society of agency are considered.

  17. Clinical quality needs complex adaptive systems and machine learning.

    PubMed

    Marsland, Stephen; Buchan, Iain

    2004-01-01

    The vast increase in clinical data has the potential to bring about large improvements in clinical quality and other aspects of healthcare delivery. However, such benefits do not come without cost. The analysis of such large datasets, particularly where the data may have to be merged from several sources and may be noisy and incomplete, is a challenging task. Furthermore, the introduction of clinical changes is a cyclical task, meaning that the processes under examination operate in an environment that is not static. We suggest that traditional methods of analysis are unsuitable for the task, and identify complexity theory and machine learning as areas that have the potential to facilitate the examination of clinical quality. By its nature the field of complex adaptive systems deals with environments that change because of the interactions that have occurred in the past. We draw parallels between health informatics and bioinformatics, which has already started to successfully use machine learning methods.

  18. Programming and machining of complex parts based on CATIA solid modeling

    NASA Astrophysics Data System (ADS)

    Zhu, Xiurong

    2017-09-01

    The complex parts of the use of CATIA solid modeling programming and simulation processing design, elaborated in the field of CNC machining, programming and the importance of processing technology. In parts of the design process, first make a deep analysis on the principle, and then the size of the design, the size of each chain, connected to each other. After the use of backstepping and a variety of methods to calculate the final size of the parts. In the selection of parts materials, careful study, repeated testing, the final choice of 6061 aluminum alloy. According to the actual situation of the processing site, it is necessary to make a comprehensive consideration of various factors in the machining process. The simulation process should be based on the actual processing, not only pay attention to shape. It can be used as reference for machining.

  19. Machine learning applications in proteomics research: how the past can boost the future.

    PubMed

    Kelchtermans, Pieter; Bittremieux, Wout; De Grave, Kurt; Degroeve, Sven; Ramon, Jan; Laukens, Kris; Valkenborg, Dirk; Barsnes, Harald; Martens, Lennart

    2014-03-01

    Machine learning is a subdiscipline within artificial intelligence that focuses on algorithms that allow computers to learn solving a (complex) problem from existing data. This ability can be used to generate a solution to a particularly intractable problem, given that enough data are available to train and subsequently evaluate an algorithm on. Since MS-based proteomics has no shortage of complex problems, and since publicly available data are becoming available in ever growing amounts, machine learning is fast becoming a very popular tool in the field. We here therefore present an overview of the different applications of machine learning in proteomics that together cover nearly the entire wet- and dry-lab workflow, and that address key bottlenecks in experiment planning and design, as well as in data processing and analysis. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. Telepresence and telerobotics

    NASA Technical Reports Server (NTRS)

    Garin, John; Matteo, Joseph; Jennings, Von Ayre

    1988-01-01

    The capability for a single operator to simultaneously control complex remote multi degree of freedom robotic arms and associated dextrous end effectors is being developed. An optimal solution within the realm of current technology, can be achieved by recognizing that: (1) machines/computer systems are more effective than humans when the task is routine and specified, and (2) humans process complex data sets and deal with the unpredictable better than machines. These observations lead naturally to a philosophy in which the human's role becomes a higher level function associated with planning, teaching, initiating, monitoring, and intervening when the machine gets into trouble, while the machine performs the codifiable tasks with deliberate efficiency. This concept forms the basis for the integration of man and telerobotics, i.e., robotics with the operator in the control loop. The concept of integration of the human in the loop and maximizing the feed-forward and feed-back data flow is referred to as telepresence.

  1. Residual Error Based Anomaly Detection Using Auto-Encoder in SMD Machine Sound.

    PubMed

    Oh, Dong Yul; Yun, Il Dong

    2018-04-24

    Detecting an anomaly or an abnormal situation from given noise is highly useful in an environment where constantly verifying and monitoring a machine is required. As deep learning algorithms are further developed, current studies have focused on this problem. However, there are too many variables to define anomalies, and the human annotation for a large collection of abnormal data labeled at the class-level is very labor-intensive. In this paper, we propose to detect abnormal operation sounds or outliers in a very complex machine along with reducing the data-driven annotation cost. The architecture of the proposed model is based on an auto-encoder, and it uses the residual error, which stands for its reconstruction quality, to identify the anomaly. We assess our model using Surface-Mounted Device (SMD) machine sound, which is very complex, as experimental data, and state-of-the-art performance is successfully achieved for anomaly detection.

  2. Ultra-Compact Transputer-Based Controller for High-Level, Multi-Axis Coordination

    NASA Technical Reports Server (NTRS)

    Zenowich, Brian; Crowell, Adam; Townsend, William T.

    2013-01-01

    The design of machines that rely on arrays of servomotors such as robotic arms, orbital platforms, and combinations of both, imposes a heavy computational burden to coordinate their actions to perform coherent tasks. For example, the robotic equivalent of a person tracing a straight line in space requires enormously complex kinematics calculations, and complexity increases with the number of servo nodes. A new high-level architecture for coordinated servo-machine control enables a practical, distributed transputer alternative to conventional central processor electronics. The solution is inherently scalable, dramatically reduces bulkiness and number of conductor runs throughout the machine, requires only a fraction of the power, and is designed for cooling in a vacuum.

  3. The Mathematical Modeling and Computer Simulation of Electrochemical Micromachining Using Ultrashort Pulses

    NASA Astrophysics Data System (ADS)

    Kozak, J.; Gulbinowicz, D.; Gulbinowicz, Z.

    2009-05-01

    The need for complex and accurate three dimensional (3-D) microcomponents is increasing rapidly for many industrial and consumer products. Electrochemical machining process (ECM) has the potential of generating desired crack-free and stress-free surfaces of microcomponents. This paper reports a study of pulse electrochemical micromachining (PECMM) using ultrashort (nanoseconds) pulses for generating complex 3-D microstructures of high accuracy. A mathematical model of the microshaping process with taking into consideration unsteady phenomena in electrical double layer has been developed. The software for computer simulation of PECM has been developed and the effects of machining parameters on anodic localization and final shape of machined surface are presented.

  4. Time-course, negative-stain electron microscopy-based analysis for investigating protein-protein interactions at the single-molecule level.

    PubMed

    Nogal, Bartek; Bowman, Charles A; Ward, Andrew B

    2017-11-24

    Several biophysical approaches are available to study protein-protein interactions. Most approaches are conducted in bulk solution, and are therefore limited to an average measurement of the ensemble of molecular interactions. Here, we show how single-particle EM can enrich our understanding of protein-protein interactions at the single-molecule level and potentially capture states that are unobservable with ensemble methods because they are below the limit of detection or not conducted on an appropriate time scale. Using the HIV-1 envelope glycoprotein (Env) and its interaction with receptor CD4-binding site neutralizing antibodies as a model system, we both corroborate ensemble kinetics-derived parameters and demonstrate how time-course EM can further dissect stoichiometric states of complexes that are not readily observable with other methods. Visualization of the kinetics and stoichiometry of Env-antibody complexes demonstrated the applicability of our approach to qualitatively and semi-quantitatively differentiate two highly similar neutralizing antibodies. Furthermore, implementation of machine-learning techniques for sorting class averages of these complexes into discrete subclasses of particles helped reduce human bias. Our data provide proof of concept that single-particle EM can be used to generate a "visual" kinetic profile that should be amenable to studying many other protein-protein interactions, is relatively simple and complementary to well-established biophysical approaches. Moreover, our method provides critical insights into broadly neutralizing antibody recognition of Env, which may inform vaccine immunogen design and immunotherapeutic development. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. Synaptic organic transistors with a vacuum-deposited charge-trapping nanosheet

    PubMed Central

    Kim, Chang-Hyun; Sung, Sujin; Yoon, Myung-Han

    2016-01-01

    Organic neuromorphic devices hold great promise for unconventional signal processing and efficient human-machine interfaces. Herein, we propose novel synaptic organic transistors devised to overcome the traditional trade-off between channel conductance and memory performance. A vacuum-processed, nanoscale metallic interlayer provides an ultra-flat surface for a high-mobility molecular film as well as a desirable degree of charge trapping, allowing for low-temperature fabrication of uniform device arrays on plastic. The device architecture is implemented by widely available electronic materials in combination with conventional deposition methods. Therefore, our results are expected to generate broader interests in incorporation of organic electronics into large-area neuromorphic systems, with potential in gate-addressable complex logic circuits and transparent multifunctional interfaces receiving direct optical and cellular stimulation. PMID:27645425

  6. The evolutionary time machine: forecasting how populations can adapt to changing environments using dormant propagules

    PubMed Central

    Orsini, Luisa; Schwenk, Klaus; De Meester, Luc; Colbourne, John K.; Pfrender, Michael E.; Weider, Lawrence J.

    2013-01-01

    Evolutionary changes are determined by a complex assortment of ecological, demographic and adaptive histories. Predicting how evolution will shape the genetic structures of populations coping with current (and future) environmental challenges has principally relied on investigations through space, in lieu of time, because long-term phenotypic and molecular data are scarce. Yet, dormant propagules in sediments, soils and permafrost are convenient natural archives of population-histories from which to trace adaptive trajectories along extended time periods. DNA sequence data obtained from these natural archives, combined with pioneering methods for analyzing both ecological and population genomic time-series data, are likely to provide predictive models to forecast evolutionary responses of natural populations to environmental changes resulting from natural and anthropogenic stressors, including climate change. PMID:23395434

  7. Exploiting CRISPR/Cas: Interference Mechanisms and Applications

    PubMed Central

    Richter, Hagen; Randau, Lennart; Plagens, André

    2013-01-01

    The discovery of biological concepts can often provide a framework for the development of novel molecular tools, which can help us to further understand and manipulate life. One recent example is the elucidation of the prokaryotic adaptive immune system, clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) that protects bacteria and archaea against viruses or conjugative plasmids. The immunity is based on small RNA molecules that are incorporated into versatile multi-domain proteins or protein complexes and specifically target viral nucleic acids via base complementarity. CRISPR/Cas interference machines are utilized to develop novel genome editing tools for different organisms. Here, we will review the latest progress in the elucidation and application of prokaryotic CRISPR/Cas systems and discuss possible future approaches to exploit the potential of these interference machineries. PMID:23857052

  8. Exploiting CRISPR/Cas: interference mechanisms and applications.

    PubMed

    Richter, Hagen; Randau, Lennart; Plagens, André

    2013-07-12

    The discovery of biological concepts can often provide a framework for the development of novel molecular tools, which can help us to further understand and manipulate life. One recent example is the elucidation of the prokaryotic adaptive immune system, clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) that protects bacteria and archaea against viruses or conjugative plasmids. The immunity is based on small RNA molecules that are incorporated into versatile multi-domain proteins or protein complexes and specifically target viral nucleic acids via base complementarity. CRISPR/Cas interference machines are utilized to develop novel genome editing tools for different organisms. Here, we will review the latest progress in the elucidation and application of prokaryotic CRISPR/Cas systems and discuss possible future approaches to exploit the potential of these interference machineries.

  9. Mathematical concepts for modeling human behavior in complex man-machine systems

    NASA Technical Reports Server (NTRS)

    Johannsen, G.; Rouse, W. B.

    1979-01-01

    Many human behavior (e.g., manual control) models have been found to be inadequate for describing processes in certain real complex man-machine systems. An attempt is made to find a way to overcome this problem by examining the range of applicability of existing mathematical models with respect to the hierarchy of human activities in real complex tasks. Automobile driving is chosen as a baseline scenario, and a hierarchy of human activities is derived by analyzing this task in general terms. A structural description leads to a block diagram and a time-sharing computer analogy.

  10. [Design of Complex Cavity Structure in Air Route System of Automated Peritoneal Dialysis Machine].

    PubMed

    Quan, Xiaoliang

    2017-07-30

    This paper introduced problems about Automated Peritoneal Dialysis machine(APD) that the lack of technical issues such as the structural design of the complex cavities. To study the flow characteristics of this special structure, the application of ANSYS CFX software is used with k-ε turbulence model as the theoretical basis of fluid mechanics. The numerical simulation of flow field simulation result in the internal model can be gotten after the complex structure model is imported into ANSYS CFX module. Then, it will present the distribution of complex cavities inside the flow field and the flow characteristics parameter, which will provide an important reference design for APD design.

  11. The Impact of Protein Structure and Sequence Similarity on the Accuracy of Machine-Learning Scoring Functions for Binding Affinity Prediction

    PubMed Central

    Peng, Jiangjun; Leung, Yee; Leung, Kwong-Sak; Wong, Man-Hon; Lu, Gang; Ballester, Pedro J.

    2018-01-01

    It has recently been claimed that the outstanding performance of machine-learning scoring functions (SFs) is exclusively due to the presence of training complexes with highly similar proteins to those in the test set. Here, we revisit this question using 24 similarity-based training sets, a widely used test set, and four SFs. Three of these SFs employ machine learning instead of the classical linear regression approach of the fourth SF (X-Score which has the best test set performance out of 16 classical SFs). We have found that random forest (RF)-based RF-Score-v3 outperforms X-Score even when 68% of the most similar proteins are removed from the training set. In addition, unlike X-Score, RF-Score-v3 is able to keep learning with an increasing training set size, becoming substantially more predictive than X-Score when the full 1105 complexes are used for training. These results show that machine-learning SFs owe a substantial part of their performance to training on complexes with dissimilar proteins to those in the test set, against what has been previously concluded using the same data. Given that a growing amount of structural and interaction data will be available from academic and industrial sources, this performance gap between machine-learning SFs and classical SFs is expected to enlarge in the future. PMID:29538331

  12. The Impact of Protein Structure and Sequence Similarity on the Accuracy of Machine-Learning Scoring Functions for Binding Affinity Prediction.

    PubMed

    Li, Hongjian; Peng, Jiangjun; Leung, Yee; Leung, Kwong-Sak; Wong, Man-Hon; Lu, Gang; Ballester, Pedro J

    2018-03-14

    It has recently been claimed that the outstanding performance of machine-learning scoring functions (SFs) is exclusively due to the presence of training complexes with highly similar proteins to those in the test set. Here, we revisit this question using 24 similarity-based training sets, a widely used test set, and four SFs. Three of these SFs employ machine learning instead of the classical linear regression approach of the fourth SF (X-Score which has the best test set performance out of 16 classical SFs). We have found that random forest (RF)-based RF-Score-v3 outperforms X-Score even when 68% of the most similar proteins are removed from the training set. In addition, unlike X-Score, RF-Score-v3 is able to keep learning with an increasing training set size, becoming substantially more predictive than X-Score when the full 1105 complexes are used for training. These results show that machine-learning SFs owe a substantial part of their performance to training on complexes with dissimilar proteins to those in the test set, against what has been previously concluded using the same data. Given that a growing amount of structural and interaction data will be available from academic and industrial sources, this performance gap between machine-learning SFs and classical SFs is expected to enlarge in the future.

  13. Study of Effect of Impacting Direction on Abrasive Nanometric Cutting Process with Molecular Dynamics

    NASA Astrophysics Data System (ADS)

    Li, Junye; Meng, Wenqing; Dong, Kun; Zhang, Xinming; Zhao, Weihong

    2018-01-01

    Abrasive flow polishing plays an important part in modern ultra-precision machining. Ultrafine particles suspended in the medium of abrasive flow removes the material in nanoscale. In this paper, three-dimensional molecular dynamics (MD) simulations are performed to investigate the effect of impacting direction on abrasive cutting process during abrasive flow polishing. The molecular dynamics simulation software Lammps was used to simulate the cutting of single crystal copper with SiC abrasive grains at different cutting angles (0o-45o). At a constant friction coefficient, we found a direct relation between cutting angle and cutting force, which ultimately increases the number of dislocation during abrasive flow machining. Our theoretical study reveal that a small cutting angle is beneficial for improving surface quality and reducing internal defects in the workpiece. However, there is no obvious relationship between cutting angle and friction coefficient.

  14. Study of Effect of Impacting Direction on Abrasive Nanometric Cutting Process with Molecular Dynamics.

    PubMed

    Li, Junye; Meng, Wenqing; Dong, Kun; Zhang, Xinming; Zhao, Weihong

    2018-01-11

    Abrasive flow polishing plays an important part in modern ultra-precision machining. Ultrafine particles suspended in the medium of abrasive flow removes the material in nanoscale. In this paper, three-dimensional molecular dynamics (MD) simulations are performed to investigate the effect of impacting direction on abrasive cutting process during abrasive flow polishing. The molecular dynamics simulation software Lammps was used to simulate the cutting of single crystal copper with SiC abrasive grains at different cutting angles (0 o -45 o ). At a constant friction coefficient, we found a direct relation between cutting angle and cutting force, which ultimately increases the number of dislocation during abrasive flow machining. Our theoretical study reveal that a small cutting angle is beneficial for improving surface quality and reducing internal defects in the workpiece. However, there is no obvious relationship between cutting angle and friction coefficient.

  15. State of the APC/C: Organization, function, and structure

    PubMed Central

    McLean, Janel R.; Chaix, Denis; Ohi, Melanie D.; Gould, Kathleen L.

    2016-01-01

    The ubiquitin-proteasome protein degradation system is involved in many essential cellular processes including cell cycle regulation, cell differentiation, and the unfolded protein response.The anaphase-promoting complex/cyclosome (APC/C), an evolutionary conserved E3 ubiquitin ligase, was discovered 15 years ago because of its pivotal role in cyclin degradation and mitotic progression. Since then, we have learned that the APC/C is a very large, complex E3 ligase composed of 13 subunits, yielding a molecular machine of approximately 1 MDa. The intricate regulation of the APC/C is mediated by the Cdc20 family of activators, pseudosubstrate inhibitors, protein kinases and phosphatases and the spindle assembly checkpoint. The large size, complexity, and dynamic nature of the APC/C represent significant obstacles toward high-resolution structural techniques; however, over the last decade, there have been a number of lower resolution APC/C structures determined using single particle electron microscopy. These structures, when combined with data generated from numerous genetic and biochemical studies, have begun to shed light on how APC/C activity is regulated. Here, we discuss the most recent developments in the APC/C field concerning structure, substrate recognition, and catalysis. PMID:21261459

  16. Molecular dynamic simulation for nanometric cutting of single-crystal face-centered cubic metals.

    PubMed

    Huang, Yanhua; Zong, Wenjun

    2014-01-01

    In this work, molecular dynamics simulations are performed to investigate the influence of material properties on the nanometric cutting of single crystal copper and aluminum with a diamond cutting tool. The atomic interactions in the two metallic materials are modeled by two sets of embedded atom method (EAM) potential parameters. Simulation results show that although the plastic deformation of the two materials is achieved by dislocation activities, the deformation behavior and related physical phenomena, such as the machining forces, machined surface quality, and chip morphology, are significantly different for different materials. Furthermore, the influence of material properties on the nanometric cutting has a strong dependence on the operating temperature.

  17. Amazing structure of respirasome: unveiling the secrets of cell respiration.

    PubMed

    Guo, Runyu; Gu, Jinke; Wu, Meng; Yang, Maojun

    2016-12-01

    Respirasome, a huge molecular machine that carries out cellular respiration, has gained growing attention since its discovery, because respiration is the most indispensable biological process in almost all living creatures. The concept of respirasome has renewed our understanding of the respiratory chain organization, and most recently, the structure of respirasome solved by Yang's group from Tsinghua University (Gu et al. Nature 237(7622):639-643, 2016) firstly presented the detailed interactions within this huge molecular machine, and provided important information for drug design and screening. However, the study of cellular respiration went through a long history. Here, we briefly showed the detoured history of respiratory chain investigation, and then described the amazing structure of respirasome.

  18. Plugging a Bipyridinium Axle into Multichromophoric Calix[6]arene Wheels Bearing Naphthyl Units at Different Rims.

    PubMed

    Orlandini, Guido; Ragazzon, Giulio; Zanichelli, Valeria; Degli Esposti, Lorenzo; Baroncini, Massimo; Silvi, Serena; Venturi, Margherita; Credi, Alberto; Secchi, Andrea; Arduini, Arturo

    2017-02-01

    Tris-( N -phenylureido)-calix[6]arene derivatives are heteroditopic non-symmetric molecular hosts that can form pseudorotaxane complexes with 4,4'-bipyridinium-type guests. Owing to the unique structural features and recognition properties of the calix[6]arene wheel, these systems are of interest for the design and synthesis of novel molecular devices and machines. We envisaged that the incorporation of photoactive units in the calixarene skeleton could lead to the development of systems the working modes of which can be governed and monitored by means of light-activated processes. Here, we report on the synthesis, structural characterization, and spectroscopic, photophysical, and electrochemical investigation of two calix[6]arene wheels decorated with three naphthyl groups anchored to either the upper or lower rim of the phenylureido calixarene platform. We found that the naphthyl units interact mutually and with the calixarene skeleton in a different fashion in the two compounds, which thus exhibit a markedly distinct photophysical behavior. For both hosts, the inclusion of a 4,4 ' -bipyridinium guest activates energy- and/or electron-transfer processes that lead to non-trivial luminescence changes.

  19. Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach.

    PubMed

    Ramakrishnan, Raghunathan; Dral, Pavlo O; Rupp, Matthias; von Lilienfeld, O Anatole

    2015-05-12

    Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger molecular sets than used for training. For thermochemical properties of up to 16k isomers of C7H10O2 we present numerical evidence that chemical accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of Hartree-Fock, and we establish a qualitative relationship between molecular entropy and electron correlation. The transferability of our approach is demonstrated, using semiempirical quantum chemistry and machine learning models trained on 1 and 10% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy.

  20. Machine learning of molecular electronic properties in chemical compound space

    NASA Astrophysics Data System (ADS)

    Montavon, Grégoire; Rupp, Matthias; Gobre, Vivekanand; Vazquez-Mayagoitia, Alvaro; Hansen, Katja; Tkatchenko, Alexandre; Müller, Klaus-Robert; Anatole von Lilienfeld, O.

    2013-09-01

    The combination of modern scientific computing with electronic structure theory can lead to an unprecedented amount of data amenable to intelligent data analysis for the identification of meaningful, novel and predictive structure-property relationships. Such relationships enable high-throughput screening for relevant properties in an exponentially growing pool of virtual compounds that are synthetically accessible. Here, we present a machine learning model, trained on a database of ab initio calculation results for thousands of organic molecules, that simultaneously predicts multiple electronic ground- and excited-state properties. The properties include atomization energy, polarizability, frontier orbital eigenvalues, ionization potential, electron affinity and excitation energies. The machine learning model is based on a deep multi-task artificial neural network, exploiting the underlying correlations between various molecular properties. The input is identical to ab initio methods, i.e. nuclear charges and Cartesian coordinates of all atoms. For small organic molecules, the accuracy of such a ‘quantum machine’ is similar, and sometimes superior, to modern quantum-chemical methods—at negligible computational cost.

  1. Virtual C Machine and Integrated Development Environment for ATMS Controllers.

    DOT National Transportation Integrated Search

    2000-04-01

    The overall objective of this project is to develop a prototype virtual machine that fits on current Advanced Traffic Management Systems (ATMS) controllers and provides functionality for complex traffic operations.;Prepared in cooperation with Utah S...

  2. Hybrid polylingual object model: an efficient and seamless integration of Java and native components on the Dalvik virtual machine.

    PubMed

    Huang, Yukun; Chen, Rong; Wei, Jingbo; Pei, Xilong; Cao, Jing; Prakash Jayaraman, Prem; Ranjan, Rajiv

    2014-01-01

    JNI in the Android platform is often observed with low efficiency and high coding complexity. Although many researchers have investigated the JNI mechanism, few of them solve the efficiency and the complexity problems of JNI in the Android platform simultaneously. In this paper, a hybrid polylingual object (HPO) model is proposed to allow a CAR object being accessed as a Java object and as vice in the Dalvik virtual machine. It is an acceptable substitute for JNI to reuse the CAR-compliant components in Android applications in a seamless and efficient way. The metadata injection mechanism is designed to support the automatic mapping and reflection between CAR objects and Java objects. A prototype virtual machine, called HPO-Dalvik, is implemented by extending the Dalvik virtual machine to support the HPO model. Lifespan management, garbage collection, and data type transformation of HPO objects are also handled in the HPO-Dalvik virtual machine automatically. The experimental result shows that the HPO model outweighs the standard JNI in lower overhead on native side, better executing performance with no JNI bridging code being demanded.

  3. [Fragments Woven into a Whole: A New Approach to the Diagnosis of Cancer through Mass Spectrometry and Machine-Learning].

    PubMed

    Takeda, Sen; Yoshimura, Kentaro; Tanabe, Kunio

    2015-09-01

    Conventionally, a definitive diagnosis of cancer is derived from histopathological diagnostics based on morphological criteria that are difficult to standardize on a quantifiable basis. On the other hand, while molecular tumor markers and blood biochemical profiles give quantitative values evaluated by objective criteria, these parameters are usually generated by deductive methods such as peak extraction. Therefore, some of the data that may contain useful information on specimens are discarded. To overcome the disadvantages of these methods, we have developed a new approach by employing both mass spectrometry and machine-learning for cancer diagnosis. Probe electrospray ionization (PESI) is a derivative of electrospray ionization that uses a fine acupuncture needle as a sample picker as well as an ion emitter for mass spectrometry. This method enables us to ionize very small tissue samples up to a few pico liters in the presence of physiological concentrations of inorganic salts, without the need for any sample pretreatment. Moreover, as this technique makes it possible to ionize all components with minimal suppression effects, we can retrieve much more molecular information from specimens. To make the most of data enriched with lipid compounds and substances with lower molecular weights such as carbohydrates, we employed machine-learning named the dual penalized logistic regression machine (dPLRM). This method is completely different from pattern-matching in that it discriminates categories by projecting the spectral data into a mathematical space with very high dimensions, where final judgment is made. We are now approaching the final clinical trial to validate the usefulness of our system.

  4. Statistical complex fatigue data for SAE 4340 steel and its use in design by reliability

    NASA Technical Reports Server (NTRS)

    Kececioglu, D.; Smith, J. L.

    1970-01-01

    A brief description of the complex fatigue machines used in the test program is presented. The data generated from these machines are given and discussed. Two methods of obtaining strength distributions from the data are also discussed. Then follows a discussion of the construction of statistical fatigue diagrams and their use in designing by reliability. Finally, some of the problems encountered in the test equipment and a corrective modification are presented.

  5. A novel integrated framework and improved methodology of computer-aided drug design.

    PubMed

    Chen, Calvin Yu-Chian

    2013-01-01

    Computer-aided drug design (CADD) is a critical initiating step of drug development, but a single model capable of covering all designing aspects remains to be elucidated. Hence, we developed a drug design modeling framework that integrates multiple approaches, including machine learning based quantitative structure-activity relationship (QSAR) analysis, 3D-QSAR, Bayesian network, pharmacophore modeling, and structure-based docking algorithm. Restrictions for each model were defined for improved individual and overall accuracy. An integration method was applied to join the results from each model to minimize bias and errors. In addition, the integrated model adopts both static and dynamic analysis to validate the intermolecular stabilities of the receptor-ligand conformation. The proposed protocol was applied to identifying HER2 inhibitors from traditional Chinese medicine (TCM) as an example for validating our new protocol. Eight potent leads were identified from six TCM sources. A joint validation system comprised of comparative molecular field analysis, comparative molecular similarity indices analysis, and molecular dynamics simulation further characterized the candidates into three potential binding conformations and validated the binding stability of each protein-ligand complex. The ligand pathway was also performed to predict the ligand "in" and "exit" from the binding site. In summary, we propose a novel systematic CADD methodology for the identification, analysis, and characterization of drug-like candidates.

  6. Unveiling the mechanism of the promising two-dimensional photoswitch - Hemithioindigo

    NASA Astrophysics Data System (ADS)

    Li, Donglin; Yang, Yonggang; Li, Chaozheng; Liu, Yufang

    2018-07-01

    The control of internal molecular motions by outside stimuli is a decisive task in the construction of functional molecules and molecular machines. Light-induced intramolecular rotations of photoswitches have attracted increasing research interests because of the high stability and high reversibility of photoswitches. Recently, Henry et al. reported an unprecedented two-dimensional controlled photoswitch, the hemithioindigo (HTI) derivative Z1, whose single bond rotation in dimethyl sulphoxide (DMSO) solvent and double bond rotation in cyclohexane solvent can be induced by visible light (J. Am. Chem. Soc. 2016, 138, 12,219). Here we investigate the intramolecular rotations of the HTI and Z1 in different polar solvents by time-dependent density functional theory (TDDFT) and Nonadiabatic dynamic simulations. Due to the steric hindrance between methyl and thioindigo fragment, the rotations of Z1 in the excited state are obstructed. Interestingly, the HTI exhibits two distinct rotation paths in DMSO and cyclohexane solvents at about 50 fs. The intermolecular hydrogen bonds between HTI and DMSO play an important role in the rotation of HTI in DMSO solvent. Therefore, the HTI is a more promising two-dimensional photoswitch compared with the Z1. Our finding is thus of fundamental importance to understand the mechanisms of this class of photoswitches and design complex molecular behavior.

  7. Quantum dynamics of light-driven chiral molecular motors.

    PubMed

    Yamaki, Masahiro; Nakayama, Shin-ichiro; Hoki, Kunihito; Kono, Hirohiko; Fujimura, Yuichi

    2009-03-21

    The results of theoretical studies on quantum dynamics of light-driven molecular motors with internal rotation are presented. Characteristic features of chiral motors driven by a non-helical, linearly polarized electric field of light are explained on the basis of symmetry argument. The rotational potential of the chiral motor is characterized by a ratchet form. The asymmetric potential determines the directional motion: the rotational direction is toward the gentle slope of the asymmetric potential. This direction is called the intuitive direction. To confirm the unidirectional rotational motion, results of quantum dynamical calculations of randomly-oriented molecular motors are presented. A theoretical design of the smallest light-driven molecular machine is presented. The smallest chiral molecular machine has an optically driven engine and a running propeller on its body. The mechanisms of transmission of driving forces from the engine to the propeller are elucidated by using a quantum dynamical treatment. The results provide a principle for control of optically-driven molecular bevel gears. Temperature effects are discussed using the density operator formalism. An effective method for ultrafast control of rotational motions in any desired direction is presented with the help of a quantum control theory. In this method, visible or UV light pulses are applied to drive the motor via an electronic excited state. A method for driving a large molecular motor consisting of an aromatic hydrocarbon is presented. The molecular motor is operated by interactions between the induced dipole of the molecular motor and the electric field of light pulses.

  8. Complex Networks Analysis of Manual and Machine Translations

    NASA Astrophysics Data System (ADS)

    Amancio, Diego R.; Antiqueira, Lucas; Pardo, Thiago A. S.; da F. Costa, Luciano; Oliveira, Osvaldo N.; Nunes, Maria G. V.

    Complex networks have been increasingly used in text analysis, including in connection with natural language processing tools, as important text features appear to be captured by the topology and dynamics of the networks. Following previous works that apply complex networks concepts to text quality measurement, summary evaluation, and author characterization, we now focus on machine translation (MT). In this paper we assess the possible representation of texts as complex networks to evaluate cross-linguistic issues inherent in manual and machine translation. We show that different quality translations generated by MT tools can be distinguished from their manual counterparts by means of metrics such as in- (ID) and out-degrees (OD), clustering coefficient (CC), and shortest paths (SP). For instance, we demonstrate that the average OD in networks of automatic translations consistently exceeds the values obtained for manual ones, and that the CC values of source texts are not preserved for manual translations, but are for good automatic translations. This probably reflects the text rearrangements humans perform during manual translation. We envisage that such findings could lead to better MT tools and automatic evaluation metrics.

  9. Next-Generation Machine Learning for Biological Networks.

    PubMed

    Camacho, Diogo M; Collins, Katherine M; Powers, Rani K; Costello, James C; Collins, James J

    2018-06-14

    Machine learning, a collection of data-analytical techniques aimed at building predictive models from multi-dimensional datasets, is becoming integral to modern biological research. By enabling one to generate models that learn from large datasets and make predictions on likely outcomes, machine learning can be used to study complex cellular systems such as biological networks. Here, we provide a primer on machine learning for life scientists, including an introduction to deep learning. We discuss opportunities and challenges at the intersection of machine learning and network biology, which could impact disease biology, drug discovery, microbiome research, and synthetic biology. Copyright © 2018 Elsevier Inc. All rights reserved.

  10. Cataloging the biomedical world of pain through semi-automated curation of molecular interactions

    PubMed Central

    Jamieson, Daniel G.; Roberts, Phoebe M.; Robertson, David L.; Sidders, Ben; Nenadic, Goran

    2013-01-01

    The vast collection of biomedical literature and its continued expansion has presented a number of challenges to researchers who require structured findings to stay abreast of and analyze molecular mechanisms relevant to their domain of interest. By structuring literature content into topic-specific machine-readable databases, the aggregate data from multiple articles can be used to infer trends that can be compared and contrasted with similar findings from topic-independent resources. Our study presents a generalized procedure for semi-automatically creating a custom topic-specific molecular interaction database through the use of text mining to assist manual curation. We apply the procedure to capture molecular events that underlie ‘pain’, a complex phenomenon with a large societal burden and unmet medical need. We describe how existing text mining solutions are used to build a pain-specific corpus, extract molecular events from it, add context to the extracted events and assess their relevance. The pain-specific corpus contains 765 692 documents from Medline and PubMed Central, from which we extracted 356 499 unique normalized molecular events, with 261 438 single protein events and 93 271 molecular interactions supplied by BioContext. Event chains are annotated with negation, speculation, anatomy, Gene Ontology terms, mutations, pain and disease relevance, which collectively provide detailed insight into how that event chain is associated with pain. The extracted relations are visualized in a wiki platform (wiki-pain.org) that enables efficient manual curation and exploration of the molecular mechanisms that underlie pain. Curation of 1500 grouped event chains ranked by pain relevance revealed 613 accurately extracted unique molecular interactions that in the future can be used to study the underlying mechanisms involved in pain. Our approach demonstrates that combining existing text mining tools with domain-specific terms and wiki-based visualization can facilitate rapid curation of molecular interactions to create a custom database. Database URL: ••• PMID:23707966

  11. Cataloging the biomedical world of pain through semi-automated curation of molecular interactions.

    PubMed

    Jamieson, Daniel G; Roberts, Phoebe M; Robertson, David L; Sidders, Ben; Nenadic, Goran

    2013-01-01

    The vast collection of biomedical literature and its continued expansion has presented a number of challenges to researchers who require structured findings to stay abreast of and analyze molecular mechanisms relevant to their domain of interest. By structuring literature content into topic-specific machine-readable databases, the aggregate data from multiple articles can be used to infer trends that can be compared and contrasted with similar findings from topic-independent resources. Our study presents a generalized procedure for semi-automatically creating a custom topic-specific molecular interaction database through the use of text mining to assist manual curation. We apply the procedure to capture molecular events that underlie 'pain', a complex phenomenon with a large societal burden and unmet medical need. We describe how existing text mining solutions are used to build a pain-specific corpus, extract molecular events from it, add context to the extracted events and assess their relevance. The pain-specific corpus contains 765 692 documents from Medline and PubMed Central, from which we extracted 356 499 unique normalized molecular events, with 261 438 single protein events and 93 271 molecular interactions supplied by BioContext. Event chains are annotated with negation, speculation, anatomy, Gene Ontology terms, mutations, pain and disease relevance, which collectively provide detailed insight into how that event chain is associated with pain. The extracted relations are visualized in a wiki platform (wiki-pain.org) that enables efficient manual curation and exploration of the molecular mechanisms that underlie pain. Curation of 1500 grouped event chains ranked by pain relevance revealed 613 accurately extracted unique molecular interactions that in the future can be used to study the underlying mechanisms involved in pain. Our approach demonstrates that combining existing text mining tools with domain-specific terms and wiki-based visualization can facilitate rapid curation of molecular interactions to create a custom database. Database URL: •••

  12. Molecular Nanotechnology and Designs of Future

    NASA Technical Reports Server (NTRS)

    Srivastava, Deepak; Chancellor, Marisa K. (Technical Monitor)

    1997-01-01

    Reviewing the status of current approaches and future projections, as already published in the scientific journals and books, the talk will summarize the direction in which computational and experimental molecular nanotechnologies are progressing. Examples of nanotechnological approach to the concepts of design and simulation of atomically precise materials in a variety of interdisciplinary areas will be presented. The concepts of hypothetical molecular machines and assemblers as explained in Drexler's and Merckle's already published work and Han et. al's WWW distributed molecular gears will be explained.

  13. DNA Bipedal Motor Achieves a Large Number of Steps Due to Operation Using Microfluidics-Based Interface.

    PubMed

    Tomov, Toma E; Tsukanov, Roman; Glick, Yair; Berger, Yaron; Liber, Miran; Avrahami, Dorit; Gerber, Doron; Nir, Eyal

    2017-04-25

    Realization of bioinspired molecular machines that can perform many and diverse operations in response to external chemical commands is a major goal in nanotechnology, but current molecular machines respond to only a few sequential commands. Lack of effective methods for introduction and removal of command compounds and low efficiencies of the reactions involved are major reasons for the limited performance. We introduce here a user interface based on a microfluidics device and single-molecule fluorescence spectroscopy that allows efficient introduction and removal of chemical commands and enables detailed study of the reaction mechanisms involved in the operation of synthetic molecular machines. The microfluidics provided 64 consecutive DNA strand commands to a DNA-based motor system immobilized inside the microfluidics, driving a bipedal walker to perform 32 steps on a DNA origami track. The microfluidics enabled removal of redundant strands, resulting in a 6-fold increase in processivity relative to an identical motor operated without strand removal and significantly more operations than previously reported for user-controlled DNA nanomachines. In the motor operated without strand removal, redundant strands interfere with motor operation and reduce its performance. The microfluidics also enabled computer control of motor direction and speed. Furthermore, analysis of the reaction kinetics and motor performance in the absence of redundant strands, made possible by the microfluidics, enabled accurate modeling of the walker processivity. This enabled identification of dynamic boundaries and provided an explanation, based on the "trap state" mechanism, for why the motor did not perform an even larger number of steps. This understanding is very important for the development of future motors with significantly improved performance. Our universal interface enables two-way communication between user and molecular machine and, relying on concepts similar to that of solid-phase synthesis, removes limitations on the number of external stimuli. This interface, therefore, is an important step toward realization of reliable, processive, reproducible, and useful externally controlled DNA nanomachines.

  14. Machine learning for medical images analysis.

    PubMed

    Criminisi, A

    2016-10-01

    This article discusses the application of machine learning for the analysis of medical images. Specifically: (i) We show how a special type of learning models can be thought of as automatically optimized, hierarchically-structured, rule-based algorithms, and (ii) We discuss how the issue of collecting large labelled datasets applies to both conventional algorithms as well as machine learning techniques. The size of the training database is a function of model complexity rather than a characteristic of machine learning methods. Crown Copyright © 2016. Published by Elsevier B.V. All rights reserved.

  15. Implementing Machine Learning in Radiology Practice and Research.

    PubMed

    Kohli, Marc; Prevedello, Luciano M; Filice, Ross W; Geis, J Raymond

    2017-04-01

    The purposes of this article are to describe concepts that radiologists should understand to evaluate machine learning projects, including common algorithms, supervised as opposed to unsupervised techniques, statistical pitfalls, and data considerations for training and evaluation, and to briefly describe ethical dilemmas and legal risk. Machine learning includes a broad class of computer programs that improve with experience. The complexity of creating, training, and monitoring machine learning indicates that the success of the algorithms will require radiologist involvement for years to come, leading to engagement rather than replacement.

  16. Nature versus design: synthetic biology or how to build a biological non-machine.

    PubMed

    Porcar, M; Peretó, J

    2016-04-18

    The engineering ideal of synthetic biology presupposes that organisms are composed of standard, interchangeable parts with a predictive behaviour. In one word, organisms are literally recognized as machines. Yet living objects are the result of evolutionary processes without any purposiveness, not of a design by external agents. Biological components show massive overlapping and functional degeneracy, standard-free complexity, intrinsic variation and context dependent performances. However, although organisms are not full-fledged machines, synthetic biologists may still be eager for machine-like behaviours from artificially modified biosystems.

  17. Quantum Computing: Solving Complex Problems

    ScienceCinema

    DiVincenzo, David

    2018-05-22

    One of the motivating ideas of quantum computation was that there could be a new kind of machine that would solve hard problems in quantum mechanics. There has been significant progress towards the experimental realization of these machines (which I will review), but there are still many questions about how such a machine could solve computational problems of interest in quantum physics. New categorizations of the complexity of computational problems have now been invented to describe quantum simulation. The bad news is that some of these problems are believed to be intractable even on a quantum computer, falling into a quantum analog of the NP class. The good news is that there are many other new classifications of tractability that may apply to several situations of physical interest.

  18. Machine learning in computational docking.

    PubMed

    Khamis, Mohamed A; Gomaa, Walid; Ahmed, Walaa F

    2015-03-01

    The objective of this paper is to highlight the state-of-the-art machine learning (ML) techniques in computational docking. The use of smart computational methods in the life cycle of drug design is relatively a recent development that has gained much popularity and interest over the last few years. Central to this methodology is the notion of computational docking which is the process of predicting the best pose (orientation + conformation) of a small molecule (drug candidate) when bound to a target larger receptor molecule (protein) in order to form a stable complex molecule. In computational docking, a large number of binding poses are evaluated and ranked using a scoring function. The scoring function is a mathematical predictive model that produces a score that represents the binding free energy, and hence the stability, of the resulting complex molecule. Generally, such a function should produce a set of plausible ligands ranked according to their binding stability along with their binding poses. In more practical terms, an effective scoring function should produce promising drug candidates which can then be synthesized and physically screened using high throughput screening process. Therefore, the key to computer-aided drug design is the design of an efficient highly accurate scoring function (using ML techniques). The methods presented in this paper are specifically based on ML techniques. Despite many traditional techniques have been proposed, the performance was generally poor. Only in the last few years started the application of the ML technology in the design of scoring functions; and the results have been very promising. The ML-based techniques are based on various molecular features extracted from the abundance of protein-ligand information in the public molecular databases, e.g., protein data bank bind (PDBbind). In this paper, we present this paradigm shift elaborating on the main constituent elements of the ML approach to molecular docking along with the state-of-the-art research in this area. For instance, the best random forest (RF)-based scoring function on PDBbind v2007 achieves a Pearson correlation coefficient between the predicted and experimentally determined binding affinities of 0.803 while the best conventional scoring function achieves 0.644. The best RF-based ranking power ranks the ligands correctly based on their experimentally determined binding affinities with accuracy 62.5% and identifies the top binding ligand with accuracy 78.1%. We conclude with open questions and potential future research directions that can be pursued in smart computational docking; using molecular features of different nature (geometrical, energy terms, pharmacophore), advanced ML techniques (e.g., deep learning), combining more than one ML models. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. The Harvard Clean Energy Project: High-throughput screening of organic photovoltaic materials using cheminformatics, machine learning, and pattern recognition

    NASA Astrophysics Data System (ADS)

    Olivares-Amaya, Roberto; Hachmann, Johannes; Amador-Bedolla, Carlos; Daly, Aidan; Jinich, Adrian; Atahan-Evrenk, Sule; Boixo, Sergio; Aspuru-Guzik, Alán

    2012-02-01

    Organic photovoltaic devices have emerged as competitors to silicon-based solar cells, currently reaching efficiencies of over 9% and offering desirable properties for manufacturing and installation. We study conjugated donor polymers for high-efficiency bulk-heterojunction photovoltaic devices with a molecular library motivated by experimental feasibility. We use quantum mechanics and a distributed computing approach to explore this vast molecular space. We will detail the screening approach starting from the generation of the molecular library, which can be easily extended to other kinds of molecular systems. We will describe the screening method for these materials which ranges from descriptor models, ubiquitous in the drug discovery community, to eventually reaching first principles quantum chemistry methods. We will present results on the statistical analysis, based principally on machine learning, specifically partial least squares and Gaussian processes. Alongside, clustering methods and the use of the hypergeometric distribution reveal moieties important for the donor materials and allow us to quantify structure-property relationships. These efforts enable us to accelerate materials discovery in organic photovoltaics through our collaboration with experimental groups.

  20. Legionellosis Outbreak Associated with Asphalt Paving Machine, Spain, 2009

    PubMed Central

    Fenollar, José; Escribano, Isabel; González-Candelas, Fernando

    2010-01-01

    From 1999 through 2005 in Alcoi, Spain, incidence of legionellosis was continually high. Over the next 4 years, incidence was lower, but an increase in July 2009 led health authorities to declare an epidemic outbreak. A molecular epidemiology investigation showed that the allelic profiles for all Legionella pneumophila samples from the 2009 outbreak patients were the same, thus pointing to a common genetic origin for their infections, and that they were identical to that of the organism that had caused the previous outbreaks. Spatial-temporal and sequence-based typing analyses indicated a milling machine used in street asphalt repaving and its water tank as the most likely sources. As opposed to other machines used for street cleaning, the responsible milling machine used water from a natural spring. When the operation of this machine was prohibited and cleaning measures were adopted, infections ceased. PMID:20735921

  1. Powering the programmed nanostructure and function of gold nanoparticles with catenated DNA machines

    NASA Astrophysics Data System (ADS)

    Elbaz, Johann; Cecconello, Alessandro; Fan, Zhiyuan; Govorov, Alexander O.; Willner, Itamar

    2013-06-01

    DNA nanotechnology is a rapidly developing research area in nanoscience. It includes the development of DNA machines, tailoring of DNA nanostructures, application of DNA nanostructures for computing, and more. Different DNA machines were reported in the past and DNA-guided assembly of nanoparticles represents an active research effort in DNA nanotechnology. Several DNA-dictated nanoparticle structures were reported, including a tetrahedron, a triangle or linear nanoengineered nanoparticle structures; however, the programmed, dynamic reversible switching of nanoparticle structures and, particularly, the dictated switchable functions emerging from the nanostructures, are missing elements in DNA nanotechnology. Here we introduce DNA catenane systems (interlocked DNA rings) as molecular DNA machines for the programmed, reversible and switchable arrangement of different-sized gold nanoparticles. We further demonstrate that the machine-powered gold nanoparticle structures reveal unique emerging switchable spectroscopic features, such as plasmonic coupling or surface-enhanced fluorescence.

  2. Comparison of subsurface damages on mono-crystalline silicon between traditional nanoscale machining and laser-assisted nanoscale machining via molecular dynamics simulation

    NASA Astrophysics Data System (ADS)

    Dai, Houfu; Li, Shaobo; Chen, Genyu

    2018-01-01

    Molecular dynamics is employed to compare nanoscale traditional machining (TM) with laser-assisted machining (LAM). LAM is that the workpiece is locally heated by an intense laser beam prior to material removal. We have a comprehensive comparison between LAM and TM in terms of atomic trajectories, phase transformation, radial distribution function, chips, temperature distribution, number of atoms in different temperature, grinding temperature, grinding force, friction coefficient and atomic potential energy. It can be found that there is a decrease of atoms with five and six nearest neighbors, and LAM generates more chips than that in the TM. It indicates that LAM reduces the subsurface damage of workpiece, gets a better-qualified ground surface and improves the material removal rate. Moreover, laser energy makes the materials fully softened before being removed, the number of atoms with temperature above 500 K is increased, and the average temperature of workpiece higher and faster to reach the equilibrium in LAM. It means that LAM has an absolute advantage in machining materials and greatly reduces the material resistance. Not only the tangential force (Fx) and the normal force (Fy) but also friction coefficients become smaller as laser heating reduces the strength and hardness of the material in LAM. These results show that LAM is a promising technique since it can get a better-qualified workpiece surface with larger material removal rates, less grinding force and lower friction coefficient.

  3. EUCANEXT: an integrated database for the exploration of genomic and transcriptomic data from Eucalyptus species

    PubMed Central

    Nascimento, Leandro Costa; Salazar, Marcela Mendes; Lepikson-Neto, Jorge; Camargo, Eduardo Leal Oliveira; Parreiras, Lucas Salera; Carazzolle, Marcelo Falsarella

    2017-01-01

    Abstract Tree species of the genus Eucalyptus are the most valuable and widely planted hardwoods in the world. Given the economic importance of Eucalyptus trees, much effort has been made towards the generation of specimens with superior forestry properties that can deliver high-quality feedstocks, customized to the industrýs needs for both cellulosic (paper) and lignocellulosic biomass production. In line with these efforts, large sets of molecular data have been generated by several scientific groups, providing invaluable information that can be applied in the development of improved specimens. In order to fully explore the potential of available datasets, the development of a public database that provides integrated access to genomic and transcriptomic data from Eucalyptus is needed. EUCANEXT is a database that analyses and integrates publicly available Eucalyptus molecular data, such as the E. grandis genome assembly and predicted genes, ESTs from several species and digital gene expression from 26 RNA-Seq libraries. The database has been implemented in a Fedora Linux machine running MySQL and Apache, while Perl CGI was used for the web interfaces. EUCANEXT provides a user-friendly web interface for easy access and analysis of publicly available molecular data from Eucalyptus species. This integrated database allows for complex searches by gene name, keyword or sequence similarity and is publicly accessible at http://www.lge.ibi.unicamp.br/eucalyptusdb. Through EUCANEXT, users can perform complex analysis to identify genes related traits of interest using RNA-Seq libraries and tools for differential expression analysis. Moreover, all the bioinformatics pipeline here described, including the database schema and PERL scripts, are readily available and can be applied to any genomic and transcriptomic project, regardless of the organism. Database URL: http://www.lge.ibi.unicamp.br/eucalyptusdb PMID:29220468

  4. Electron cryomicroscopy structure of a membrane-anchored mitochondrial AAA protease.

    PubMed

    Lee, Sukyeong; Augustin, Steffen; Tatsuta, Takashi; Gerdes, Florian; Langer, Thomas; Tsai, Francis T F

    2011-02-11

    FtsH-related AAA proteases are conserved membrane-anchored, ATP-dependent molecular machines, which mediate the processing and turnover of soluble and membrane-embedded proteins in eubacteria, mitochondria, and chloroplasts. Homo- and hetero-oligomeric proteolytic complexes exist, which are composed of homologous subunits harboring an ATPase domain of the AAA family and an H41 metallopeptidase domain. Mutations in subunits of mitochondrial m-AAA proteases have been associated with different neurodegenerative disorders in human, raising questions on the functional differences between homo- and hetero-oligomeric AAA proteases. Here, we have analyzed the hetero-oligomeric yeast m-AAA protease composed of homologous Yta10 and Yta12 subunits. We combined genetic and structural approaches to define the molecular determinants for oligomer assembly and to assess functional similarities between Yta10 and Yta12. We demonstrate that replacement of only two amino acid residues within the metallopeptidase domain of Yta12 allows its assembly into homo-oligomeric complexes. To provide a molecular explanation, we determined the 12 Å resolution structure of the intact yeast m-AAA protease with its transmembrane domains by electron cryomicroscopy (cryo-EM) and atomic structure fitting. The full-length m-AAA protease has a bipartite structure and is a hexamer in solution. We found that residues in Yta12, which facilitate homo-oligomerization when mutated, are located at the interface between neighboring protomers in the hexamer ring. Notably, the transmembrane and intermembrane space domains are separated from the main body, creating a passage on the matrix side, which is wide enough to accommodate unfolded but not folded polypeptides. These results suggest a mechanism regarding how proteins are recognized and degraded by m-AAA proteases.

  5. Air Bearings Machined On Ultra Precision, Hydrostatic CNC-Lathe

    NASA Astrophysics Data System (ADS)

    Knol, Pierre H.; Szepesi, Denis; Deurwaarder, Jan M.

    1987-01-01

    Micromachining of precision elements requires an adequate machine concept to meet the high demand of surface finish, dimensional and shape accuracy. The Hembrug ultra precision lathes have been exclusively designed with hydrostatic principles for main spindle and guideways. This concept is to be explained with some major advantages of hydrostatics compared with aerostatics at universal micromachining applications. Hembrug has originally developed the conventional Mikroturn ultra precision facing lathes, for diamond turning of computer memory discs. This first generation of machines was followed by the advanced computer numerically controlled types for machining of complex precision workpieces. One of these parts, an aerostatic bearing component has been succesfully machined on the Super-Mikroturn CNC. A case study of airbearing machining confirms the statement that a good result of the micromachining does not depend on machine performance alone, but also on the technology applied.

  6. Exploring cluster Monte Carlo updates with Boltzmann machines

    NASA Astrophysics Data System (ADS)

    Wang, Lei

    2017-11-01

    Boltzmann machines are physics informed generative models with broad applications in machine learning. They model the probability distribution of an input data set with latent variables and generate new samples accordingly. Applying the Boltzmann machines back to physics, they are ideal recommender systems to accelerate the Monte Carlo simulation of physical systems due to their flexibility and effectiveness. More intriguingly, we show that the generative sampling of the Boltzmann machines can even give different cluster Monte Carlo algorithms. The latent representation of the Boltzmann machines can be designed to mediate complex interactions and identify clusters of the physical system. We demonstrate these findings with concrete examples of the classical Ising model with and without four-spin plaquette interactions. In the future, automatic searches in the algorithm space parametrized by Boltzmann machines may discover more innovative Monte Carlo updates.

  7. Machine learning in cardiovascular medicine: are we there yet?

    PubMed

    Shameer, Khader; Johnson, Kipp W; Glicksberg, Benjamin S; Dudley, Joel T; Sengupta, Partho P

    2018-01-19

    Artificial intelligence (AI) broadly refers to analytical algorithms that iteratively learn from data, allowing computers to find hidden insights without being explicitly programmed where to look. These include a family of operations encompassing several terms like machine learning, cognitive learning, deep learning and reinforcement learning-based methods that can be used to integrate and interpret complex biomedical and healthcare data in scenarios where traditional statistical methods may not be able to perform. In this review article, we discuss the basics of machine learning algorithms and what potential data sources exist; evaluate the need for machine learning; and examine the potential limitations and challenges of implementing machine in the context of cardiovascular medicine. The most promising avenues for AI in medicine are the development of automated risk prediction algorithms which can be used to guide clinical care; use of unsupervised learning techniques to more precisely phenotype complex disease; and the implementation of reinforcement learning algorithms to intelligently augment healthcare providers. The utility of a machine learning-based predictive model will depend on factors including data heterogeneity, data depth, data breadth, nature of modelling task, choice of machine learning and feature selection algorithms, and orthogonal evidence. A critical understanding of the strength and limitations of various methods and tasks amenable to machine learning is vital. By leveraging the growing corpus of big data in medicine, we detail pathways by which machine learning may facilitate optimal development of patient-specific models for improving diagnoses, intervention and outcome in cardiovascular medicine. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  8. Coupling Matched Molecular Pairs with Machine Learning for Virtual Compound Optimization.

    PubMed

    Turk, Samo; Merget, Benjamin; Rippmann, Friedrich; Fulle, Simone

    2017-12-26

    Matched molecular pair (MMP) analyses are widely used in compound optimization projects to gain insights into structure-activity relationships (SAR). The analysis is traditionally done via statistical methods but can also be employed together with machine learning (ML) approaches to extrapolate to novel compounds. The here introduced MMP/ML method combines a fragment-based MMP implementation with different machine learning methods to obtain automated SAR decomposition and prediction. To test the prediction capabilities and model transferability, two different compound optimization scenarios were designed: (1) "new fragments" which occurs when exploring new fragments for a defined compound series and (2) "new static core and transformations" which resembles for instance the identification of a new compound series. Very good results were achieved by all employed machine learning methods especially for the new fragments case, but overall deep neural network models performed best, allowing reliable predictions also for the new static core and transformations scenario, where comprehensive SAR knowledge of the compound series is missing. Furthermore, we show that models trained on all available data have a higher generalizability compared to models trained on focused series and can extend beyond chemical space covered in the training data. Thus, coupling MMP with deep neural networks provides a promising approach to make high quality predictions on various data sets and in different compound optimization scenarios.

  9. DFT modeling of chemistry on the Z machine

    NASA Astrophysics Data System (ADS)

    Mattsson, Thomas

    2013-06-01

    Density Functional Theory (DFT) has proven remarkably accurate in predicting properties of matter under shock compression for a wide-range of elements and compounds: from hydrogen to xenon via water. Materials where chemistry plays a role are of particular interest for many applications. For example the deep interiors of Neptune, Uranus, and hundreds of similar exoplanets are composed of molecular ices of carbon, hydrogen, oxygen, and nitrogen at pressures of several hundred GPa and temperatures of many thousand Kelvin. High-quality thermophysical experimental data and high-fidelity simulations including chemical reaction are necessary to constrain planetary models over a large range of conditions. As examples of where chemical reactions are important, and demonstration of the high fidelity possible for these both structurally and chemically complex systems, we will discuss shock- and re-shock of liquid carbon dioxide (CO2) in the range 100 to 800 GPa, shock compression of the hydrocarbon polymers polyethylene (PE) and poly(4-methyl-1-pentene) (PMP), and finally simulations of shock compression of glow discharge polymer (GDP) including the effects of doping with germanium. Experimental results from Sandia's Z machine have time and again validated the DFT simulations at extreme conditions and the combination of experiment and DFT provide reliable data for evaluating existing and constructing future wide-range equations of state models for molecular compounds like CO2 and polymers like PE, PMP, and GDP. Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Company, for the U.S. Department of Energy's National Nuclear Security Administration under contract DE-AC04-94AL85000.

  10. Artificial Dipolar Molecular Rotors

    NASA Astrophysics Data System (ADS)

    Horansky, R. D.; Magnera, T. F.; Price, J. C.; Michl, J.

    Rotors are present in almost every macroscopic machine, converting rotational motion into energy of other forms, or converting other forms of energy into rotation. Rotation may be transmitted via belts or gears, converted into linear motion by various linkages, or used to drive propellers to produce fluid motion. Examples of macroscopic rotors include engines which couple to combustible energy sources, windmills which couple to air flows, and most generators of electricity. A key feature of these objects is the presence of a part with rotational freedom relative to a stationary frame. In this chapter we discuss the miniaturization of rotary machines all the way to the molecular scale, where chemical groups form the rotary and stationary parts. For a recent review of molecules with rotary and stationary parts see [1].

  11. 50 GFlops molecular dynamics on the Connection Machine 5

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lomdahl, P.S.; Tamayo, P.; Groenbech-Jensen, N.

    1993-12-31

    The authors present timings and performance numbers for a new short range three dimensional (3D) molecular dynamics (MD) code, SPaSM, on the Connection Machine-5 (CM-5). They demonstrate that runs with more than 10{sup 8} particles are now possible on massively parallel MIMD computers. To the best of their knowledge this is at least an order of magnitude more particles than what has previously been reported. Typical production runs show sustained performance (including communication) in the range of 47--50 GFlops on a 1024 node CM-5 with vector units (VUs). The speed of the code scales linearly with the number of processorsmore » and with the number of particles and shows 95% parallel efficiency in the speedup.« less

  12. Implementation of the force decomposition machine for molecular dynamics simulations.

    PubMed

    Borštnik, Urban; Miller, Benjamin T; Brooks, Bernard R; Janežič, Dušanka

    2012-09-01

    We present the design and implementation of the force decomposition machine (FDM), a cluster of personal computers (PCs) that is tailored to running molecular dynamics (MD) simulations using the distributed diagonal force decomposition (DDFD) parallelization method. The cluster interconnect architecture is optimized for the communication pattern of the DDFD method. Our implementation of the FDM relies on standard commodity components even for networking. Although the cluster is meant for DDFD MD simulations, it remains general enough for other parallel computations. An analysis of several MD simulation runs on both the FDM and a standard PC cluster demonstrates that the FDM's interconnect architecture provides a greater performance compared to a more general cluster interconnect. Copyright © 2012 Elsevier Inc. All rights reserved.

  13. Applications of machine learning in cancer prediction and prognosis.

    PubMed

    Cruz, Joseph A; Wishart, David S

    2007-02-11

    Machine learning is a branch of artificial intelligence that employs a variety of statistical, probabilistic and optimization techniques that allows computers to "learn" from past examples and to detect hard-to-discern patterns from large, noisy or complex data sets. This capability is particularly well-suited to medical applications, especially those that depend on complex proteomic and genomic measurements. As a result, machine learning is frequently used in cancer diagnosis and detection. More recently machine learning has been applied to cancer prognosis and prediction. This latter approach is particularly interesting as it is part of a growing trend towards personalized, predictive medicine. In assembling this review we conducted a broad survey of the different types of machine learning methods being used, the types of data being integrated and the performance of these methods in cancer prediction and prognosis. A number of trends are noted, including a growing dependence on protein biomarkers and microarray data, a strong bias towards applications in prostate and breast cancer, and a heavy reliance on "older" technologies such artificial neural networks (ANNs) instead of more recently developed or more easily interpretable machine learning methods. A number of published studies also appear to lack an appropriate level of validation or testing. Among the better designed and validated studies it is clear that machine learning methods can be used to substantially (15-25%) improve the accuracy of predicting cancer susceptibility, recurrence and mortality. At a more fundamental level, it is also evident that machine learning is also helping to improve our basic understanding of cancer development and progression.

  14. Applications of Machine Learning in Cancer Prediction and Prognosis

    PubMed Central

    Cruz, Joseph A.; Wishart, David S.

    2006-01-01

    Machine learning is a branch of artificial intelligence that employs a variety of statistical, probabilistic and optimization techniques that allows computers to “learn” from past examples and to detect hard-to-discern patterns from large, noisy or complex data sets. This capability is particularly well-suited to medical applications, especially those that depend on complex proteomic and genomic measurements. As a result, machine learning is frequently used in cancer diagnosis and detection. More recently machine learning has been applied to cancer prognosis and prediction. This latter approach is particularly interesting as it is part of a growing trend towards personalized, predictive medicine. In assembling this review we conducted a broad survey of the different types of machine learning methods being used, the types of data being integrated and the performance of these methods in cancer prediction and prognosis. A number of trends are noted, including a growing dependence on protein biomarkers and microarray data, a strong bias towards applications in prostate and breast cancer, and a heavy reliance on “older” technologies such artificial neural networks (ANNs) instead of more recently developed or more easily interpretable machine learning methods. A number of published studies also appear to lack an appropriate level of validation or testing. Among the better designed and validated studies it is clear that machine learning methods can be used to substantially (15–25%) improve the accuracy of predicting cancer susceptibility, recurrence and mortality. At a more fundamental level, it is also evident that machine learning is also helping to improve our basic understanding of cancer development and progression. PMID:19458758

  15. Machine Learning for Detecting Gene-Gene Interactions

    PubMed Central

    McKinney, Brett A.; Reif, David M.; Ritchie, Marylyn D.; Moore, Jason H.

    2011-01-01

    Complex interactions among genes and environmental factors are known to play a role in common human disease aetiology. There is a growing body of evidence to suggest that complex interactions are ‘the norm’ and, rather than amounting to a small perturbation to classical Mendelian genetics, interactions may be the predominant effect. Traditional statistical methods are not well suited for detecting such interactions, especially when the data are high dimensional (many attributes or independent variables) or when interactions occur between more than two polymorphisms. In this review, we discuss machine-learning models and algorithms for identifying and characterising susceptibility genes in common, complex, multifactorial human diseases. We focus on the following machine-learning methods that have been used to detect gene-gene interactions: neural networks, cellular automata, random forests, and multifactor dimensionality reduction. We conclude with some ideas about how these methods and others can be integrated into a comprehensive and flexible framework for data mining and knowledge discovery in human genetics. PMID:16722772

  16. A review on prognostic techniques for non-stationary and non-linear rotating systems

    NASA Astrophysics Data System (ADS)

    Kan, Man Shan; Tan, Andy C. C.; Mathew, Joseph

    2015-10-01

    The field of prognostics has attracted significant interest from the research community in recent times. Prognostics enables the prediction of failures in machines resulting in benefits to plant operators such as shorter downtimes, higher operation reliability, reduced operations and maintenance cost, and more effective maintenance and logistics planning. Prognostic systems have been successfully deployed for the monitoring of relatively simple rotating machines. However, machines and associated systems today are increasingly complex. As such, there is an urgent need to develop prognostic techniques for such complex systems operating in the real world. This review paper focuses on prognostic techniques that can be applied to rotating machinery operating under non-linear and non-stationary conditions. The general concept of these techniques, the pros and cons of applying these methods, as well as their applications in the research field are discussed. Finally, the opportunities and challenges in implementing prognostic systems and developing effective techniques for monitoring machines operating under non-stationary and non-linear conditions are also discussed.

  17. Conversion of Mass Storage Hierarchy in an IBM Computer Network

    DTIC Science & Technology

    1989-03-01

    storage devices GUIDE IBM users’ group for DOS operating systems IBM International Business Machines IBM 370/145 CPU introduced in 1970 IBM 370/168 CPU...February 12, 1985, Information Systems Group, International Business Machines Corporation. "IBM 3090 Processor Complex" and 񓼪 Mass Storage System...34 Mainframe Journal, pp. 15-26, 64-65, Dallas, Texas, September-October 1987. 3. International Business Machines Corporation, Introduction to IBM 3S80 Storage

  18. Machine learning applications in genetics and genomics.

    PubMed

    Libbrecht, Maxwell W; Noble, William Stafford

    2015-06-01

    The field of machine learning, which aims to develop computer algorithms that improve with experience, holds promise to enable computers to assist humans in the analysis of large, complex data sets. Here, we provide an overview of machine learning applications for the analysis of genome sequencing data sets, including the annotation of sequence elements and epigenetic, proteomic or metabolomic data. We present considerations and recurrent challenges in the application of supervised, semi-supervised and unsupervised machine learning methods, as well as of generative and discriminative modelling approaches. We provide general guidelines to assist in the selection of these machine learning methods and their practical application for the analysis of genetic and genomic data sets.

  19. Quantum Machine Learning over Infinite Dimensions

    DOE PAGES

    Lau, Hoi-Kwan; Pooser, Raphael; Siopsis, George; ...

    2017-02-21

    Machine learning is a fascinating and exciting eld within computer science. Recently, this ex- citement has been transferred to the quantum information realm. Currently, all proposals for the quantum version of machine learning utilize the nite-dimensional substrate of discrete variables. Here we generalize quantum machine learning to the more complex, but still remarkably practi- cal, in nite-dimensional systems. We present the critical subroutines of quantum machine learning algorithms for an all-photonic continuous-variable quantum computer that achieve an exponential speedup compared to their equivalent classical counterparts. Finally, we also map out an experi- mental implementation which can be used as amore » blueprint for future photonic demonstrations.« less

  20. Machine learning in heart failure: ready for prime time.

    PubMed

    Awan, Saqib Ejaz; Sohel, Ferdous; Sanfilippo, Frank Mario; Bennamoun, Mohammed; Dwivedi, Girish

    2018-03-01

    The aim of this review is to present an up-to-date overview of the application of machine learning methods in heart failure including diagnosis, classification, readmissions and medication adherence. Recent studies have shown that the application of machine learning techniques may have the potential to improve heart failure outcomes and management, including cost savings by improving existing diagnostic and treatment support systems. Recently developed deep learning methods are expected to yield even better performance than traditional machine learning techniques in performing complex tasks by learning the intricate patterns hidden in big medical data. The review summarizes the recent developments in the application of machine and deep learning methods in heart failure management.

  1. Quantum Machine Learning over Infinite Dimensions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lau, Hoi-Kwan; Pooser, Raphael; Siopsis, George

    Machine learning is a fascinating and exciting eld within computer science. Recently, this ex- citement has been transferred to the quantum information realm. Currently, all proposals for the quantum version of machine learning utilize the nite-dimensional substrate of discrete variables. Here we generalize quantum machine learning to the more complex, but still remarkably practi- cal, in nite-dimensional systems. We present the critical subroutines of quantum machine learning algorithms for an all-photonic continuous-variable quantum computer that achieve an exponential speedup compared to their equivalent classical counterparts. Finally, we also map out an experi- mental implementation which can be used as amore » blueprint for future photonic demonstrations.« less

  2. [Molecular imaging; current status and future prospects in USA].

    PubMed

    Kobayashi, Hisataka

    2007-02-01

    The goal of this review is to introduce the definition, current status, and future prospects of the molecular imaging, which has recently been a hot topic in medicine and the biological science in USA. In vivo imaging methods to visualize the molecular events and functions in organs or animals/humans are overviewed and discussed especially in combinations of imaging modalities (machines) and contrast agents(chemicals) used in the molecular imaging. Next, the close relationship between the molecular imaging and the nanotechnology, an important part of nanomedicine, is stressed from the aspect of united multidisciplinary sciences such as physics, chemistry, biology, and medicine.

  3. Machine Phase Fullerene Nanotechnology: 1996

    NASA Technical Reports Server (NTRS)

    Globus, Al; Chancellor, Marisa K. (Technical Monitor)

    1997-01-01

    NASA has used exotic materials for spacecraft and experimental aircraft to good effect for many decades. In spite of many advances, transportation to space still costs about $10,000 per pound. Drexler has proposed a hypothetical nanotechnology based on diamond and investigated the properties of such molecular systems. These studies and others suggest enormous potential for aerospace systems. Unfortunately, methods to realize diamonoid nanotechnology are at best highly speculative. Recent computational efforts at NASA Ames Research Center and computation and experiment elsewhere suggest that a nanotechnology of machine phase functionalized fullerenes may be synthetically relatively accessible and of great aerospace interest. Machine phase materials are (hypothetical) materials consisting entirely or in large part of microscopic machines. In a sense, most living matter fits this definition. To begin investigation of fullerene nanotechnology, we used molecular dynamics to study the properties of carbon nanotube based gears and gear/shaft configurations. Experiments on C60 and quantum calculations suggest that benzyne may react with carbon nanotubes to form gear teeth. Han has computationally demonstrated that molecular gears fashioned from (14,0) single-walled carbon nanotubes and benzyne teeth should operate well at 50-100 gigahertz. Results suggest that rotation can be converted to rotating or linear motion, and linear motion may be converted into rotation. Preliminary results suggest that these mechanical systems can be cooled by a helium atmosphere. Furthermore, Deepak has successfully simulated using helical electric fields generated by a laser to power fullerene gears once a positive and negative charge have been added to form a dipole. Even with mechanical motion, cooling, and power; creating a viable nanotechnology requires support structures, computer control, a system architecture, a variety of components, and some approach to manufacture. Additional information is contained within the original extended abstract.

  4. The mismeasure of machine: Synthetic biology and the trouble with engineering metaphors.

    PubMed

    Boudry, Maarten; Pigliucci, Massimo

    2013-12-01

    The scientific study of living organisms is permeated by machine and design metaphors. Genes are thought of as the "blueprint" of an organism, organisms are "reverse engineered" to discover their functionality, and living cells are compared to biochemical factories, complete with assembly lines, transport systems, messenger circuits, etc. Although the notion of design is indispensable to think about adaptations, and engineering analogies have considerable heuristic value (e.g., optimality assumptions), we argue they are limited in several important respects. In particular, the analogy with human-made machines falters when we move down to the level of molecular biology and genetics. Living organisms are far more messy and less transparent than human-made machines. Notoriously, evolution is an opportunistic tinkerer, blindly stumbling on "designs" that no sensible engineer would come up with. Despite impressive technological innovation, the prospect of artificially designing new life forms from scratch has proven more difficult than the superficial analogy with "programming" the right "software" would suggest. The idea of applying straightforward engineering approaches to living systems and their genomes-isolating functional components, designing new parts from scratch, recombining and assembling them into novel life forms-pushes the analogy with human artifacts beyond its limits. In the absence of a one-to-one correspondence between genotype and phenotype, there is no straightforward way to implement novel biological functions and design new life forms. Both the developmental complexity of gene expression and the multifarious interactions of genes and environments are serious obstacles for "engineering" a particular phenotype. The problem of reverse-engineering a desired phenotype to its genetic "instructions" is probably intractable for any but the most simple phenotypes. Recent developments in the field of bio-engineering and synthetic biology reflect these limitations. Instead of genetically engineering a desired trait from scratch, as the machine/engineering metaphor promises, researchers are making greater strides by co-opting natural selection to "search" for a suitable genotype, or by borrowing and recombining genetic material from extant life forms. Copyright © 2013 Elsevier Ltd. All rights reserved.

  5. Hybrid PolyLingual Object Model: An Efficient and Seamless Integration of Java and Native Components on the Dalvik Virtual Machine

    PubMed Central

    Huang, Yukun; Chen, Rong; Wei, Jingbo; Pei, Xilong; Cao, Jing; Prakash Jayaraman, Prem; Ranjan, Rajiv

    2014-01-01

    JNI in the Android platform is often observed with low efficiency and high coding complexity. Although many researchers have investigated the JNI mechanism, few of them solve the efficiency and the complexity problems of JNI in the Android platform simultaneously. In this paper, a hybrid polylingual object (HPO) model is proposed to allow a CAR object being accessed as a Java object and as vice in the Dalvik virtual machine. It is an acceptable substitute for JNI to reuse the CAR-compliant components in Android applications in a seamless and efficient way. The metadata injection mechanism is designed to support the automatic mapping and reflection between CAR objects and Java objects. A prototype virtual machine, called HPO-Dalvik, is implemented by extending the Dalvik virtual machine to support the HPO model. Lifespan management, garbage collection, and data type transformation of HPO objects are also handled in the HPO-Dalvik virtual machine automatically. The experimental result shows that the HPO model outweighs the standard JNI in lower overhead on native side, better executing performance with no JNI bridging code being demanded. PMID:25110745

  6. Informatics and machine learning to define the phenotype.

    PubMed

    Basile, Anna Okula; Ritchie, Marylyn DeRiggi

    2018-03-01

    For the past decade, the focus of complex disease research has been the genotype. From technological advancements to the development of analysis methods, great progress has been made. However, advances in our definition of the phenotype have remained stagnant. Phenotype characterization has recently emerged as an exciting area of informatics and machine learning. The copious amounts of diverse biomedical data that have been collected may be leveraged with data-driven approaches to elucidate trait-related features and patterns. Areas covered: In this review, the authors discuss the phenotype in traditional genetic associations and the challenges this has imposed.Approaches for phenotype refinement that can aid in more accurate characterization of traits are also discussed. Further, the authors highlight promising machine learning approaches for establishing a phenotype and the challenges of electronic health record (EHR)-derived data. Expert commentary: The authors hypothesize that through unsupervised machine learning, data-driven approaches can be used to define phenotypes rather than relying on expert clinician knowledge. Through the use of machine learning and an unbiased set of features extracted from clinical repositories, researchers will have the potential to further understand complex traits and identify patient subgroups. This knowledge may lead to more preventative and precise clinical care.

  7. Generation of a Multicomponent Library of Disulfide Donor-Acceptor Architectures Using Dynamic Combinatorial Chemistry

    PubMed Central

    Drożdż, Wojciech; Kołodziejski, Michał; Markiewicz, Grzegorz; Jenczak, Anna; Stefankiewicz, Artur R.

    2015-01-01

    We describe here the generation of new donor-acceptor disulfide architectures obtained in aqueous solution at physiological pH. The application of a dynamic combinatorial chemistry approach allowed us to generate a large number of new disulfide macrocyclic architectures together with a new type of [2]catenanes consisting of four distinct components. Up to fifteen types of structurally-distinct dynamic architectures have been generated through one-pot disulfide exchange reactions between four thiol-functionalized aqueous components. The distribution of disulfide products formed was found to be strongly dependent on the structural features of the thiol components employed. This work not only constitutes a success in the synthesis of topologically- and morphologically-complex targets, but it may also open new horizons for the use of this methodology in the construction of molecular machines. PMID:26193265

  8. Generation of a Multicomponent Library of Disulfide Donor-Acceptor Architectures Using Dynamic Combinatorial Chemistry.

    PubMed

    Drożdż, Wojciech; Kołodziejski, Michał; Markiewicz, Grzegorz; Jenczak, Anna; Stefankiewicz, Artur R

    2015-07-17

    We describe here the generation of new donor-acceptor disulfide architectures obtained in aqueous solution at physiological pH. The application of a dynamic combinatorial chemistry approach allowed us to generate a large number of new disulfide macrocyclic architectures together with a new type of [2]catenanes consisting of four distinct components. Up to fifteen types of structurally-distinct dynamic architectures have been generated through one-pot disulfide exchange reactions between four thiol-functionalized aqueous components. The distribution of disulfide products formed was found to be strongly dependent on the structural features of the thiol components employed. This work not only constitutes a success in the synthesis of topologically- and morphologically-complex targets, but it may also open new horizons for the use of this methodology in the construction of molecular machines.

  9. ClearTK 2.0: Design Patterns for Machine Learning in UIMA

    PubMed Central

    Bethard, Steven; Ogren, Philip; Becker, Lee

    2014-01-01

    ClearTK adds machine learning functionality to the UIMA framework, providing wrappers to popular machine learning libraries, a rich feature extraction library that works across different classifiers, and utilities for applying and evaluating machine learning models. Since its inception in 2008, ClearTK has evolved in response to feedback from developers and the community. This evolution has followed a number of important design principles including: conceptually simple annotator interfaces, readable pipeline descriptions, minimal collection readers, type system agnostic code, modules organized for ease of import, and assisting user comprehension of the complex UIMA framework. PMID:29104966

  10. ClearTK 2.0: Design Patterns for Machine Learning in UIMA.

    PubMed

    Bethard, Steven; Ogren, Philip; Becker, Lee

    2014-05-01

    ClearTK adds machine learning functionality to the UIMA framework, providing wrappers to popular machine learning libraries, a rich feature extraction library that works across different classifiers, and utilities for applying and evaluating machine learning models. Since its inception in 2008, ClearTK has evolved in response to feedback from developers and the community. This evolution has followed a number of important design principles including: conceptually simple annotator interfaces, readable pipeline descriptions, minimal collection readers, type system agnostic code, modules organized for ease of import, and assisting user comprehension of the complex UIMA framework.

  11. Studying depression using imaging and machine learning methods.

    PubMed

    Patel, Meenal J; Khalaf, Alexander; Aizenstein, Howard J

    2016-01-01

    Depression is a complex clinical entity that can pose challenges for clinicians regarding both accurate diagnosis and effective timely treatment. These challenges have prompted the development of multiple machine learning methods to help improve the management of this disease. These methods utilize anatomical and physiological data acquired from neuroimaging to create models that can identify depressed patients vs. non-depressed patients and predict treatment outcomes. This article (1) presents a background on depression, imaging, and machine learning methodologies; (2) reviews methodologies of past studies that have used imaging and machine learning to study depression; and (3) suggests directions for future depression-related studies.

  12. Energy-free machine learning force field for aluminum.

    PubMed

    Kruglov, Ivan; Sergeev, Oleg; Yanilkin, Alexey; Oganov, Artem R

    2017-08-17

    We used the machine learning technique of Li et al. (PRL 114, 2015) for molecular dynamics simulations. Atomic configurations were described by feature matrix based on internal vectors, and linear regression was used as a learning technique. We implemented this approach in the LAMMPS code. The method was applied to crystalline and liquid aluminum and uranium at different temperatures and densities, and showed the highest accuracy among different published potentials. Phonon density of states, entropy and melting temperature of aluminum were calculated using this machine learning potential. The results are in excellent agreement with experimental data and results of full ab initio calculations.

  13. Means and method of balancing multi-cylinder reciprocating machines

    DOEpatents

    Corey, John A.; Walsh, Michael M.

    1985-01-01

    A virtual balancing axis arrangement is described for multi-cylinder reciprocating piston machines for effectively balancing out imbalanced forces and minimizing residual imbalance moments acting on the crankshaft of such machines without requiring the use of additional parallel-arrayed balancing shafts or complex and expensive gear arrangements. The novel virtual balancing axis arrangement is capable of being designed into multi-cylinder reciprocating piston and crankshaft machines for substantially reducing vibrations induced during operation of such machines with only minimal number of additional component parts. Some of the required component parts may be available from parts already required for operation of auxiliary equipment, such as oil and water pumps used in certain types of reciprocating piston and crankshaft machine so that by appropriate location and dimensioning in accordance with the teachings of the invention, the virtual balancing axis arrangement can be built into the machine at little or no additional cost.

  14. Computational Nanotechnology of Molecular Materials, Electronics and Machines

    NASA Technical Reports Server (NTRS)

    Srivastava, D.; Biegel, Bryan A. (Technical Monitor)

    2002-01-01

    This viewgraph presentation covers carbon nanotubes, their characteristics, and their potential future applications. The presentation include predictions on the development of nanostructures and their applications, the thermal characteristics of carbon nanotubes, mechano-chemical effects upon carbon nanotubes, molecular electronics, and models for possible future nanostructure devices. The presentation also proposes a neural model for signal processing.

  15. Management Strategies for Complex Adaptive Systems: Sensemaking, Learning, and Improvisation

    ERIC Educational Resources Information Center

    McDaniel, Reuben R., Jr.

    2007-01-01

    Misspecification of the nature of organizations may be a major reason for difficulty in achieving performance improvement. Organizations are often viewed as machine-like, but complexity science suggests that organizations should be viewed as complex adaptive systems. I identify the characteristics of complex adaptive systems and give examples of…

  16. Training Knowledge Bots for Physics-Based Simulations Using Artificial Neural Networks

    NASA Technical Reports Server (NTRS)

    Samareh, Jamshid A.; Wong, Jay Ming

    2014-01-01

    Millions of complex physics-based simulations are required for design of an aerospace vehicle. These simulations are usually performed by highly trained and skilled analysts, who execute, monitor, and steer each simulation. Analysts rely heavily on their broad experience that may have taken 20-30 years to accumulate. In addition, the simulation software is complex in nature, requiring significant computational resources. Simulations of system of systems become even more complex and are beyond human capacity to effectively learn their behavior. IBM has developed machines that can learn and compete successfully with a chess grandmaster and most successful jeopardy contestants. These machines are capable of learning some complex problems much faster than humans can learn. In this paper, we propose using artificial neural network to train knowledge bots to identify the idiosyncrasies of simulation software and recognize patterns that can lead to successful simulations. We examine the use of knowledge bots for applications of computational fluid dynamics (CFD), trajectory analysis, commercial finite-element analysis software, and slosh propellant dynamics. We will show that machine learning algorithms can be used to learn the idiosyncrasies of computational simulations and identify regions of instability without including any additional information about their mathematical form or applied discretization approaches.

  17. Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines

    PubMed Central

    2010-01-01

    Background Protein-protein interaction (PPI) plays essential roles in cellular functions. The cost, time and other limitations associated with the current experimental methods have motivated the development of computational methods for predicting PPIs. As protein interactions generally occur via domains instead of the whole molecules, predicting domain-domain interaction (DDI) is an important step toward PPI prediction. Computational methods developed so far have utilized information from various sources at different levels, from primary sequences, to molecular structures, to evolutionary profiles. Results In this paper, we propose a computational method to predict DDI using support vector machines (SVMs), based on domains represented as interaction profile hidden Markov models (ipHMM) where interacting residues in domains are explicitly modeled according to the three dimensional structural information available at the Protein Data Bank (PDB). Features about the domains are extracted first as the Fisher scores derived from the ipHMM and then selected using singular value decomposition (SVD). Domain pairs are represented by concatenating their selected feature vectors, and classified by a support vector machine trained on these feature vectors. The method is tested by leave-one-out cross validation experiments with a set of interacting protein pairs adopted from the 3DID database. The prediction accuracy has shown significant improvement as compared to InterPreTS (Interaction Prediction through Tertiary Structure), an existing method for PPI prediction that also uses the sequences and complexes of known 3D structure. Conclusions We show that domain-domain interaction prediction can be significantly enhanced by exploiting information inherent in the domain profiles via feature selection based on Fisher scores, singular value decomposition and supervised learning based on support vector machines. Datasets and source code are freely available on the web at http://liao.cis.udel.edu/pub/svdsvm. Implemented in Matlab and supported on Linux and MS Windows. PMID:21034480

  18. Solid surface vs. liquid surface: nanoarchitectonics, molecular machines, and DNA origami.

    PubMed

    Ariga, Katsuhiko; Mori, Taizo; Nakanishi, Waka; Hill, Jonathan P

    2017-09-13

    The investigation of molecules and materials at interfaces is critical for the accumulation of new scientific insights and technological advances in the chemical and physical sciences. Immobilization on solid surfaces permits the investigation of different properties of functional molecules or materials with high sensitivity and high spatial resolution. Liquid surfaces also present important media for physicochemical innovation and insight based on their great flexibility and dynamicity, rapid diffusion of molecular components for mixing and rearrangements, as well as drastic spatial variation in the prevailing dielectric environment. Therefore, a comparative discussion of the relative merits of the properties of materials when positioned at solid or liquid surfaces would be informative regarding present-to-future developments of surface-based technologies. In this perspective article, recent research examples of nanoarchitectonics, molecular machines, DNA nanotechnology, and DNA origami are compared with respect to the type of surface used, i.e. solid surfaces vs. liquid surfaces, for future perspectives of interfacial physics and chemistry.

  19. An experimental study of the putative mechanism of a synthetic autonomous rotary DNA nanomotor

    NASA Astrophysics Data System (ADS)

    Dunn, K. E.; Leake, M. C.; Wollman, A. J. M.; Trefzer, M. A.; Johnson, S.; Tyrrell, A. M.

    2017-03-01

    DNA has been used to construct a wide variety of nanoscale molecular devices. Inspiration for such synthetic molecular machines is frequently drawn from protein motors, which are naturally occurring and ubiquitous. However, despite the fact that rotary motors such as ATP synthase and the bacterial flagellar motor play extremely important roles in nature, very few rotary devices have been constructed using DNA. This paper describes an experimental study of the putative mechanism of a rotary DNA nanomotor, which is based on strand displacement, the phenomenon that powers many synthetic linear DNA motors. Unlike other examples of rotary DNA machines, the device described here is designed to be capable of autonomous operation after it is triggered. The experimental results are consistent with operation of the motor as expected, and future work on an enhanced motor design may allow rotation to be observed at the single-molecule level. The rotary motor concept presented here has potential applications in molecular processing, DNA computing, biosensing and photonics.

  20. Mesoscale imaging with cryo-light and X-rays: Larger than molecular machines, smaller than a cell: Mesoscale imaging with cryo-light and X-rays

    DOE PAGES

    Ekman, Axel A.; Chen, Jian-Hua; Guo, Jessica; ...

    2016-11-14

    In the context of cell biology, the term mesoscale describes length scales ranging from that of an individual cell, down to the size of the molecular machines. In this spatial regime, small building blocks self-organise to form large, functional structures. A comprehensive set of rules governing mesoscale self-organisation has not been established, making the prediction of many cell behaviours difficult, if not impossible. Our knowledge of mesoscale biology comes from experimental data, in particular, imaging. Here, we explore the application of soft X-ray tomography (SXT) to imaging the mesoscale, and describe the structural insights this technology can generate. We alsomore » discuss how SXT imaging is complemented by the addition of correlative fluorescence data measured from the same cell. This combination of two discrete imaging modalities produces a 3D view of the cell that blends high-resolution structural information with precise molecular localisation data.« less

  1. Implementing Molecular Dynamics for Hybrid High Performance Computers - 1. Short Range Forces

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brown, W Michael; Wang, Peng; Plimpton, Steven J

    The use of accelerators such as general-purpose graphics processing units (GPGPUs) have become popular in scientific computing applications due to their low cost, impressive floating-point capabilities, high memory bandwidth, and low electrical power requirements. Hybrid high performance computers, machines with more than one type of floating-point processor, are now becoming more prevalent due to these advantages. In this work, we discuss several important issues in porting a large molecular dynamics code for use on parallel hybrid machines - 1) choosing a hybrid parallel decomposition that works on central processing units (CPUs) with distributed memory and accelerator cores with shared memory,more » 2) minimizing the amount of code that must be ported for efficient acceleration, 3) utilizing the available processing power from both many-core CPUs and accelerators, and 4) choosing a programming model for acceleration. We present our solution to each of these issues for short-range force calculation in the molecular dynamics package LAMMPS. We describe algorithms for efficient short range force calculation on hybrid high performance machines. We describe a new approach for dynamic load balancing of work between CPU and accelerator cores. We describe the Geryon library that allows a single code to compile with both CUDA and OpenCL for use on a variety of accelerators. Finally, we present results on a parallel test cluster containing 32 Fermi GPGPUs and 180 CPU cores.« less

  2. Alchemical Free Energy Calculations for Nucleotide Mutations in Protein-DNA Complexes.

    PubMed

    Gapsys, Vytautas; de Groot, Bert L

    2017-12-12

    Nucleotide-sequence-dependent interactions between proteins and DNA are responsible for a wide range of gene regulatory functions. Accurate and generalizable methods to evaluate the strength of protein-DNA binding have long been sought. While numerous computational approaches have been developed, most of them require fitting parameters to experimental data to a certain degree, e.g., machine learning algorithms or knowledge-based statistical potentials. Molecular-dynamics-based free energy calculations offer a robust, system-independent, first-principles-based method to calculate free energy differences upon nucleotide mutation. We present an automated procedure to set up alchemical MD-based calculations to evaluate free energy changes occurring as the result of a nucleotide mutation in DNA. We used these methods to perform a large-scale mutation scan comprising 397 nucleotide mutation cases in 16 protein-DNA complexes. The obtained prediction accuracy reaches 5.6 kJ/mol average unsigned deviation from experiment with a correlation coefficient of 0.57 with respect to the experimentally measured free energies. Overall, the first-principles-based approach performed on par with the molecular modeling approaches Rosetta and FoldX. Subsequently, we utilized the MD-based free energy calculations to construct protein-DNA binding profiles for the zinc finger protein Zif268. The calculation results compare remarkably well with the experimentally determined binding profiles. The software automating the structure and topology setup for alchemical calculations is a part of the pmx package; the utilities have also been made available online at http://pmx.mpibpc.mpg.de/dna_webserver.html .

  3. Nano-scale characterization of the dynamics of the chloroplast Toc translocon.

    PubMed

    Reddick, L Evan; Chotewutmontri, Prakitchai; Crenshaw, Will; Dave, Ashita; Vaughn, Michael; Bruce, Barry D

    2008-01-01

    Translocons are macromolecular nano-scale machines that facilitate the selective translocation of proteins across membranes. Although common in function, different translocons have evolved diverse molecular mechanisms for protein translocation. Subcellular organelles of endosymbiotic origin such as the chloroplast and mitochondria had to evolve/acquire translocons capable of importing proteins whose genes were transferred to the host genome. These gene products are expressed on cytosolic ribosomes as precursor proteins and targeted back to the organelle by an N-terminal extension called the transit peptide or presequence. In chloroplasts the transit peptide is specifically recognized by the Translocon of the Outer Chloroplast membrane (Toc) which is composed of receptor GTPases that potentially function as gate-like switches, where GTP binding and hydrolysis somehow facilitate preprotein binding and translocation. Compared to other translocons, the dynamics of the Toc translocon are probably more complex and certainly less understood. We have developed biochemical/biophysical, imaging, and computational techniques to probe the dynamics of the Toc translocon at the nanoscale. In this chapter we provide detailed protocols for kinetic and binding analysis of precursor interactions in organeller, measurement of the activity and nucleotide binding of the Toc GTPases, native electrophoretic analysis of the assembly/organization of the Toc complex, visualization of the distribution and mobility of Toc apparatus on the surface of chloroplasts, and conclude with the identification and molecular modeling Toc75 POTRA domains. With these new methodologies we discuss future directions of the field.

  4. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Ang; Song, Shuaiwen; Brugel, Eric

    To continuously comply with Moore’s Law, modern parallel machines become increasingly complex. Effectively tuning application performance for these machines therefore becomes a daunting task. Moreover, identifying performance bottlenecks at application and architecture level, as well as evaluating various optimization strategies, are becoming extremely difficult when the entanglement of numerous correlated factors is being presented. To tackle these challenges, we present a visual analytical model named “X”. It is intuitive and sufficiently flexible to track all the typical features of a parallel machine.

  5. Design Guidelines and Criteria for User/Operator Transactions with Battlefield Automated Systems. Volume 5. Background Literature

    DTIC Science & Technology

    1981-02-01

    the machine . ARI’s efforts in this area focus on human perfor- mance problems related to interactions with command and control centers, and on issues...improvement of the user- machine interface. Lacking consistent design principles, current practice results in a fragmented and unsystematic approach to system...complexity in the user- machine interface of BAS, ARI supported this effort for develop- me:nt of an online language for Army tactical intelligence

  6. Grindability and mechanical property of ceramics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Guo, Changsheng; Chand, R.H.

    1996-12-31

    For cost-effective ceramic machining, material-specific machining methodology is needed. This requires characterizing ceramics from machining view point. In this paper, a preliminary study of the correlation between grindability and mechanical properties is reported. Results indicate that there exists complex correlations between grindability and mechanical properties such as hardness, fracture toughness and elasticity. Some ceramics of similar mechanical properties have different grindabilities, which implies that it is possible to develop ceramics of both superior mechanical properties and good grindability.

  7. Sparse dynamical Boltzmann machine for reconstructing complex networks with binary dynamics

    NASA Astrophysics Data System (ADS)

    Chen, Yu-Zhong; Lai, Ying-Cheng

    2018-03-01

    Revealing the structure and dynamics of complex networked systems from observed data is a problem of current interest. Is it possible to develop a completely data-driven framework to decipher the network structure and different types of dynamical processes on complex networks? We develop a model named sparse dynamical Boltzmann machine (SDBM) as a structural estimator for complex networks that host binary dynamical processes. The SDBM attains its topology according to that of the original system and is capable of simulating the original binary dynamical process. We develop a fully automated method based on compressive sensing and a clustering algorithm to construct the SDBM. We demonstrate, for a variety of representative dynamical processes on model and real world complex networks, that the equivalent SDBM can recover the network structure of the original system and simulates its dynamical behavior with high precision.

  8. Sparse dynamical Boltzmann machine for reconstructing complex networks with binary dynamics.

    PubMed

    Chen, Yu-Zhong; Lai, Ying-Cheng

    2018-03-01

    Revealing the structure and dynamics of complex networked systems from observed data is a problem of current interest. Is it possible to develop a completely data-driven framework to decipher the network structure and different types of dynamical processes on complex networks? We develop a model named sparse dynamical Boltzmann machine (SDBM) as a structural estimator for complex networks that host binary dynamical processes. The SDBM attains its topology according to that of the original system and is capable of simulating the original binary dynamical process. We develop a fully automated method based on compressive sensing and a clustering algorithm to construct the SDBM. We demonstrate, for a variety of representative dynamical processes on model and real world complex networks, that the equivalent SDBM can recover the network structure of the original system and simulates its dynamical behavior with high precision.

  9. Intelligent Techniques Using Molecular Data Analysis in Leukaemia: An Opportunity for Personalized Medicine Support System

    PubMed Central

    Adelson, David; Brown, Fred; Chaudhri, Naeem

    2017-01-01

    The use of intelligent techniques in medicine has brought a ray of hope in terms of treating leukaemia patients. Personalized treatment uses patient's genetic profile to select a mode of treatment. This process makes use of molecular technology and machine learning, to determine the most suitable approach to treating a leukaemia patient. Until now, no reviews have been published from a computational perspective concerning the development of personalized medicine intelligent techniques for leukaemia patients using molecular data analysis. This review studies the published empirical research on personalized medicine in leukaemia and synthesizes findings across studies related to intelligence techniques in leukaemia, with specific attention to particular categories of these studies to help identify opportunities for further research into personalized medicine support systems in chronic myeloid leukaemia. A systematic search was carried out to identify studies using intelligence techniques in leukaemia and to categorize these studies based on leukaemia type and also the task, data source, and purpose of the studies. Most studies used molecular data analysis for personalized medicine, but future advancement for leukaemia patients requires molecular models that use advanced machine-learning methods to automate decision-making in treatment management to deliver supportive medical information to the patient in clinical practice. PMID:28812013

  10. Intelligent Techniques Using Molecular Data Analysis in Leukaemia: An Opportunity for Personalized Medicine Support System.

    PubMed

    Banjar, Haneen; Adelson, David; Brown, Fred; Chaudhri, Naeem

    2017-01-01

    The use of intelligent techniques in medicine has brought a ray of hope in terms of treating leukaemia patients. Personalized treatment uses patient's genetic profile to select a mode of treatment. This process makes use of molecular technology and machine learning, to determine the most suitable approach to treating a leukaemia patient. Until now, no reviews have been published from a computational perspective concerning the development of personalized medicine intelligent techniques for leukaemia patients using molecular data analysis. This review studies the published empirical research on personalized medicine in leukaemia and synthesizes findings across studies related to intelligence techniques in leukaemia, with specific attention to particular categories of these studies to help identify opportunities for further research into personalized medicine support systems in chronic myeloid leukaemia. A systematic search was carried out to identify studies using intelligence techniques in leukaemia and to categorize these studies based on leukaemia type and also the task, data source, and purpose of the studies. Most studies used molecular data analysis for personalized medicine, but future advancement for leukaemia patients requires molecular models that use advanced machine-learning methods to automate decision-making in treatment management to deliver supportive medical information to the patient in clinical practice.

  11. Teaching Machines and Programmed Instruction.

    ERIC Educational Resources Information Center

    Kay, Harry; And Others

    The various devices used in programed instruction range from the simple linear programed book to branching and skip branching programs, adaptive teaching machines, and even complex computer based systems. In order to provide a background for the would-be programer, the essential principles of each of these devices is outlined. Different ideas of…

  12. IQ Tests Are Not for Machines, Yet

    ERIC Educational Resources Information Center

    Dowe, David L.; Hernandez-Orallo, Jose

    2012-01-01

    Complex, but specific, tasks--such as chess or "Jeopardy!"--are popularly seen as milestones for artificial intelligence (AI). However, they are not appropriate for evaluating the intelligence of machines or measuring the progress in AI. Aware of this delusion, Detterman has recently raised a challenge prompting AI researchers to evaluate their…

  13. Atwood's Machine as a Tool to Introduce Variable Mass Systems

    ERIC Educational Resources Information Center

    de Sousa, Celia A.

    2012-01-01

    This article discusses an instructional strategy which explores eventual similarities and/or analogies between familiar problems and more sophisticated systems. In this context, the Atwood's machine problem is used to introduce students to more complex problems involving ropes and chains. The methodology proposed helps students to develop the…

  14. Molecular complexes in close and far away

    PubMed Central

    Klemperer, William; Vaida, Veronica

    2006-01-01

    In this review, gas-phase chemistry of interstellar media and some planetary atmospheres is extended to include molecular complexes. Although the composition, density, and temperature of the environments discussed are very different, molecular complexes have recently been considered as potential contributors to chemistry. The complexes reviewed include strongly bound aggregates of molecules with ions, intermediate-strength hydrogen bonded complexes (primarily hydrates), and weakly bonded van der Waals molecules. In low-density, low-temperature environments characteristic of giant molecular clouds, molecular synthesis, known to involve gas-phase ion-molecule reactions and chemistry at the surface of dust and ice grains is extended here to involve molecular ionic clusters. At the high density and high temperatures found on planetary atmospheres, molecular complexes contribute to both atmospheric chemistry and climate. Using the observational, laboratory, and theoretical database, the role of molecular complexes in close and far away is discussed. PMID:16740667

  15. Machine Learning Force Field Parameters from Ab Initio Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Ying; Li, Hui; Pickard, Frank C.

    Machine learning (ML) techniques with the genetic algorithm (GA) have been applied to determine a polarizable force field parameters using only ab initio data from quantum mechanics (QM) calculations of molecular clusters at the MP2/6-31G(d,p), DFMP2(fc)/jul-cc-pVDZ, and DFMP2(fc)/jul-cc-pVTZ levels to predict experimental condensed phase properties (i.e., density and heat of vaporization). The performance of this ML/GA approach is demonstrated on 4943 dimer electrostatic potentials and 1250 cluster interaction energies for methanol. Excellent agreement between the training data set from QM calculations and the optimized force field model was achieved. The results were further improved by introducing an offset factor duringmore » the machine learning process to compensate for the discrepancy between the QM calculated energy and the energy reproduced by optimized force field, while maintaining the local “shape” of the QM energy surface. Throughout the machine learning process, experimental observables were not involved in the objective function, but were only used for model validation. The best model, optimized from the QM data at the DFMP2(fc)/jul-cc-pVTZ level, appears to perform even better than the original AMOEBA force field (amoeba09.prm), which was optimized empirically to match liquid properties. The present effort shows the possibility of using machine learning techniques to develop descriptive polarizable force field using only QM data. The ML/GA strategy to optimize force fields parameters described here could easily be extended to other molecular systems.« less

  16. Laser cutting: influence on morphological and physicochemical properties of polyhydroxybutyrate.

    PubMed

    Lootz, D; Behrend, D; Kramer, S; Freier, T; Haubold, A; Benkiesser, G; Schmitz, K P; Becher, B

    2001-09-01

    Polyhydroxybutyrate (PHB) is a biocompatible and resorbable implant material. For these reasons, it has been used for the fabrication of temporary stents, bone plates, nails and screws (Peng et al. Biomaterials 1996;17:685). In some cases, the brittle mechanical properties of PHB homopolymer limit its application. A typical plasticizer, triethylcitrate (TEC), was used to overcome such limitations by making the material more pliable. In the past few years, CO2-laser cutting of PHB was used in the manufacturing of small medical devices such as stents. Embrittlement of plasticized PHB tubes has been observed, after laser machining. Consequently, the physicochemical and morphological properties of laser-processed surfaces and cut edges of plasticized polymer samples were examined to determine the extent of changes in polymer properties as a result of laser machining. These studies included determination of the depth of the laser-induced heat affected zone by polariscopy of thin polymer sections. Molecular weight changes and changes in the TEC content as a function of distance from the laser-cut edge were determined. In a preliminary test, the cellular response to the processed material was investigated by cell culture study of L929 mouse fibroblasts on laser-machined surfaces. The heat-affected zone was readily classified into four different regions with a total depth of about 60 to 100 microm (Klamp, Master Thesis, University of Rostock, 1998). These results correspond well with the chemical analysis and molecular weight measurements. Furthermore, it was found that cells grew preferentially on the laser-machined area. These findings have significant implications for the manufacture of medical implants from PHB by laser machining.

  17. Investigation of approximate models of experimental temperature characteristics of machines

    NASA Astrophysics Data System (ADS)

    Parfenov, I. V.; Polyakov, A. N.

    2018-05-01

    This work is devoted to the investigation of various approaches to the approximation of experimental data and the creation of simulation mathematical models of thermal processes in machines with the aim of finding ways to reduce the time of their field tests and reducing the temperature error of the treatments. The main methods of research which the authors used in this work are: the full-scale thermal testing of machines; realization of various approaches at approximation of experimental temperature characteristics of machine tools by polynomial models; analysis and evaluation of modelling results (model quality) of the temperature characteristics of machines and their derivatives up to the third order in time. As a result of the performed researches, rational methods, type, parameters and complexity of simulation mathematical models of thermal processes in machine tools are proposed.

  18. Research on intelligent machine self-perception method based on LSTM

    NASA Astrophysics Data System (ADS)

    Wang, Qiang; Cheng, Tao

    2018-05-01

    In this paper, we use the advantages of LSTM in feature extraction and processing high-dimensional and complex nonlinear data, and apply it to the autonomous perception of intelligent machines. Compared with the traditional multi-layer neural network, this model has memory, can handle time series information of any length. Since the multi-physical domain signals of processing machines have a certain timing relationship, and there is a contextual relationship between states and states, using this deep learning method to realize the self-perception of intelligent processing machines has strong versatility and adaptability. The experiment results show that the method proposed in this paper can obviously improve the sensing accuracy under various working conditions of the intelligent machine, and also shows that the algorithm can well support the intelligent processing machine to realize self-perception.

  19. The methodic of calculation for the need of basic construction machines on construction site when developing organizational and technological documentation

    NASA Astrophysics Data System (ADS)

    Zhadanovsky, Boris; Sinenko, Sergey

    2018-03-01

    Economic indicators of construction work, particularly in high-rise construction, are directly related to the choice of optimal number of machines. The shortage of machinery makes it impossible to complete the construction & installation work on scheduled time. Rates of performance of construction & installation works and labor productivity during high-rise construction largely depend on the degree of provision of construction project with machines (level of work mechanization). During calculation of the need for machines in construction projects, it is necessary to ensure that work is completed on scheduled time, increased level of complex mechanization, increased productivity and reduction of manual work, and improved usage and maintenance of machine fleet. The selection of machines and determination of their numbers should be carried out by using formulas presented in this work.

  20. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hill, Mary Ann; Dombrowski, David E.; Clarke, Kester Diederik

    U-10 wt. % Mo (U-10Mo) alloys are being developed as low enrichment monolithic fuel for the CONVERT program. Optimization of processing for the monolithic fuel is being pursued with the use of electrical discharge machining (EDM) under CONVERT HPRR WBS 1.2.4.5 Optimization of Coupon Preparation. The process is applicable to manufacturing experimental fuel plate specimens for the Mini-Plate-1 (MP-1) irradiation campaign. The benefits of EDM are reduced machining costs, ability to achieve higher tolerances, stress-free, burr-free surfaces eliminating the need for milling, and the ability to machine complex shapes. Kerf losses are much smaller with EDM (tenths of mm) comparedmore » to conventional machining (mm). Reliable repeatability is achievable with EDM due to its computer-generated machining programs.« less

  1. State machine analysis of sensor data from dynamic processes

    DOEpatents

    Cook, William R.; Brabson, John M.; Deland, Sharon M.

    2003-12-23

    A state machine model analyzes sensor data from dynamic processes at a facility to identify the actual processes that were performed at the facility during a period of interest for the purpose of remote facility inspection. An inspector can further input the expected operations into the state machine model and compare the expected, or declared, processes to the actual processes to identify undeclared processes at the facility. The state machine analysis enables the generation of knowledge about the state of the facility at all levels, from location of physical objects to complex operational concepts. Therefore, the state machine method and apparatus may benefit any agency or business with sensored facilities that stores or manipulates expensive, dangerous, or controlled materials or information.

  2. Phenotyping: Using Machine Learning for Improved Pairwise Genotype Classification Based on Root Traits

    PubMed Central

    Zhao, Jiangsan; Bodner, Gernot; Rewald, Boris

    2016-01-01

    Phenotyping local crop cultivars is becoming more and more important, as they are an important genetic source for breeding – especially in regard to inherent root system architectures. Machine learning algorithms are promising tools to assist in the analysis of complex data sets; novel approaches are need to apply them on root phenotyping data of mature plants. A greenhouse experiment was conducted in large, sand-filled columns to differentiate 16 European Pisum sativum cultivars based on 36 manually derived root traits. Through combining random forest and support vector machine models, machine learning algorithms were successfully used for unbiased identification of most distinguishing root traits and subsequent pairwise cultivar differentiation. Up to 86% of pea cultivar pairs could be distinguished based on top five important root traits (Timp5) – Timp5 differed widely between cultivar pairs. Selecting top important root traits (Timp) provided a significant improved classification compared to using all available traits or randomly selected trait sets. The most frequent Timp of mature pea cultivars was total surface area of lateral roots originating from tap root segments at 0–5 cm depth. The high classification rate implies that culturing did not lead to a major loss of variability in root system architecture in the studied pea cultivars. Our results illustrate the potential of machine learning approaches for unbiased (root) trait selection and cultivar classification based on rather small, complex phenotypic data sets derived from pot experiments. Powerful statistical approaches are essential to make use of the increasing amount of (root) phenotyping information, integrating the complex trait sets describing crop cultivars. PMID:27999587

  3. Nondimensional parameter for conformal grinding: combining machine and process parameters

    NASA Astrophysics Data System (ADS)

    Funkenbusch, Paul D.; Takahashi, Toshio; Gracewski, Sheryl M.; Ruckman, Jeffrey L.

    1999-11-01

    Conformal grinding of optical materials with CNC (Computer Numerical Control) machining equipment can be used to achieve precise control over complex part configurations. However complications can arise due to the need to fabricate complex geometrical shapes at reasonable production rates. For example high machine stiffness is essential, but the need to grind 'inside' small or highly concave surfaces may require use of tooling with less than ideal stiffness characteristics. If grinding generates loads sufficient for significant tool deflection, the programmed removal depth will not be achieved. Moreover since grinding load is a function of the volumetric removal rate the amount of load deflection can vary with location on the part, potentially producing complex figure errors. In addition to machine/tool stiffness and removal rate, load generation is a function of the process parameters. For example by reducing the feed rate of the tool into the part, both the load and resultant deflection/removal error can be decreased. However this must be balanced against the need for part through put. In this paper a simple model which permits combination of machine stiffness and process parameters into a single non-dimensional parameter is adapted for a conformal grinding geometry. Errors in removal can be minimized by maintaining this parameter above a critical value. Moreover, since the value of this parameter depends on the local part geometry, it can be used to optimize process settings during grinding. For example it may be used to guide adjustment of the feed rate as a function of location on the part to eliminate figure errors while minimizing the total grinding time required.

  4. Hospital-acquired listeriosis linked to a persistently contaminated milkshake machine.

    PubMed

    Mazengia, E; Kawakami, V; Rietberg, K; Kay, M; Wyman, P; Skilton, C; Aberra, A; Boonyaratanakornkit, J; Limaye, A P; Pergam, S A; Whimbey, E; Olsen-Scribner, R J; Duchin, J S

    2017-04-01

    One case of hospital-acquired listeriosis was linked to milkshakes produced in a commercial-grade shake freezer machine. This machine was found to be contaminated with a strain of Listeria monocytogenes epidemiologically and molecularly linked to a contaminated pasteurized, dairy-based ice cream product at the same hospital a year earlier, despite repeated cleaning and sanitizing. Healthcare facilities should be aware of the potential for prolonged Listeria contamination of food service equipment. In addition, healthcare providers should consider counselling persons who have an increased risk for Listeria infections regarding foods that have caused Listeria infections. The prevalence of persistent Listeria contamination of commercial-grade milkshake machines in healthcare facilities and the risk associated with serving dairy-based ice cream products to hospitalized patients at increased risk for invasive L. monocytogenes infections should be further evaluated.

  5. Time-course human urine proteomics in space-flight simulation experiments.

    PubMed

    Binder, Hans; Wirth, Henry; Arakelyan, Arsen; Lembcke, Kathrin; Tiys, Evgeny S; Ivanisenko, Vladimir A; Kolchanov, Nikolay A; Kononikhin, Alexey; Popov, Igor; Nikolaev, Evgeny N; Pastushkova, Lyudmila; Larina, Irina M

    2014-01-01

    Long-term space travel simulation experiments enabled to discover different aspects of human metabolism such as the complexity of NaCl salt balance. Detailed proteomics data were collected during the Mars105 isolation experiment enabling a deeper insight into the molecular processes involved. We studied the abundance of about two thousand proteins extracted from urine samples of six volunteers collected weekly during a 105-day isolation experiment under controlled dietary conditions including progressive reduction of salt consumption. Machine learning using Self Organizing maps (SOM) in combination with different analysis tools was applied to describe the time trajectories of protein abundance in urine. The method enables a personalized and intuitive view on the physiological state of the volunteers. The abundance of more than one half of the proteins measured clearly changes in the course of the experiment. The trajectory splits roughly into three time ranges, an early (week 1-6), an intermediate (week 7-11) and a late one (week 12-15). Regulatory modes associated with distinct biological processes were identified using previous knowledge by applying enrichment and pathway flow analysis. Early protein activation modes can be related to immune response and inflammatory processes, activation at intermediate times to developmental and proliferative processes and late activations to stress and responses to chemicals. The protein abundance profiles support previous results about alternative mechanisms of salt storage in an osmotically inactive form. We hypothesize that reduced NaCl consumption of about 6 g/day presumably will reduce or even prevent the activation of inflammatory processes observed in the early time range of isolation. SOM machine learning in combination with analysis methods of class discovery and functional annotation enable the straightforward analysis of complex proteomics data sets generated by means of mass spectrometry.

  6. Big data need big theory too

    PubMed Central

    Dougherty, Edward R.; Highfield, Roger R.

    2016-01-01

    The current interest in big data, machine learning and data analytics has generated the widespread impression that such methods are capable of solving most problems without the need for conventional scientific methods of inquiry. Interest in these methods is intensifying, accelerated by the ease with which digitized data can be acquired in virtually all fields of endeavour, from science, healthcare and cybersecurity to economics, social sciences and the humanities. In multiscale modelling, machine learning appears to provide a shortcut to reveal correlations of arbitrary complexity between processes at the atomic, molecular, meso- and macroscales. Here, we point out the weaknesses of pure big data approaches with particular focus on biology and medicine, which fail to provide conceptual accounts for the processes to which they are applied. No matter their ‘depth’ and the sophistication of data-driven methods, such as artificial neural nets, in the end they merely fit curves to existing data. Not only do these methods invariably require far larger quantities of data than anticipated by big data aficionados in order to produce statistically reliable results, but they can also fail in circumstances beyond the range of the data used to train them because they are not designed to model the structural characteristics of the underlying system. We argue that it is vital to use theory as a guide to experimental design for maximal efficiency of data collection and to produce reliable predictive models and conceptual knowledge. Rather than continuing to fund, pursue and promote ‘blind’ big data projects with massive budgets, we call for more funding to be allocated to the elucidation of the multiscale and stochastic processes controlling the behaviour of complex systems, including those of life, medicine and healthcare. This article is part of the themed issue ‘Multiscale modelling at the physics–chemistry–biology interface’. PMID:27698035

  7. Big data need big theory too.

    PubMed

    Coveney, Peter V; Dougherty, Edward R; Highfield, Roger R

    2016-11-13

    The current interest in big data, machine learning and data analytics has generated the widespread impression that such methods are capable of solving most problems without the need for conventional scientific methods of inquiry. Interest in these methods is intensifying, accelerated by the ease with which digitized data can be acquired in virtually all fields of endeavour, from science, healthcare and cybersecurity to economics, social sciences and the humanities. In multiscale modelling, machine learning appears to provide a shortcut to reveal correlations of arbitrary complexity between processes at the atomic, molecular, meso- and macroscales. Here, we point out the weaknesses of pure big data approaches with particular focus on biology and medicine, which fail to provide conceptual accounts for the processes to which they are applied. No matter their 'depth' and the sophistication of data-driven methods, such as artificial neural nets, in the end they merely fit curves to existing data. Not only do these methods invariably require far larger quantities of data than anticipated by big data aficionados in order to produce statistically reliable results, but they can also fail in circumstances beyond the range of the data used to train them because they are not designed to model the structural characteristics of the underlying system. We argue that it is vital to use theory as a guide to experimental design for maximal efficiency of data collection and to produce reliable predictive models and conceptual knowledge. Rather than continuing to fund, pursue and promote 'blind' big data projects with massive budgets, we call for more funding to be allocated to the elucidation of the multiscale and stochastic processes controlling the behaviour of complex systems, including those of life, medicine and healthcare.This article is part of the themed issue 'Multiscale modelling at the physics-chemistry-biology interface'. © 2015 The Authors.

  8. Machine learning predictions of molecular properties: Accurate many-body potentials and nonlocality in chemical space

    DOE PAGES

    Hansen, Katja; Biegler, Franziska; Ramakrishnan, Raghunathan; ...

    2015-06-04

    Simultaneously accurate and efficient prediction of molecular properties throughout chemical compound space is a critical ingredient toward rational compound design in chemical and pharmaceutical industries. Aiming toward this goal, we develop and apply a systematic hierarchy of efficient empirical methods to estimate atomization and total energies of molecules. These methods range from a simple sum over atoms, to addition of bond energies, to pairwise interatomic force fields, reaching to the more sophisticated machine learning approaches that are capable of describing collective interactions between many atoms or bonds. In the case of equilibrium molecular geometries, even simple pairwise force fields demonstratemore » prediction accuracy comparable to benchmark energies calculated using density functional theory with hybrid exchange-correlation functionals; however, accounting for the collective many-body interactions proves to be essential for approaching the “holy grail” of chemical accuracy of 1 kcal/mol for both equilibrium and out-of-equilibrium geometries. This remarkable accuracy is achieved by a vectorized representation of molecules (so-called Bag of Bonds model) that exhibits strong nonlocality in chemical space. The same representation allows us to predict accurate electronic properties of molecules, such as their polarizability and molecular frontier orbital energies.« less

  9. Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space

    PubMed Central

    2015-01-01

    Simultaneously accurate and efficient prediction of molecular properties throughout chemical compound space is a critical ingredient toward rational compound design in chemical and pharmaceutical industries. Aiming toward this goal, we develop and apply a systematic hierarchy of efficient empirical methods to estimate atomization and total energies of molecules. These methods range from a simple sum over atoms, to addition of bond energies, to pairwise interatomic force fields, reaching to the more sophisticated machine learning approaches that are capable of describing collective interactions between many atoms or bonds. In the case of equilibrium molecular geometries, even simple pairwise force fields demonstrate prediction accuracy comparable to benchmark energies calculated using density functional theory with hybrid exchange-correlation functionals; however, accounting for the collective many-body interactions proves to be essential for approaching the “holy grail” of chemical accuracy of 1 kcal/mol for both equilibrium and out-of-equilibrium geometries. This remarkable accuracy is achieved by a vectorized representation of molecules (so-called Bag of Bonds model) that exhibits strong nonlocality in chemical space. In addition, the same representation allows us to predict accurate electronic properties of molecules, such as their polarizability and molecular frontier orbital energies. PMID:26113956

  10. Omics approaches to individual variation: modeling networks and the virtual patient.

    PubMed

    Lehrach, Hans

    2016-09-01

    Every human is unique. We differ in our genomes, environment, behavior, disease history, and past and current medical treatment-a complex catalog of differences that often leads to variations in the way each of us responds to a particular therapy. We argue here that true personalization of drug therapies will rely on "virtual patient" models based on a detailed characterization of the individual patient by molecular, imaging, and sensor techniques. The models will be based, wherever possible, on the molecular mechanisms of disease processes and drug action but can also expand to hybrid models including statistics/machine learning/artificial intelligence-based elements trained on available data to address therapeutic areas or therapies for which insufficient information on mechanisms is available. Depending on the disease, its mechanisms, and the therapy, virtual patient models can be implemented at a fairly high level of abstraction, with molecular models representing cells, cell types, or organs relevant to the clinical question, interacting not only with each other but also the environment. In the future, "virtual patient/in-silico self" models may not only become a central element of our health care system, reducing otherwise unavoidable mistakes and unnecessary costs, but also act as "guardian angels" accompanying us through life to protect us against dangers and to help us to deal intelligently with our own health and wellness.

  11. Omics approaches to individual variation: modeling networks and the virtual patient

    PubMed Central

    Lehrach, Hans

    2016-01-01

    Every human is unique. We differ in our genomes, environment, behavior, disease history, and past and current medical treatment—a complex catalog of differences that often leads to variations in the way each of us responds to a particular therapy. We argue here that true personalization of drug therapies will rely on “virtual patient” models based on a detailed characterization of the individual patient by molecular, imaging, and sensor techniques. The models will be based, wherever possible, on the molecular mechanisms of disease processes and drug action but can also expand to hybrid models including statistics/machine learning/artificial intelligence-based elements trained on available data to address therapeutic areas or therapies for which insufficient information on mechanisms is available. Depending on the disease, its mechanisms, and the therapy, virtual patient models can be implemented at a fairly high level of abstraction, with molecular models representing cells, cell types, or organs relevant to the clinical question, interacting not only with each other but also the environment. In the future, “virtual patient/in-silico self” models may not only become a central element of our health care system, reducing otherwise unavoidable mistakes and unnecessary costs, but also act as “guardian angels” accompanying us through life to protect us against dangers and to help us to deal intelligently with our own health and wellness. PMID:27757060

  12. History in the gene: negotiations between molecular and organismal anthropology.

    PubMed

    Sommer, Marianne

    2008-01-01

    In the advertising discourse of human genetic database projects, of genetic ancestry tracing companies, and in popular books on anthropological genetics, what I refer to as the anthropological gene and genome appear as documents of human history, by far surpassing the written record and oral history in scope and accuracy as archives of our past. How did macromolecules become "documents of human evolutionary history"? Historically, molecular anthropology, a term introduced by Emile Zuckerkandl in 1962 to characterize the study of primate phylogeny and human evolution on the molecular level, asserted its claim to the privilege of interpretation regarding hominoid, hominid, and human phylogeny and evolution vis-à-vis other historical sciences such as evolutionary biology, physical anthropology, and paleoanthropology. This process will be discussed on the basis of three key conferences on primate classification and evolution that brought together exponents of the respective fields and that were held in approximately ten-years intervals between the early 1960s and the 1980s. I show how the anthropological gene and genome gained their status as the most fundamental, clean, and direct records of historical information, and how the prioritizing of these epistemic objects was part of a complex involving the objectivity of numbers, logic, and mathematics, the objectivity of machines and instruments, and the objectivity seen to reside in the epistemic objects themselves.

  13. KSC-2012-3081

    NASA Image and Video Library

    2012-05-25

    CAPE CANAVERAL, Fla. – A team of competitors works with its machine during NASA's Lunabotics Mining Competition at the Kennedy Space Center Visitor Complex in Florida. The competition challenges university students to build machines that can collect soil such as the material found on the moon. Working inside the Caterpillar LunArena, the robotic craft dig soil that simulates lunar material. The event is judged by a machine's abilities to collect the soil, its design and operation, size, dust tolerance and its level of autonomy. Photo credit: NASA/Glenn Benson

  14. KSC-2012-3076

    NASA Image and Video Library

    2012-05-25

    CAPE CANAVERAL, Fla. – A team of competitors works with its machine during NASA's Lunabotics Mining Competition at the Kennedy Space Center Visitor Complex in Florida. The competition challenges university students to build machines that can collect soil such as the material found on the moon. Working inside the Caterpillar LunArena, the robotic craft dig soil that simulates lunar material. The event is judged by a machine's abilities to collect the soil, its design and operation, size, dust tolerance and its level of autonomy. Photo credit: NASA/Glenn Benson

  15. KSC-2012-3080

    NASA Image and Video Library

    2012-05-25

    CAPE CANAVERAL, Fla. – A team of competitors works with its machine during NASA's Lunabotics Mining Competition at the Kennedy Space Center Visitor Complex in Florida. The competition challenges university students to build machines that can collect soil such as the material found on the moon. Working inside the Caterpillar LunArena, the robotic craft dig soil that simulates lunar material. The event is judged by a machine's abilities to collect the soil, its design and operation, size, dust tolerance and its level of autonomy. Photo credit: NASA/Glenn Benson

  16. KSC-2012-3079

    NASA Image and Video Library

    2012-05-25

    CAPE CANAVERAL, Fla. – A team of competitors works with its machine during NASA's Lunabotics Mining Competition at the Kennedy Space Center Visitor Complex in Florida. The competition challenges university students to build machines that can collect soil such as the material found on the moon. Working inside the Caterpillar LunArena, the robotic craft dig soil that simulates lunar material. The event is judged by a machine's abilities to collect the soil, its design and operation, size, dust tolerance and its level of autonomy. Photo credit: NASA/Glenn Benson

  17. Cognitive learning: a machine learning approach for automatic process characterization from design

    NASA Astrophysics Data System (ADS)

    Foucher, J.; Baderot, J.; Martinez, S.; Dervilllé, A.; Bernard, G.

    2018-03-01

    Cutting edge innovation requires accurate and fast process-control to obtain fast learning rate and industry adoption. Current tools available for such task are mainly manual and user dependent. We present in this paper cognitive learning, which is a new machine learning based technique to facilitate and to speed up complex characterization by using the design as input, providing fast training and detection time. We will focus on the machine learning framework that allows object detection, defect traceability and automatic measurement tools.

  18. Complex extreme learning machine applications in terahertz pulsed signals feature sets.

    PubMed

    Yin, X-X; Hadjiloucas, S; Zhang, Y

    2014-11-01

    This paper presents a novel approach to the automatic classification of very large data sets composed of terahertz pulse transient signals, highlighting their potential use in biochemical, biomedical, pharmaceutical and security applications. Two different types of THz spectra are considered in the classification process. Firstly a binary classification study of poly-A and poly-C ribonucleic acid samples is performed. This is then contrasted with a difficult multi-class classification problem of spectra from six different powder samples that although have fairly indistinguishable features in the optical spectrum, they also possess a few discernable spectral features in the terahertz part of the spectrum. Classification is performed using a complex-valued extreme learning machine algorithm that takes into account features in both the amplitude as well as the phase of the recorded spectra. Classification speed and accuracy are contrasted with that achieved using a support vector machine classifier. The study systematically compares the classifier performance achieved after adopting different Gaussian kernels when separating amplitude and phase signatures. The two signatures are presented as feature vectors for both training and testing purposes. The study confirms the utility of complex-valued extreme learning machine algorithms for classification of the very large data sets generated with current terahertz imaging spectrometers. The classifier can take into consideration heterogeneous layers within an object as would be required within a tomographic setting and is sufficiently robust to detect patterns hidden inside noisy terahertz data sets. The proposed study opens up the opportunity for the establishment of complex-valued extreme learning machine algorithms as new chemometric tools that will assist the wider proliferation of terahertz sensing technology for chemical sensing, quality control, security screening and clinic diagnosis. Furthermore, the proposed algorithm should also be very useful in other applications requiring the classification of very large datasets. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  19. Data Intensive Analysis of Biomolecular Simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Straatsma, TP; Soares, Thereza A.

    2007-12-01

    The advances in biomolecular modeling and simulation made possible by the availability of increasingly powerful high performance computing resources is extending molecular simulations to biological more relevant system size and time scales. At the same time, advances in simulation methodologies are allowing more complex processes to be described more accurately. These developments make a systems approach to computational structural biology feasible, but this will require a focused emphasis on the comparative analysis of the increasing number of molecular simulations that are being carried out for biomolecular systems with more realistic models, multi-component environments, and for longer simulation times. Just asmore » in the case of the analysis of the large data sources created by the new high-throughput experimental technologies, biomolecular computer simulations contribute to the progress in biology through comparative analysis. The continuing increase in available protein structures allows the comparative analysis of the role of structure and conformational flexibility in protein function, and is the foundation of the discipline of structural bioinformatics. This creates the opportunity to derive general findings from the comparative analysis of molecular dynamics simulations of a wide range of proteins, protein-protein complexes and other complex biological systems. Because of the importance of protein conformational dynamics for protein function, it is essential that the analysis of molecular trajectories is carried out using a novel, more integrative and systematic approach. We are developing a much needed rigorous computer science based framework for the efficient analysis of the increasingly large data sets resulting from molecular simulations. Such a suite of capabilities will also provide the required tools for access and analysis of a distributed library of generated trajectories. Our research is focusing on the following areas: (1) the development of an efficient analysis framework for very large scale trajectories on massively parallel architectures, (2) the development of novel methodologies that allow automated detection of events in these very large data sets, and (3) the efficient comparative analysis of multiple trajectories. The goal of the presented work is the development of new algorithms that will allow biomolecular simulation studies to become an integral tool to address the challenges of post-genomic biological research. The strategy to deliver the required data intensive computing applications that can effectively deal with the volume of simulation data that will become available is based on taking advantage of the capabilities offered by the use of large globally addressable memory architectures. The first requirement is the design of a flexible underlying data structure for single large trajectories that will form an adaptable framework for a wide range of analysis capabilities. The typical approach to trajectory analysis is to sequentially process trajectories time frame by time frame. This is the implementation found in molecular simulation codes such as NWChem, and has been designed in this way to be able to run on workstation computers and other architectures with an aggregate amount of memory that would not allow entire trajectories to be held in core. The consequence of this approach is an I/O dominated solution that scales very poorly on parallel machines. We are currently using an approach of developing tools specifically intended for use on large scale machines with sufficient main memory that entire trajectories can be held in core. This greatly reduces the cost of I/O as trajectories are read only once during the analysis. In our current Data Intensive Analysis (DIANA) implementation, each processor determines and skips to the entry within the trajectory that typically will be available in multiple files and independently from all other processors read the appropriate frames.« less

  20. Data Intensive Analysis of Biomolecular Simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Straatsma, TP

    2008-03-01

    The advances in biomolecular modeling and simulation made possible by the availability of increasingly powerful high performance computing resources is extending molecular simulations to biological more relevant system size and time scales. At the same time, advances in simulation methodologies are allowing more complex processes to be described more accurately. These developments make a systems approach to computational structural biology feasible, but this will require a focused emphasis on the comparative analysis of the increasing number of molecular simulations that are being carried out for biomolecular systems with more realistic models, multi-component environments, and for longer simulation times. Just asmore » in the case of the analysis of the large data sources created by the new high-throughput experimental technologies, biomolecular computer simulations contribute to the progress in biology through comparative analysis. The continuing increase in available protein structures allows the comparative analysis of the role of structure and conformational flexibility in protein function, and is the foundation of the discipline of structural bioinformatics. This creates the opportunity to derive general findings from the comparative analysis of molecular dynamics simulations of a wide range of proteins, protein-protein complexes and other complex biological systems. Because of the importance of protein conformational dynamics for protein function, it is essential that the analysis of molecular trajectories is carried out using a novel, more integrative and systematic approach. We are developing a much needed rigorous computer science based framework for the efficient analysis of the increasingly large data sets resulting from molecular simulations. Such a suite of capabilities will also provide the required tools for access and analysis of a distributed library of generated trajectories. Our research is focusing on the following areas: (1) the development of an efficient analysis framework for very large scale trajectories on massively parallel architectures, (2) the development of novel methodologies that allow automated detection of events in these very large data sets, and (3) the efficient comparative analysis of multiple trajectories. The goal of the presented work is the development of new algorithms that will allow biomolecular simulation studies to become an integral tool to address the challenges of post-genomic biological research. The strategy to deliver the required data intensive computing applications that can effectively deal with the volume of simulation data that will become available is based on taking advantage of the capabilities offered by the use of large globally addressable memory architectures. The first requirement is the design of a flexible underlying data structure for single large trajectories that will form an adaptable framework for a wide range of analysis capabilities. The typical approach to trajectory analysis is to sequentially process trajectories time frame by time frame. This is the implementation found in molecular simulation codes such as NWChem, and has been designed in this way to be able to run on workstation computers and other architectures with an aggregate amount of memory that would not allow entire trajectories to be held in core. The consequence of this approach is an I/O dominated solution that scales very poorly on parallel machines. We are currently using an approach of developing tools specifically intended for use on large scale machines with sufficient main memory that entire trajectories can be held in core. This greatly reduces the cost of I/O as trajectories are read only once during the analysis. In our current Data Intensive Analysis (DIANA) implementation, each processor determines and skips to the entry within the trajectory that typically will be available in multiple files and independently from all other processors read the appropriate frames.« less

  1. Endoglucanase Peripheral Loops Facilitate Complexation of Glucan Chains on Cellulose via Adaptive Coupling to the Emergent Substrate Structures

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lin, Yuchun; Beckham, Gregg T.; Himmel, Michael E.

    We examine how the catalytic domain of a glycoside hydrolase family 7 endoglucanase catalytic domain (Cel7B CD) facilitates complexation of cellulose chains from a crystal surface. With direct relevance to the science of biofuel production, this problem also represents a model system of biopolymer processing by proteins in Nature. Interactions of Cel7B CD with a cellulose microfibril along different paths of complexation are characterized by mapping the atomistic fluctuations recorded in free-energy simulations onto the parameters of a coarse-grain model. The resulting patterns of protein-biopolymer couplings also uncover the sequence signatures of the enzyme in peeling off glucan chains frommore » the microfibril substrate. We show that the semiopen active site of Cel7B CD exhibits similar barriers and free energies of complexation over two distinct routes; namely, scooping of a chain into the active-site cleft and threading from the chain end into the channel. On the other hand, the complexation energetics strongly depends on the surface packing of the targeted chain and the resulting interaction sites with the enzyme. A revealed principle is that Cel7B CD facilitates cellulose deconstruction via adaptive coupling to the emergent substrate. The flexible, peripheral segments of the protein outside of the active-site cleft are able to accommodate the varying features of cellulose along the simulated paths of complexation. The general strategy of linking physics-based molecular interactions to protein sequence could also be helpful in elucidating how other protein machines process biopolymers.« less

  2. Molecular Thermodynamics for Cell Biology as Taught with Boxes

    PubMed Central

    Mayorga, Luis S.; López, María José; Becker, Wayne M.

    2012-01-01

    Thermodynamic principles are basic to an understanding of the complex fluxes of energy and information required to keep cells alive. These microscopic machines are nonequilibrium systems at the micron scale that are maintained in pseudo-steady-state conditions by very sophisticated processes. Therefore, several nonstandard concepts need to be taught to rationalize why these very ordered systems proliferate actively all over our planet in seeming contradiction to the second law of thermodynamics. We propose a model consisting of boxes with different shapes that contain small balls that are in constant motion due to a stream of air blowing from below. This is a simple macroscopic system that can be easily visualized by students and that can be understood as mimicking the behavior of a set of molecules exchanging energy. With such boxes, the basic concepts of entropy, enthalpy, and free energy can be taught while reinforcing a molecular understanding of the concepts and stressing the stochastic nature of the thermodynamic laws. In addition, time-related concepts, such as reaction rates and activation energy, can be readily visualized. Moreover, the boxes provide an intuitive way to introduce the role in cellular organization of “information” and Maxwell's demons operating under nonequilibrium conditions. PMID:22383615

  3. Molecular thermodynamics for cell biology as taught with boxes.

    PubMed

    Mayorga, Luis S; López, María José; Becker, Wayne M

    2012-01-01

    Thermodynamic principles are basic to an understanding of the complex fluxes of energy and information required to keep cells alive. These microscopic machines are nonequilibrium systems at the micron scale that are maintained in pseudo-steady-state conditions by very sophisticated processes. Therefore, several nonstandard concepts need to be taught to rationalize why these very ordered systems proliferate actively all over our planet in seeming contradiction to the second law of thermodynamics. We propose a model consisting of boxes with different shapes that contain small balls that are in constant motion due to a stream of air blowing from below. This is a simple macroscopic system that can be easily visualized by students and that can be understood as mimicking the behavior of a set of molecules exchanging energy. With such boxes, the basic concepts of entropy, enthalpy, and free energy can be taught while reinforcing a molecular understanding of the concepts and stressing the stochastic nature of the thermodynamic laws. In addition, time-related concepts, such as reaction rates and activation energy, can be readily visualized. Moreover, the boxes provide an intuitive way to introduce the role in cellular organization of "information" and Maxwell's demons operating under nonequilibrium conditions.

  4. Plugging a Bipyridinium Axle into Multichromophoric Calix[6]arene Wheels Bearing Naphthyl Units at Different Rims

    PubMed Central

    Orlandini, Guido; Ragazzon, Giulio; Zanichelli, Valeria; Degli Esposti, Lorenzo; Baroncini, Massimo; Silvi, Serena; Venturi, Margherita; Arduini, Arturo

    2017-01-01

    Abstract Tris‐(N‐phenylureido)‐calix[6]arene derivatives are heteroditopic non‐symmetric molecular hosts that can form pseudorotaxane complexes with 4,4′‐bipyridinium‐type guests. Owing to the unique structural features and recognition properties of the calix[6]arene wheel, these systems are of interest for the design and synthesis of novel molecular devices and machines. We envisaged that the incorporation of photoactive units in the calixarene skeleton could lead to the development of systems the working modes of which can be governed and monitored by means of light‐activated processes. Here, we report on the synthesis, structural characterization, and spectroscopic, photophysical, and electrochemical investigation of two calix[6]arene wheels decorated with three naphthyl groups anchored to either the upper or lower rim of the phenylureido calixarene platform. We found that the naphthyl units interact mutually and with the calixarene skeleton in a different fashion in the two compounds, which thus exhibit a markedly distinct photophysical behavior. For both hosts, the inclusion of a 4,4′‐bipyridinium guest activates energy‐ and/or electron‐transfer processes that lead to non‐trivial luminescence changes. PMID:28168152

  5. Molecular systems under shock compression into the dense plasma regime: carbon dioxide and hydrocarbon polymers

    NASA Astrophysics Data System (ADS)

    Mattsson, Thomas R.; Cochrane, Kyle R.; Root, Seth; Carpenter, John H.

    2013-10-01

    Density Functional Theory (DFT) has proven remarkably accurate in predicting properties of matter under shock compression into the dense plasma regime. Materials where chemistry plays a role are of interest for many applications, including planetary science and inertial confinement fusion (ICF). As examples of systems where chemical reactions are important, and demonstration of the high fidelity possible for these both structurally and chemically complex systems, we will discuss shock- and re-shock of liquid carbon dioxide (CO2) in the range 100 to 800 GPa and shock compression of hydrocarbon polymers, including GDP (glow discharge polymer) which is used as an ablator in laser ICF experiments. Experimental results from Sandia's Z machine validate the DFT simulations at extreme conditions and the combination of experiment and DFT provide reliable data for evaluating existing and constructing future wide-range equations of state models for molecular compounds. Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Company, for the U.S. Department of Energy's National Nuclear Security Administration under contract DE-AC04-94AL85000.

  6. Selection of Wire Electrical Discharge Machining Process Parameters on Stainless Steel AISI Grade-304 using Design of Experiments Approach

    NASA Astrophysics Data System (ADS)

    Lingadurai, K.; Nagasivamuni, B.; Muthu Kamatchi, M.; Palavesam, J.

    2012-06-01

    Wire electrical discharge machining (WEDM) is a specialized thermal machining process capable of accurately machining parts of hard materials with complex shapes. Parts having sharp edges that pose difficulties to be machined by the main stream machining processes can be easily machined by WEDM process. Design of Experiments approach (DOE) has been reported in this work for stainless steel AISI grade-304 which is used in cryogenic vessels, evaporators, hospital surgical equipment, marine equipment, fasteners, nuclear vessels, feed water tubing, valves, refrigeration equipment, etc., is machined by WEDM with brass wire electrode. The DOE method is used to formulate the experimental layout, to analyze the effect of each parameter on the machining characteristics, and to predict the optimal choice for each WEDM parameter such as voltage, pulse ON, pulse OFF and wire feed. It is found that these parameters have a significant influence on machining characteristic such as metal removal rate (MRR), kerf width and surface roughness (SR). The analysis of the DOE reveals that, in general the pulse ON time significantly affects the kerf width and the wire feed rate affects SR, while, the input voltage mainly affects the MRR.

  7. National Synchrotron Light Source annual report 1991

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hulbert, S.L.; Lazarz, N.M.

    1992-04-01

    This report discusses the following research conducted at NSLS: atomic and molecular science; energy dispersive diffraction; lithography, microscopy and tomography; nuclear physics; UV photoemission and surface science; x-ray absorption spectroscopy; x-ray scattering and crystallography; x-ray topography; workshop on surface structure; workshop on electronic and chemical phenomena at surfaces; workshop on imaging; UV FEL machine reviews; VUV machine operations; VUV beamline operations; VUV storage ring parameters; x-ray machine operations; x-ray beamline operations; x-ray storage ring parameters; superconducting x-ray lithography source; SXLS storage ring parameters; the accelerator test facility; proposed UV-FEL user facility at the NSLS; global orbit feedback systems; and NSLSmore » computer system.« less

  8. Revisiting the Central Dogma One Molecule at a Time

    PubMed Central

    Bustamante, Carlos; Cheng, Wei; Meija, Yara

    2011-01-01

    The faithful relay and timely expression of genetic information depend on specialized molecular machines, many of which function as nucleic acid translocases. The emergence over the last decade of single-molecule fluorescence detection and manipulation techniques with nm and Å resolution, and their application to the study of nucleic acid translocases are painting an increasingly sharp picture of the inner workings of these machines, the dynamics and coordination of their moving parts, their thermodynamic efficiency, and the nature of their transient intermediates. Here we present an overview of the main results arrived at by the application of single-molecule methods to the study of the main machines of the central dogma. PMID:21335233

  9. POLYSHIFT Communications Software for the Connection Machine System CM-200

    DOE PAGES

    George, William; Brickner, Ralph G.; Johnsson, S. Lennart

    1994-01-01

    We describe the use and implementation of a polyshift function PSHIFT for circular shifts and end-offs shifts. Polyshift is useful in many scientific codes using regular grids, such as finite difference codes in several dimensions, and multigrid codes, molecular dynamics computations, and in lattice gauge physics computations, such as quantum chromodynamics (QCD) calculations. Our implementation of the PSHIFT function on the Connection Machine systems CM-2 and CM-200 offers a speedup of up to a factor of 3–4 compared with CSHIFT when the local data motion within a node is small. The PSHIFT routine is included in the Connection Machine Scientificmore » Software Library (CMSSL).« less

  10. The Molecular Industrial Revolution: Automated Synthesis of Small Molecules

    PubMed Central

    Trobe, Melanie; Burke, Martin D.

    2018-01-01

    The eighteenth and nineteenth centuries marked a sweeping transition from manual to automated manufacturing on the macroscopic scale. This enabled an unmatched period of human innovation that helped drive the Industrial Revolution. The impact on society was transformative, ultimately yielding substantial improvements in living conditions and lifespan in many parts of the world. During the same time period, the first manual syntheses of organic molecules was achieved. Now, two centuries later, we are poised for an analogous transition from highly customized crafting of specific molecular targets by hand to the increasingly general and automated assembly of many different types of molecules with the push of a button. Automation of customized small molecule synthesis pathways is already enabling safer, more reproducible, and readily scalable production of specific targets, and general machines now exist for the synthesis of a wide range of different peptides, oligonucleotides, and oligosaccharides. Creating general machines that are similarly capable of making many different types of small molecules on-demand, akin to that which has been achieved on the macroscopic scale with 3D printers, has proven to be substantially more challenging. Yet important progress is being made toward this potentially transformative objective with two complementary approaches: (1) automation of customized synthesis routes to different targets via machines that enable use of many different reactions and starting materials, and (2) automation of generalized platforms that make many different targets using common coupling chemistry and building blocks. Continued progress in these exciting directions has the potential to shift the bottleneck in molecular innovation from synthesis to imagination, and thereby help drive a new industrial revolution on the molecular scale. PMID:29513400

  11. Trust and Influence

    DTIC Science & Technology

    2012-03-05

    DISTRIBUTION A: Approved for public release; distribution is unlimited. Program Trends •Trust in Autonomous Systems • Cross - cultural Trust...Trust & trustworthiness are independent (Mayer et al, 1995) •Trust is relational •Humans in cross - cultural interactions •Complex human-machine...Interpersonal Trustworthiness •Ability •Benevolence •Integrity Trust Metrics Cross - Cultural Trust Issues Human-Machine Interactions Autonomous

  12. Diverse applications of advanced man-telerobot interfaces

    NASA Technical Reports Server (NTRS)

    Mcaffee, Douglas A.

    1991-01-01

    Advancements in man-machine interfaces and control technologies used in space telerobotics and teleoperators have potential application wherever human operators need to manipulate multi-dimensional spatial relationships. Bilateral six degree-of-freedom position and force cues exchanged between the user and a complex system can broaden and improve the effectiveness of several diverse man-machine interfaces.

  13. Open Architecture Data System for NASA Langley Combined Loads Test System

    NASA Technical Reports Server (NTRS)

    Lightfoot, Michael C.; Ambur, Damodar R.

    1998-01-01

    The Combined Loads Test System (COLTS) is a new structures test complex that is being developed at NASA Langley Research Center (LaRC) to test large curved panels and cylindrical shell structures. These structural components are representative of aircraft fuselage sections of subsonic and supersonic transport aircraft and cryogenic tank structures of reusable launch vehicles. Test structures are subjected to combined loading conditions that simulate realistic flight load conditions. The facility consists of two pressure-box test machines and one combined loads test machine. Each test machine possesses a unique set of requirements or research data acquisition and real-time data display. Given the complex nature of the mechanical and thermal loads to be applied to the various research test articles, each data system has been designed with connectivity attributes that support both data acquisition and data management functions. This paper addresses the research driven data acquisition requirements for each test machine and demonstrates how an open architecture data system design not only meets those needs but provides robust data sharing between data systems including the various control systems which apply spectra of mechanical and thermal loading profiles.

  14. A 16-bit Coherent Ising Machine for One-Dimensional Ring and Cubic Graph Problems

    NASA Astrophysics Data System (ADS)

    Takata, Kenta; Marandi, Alireza; Hamerly, Ryan; Haribara, Yoshitaka; Maruo, Daiki; Tamate, Shuhei; Sakaguchi, Hiromasa; Utsunomiya, Shoko; Yamamoto, Yoshihisa

    2016-09-01

    Many tasks in our modern life, such as planning an efficient travel, image processing and optimizing integrated circuit design, are modeled as complex combinatorial optimization problems with binary variables. Such problems can be mapped to finding a ground state of the Ising Hamiltonian, thus various physical systems have been studied to emulate and solve this Ising problem. Recently, networks of mutually injected optical oscillators, called coherent Ising machines, have been developed as promising solvers for the problem, benefiting from programmability, scalability and room temperature operation. Here, we report a 16-bit coherent Ising machine based on a network of time-division-multiplexed femtosecond degenerate optical parametric oscillators. The system experimentally gives more than 99.6% of success rates for one-dimensional Ising ring and nondeterministic polynomial-time (NP) hard instances. The experimental and numerical results indicate that gradual pumping of the network combined with multiple spectral and temporal modes of the femtosecond pulses can improve the computational performance of the Ising machine, offering a new path for tackling larger and more complex instances.

  15. Ultra-precision turning of complex spiral optical delay line

    NASA Astrophysics Data System (ADS)

    Zhang, Xiaodong; Li, Po; Fang, Fengzhou; Wang, Qichang

    2011-11-01

    Optical delay line (ODL) implements the vertical or depth scanning of optical coherence tomography, which is the most important factor affecting the scanning resolution and speed. The spinning spiral mirror is found as an excellent optical delay device because of the high-speed and high-repetition-rate. However, it is one difficult task to machine the mirror due to the special shape and precision requirement. In this paper, the spiral mirror with titled parabolic generatrix is proposed, and the ultra-precision turning method is studied for its machining using the spiral mathematic model. Another type of ODL with the segmental shape is also introduced and machined to make rotation balance for the mass equalization when scanning. The efficiency improvement is considered in details, including the rough cutting with the 5- axis milling machine, the machining coordinates unification, and the selection of layer direction in turning. The onmachine measuring method based on stylus gauge is designed to analyze the shape deviation. The air bearing is used as the measuring staff and the laser interferometer sensor as the position sensor, whose repeatability accuracy is proved up to 10nm and the stable feature keeps well. With this method developed, the complex mirror with nanometric finish of 10.7nm in Ra and the form error within 1um are achieved.

  16. A novel diamond micro-/nano-machining process for the generation of hierarchical micro-/nano-structures

    NASA Astrophysics Data System (ADS)

    Zhu, Zhiwei; To, Suet; Ehmann, Kornel F.; Xiao, Gaobo; Zhu, Wule

    2016-03-01

    A new mechanical micro-/nano-machining process that combines rotary spatial vibrations (RSV) of a diamond tool and the servo motions of the workpiece is proposed and applied for the generation of multi-tier hierarchical micro-/nano-structures. In the proposed micro-/nano-machining system, the servo motion, as the primary cutting motion generated by a slow-tool-servo, is adopted for the fine generation of the primary surfaces with complex shapes. The RSV, as the tertiary cutting operation, is superimposed on the secondary fundamental rotary cutting motion to construct secondary nano-structures on the primary surface. Since the RSV system generally works at much higher frequencies and motion resolution than the primary and secondary motions, it leads to an inherent hierarchical cutting architecture. To investigate the machining performance, complex micro-/nano-structures were generated and explored by both numerical simulations and actual cutting tests. Rotary vibrations of the diamond tool at a constant rotational distance offer an inherent constant cutting velocity, leading to the ability for the generation of homogeneous micro-/nano-structures with fixed amplitudes and frequencies of the vibrations, even over large-scale surfaces. Furthermore, by deliberately combining the non-resonant three-axial vibrations and the servo motion, the generation of a variety of micro-/nano-structures with complex shapes and with flexibly tunable feature sizes can be achieved.

  17. Dialogue-Based Research in Man-Machine Communication

    DTIC Science & Technology

    1975-11-01

    This paper first surveys current knowledge of human communication from a point of view which seeks to find or develop knowledge that will be useful...complexity is explored. Building a useful knowledge of human communication is an extremely complex task. Controlling this complexity and its effects, without

  18. RAFCON: A Graphical Tool for Engineering Complex, Robotic Tasks

    DTIC Science & Technology

    2016-10-09

    Robotic tasks are becoming increasingly complex, and with this also the robotic systems. This requires new tools to manage this complexity and to...execution of robotic tasks, called RAFCON. These tasks are described in hierarchical state machines supporting concurrency. A formal notation of this concept

  19. Development Of Knowledge Systems For Trouble Shooting Complex Production Machinery

    NASA Astrophysics Data System (ADS)

    Sanford, Richard L.; Novak, Thomas; Meigs, James R.

    1987-05-01

    This paper discusses the use of knowledge base system software for microcomputers to aid repairmen in diagnosing electrical failures in complex mining machinery. The knowledge base is constructed to allow the user to input initial symptoms of the failed machine, and the most probable cause of failure is traced through the knowledge base, with the software requesting additional information such as voltage or resistance measurements as needed. Although the case study presented is for an underground mining machine, results have application to any industry using complex machinery. Two commercial expert-system development tools (M1 TM and Insight 2+TM) and an Al language (Turbo PrologTM) are discussed with emphasis on ease of application and suitability for this study.

  20. Mirnacle: machine learning with SMOTE and random forest for improving selectivity in pre-miRNA ab initio prediction.

    PubMed

    Marques, Yuri Bento; de Paiva Oliveira, Alcione; Ribeiro Vasconcelos, Ana Tereza; Cerqueira, Fabio Ribeiro

    2016-12-15

    MicroRNAs (miRNAs) are key gene expression regulators in plants and animals. Therefore, miRNAs are involved in several biological processes, making the study of these molecules one of the most relevant topics of molecular biology nowadays. However, characterizing miRNAs in vivo is still a complex task. As a consequence, in silico methods have been developed to predict miRNA loci. A common ab initio strategy to find miRNAs in genomic data is to search for sequences that can fold into the typical hairpin structure of miRNA precursors (pre-miRNAs). The current ab initio approaches, however, have selectivity issues, i.e., a high number of false positives is reported, which can lead to laborious and costly attempts to provide biological validation. This study presents an extension of the ab initio method miRNAFold, with the aim of improving selectivity through machine learning techniques, namely, random forest combined with the SMOTE procedure that copes with imbalance datasets. By comparing our method, termed Mirnacle, with other important approaches in the literature, we demonstrate that Mirnacle substantially improves selectivity without compromising sensitivity. For the three datasets used in our experiments, our method achieved at least 97% of sensitivity and could deliver a two-fold, 20-fold, and 6-fold increase in selectivity, respectively, compared with the best results of current computational tools. The extension of miRNAFold by the introduction of machine learning techniques, significantly increases selectivity in pre-miRNA ab initio prediction, which optimally contributes to advanced studies on miRNAs, as the need of biological validations is diminished. Hopefully, new research, such as studies of severe diseases caused by miRNA malfunction, will benefit from the proposed computational tool.

  1. Machine Learning and Inverse Problem in Geodynamics

    NASA Astrophysics Data System (ADS)

    Shahnas, M. H.; Yuen, D. A.; Pysklywec, R.

    2017-12-01

    During the past few decades numerical modeling and traditional HPC have been widely deployed in many diverse fields for problem solutions. However, in recent years the rapid emergence of machine learning (ML), a subfield of the artificial intelligence (AI), in many fields of sciences, engineering, and finance seems to mark a turning point in the replacement of traditional modeling procedures with artificial intelligence-based techniques. The study of the circulation in the interior of Earth relies on the study of high pressure mineral physics, geochemistry, and petrology where the number of the mantle parameters is large and the thermoelastic parameters are highly pressure- and temperature-dependent. More complexity arises from the fact that many of these parameters that are incorporated in the numerical models as input parameters are not yet well established. In such complex systems the application of machine learning algorithms can play a valuable role. Our focus in this study is the application of supervised machine learning (SML) algorithms in predicting mantle properties with the emphasis on SML techniques in solving the inverse problem. As a sample problem we focus on the spin transition in ferropericlase and perovskite that may cause slab and plume stagnation at mid-mantle depths. The degree of the stagnation depends on the degree of negative density anomaly at the spin transition zone. The training and testing samples for the machine learning models are produced by the numerical convection models with known magnitudes of density anomaly (as the class labels of the samples). The volume fractions of the stagnated slabs and plumes which can be considered as measures for the degree of stagnation are assigned as sample features. The machine learning models can determine the magnitude of the spin transition-induced density anomalies that can cause flow stagnation at mid-mantle depths. Employing support vector machine (SVM) algorithms we show that SML techniques can successfully predict the magnitude of the mantle density anomalies and can also be used in characterizing mantle flow patterns. The technique can be extended to more complex problems in mantle dynamics by employing deep learning algorithms for estimation of mantle properties such as viscosity, elastic parameters, and thermal and chemical anomalies.

  2. Who Should Decide How Machines Make Morally Laden Decisions?

    PubMed

    Martin, Dominic

    2017-08-01

    Who should decide how a machine will decide what to do when it is driving a car, performing a medical procedure, or, more generally, when it is facing any kind of morally laden decision? More and more, machines are making complex decisions with a considerable level of autonomy. We should be much more preoccupied by this problem than we currently are. After a series of preliminary remarks, this paper will go over four possible answers to the question raised above. First, we may claim that it is the maker of a machine that gets to decide how it will behave in morally laden scenarios. Second, we may claim that the users of a machine should decide. Third, that decision may have to be made collectively or, fourth, by other machines built for this special purpose. The paper argues that each of these approaches suffers from its own shortcomings, and it concludes by showing, among other things, which approaches should be emphasized for different types of machines, situations, and/or morally laden decisions.

  3. Application of Fuzzy TOPSIS for evaluating machining techniques using sustainability metrics

    NASA Astrophysics Data System (ADS)

    Digalwar, Abhijeet K.

    2018-04-01

    Sustainable processes and techniques are getting increased attention over the last few decades due to rising concerns over the environment, improved focus on productivity and stringency in environmental as well as occupational health and safety norms. The present work analyzes the research on sustainable machining techniques and identifies techniques and parameters on which sustainability of a process is evaluated. Based on the analysis these parameters are then adopted as criteria’s to evaluate different sustainable machining techniques such as Cryogenic Machining, Dry Machining, Minimum Quantity Lubrication (MQL) and High Pressure Jet Assisted Machining (HPJAM) using a fuzzy TOPSIS framework. In order to facilitate easy arithmetic, the linguistic variables represented by fuzzy numbers are transformed into crisp numbers based on graded mean representation. Cryogenic machining was found to be the best alternative sustainable technique as per the fuzzy TOPSIS framework adopted. The paper provides a method to deal with multi criteria decision making problems in a complex and linguistic environment.

  4. A deviation based assessment methodology for multiple machine health patterns classification and fault detection

    NASA Astrophysics Data System (ADS)

    Jia, Xiaodong; Jin, Chao; Buzza, Matt; Di, Yuan; Siegel, David; Lee, Jay

    2018-01-01

    Successful applications of Diffusion Map (DM) in machine failure detection and diagnosis have been reported in several recent studies. DM provides an efficient way to visualize the high-dimensional, complex and nonlinear machine data, and thus suggests more knowledge about the machine under monitoring. In this paper, a DM based methodology named as DM-EVD is proposed for machine degradation assessment, abnormality detection and diagnosis in an online fashion. Several limitations and challenges of using DM for machine health monitoring have been analyzed and addressed. Based on the proposed DM-EVD, a deviation based methodology is then proposed to include more dimension reduction methods. In this work, the incorporation of Laplacian Eigen-map and Principal Component Analysis (PCA) are explored, and the latter algorithm is named as PCA-Dev and is validated in the case study. To show the successful application of the proposed methodology, case studies from diverse fields are presented and investigated in this work. Improved results are reported by benchmarking with other machine learning algorithms.

  5. Ontological modelling of knowledge management for human-machine integrated design of ultra-precision grinding machine

    NASA Astrophysics Data System (ADS)

    Hong, Haibo; Yin, Yuehong; Chen, Xing

    2016-11-01

    Despite the rapid development of computer science and information technology, an efficient human-machine integrated enterprise information system for designing complex mechatronic products is still not fully accomplished, partly because of the inharmonious communication among collaborators. Therefore, one challenge in human-machine integration is how to establish an appropriate knowledge management (KM) model to support integration and sharing of heterogeneous product knowledge. Aiming at the diversity of design knowledge, this article proposes an ontology-based model to reach an unambiguous and normative representation of knowledge. First, an ontology-based human-machine integrated design framework is described, then corresponding ontologies and sub-ontologies are established according to different purposes and scopes. Second, a similarity calculation-based ontology integration method composed of ontology mapping and ontology merging is introduced. The ontology searching-based knowledge sharing method is then developed. Finally, a case of human-machine integrated design of a large ultra-precision grinding machine is used to demonstrate the effectiveness of the method.

  6. Cobalt-60 Machines and Medical Linear Accelerators: Competing Technologies for External Beam Radiotherapy.

    PubMed

    Healy, B J; van der Merwe, D; Christaki, K E; Meghzifene, A

    2017-02-01

    Medical linear accelerators (linacs) and cobalt-60 machines are both mature technologies for external beam radiotherapy. A comparison is made between these two technologies in terms of infrastructure and maintenance, dosimetry, shielding requirements, staffing, costs, security, patient throughput and clinical use. Infrastructure and maintenance are more demanding for linacs due to the complex electric componentry. In dosimetry, a higher beam energy, modulated dose rate and smaller focal spot size mean that it is easier to create an optimised treatment with a linac for conformal dose coverage of the tumour while sparing healthy organs at risk. In shielding, the requirements for a concrete bunker are similar for cobalt-60 machines and linacs but extra shielding and protection from neutrons are required for linacs. Staffing levels can be higher for linacs and more staff training is required for linacs. Life cycle costs are higher for linacs, especially multi-energy linacs. Security is more complex for cobalt-60 machines because of the high activity radioactive source. Patient throughput can be affected by source decay for cobalt-60 machines but poor maintenance and breakdowns can severely affect patient throughput for linacs. In clinical use, more complex treatment techniques are easier to achieve with linacs, and the availability of electron beams on high-energy linacs can be useful for certain treatments. In summary, there is no simple answer to the question of the choice of either cobalt-60 machines or linacs for radiotherapy in low- and middle-income countries. In fact a radiotherapy department with a combination of technologies, including orthovoltage X-ray units, may be an option. Local needs, conditions and resources will have to be factored into any decision on technology taking into account the characteristics of both forms of teletherapy, with the primary goal being the sustainability of the radiotherapy service over the useful lifetime of the equipment. Copyright © 2016 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.

  7. Design and implementation of a system for laser assisted milling of advanced materials

    NASA Astrophysics Data System (ADS)

    Wu, Xuefeng; Feng, Gaocheng; Liu, Xianli

    2016-09-01

    Laser assisted machining is an effective method to machine advanced materials with the added benefits of longer tool life and increased material removal rates. While extensive studies have investigated the machining properties for laser assisted milling(LAML), few attempts have been made to extend LAML to machining parts with complex geometric features. A methodology for continuous path machining for LAML is developed by integration of a rotary and movable table into an ordinary milling machine with a laser beam system. The machining strategy and processing path are investigated to determine alignment of the machining path with the laser spot. In order to keep the material removal temperatures above the softening temperature of silicon nitride, the transformation is coordinated and the temperature interpolated, establishing a transient thermal model. The temperatures of the laser center and cutting zone are also carefully controlled to achieve optimal machining results and avoid thermal damage. These experiments indicate that the system results in no surface damage as well as good surface roughness, validating the application of this machining strategy and thermal model in the development of a new LAML system for continuous path processing of silicon nitride. The proposed approach can be easily applied in LAML system to achieve continuous processing and improve efficiency in laser assisted machining.

  8. Freeform diamond machining of complex monolithic metal optics for integral field systems

    NASA Astrophysics Data System (ADS)

    Dubbeldam, Cornelis M.; Robertson, David J.; Preuss, Werner

    2004-09-01

    Implementation of the optical designs of image slicing Integral Field Systems requires accurate alignment of a large number of small (and therefore difficult to manipulate) optical components. In order to facilitate the integration of these complex systems, the Astronomical Instrumentation Group (AIG) of the University of Durham, in collaboration with the Labor für Mikrozerspanung (Laboratory for Precision Machining - LFM) of the University of Bremen, have developed a technique for fabricating monolithic multi-faceted mirror arrays using freeform diamond machining. Using this technique, the inherent accuracy of the diamond machining equipment is exploited to achieve the required relative alignment accuracy of the facets, as well as an excellent optical surface quality for each individual facet. Monolithic arrays manufactured using this freeform diamond machining technique were successfully applied in the Integral Field Unit for the GEMINI Near-InfraRed Spectrograph (GNIRS IFU), which was recently installed at GEMINI South. Details of their fabrication process and optical performance are presented in this paper. In addition, the direction of current development work, conducted under the auspices of the Durham Instrumentation R&D Program supported by the UK Particle Physics and Astronomy Research Council (PPARC), will be discussed. The main emphasis of this research is to improve further the optical performance of diamond machined components, as well as to streamline the production and quality control processes with a view to making this technique suitable for multi-IFU instruments such as KMOS etc., which require series production of large quantities of optical components.

  9. RNA structures as mediators of neurological diseases and as drug targets

    PubMed Central

    Bernat, Viachaslau; Disney, Matthew D.

    2015-01-01

    RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions are dictated by their structure. It is not surprising that aberrantly folded RNA structures contribute to disease. In this review, we provide a brief introduction into RNA structural biology and then describe how RNA structures function in cells and cause or contribute to neurological disease. Finally, we highlight successful applications of rational design principles to provide chemical probes and lead compounds targeting structured RNAs. Based on several examples of well-characterized RNA-driven neurological disorders, we demonstrate how designed small molecules can facilitate study of RNA dysfunction, elucidating previously unknown roles for RNA in disease, and provide lead therapeutics. PMID:26139368

  10. Foundations and Emerging Paradigms for Computing in Living Cells.

    PubMed

    Ma, Kevin C; Perli, Samuel D; Lu, Timothy K

    2016-02-27

    Genetic circuits, composed of complex networks of interacting molecular machines, enable living systems to sense their dynamic environments, perform computation on the inputs, and formulate appropriate outputs. By rewiring and expanding these circuits with novel parts and modules, synthetic biologists have adapted living systems into vibrant substrates for engineering. Diverse paradigms have emerged for designing, modeling, constructing, and characterizing such artificial genetic systems. In this paper, we first provide an overview of recent advances in the development of genetic parts and highlight key engineering approaches. We then review the assembly of these parts into synthetic circuits from the perspectives of digital and analog logic, systems biology, and metabolic engineering, three areas of particular theoretical and practical interest. Finally, we discuss notable challenges that the field of synthetic biology still faces in achieving reliable and predictable forward-engineering of artificial biological circuits. Copyright © 2016. Published by Elsevier Ltd.

  11. Highly selective inhibition of myosin motors provides the basis of potential therapeutic application

    PubMed Central

    Sirigu, Serena; Hartman, James J.; Ropars, Virginie; Clancy, Sheila; Wang, Xi; Chuang, Grace; Qian, Xiangping; Lu, Pu-Ping; Barrett, Edward; Rudolph, Karin; Royer, Christopher; Morgan, Bradley P.; Stura, Enrico A.; Malik, Fady I.; Houdusse, Anne M.

    2016-01-01

    Direct inhibition of smooth muscle myosin (SMM) is a potential means to treat hypercontractile smooth muscle diseases. The selective inhibitor CK-2018571 prevents strong binding to actin and promotes muscle relaxation in vitro and in vivo. The crystal structure of the SMM/drug complex reveals that CK-2018571 binds to a novel allosteric pocket that opens up during the “recovery stroke” transition necessary to reprime the motor. Trapped in an intermediate of this fast transition, SMM is inhibited with high selectivity compared with skeletal muscle myosin (IC50 = 9 nM and 11,300 nM, respectively), although all of the binding site residues are identical in these motors. This structure provides a starting point from which to design highly specific myosin modulators to treat several human diseases. It further illustrates the potential of targeting transition intermediates of molecular machines to develop exquisitely selective pharmacological agents. PMID:27815532

  12. Reliable Real-time Calculation of Heart-rate Complexity in Critically Ill Patients Using Multiple Noisy Waveform Sources

    DTIC Science & Technology

    2014-01-01

    systems Machine learning Automatic data processing 1 Introduction Heart-rate complexity (HRC) is a method of quantifying the amount of complex...5. Batchinsky AI, Skinner J, Necsoiu C, et al. New measures of heart-rate complexity: effect of chest trauma and hemorrhage. J Trauma. 2010;68:1178–85

  13. Challenges in the Verification of Reinforcement Learning Algorithms

    NASA Technical Reports Server (NTRS)

    Van Wesel, Perry; Goodloe, Alwyn E.

    2017-01-01

    Machine learning (ML) is increasingly being applied to a wide array of domains from search engines to autonomous vehicles. These algorithms, however, are notoriously complex and hard to verify. This work looks at the assumptions underlying machine learning algorithms as well as some of the challenges in trying to verify ML algorithms. Furthermore, we focus on the specific challenges of verifying reinforcement learning algorithms. These are highlighted using a specific example. Ultimately, we do not offer a solution to the complex problem of ML verification, but point out possible approaches for verification and interesting research opportunities.

  14. 4. Overall view of complex. Foundry (MN99B) at center. Main ...

    Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

    4. Overall view of complex. Foundry (MN-99-B) at center. Main section of roundhouse (MN-99-A) at left. Machine shop section of roundhouse in center behind foundry. East end of air brake shop section of roundhouse to right of machine shop. Top of sand tower (MN-99-E) just visible above main section of roundhouse at far left. Photograph taken from second floor of office (MN-99-D). View to south. - Duluth & Iron Range Rail Road Company Shops, Southwest of downtown Two Harbors, northwest of Agate Bay, Two Harbors, Lake County, MN

  15. Improving the Prediction of Mortality and the Need for Life-Saving Interventions in Trauma Patients Using Standard Vital Signs With Heart-Rate Variability and Complexity

    DTIC Science & Technology

    2015-06-01

    Trauma 69:S10YS13, 2010. 2. Liu NT, Holcomb JB, Wade CE, Darrah MI, Salinas J: Utility of vital signs, heart-rate variability and complexity, and machine ... learning for identifying the need for life-saving interventions in trauma patients. Shock 42:108Y114, 2014. 3. Pickering TG, Shimbo D, Hass D...Ann Emerg Med 45:68Y76, 2005. 8. Liu NT, Holcomb JB, Wade CE, Batchinsky AI, Cancio LC, Darrah MI, Salinas J: Development and validation of a machine

  16. A tool for modeling concurrent real-time computation

    NASA Technical Reports Server (NTRS)

    Sharma, D. D.; Huang, Shie-Rei; Bhatt, Rahul; Sridharan, N. S.

    1990-01-01

    Real-time computation is a significant area of research in general, and in AI in particular. The complexity of practical real-time problems demands use of knowledge-based problem solving techniques while satisfying real-time performance constraints. Since the demands of a complex real-time problem cannot be predicted (owing to the dynamic nature of the environment) powerful dynamic resource control techniques are needed to monitor and control the performance. A real-time computation model for a real-time tool, an implementation of the QP-Net simulator on a Symbolics machine, and an implementation on a Butterfly multiprocessor machine are briefly described.

  17. A heuristic method for simulating open-data of arbitrary complexity that can be used to compare and evaluate machine learning methods.

    PubMed

    Moore, Jason H; Shestov, Maksim; Schmitt, Peter; Olson, Randal S

    2018-01-01

    A central challenge of developing and evaluating artificial intelligence and machine learning methods for regression and classification is access to data that illuminates the strengths and weaknesses of different methods. Open data plays an important role in this process by making it easy for computational researchers to easily access real data for this purpose. Genomics has in some examples taken a leading role in the open data effort starting with DNA microarrays. While real data from experimental and observational studies is necessary for developing computational methods it is not sufficient. This is because it is not possible to know what the ground truth is in real data. This must be accompanied by simulated data where that balance between signal and noise is known and can be directly evaluated. Unfortunately, there is a lack of methods and software for simulating data with the kind of complexity found in real biological and biomedical systems. We present here the Heuristic Identification of Biological Architectures for simulating Complex Hierarchical Interactions (HIBACHI) method and prototype software for simulating complex biological and biomedical data. Further, we introduce new methods for developing simulation models that generate data that specifically allows discrimination between different machine learning methods.

  18. Detection of segments with fetal QRS complex from abdominal maternal ECG recordings using support vector machine

    NASA Astrophysics Data System (ADS)

    Delgado, Juan A.; Altuve, Miguel; Nabhan Homsi, Masun

    2015-12-01

    This paper introduces a robust method based on the Support Vector Machine (SVM) algorithm to detect the presence of Fetal QRS (fQRS) complexes in electrocardiogram (ECG) recordings provided by the PhysioNet/CinC challenge 2013. ECG signals are first segmented into contiguous frames of 250 ms duration and then labeled in six classes. Fetal segments are tagged according to the position of fQRS complex within each one. Next, segment features extraction and dimensionality reduction are obtained by applying principal component analysis on Haar-wavelet transform. After that, two sub-datasets are generated to separate representative segments from atypical ones. Imbalanced class problem is dealt by applying sampling without replacement on each sub-dataset. Finally, two SVMs are trained and cross-validated using the two balanced sub-datasets separately. Experimental results show that the proposed approach achieves high performance rates in fetal heartbeats detection that reach up to 90.95% of accuracy, 92.16% of sensitivity, 88.51% of specificity, 94.13% of positive predictive value and 84.96% of negative predictive value. A comparative study is also carried out to show the performance of other two machine learning algorithms for fQRS complex estimation, which are K-nearest neighborhood and Bayesian network.

  19. Machine Learning and Radiology

    PubMed Central

    Wang, Shijun; Summers, Ronald M.

    2012-01-01

    In this paper, we give a short introduction to machine learning and survey its applications in radiology. We focused on six categories of applications in radiology: medical image segmentation, registration, computer aided detection and diagnosis, brain function or activity analysis and neurological disease diagnosis from fMR images, content-based image retrieval systems for CT or MRI images, and text analysis of radiology reports using natural language processing (NLP) and natural language understanding (NLU). This survey shows that machine learning plays a key role in many radiology applications. Machine learning identifies complex patterns automatically and helps radiologists make intelligent decisions on radiology data such as conventional radiographs, CT, MRI, and PET images and radiology reports. In many applications, the performance of machine learning-based automatic detection and diagnosis systems has shown to be comparable to that of a well-trained and experienced radiologist. Technology development in machine learning and radiology will benefit from each other in the long run. Key contributions and common characteristics of machine learning techniques in radiology are discussed. We also discuss the problem of translating machine learning applications to the radiology clinical setting, including advantages and potential barriers. PMID:22465077

  20. Army Research Laboratory S&T Campaign Plans 2015-2035

    DTIC Science & Technology

    2014-09-01

    addressing propagation effects, impact of wind noise on detection range and angular accuracy. Use of vector sensors is one approach for potential advances...for the future warfighter, such as new protective and responsive materials, sensors , and munitions. Polymer Chemistry explores the molecular-level...ultimately enable the design and development of novel materials, molecular sensors , and nanoscale machines that exploit the exceptional capabilities of

  1. The tail sheath structure of bacteriophage T4: a molecular machine for infecting bacteria

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aksyuk, Anastasia A.; Leiman, Petr G.; Kurochkina, Lidia P.

    2009-07-22

    The contractile tail of bacteriophage T4 is a molecular machine that facilitates very high viral infection efficiency. Its major component is a tail sheath, which contracts during infection to less than half of its initial length. The sheath consists of 138 copies of the tail sheath protein, gene product (gp) 18, which surrounds the central non-contractile tail tube. The contraction of the sheath drives the tail tube through the outer membrane, creating a channel for the viral genome delivery. A crystal structure of about three quarters of gp18 has been determined and was fitted into cryo-electron microscopy reconstructions of themore » tail sheath before and after contraction. It was shown that during contraction, gp18 subunits slide over each other with no apparent change in their structure.« less

  2. Efficient diagonalization of the sparse matrices produced within the framework of the UK R-matrix molecular codes

    NASA Astrophysics Data System (ADS)

    Galiatsatos, P. G.; Tennyson, J.

    2012-11-01

    The most time consuming step within the framework of the UK R-matrix molecular codes is that of the diagonalization of the inner region Hamiltonian matrix (IRHM). Here we present the method that we follow to speed up this step. We use shared memory machines (SMM), distributed memory machines (DMM), the OpenMP directive based parallel language, the MPI function based parallel language, the sparse matrix diagonalizers ARPACK and PARPACK, a variation for real symmetric matrices of the official coordinate sparse matrix format and finally a parallel sparse matrix-vector product (PSMV). The efficient application of the previous techniques rely on two important facts: the sparsity of the matrix is large enough (more than 98%) and in order to get back converged results we need a small only part of the matrix spectrum.

  3. Combining Structural Modeling with Ensemble Machine Learning to Accurately Predict Protein Fold Stability and Binding Affinity Effects upon Mutation

    PubMed Central

    Garcia Lopez, Sebastian; Kim, Philip M.

    2014-01-01

    Advances in sequencing have led to a rapid accumulation of mutations, some of which are associated with diseases. However, to draw mechanistic conclusions, a biochemical understanding of these mutations is necessary. For coding mutations, accurate prediction of significant changes in either the stability of proteins or their affinity to their binding partners is required. Traditional methods have used semi-empirical force fields, while newer methods employ machine learning of sequence and structural features. Here, we show how combining both of these approaches leads to a marked boost in accuracy. We introduce ELASPIC, a novel ensemble machine learning approach that is able to predict stability effects upon mutation in both, domain cores and domain-domain interfaces. We combine semi-empirical energy terms, sequence conservation, and a wide variety of molecular details with a Stochastic Gradient Boosting of Decision Trees (SGB-DT) algorithm. The accuracy of our predictions surpasses existing methods by a considerable margin, achieving correlation coefficients of 0.77 for stability, and 0.75 for affinity predictions. Notably, we integrated homology modeling to enable proteome-wide prediction and show that accurate prediction on modeled structures is possible. Lastly, ELASPIC showed significant differences between various types of disease-associated mutations, as well as between disease and common neutral mutations. Unlike pure sequence-based prediction methods that try to predict phenotypic effects of mutations, our predictions unravel the molecular details governing the protein instability, and help us better understand the molecular causes of diseases. PMID:25243403

  4. A mechanical Turing machine: blueprint for a biomolecular computer

    PubMed Central

    Shapiro, Ehud

    2012-01-01

    We describe a working mechanical device that embodies the theoretical computing machine of Alan Turing, and as such is a universal programmable computer. The device operates on three-dimensional building blocks by applying mechanical analogues of polymer elongation, cleavage and ligation, movement along a polymer, and control by molecular recognition unleashing allosteric conformational changes. Logically, the device is not more complicated than biomolecular machines of the living cell, and all its operations are part of the standard repertoire of these machines; hence, a biomolecular embodiment of the device is not infeasible. If implemented, such a biomolecular device may operate in vivo, interacting with its biochemical environment in a program-controlled manner. In particular, it may ‘compute’ synthetic biopolymers and release them into its environment in response to input from the environment, a capability that may have broad pharmaceutical and biological applications. PMID:22649583

  5. Domain activities of PapC usher reveal the mechanism of action of an Escherichia coli molecular machine.

    PubMed

    Volkan, Ender; Ford, Bradley A; Pinkner, Jerome S; Dodson, Karen W; Henderson, Nadine S; Thanassi, David G; Waksman, Gabriel; Hultgren, Scott J

    2012-06-12

    P pili are prototypical chaperone-usher pathway-assembled pili used by Gram-negative bacteria to adhere to host tissues. The PapC usher contains five functional domains: a transmembrane β-barrel, a β-sandwich Plug, an N-terminal (periplasmic) domain (NTD), and two C-terminal (periplasmic) domains, CTD1 and CTD2. Here, we delineated usher domain interactions between themselves and with chaperone-subunit complexes and showed that overexpression of individual usher domains inhibits pilus assembly. Prior work revealed that the Plug domain occludes the pore of the transmembrane domain of a solitary usher, but the chaperone-adhesin-bound usher has its Plug displaced from the pore, adjacent to the NTD. We demonstrate an interaction between the NTD and Plug domains that suggests a biophysical basis for usher gating. Furthermore, we found that the NTD exhibits high-affinity binding to the chaperone-adhesin (PapDG) complex and low-affinity binding to the major tip subunit PapE (PapDE). We also demonstrate that CTD2 binds with lower affinity to all tested chaperone-subunit complexes except for the chaperone-terminator subunit (PapDH) and has a catalytic role in dissociating the NTD-PapDG complex, suggesting an interplay between recruitment to the NTD and transfer to CTD2 during pilus initiation. The Plug domain and the NTD-Plug complex bound all of the chaperone-subunit complexes tested including PapDH, suggesting that the Plug actively recruits chaperone-subunit complexes to the usher and is the sole recruiter of PapDH. Overall, our studies reveal the cooperative, active roles played by periplasmic domains of the usher to initiate, grow, and terminate a prototypical chaperone-usher pathway pilus.

  6. The journey from proton to gamma knife.

    PubMed

    Ganz, Jeremy C

    2014-01-01

    It was generally accepted by the early 1960s that proton beam radiosurgery was too complex and impractical. The need was seen for a new machine. The beam design had to be as good as a proton beam. It was also decided that a static design was preferable even if the evolution of that notion is no longer clear. Complex collimators were designed that using sources of cobalt-60 could produce beams with characteristics adequately close to those of proton beams. The geometry of the machine was determined including the distance of the sources from the patient the optimal distance between the sources. The first gamma unit was built with private money with no contribution from the Swedish state, which nonetheless required detailed design information in order to ensure radiation safety. This original machine was built with rectangular collimators to produce lesions for thalamotomy for functional work. However, with the introduction of dopamine analogs, this indication virtually disappeared overnight.

  7. The dynamics of discrete-time computation, with application to recurrent neural networks and finite state machine extraction.

    PubMed

    Casey, M

    1996-08-15

    Recurrent neural networks (RNNs) can learn to perform finite state computations. It is shown that an RNN performing a finite state computation must organize its state space to mimic the states in the minimal deterministic finite state machine that can perform that computation, and a precise description of the attractor structure of such systems is given. This knowledge effectively predicts activation space dynamics, which allows one to understand RNN computation dynamics in spite of complexity in activation dynamics. This theory provides a theoretical framework for understanding finite state machine (FSM) extraction techniques and can be used to improve training methods for RNNs performing FSM computations. This provides an example of a successful approach to understanding a general class of complex systems that has not been explicitly designed, e.g., systems that have evolved or learned their internal structure.

  8. Anisotropy of machine building materials

    NASA Technical Reports Server (NTRS)

    Ashkenazi, Y. K.

    1981-01-01

    The results of experimental studies of the anisotropy of elastic and strength characteristics of various structural materials, including pressure worked metals and alloys, laminated fiberglass plastics, and laminated wood plastics, are correlated and classified. Strength criteria under simple and complex stresses are considered as applied to anisotropic materials. Practical application to determining the strength of machine parts and structural materials is discussed.

  9. Quantum Computing

    DTIC Science & Technology

    1998-04-01

    information representation and processing technology, although faster than the wheels and gears of the Charles Babbage computation machine, is still in...the same computational complexity class as the Babbage machine, with bits of information represented by entities which obey classical (non-quantum...nuclear double resonances Charles M Bowden and Jonathan P. Dowling Weapons Sciences Directorate, AMSMI-RD-WS-ST Missile Research, Development, and

  10. Web-Based Machine Translation as a Tool for Promoting Electronic Literacy and Language Awareness

    ERIC Educational Resources Information Center

    Williams, Lawrence

    2006-01-01

    This article addresses a pervasive problem of concern to teachers of many foreign languages: the use of Web-Based Machine Translation (WBMT) by students who do not understand the complexities of this relatively new tool. Although networked technologies have greatly increased access to many language and communication tools, WBMT is still…

  11. Machine Learning

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chikkagoudar, Satish; Chatterjee, Samrat; Thomas, Dennis G.

    The absence of a robust and unified theory of cyber dynamics presents challenges and opportunities for using machine learning based data-driven approaches to further the understanding of the behavior of such complex systems. Analysts can also use machine learning approaches to gain operational insights. In order to be operationally beneficial, cybersecurity machine learning based models need to have the ability to: (1) represent a real-world system, (2) infer system properties, and (3) learn and adapt based on expert knowledge and observations. Probabilistic models and Probabilistic graphical models provide these necessary properties and are further explored in this chapter. Bayesian Networksmore » and Hidden Markov Models are introduced as an example of a widely used data driven classification/modeling strategy.« less

  12. Characterizing informative sequence descriptors and predicting binding affinities of heterodimeric protein complexes.

    PubMed

    Srinivasulu, Yerukala Sathipati; Wang, Jyun-Rong; Hsu, Kai-Ti; Tsai, Ming-Ju; Charoenkwan, Phasit; Huang, Wen-Lin; Huang, Hui-Ling; Ho, Shinn-Ying

    2015-01-01

    Protein-protein interactions (PPIs) are involved in various biological processes, and underlying mechanism of the interactions plays a crucial role in therapeutics and protein engineering. Most machine learning approaches have been developed for predicting the binding affinity of protein-protein complexes based on structure and functional information. This work aims to predict the binding affinity of heterodimeric protein complexes from sequences only. This work proposes a support vector machine (SVM) based binding affinity classifier, called SVM-BAC, to classify heterodimeric protein complexes based on the prediction of their binding affinity. SVM-BAC identified 14 of 580 sequence descriptors (physicochemical, energetic and conformational properties of the 20 amino acids) to classify 216 heterodimeric protein complexes into low and high binding affinity. SVM-BAC yielded the training accuracy, sensitivity, specificity, AUC and test accuracy of 85.80%, 0.89, 0.83, 0.86 and 83.33%, respectively, better than existing machine learning algorithms. The 14 features and support vector regression were further used to estimate the binding affinities (Pkd) of 200 heterodimeric protein complexes. Prediction performance of a Jackknife test was the correlation coefficient of 0.34 and mean absolute error of 1.4. We further analyze three informative physicochemical properties according to their contribution to prediction performance. Results reveal that the following properties are effective in predicting the binding affinity of heterodimeric protein complexes: apparent partition energy based on buried molar fractions, relations between chemical structure and biological activity in principal component analysis IV, and normalized frequency of beta turn. The proposed sequence-based prediction method SVM-BAC uses an optimal feature selection method to identify 14 informative features to classify and predict binding affinity of heterodimeric protein complexes. The characterization analysis revealed that the average numbers of beta turns and hydrogen bonds at protein-protein interfaces in high binding affinity complexes are more than those in low binding affinity complexes.

  13. Characterizing informative sequence descriptors and predicting binding affinities of heterodimeric protein complexes

    PubMed Central

    2015-01-01

    Background Protein-protein interactions (PPIs) are involved in various biological processes, and underlying mechanism of the interactions plays a crucial role in therapeutics and protein engineering. Most machine learning approaches have been developed for predicting the binding affinity of protein-protein complexes based on structure and functional information. This work aims to predict the binding affinity of heterodimeric protein complexes from sequences only. Results This work proposes a support vector machine (SVM) based binding affinity classifier, called SVM-BAC, to classify heterodimeric protein complexes based on the prediction of their binding affinity. SVM-BAC identified 14 of 580 sequence descriptors (physicochemical, energetic and conformational properties of the 20 amino acids) to classify 216 heterodimeric protein complexes into low and high binding affinity. SVM-BAC yielded the training accuracy, sensitivity, specificity, AUC and test accuracy of 85.80%, 0.89, 0.83, 0.86 and 83.33%, respectively, better than existing machine learning algorithms. The 14 features and support vector regression were further used to estimate the binding affinities (Pkd) of 200 heterodimeric protein complexes. Prediction performance of a Jackknife test was the correlation coefficient of 0.34 and mean absolute error of 1.4. We further analyze three informative physicochemical properties according to their contribution to prediction performance. Results reveal that the following properties are effective in predicting the binding affinity of heterodimeric protein complexes: apparent partition energy based on buried molar fractions, relations between chemical structure and biological activity in principal component analysis IV, and normalized frequency of beta turn. Conclusions The proposed sequence-based prediction method SVM-BAC uses an optimal feature selection method to identify 14 informative features to classify and predict binding affinity of heterodimeric protein complexes. The characterization analysis revealed that the average numbers of beta turns and hydrogen bonds at protein-protein interfaces in high binding affinity complexes are more than those in low binding affinity complexes. PMID:26681483

  14. Hidden Markov models and other machine learning approaches in computational molecular biology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Baldi, P.

    1995-12-31

    This tutorial was one of eight tutorials selected to be presented at the Third International Conference on Intelligent Systems for Molecular Biology which was held in the United Kingdom from July 16 to 19, 1995. Computational tools are increasingly needed to process the massive amounts of data, to organize and classify sequences, to detect weak similarities, to separate coding from non-coding regions, and reconstruct the underlying evolutionary history. The fundamental problem in machine learning is the same as in scientific reasoning in general, as well as statistical modeling: to come up with a good model for the data. In thismore » tutorial four classes of models are reviewed. They are: Hidden Markov models; artificial Neural Networks; Belief Networks; and Stochastic Grammars. When dealing with DNA and protein primary sequences, Hidden Markov models are one of the most flexible and powerful alignments and data base searches. In this tutorial, attention is focused on the theory of Hidden Markov Models, and how to apply them to problems in molecular biology.« less

  15. Machine Learning of Accurate Energy-Conserving Molecular Force Fields

    NASA Astrophysics Data System (ADS)

    Chmiela, Stefan; Tkatchenko, Alexandre; Sauceda, Huziel; Poltavsky, Igor; Schütt, Kristof; Müller, Klaus-Robert; GDML Collaboration

    Efficient and accurate access to the Born-Oppenheimer potential energy surface (PES) is essential for long time scale molecular dynamics (MD) simulations. Using conservation of energy - a fundamental property of closed classical and quantum mechanical systems - we develop an efficient gradient-domain machine learning (GDML) approach to construct accurate molecular force fields using a restricted number of samples from ab initio MD trajectories (AIMD). The GDML implementation is able to reproduce global potential-energy surfaces of intermediate-size molecules with an accuracy of 0.3 kcal/mol for energies and 1 kcal/mol/Å for atomic forces using only 1000 conformational geometries for training. We demonstrate this accuracy for AIMD trajectories of molecules, including benzene, toluene, naphthalene, malonaldehyde, ethanol, uracil, and aspirin. The challenge of constructing conservative force fields is accomplished in our work by learning in a Hilbert space of vector-valued functions that obey the law of energy conservation. The GDML approach enables quantitative MD simulations for molecules at a fraction of cost of explicit AIMD calculations, thereby allowing the construction of efficient force fields with the accuracy and transferability of high-level ab initio methods.

  16. Application of machine learning methods in bioinformatics

    NASA Astrophysics Data System (ADS)

    Yang, Haoyu; An, Zheng; Zhou, Haotian; Hou, Yawen

    2018-05-01

    Faced with the development of bioinformatics, high-throughput genomic technology have enabled biology to enter the era of big data. [1] Bioinformatics is an interdisciplinary, including the acquisition, management, analysis, interpretation and application of biological information, etc. It derives from the Human Genome Project. The field of machine learning, which aims to develop computer algorithms that improve with experience, holds promise to enable computers to assist humans in the analysis of large, complex data sets.[2]. This paper analyzes and compares various algorithms of machine learning and their applications in bioinformatics.

  17. Approximation algorithms for scheduling unrelated parallel machines with release dates

    NASA Astrophysics Data System (ADS)

    Avdeenko, T. V.; Mesentsev, Y. A.; Estraykh, I. V.

    2017-01-01

    In this paper we propose approaches to optimal scheduling of unrelated parallel machines with release dates. One approach is based on the scheme of dynamic programming modified with adaptive narrowing of search domain ensuring its computational effectiveness. We discussed complexity of the exact schedules synthesis and compared it with approximate, close to optimal, solutions. Also we explain how the algorithm works for the example of two unrelated parallel machines and five jobs with release dates. Performance results that show the efficiency of the proposed approach have been given.

  18. Quantum Support Vector Machine for Big Data Classification

    NASA Astrophysics Data System (ADS)

    Rebentrost, Patrick; Mohseni, Masoud; Lloyd, Seth

    2014-09-01

    Supervised machine learning is the classification of new data based on already classified training examples. In this work, we show that the support vector machine, an optimized binary classifier, can be implemented on a quantum computer, with complexity logarithmic in the size of the vectors and the number of training examples. In cases where classical sampling algorithms require polynomial time, an exponential speedup is obtained. At the core of this quantum big data algorithm is a nonsparse matrix exponentiation technique for efficiently performing a matrix inversion of the training data inner-product (kernel) matrix.

  19. Open multi-agent control architecture to support virtual-reality-based man-machine interfaces

    NASA Astrophysics Data System (ADS)

    Freund, Eckhard; Rossmann, Juergen; Brasch, Marcel

    2001-10-01

    Projective Virtual Reality is a new and promising approach to intuitively operable man machine interfaces for the commanding and supervision of complex automation systems. The user interface part of Projective Virtual Reality heavily builds on latest Virtual Reality techniques, a task deduction component and automatic action planning capabilities. In order to realize man machine interfaces for complex applications, not only the Virtual Reality part has to be considered but also the capabilities of the underlying robot and automation controller are of great importance. This paper presents a control architecture that has proved to be an ideal basis for the realization of complex robotic and automation systems that are controlled by Virtual Reality based man machine interfaces. The architecture does not just provide a well suited framework for the real-time control of a multi robot system but also supports Virtual Reality metaphors and augmentations which facilitate the user's job to command and supervise a complex system. The developed control architecture has already been used for a number of applications. Its capability to integrate sensor information from sensors of different levels of abstraction in real-time helps to make the realized automation system very responsive to real world changes. In this paper, the architecture will be described comprehensively, its main building blocks will be discussed and one realization that is built based on an open source real-time operating system will be presented. The software design and the features of the architecture which make it generally applicable to the distributed control of automation agents in real world applications will be explained. Furthermore its application to the commanding and control of experiments in the Columbus space laboratory, the European contribution to the International Space Station (ISS), is only one example which will be described.

  20. Fabrication of micro-lens array on convex surface by meaning of micro-milling

    NASA Astrophysics Data System (ADS)

    Zhang, Peng; Du, Yunlong; Wang, Bo; Shan, Debin

    2014-08-01

    In order to develop the application of the micro-milling technology, and to fabricate ultra-precision optical surface with complex microstructure, in this paper, the primary experimental research on micro-milling complex microstructure array is carried out. A complex microstructure array surface with vary parameters is designed, and the mathematic model of the surface is set up and simulated. For the fabrication of the designed microstructure array surface, a micro three-axis ultra-precision milling machine tool is developed, aerostatic guideway drove directly by linear motor is adopted in order to guarantee the enough stiffness of the machine, and novel numerical control strategy with linear encoders of 5nm resolution used as the feedback of the control system is employed to ensure the extremely high motion control accuracy. With the help of CAD/CAM technology, convex micro lens array on convex spherical surface with different scales on material of polyvinyl chloride (PVC) and pure copper is fabricated using micro tungsten carbide ball end milling tool based on the ultra-precision micro-milling machine. Excellent nanometer-level micro-movement performance of the axis is proved by motion control experiment. The fabrication is nearly as the same as the design, the characteristic scale of the microstructure is less than 200μm and the accuracy is better than 1μm. It prove that ultra-precision micro-milling technology based on micro ultra-precision machine tool is a suitable and optional method for micro manufacture of microstructure array surface on different kinds of materials, and with the development of micro milling cutter, ultraprecision micro-milling complex microstructure surface will be achieved in future.

  1. Machine Learning Methods for Analysis of Metabolic Data and Metabolic Pathway Modeling

    PubMed Central

    Cuperlovic-Culf, Miroslava

    2018-01-01

    Machine learning uses experimental data to optimize clustering or classification of samples or features, or to develop, augment or verify models that can be used to predict behavior or properties of systems. It is expected that machine learning will help provide actionable knowledge from a variety of big data including metabolomics data, as well as results of metabolism models. A variety of machine learning methods has been applied in bioinformatics and metabolism analyses including self-organizing maps, support vector machines, the kernel machine, Bayesian networks or fuzzy logic. To a lesser extent, machine learning has also been utilized to take advantage of the increasing availability of genomics and metabolomics data for the optimization of metabolic network models and their analysis. In this context, machine learning has aided the development of metabolic networks, the calculation of parameters for stoichiometric and kinetic models, as well as the analysis of major features in the model for the optimal application of bioreactors. Examples of this very interesting, albeit highly complex, application of machine learning for metabolism modeling will be the primary focus of this review presenting several different types of applications for model optimization, parameter determination or system analysis using models, as well as the utilization of several different types of machine learning technologies. PMID:29324649

  2. Machine Learning Methods for Analysis of Metabolic Data and Metabolic Pathway Modeling.

    PubMed

    Cuperlovic-Culf, Miroslava

    2018-01-11

    Machine learning uses experimental data to optimize clustering or classification of samples or features, or to develop, augment or verify models that can be used to predict behavior or properties of systems. It is expected that machine learning will help provide actionable knowledge from a variety of big data including metabolomics data, as well as results of metabolism models. A variety of machine learning methods has been applied in bioinformatics and metabolism analyses including self-organizing maps, support vector machines, the kernel machine, Bayesian networks or fuzzy logic. To a lesser extent, machine learning has also been utilized to take advantage of the increasing availability of genomics and metabolomics data for the optimization of metabolic network models and their analysis. In this context, machine learning has aided the development of metabolic networks, the calculation of parameters for stoichiometric and kinetic models, as well as the analysis of major features in the model for the optimal application of bioreactors. Examples of this very interesting, albeit highly complex, application of machine learning for metabolism modeling will be the primary focus of this review presenting several different types of applications for model optimization, parameter determination or system analysis using models, as well as the utilization of several different types of machine learning technologies.

  3. National Synchrotron Light Source annual report 1991. Volume 1, October 1, 1990--September 30, 1991

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hulbert, S.L.; Lazarz, N.M.

    1992-04-01

    This report discusses the following research conducted at NSLS: atomic and molecular science; energy dispersive diffraction; lithography, microscopy and tomography; nuclear physics; UV photoemission and surface science; x-ray absorption spectroscopy; x-ray scattering and crystallography; x-ray topography; workshop on surface structure; workshop on electronic and chemical phenomena at surfaces; workshop on imaging; UV FEL machine reviews; VUV machine operations; VUV beamline operations; VUV storage ring parameters; x-ray machine operations; x-ray beamline operations; x-ray storage ring parameters; superconducting x-ray lithography source; SXLS storage ring parameters; the accelerator test facility; proposed UV-FEL user facility at the NSLS; global orbit feedback systems; and NSLSmore » computer system.« less

  4. "Paper Machine" for Molecular Diagnostics.

    PubMed

    Connelly, John T; Rolland, Jason P; Whitesides, George M

    2015-08-04

    Clinical tests based on primer-initiated amplification of specific nucleic acid sequences achieve high levels of sensitivity and specificity. Despite these desirable characteristics, these tests have not reached their full potential because their complexity and expense limit their usefulness to centralized laboratories. This paper describes a device that integrates sample preparation and loop-mediated isothermal amplification (LAMP) with end point detection using a hand-held UV source and camera phone. The prototype device integrates paper microfluidics (to enable fluid handling) and a multilayer structure, or a "paper machine", that allows a central patterned paper strip to slide in and out of fluidic path and thus allows introduction of sample, wash buffers, amplification master mix, and detection reagents with minimal pipetting, in a hand-held, disposable device intended for point-of-care use in resource-limited environments. This device creates a dynamic seal that prevents evaporation during incubation at 65 °C for 1 h. This interval is sufficient to allow a LAMP reaction for the Escherichia coli malB gene to proceed with an analytical sensitivity of 1 double-stranded DNA target copy. Starting with human plasma spiked with whole, live E. coli cells, this paper demonstrates full integration of sample preparation with LAMP amplification and end point detection with a limit of detection of 5 cells. Further, it shows that the method used to prepare sample enables concentration of DNA from sample volumes commonly available from fingerstick blood draw.

  5. A Python Analytical Pipeline to Identify Prohormone Precursors and Predict Prohormone Cleavage Sites

    PubMed Central

    Southey, Bruce R.; Sweedler, Jonathan V.; Rodriguez-Zas, Sandra L.

    2008-01-01

    Neuropeptides and hormones are signaling molecules that support cell–cell communication in the central nervous system. Experimentally characterizing neuropeptides requires significant efforts because of the complex and variable processing of prohormone precursor proteins into neuropeptides and hormones. We demonstrate the power and flexibility of the Python language to develop components of an bioinformatic analytical pipeline to identify precursors from genomic data and to predict cleavage as these precursors are en route to the final bioactive peptides. We identified 75 precursors in the rhesus genome, predicted cleavage sites using support vector machines and compared the rhesus predictions to putative assignments based on homology to human sequences. The correct classification rate of cleavage using the support vector machines was over 97% for both human and rhesus data sets. The functionality of Python has been important to develop and maintain NeuroPred (http://neuroproteomics.scs.uiuc.edu/neuropred.html), a user-centered web application for the neuroscience community that provides cleavage site prediction from a wide range of models, precision and accuracy statistics, post-translational modifications, and the molecular mass of potential peptides. The combined results illustrate the suitability of the Python language to implement an all-inclusive bioinformatics approach to predict neuropeptides that encompasses a large number of interdependent steps, from scanning genomes for precursor genes to identification of potential bioactive neuropeptides. PMID:19169350

  6. Theoretical study of cut area of reduction of large surfaces of rotation parts on machines with rotary cutters “Extra”

    NASA Astrophysics Data System (ADS)

    Bondarenko, J. A.; Fedorenko, M. A.; Pogonin, A. A.

    2018-03-01

    Large parts can be treated without disassembling machines using “Extra”, having technological and design challenges, which differ from the challenges in the processing of these components on the stationary machine. Extension machines are used to restore large parts up to the condition allowing one to use them in a production environment. To achieve the desired accuracy and surface roughness parameters, the surface after rotary grinding becomes recoverable, which greatly increases complexity. In order to improve production efficiency and productivity of the process, the qualitative rotary processing of the machined surface is applied. The rotary cutting process includes a continuous change of the cutting edge surfaces. The kinematic parameters of a rotary cutting define its main features and patterns, the cutting operation of the rotary cutting instrument.

  7. Comparative analysis of machine learning methods in ligand-based virtual screening of large compound libraries.

    PubMed

    Ma, Xiao H; Jia, Jia; Zhu, Feng; Xue, Ying; Li, Ze R; Chen, Yu Z

    2009-05-01

    Machine learning methods have been explored as ligand-based virtual screening tools for facilitating drug lead discovery. These methods predict compounds of specific pharmacodynamic, pharmacokinetic or toxicological properties based on their structure-derived structural and physicochemical properties. Increasing attention has been directed at these methods because of their capability in predicting compounds of diverse structures and complex structure-activity relationships without requiring the knowledge of target 3D structure. This article reviews current progresses in using machine learning methods for virtual screening of pharmacodynamically active compounds from large compound libraries, and analyzes and compares the reported performances of machine learning tools with those of structure-based and other ligand-based (such as pharmacophore and clustering) virtual screening methods. The feasibility to improve the performance of machine learning methods in screening large libraries is discussed.

  8. AceCloud: Molecular Dynamics Simulations in the Cloud.

    PubMed

    Harvey, M J; De Fabritiis, G

    2015-05-26

    We present AceCloud, an on-demand service for molecular dynamics simulations. AceCloud is designed to facilitate the secure execution of large ensembles of simulations on an external cloud computing service (currently Amazon Web Services). The AceCloud client, integrated into the ACEMD molecular dynamics package, provides an easy-to-use interface that abstracts all aspects of interaction with the cloud services. This gives the user the experience that all simulations are running on their local machine, minimizing the learning curve typically associated with the transition to using high performance computing services.

  9. Systematic methods for defining coarse-grained maps in large biomolecules.

    PubMed

    Zhang, Zhiyong

    2015-01-01

    Large biomolecules are involved in many important biological processes. It would be difficult to use large-scale atomistic molecular dynamics (MD) simulations to study the functional motions of these systems because of the computational expense. Therefore various coarse-grained (CG) approaches have attracted rapidly growing interest, which enable simulations of large biomolecules over longer effective timescales than all-atom MD simulations. The first issue in CG modeling is to construct CG maps from atomic structures. In this chapter, we review the recent development of a novel and systematic method for constructing CG representations of arbitrarily complex biomolecules, in order to preserve large-scale and functionally relevant essential dynamics (ED) at the CG level. In this ED-CG scheme, the essential dynamics can be characterized by principal component analysis (PCA) on a structural ensemble, or elastic network model (ENM) of a single atomic structure. Validation and applications of the method cover various biological systems, such as multi-domain proteins, protein complexes, and even biomolecular machines. The results demonstrate that the ED-CG method may serve as a very useful tool for identifying functional dynamics of large biomolecules at the CG level.

  10. Crystal structure of a transfer-ribonucleoprotein particle that promotes asparagine formation

    PubMed Central

    Blaise, Mickaël; Bailly, Marc; Frechin, Mathieu; Behrens, Manja Annette; Fischer, Frédéric; Oliveira, Cristiano L P; Becker, Hubert Dominique; Pedersen, Jan Skov; Thirup, Søren; Kern, Daniel

    2010-01-01

    Four out of the 22 aminoacyl-tRNAs (aa-tRNAs) are systematically or alternatively synthesized by an indirect, two-step route requiring an initial mischarging of the tRNA followed by tRNA-dependent conversion of the non-cognate amino acid. During tRNA-dependent asparagine formation, tRNAAsn promotes assembly of a ribonucleoprotein particle called transamidosome that allows channelling of the aa-tRNA from non-discriminating aspartyl-tRNA synthetase active site to the GatCAB amidotransferase site. The crystal structure of the Thermus thermophilus transamidosome determined at 3 Å resolution reveals a particle formed by two GatCABs, two dimeric ND-AspRSs and four tRNAsAsn molecules. In the complex, only two tRNAs are bound in a functional state, whereas the two other ones act as an RNA scaffold enabling release of the asparaginyl-tRNAAsn without dissociation of the complex. We propose that the crystal structure represents a transient state of the transamidation reaction. The transamidosome constitutes a transfer-ribonucleoprotein particle in which tRNAs serve the function of both substrate and structural foundation for a large molecular machine. PMID:20717102

  11. Ancestral and derived protein import pathways in the mitochondrion of Reclinomonas americana.

    PubMed

    Tong, Janette; Dolezal, Pavel; Selkrig, Joel; Crawford, Simon; Simpson, Alastair G B; Noinaj, Nicholas; Buchanan, Susan K; Gabriel, Kipros; Lithgow, Trevor

    2011-05-01

    The evolution of mitochondria from ancestral bacteria required that new protein transport machinery be established. Recent controversy over the evolution of these new molecular machines hinges on the degree to which ancestral bacterial transporters contributed during the establishment of the new protein import pathway. Reclinomonas americana is a unicellular eukaryote with the most gene-rich mitochondrial genome known, and the large collection of membrane proteins encoded on the mitochondrial genome of R. americana includes a bacterial-type SecY protein transporter. Analysis of expressed sequence tags shows R. americana also has components of a mitochondrial protein translocase or "translocase in the inner mitochondrial membrane complex." Along with several other membrane proteins encoded on the mitochondrial genome Cox11, an assembly factor for cytochrome c oxidase retains sequence features suggesting that it is assembled by the SecY complex in R. americana. Despite this, protein import studies show that the RaCox11 protein is suited for import into mitochondria and functional complementation if the gene is transferred into the nucleus of yeast. Reclinomonas americana provides direct evidence that bacterial protein transport pathways were retained, alongside the evolving mitochondrial protein import machinery, shedding new light on the process of mitochondrial evolution.

  12. An Efficient Semi-supervised Learning Approach to Predict SH2 Domain Mediated Interactions.

    PubMed

    Kundu, Kousik; Backofen, Rolf

    2017-01-01

    Src homology 2 (SH2) domain is an important subclass of modular protein domains that plays an indispensable role in several biological processes in eukaryotes. SH2 domains specifically bind to the phosphotyrosine residue of their binding peptides to facilitate various molecular functions. For determining the subtle binding specificities of SH2 domains, it is very important to understand the intriguing mechanisms by which these domains recognize their target peptides in a complex cellular environment. There are several attempts have been made to predict SH2-peptide interactions using high-throughput data. However, these high-throughput data are often affected by a low signal to noise ratio. Furthermore, the prediction methods have several additional shortcomings, such as linearity problem, high computational complexity, etc. Thus, computational identification of SH2-peptide interactions using high-throughput data remains challenging. Here, we propose a machine learning approach based on an efficient semi-supervised learning technique for the prediction of 51 SH2 domain mediated interactions in the human proteome. In our study, we have successfully employed several strategies to tackle the major problems in computational identification of SH2-peptide interactions.

  13. How does a scanning ribosomal particle move along the 5'-untranslated region of eukaryotic mRNA? Brownian Ratchet model.

    PubMed

    Spirin, Alexander S

    2009-11-17

    A model of the ATP-dependent unidirectional movement of the 43S ribosomal initiation complex (=40S ribosomal subunit + eIF1 + eIF1A + eIF2.GTP.Met-tRNA(i) + eIF3) during scanning of the 5'-untranslated region of eukaryotic mRNA is proposed. The model is based on the principles of molecular Brownian ratchet machines and explains several enigmatic data concerning the scanning complex. In this model, the one-dimensional diffusion of the ribosomal initiation complex along the mRNA chain is rectified into the net-unidirectional 5'-to-3' movement by the Feynman ratchet-and-pawl mechanism. The proposed mechanism is organized by the heterotrimeric protein eIF4F (=eIF4A + eIF4E + eIF4G), attached to the scanning ribosomal particle via eIF3, and the RNA-binding protein eIF4B that is postulated to play the role of the pawl. The energy for the useful work of the ratchet-and-pawl mechanism is supplied from ATP hydrolysis induced by the eIF4A subunit: ATP binding and its hydrolysis alternately change the affinities of eIF4A for eIF4B and for mRNA, resulting in the restriction of backward diffusional sliding of the 43S ribosomal complex along the mRNA chain, while stochastic movements ahead are allowed.

  14. Kinesin Motor Enzymology: Chemistry, Structure, and Physics of Nanoscale Molecular Machines.

    PubMed

    Cochran, J C

    2015-09-01

    Molecular motors are enzymes that convert chemical potential energy into controlled kinetic energy for mechanical work inside cells. Understanding the biophysics of these motors is essential for appreciating life as well as apprehending diseases that arise from motor malfunction. This review focuses on kinesin motor enzymology with special emphasis on the literature that reports the chemistry, structure and physics of several different kinesin superfamily members.

  15. MultiDK: A Multiple Descriptor Multiple Kernel Approach for Molecular Discovery and Its Application to Organic Flow Battery Electrolytes.

    PubMed

    Kim, Sungjin; Jinich, Adrián; Aspuru-Guzik, Alán

    2017-04-24

    We propose a multiple descriptor multiple kernel (MultiDK) method for efficient molecular discovery using machine learning. We show that the MultiDK method improves both the speed and accuracy of molecular property prediction. We apply the method to the discovery of electrolyte molecules for aqueous redox flow batteries. Using multiple-type-as opposed to single-type-descriptors, we obtain more relevant features for machine learning. Following the principle of "wisdom of the crowds", the combination of multiple-type descriptors significantly boosts prediction performance. Moreover, by employing multiple kernels-more than one kernel function for a set of the input descriptors-MultiDK exploits nonlinear relations between molecular structure and properties better than a linear regression approach. The multiple kernels consist of a Tanimoto similarity kernel and a linear kernel for a set of binary descriptors and a set of nonbinary descriptors, respectively. Using MultiDK, we achieve an average performance of r 2 = 0.92 with a test set of molecules for solubility prediction. We also extend MultiDK to predict pH-dependent solubility and apply it to a set of quinone molecules with different ionizable functional groups to assess their performance as flow battery electrolytes.

  16. State Machine Modeling of the Space Launch System Solid Rocket Boosters

    NASA Technical Reports Server (NTRS)

    Harris, Joshua A.; Patterson-Hine, Ann

    2013-01-01

    The Space Launch System is a Shuttle-derived heavy-lift vehicle currently in development to serve as NASA's premiere launch vehicle for space exploration. The Space Launch System is a multistage rocket with two Solid Rocket Boosters and multiple payloads, including the Multi-Purpose Crew Vehicle. Planned Space Launch System destinations include near-Earth asteroids, the Moon, Mars, and Lagrange points. The Space Launch System is a complex system with many subsystems, requiring considerable systems engineering and integration. To this end, state machine analysis offers a method to support engineering and operational e orts, identify and avert undesirable or potentially hazardous system states, and evaluate system requirements. Finite State Machines model a system as a finite number of states, with transitions between states controlled by state-based and event-based logic. State machines are a useful tool for understanding complex system behaviors and evaluating "what-if" scenarios. This work contributes to a state machine model of the Space Launch System developed at NASA Ames Research Center. The Space Launch System Solid Rocket Booster avionics and ignition subsystems are modeled using MATLAB/Stateflow software. This model is integrated into a larger model of Space Launch System avionics used for verification and validation of Space Launch System operating procedures and design requirements. This includes testing both nominal and o -nominal system states and command sequences.

  17. The value of prior knowledge in machine learning of complex network systems.

    PubMed

    Ferranti, Dana; Krane, David; Craft, David

    2017-11-15

    Our overall goal is to develop machine-learning approaches based on genomics and other relevant accessible information for use in predicting how a patient will respond to a given proposed drug or treatment. Given the complexity of this problem, we begin by developing, testing and analyzing learning methods using data from simulated systems, which allows us access to a known ground truth. We examine the benefits of using prior system knowledge and investigate how learning accuracy depends on various system parameters as well as the amount of training data available. The simulations are based on Boolean networks-directed graphs with 0/1 node states and logical node update rules-which are the simplest computational systems that can mimic the dynamic behavior of cellular systems. Boolean networks can be generated and simulated at scale, have complex yet cyclical dynamics and as such provide a useful framework for developing machine-learning algorithms for modular and hierarchical networks such as biological systems in general and cancer in particular. We demonstrate that utilizing prior knowledge (in the form of network connectivity information), without detailed state equations, greatly increases the power of machine-learning algorithms to predict network steady-state node values ('phenotypes') and perturbation responses ('drug effects'). Links to codes and datasets here: https://gray.mgh.harvard.edu/people-directory/71-david-craft-phd. dcraft@broadinstitute.org. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  18. Light-powered autonomous and directional molecular motion of a dissipative self-assembling system

    NASA Astrophysics Data System (ADS)

    Ragazzon, Giulio; Baroncini, Massimo; Silvi, Serena; Venturi, Margherita; Credi, Alberto

    2015-01-01

    Biomolecular motors convert energy into directed motion and operate away from thermal equilibrium. The development of dynamic chemical systems that exploit dissipative (non-equilibrium) processes is a challenge in supramolecular chemistry and a premise for the realization of artificial nanoscale motors. Here, we report the relative unidirectional transit of a non-symmetric molecular axle through a macrocycle powered solely by light. The molecular machine rectifies Brownian fluctuations by energy and information ratchet mechanisms and can repeat its working cycle under photostationary conditions. The system epitomizes the conceptual and practical elements forming the basis of autonomous light-powered directed motion with a minimalist molecular design.

  19. Machine learning for predicting soil classes in three semi-arid landscapes

    USGS Publications Warehouse

    Brungard, Colby W.; Boettinger, Janis L.; Duniway, Michael C.; Wills, Skye A.; Edwards, Thomas C.

    2015-01-01

    Mapping the spatial distribution of soil taxonomic classes is important for informing soil use and management decisions. Digital soil mapping (DSM) can quantitatively predict the spatial distribution of soil taxonomic classes. Key components of DSM are the method and the set of environmental covariates used to predict soil classes. Machine learning is a general term for a broad set of statistical modeling techniques. Many different machine learning models have been applied in the literature and there are different approaches for selecting covariates for DSM. However, there is little guidance as to which, if any, machine learning model and covariate set might be optimal for predicting soil classes across different landscapes. Our objective was to compare multiple machine learning models and covariate sets for predicting soil taxonomic classes at three geographically distinct areas in the semi-arid western United States of America (southern New Mexico, southwestern Utah, and northeastern Wyoming). All three areas were the focus of digital soil mapping studies. Sampling sites at each study area were selected using conditioned Latin hypercube sampling (cLHS). We compared models that had been used in other DSM studies, including clustering algorithms, discriminant analysis, multinomial logistic regression, neural networks, tree based methods, and support vector machine classifiers. Tested machine learning models were divided into three groups based on model complexity: simple, moderate, and complex. We also compared environmental covariates derived from digital elevation models and Landsat imagery that were divided into three different sets: 1) covariates selected a priori by soil scientists familiar with each area and used as input into cLHS, 2) the covariates in set 1 plus 113 additional covariates, and 3) covariates selected using recursive feature elimination. Overall, complex models were consistently more accurate than simple or moderately complex models. Random forests (RF) using covariates selected via recursive feature elimination was consistently the most accurate, or was among the most accurate, classifiers between study areas and between covariate sets within each study area. We recommend that for soil taxonomic class prediction, complex models and covariates selected by recursive feature elimination be used. Overall classification accuracy in each study area was largely dependent upon the number of soil taxonomic classes and the frequency distribution of pedon observations between taxonomic classes. Individual subgroup class accuracy was generally dependent upon the number of soil pedon observations in each taxonomic class. The number of soil classes is related to the inherent variability of a given area. The imbalance of soil pedon observations between classes is likely related to cLHS. Imbalanced frequency distributions of soil pedon observations between classes must be addressed to improve model accuracy. Solutions include increasing the number of soil pedon observations in classes with few observations or decreasing the number of classes. Spatial predictions using the most accurate models generally agree with expected soil–landscape relationships. Spatial prediction uncertainty was lowest in areas of relatively low relief for each study area.

  20. A Flexible Approach to Quantifying Various Dimensions of Environmental Complexity

    DTIC Science & Technology

    2004-08-01

    dissertation, Cambridge University, Cambridge, England, 1989. [15] C. J. C. H. Watkins and P. Dayan, “Q-learning,” Machine Learning , vol. 8, pp. 279–292, 1992...16] I. Szita, B. Takács, and A. Lörincz, “²-MDPs: Learning in varying environments,” Journal of Machine Learning Research, vol. 3, pp. 145–174, 2002

Top