QPSO-Based Adaptive DNA Computing Algorithm
Karakose, Mehmet; Cigdem, Ugur
2013-01-01
DNA (deoxyribonucleic acid) computing that is a new computation model based on DNA molecules for information storage has been increasingly used for optimization and data analysis in recent years. However, DNA computing algorithm has some limitations in terms of convergence speed, adaptability, and effectiveness. In this paper, a new approach for improvement of DNA computing is proposed. This new approach aims to perform DNA computing algorithm with adaptive parameters towards the desired goal using quantum-behaved particle swarm optimization (QPSO). Some contributions provided by the proposed QPSO based on adaptive DNA computing algorithm are as follows: (1) parameters of population size, crossover rate, maximum number of operations, enzyme and virus mutation rate, and fitness function of DNA computing algorithm are simultaneously tuned for adaptive process, (2) adaptive algorithm is performed using QPSO algorithm for goal-driven progress, faster operation, and flexibility in data, and (3) numerical realization of DNA computing algorithm with proposed approach is implemented in system identification. Two experiments with different systems were carried out to evaluate the performance of the proposed approach with comparative results. Experimental results obtained with Matlab and FPGA demonstrate ability to provide effective optimization, considerable convergence speed, and high accuracy according to DNA computing algorithm. PMID:23935409
Song, Tianqi; Garg, Sudhanshu; Mokhtar, Reem; Bui, Hieu; Reif, John
2018-01-19
A main goal in DNA computing is to build DNA circuits to compute designated functions using a minimal number of DNA strands. Here, we propose a novel architecture to build compact DNA strand displacement circuits to compute a broad scope of functions in an analog fashion. A circuit by this architecture is composed of three autocatalytic amplifiers, and the amplifiers interact to perform computation. We show DNA circuits to compute functions sqrt(x), ln(x) and exp(x) for x in tunable ranges with simulation results. A key innovation in our architecture, inspired by Napier's use of logarithm transforms to compute square roots on a slide rule, is to make use of autocatalytic amplifiers to do logarithmic and exponential transforms in concentration and time. In particular, we convert from the input that is encoded by the initial concentration of the input DNA strand, to time, and then back again to the output encoded by the concentration of the output DNA strand at equilibrium. This combined use of strand-concentration and time encoding of computational values may have impact on other forms of molecular computation.
Fan, Daoqing; Zhu, Xiaoqing; Dong, Shaojun; Wang, Erkang
2017-07-05
DNA is believed to be a promising candidate for molecular logic computation, and the fluorogenic/colorimetric substrates of G-quadruplex DNAzyme (G4zyme) are broadly used as label-free output reporters of DNA logic circuits. Herein, for the first time, tyramine-HCl (a fluorogenic substrate of G4zyme) is applied to DNA logic computation and a series of label-free DNA-input logic gates, including elementary AND, OR, and INHIBIT logic gates, as well as a two to one encoder, are constructed. Furthermore, a DNA caliper that can measure the base number of target DNA as low as three bases is also fabricated. This DNA caliper can also perform concatenated AND-AND logic computation to fulfil the requirements of sophisticated logic computing. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Vandersall, Jennifer A.; Gardner, Shea N.; Clague, David S.
2010-05-04
A computational method and computer-based system of modeling DNA synthesis for the design and interpretation of PCR amplification, parallel DNA synthesis, and microarray chip analysis. The method and system include modules that address the bioinformatics, kinetics, and thermodynamics of DNA amplification and synthesis. Specifically, the steps of DNA selection, as well as the kinetics and thermodynamics of DNA hybridization and extensions, are addressed, which enable the optimization of the processing and the prediction of the products as a function of DNA sequence, mixing protocol, time, temperature and concentration of species.
Roberts, Victoria A.; Pique, Michael E.; Hsu, Simon; Li, Sheng; Slupphaug, Geir; Rambo, Robert P.; Jamison, Jonathan W.; Liu, Tong; Lee, Jun H.; Tainer, John A.; Ten Eyck, Lynn F.; Woods, Virgil L.
2012-01-01
X-ray crystallography provides excellent structural data on protein–DNA interfaces, but crystallographic complexes typically contain only small fragments of large DNA molecules. We present a new approach that can use longer DNA substrates and reveal new protein–DNA interactions even in extensively studied systems. Our approach combines rigid-body computational docking with hydrogen/deuterium exchange mass spectrometry (DXMS). DXMS identifies solvent-exposed protein surfaces; docking is used to create a 3-dimensional model of the protein–DNA interaction. We investigated the enzyme uracil-DNA glycosylase (UNG), which detects and cleaves uracil from DNA. UNG was incubated with a 30 bp DNA fragment containing a single uracil, giving the complex with the abasic DNA product. Compared with free UNG, the UNG–DNA complex showed increased solvent protection at the UNG active site and at two regions outside the active site: residues 210–220 and 251–264. Computational docking also identified these two DNA-binding surfaces, but neither shows DNA contact in UNG–DNA crystallographic structures. Our results can be explained by separation of the two DNA strands on one side of the active site. These non-sequence-specific DNA-binding surfaces may aid local uracil search, contribute to binding the abasic DNA product and help present the DNA product to APE-1, the next enzyme on the DNA-repair pathway. PMID:22492624
Analog Computation by DNA Strand Displacement Circuits.
Song, Tianqi; Garg, Sudhanshu; Mokhtar, Reem; Bui, Hieu; Reif, John
2016-08-19
DNA circuits have been widely used to develop biological computing devices because of their high programmability and versatility. Here, we propose an architecture for the systematic construction of DNA circuits for analog computation based on DNA strand displacement. The elementary gates in our architecture include addition, subtraction, and multiplication gates. The input and output of these gates are analog, which means that they are directly represented by the concentrations of the input and output DNA strands, respectively, without requiring a threshold for converting to Boolean signals. We provide detailed domain designs and kinetic simulations of the gates to demonstrate their expected performance. On the basis of these gates, we describe how DNA circuits to compute polynomial functions of inputs can be built. Using Taylor Series and Newton Iteration methods, functions beyond the scope of polynomials can also be computed by DNA circuits built upon our architecture.
Computational Approaches to Nucleic Acid Origami.
Jabbari, Hosna; Aminpour, Maral; Montemagno, Carlo
2015-10-12
Recent advances in experimental DNA origami have dramatically expanded the horizon of DNA nanotechnology. Complex 3D suprastructures have been designed and developed using DNA origami with applications in biomaterial science, nanomedicine, nanorobotics, and molecular computation. Ribonucleic acid (RNA) origami has recently been realized as a new approach. Similar to DNA, RNA molecules can be designed to form complex 3D structures through complementary base pairings. RNA origami structures are, however, more compact and more thermodynamically stable due to RNA's non-canonical base pairing and tertiary interactions. With all these advantages, the development of RNA origami lags behind DNA origami by a large gap. Furthermore, although computational methods have proven to be effective in designing DNA and RNA origami structures and in their evaluation, advances in computational nucleic acid origami is even more limited. In this paper, we review major milestones in experimental and computational DNA and RNA origami and present current challenges in these fields. We believe collaboration between experimental nanotechnologists and computer scientists are critical for advancing these new research paradigms.
Constructing Smart Protocells with Built-In DNA Computational Core to Eliminate Exogenous Challenge.
Lyu, Yifan; Wu, Cuichen; Heinke, Charles; Han, Da; Cai, Ren; Teng, I-Ting; Liu, Yuan; Liu, Hui; Zhang, Xiaobing; Liu, Qiaoling; Tan, Weihong
2018-06-06
A DNA reaction network is like a biological algorithm that can respond to "molecular input signals", such as biological molecules, while the artificial cell is like a microrobot whose function is powered by the encapsulated DNA reaction network. In this work, we describe the feasibility of using a DNA reaction network as the computational core of a protocell, which will perform an artificial immune response in a concise way to eliminate a mimicked pathogenic challenge. Such a DNA reaction network (RN)-powered protocell can realize the connection of logical computation and biological recognition due to the natural programmability and biological properties of DNA. Thus, the biological input molecules can be easily involved in the molecular computation and the computation process can be spatially isolated and protected by artificial bilayer membrane. We believe the strategy proposed in the current paper, i.e., using DNA RN to power artificial cells, will lay the groundwork for understanding the basic design principles of DNA algorithm-based nanodevices which will, in turn, inspire the construction of artificial cells, or protocells, that will find a place in future biomedical research.
Reversible Data Hiding Based on DNA Computing
Xie, Yingjie
2017-01-01
Biocomputing, especially DNA, computing has got great development. It is widely used in information security. In this paper, a novel algorithm of reversible data hiding based on DNA computing is proposed. Inspired by the algorithm of histogram modification, which is a classical algorithm for reversible data hiding, we combine it with DNA computing to realize this algorithm based on biological technology. Compared with previous results, our experimental results have significantly improved the ER (Embedding Rate). Furthermore, some PSNR (peak signal-to-noise ratios) of test images are also improved. Experimental results show that it is suitable for protecting the copyright of cover image in DNA-based information security. PMID:28280504
High performance transcription factor-DNA docking with GPU computing
2012-01-01
Background Protein-DNA docking is a very challenging problem in structural bioinformatics and has important implications in a number of applications, such as structure-based prediction of transcription factor binding sites and rational drug design. Protein-DNA docking is very computational demanding due to the high cost of energy calculation and the statistical nature of conformational sampling algorithms. More importantly, experiments show that the docking quality depends on the coverage of the conformational sampling space. It is therefore desirable to accelerate the computation of the docking algorithm, not only to reduce computing time, but also to improve docking quality. Methods In an attempt to accelerate the sampling process and to improve the docking performance, we developed a graphics processing unit (GPU)-based protein-DNA docking algorithm. The algorithm employs a potential-based energy function to describe the binding affinity of a protein-DNA pair, and integrates Monte-Carlo simulation and a simulated annealing method to search through the conformational space. Algorithmic techniques were developed to improve the computation efficiency and scalability on GPU-based high performance computing systems. Results The effectiveness of our approach is tested on a non-redundant set of 75 TF-DNA complexes and a newly developed TF-DNA docking benchmark. We demonstrated that the GPU-based docking algorithm can significantly accelerate the simulation process and thereby improving the chance of finding near-native TF-DNA complex structures. This study also suggests that further improvement in protein-DNA docking research would require efforts from two integral aspects: improvement in computation efficiency and energy function design. Conclusions We present a high performance computing approach for improving the prediction accuracy of protein-DNA docking. The GPU-based docking algorithm accelerates the search of the conformational space and thus increases the chance of finding more near-native structures. To the best of our knowledge, this is the first ad hoc effort of applying GPU or GPU clusters to the protein-DNA docking problem. PMID:22759575
Logical NAND and NOR Operations Using Algorithmic Self-assembly of DNA Molecules
NASA Astrophysics Data System (ADS)
Wang, Yanfeng; Cui, Guangzhao; Zhang, Xuncai; Zheng, Yan
DNA self-assembly is the most advanced and versatile system that has been experimentally demonstrated for programmable construction of patterned systems on the molecular scale. It has been demonstrated that the simple binary arithmetic and logical operations can be computed by the process of self assembly of DNA tiles. Here we report a one-dimensional algorithmic self-assembly of DNA triple-crossover molecules that can be used to execute five steps of a logical NAND and NOR operations on a string of binary bits. To achieve this, abstract tiles were translated into DNA tiles based on triple-crossover motifs. Serving as input for the computation, long single stranded DNA molecules were used to nucleate growth of tiles into algorithmic crystals. Our method shows that engineered DNA self-assembly can be treated as a bottom-up design techniques, and can be capable of designing DNA computer organization and architecture.
A DNA sequence analysis package for the IBM personal computer.
Lagrimini, L M; Brentano, S T; Donelson, J E
1984-01-01
We present here a collection of DNA sequence analysis programs, called "PC Sequence" (PCS), which are designed to run on the IBM Personal Computer (PC). These programs are written in IBM PC compiled BASIC and take full advantage of the IBM PC's speed, error handling, and graphics capabilities. For a modest initial expense in hardware any laboratory can use these programs to quickly perform computer analysis on DNA sequences. They are written with the novice user in mind and require very little training or previous experience with computers. Also provided are a text editing program for creating and modifying DNA sequence files and a communications program which enables the PC to communicate with and collect information from mainframe computers and DNA sequence databases. PMID:6546433
Investigation of a Sybr-Green-Based Method to Validate DNA Sequences for DNA Computing
2005-05-01
OF A SYBR-GREEN-BASED METHOD TO VALIDATE DNA SEQUENCES FOR DNA COMPUTING 6. AUTHOR(S) Wendy Pogozelski, Salvatore Priore, Matthew Bernard ...simulated annealing. Biochemistry, 35, 14077-14089. 15 Pogozelski, W.K., Bernard , M.P. and Macula, A. (2004) DNA code validation using...and Clark, B.F.C. (eds) In RNA Biochemistry and Biotechnology, NATO ASI Series, Kluwer Academic Publishers. Zucker, M. and Stiegler , P. (1981
Superimposed Code Theoretic Analysis of DNA Codes and DNA Computing
2008-01-01
complements of one another and the DNA duplex formed is a Watson - Crick (WC) duplex. However, there are many instances when the formation of non-WC...that the user’s requirements for probe selection are met based on the Watson - Crick probe locality within a target. The second type, called...AFRL-RI-RS-TR-2007-288 Final Technical Report January 2008 SUPERIMPOSED CODE THEORETIC ANALYSIS OF DNA CODES AND DNA COMPUTING
AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide
2015-11-19
Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database in which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. This database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.
A strand graph semantics for DNA-based computation
Petersen, Rasmus L.; Lakin, Matthew R.; Phillips, Andrew
2015-01-01
DNA nanotechnology is a promising approach for engineering computation at the nanoscale, with potential applications in biofabrication and intelligent nanomedicine. DNA strand displacement is a general strategy for implementing a broad range of nanoscale computations, including any computation that can be expressed as a chemical reaction network. Modelling and analysis of DNA strand displacement systems is an important part of the design process, prior to experimental realisation. As experimental techniques improve, it is important for modelling languages to keep pace with the complexity of structures that can be realised experimentally. In this paper we present a process calculus for modelling DNA strand displacement computations involving rich secondary structures, including DNA branches and loops. We prove that our calculus is also sufficiently expressive to model previous work on non-branching structures, and propose a mapping from our calculus to a canonical strand graph representation, in which vertices represent DNA strands, ordered sites represent domains, and edges between sites represent bonds between domains. We define interactions between strands by means of strand graph rewriting, and prove the correspondence between the process calculus and strand graph behaviours. Finally, we propose a mapping from strand graphs to an efficient implementation, which we use to perform modelling and simulation of DNA strand displacement systems with rich secondary structure. PMID:27293306
Computational Design of DNA-Binding Proteins.
Thyme, Summer; Song, Yifan
2016-01-01
Predicting the outcome of engineered and naturally occurring sequence perturbations to protein-DNA interfaces requires accurate computational modeling technologies. It has been well established that computational design to accommodate small numbers of DNA target site substitutions is possible. This chapter details the basic method of design used in the Rosetta macromolecular modeling program that has been successfully used to modulate the specificity of DNA-binding proteins. More recently, combining computational design and directed evolution has become a common approach for increasing the success rate of protein engineering projects. The power of such high-throughput screening depends on computational methods producing multiple potential solutions. Therefore, this chapter describes several protocols for increasing the diversity of designed output. Lastly, we describe an approach for building comparative models of protein-DNA complexes in order to utilize information from homologous sequences. These models can be used to explore how nature modulates specificity of protein-DNA interfaces and potentially can even be used as starting templates for further engineering.
Research on Image Encryption Based on DNA Sequence and Chaos Theory
NASA Astrophysics Data System (ADS)
Tian Zhang, Tian; Yan, Shan Jun; Gu, Cheng Yan; Ren, Ran; Liao, Kai Xin
2018-04-01
Nowadays encryption is a common technique to protect image data from unauthorized access. In recent years, many scientists have proposed various encryption algorithms based on DNA sequence to provide a new idea for the design of image encryption algorithm. Therefore, a new method of image encryption based on DNA computing technology is proposed in this paper, whose original image is encrypted by DNA coding and 1-D logistic chaotic mapping. First, the algorithm uses two modules as the encryption key. The first module uses the real DNA sequence, and the second module is made by one-dimensional logistic chaos mapping. Secondly, the algorithm uses DNA complementary rules to encode original image, and uses the key and DNA computing technology to compute each pixel value of the original image, so as to realize the encryption of the whole image. Simulation results show that the algorithm has good encryption effect and security.
Computational design of co-assembling protein-DNA nanowires
NASA Astrophysics Data System (ADS)
Mou, Yun; Yu, Jiun-Yann; Wannier, Timothy M.; Guo, Chin-Lin; Mayo, Stephen L.
2015-09-01
Biomolecular self-assemblies are of great interest to nanotechnologists because of their functional versatility and their biocompatibility. Over the past decade, sophisticated single-component nanostructures composed exclusively of nucleic acids, peptides and proteins have been reported, and these nanostructures have been used in a wide range of applications, from drug delivery to molecular computing. Despite these successes, the development of hybrid co-assemblies of nucleic acids and proteins has remained elusive. Here we use computational protein design to create a protein-DNA co-assembling nanomaterial whose assembly is driven via non-covalent interactions. To achieve this, a homodimerization interface is engineered onto the Drosophila Engrailed homeodomain (ENH), allowing the dimerized protein complex to bind to two double-stranded DNA (dsDNA) molecules. By varying the arrangement of protein-binding sites on the dsDNA, an irregular bulk nanoparticle or a nanowire with single-molecule width can be spontaneously formed by mixing the protein and dsDNA building blocks. We characterize the protein-DNA nanowire using fluorescence microscopy, atomic force microscopy and X-ray crystallography, confirming that the nanowire is formed via the proposed mechanism. This work lays the foundation for the development of new classes of protein-DNA hybrid materials. Further applications can be explored by incorporating DNA origami, DNA aptamers and/or peptide epitopes into the protein-DNA framework presented here.
Intrinsically bent DNA in replication origins and gene promoters.
Gimenes, F; Takeda, K I; Fiorini, A; Gouveia, F S; Fernandez, M A
2008-06-24
Intrinsically bent DNA is an alternative conformation of the DNA molecule caused by the presence of dA/dT tracts, 2 to 6 bp long, in a helical turn phase DNA or with multiple intervals of 10 to 11 bp. Other than flexibility, intrinsic bending sites induce DNA curvature in particular chromosome regions such as replication origins and promoters. Intrinsically bent DNA sites are important in initiating DNA replication, and are sometimes found near to regions associated with the nuclear matrix. Many methods have been developed to localize bent sites, for example, circular permutation, computational analysis, and atomic force microscopy. This review discusses intrinsically bent DNA sites associated with replication origins and gene promoter regions in prokaryote and eukaryote cells. We also describe methods for identifying bent DNA sites for circular permutation and computational analysis.
In vitro molecular machine learning algorithm via symmetric internal loops of DNA.
Lee, Ji-Hoon; Lee, Seung Hwan; Baek, Christina; Chun, Hyosun; Ryu, Je-Hwan; Kim, Jin-Woo; Deaton, Russell; Zhang, Byoung-Tak
2017-08-01
Programmable biomolecules, such as DNA strands, deoxyribozymes, and restriction enzymes, have been used to solve computational problems, construct large-scale logic circuits, and program simple molecular games. Although studies have shown the potential of molecular computing, the capability of computational learning with DNA molecules, i.e., molecular machine learning, has yet to be experimentally verified. Here, we present a novel molecular learning in vitro model in which symmetric internal loops of double-stranded DNA are exploited to measure the differences between training instances, thus enabling the molecules to learn from small errors. The model was evaluated on a data set of twenty dialogue sentences obtained from the television shows Friends and Prison Break. The wet DNA-computing experiments confirmed that the molecular learning machine was able to generalize the dialogue patterns of each show and successfully identify the show from which the sentences originated. The molecular machine learning model described here opens the way for solving machine learning problems in computer science and biology using in vitro molecular computing with the data encoded in DNA molecules. Copyright © 2017. Published by Elsevier B.V.
Solving traveling salesman problems with DNA molecules encoding numerical values.
Lee, Ji Youn; Shin, Soo-Yong; Park, Tai Hyun; Zhang, Byoung-Tak
2004-12-01
We introduce a DNA encoding method to represent numerical values and a biased molecular algorithm based on the thermodynamic properties of DNA. DNA strands are designed to encode real values by variation of their melting temperatures. The thermodynamic properties of DNA are used for effective local search of optimal solutions using biochemical techniques, such as denaturation temperature gradient polymerase chain reaction and temperature gradient gel electrophoresis. The proposed method was successfully applied to the traveling salesman problem, an instance of optimization problems on weighted graphs. This work extends the capability of DNA computing to solving numerical optimization problems, which is contrasted with other DNA computing methods focusing on logical problem solving.
The role of structural parameters in DNA cyclization
Alexandrov, Ludmil B.; Bishop, Alan R.; Rasmussen, Kim O.; ...
2016-02-04
The intrinsic bendability of DNA plays an important role with relevance for myriad of essential cellular mechanisms. The flexibility of a DNA fragment can be experimentally and computationally examined by its propensity for cyclization, quantified by the Jacobson-Stockmayer J factor. In this paper, we use a well-established coarse-grained three-dimensional model of DNA and seven distinct sets of experimentally and computationally derived conformational parameters of the double helix to evaluate the role of structural parameters in calculating DNA cyclization.
A programming language for composable DNA circuits
Phillips, Andrew; Cardelli, Luca
2009-01-01
Recently, a range of information-processing circuits have been implemented in DNA by using strand displacement as their main computational mechanism. Examples include digital logic circuits and catalytic signal amplification circuits that function as efficient molecular detectors. As new paradigms for DNA computation emerge, the development of corresponding languages and tools for these paradigms will help to facilitate the design of DNA circuits and their automatic compilation to nucleotide sequences. We present a programming language for designing and simulating DNA circuits in which strand displacement is the main computational mechanism. The language includes basic elements of sequence domains, toeholds and branch migration, and assumes that strands do not possess any secondary structure. The language is used to model and simulate a variety of circuits, including an entropy-driven catalytic gate, a simple gate motif for synthesizing large-scale circuits and a scheme for implementing an arbitrary system of chemical reactions. The language is a first step towards the design of modelling and simulation tools for DNA strand displacement, which complements the emergence of novel implementation strategies for DNA computing. PMID:19535415
A programming language for composable DNA circuits.
Phillips, Andrew; Cardelli, Luca
2009-08-06
Recently, a range of information-processing circuits have been implemented in DNA by using strand displacement as their main computational mechanism. Examples include digital logic circuits and catalytic signal amplification circuits that function as efficient molecular detectors. As new paradigms for DNA computation emerge, the development of corresponding languages and tools for these paradigms will help to facilitate the design of DNA circuits and their automatic compilation to nucleotide sequences. We present a programming language for designing and simulating DNA circuits in which strand displacement is the main computational mechanism. The language includes basic elements of sequence domains, toeholds and branch migration, and assumes that strands do not possess any secondary structure. The language is used to model and simulate a variety of circuits, including an entropy-driven catalytic gate, a simple gate motif for synthesizing large-scale circuits and a scheme for implementing an arbitrary system of chemical reactions. The language is a first step towards the design of modelling and simulation tools for DNA strand displacement, which complements the emergence of novel implementation strategies for DNA computing.
Molecular Sticker Model Stimulation on Silicon for a Maximum Clique Problem
Ning, Jianguo; Li, Yanmei; Yu, Wen
2015-01-01
Molecular computers (also called DNA computers), as an alternative to traditional electronic computers, are smaller in size but more energy efficient, and have massive parallel processing capacity. However, DNA computers may not outperform electronic computers owing to their higher error rates and some limitations of the biological laboratory. The stickers model, as a typical DNA-based computer, is computationally complete and universal, and can be viewed as a bit-vertically operating machine. This makes it attractive for silicon implementation. Inspired by the information processing method on the stickers computer, we propose a novel parallel computing model called DEM (DNA Electronic Computing Model) on System-on-a-Programmable-Chip (SOPC) architecture. Except for the significant difference in the computing medium—transistor chips rather than bio-molecules—the DEM works similarly to DNA computers in immense parallel information processing. Additionally, a plasma display panel (PDP) is used to show the change of solutions, and helps us directly see the distribution of assignments. The feasibility of the DEM is tested by applying it to compute a maximum clique problem (MCP) with eight vertices. Owing to the limited computing sources on SOPC architecture, the DEM could solve moderate-size problems in polynomial time. PMID:26075867
Genomic signal processing methods for computation of alignment-free distances from DNA sequences.
Borrayo, Ernesto; Mendizabal-Ruiz, E Gerardo; Vélez-Pérez, Hugo; Romo-Vázquez, Rebeca; Mendizabal, Adriana P; Morales, J Alejandro
2014-01-01
Genomic signal processing (GSP) refers to the use of digital signal processing (DSP) tools for analyzing genomic data such as DNA sequences. A possible application of GSP that has not been fully explored is the computation of the distance between a pair of sequences. In this work we present GAFD, a novel GSP alignment-free distance computation method. We introduce a DNA sequence-to-signal mapping function based on the employment of doublet values, which increases the number of possible amplitude values for the generated signal. Additionally, we explore the use of three DSP distance metrics as descriptors for categorizing DNA signal fragments. Our results indicate the feasibility of employing GAFD for computing sequence distances and the use of descriptors for characterizing DNA fragments.
Genomic Signal Processing Methods for Computation of Alignment-Free Distances from DNA Sequences
Borrayo, Ernesto; Mendizabal-Ruiz, E. Gerardo; Vélez-Pérez, Hugo; Romo-Vázquez, Rebeca; Mendizabal, Adriana P.; Morales, J. Alejandro
2014-01-01
Genomic signal processing (GSP) refers to the use of digital signal processing (DSP) tools for analyzing genomic data such as DNA sequences. A possible application of GSP that has not been fully explored is the computation of the distance between a pair of sequences. In this work we present GAFD, a novel GSP alignment-free distance computation method. We introduce a DNA sequence-to-signal mapping function based on the employment of doublet values, which increases the number of possible amplitude values for the generated signal. Additionally, we explore the use of three DSP distance metrics as descriptors for categorizing DNA signal fragments. Our results indicate the feasibility of employing GAFD for computing sequence distances and the use of descriptors for characterizing DNA fragments. PMID:25393409
Magro, Massimiliano; Martinello, Tiziana; Bonaiuto, Emanuela; Gomiero, Chiara; Baratella, Davide; Zoppellaro, Giorgio; Cozza, Giorgio; Patruno, Marco; Zboril, Radek; Vianello, Fabio
2017-11-01
Conversely to common coated iron oxide nanoparticles, novel naked surface active maghemite nanoparticles (SAMNs) can covalently bind DNA. Plasmid (pDNA) harboring the coding gene for GFP was directly chemisorbed onto SAMNs, leading to a novel DNA nanovector (SAMN@pDNA). The spontaneous internalization of SAMN@pDNA into cells was compared with an extensively studied fluorescent SAMN derivative (SAMN@RITC). Moreover, the transfection efficiency of SAMN@pDNA was evaluated and explained by computational model. SAMN@pDNA was prepared and characterized by spectroscopic and computational methods, and molecular dynamic simulation. The size and hydrodynamic properties of SAMN@pDNA and SAMN@RITC were studied by electron transmission microscopy, light scattering and zeta-potential. The two nanomaterials were tested by confocal scanning microscopy on equine peripheral blood-derived mesenchymal stem cells (ePB-MSCs) and GFP expression by SAMN@pDNA was determined. Nanomaterials characterized by similar hydrodynamic properties were successfully internalized and stored into mesenchymal stem cells. Transfection by SAMN@pDNA occurred and GFP expression was higher than lipofectamine procedure, even in the absence of an external magnetic field. A computational model clarified that transfection efficiency can be ascribed to DNA availability inside cells. Direct covalent binding of DNA on naked magnetic nanoparticles led to an extremely robust gene delivery tool. Hydrodynamic and chemical-physical properties of SAMN@pDNA were responsible of the successful uptake by cells and of the efficiency of GFP gene transfection. SAMNs are characterized by colloidal stability, excellent cell uptake, persistence in the host cells, low toxicity and are proposed as novel intelligent DNA nanovectors for efficient cell transfection. Copyright © 2017 Elsevier B.V. All rights reserved.
21st International Conference on DNA Computing and Molecular Programming: 8.1 Biochemistry
include information storage and biological applications of DNA systems, biomolecular chemical reaction networks, applications of self -assembled DNA...nanostructures, tile self -assembly and computation, principles and models of self -assembly, and strand displacement and biomolecular circuits. The fund
Shahabadi, Nahid; Pourfoulad, Mehdi; Moghadam, Neda Hosseinpour
2017-01-02
DNA-binding properties of an antiviral drug, valganciclovir (valcyte) was studied by using emission, absorption, circular dichroism, viscosity, differential pulse voltammetry, fluorescence techniques, and computational studies. The drug bound to calf thymus DNA (ct-DNA) in a groove-binding mode. The calculated binding constant of UV-vis, K a , is comparable to groove-binding drugs. Competitive fluorimetric studies with Hoechst 33258 showed that valcyte could displace the DNA-bound Hoechst 33258. The drug could not displace intercalated methylene blue from DNA double helix. Furthermore, the induced detectable changes in the CD spectrum of ct-DNA as well as changes in its viscosity confirm the groove-binding mode. In addition, an integrated molecular docking was employed to further investigate the binding interactions between valcyte and calf thymus DNA.
DENA: A Configurable Microarchitecture and Design Flow for Biomedical DNA-Based Logic Design.
Beiki, Zohre; Jahanian, Ali
2017-10-01
DNA is known as the building block for storing the life codes and transferring the genetic features through the generations. However, it is found that DNA strands can be used for a new type of computation that opens fascinating horizons in computational medicine. Significant contributions are addressed on design of DNA-based logic gates for medical and computational applications but there are serious challenges for designing the medium and large-scale DNA circuits. In this paper, a new microarchitecture and corresponding design flow is proposed to facilitate the design of multistage large-scale DNA logic systems. Feasibility and efficiency of the proposed microarchitecture are evaluated by implementing a full adder and, then, its cascadability is determined by implementing a multistage 8-bit adder. Simulation results show the highlight features of the proposed design style and microarchitecture in terms of the scalability, implementation cost, and signal integrity of the DNA-based logic system compared to the traditional approaches.
An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system
DOE Office of Scientific and Technical Information (OSTI.GOV)
AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide
Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database inmore » which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Lastly, this database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.« less
An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system
AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide
2015-11-19
Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database inmore » which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Lastly, this database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.« less
A novel image encryption algorithm based on the chaotic system and DNA computing
NASA Astrophysics Data System (ADS)
Chai, Xiuli; Gan, Zhihua; Lu, Yang; Chen, Yiran; Han, Daojun
A novel image encryption algorithm using the chaotic system and deoxyribonucleic acid (DNA) computing is presented. Different from the traditional encryption methods, the permutation and diffusion of our method are manipulated on the 3D DNA matrix. Firstly, a 3D DNA matrix is obtained through bit plane splitting, bit plane recombination, DNA encoding of the plain image. Secondly, 3D DNA level permutation based on position sequence group (3DDNALPBPSG) is introduced, and chaotic sequences generated from the chaotic system are employed to permutate the positions of the elements of the 3D DNA matrix. Thirdly, 3D DNA level diffusion (3DDNALD) is given, the confused 3D DNA matrix is split into sub-blocks, and XOR operation by block is manipulated to the sub-DNA matrix and the key DNA matrix from the chaotic system. At last, by decoding the diffused DNA matrix, we get the cipher image. SHA 256 hash of the plain image is employed to calculate the initial values of the chaotic system to avoid chosen plaintext attack. Experimental results and security analyses show that our scheme is secure against several known attacks, and it can effectively protect the security of the images.
A Hybrid Computer Simulation to Generate the DNA Distribution of a Cell Population.
ERIC Educational Resources Information Center
Griebling, John L.; Adams, William S.
1981-01-01
Described is a method of simulating the formation of a DNA distribution, on which statistical results and experimentally measured parameters from DNA distribution and percent-labeled mitosis studies are combined. An EAI-680 and DECSystem-10 Hybrid Computer configuration are used. (Author/CS)
Evaluating the role of coherent delocalized phonon-like modes in DNA cyclization
Alexandrov, Ludmil B.; Rasmussen, Kim Ã.; Bishop, Alan R.; ...
2017-08-29
The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer’s J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. We develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded ”flexible hinges” to assist in loop formation. We also combine the Czapla-Swigon-Olson structural model of DNA with ourmore » extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Furthermore, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.« less
Evaluating the role of coherent delocalized phonon-like modes in DNA cyclization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alexandrov, Ludmil B.; Rasmussen, Kim Ã.; Bishop, Alan R.
The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer’s J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. We develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded ”flexible hinges” to assist in loop formation. We also combine the Czapla-Swigon-Olson structural model of DNA with ourmore » extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Furthermore, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.« less
Kilina, Svetlana; Yarotski, Dzmitry A.; Talin, A. Alec; ...
2011-01-01
We present a combined approach that relies on computational simulations and scanning tunneling microscopy (STM) measurements to reveal morphological properties and stability criteria of carbon nanotube-DNA (CNT-DNA) constructs. Application of STM allows direct observation of very stable CNT-DNA hybrid structures with the well-defined DNA wrapping angle of 63.4 ° and a coiling period of 3.3 nm. Using force field simulations, we determine how the DNA-CNT binding energy depends on the sequence and binding geometry of a single strand DNA. This dependence allows us to quantitatively characterize the stability of a hybrid structure with an optimal π-stacking between DNA nucleotides and themore » tube surface and better interpret STM data. Our simulations clearly demonstrate the existence of a very stable DNA binding geometry for (6,5) CNT as evidenced by the presence of a well-defined minimum in the binding energy as a function of an angle between DNA strand and the nanotube chiral vector. This novel approach demonstrates the feasibility of CNT-DNA geometry studies with subnanometer resolution and paves the way towards complete characterization of the structural and electronic properties of drug-delivering systems based on DNA-CNT hybrids as a function of DNA sequence and a nanotube chirality.« less
Hu, Yue-Qing; Fung, Wing K
2003-08-01
The effect of a structured population on the likelihood ratio of a DNA mixture has been studied by the current authors and others. In practice, contributors of a DNA mixture may belong to different ethnic/racial origins, a situation especially common in multi-racial countries such as the USA and Singapore. We have developed a computer software which is available on the web for evaluating DNA mixtures in multi-structured populations. The software can deal with various DNA mixture problems that cannot be handled by the methods given in a recent article of Fung and Hu.
Johnston, Iain G; Burgstaller, Joerg P; Havlicek, Vitezslav; Kolbe, Thomas; Rülicke, Thomas; Brem, Gottfried; Poulton, Jo; Jones, Nick S
2015-01-01
Dangerous damage to mitochondrial DNA (mtDNA) can be ameliorated during mammalian development through a highly debated mechanism called the mtDNA bottleneck. Uncertainty surrounding this process limits our ability to address inherited mtDNA diseases. We produce a new, physically motivated, generalisable theoretical model for mtDNA populations during development, allowing the first statistical comparison of proposed bottleneck mechanisms. Using approximate Bayesian computation and mouse data, we find most statistical support for a combination of binomial partitioning of mtDNAs at cell divisions and random mtDNA turnover, meaning that the debated exact magnitude of mtDNA copy number depletion is flexible. New experimental measurements from a wild-derived mtDNA pairing in mice confirm the theoretical predictions of this model. We analytically solve a mathematical description of this mechanism, computing probabilities of mtDNA disease onset, efficacy of clinical sampling strategies, and effects of potential dynamic interventions, thus developing a quantitative and experimentally-supported stochastic theory of the bottleneck. DOI: http://dx.doi.org/10.7554/eLife.07464.001 PMID:26035426
Simultaneous G-Quadruplex DNA Logic.
Bader, Antoine; Cockroft, Scott L
2018-04-03
A fundamental principle of digital computer operation is Boolean logic, where inputs and outputs are described by binary integer voltages. Similarly, inputs and outputs may be processed on the molecular level as exemplified by synthetic circuits that exploit the programmability of DNA base-pairing. Unlike modern computers, which execute large numbers of logic gates in parallel, most implementations of molecular logic have been limited to single computing tasks, or sensing applications. This work reports three G-quadruplex-based logic gates that operate simultaneously in a single reaction vessel. The gates respond to unique Boolean DNA inputs by undergoing topological conversion from duplex to G-quadruplex states that were resolved using a thioflavin T dye and gel electrophoresis. The modular, addressable, and label-free approach could be incorporated into DNA-based sensors, or used for resolving and debugging parallel processes in DNA computing applications. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Programmable DNA-Mediated Multitasking Processor.
Shu, Jian-Jun; Wang, Qi-Wen; Yong, Kian-Yan; Shao, Fangwei; Lee, Kee Jin
2015-04-30
Because of DNA appealing features as perfect material, including minuscule size, defined structural repeat and rigidity, programmable DNA-mediated processing is a promising computing paradigm, which employs DNAs as information storing and processing substrates to tackle the computational problems. The massive parallelism of DNA hybridization exhibits transcendent potential to improve multitasking capabilities and yield a tremendous speed-up over the conventional electronic processors with stepwise signal cascade. As an example of multitasking capability, we present an in vitro programmable DNA-mediated optimal route planning processor as a functional unit embedded in contemporary navigation systems. The novel programmable DNA-mediated processor has several advantages over the existing silicon-mediated methods, such as conducting massive data storage and simultaneous processing via much fewer materials than conventional silicon devices.
Markov chains: computing limit existence and approximations with DNA.
Cardona, M; Colomer, M A; Conde, J; Miret, J M; Miró, J; Zaragoza, A
2005-09-01
We present two algorithms to perform computations over Markov chains. The first one determines whether the sequence of powers of the transition matrix of a Markov chain converges or not to a limit matrix. If it does converge, the second algorithm enables us to estimate this limit. The combination of these algorithms allows the computation of a limit using DNA computing. In this sense, we have encoded the states and the transition probabilities using strands of DNA for generating paths of the Markov chain.
Solving satisfiability problems using a novel microarray-based DNA computer.
Lin, Che-Hsin; Cheng, Hsiao-Ping; Yang, Chang-Biau; Yang, Chia-Ning
2007-01-01
An algorithm based on a modified sticker model accompanied with an advanced MEMS-based microarray technology is demonstrated to solve SAT problem, which has long served as a benchmark in DNA computing. Unlike conventional DNA computing algorithms needing an initial data pool to cover correct and incorrect answers and further executing a series of separation procedures to destroy the unwanted ones, we built solutions in parts to satisfy one clause in one step, and eventually solve the entire Boolean formula through steps. No time-consuming sample preparation procedures and delicate sample applying equipment were required for the computing process. Moreover, experimental results show the bound DNA sequences can sustain the chemical solutions during computing processes such that the proposed method shall be useful in dealing with large-scale problems.
Myers, E W; Mount, D W
1986-01-01
We describe a program which may be used to find approximate matches to a short predefined DNA sequence in a larger target DNA sequence. The program predicts the usefulness of specific DNA probes and sequencing primers and finds nearly identical sequences that might represent the same regulatory signal. The program is written in the C programming language and will run on virtually any computer system with a C compiler, such as the IBM/PC and other computers running under the MS/DOS and UNIX operating systems. The program has been integrated into an existing software package for the IBM personal computer (see article by Mount and Conrad, this volume). Some examples of its use are given. PMID:3753785
A detailed experimental study of a DNA computer with two endonucleases.
Sakowski, Sebastian; Krasiński, Tadeusz; Sarnik, Joanna; Blasiak, Janusz; Waldmajer, Jacek; Poplawski, Tomasz
2017-07-14
Great advances in biotechnology have allowed the construction of a computer from DNA. One of the proposed solutions is a biomolecular finite automaton, a simple two-state DNA computer without memory, which was presented by Ehud Shapiro's group at the Weizmann Institute of Science. The main problem with this computer, in which biomolecules carry out logical operations, is its complexity - increasing the number of states of biomolecular automata. In this study, we constructed (in laboratory conditions) a six-state DNA computer that uses two endonucleases (e.g. AcuI and BbvI) and a ligase. We have presented a detailed experimental verification of its feasibility. We described the effect of the number of states, the length of input data, and the nondeterminism on the computing process. We also tested different automata (with three, four, and six states) running on various accepted input words of different lengths such as ab, aab, aaab, ababa, and of an unaccepted word ba. Moreover, this article presents the reaction optimization and the methods of eliminating certain biochemical problems occurring in the implementation of a biomolecular DNA automaton based on two endonucleases.
The Ins and Outs of DNA Fingerprinting the Infectious Fungi
Soll, David R.
2000-01-01
DNA fingerprinting methods have evolved as major tools in fungal epidemiology. However, no single method has emerged as the method of choice, and some methods perform better than others at different levels of resolution. In this review, requirements for an effective DNA fingerprinting method are proposed and procedures are described for testing the efficacy of a method. In light of the proposed requirements, the most common methods now being used to DNA fingerprint the infectious fungi are described and assessed. These methods include restriction fragment length polymorphisms (RFLP), RFLP with hybridization probes, randomly amplified polymorphic DNA and other PCR-based methods, electrophoretic karyotyping, and sequencing-based methods. Procedures for computing similarity coefficients, generating phylogenetic trees, and testing the stability of clusters are then described. To facilitate the analysis of DNA fingerprinting data, computer-assisted methods are described. Finally, the problems inherent in the collection of test and control isolates are considered, and DNA fingerprinting studies of strain maintenance during persistent or recurrent infections, microevolution in infecting strains, and the origin of nosocomial infections are assessed in light of the preceding discussion of the ins and outs of DNA fingerprinting. The intent of this review is to generate an awareness of the need to verify the efficacy of each DNA fingerprinting method for the level of genetic relatedness necessary to answer the epidemiological question posed, to use quantitative methods to analyze DNA fingerprint data, to use computer-assisted DNA fingerprint analysis systems to analyze data, and to file data in a form that can be used in the future for retrospective and comparative studies. PMID:10756003
Testing the Use of Implicit Solvent in the Molecular Dynamics Modelling of DNA Flexibility
NASA Astrophysics Data System (ADS)
Mitchell, J.; Harris, S.
DNA flexibility controls packaging, looping and in some cases sequence specific protein binding. Molecular dynamics simulations carried out with a computationally efficient implicit solvent model are potentially a powerful tool for studying larger DNA molecules than can be currently simulated when water and counterions are represented explicitly. In this work we compare DNA flexibility at the base pair step level modelled using an implicit solvent model to that previously determined from explicit solvent simulations and database analysis. Although much of the sequence dependent behaviour is preserved in implicit solvent, the DNA is considerably more flexible when the approximate model is used. In addition we test the ability of the implicit solvent to model stress induced DNA disruptions by simulating a series of DNA minicircle topoisomers which vary in size and superhelical density. When compared with previously run explicit solvent simulations, we find that while the levels of DNA denaturation are similar using both computational methodologies, the specific structural form of the disruptions is different.
DNA context represents transcription regulation of the gene in mouse embryonic stem cells
NASA Astrophysics Data System (ADS)
Ha, Misook; Hong, Soondo
2016-04-01
Understanding gene regulatory information in DNA remains a significant challenge in biomedical research. This study presents a computational approach to infer gene regulatory programs from primary DNA sequences. Using DNA around transcription start sites as attributes, our model predicts gene regulation in the gene. We find that H3K27ac around TSS is an informative descriptor of the transcription program in mouse embryonic stem cells. We build a computational model inferring the cell-type-specific H3K27ac signatures in the DNA around TSS. A comparison of embryonic stem cell and liver cell-specific H3K27ac signatures in DNA shows that the H3K27ac signatures in DNA around TSS efficiently distinguish the cell-type specific H3K27ac peaks and the gene regulation. The arrangement of the H3K27ac signatures inferred from the DNA represents the transcription regulation of the gene in mESC. We show that the DNA around transcription start sites is associated with the gene regulatory program by specific interaction with H3K27ac.
DNA context represents transcription regulation of the gene in mouse embryonic stem cells.
Ha, Misook; Hong, Soondo
2016-04-14
Understanding gene regulatory information in DNA remains a significant challenge in biomedical research. This study presents a computational approach to infer gene regulatory programs from primary DNA sequences. Using DNA around transcription start sites as attributes, our model predicts gene regulation in the gene. We find that H3K27ac around TSS is an informative descriptor of the transcription program in mouse embryonic stem cells. We build a computational model inferring the cell-type-specific H3K27ac signatures in the DNA around TSS. A comparison of embryonic stem cell and liver cell-specific H3K27ac signatures in DNA shows that the H3K27ac signatures in DNA around TSS efficiently distinguish the cell-type specific H3K27ac peaks and the gene regulation. The arrangement of the H3K27ac signatures inferred from the DNA represents the transcription regulation of the gene in mESC. We show that the DNA around transcription start sites is associated with the gene regulatory program by specific interaction with H3K27ac.
New Trends of Digital Data Storage in DNA
2016-01-01
With the exponential growth in the capacity of information generated and the emerging need for data to be stored for prolonged period of time, there emerges a need for a storage medium with high capacity, high storage density, and possibility to withstand extreme environmental conditions. DNA emerges as the prospective medium for data storage with its striking features. Diverse encoding models for reading and writing data onto DNA, codes for encrypting data which addresses issues of error generation, and approaches for developing codons and storage styles have been developed over the recent past. DNA has been identified as a potential medium for secret writing, which achieves the way towards DNA cryptography and stenography. DNA utilized as an organic memory device along with big data storage and analytics in DNA has paved the way towards DNA computing for solving computational problems. This paper critically analyzes the various methods used for encoding and encrypting data onto DNA while identifying the advantages and capability of every scheme to overcome the drawbacks identified priorly. Cryptography and stenography techniques have been analyzed in a critical approach while identifying the limitations of each method. This paper also identifies the advantages and limitations of DNA as a memory device and memory applications. PMID:27689089
New Trends of Digital Data Storage in DNA.
De Silva, Pavani Yashodha; Ganegoda, Gamage Upeksha
With the exponential growth in the capacity of information generated and the emerging need for data to be stored for prolonged period of time, there emerges a need for a storage medium with high capacity, high storage density, and possibility to withstand extreme environmental conditions. DNA emerges as the prospective medium for data storage with its striking features. Diverse encoding models for reading and writing data onto DNA, codes for encrypting data which addresses issues of error generation, and approaches for developing codons and storage styles have been developed over the recent past. DNA has been identified as a potential medium for secret writing, which achieves the way towards DNA cryptography and stenography. DNA utilized as an organic memory device along with big data storage and analytics in DNA has paved the way towards DNA computing for solving computational problems. This paper critically analyzes the various methods used for encoding and encrypting data onto DNA while identifying the advantages and capability of every scheme to overcome the drawbacks identified priorly. Cryptography and stenography techniques have been analyzed in a critical approach while identifying the limitations of each method. This paper also identifies the advantages and limitations of DNA as a memory device and memory applications.
Wormlike Chain Theory and Bending of Short DNA
NASA Astrophysics Data System (ADS)
Mazur, Alexey K.
2007-05-01
The probability distributions for bending angles in double helical DNA obtained in all-atom molecular dynamics simulations are compared with theoretical predictions. The computed distributions remarkably agree with the wormlike chain theory and qualitatively differ from predictions of the subelastic chain model. The computed data exhibit only small anomalies in the apparent flexibility of short DNA and cannot account for the recently reported AFM data. It is possible that the current atomistic DNA models miss some essential mechanisms of DNA bending on intermediate length scales. Analysis of bent DNA structures reveal, however, that the bending motion is structurally heterogeneous and directionally anisotropic on the length scales where the experimental anomalies were detected. These effects are essential for interpretation of the experimental data and they also can be responsible for the apparent discrepancy.
Wang, Zhaocai; Ji, Zuwen; Wang, Xiaoming; Wu, Tunhua; Huang, Wei
2017-12-01
As a promising approach to solve the computationally intractable problem, the method based on DNA computing is an emerging research area including mathematics, computer science and molecular biology. The task scheduling problem, as a well-known NP-complete problem, arranges n jobs to m individuals and finds the minimum execution time of last finished individual. In this paper, we use a biologically inspired computational model and describe a new parallel algorithm to solve the task scheduling problem by basic DNA molecular operations. In turn, we skillfully design flexible length DNA strands to represent elements of the allocation matrix, take appropriate biological experiment operations and get solutions of the task scheduling problem in proper length range with less than O(n 2 ) time complexity. Copyright © 2017. Published by Elsevier B.V.
Comprehensive restriction enzyme lists to update any DNA sequence computer program.
Raschke, E
1993-04-01
Restriction enzyme lists are presented for the practical working geneticist to update any DNA computer program. These lists combine formerly scattered information and contain all presently known restriction enzymes with a unique recognition sequence, a cut site, or methylation (in)sensitivity. The lists are in the shortest possible form to also be functional with small DNA computer programs, and will produce clear restriction maps without any redundancy or loss of information. The lists discern between commercial and noncommercial enzymes, and prototype enzymes and different isoschizomers are cross-referenced. Differences in general methylation sensitivities and (in)sensitivities against Dam and Dcm methylases of Escherichia coli are indicated. Commercial methylases and intron-encoded endonucleases are included. An address list is presented to contact commercial suppliers. The lists are constantly updated and available in electronic form as pure US ASCII files, and in formats for the DNA computer programs DNA-Strider for Apple Macintosh, and DNAsis for IBM personal computers or compatibles via e-mail from the internet address: NETSERV@EMBL-HEIDELBERG.DE by sending only the message HELP RELIBRARY.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tamrin, Mohd Izzuddin Mohd; Turaev, Sherzod; Sembok, Tengku Mohd Tengku
There are tremendous works in biotechnology especially in area of DNA molecules. The computer society is attempting to develop smaller computing devices through computational models which are based on the operations performed on the DNA molecules. A Watson-Crick automaton, a theoretical model for DNA based computation, has two reading heads, and works on double-stranded sequences of the input related by a complementarity relation similar with the Watson-Crick complementarity of DNA nucleotides. Over the time, several variants of Watson-Crick automata have been introduced and investigated. However, they cannot be used as suitable DNA based computational models for molecular stochastic processes andmore » fuzzy processes that are related to important practical problems such as molecular parsing, gene disease detection, and food authentication. In this paper we define new variants of Watson-Crick automata, called weighted Watson-Crick automata, developing theoretical models for molecular stochastic and fuzzy processes. We define weighted Watson-Crick automata adapting weight restriction mechanisms associated with formal grammars and automata. We also study the generative capacities of weighted Watson-Crick automata, including probabilistic and fuzzy variants. We show that weighted variants of Watson-Crick automata increase their generative power.« less
Weighted Watson-Crick automata
NASA Astrophysics Data System (ADS)
Tamrin, Mohd Izzuddin Mohd; Turaev, Sherzod; Sembok, Tengku Mohd Tengku
2014-07-01
There are tremendous works in biotechnology especially in area of DNA molecules. The computer society is attempting to develop smaller computing devices through computational models which are based on the operations performed on the DNA molecules. A Watson-Crick automaton, a theoretical model for DNA based computation, has two reading heads, and works on double-stranded sequences of the input related by a complementarity relation similar with the Watson-Crick complementarity of DNA nucleotides. Over the time, several variants of Watson-Crick automata have been introduced and investigated. However, they cannot be used as suitable DNA based computational models for molecular stochastic processes and fuzzy processes that are related to important practical problems such as molecular parsing, gene disease detection, and food authentication. In this paper we define new variants of Watson-Crick automata, called weighted Watson-Crick automata, developing theoretical models for molecular stochastic and fuzzy processes. We define weighted Watson-Crick automata adapting weight restriction mechanisms associated with formal grammars and automata. We also study the generative capacities of weighted Watson-Crick automata, including probabilistic and fuzzy variants. We show that weighted variants of Watson-Crick automata increase their generative power.
Standard atomic volumes in double-stranded DNA and packing in protein–DNA interfaces
Nadassy, Katalin; Tomás-Oliveira, Isabel; Alberts, Ian; Janin, Joël; Wodak, Shoshana J.
2001-01-01
Standard volumes for atoms in double-stranded B-DNA are derived using high resolution crystal structures from the Nucleic Acid Database (NDB) and compared with corresponding values derived from crystal structures of small organic compounds in the Cambridge Structural Database (CSD). Two different methods are used to compute these volumes: the classical Voronoi method, which does not depend on the size of atoms, and the related Radical Planes method which does. Results show that atomic groups buried in the interior of double-stranded DNA are, on average, more tightly packed than in related small molecules in the CSD. The packing efficiency of DNA atoms at the interfaces of 25 high resolution protein–DNA complexes is determined by computing the ratios between the volumes of interfacial DNA atoms and the corresponding standard volumes. These ratios are found to be close to unity, indicating that the DNA atoms at protein–DNA interfaces are as closely packed as in crystals of B-DNA. Analogous volume ratios, computed for buried protein atoms, are also near unity, confirming our earlier conclusions that the packing efficiency of these atoms is similar to that in the protein interior. In addition, we examine the number, volume and solvent occupation of cavities located at the protein–DNA interfaces and compared them with those in the protein interior. Cavities are found to be ubiquitous in the interfaces as well as inside the protein moieties. The frequency of solvent occupation of cavities is however higher in the interfaces, indicating that those are more hydrated than protein interiors. Lastly, we compare our results with those obtained using two different measures of shape complementarity of the analysed interfaces, and find that the correlation between our volume ratios and these measures, as well as between the measures themselves, is weak. Our results indicate that a tightly packed environment made up of DNA, protein and solvent atoms plays a significant role in protein–DNA recognition. PMID:11504874
NASA Astrophysics Data System (ADS)
Al-Otaibi, Jamelah S.; Teesdale Spittle, Paul; El Gogary, Tarek M.
2017-01-01
Anthraquinones form the basis of several anticancer drugs. Anthraquinones anticancer drugs carry out their cytotoxic activities through their interaction with DNA, and inhibition of topoisomerase II activity. Anthraquinones (AQ4 and AQ4H) were synthesized and studied along with 1,4-DAAQ by computational and experimental tools. The purpose of this study is to shade more light on mechanism of interaction between anthraquinone DNA affinic agents and different types of DNA. This study will lead to gain of information useful for drug design and development. Molecular structures were optimized using DFT B3LYP/6-31 + G(d). Depending on intramolecular hydrogen bonding interactions two conformers of AQ4 were detected and computed as 25.667 kcal/mol apart. Molecular reactivity of the anthraquinone compounds was explored using global and condensed descriptors (electrophilicity and Fukui functions). Molecular docking studies for the inhibition of CDK2 and DNA binding were carried out to explore the anti cancer potency of these drugs. NMR and UV-VIS electronic absorption spectra of anthraquinones/DNA were investigated at the physiological pH. The interaction of the three anthraquinones (AQ4, AQ4H and 1,4-DAAQ) were studied with three DNA (calf thymus DNA, (Poly[dA].Poly[dT]) and (Poly[dG].Poly[dC]). NMR study shows a qualitative pattern of drug/DNA interaction in terms of band shift and broadening. UV-VIS electronic absorption spectra were employed to measure the affinity constants of drug/DNA binding using Scatchard analysis.
Powering the programmed nanostructure and function of gold nanoparticles with catenated DNA machines
NASA Astrophysics Data System (ADS)
Elbaz, Johann; Cecconello, Alessandro; Fan, Zhiyuan; Govorov, Alexander O.; Willner, Itamar
2013-06-01
DNA nanotechnology is a rapidly developing research area in nanoscience. It includes the development of DNA machines, tailoring of DNA nanostructures, application of DNA nanostructures for computing, and more. Different DNA machines were reported in the past and DNA-guided assembly of nanoparticles represents an active research effort in DNA nanotechnology. Several DNA-dictated nanoparticle structures were reported, including a tetrahedron, a triangle or linear nanoengineered nanoparticle structures; however, the programmed, dynamic reversible switching of nanoparticle structures and, particularly, the dictated switchable functions emerging from the nanostructures, are missing elements in DNA nanotechnology. Here we introduce DNA catenane systems (interlocked DNA rings) as molecular DNA machines for the programmed, reversible and switchable arrangement of different-sized gold nanoparticles. We further demonstrate that the machine-powered gold nanoparticle structures reveal unique emerging switchable spectroscopic features, such as plasmonic coupling or surface-enhanced fluorescence.
Petri-net-based 2D design of DNA walker circuits.
Gilbert, David; Heiner, Monika; Rohr, Christian
2018-01-01
We consider localised DNA computation, where a DNA strand walks along a binary decision graph to compute a binary function. One of the challenges for the design of reliable walker circuits consists in leakage transitions, which occur when a walker jumps into another branch of the decision graph. We automatically identify leakage transitions, which allows for a detailed qualitative and quantitative assessment of circuit designs, design comparison, and design optimisation. The ability to identify leakage transitions is an important step in the process of optimising DNA circuit layouts where the aim is to minimise the computational error inherent in a circuit while minimising the area of the circuit. Our 2D modelling approach of DNA walker circuits relies on coloured stochastic Petri nets which enable functionality, topology and dimensionality all to be integrated in one two-dimensional model. Our modelling and analysis approach can be easily extended to 3-dimensional walker systems.
MICA: desktop software for comprehensive searching of DNA databases
Stokes, William A; Glick, Benjamin S
2006-01-01
Background Molecular biologists work with DNA databases that often include entire genomes. A common requirement is to search a DNA database to find exact matches for a nondegenerate or partially degenerate query. The software programs available for such purposes are normally designed to run on remote servers, but an appealing alternative is to work with DNA databases stored on local computers. We describe a desktop software program termed MICA (K-Mer Indexing with Compact Arrays) that allows large DNA databases to be searched efficiently using very little memory. Results MICA rapidly indexes a DNA database. On a Macintosh G5 computer, the complete human genome could be indexed in about 5 minutes. The indexing algorithm recognizes all 15 characters of the DNA alphabet and fully captures the information in any DNA sequence, yet for a typical sequence of length L, the index occupies only about 2L bytes. The index can be searched to return a complete list of exact matches for a nondegenerate or partially degenerate query of any length. A typical search of a long DNA sequence involves reading only a small fraction of the index into memory. As a result, searches are fast even when the available RAM is limited. Conclusion MICA is suitable as a search engine for desktop DNA analysis software. PMID:17018144
Liu, Bin; Wang, Shanyi; Dong, Qiwen; Li, Shumin; Liu, Xuan
2016-04-20
DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. With the rapid development of next generation of sequencing technique, the number of protein sequences is unprecedentedly increasing. Thus it is necessary to develop computational methods to identify the DNA-binding proteins only based on the protein sequence information. In this study, a novel method called iDNA-KACC is presented, which combines the Support Vector Machine (SVM) and the auto-cross covariance transformation. The protein sequences are first converted into profile-based protein representation, and then converted into a series of fixed-length vectors by the auto-cross covariance transformation with Kmer composition. The sequence order effect can be effectively captured by this scheme. These vectors are then fed into Support Vector Machine (SVM) to discriminate the DNA-binding proteins from the non DNA-binding ones. iDNA-KACC achieves an overall accuracy of 75.16% and Matthew correlation coefficient of 0.5 by a rigorous jackknife test. Its performance is further improved by employing an ensemble learning approach, and the improved predictor is called iDNA-KACC-EL. Experimental results on an independent dataset shows that iDNA-KACC-EL outperforms all the other state-of-the-art predictors, indicating that it would be a useful computational tool for DNA binding protein identification. .
Christen, Matthias; Del Medico, Luca; Christen, Heinz; Christen, Beat
2017-01-01
Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.
DNA Compass: a secure, client-side site for navigating personal genetic information
Curnin, Charles; Gordon, Assaf; Erlich, Yaniv
2017-01-01
Abstract Motivation: Millions of individuals have access to raw genomic data using direct-to-consumer companies. The advent of large-scale sequencing projects, such as the Precision Medicine Initiative, will further increase the number of individuals with access to their own genomic information. However, querying genomic data requires a computer terminal and computational skill to analyze the data—an impediment for the general public. Results: DNA Compass is a website designed to empower the public by enabling simple navigation of personal genomic data. Users can query the status of their genomic variants for over 1658 markers or tens of millions of documented single nucleotide polymorphisms (SNPs). DNA Compass presents the relevant genotypes of the user side-by-side with explanatory scientific resources. The genotype data never leaves the user’s computer, a feature that provides improved security and performance. More than 12 000 unique users, mainly from the general genetic genealogy community, have already used DNA Compass, demonstrating its utility. Availability and Implementation: DNA Compass is freely available on https://compass.dna.land. Contact: yaniv@cs.columbia.edu PMID:28334237
Ron, Gil; Globerson, Yuval; Moran, Dror; Kaplan, Tommy
2017-12-21
Proximity-ligation methods such as Hi-C allow us to map physical DNA-DNA interactions along the genome, and reveal its organization into topologically associating domains (TADs). As the Hi-C data accumulate, computational methods were developed for identifying domain borders in multiple cell types and organisms. Here, we present PSYCHIC, a computational approach for analyzing Hi-C data and identifying promoter-enhancer interactions. We use a unified probabilistic model to segment the genome into domains, which we then merge hierarchically and fit using a local background model, allowing us to identify over-represented DNA-DNA interactions across the genome. By analyzing the published Hi-C data sets in human and mouse, we identify hundreds of thousands of putative enhancers and their target genes, and compile an extensive genome-wide catalog of gene regulation in human and mouse. As we show, our predictions are highly enriched for ChIP-seq and DNA accessibility data, evolutionary conservation, eQTLs and other DNA-DNA interaction data.
Abstractions for DNA circuit design.
Lakin, Matthew R; Youssef, Simon; Cardelli, Luca; Phillips, Andrew
2012-03-07
DNA strand displacement techniques have been used to implement a broad range of information processing devices, from logic gates, to chemical reaction networks, to architectures for universal computation. Strand displacement techniques enable computational devices to be implemented in DNA without the need for additional components, allowing computation to be programmed solely in terms of nucleotide sequences. A major challenge in the design of strand displacement devices has been to enable rapid analysis of high-level designs while also supporting detailed simulations that include known forms of interference. Another challenge has been to design devices capable of sustaining precise reaction kinetics over long periods, without relying on complex experimental equipment to continually replenish depleted species over time. In this paper, we present a programming language for designing DNA strand displacement devices, which supports progressively increasing levels of molecular detail. The language allows device designs to be programmed using a common syntax and then analysed at varying levels of detail, with or without interference, without needing to modify the program. This allows a trade-off to be made between the level of molecular detail and the computational cost of analysis. We use the language to design a buffered architecture for DNA devices, capable of maintaining precise reaction kinetics for a potentially unbounded period. We test the effectiveness of buffered gates to support long-running computation by designing a DNA strand displacement system capable of sustained oscillations.
Wang, Zhaocai; Pu, Jun; Cao, Liling; Tan, Jian
2015-10-23
The unbalanced assignment problem (UAP) is to optimally resolve the problem of assigning n jobs to m individuals (m < n), such that minimum cost or maximum profit obtained. It is a vitally important Non-deterministic Polynomial (NP) complete problem in operation management and applied mathematics, having numerous real life applications. In this paper, we present a new parallel DNA algorithm for solving the unbalanced assignment problem using DNA molecular operations. We reasonably design flexible-length DNA strands representing different jobs and individuals, take appropriate steps, and get the solutions of the UAP in the proper length range and O(mn) time. We extend the application of DNA molecular operations and simultaneity to simplify the complexity of the computation.
Computational and experimental analysis of DNA shuffling
Maheshri, Narendra; Schaffer, David V.
2003-01-01
We describe a computational model of DNA shuffling based on the thermodynamics and kinetics of this process. The model independently tracks a representative ensemble of DNA molecules and records their states at every stage of a shuffling reaction. These data can subsequently be analyzed to yield information on any relevant metric, including reassembly efficiency, crossover number, type and distribution, and DNA sequence length distributions. The predictive ability of the model was validated by comparison to three independent sets of experimental data, and analysis of the simulation results led to several unique insights into the DNA shuffling process. We examine a tradeoff between crossover frequency and reassembly efficiency and illustrate the effects of experimental parameters on this relationship. Furthermore, we discuss conditions that promote the formation of useless “junk” DNA sequences or multimeric sequences containing multiple copies of the reassembled product. This model will therefore aid in the design of optimal shuffling reaction conditions. PMID:12626764
Programmable energy landscapes for kinetic control of DNA strand displacement.
Machinek, Robert R F; Ouldridge, Thomas E; Haley, Natalie E C; Bath, Jonathan; Turberfield, Andrew J
2014-11-10
DNA is used to construct synthetic systems that sense, actuate, move and compute. The operation of many dynamic DNA devices depends on toehold-mediated strand displacement, by which one DNA strand displaces another from a duplex. Kinetic control of strand displacement is particularly important in autonomous molecular machinery and molecular computation, in which non-equilibrium systems are controlled through rates of competing processes. Here, we introduce a new method based on the creation of mismatched base pairs as kinetic barriers to strand displacement. Reaction rate constants can be tuned across three orders of magnitude by altering the position of such a defect without significantly changing the stabilities of reactants or products. By modelling reaction free-energy landscapes, we explore the mechanistic basis of this control mechanism. We also demonstrate that oxDNA, a coarse-grained model of DNA, is capable of accurately predicting and explaining the impact of mismatches on displacement kinetics.
Biomolecular computers with multiple restriction enzymes.
Sakowski, Sebastian; Krasinski, Tadeusz; Waldmajer, Jacek; Sarnik, Joanna; Blasiak, Janusz; Poplawski, Tomasz
2017-01-01
The development of conventional, silicon-based computers has several limitations, including some related to the Heisenberg uncertainty principle and the von Neumann "bottleneck". Biomolecular computers based on DNA and proteins are largely free of these disadvantages and, along with quantum computers, are reasonable alternatives to their conventional counterparts in some applications. The idea of a DNA computer proposed by Ehud Shapiro's group at the Weizmann Institute of Science was developed using one restriction enzyme as hardware and DNA fragments (the transition molecules) as software and input/output signals. This computer represented a two-state two-symbol finite automaton that was subsequently extended by using two restriction enzymes. In this paper, we propose the idea of a multistate biomolecular computer with multiple commercially available restriction enzymes as hardware. Additionally, an algorithmic method for the construction of transition molecules in the DNA computer based on the use of multiple restriction enzymes is presented. We use this method to construct multistate, biomolecular, nondeterministic finite automata with four commercially available restriction enzymes as hardware. We also describe an experimental applicaton of this theoretical model to a biomolecular finite automaton made of four endonucleases.
Redesigning the specificity of protein-DNA interactions with Rosetta.
Thyme, Summer; Baker, David
2014-01-01
Building protein tools that can selectively bind or cleave specific DNA sequences requires efficient technologies for modifying protein-DNA interactions. Computational design is one method for accomplishing this goal. In this chapter, we present the current state of protein-DNA interface design with the Rosetta macromolecular modeling program. The LAGLIDADG endonuclease family of DNA-cleaving enzymes, under study as potential gene therapy reagents, has been the main testing ground for these in silico protocols. At this time, the computational methods are most useful for designing endonuclease variants that can accommodate small numbers of target site substitutions. Attempts to engineer for more extensive interface changes will likely benefit from an approach that uses the computational design results in conjunction with a high-throughput directed evolution or screening procedure. The family of enzymes presents an engineering challenge because their interfaces are highly integrated and there is significant coordination between the binding and catalysis events. Future developments in the computational algorithms depend on experimental feedback to improve understanding and modeling of these complex enzymatic features. This chapter presents both the basic method of design that has been successfully used to modulate specificity and more advanced procedures that incorporate DNA flexibility and other properties that are likely necessary for reliable modeling of more extensive target site changes.
Solving probability reasoning based on DNA strand displacement and probability modules.
Zhang, Qiang; Wang, Xiaobiao; Wang, Xiaojun; Zhou, Changjun
2017-12-01
In computation biology, DNA strand displacement technology is used to simulate the computation process and has shown strong computing ability. Most researchers use it to solve logic problems, but it is only rarely used in probabilistic reasoning. To process probabilistic reasoning, a conditional probability derivation model and total probability model based on DNA strand displacement were established in this paper. The models were assessed through the game "read your mind." It has been shown to enable the application of probabilistic reasoning in genetic diagnosis. Copyright © 2017 Elsevier Ltd. All rights reserved.
DNA Origami-Graphene Hybrid Nanopore for DNA Detection.
Barati Farimani, Amir; Dibaeinia, Payam; Aluru, Narayana R
2017-01-11
DNA origami nanostructures can be used to functionalize solid-state nanopores for single molecule studies. In this study, we characterized a nanopore in a DNA origami-graphene heterostructure for DNA detection. The DNA origami nanopore is functionalized with a specific nucleotide type at the edge of the pore. Using extensive molecular dynamics (MD) simulations, we computed and analyzed the ionic conductivity of nanopores in heterostructures carpeted with one or two layers of DNA origami on graphene. We demonstrate that a nanopore in DNA origami-graphene gives rise to distinguishable dwell times for the four DNA base types, whereas for a nanopore in bare graphene, the dwell time is almost the same for all types of bases. The specific interactions (hydrogen bonds) between DNA origami and the translocating DNA strand yield different residence times and ionic currents. We also conclude that the speed of DNA translocation decreases due to the friction between the dangling bases at the pore mouth and the sequencing DNA strands.
Computational Nanoelectronics: Applications to DNA, Carbon Nanotubes and Nanotransistors
NASA Technical Reports Server (NTRS)
Anantram, M. P.; Svizhenko, Alexei; Govindan, T. R.; Govindan, T. R.; Walch, S.; Mehrez, H.
2003-01-01
The topics covered by the panels of this viewgraph presentation include phonon scattering, layered structures, DNA as a device, the influence of twist and rise in the DNA molecule, counter-ions, conductance versus length, and intrinsic resonant tunneling.
An improved model for whole genome phylogenetic analysis by Fourier transform.
Yin, Changchuan; Yau, Stephen S-T
2015-10-07
DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Del Medico, Luca; Christen, Heinz; Christen, Beat
2017-01-01
Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner. PMID:28531174
Shuffle Optimizer: A Program to Optimize DNA Shuffling for Protein Engineering.
Milligan, John N; Garry, Daniel J
2017-01-01
DNA shuffling is a powerful tool to develop libraries of variants for protein engineering. Here, we present a protocol to use our freely available and easy-to-use computer program, Shuffle Optimizer. Shuffle Optimizer is written in the Python computer language and increases the nucleotide homology between two pieces of DNA desired to be shuffled together without changing the amino acid sequence. In addition we also include sections on optimal primer design for DNA shuffling and library construction, a small-volume ultrasonicator method to create sheared DNA, and finally a method to reassemble the sheared fragments and recover and clone the library. The Shuffle Optimizer program and these protocols will be useful to anyone desiring to perform any of the nucleotide homology-dependent shuffling methods.
The number of reduced alignments between two DNA sequences
2014-01-01
Background In this study we consider DNA sequences as mathematical strings. Total and reduced alignments between two DNA sequences have been considered in the literature to measure their similarity. Results for explicit representations of some alignments have been already obtained. Results We present exact, explicit and computable formulas for the number of different possible alignments between two DNA sequences and a new formula for a class of reduced alignments. Conclusions A unified approach for a wide class of alignments between two DNA sequences has been provided. The formula is computable and, if complemented by software development, will provide a deeper insight into the theory of sequence alignment and give rise to new comparison methods. AMS Subject Classification Primary 92B05, 33C20, secondary 39A14, 65Q30 PMID:24684679
Rutty, Guy N; Barber, Jade; Amoroso, Jasmin; Morgan, Bruno; Graham, Eleanor A M
2013-12-01
Post-mortem computed tomography angiography (PMCTA) involves the injection of contrast agents. This could have both a dilution effect on biological fluid samples and could affect subsequent post-contrast analytical laboratory processes. We undertook a small sample study of 10 targeted and 10 whole body PMCTA cases to consider whether or not these two methods of PMCTA could affect post-PMCTA cadaver blood based DNA identification. We used standard methodology to examine DNA from blood samples obtained before and after the PMCTA procedure. We illustrate that neither of these PMCTA methods had an effect on the alleles called following short tandem repeat based DNA profiling, and therefore the ability to undertake post-PMCTA blood based DNA identification.
Wang, Zhaocai; Pu, Jun; Cao, Liling; Tan, Jian
2015-01-01
The unbalanced assignment problem (UAP) is to optimally resolve the problem of assigning n jobs to m individuals (m < n), such that minimum cost or maximum profit obtained. It is a vitally important Non-deterministic Polynomial (NP) complete problem in operation management and applied mathematics, having numerous real life applications. In this paper, we present a new parallel DNA algorithm for solving the unbalanced assignment problem using DNA molecular operations. We reasonably design flexible-length DNA strands representing different jobs and individuals, take appropriate steps, and get the solutions of the UAP in the proper length range and O(mn) time. We extend the application of DNA molecular operations and simultaneity to simplify the complexity of the computation. PMID:26512650
Computer-Aided Drug Discovery: Molecular Docking of Diminazene Ligands to DNA Minor Groove
ERIC Educational Resources Information Center
Kholod, Yana; Hoag, Erin; Muratore, Katlynn; Kosenkov, Dmytro
2018-01-01
The reported project-based laboratory unit introduces upper-division undergraduate students to the basics of computer-aided drug discovery as a part of a computational chemistry laboratory course. The students learn to perform model binding of organic molecules (ligands) to the DNA minor groove with computer-aided drug discovery (CADD) tools. The…
Mapping the Space of Genomic Signatures
Kari, Lila; Hill, Kathleen A.; Sayem, Abu S.; Karamichalis, Rallis; Bryans, Nathaniel; Davis, Katelyn; Dattani, Nikesh S.
2015-01-01
We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity between the corresponding sequences. The graphical representation of DNA sequences utilized, Chaos Game Representation (CGR), is genome- and species-specific and can thus act as a genomic signature. Consequently, Molecular Distance Maps could inform species identification, taxonomic classifications and, to a certain extent, evolutionary history. The image distance employed, Structural Dissimilarity Index (DSSIM), implicitly compares the occurrences of oligomers of length up to k (herein k = 9) in DNA sequences. We computed DSSIM distances for more than 5 million pairs of complete mitochondrial genomes, and used Multi-Dimensional Scaling (MDS) to obtain Molecular Distance Maps that visually display the sequence relatedness in various subsets, at different taxonomic levels. This general-purpose method does not require DNA sequence alignment and can thus be used to compare similar or vastly different DNA sequences, genomic or computer-generated, of the same or different lengths. We illustrate potential uses of this approach by applying it to several taxonomic subsets: phylum Vertebrata, (super)kingdom Protista, classes Amphibia-Insecta-Mammalia, class Amphibia, and order Primates. This analysis of an extensive dataset confirms that the oligomer composition of full mtDNA sequences can be a source of taxonomic information. This method also correctly finds the mtDNA sequences most closely related to that of the anatomically modern human (the Neanderthal, the Denisovan, and the chimp), and that the sequence most different from it in this dataset belongs to a cucumber. PMID:26000734
Hexagonally packed DNA within bacteriophage T7 stabilized by curvature stress.
Odijk, T
1998-01-01
A continuum computation is proposed for the bending stress stabilizing DNA that is hexagonally packed within bacteriophage T7. Because the inner radius of the DNA spool is rather small, the stress of the curved DNA genome is strong enough to balance its electrostatic self-repulsion so as to form a stable hexagonal phase. The theory is in accord with the microscopically determined structure of bacteriophage T7 filled with DNA within the experimental margin of error. PMID:9726924
Toehold-Mediated Displacement of an Adenosine-Binding Aptamer from a DNA Duplex by its Ligand.
Monserud, Jon H; Macri, Katherine M; Schwartz, Daniel K
2016-10-24
DNA is increasingly used to engineer dynamic nanoscale circuits, structures, and motors, many of which rely on DNA strand-displacement reactions. The use of functional DNA sequences (e.g., aptamers, which bind to a wide range of ligands) in these reactions would potentially confer responsiveness on such devices, and integrate DNA computation with highly varied molecular stimuli. By using high-throughput single-molecule FRET methods, we compared the kinetics of a putative aptamer-ligand and aptamer-complement strand-displacement reaction. We found that the ligands actively disrupted the DNA duplex in the presence of a DNA toehold in a similar manner to complementary DNA, with kinetic details specific to the aptamer structure, thus suggesting that the DNA strand-displacement concept can be extended to functional DNA-ligand systems. © 2016 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Manipulation of oligonucleotides immobilized on solid supports - DNA computations on surfaces
NASA Astrophysics Data System (ADS)
Liu, Qinghua
The manipulation of DNA oligonucleotides immobilized on various solid supports has been studied intensively, especially in the area of surface hybridization. Recently, surface-based biotechnology has been applied to the area of molecular computing. These surface-based methods have advantages with regard to ease of handling, facile purification, and less interference when compared to solution methodologies. This dissertation describes the investigation of molecular approaches to DNA computing. The feasibility of encoding a bit (0 or 1) of information for DNA-based computations at the single nucleotide level was studied, particularly with regard to the efficiency and specificity of hybridization discrimination. Both gold and glass surfaces, with addressed arrays of 32 oligonucleotides, were employed with similar hybridization results. Although single-base discrimination may be achieved in the system, it is at the cost of a severe decrease in the efficiency of hybridization to perfectly matched sequences. This compromises the utility of single nucleotide encoding for DNA computing applications in the absence of some additional mechanism for increasing specificity. Several methods are suggested including a multiple-base encoding strategy. The multiple-base encoding strategy was employed to develop a prototype DNA computer. The approach was demonstrated by solving a small example of the Satisfiability (SAT) problem, an NP-complete problem in Boolean logic. 16 distinct DNA oligonucleotides, encoding all candidate solutions to the 4-variable-4-clause-3-SAT problem, were immobilized on a gold surface in the non-addressed format. Four cycles of MARK (hybridization), DESTROY (enzymatic destruction) and UNMARK (denaturation) were performed, which identified and eliminated members of the set which were not solutions to the problem. Determination of the answer was accomplished in the READOUT (sequence identification) operation by PCR amplification of the remaining molecules and hybridization to an addressed array. Four answers were determined and the S/N ratio between correct and incorrect solutions ranged from 10 to 777, making discrimination between correct and incorrect solutions to the problem straightforward. Additionally, studies of enzymatic manipulations of DNA molecules on surfaces suggested the use of E. coli Exonuclease I (Exo I) and perhaps EarI in the DESTROY operation.
Biomolecular computers with multiple restriction enzymes
Sakowski, Sebastian; Krasinski, Tadeusz; Waldmajer, Jacek; Sarnik, Joanna; Blasiak, Janusz; Poplawski, Tomasz
2017-01-01
Abstract The development of conventional, silicon-based computers has several limitations, including some related to the Heisenberg uncertainty principle and the von Neumann “bottleneck”. Biomolecular computers based on DNA and proteins are largely free of these disadvantages and, along with quantum computers, are reasonable alternatives to their conventional counterparts in some applications. The idea of a DNA computer proposed by Ehud Shapiro’s group at the Weizmann Institute of Science was developed using one restriction enzyme as hardware and DNA fragments (the transition molecules) as software and input/output signals. This computer represented a two-state two-symbol finite automaton that was subsequently extended by using two restriction enzymes. In this paper, we propose the idea of a multistate biomolecular computer with multiple commercially available restriction enzymes as hardware. Additionally, an algorithmic method for the construction of transition molecules in the DNA computer based on the use of multiple restriction enzymes is presented. We use this method to construct multistate, biomolecular, nondeterministic finite automata with four commercially available restriction enzymes as hardware. We also describe an experimental applicaton of this theoretical model to a biomolecular finite automaton made of four endonucleases. PMID:29064510
A Novel Computational Method to Reduce Leaky Reaction in DNA Strand Displacement.
Li, Xin; Wang, Xun; Song, Tao; Lu, Wei; Chen, Zhihua; Shi, Xiaolong
2015-01-01
DNA strand displacement technique is widely used in DNA programming, DNA biosensors, and gene analysis. In DNA strand displacement, leaky reactions can cause DNA signals decay and detecting DNA signals fails. The mostly used method to avoid leakage is cleaning up after upstream leaky reactions, and it remains a challenge to develop reliable DNA strand displacement technique with low leakage. In this work, we address the challenge by experimentally evaluating the basic factors, including reaction time, ratio of reactants, and ion concentration to the leakage in DNA strand displacement. Specifically, fluorescent probes and a hairpin structure reporting DNA strand are designed to detect the output of DNA strand displacement, and thus can evaluate the leakage of DNA strand displacement reactions with different reaction time, ratios of reactants, and ion concentrations. From the obtained data, mathematical models for evaluating leakage are achieved by curve derivation. As a result, it is obtained that long time incubation, high concentration of fuel strand, and inappropriate amount of ion concentration can weaken leaky reactions. This contributes to a method to set proper reaction conditions to reduce leakage in DNA strand displacement.
Wells, David B; Bhattacharya, Swati; Carr, Rogan; Maffeo, Christopher; Ho, Anthony; Comer, Jeffrey; Aksimentiev, Aleksei
2012-01-01
Molecular dynamics (MD) simulations have become a standard method for the rational design and interpretation of experimental studies of DNA translocation through nanopores. The MD method, however, offers a multitude of algorithms, parameters, and other protocol choices that can affect the accuracy of the resulting data as well as computational efficiency. In this chapter, we examine the most popular choices offered by the MD method, seeking an optimal set of parameters that enable the most computationally efficient and accurate simulations of DNA and ion transport through biological nanopores. In particular, we examine the influence of short-range cutoff, integration timestep and force field parameters on the temperature and concentration dependence of bulk ion conductivity, ion pairing, ion solvation energy, DNA structure, DNA-ion interactions, and the ionic current through a nanopore.
Shimizu, Masahiro; Noguchi, Yasunori; Sakiyama, Yukari; Kawakami, Hironori; Katayama, Tsutomu; Takada, Shoji
2016-12-13
Upon DNA replication initiation in Escherichia coli, the initiator protein DnaA forms higher-order complexes with the chromosomal origin oriC and a DNA-bending protein IHF. Although tertiary structures of DnaA and IHF have previously been elucidated, dynamic structures of oriC-DnaA-IHF complexes remain unknown. Here, combining computer simulations with biochemical assays, we obtained models at almost-atomic resolution for the central part of the oriC-DnaA-IHF complex. This complex can be divided into three subcomplexes; the left and right subcomplexes include pentameric DnaA bound in a head-to-tail manner and the middle subcomplex contains only a single DnaA. In the left and right subcomplexes, DnaA ATPases associated with various cellular activities (AAA+) domain III formed helices with specific structural differences in interdomain orientations, provoking a bend in the bound DNA. In the left subcomplex a continuous DnaA chain exists, including insertion of IHF into the DNA looping, consistent with the DNA unwinding function of the complex. The intervening spaces in those subcomplexes are crucial for DNA unwinding and loading of DnaB helicases. Taken together, this model provides a reasonable near-atomic level structural solution of the initiation complex, including the dynamic conformations and spatial arrangements of DnaA subcomplexes.
Iacovelli, Federico; Falconi, Mattia
2015-09-01
DNA and RNA are large and flexible polymers selected by nature to transmit information. The most common DNA three-dimensional structure is represented by the double helix, but this biopolymer is extremely flexible and polymorphic, and can easily change its conformation to adapt to different interactions and purposes. DNA can also adopt singular topologies, giving rise, for instance, to supercoils, formed because of the limited free rotation of the DNA domain flanking a replication or transcription complex. Our understanding of the importance of these unusual or transient structures is growing, as recent studies of DNA topology, supercoiling, knotting and linking have shown that the geometric changes can drive, or strongly influence, the interactions between protein and DNA, so altering its own metabolism. On the other hand, the unique self-recognition properties of DNA, determined by the strict Watson-Crick rules of base pairing, make this material ideal for the creation of self-assembling, predesigned nanostructures. The construction of such structures is one of the main focuses of the thriving area of DNA nanotechnology, where several assembly strategies have been employed to build increasingly complex DNA nanostructures. DNA nanodevices can have direct applications in biomedicine, but also in the materials science field, requiring the immersion of DNA in an environment far from the physiological one. Crucial help in the understanding and planning of natural and artificial nanostructures is given by modern computer simulation techniques, which are able to provide a reliable structural and dynamic description of nucleic acids. © 2015 FEBS.
Mechanism for priming DNA synthesis by yeast DNA Polymerase α
Perera, Rajika L; Torella, Rubben; Klinge, Sebastian; Kilkenny, Mairi L; Maman, Joseph D; Pellegrini, Luca
2013-01-01
The DNA Polymerase α (Pol α)/primase complex initiates DNA synthesis in eukaryotic replication. In the complex, Pol α and primase cooperate in the production of RNA-DNA oligonucleotides that prime synthesis of new DNA. Here we report crystal structures of the catalytic core of yeast Pol α in unliganded form, bound to an RNA primer/DNA template and extending an RNA primer with deoxynucleotides. We combine the structural analysis with biochemical and computational data to demonstrate that Pol α specifically recognizes the A-form RNA/DNA helix and that the ensuing synthesis of B-form DNA terminates primer synthesis. The spontaneous release of the completed RNA-DNA primer by the Pol α/primase complex simplifies current models of primer transfer to leading- and lagging strand polymerases. The proposed mechanism of nucleotide polymerization by Pol α might contribute to genomic stability by limiting the amount of inaccurate DNA to be corrected at the start of each Okazaki fragment. DOI: http://dx.doi.org/10.7554/eLife.00482.001 PMID:23599895
An evolution based biosensor receptor DNA sequence generation algorithm.
Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng
2010-01-01
A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.
Approaching mathematical model of the immune network based DNA Strand Displacement system.
Mardian, Rizki; Sekiyama, Kosuke; Fukuda, Toshio
2013-12-01
One biggest obstacle in molecular programming is that there is still no direct method to compile any existed mathematical model into biochemical reaction in order to solve a computational problem. In this paper, the implementation of DNA Strand Displacement system based on nature-inspired computation is observed. By using the Immune Network Theory and Chemical Reaction Network, the compilation of DNA-based operation is defined and the formulation of its mathematical model is derived. Furthermore, the implementation on this system is compared with the conventional implementation by using silicon-based programming. From the obtained results, we can see a positive correlation between both. One possible application from this DNA-based model is for a decision making scheme of intelligent computer or molecular robot. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions
Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S
2013-06-25
A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
Introduction to the Natural Anticipator and the Artificial Anticipator
NASA Astrophysics Data System (ADS)
Dubois, Daniel M.
2010-11-01
This short communication deals with the introduction of the concept of anticipator, which is one who anticipates, in the framework of computing anticipatory systems. The definition of anticipation deals with the concept of program. Indeed, the word program, comes from "pro-gram" meaning "to write before" by anticipation, and means a plan for the programming of a mechanism, or a sequence of coded instructions that can be inserted into a mechanism, or a sequence of coded instructions, as genes or behavioural responses, that is part of an organism. Any natural or artificial programs are thus related to anticipatory rewriting systems, as shown in this paper. All the cells in the body, and the neurons in the brain, are programmed by the anticipatory genetic code, DNA, in a low-level language with four signs. The programs in computers are also computing anticipatory systems. It will be shown, at one hand, that the genetic code DNA is a natural anticipator. As demonstrated by Nobel laureate McClintock [8], genomes are programmed. The fundamental program deals with the DNA genetic code. The properties of the DNA consist in self-replication and self-modification. The self-replicating process leads to reproduction of the species, while the self-modifying process leads to new species or evolution and adaptation in existing ones. The genetic code DNA keeps its instructions in memory in the DNA coding molecule. The genetic code DNA is a rewriting system, from DNA coding to DNA template molecule. The DNA template molecule is a rewriting system to the Messenger RNA molecule. The information is not destroyed during the execution of the rewriting program. On the other hand, it will be demonstrated that Turing machine is an artificial anticipator. The Turing machine is a rewriting system. The head reads and writes, modifying the content of the tape. The information is destroyed during the execution of the program. This is an irreversible process. The input data are lost.
Programmable and autonomous computing machine made of biomolecules
Benenson, Yaakov; Paz-Elizur, Tamar; Adar, Rivka; Keinan, Ehud; Livneh, Zvi; Shapiro, Ehud
2013-01-01
Devices that convert information from one form into another according to a definite procedure are known as automata. One such hypothetical device is the universal Turing machine1, which stimulated work leading to the development of modern computers. The Turing machine and its special cases2, including finite automata3, operate by scanning a data tape, whose striking analogy to information-encoding biopolymers inspired several designs for molecular DNA computers4–8. Laboratory-scale computing using DNA and human-assisted protocols has been demonstrated9–15, but the realization of computing devices operating autonomously on the molecular scale remains rare16–20. Here we describe a programmable finite automaton comprising DNA and DNA-manipulating enzymes that solves computational problems autonomously. The automaton’s hardware consists of a restriction nuclease and ligase, the software and input are encoded by double-stranded DNA, and programming amounts to choosing appropriate software molecules. Upon mixing solutions containing these components, the automaton processes the input molecule via a cascade of restriction, hybridization and ligation cycles, producing a detectable output molecule that encodes the automaton’s final state, and thus the computational result. In our implementation 1012 automata sharing the same software run independently and in parallel on inputs (which could, in principle, be distinct) in 120 μl solution at room temperature at a combined rate of 109 transitions per second with a transition fidelity greater than 99.8%, consuming less than 10−10 W. PMID:11719800
A simple method for the computation of first neighbour frequencies of DNAs from CD spectra
Marck, Christian; Guschlbauer, Wilhelm
1978-01-01
A procedure for the computation of the first neighbour frequencies of DNA's is presented. This procedure is based on the first neighbour approximation of Gray and Tinoco. We show that the knowledge of all the ten elementary CD signals attached to the ten double stranded first neighbour configurations is not necessary. One can obtain the ten frequencies of an unknown DNA with the use of eight elementary CD signals corresponding to eight linearly independent polymer sequences. These signals can be extracted very simply from any eight or more CD spectra of double stranded DNA's of known frequencies. The ten frequencies of a DNA are obtained by least square fit of its CD spectrum with these elementary signals. One advantage of this procedure is that it does not necessitate linear programming, it can be used with CD data digitalized using a large number of wavelengths, thus permitting an accurate resolution of the CD spectra. Under favorable case, the ten frequencies of a DNA (not used as input data) can be determined with an average absolute error < 2%. We have also observed that certain satellite DNA's, those of Drosophila virilis and Callinectes sapidus have CD spectra compatible with those of DNA's of quasi random sequence; these satellite DNA's should adopt also the B-form in solution. PMID:673843
Logic Gate Operation by DNA Translocation through Biological Nanopores.
Yasuga, Hiroki; Kawano, Ryuji; Takinoue, Masahiro; Tsuji, Yutaro; Osaki, Toshihisa; Kamiya, Koki; Miki, Norihisa; Takeuchi, Shoji
2016-01-01
Logical operations using biological molecules, such as DNA computing or programmable diagnosis using DNA, have recently received attention. Challenges remain with respect to the development of such systems, including label-free output detection and the rapidity of operation. Here, we propose integration of biological nanopores with DNA molecules for development of a logical operating system. We configured outputs "1" and "0" as single-stranded DNA (ssDNA) that is or is not translocated through a nanopore; unlabeled DNA was detected electrically. A negative-AND (NAND) operation was successfully conducted within approximately 10 min, which is rapid compared with previous studies using unlabeled DNA. In addition, this operation was executed in a four-droplet network. DNA molecules and associated information were transferred among droplets via biological nanopores. This system would facilitate linking of molecules and electronic interfaces. Thus, it could be applied to molecular robotics, genetic engineering, and even medical diagnosis and treatment.
Logic Gate Operation by DNA Translocation through Biological Nanopores
Takinoue, Masahiro; Tsuji, Yutaro; Osaki, Toshihisa; Kamiya, Koki; Miki, Norihisa; Takeuchi, Shoji
2016-01-01
Logical operations using biological molecules, such as DNA computing or programmable diagnosis using DNA, have recently received attention. Challenges remain with respect to the development of such systems, including label-free output detection and the rapidity of operation. Here, we propose integration of biological nanopores with DNA molecules for development of a logical operating system. We configured outputs “1” and “0” as single-stranded DNA (ssDNA) that is or is not translocated through a nanopore; unlabeled DNA was detected electrically. A negative-AND (NAND) operation was successfully conducted within approximately 10 min, which is rapid compared with previous studies using unlabeled DNA. In addition, this operation was executed in a four-droplet network. DNA molecules and associated information were transferred among droplets via biological nanopores. This system would facilitate linking of molecules and electronic interfaces. Thus, it could be applied to molecular robotics, genetic engineering, and even medical diagnosis and treatment. PMID:26890568
Barcode extension for analysis and reconstruction of structures
NASA Astrophysics Data System (ADS)
Myhrvold, Cameron; Baym, Michael; Hanikel, Nikita; Ong, Luvena L.; Gootenberg, Jonathan S.; Yin, Peng
2017-03-01
Collections of DNA sequences can be rationally designed to self-assemble into predictable three-dimensional structures. The geometric and functional diversity of DNA nanostructures created to date has been enhanced by improvements in DNA synthesis and computational design. However, existing methods for structure characterization typically image the final product or laboriously determine the presence of individual, labelled strands using gel electrophoresis. Here we introduce a new method of structure characterization that uses barcode extension and next-generation DNA sequencing to quantitatively measure the incorporation of every strand into a DNA nanostructure. By quantifying the relative abundances of distinct DNA species in product and monomer bands, we can study the influence of geometry and sequence on assembly. We have tested our method using 2D and 3D DNA brick and DNA origami structures. Our method is general and should be extensible to a wide variety of DNA nanostructures.
Barcode extension for analysis and reconstruction of structures.
Myhrvold, Cameron; Baym, Michael; Hanikel, Nikita; Ong, Luvena L; Gootenberg, Jonathan S; Yin, Peng
2017-03-13
Collections of DNA sequences can be rationally designed to self-assemble into predictable three-dimensional structures. The geometric and functional diversity of DNA nanostructures created to date has been enhanced by improvements in DNA synthesis and computational design. However, existing methods for structure characterization typically image the final product or laboriously determine the presence of individual, labelled strands using gel electrophoresis. Here we introduce a new method of structure characterization that uses barcode extension and next-generation DNA sequencing to quantitatively measure the incorporation of every strand into a DNA nanostructure. By quantifying the relative abundances of distinct DNA species in product and monomer bands, we can study the influence of geometry and sequence on assembly. We have tested our method using 2D and 3D DNA brick and DNA origami structures. Our method is general and should be extensible to a wide variety of DNA nanostructures.
Barcode extension for analysis and reconstruction of structures
Myhrvold, Cameron; Baym, Michael; Hanikel, Nikita; Ong, Luvena L; Gootenberg, Jonathan S; Yin, Peng
2017-01-01
Collections of DNA sequences can be rationally designed to self-assemble into predictable three-dimensional structures. The geometric and functional diversity of DNA nanostructures created to date has been enhanced by improvements in DNA synthesis and computational design. However, existing methods for structure characterization typically image the final product or laboriously determine the presence of individual, labelled strands using gel electrophoresis. Here we introduce a new method of structure characterization that uses barcode extension and next-generation DNA sequencing to quantitatively measure the incorporation of every strand into a DNA nanostructure. By quantifying the relative abundances of distinct DNA species in product and monomer bands, we can study the influence of geometry and sequence on assembly. We have tested our method using 2D and 3D DNA brick and DNA origami structures. Our method is general and should be extensible to a wide variety of DNA nanostructures. PMID:28287117
Exploring the Feasibility of a DNA Computer: Design of an ALU Using Sticker-Based DNA Model.
Sarkar, Mayukh; Ghosal, Prasun; Mohanty, Saraju P
2017-09-01
Since its inception, DNA computing has advanced to offer an extremely powerful, energy-efficient emerging technology for solving hard computational problems with its inherent massive parallelism and extremely high data density. This would be much more powerful and general purpose when combined with other existing well-known algorithmic solutions that exist for conventional computing architectures using a suitable ALU. Thus, a specifically designed DNA Arithmetic and Logic Unit (ALU) that can address operations suitable for both domains can mitigate the gap between these two. An ALU must be able to perform all possible logic operations, including NOT, OR, AND, XOR, NOR, NAND, and XNOR; compare, shift etc., integer and floating point arithmetic operations (addition, subtraction, multiplication, and division). In this paper, design of an ALU has been proposed using sticker-based DNA model with experimental feasibility analysis. Novelties of this paper may be in manifold. First, the integer arithmetic operations performed here are 2s complement arithmetic, and the floating point operations follow the IEEE 754 floating point format, resembling closely to a conventional ALU. Also, the output of each operation can be reused for any next operation. So any algorithm or program logic that users can think of can be implemented directly on the DNA computer without any modification. Second, once the basic operations of sticker model can be automated, the implementations proposed in this paper become highly suitable to design a fully automated ALU. Third, proposed approaches are easy to implement. Finally, these approaches can work on sufficiently large binary numbers.
DNA-Based Dynamic Reaction Networks.
Fu, Ting; Lyu, Yifan; Liu, Hui; Peng, Ruizi; Zhang, Xiaobing; Ye, Mao; Tan, Weihong
2018-05-21
Deriving from logical and mechanical interactions between DNA strands and complexes, DNA-based artificial reaction networks (RNs) are attractive for their high programmability, as well as cascading and fan-out ability, which are similar to the basic principles of electronic logic gates. Arising from the dream of creating novel computing mechanisms, researchers have placed high hopes on the development of DNA-based dynamic RNs and have strived to establish the basic theories and operative strategies of these networks. This review starts by looking back on the evolution of DNA dynamic RNs; in particular' the most significant applications in biochemistry occurring in recent years. Finally, we discuss the perspectives of DNA dynamic RNs and give a possible direction for the development of DNA circuits. Copyright © 2018. Published by Elsevier Ltd.
Structural DNA Nanotechnology: State of the Art and Future Perspective
2015-01-01
Over the past three decades DNA has emerged as an exceptional molecular building block for nanoconstruction due to its predictable conformation and programmable intra- and intermolecular Watson–Crick base-pairing interactions. A variety of convenient design rules and reliable assembly methods have been developed to engineer DNA nanostructures of increasing complexity. The ability to create designer DNA architectures with accurate spatial control has allowed researchers to explore novel applications in many directions, such as directed material assembly, structural biology, biocatalysis, DNA computing, nanorobotics, disease diagnosis, and drug delivery. This Perspective discusses the state of the art in the field of structural DNA nanotechnology and presents some of the challenges and opportunities that exist in DNA-based molecular design and programming. PMID:25029570
GUI to Facilitate Research on Biological Damage from Radiation
NASA Technical Reports Server (NTRS)
Cucinotta, Frances A.; Ponomarev, Artem Lvovich
2010-01-01
A graphical-user-interface (GUI) computer program has been developed to facilitate research on the damage caused by highly energetic particles and photons impinging on living organisms. The program brings together, into one computational workspace, computer codes that have been developed over the years, plus codes that will be developed during the foreseeable future, to address diverse aspects of radiation damage. These include codes that implement radiation-track models, codes for biophysical models of breakage of deoxyribonucleic acid (DNA) by radiation, pattern-recognition programs for extracting quantitative information from biological assays, and image-processing programs that aid visualization of DNA breaks. The radiation-track models are based on transport models of interactions of radiation with matter and solution of the Boltzmann transport equation by use of both theoretical and numerical models. The biophysical models of breakage of DNA by radiation include biopolymer coarse-grained and atomistic models of DNA, stochastic- process models of deposition of energy, and Markov-based probabilistic models of placement of double-strand breaks in DNA. The program is designed for use in the NT, 95, 98, 2000, ME, and XP variants of the Windows operating system.
Wang, Zhaocai; Huang, Dongmei; Meng, Huajun; Tang, Chengpei
2013-10-01
The minimum spanning tree (MST) problem is to find minimum edge connected subsets containing all the vertex of a given undirected graph. It is a vitally important NP-complete problem in graph theory and applied mathematics, having numerous real life applications. Moreover in previous studies, DNA molecular operations usually were used to solve NP-complete head-to-tail path search problems, rarely for NP-hard problems with multi-lateral path solutions result, such as the minimum spanning tree problem. In this paper, we present a new fast DNA algorithm for solving the MST problem using DNA molecular operations. For an undirected graph with n vertex and m edges, we reasonably design flexible length DNA strands representing the vertex and edges, take appropriate steps and get the solutions of the MST problem in proper length range and O(3m+n) time complexity. We extend the application of DNA molecular operations and simultaneity simplify the complexity of the computation. Results of computer simulative experiments show that the proposed method updates some of the best known values with very short time and that the proposed method provides a better performance with solution accuracy over existing algorithms. Copyright © 2013 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Suárez, Martha Y.; Villagrán; Miller, John H.
2015-01-01
We report on a new technique, computational DNA hole spectroscopy, which creates spectra of electron hole probabilities vs. nucleotide position. A hole is a site of positive charge created when an electron is removed. Peaks in the hole spectrum depict sites where holes tend to localize and potentially trigger a base pair mismatch during replication. Our studies of mitochondrial DNA reveal a correlation between L-strand hole spectrum peaks and spikes in the human mutation spectrum. Importantly, we also find that hole peak positions that do not coincide with large variant frequencies often coincide with disease-implicated mutations and/or (for coding DNA) encoded conserved amino acids. This enables combining hole spectra with variant data to identify critical base pairs and potential disease ‘driver’ mutations. Such integration of DNA hole and variance spectra could ultimately prove invaluable for pinpointing critical regions of the vast non-protein-coding genome. An observed asymmetry in correlations, between the spectrum of human mtDNA variations and the L- and H-strand hole spectra, is attributed to asymmetric DNA replication processes that occur for the leading and lagging strands. PMID:26310834
Villagrán, Martha Y Suárez; Miller, John H
2015-08-27
We report on a new technique, computational DNA hole spectroscopy, which creates spectra of electron hole probabilities vs. nucleotide position. A hole is a site of positive charge created when an electron is removed. Peaks in the hole spectrum depict sites where holes tend to localize and potentially trigger a base pair mismatch during replication. Our studies of mitochondrial DNA reveal a correlation between L-strand hole spectrum peaks and spikes in the human mutation spectrum. Importantly, we also find that hole peak positions that do not coincide with large variant frequencies often coincide with disease-implicated mutations and/or (for coding DNA) encoded conserved amino acids. This enables combining hole spectra with variant data to identify critical base pairs and potential disease 'driver' mutations. Such integration of DNA hole and variance spectra could ultimately prove invaluable for pinpointing critical regions of the vast non-protein-coding genome. An observed asymmetry in correlations, between the spectrum of human mtDNA variations and the L- and H-strand hole spectra, is attributed to asymmetric DNA replication processes that occur for the leading and lagging strands.
Model Checking Temporal Logic Formulas Using Sticker Automata
Feng, Changwei; Wu, Huanmei
2017-01-01
As an important complex problem, the temporal logic model checking problem is still far from being fully resolved under the circumstance of DNA computing, especially Computation Tree Logic (CTL), Interval Temporal Logic (ITL), and Projection Temporal Logic (PTL), because there is still a lack of approaches for DNA model checking. To address this challenge, a model checking method is proposed for checking the basic formulas in the above three temporal logic types with DNA molecules. First, one-type single-stranded DNA molecules are employed to encode the Finite State Automaton (FSA) model of the given basic formula so that a sticker automaton is obtained. On the other hand, other single-stranded DNA molecules are employed to encode the given system model so that the input strings of the sticker automaton are obtained. Next, a series of biochemical reactions are conducted between the above two types of single-stranded DNA molecules. It can then be decided whether the system satisfies the formula or not. As a result, we have developed a DNA-based approach for checking all the basic formulas of CTL, ITL, and PTL. The simulated results demonstrate the effectiveness of the new method. PMID:29119114
Supercoil Formation During DNA Melting
NASA Astrophysics Data System (ADS)
Sayar, Mehmet; Avsaroglu, Baris; Kabakcioglu, Alkan
2009-03-01
Supercoil formation plays a key role in determining the structure-function relationship in DNA. Biological and technological processes, such as protein synthesis, polymerase chain reaction, and microarrays relys on separation of the two strands in DNA, which is coupled to the unwinding of the supercoiled structure. This problem has been studied theoretically via Peyrard-Bishop and Poland-Scheraga type models, which include a simple representation of the DNA structural properties. In recent years, computational models, which provide a more realtistic representaion of DNA molecule, have been used to study the melting behavior of short DNA chains. Here, we will present a new coarse-grained model of DNA which is capable of simulating sufficiently long DNA chains for studying the supercoil formation during melting, without sacrificing the local structural properties. Our coarse-grained model successfully reproduces the local geometry of the DNA molecule, such as the 3'-5' directionality, major-minor groove structure, and the helical pitch. We will present our initial results on the dynamics of supercoiling during DNA melting.
Ni-DNA-based nanowires and nanodevices
NASA Astrophysics Data System (ADS)
Chang, Chia-Ching; Yuan, Chiun-Jye; Jian, Wen-Bin; Chen, Yu-Chang; di Ventra, Massimiliano
DNA is a highly versatile biopolymer that has been a recent focus in the field of nanomachines and nanoelectronics. DNA exhibits high stability, adjustable conductance, self-organizing capability, programmability and vast information storage. It is an ideal material in the applications of nanodevices, nanoelectronics, and molecular computing. Low conductance of native DNA renders applications difficult. However, doping with nickel ions tunes the DNA into a conducting polymer. Further studies showed that nickel ions containing DNA (Ni-DNA) nanowires exhibit characteristics of memristor and memcapacitor making them a potential mass information storage system. In summary, Ni-DNA has promising applications in a variety of fields, including nanoelectronics, biosensors and memcomputing. This study was supported in part by the Ministry of Science and Technology (MOST), Taiwan (ROC) MOST 103-2112-M-009-011 -MY3, and MOST 105-2627-M-009-006.
Akberova, N I; Zhmurov, A A; Nevzorova, T A; Litvinov, R I
2016-01-01
Antibodies to DNA play an important role in the pathogenesis of autoimmune diseases. The elucidation of structural mechanisms of both the antigen recognition and the interaction of anti-DNA antibodies with DNA will help to understand the role of DNA-containing immune complexes in various pathologies and can provide a basis for new treatment modalities. Moreover, the DNA-antibody complex is an analog of specific intracellular DNA-protein interactions. In this work, we used in silico molecular dynamic simulations of bimolecular complexes of the dsDNA segment containing the Fab fragment of an anti-DNA antibody to obtain the detailed thermodynamic and structural characteristics of dynamic intermolecular interactions. Using computationally modified crystal structure of the Fab-DNA complex (PDB ID: 3VW3), we studied the equilibrium molecular dynamics of the 64M-5 antibody Fab fragment associated with the dsDNA fragment containing the thymine dimer, the product of DNA photodamage. Amino acid residues that constitute paratopes and the complementary nucleotide epitopes for the Fab-DNA construct were identified. Stacking and electrostatic interactions were found to play the main role in mediating the most specific antibody-dsDNA contacts, while hydrogen bonds were less significant. These findings may shed light on the formation and properties of pathogenic anti-DNA antibodies in autoimmune diseases, such as systemic lupus erythematosus associated with skin photosensitivity and DNA photodamage.
Flavin Charge Transfer Transitions Assist DNA Photolyase Electron Transfer
NASA Astrophysics Data System (ADS)
Skourtis, Spiros S.; Prytkova, Tatiana; Beratan, David N.
2007-12-01
This contribution describes molecular dynamics, semi-empirical and ab-initio studies of the primary photo-induced electron transfer reaction in DNA photolyase. DNA photolyases are FADH--containing proteins that repair UV-damaged DNA by photo-induced electron transfer. A DNA photolyase recognizes and binds to cyclobutatne pyrimidine dimer lesions of DNA. The protein repairs a bound lesion by transferring an electron to the lesion from FADH-, upon photo-excitation of FADH- with 350-450 nm light. We compute the lowest singlet excited states of FADH- in DNA photolyase using INDO/S configuration interaction, time-dependent density-functional, and time-dependent Hartree-Fock methods. The calculations identify the lowest singlet excited state of FADH- that is populated after photo-excitation and that acts as the electron donor. For this donor state we compute conformationally-averaged tunneling matrix elements to empty electron-acceptor states of a thymine dimer bound to photolyase. The conformational averaging involves different FADH--thymine dimer confromations obtained from molecular dynamics simulations of the solvated protein with a thymine dimer docked in its active site. The tunneling matrix element computations use INDO/S-level Green's function, energy splitting, and Generalized Mulliken-Hush methods. These calculations indicate that photo-excitation of FADH- causes a π→π* charge-transfer transition that shifts electron density to the side of the flavin isoalloxazine ring that is adjacent to the docked thymine dimer. This shift in electron density enhances the FADH--to-dimer electronic coupling, thus inducing rapid electron transfer.
A Feature Selection Algorithm to Compute Gene Centric Methylation from Probe Level Methylation Data.
Baur, Brittany; Bozdag, Serdar
2016-01-01
DNA methylation is an important epigenetic event that effects gene expression during development and various diseases such as cancer. Understanding the mechanism of action of DNA methylation is important for downstream analysis. In the Illumina Infinium HumanMethylation 450K array, there are tens of probes associated with each gene. Given methylation intensities of all these probes, it is necessary to compute which of these probes are most representative of the gene centric methylation level. In this study, we developed a feature selection algorithm based on sequential forward selection that utilized different classification methods to compute gene centric DNA methylation using probe level DNA methylation data. We compared our algorithm to other feature selection algorithms such as support vector machines with recursive feature elimination, genetic algorithms and ReliefF. We evaluated all methods based on the predictive power of selected probes on their mRNA expression levels and found that a K-Nearest Neighbors classification using the sequential forward selection algorithm performed better than other algorithms based on all metrics. We also observed that transcriptional activities of certain genes were more sensitive to DNA methylation changes than transcriptional activities of other genes. Our algorithm was able to predict the expression of those genes with high accuracy using only DNA methylation data. Our results also showed that those DNA methylation-sensitive genes were enriched in Gene Ontology terms related to the regulation of various biological processes.
A spatially localized architecture for fast and modular DNA computing
NASA Astrophysics Data System (ADS)
Chatterjee, Gourab; Dalchau, Neil; Muscat, Richard A.; Phillips, Andrew; Seelig, Georg
2017-09-01
Cells use spatial constraints to control and accelerate the flow of information in enzyme cascades and signalling networks. Synthetic silicon-based circuitry similarly relies on spatial constraints to process information. Here, we show that spatial organization can be a similarly powerful design principle for overcoming limitations of speed and modularity in engineered molecular circuits. We create logic gates and signal transmission lines by spatially arranging reactive DNA hairpins on a DNA origami. Signal propagation is demonstrated across transmission lines of different lengths and orientations and logic gates are modularly combined into circuits that establish the universality of our approach. Because reactions preferentially occur between neighbours, identical DNA hairpins can be reused across circuits. Co-localization of circuit elements decreases computation time from hours to minutes compared to circuits with diffusible components. Detailed computational models enable predictive circuit design. We anticipate our approach will motivate using spatial constraints for future molecular control circuit designs.
Modelling and Holographic Visualization of Space Radiation-Induced DNA Damage
NASA Technical Reports Server (NTRS)
Plante, Ianik
2017-01-01
Space radiation is composed by a mixture of ions of different energies. Among these, heavy inos are of particular importance because their health effects are poorly understood. In. the recent years, a software named RITRACKS (Relativistic Ion Tracks) was developed to simulate the detailed radiation track structure, several DNA models and DNA damage. As the DNA structure is complex due to packing, it is difficult to the damage using a regular computer screen.
A Novel Computational Method to Reduce Leaky Reaction in DNA Strand Displacement
Li, Xin; Wang, Xun; Song, Tao; Lu, Wei; Chen, Zhihua; Shi, Xiaolong
2015-01-01
DNA strand displacement technique is widely used in DNA programming, DNA biosensors, and gene analysis. In DNA strand displacement, leaky reactions can cause DNA signals decay and detecting DNA signals fails. The mostly used method to avoid leakage is cleaning up after upstream leaky reactions, and it remains a challenge to develop reliable DNA strand displacement technique with low leakage. In this work, we address the challenge by experimentally evaluating the basic factors, including reaction time, ratio of reactants, and ion concentration to the leakage in DNA strand displacement. Specifically, fluorescent probes and a hairpin structure reporting DNA strand are designed to detect the output of DNA strand displacement, and thus can evaluate the leakage of DNA strand displacement reactions with different reaction time, ratios of reactants, and ion concentrations. From the obtained data, mathematical models for evaluating leakage are achieved by curve derivation. As a result, it is obtained that long time incubation, high concentration of fuel strand, and inappropriate amount of ion concentration can weaken leaky reactions. This contributes to a method to set proper reaction conditions to reduce leakage in DNA strand displacement. PMID:26491602
NASA Astrophysics Data System (ADS)
Kingsland, Addie
DNA is an amazing molecule which is the basic template for all genetics. It is the primary molecule for storing biological information, and has many applications in nanotechnology. Double-stranded DNA may contain mismatched base pairs beyond the Watson-Crick pairs guanine-cytosine and adenine-thymine. To date, no one has found a physical property of base pair mismatches which describes the behavior of naturally occurring mismatch repair enzymes. Many materials properties of DNA are also unknown, for instance, when pulling DNA in different configurations, different energy differences are observed with no obvious reason why. DNA mismatches also affect their local environment, for instance changing the quantum yield of nearby azobenzene moieties. We utilize molecular dynamics computer simulations to study the structure and dynamics for both matched and mismatched base pairs, within both biological and materials contexts, and in both equilibrium and biased dynamics. We show that mismatched pairs shift further in the plane normal to the DNA strand and are more likely to exhibit non-canonical structures, including the e-motif. Base pair mismatches alter their local environment, affecting the trans- to cis- photoisomerization quantum yield of azobenzene, as well as increasing the likelihood of observing the e-motif. We also show that by using simulated data, we can give new insights on theoretical models to calculate the energetics of pulling DNA strands apart. These results, all relatively inexpensive on modern computer hardware, can help guide the design of DNA-based nanotechnologies, as well as give new insights into the functioning of mismatch repair systems in cancer prevention.
WE-DE-202-01: Connecting Nanoscale Physics to Initial DNA Damage Through Track Structure Simulations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schuemann, J.
Radiation therapy for the treatment of cancer has been established as a highly precise and effective way to eradicate a localized region of diseased tissue. To achieve further significant gains in the therapeutic ratio, we need to move towards biologically optimized treatment planning. To achieve this goal, we need to understand how the radiation-type dependent patterns of induced energy depositions within the cell (physics) connect via molecular, cellular and tissue reactions to treatment outcome such as tumor control and undesirable effects on normal tissue. Several computational biology approaches have been developed connecting physics to biology. Monte Carlo simulations are themore » most accurate method to calculate physical dose distributions at the nanometer scale, however simulations at the DNA scale are slow and repair processes are generally not simulated. Alternative models that rely on the random formation of individual DNA lesions within one or two turns of the DNA have been shown to reproduce the clusters of DNA lesions, including single strand breaks (SSBs), double strand breaks (DSBs) without the need for detailed track structure simulations. Efficient computational simulations of initial DNA damage induction facilitate computational modeling of DNA repair and other molecular and cellular processes. Mechanistic, multiscale models provide a useful conceptual framework to test biological hypotheses and help connect fundamental information about track structure and dosimetry at the sub-cellular level to dose-response effects on larger scales. In this symposium we will learn about the current state of the art of computational approaches estimating radiation damage at the cellular and sub-cellular scale. How can understanding the physics interactions at the DNA level be used to predict biological outcome? We will discuss if and how such calculations are relevant to advance our understanding of radiation damage and its repair, or, if the underlying biological processes are too complex for a mechanistic approach. Can computer simulations be used to guide future biological research? We will debate the feasibility of explaining biology from a physicists’ perspective. Learning Objectives: Understand the potential applications and limitations of computational methods for dose-response modeling at the molecular, cellular and tissue levels Learn about mechanism of action underlying the induction, repair and biological processing of damage to DNA and other constituents Understand how effects and processes at one biological scale impact on biological processes and outcomes on other scales J. Schuemann, NCI/NIH grantsS. McMahon, Funding: European Commission FP7 (grant EC FP7 MC-IOF-623630)« less
Marck, C
1988-01-01
DNA Strider is a new integrated DNA and Protein sequence analysis program written with the C language for the Macintosh Plus, SE and II computers. It has been designed as an easy to learn and use program as well as a fast and efficient tool for the day-to-day sequence analysis work. The program consists of a multi-window sequence editor and of various DNA and Protein analysis functions. The editor may use 4 different types of sequences (DNA, degenerate DNA, RNA and one-letter coded protein) and can handle simultaneously 6 sequences of any type up to 32.5 kB each. Negative numbering of the bases is allowed for DNA sequences. All classical restriction and translation analysis functions are present and can be performed in any order on any open sequence or part of a sequence. The main feature of the program is that the same analysis function can be repeated several times on different sequences, thus generating multiple windows on the screen. Many graphic capabilities have been incorporated such as graphic restriction map, hydrophobicity profile and the CAI plot- codon adaptation index according to Sharp and Li. The restriction sites search uses a newly designed fast hexamer look-ahead algorithm. Typical runtime for the search of all sites with a library of 130 restriction endonucleases is 1 second per 10,000 bases. The circular graphic restriction map of the pBR322 plasmid can be therefore computed from its sequence and displayed on the Macintosh Plus screen within 2 seconds and its multiline restriction map obtained in a scrolling window within 5 seconds. PMID:2832831
Computational Micromodel for Epigenetic Mechanisms
Raghavan, Karthika; Ruskin, Heather J.; Perrin, Dimitri; Goasmat, Francois; Burns, John
2010-01-01
Characterization of the epigenetic profile of humans since the initial breakthrough on the human genome project has strongly established the key role of histone modifications and DNA methylation. These dynamic elements interact to determine the normal level of expression or methylation status of the constituent genes in the genome. Recently, considerable evidence has been put forward to demonstrate that environmental stress implicitly alters epigenetic patterns causing imbalance that can lead to cancer initiation. This chain of consequences has motivated attempts to computationally model the influence of histone modification and DNA methylation in gene expression and investigate their intrinsic interdependency. In this paper, we explore the relation between DNA methylation and transcription and characterize in detail the histone modifications for specific DNA methylation levels using a stochastic approach. PMID:21152421
Teixeira, Erico S; Uppulury, Karthik; Privett, Austin J; Stopera, Christopher; McLaurin, Patrick M; Morales, Jorge A
2018-05-06
Proton cancer therapy (PCT) utilizes high-energy proton projectiles to obliterate cancerous tumors with low damage to healthy tissues and without the side effects of X-ray therapy. The healing action of the protons results from their damage on cancerous cell DNA. Despite established clinical use, the chemical mechanisms of PCT reactions at the molecular level remain elusive. This situation prevents a rational design of PCT that can maximize its therapeutic power and minimize its side effects. The incomplete characterization of PCT reactions is partially due to the health risks associated with experimental/clinical techniques applied to human subjects. To overcome this situation, we are conducting time-dependent and non-adiabatic computer simulations of PCT reactions with the electron nuclear dynamics (END) method. Herein, we present a review of our previous and new END research on three fundamental types of PCT reactions: water radiolysis reactions, proton-induced DNA damage and electron-induced DNA damage. These studies are performed on the computational prototypes: proton + H₂O clusters, proton + DNA/RNA bases and + cytosine nucleotide, and electron + cytosine nucleotide + H₂O. These simulations provide chemical mechanisms and dynamical properties of the selected PCT reactions in comparison with available experimental and alternative computational results.
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.
Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook
2014-11-01
As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Wills, Peter R
2016-03-13
This article reviews contributions to this theme issue covering the topic 'DNA as information' in relation to the structure of DNA, the measure of its information content, the role and meaning of information in biology and the origin of genetic coding as a transition from uninformed to meaningful computational processes in physical systems. © 2016 The Author(s).
Simulation of the charge migration in DNA under irradiation with heavy ions.
Belov, Oleg V; Boyda, Denis L; Plante, Ianik; Shirmovsky, Sergey Eh
2015-01-01
A computer model to simulate the processes of charge injection and migration through DNA after irradiation by a heavy charged particle was developed. The most probable sites of charge injection were obtained by merging spatial models of short DNA sequence and a single 1 GeV/u iron particle track simulated by the code RITRACKS (Relativistic Ion Tracks). Charge migration was simulated by using a quantum-classical nonlinear model of the DNA-charge system. It was found that charge migration depends on the environmental conditions. The oxidative damage in DNA occurring during hole migration was simulated concurrently, which allowed the determination of probable locations of radiation-induced DNA lesions.
Moghadam, Neda Hosseinpour; Salehzadeh, Sadegh; Shahabadi, Nahid
2017-09-02
The interaction of calf thymus DNA with nevirapine at physiological pH was studied by using absorption, circular dichroism, viscosity, differential pulse voltammetry, fluorescence techniques, salt effect studies and computational methods. The drug binds to ct-DNA in a groove binding mode, as shown by slight variation in the viscosity of ct-DNA. Furthermore, competitive fluorimetric studies with Hoechst 33258 indicate that nevirapine binds to DNA via groove binding. Moreover, the structure of nevirapine was optimized by DFT calculations and was used for the molecular docking calculations. The molecular docking results suggested that nevirapine prefers to bind on the minor groove of ct-DNA.
In Silico Design and Characterization of DNA Nanomaterials
NASA Astrophysics Data System (ADS)
Nash, Jessica A.
Deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) function biologically as carriers of genetic information. However, due to their ability to self-assemble via base pairing, nucleic acid molecules have become widely used in nanotechnology. In this dissertation, in silico techniques are used to probe the structure-property relationships of nucleic acid based nanomaterials. In Part 1, computational methods are employed to formulate nanoparticle design rules for applications in nucleic acid packaging and delivery. Nanoparticles (NPs) play increasingly important roles in nanomedicine, where the surface chemistry allows for control over interactions with biomolecules. Understanding how DNA and RNA compaction occurs is relevant to biological systems and systems in nanotechnology, and critical for the development of more efficient and effective nanoparticle carriers. Computational modeling allows for the description of bio-nano systems and processes with unprecedented detail, and can provide insights and guidelines for the creation of new nanomaterials. Using all-atom molecular dynamics simulations, the effect of nanoparticle surface chemistry, size, and solvent ionic strength on interactions with DNA and RNA are reported. In Chapter 2, a systematic study of the effect of nanoparticle charge on ability to bend and wrap short sequences of DNA and RNA is presented. To cause bending of DNA, a nanoparticle charge of at least +30 is required. Higher nanoparticle charges cause a greater degree of compaction. For RNA, however, charged ligand end-groups bind internally and prevent RNA bending. Nanoparticles were designed to test the influence of NP ligand shell shape and length on RNA binding using these results. In Chapter 3, all-atom simulation of NPs with long double stranded RNA are reported. Simulations show that by shortening NP ligand length, double stranded RNA can be wrapped. In Chapter 4, we consider compaction of long DNA by nanoparticles. NPs with +120 charge can fully compact DNA, but the wrapping is unordered on the surface. Chapter 5 reports the influence of NPs on the structure of single stranded DNA and RNA, showing that NPs have a greater influence on poly-pyrimidine strands than poly-purine strands, and can interrupt hydrogen bonds and pi-pi stacking. In Part II of this dissertation, computational techniques are applied to study DNA tiles and origami. Due to base-pairing DNA can be used to place objects with nanoscale precision, with applications in nanoscience and nanomedicine. Chapter 6 presents the development of anticoagulants using DNA weave tiles and aptamers. More effective anticoagulants can be created by varying the DNA aptamer used, and increasing local concentration by attaching aptamers to a DNA tile. Molecular dynamics simulations show that increasing the number of helices on a DNA weave tile increases tile flexibility. Chapter 7 introduces a tool developed for visualization of DNA origami design. We develop circle map visualizations for DNA origami and maps of the base composition, allowing for visualizations of DNA origami that were not previously available. This tool is currently available online via nanohub (open source) for users around the world. The results reported here provide a fundamental understanding of the behavior of DNA systems in nanotechnology. Results are expected to aid in the development of more effective NP compaction agents, DNA delivery vehicles, and DNA origami design.
The generative power of weighted one-sided and regular sticker systems
NASA Astrophysics Data System (ADS)
Siang, Gan Yee; Heng, Fong Wan; Sarmin, Nor Haniza; Turaev, Sherzod
2014-06-01
Sticker systems were introduced in 1998 as one of the DNA computing models by using the recombination behavior of DNA molecules. The Watson-Crick complementary principle of DNA molecules is abstractly used in the sticker systems to perform the computation of sticker systems. In this paper, the generative power of weighted one-sided sticker systems and weighted regular sticker systems are investigated. Moreover, the relationship of the families of languages generated by these two variants of sticker systems to the Chomsky hierarchy is also presented.
Biomedical Requirements for High Productivity Computing Systems
2005-04-01
server at http://www.ncbi.nlm.nih.gov/BLAST/. There are many variants of BLAST, including: 1. BLASTN - Compares a DNA query to a DNA database. Searches ...database (3 reading frames from each strand of the DNA) searching . 13 4. TBLASTN - Compares a protein query to a DNA database, in the 6 possible...the molecular during this phase. After eliminating molecules that could not match the query , an atom-by-atom search for the molecules in conducted
DNA Cryptography and Deep Learning using Genetic Algorithm with NW algorithm for Key Generation.
Kalsi, Shruti; Kaur, Harleen; Chang, Victor
2017-12-05
Cryptography is not only a science of applying complex mathematics and logic to design strong methods to hide data called as encryption, but also to retrieve the original data back, called decryption. The purpose of cryptography is to transmit a message between a sender and receiver such that an eavesdropper is unable to comprehend it. To accomplish this, not only we need a strong algorithm, but a strong key and a strong concept for encryption and decryption process. We have introduced a concept of DNA Deep Learning Cryptography which is defined as a technique of concealing data in terms of DNA sequence and deep learning. In the cryptographic technique, each alphabet of a letter is converted into a different combination of the four bases, namely; Adenine (A), Cytosine (C), Guanine (G) and Thymine (T), which make up the human deoxyribonucleic acid (DNA). Actual implementations with the DNA don't exceed laboratory level and are expensive. To bring DNA computing on a digital level, easy and effective algorithms are proposed in this paper. In proposed work we have introduced firstly, a method and its implementation for key generation based on the theory of natural selection using Genetic Algorithm with Needleman-Wunsch (NW) algorithm and Secondly, a method for implementation of encryption and decryption based on DNA computing using biological operations Transcription, Translation, DNA Sequencing and Deep Learning.
The Crystal Structure of TAL Effector PthXo1 Bound to Its DNA Target
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mak, Amanda Nga-Sze; Bradley, Philip; Cernadas, Raul A.
2012-02-10
DNA recognition by TAL effectors is mediated by tandem repeats, each 33 to 35 residues in length, that specify nucleotides via unique repeat-variable diresidues (RVDs). The crystal structure of PthXo1 bound to its DNA target was determined by high-throughput computational structure prediction and validated by heavy-atom derivatization. Each repeat forms a left-handed, two-helix bundle that presents an RVD-containing loop to the DNA. The repeats self-associate to form a right-handed superhelix wrapped around the DNA major groove. The first RVD residue forms a stabilizing contact with the protein backbone, while the second makes a base-specific contact to the DNA sense strand.more » Two degenerate amino-terminal repeats also interact with the DNA. Containing several RVDs and noncanonical associations, the structure illustrates the basis of TAL effector-DNA recognition.« less
An anti-DNA antibody prefers damaged dsDNA over native.
Akberova, N I; Zhmurov, A A; Nevzorova, T A; Litvinov, R I
2017-01-01
DNA-protein interactions, including DNA-antibody complexes, have both fundamental and practical significance. In particular, antibodies against double-stranded DNA play an important role in the pathogenesis of autoimmune diseases. Elucidation of structural mechanisms of an antigen recognition and interaction of anti-DNA antibodies provides a basis for understanding the role of DNA-containing immune complexes in human pathologies and for new treatments. Here we used Molecular Dynamic simulations of bimolecular complexes of a segment of dsDNA with a monoclonal anti-DNA antibody's Fab-fragment to obtain detailed structural and physical characteristics of the dynamic intermolecular interactions. Using a computationally modified crystal structure of a Fab-DNA complex (PDB: 3VW3), we studied in silico equilibrium Molecular Dynamics of the Fab-fragment associated with two homologous dsDNA fragments, containing or not containing dimerized thymine, a product of DNA photodamage. The Fab-fragment interactions with the thymine dimer-containing DNA was thermodynamically more stable than with the native DNA. The amino acid residues constituting a paratope and the complementary nucleotide epitopes for both Fab-DNA constructs were identified. Stacking and electrostatic interactions were shown to play the main role in the antibody-dsDNA contacts, while hydrogen bonds were less significant. The aggregate of data show that the chemically modified dsDNA (containing a covalent thymine dimer) has a higher affinity toward the antibody and forms a stronger immune complex. These findings provide a mechanistic insight into formation and properties of the pathogenic anti-DNA antibodies in autoimmune diseases, such as systemic lupus erythematosus, associated with skin photosensibilization and DNA photodamage.
DNA nanotechnology: a future perspective
2013-01-01
In addition to its genetic function, DNA is one of the most distinct and smart self-assembling nanomaterials. DNA nanotechnology exploits the predictable self-assembly of DNA oligonucleotides to design and assemble innovative and highly discrete nanostructures. Highly ordered DNA motifs are capable of providing an ultra-fine framework for the next generation of nanofabrications. The majority of these applications are based upon the complementarity of DNA base pairing: adenine with thymine, and guanine with cytosine. DNA provides an intelligent route for the creation of nanoarchitectures with programmable and predictable patterns. DNA strands twist along one helix for a number of bases before switching to the other helix by passing through a crossover junction. The association of two crossovers keeps the helices parallel and holds them tightly together, allowing the assembly of bigger structures. Because of the DNA molecule's unique and novel characteristics, it can easily be applied in a vast variety of multidisciplinary research areas like biomedicine, computer science, nano/optoelectronics, and bionanotechnology. PMID:23497147
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions
Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA
2011-01-18
A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Effects of sequence on DNA wrapping around histones
NASA Astrophysics Data System (ADS)
Ortiz, Vanessa
2011-03-01
A central question in biophysics is whether the sequence of a DNA strand affects its mechanical properties. In epigenetics, these are thought to influence nucleosome positioning and gene expression. Theoretical and experimental attempts to answer this question have been hindered by an inability to directly resolve DNA structure and dynamics at the base-pair level. In our previous studies we used a detailed model of DNA to measure the effects of sequence on the stability of naked DNA under bending. Sequence was shown to influence DNA's ability to form kinks, which arise when certain motifs slide past others to form non-native contacts. Here, we have now included histone-DNA interactions to see if the results obtained for naked DNA are transferable to the problem of nucleosome positioning. Different DNA sequences interacting with the histone protein complex are studied, and their equilibrium and mechanical properties are compared among themselves and with the naked case. NLM training grant to the Computation and Informatics in Biology and Medicine Training Program (NLM T15LM007359).
NASA Astrophysics Data System (ADS)
Yang, Bin; Zhang, Xiao-Bing; Kang, Li-Ping; Huang, Zhi-Mei; Shen, Guo-Li; Yu, Ru-Qin; Tan, Weihong
2014-07-01
DNA strand displacement cascades have been engineered to construct various fascinating DNA circuits. However, biological applications are limited by the insufficient cellular internalization of naked DNA structures, as well as the separated multicomponent feature. In this work, these problems are addressed by the development of a novel DNA nanodevice, termed intelligent layered nanoflare, which integrates DNA computing at the nanoscale, via the self-assembly of DNA flares on a single gold nanoparticle. As a ``lab-on-a-nanoparticle'', the intelligent layered nanoflare could be engineered to perform a variety of Boolean logic gate operations, including three basic logic gates, one three-input AND gate, and two complex logic operations, in a digital non-leaky way. In addition, the layered nanoflare can serve as a programmable strategy to sequentially tune the size of nanoparticles, as well as a new fingerprint spectrum technique for intelligent multiplex biosensing. More importantly, the nanoflare developed here can also act as a single entity for intracellular DNA logic gate delivery, without the need of commercial transfection agents or other auxiliary carriers. By incorporating DNA circuits on nanoparticles, the presented layered nanoflare will broaden the applications of DNA circuits in biological systems, and facilitate the development of DNA nanotechnology.DNA strand displacement cascades have been engineered to construct various fascinating DNA circuits. However, biological applications are limited by the insufficient cellular internalization of naked DNA structures, as well as the separated multicomponent feature. In this work, these problems are addressed by the development of a novel DNA nanodevice, termed intelligent layered nanoflare, which integrates DNA computing at the nanoscale, via the self-assembly of DNA flares on a single gold nanoparticle. As a ``lab-on-a-nanoparticle'', the intelligent layered nanoflare could be engineered to perform a variety of Boolean logic gate operations, including three basic logic gates, one three-input AND gate, and two complex logic operations, in a digital non-leaky way. In addition, the layered nanoflare can serve as a programmable strategy to sequentially tune the size of nanoparticles, as well as a new fingerprint spectrum technique for intelligent multiplex biosensing. More importantly, the nanoflare developed here can also act as a single entity for intracellular DNA logic gate delivery, without the need of commercial transfection agents or other auxiliary carriers. By incorporating DNA circuits on nanoparticles, the presented layered nanoflare will broaden the applications of DNA circuits in biological systems, and facilitate the development of DNA nanotechnology. Electronic supplementary information (ESI) available: Additional figures (Table S1, Fig. S1-S5). See DOI: 10.1039/c4nr01676a
WE-DE-202-00: Connecting Radiation Physics with Computational Biology
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
Radiation therapy for the treatment of cancer has been established as a highly precise and effective way to eradicate a localized region of diseased tissue. To achieve further significant gains in the therapeutic ratio, we need to move towards biologically optimized treatment planning. To achieve this goal, we need to understand how the radiation-type dependent patterns of induced energy depositions within the cell (physics) connect via molecular, cellular and tissue reactions to treatment outcome such as tumor control and undesirable effects on normal tissue. Several computational biology approaches have been developed connecting physics to biology. Monte Carlo simulations are themore » most accurate method to calculate physical dose distributions at the nanometer scale, however simulations at the DNA scale are slow and repair processes are generally not simulated. Alternative models that rely on the random formation of individual DNA lesions within one or two turns of the DNA have been shown to reproduce the clusters of DNA lesions, including single strand breaks (SSBs), double strand breaks (DSBs) without the need for detailed track structure simulations. Efficient computational simulations of initial DNA damage induction facilitate computational modeling of DNA repair and other molecular and cellular processes. Mechanistic, multiscale models provide a useful conceptual framework to test biological hypotheses and help connect fundamental information about track structure and dosimetry at the sub-cellular level to dose-response effects on larger scales. In this symposium we will learn about the current state of the art of computational approaches estimating radiation damage at the cellular and sub-cellular scale. How can understanding the physics interactions at the DNA level be used to predict biological outcome? We will discuss if and how such calculations are relevant to advance our understanding of radiation damage and its repair, or, if the underlying biological processes are too complex for a mechanistic approach. Can computer simulations be used to guide future biological research? We will debate the feasibility of explaining biology from a physicists’ perspective. Learning Objectives: Understand the potential applications and limitations of computational methods for dose-response modeling at the molecular, cellular and tissue levels Learn about mechanism of action underlying the induction, repair and biological processing of damage to DNA and other constituents Understand how effects and processes at one biological scale impact on biological processes and outcomes on other scales J. Schuemann, NCI/NIH grantsS. McMahon, Funding: European Commission FP7 (grant EC FP7 MC-IOF-623630)« less
Dunn, Katherine E; Trefzer, Martin A; Johnson, Steven; Tyrrell, Andy M
2016-08-01
Molecular computation with DNA has great potential for low power, highly parallel information processing in a biological or biochemical context. However, significant challenges remain for the field of DNA computation. New technology is needed to allow multiplexed label-free readout and to enable regulation of molecular state without addition of new DNA strands. These capabilities could be provided by hybrid bioelectronic systems in which biomolecular computing is integrated with conventional electronics through immobilization of DNA machines on the surface of electronic circuitry. Here we present a quantitative experimental analysis of a surface-immobilized OR gate made from DNA and driven by strand displacement. The purpose of our work is to examine the performance of a simple representative surface-immobilized DNA logic machine, to provide valuable information for future work on hybrid bioelectronic systems involving DNA devices. We used a quartz crystal microbalance to examine a DNA monolayer containing approximately 5×10(11)gatescm(-2), with an inter-gate separation of approximately 14nm, and we found that the ensemble of gates took approximately 6min to switch. The gates could be switched repeatedly, but the switching efficiency was significantly degraded on the second and subsequent cycles when the binding site for the input was near to the surface. Otherwise, the switching efficiency could be 80% or better, and the power dissipated by the ensemble of gates during switching was approximately 0.1nWcm(-2), which is orders of magnitude less than the power dissipated during switching of an equivalent array of transistors. We propose an architecture for hybrid DNA-electronic systems in which information can be stored and processed, either in series or in parallel, by a combination of molecular machines and conventional electronics. In this architecture, information can flow freely and in both directions between the solution-phase and the underlying electronics via surface-immobilized DNA machines that provide the interface between the molecular and electronic domains. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Subacute Low Dose Nerve Agent Exposure Causes DNA Fragmentation in Guinea Pig Leukocytes
2005-10-01
1 SUBACUTE LOW DOSE NERVE AGENT EXPOSURE CAUSES DNA FRAGMENTATION IN GUINEA PIG LEUKOCYTES. Jitendra R. Dave1, John R. Moffett1, Sally M...DNA fragmentation in blood leukocytes from guinea pigs by ‘Comet’ assay after exposure to soman at doses ranging from 0.1LD50 to 0.4 LD50, once per...computer. Data obtained for exposure to soman demonstrated significant increases in DNA fragmentation in circulating leukocytes in CWNA treated guinea pigs as
Synthetic Biology: Knowledge Accessed by Everyone (Open Sources)
ERIC Educational Resources Information Center
Sánchez Reyes, Patricia Margarita
2016-01-01
Using the principles of biology, along with engineering and with the help of computer, scientists manage to copy. DNA sequences from nature and use them to create new organisms. DNA is created through engineering and computer science managing to create life inside a laboratory. We cannot dismiss the role that synthetic biology could lead in…
Zhang, Li; Wang, Zhong-Xia; Liang, Ru-Ping; Qiu, Jian-Ding
2013-07-16
Utilizing the principles of metal-ion-mediated base pairs (C-Ag-C and T-Hg-T), the pH-sensitive conformational transition of C-rich DNA strand, and the ligand-exchange process triggered by DL-dithiothreitol (DTT), a system of colorimetric logic gates (YES, AND, INHIBIT, and XOR) can be rationally constructed based on the aggregation of the DNA-modified Au NPs. The proposed logic operation system is simple, which consists of only T-/C-rich DNA-modified Au NPs, and it is unnecessary to exquisitely design and alter the DNA sequence for different multiple molecular logic operations. The nonnatural base pairing combined with unique optical properties of Au NPs promises great potential in multiplexed ion sensing, molecular-scale computers, and other computational logic devices.
Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y F; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie; Martin, Darren Patrick
2014-02-01
Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here.
Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y. F.; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie
2014-01-01
Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here. PMID:24284329
Self-assembly programming of DNA polyominoes.
Ong, Hui San; Syafiq-Rahim, Mohd; Kasim, Noor Hayaty Abu; Firdaus-Raih, Mohd; Ramlan, Effirul Ikhwan
2016-10-20
Fabrication of functional DNA nanostructures operating at a cellular level has been accomplished through molecular programming techniques such as DNA origami and single-stranded tiles (SST). During implementation, restrictive and constraint dependent designs are enforced to ensure conformity is attainable. We propose a concept of DNA polyominoes that promotes flexibility in molecular programming. The fabrication of complex structures is achieved through self-assembly of distinct heterogeneous shapes (i.e., self-organised optimisation among competing DNA basic shapes) with total flexibility during the design and assembly phases. In this study, the plausibility of the approach is validated using the formation of multiple 3×4 DNA network fabricated from five basic DNA shapes with distinct configurations (monomino, tromino and tetrominoes). Computational tools to aid the design of compatible DNA shapes and the structure assembly assessment are presented. The formations of the desired structures were validated using Atomic Force Microscopy (AFM) imagery. Five 3×4 DNA networks were successfully constructed using combinatorics of these five distinct DNA heterogeneous shapes. Our findings revealed that the construction of DNA supra-structures could be achieved using a more natural-like orchestration as compared to the rigid and restrictive conventional approaches adopted previously. Copyright © 2016 Elsevier B.V. All rights reserved.
HTSFinder: Powerful Pipeline of DNA Signature Discovery by Parallel and Distributed Computing
Karimi, Ramin; Hajdu, Andras
2016-01-01
Comprehensive effort for low-cost sequencing in the past few years has led to the growth of complete genome databases. In parallel with this effort, a strong need, fast and cost-effective methods and applications have been developed to accelerate sequence analysis. Identification is the very first step of this task. Due to the difficulties, high costs, and computational challenges of alignment-based approaches, an alternative universal identification method is highly required. Like an alignment-free approach, DNA signatures have provided new opportunities for the rapid identification of species. In this paper, we present an effective pipeline HTSFinder (high-throughput signature finder) with a corresponding k-mer generator GkmerG (genome k-mers generator). Using this pipeline, we determine the frequency of k-mers from the available complete genome databases for the detection of extensive DNA signatures in a reasonably short time. Our application can detect both unique and common signatures in the arbitrarily selected target and nontarget databases. Hadoop and MapReduce as parallel and distributed computing tools with commodity hardware are used in this pipeline. This approach brings the power of high-performance computing into the ordinary desktop personal computers for discovering DNA signatures in large databases such as bacterial genome. A considerable number of detected unique and common DNA signatures of the target database bring the opportunities to improve the identification process not only for polymerase chain reaction and microarray assays but also for more complex scenarios such as metagenomics and next-generation sequencing analysis. PMID:26884678
HTSFinder: Powerful Pipeline of DNA Signature Discovery by Parallel and Distributed Computing.
Karimi, Ramin; Hajdu, Andras
2016-01-01
Comprehensive effort for low-cost sequencing in the past few years has led to the growth of complete genome databases. In parallel with this effort, a strong need, fast and cost-effective methods and applications have been developed to accelerate sequence analysis. Identification is the very first step of this task. Due to the difficulties, high costs, and computational challenges of alignment-based approaches, an alternative universal identification method is highly required. Like an alignment-free approach, DNA signatures have provided new opportunities for the rapid identification of species. In this paper, we present an effective pipeline HTSFinder (high-throughput signature finder) with a corresponding k-mer generator GkmerG (genome k-mers generator). Using this pipeline, we determine the frequency of k-mers from the available complete genome databases for the detection of extensive DNA signatures in a reasonably short time. Our application can detect both unique and common signatures in the arbitrarily selected target and nontarget databases. Hadoop and MapReduce as parallel and distributed computing tools with commodity hardware are used in this pipeline. This approach brings the power of high-performance computing into the ordinary desktop personal computers for discovering DNA signatures in large databases such as bacterial genome. A considerable number of detected unique and common DNA signatures of the target database bring the opportunities to improve the identification process not only for polymerase chain reaction and microarray assays but also for more complex scenarios such as metagenomics and next-generation sequencing analysis.
Programmable chemical controllers made from DNA.
Chen, Yuan-Jyue; Dalchau, Neil; Srinivas, Niranjan; Phillips, Andrew; Cardelli, Luca; Soloveichik, David; Seelig, Georg
2013-10-01
Biological organisms use complex molecular networks to navigate their environment and regulate their internal state. The development of synthetic systems with similar capabilities could lead to applications such as smart therapeutics or fabrication methods based on self-organization. To achieve this, molecular control circuits need to be engineered to perform integrated sensing, computation and actuation. Here we report a DNA-based technology for implementing the computational core of such controllers. We use the formalism of chemical reaction networks as a 'programming language' and our DNA architecture can, in principle, implement any behaviour that can be mathematically expressed as such. Unlike logic circuits, our formulation naturally allows complex signal processing of intrinsically analogue biological and chemical inputs. Controller components can be derived from biologically synthesized (plasmid) DNA, which reduces errors associated with chemically synthesized DNA. We implement several building-block reaction types and then combine them into a network that realizes, at the molecular level, an algorithm used in distributed control systems for achieving consensus between multiple agents.
Holes influence the mutation spectrum of human mitochondrial DNA
NASA Astrophysics Data System (ADS)
Villagran, Martha; Miller, John
Mutations drive evolution and disease, showing highly non-random patterns of variant frequency vs. nucleotide position. We use computational DNA hole spectroscopy [M.Y. Suarez-Villagran & J.H. Miller, Sci. Rep. 5, 13571 (2015)] to reveal sites of enhanced hole probability in selected regions of human mitochondrial DNA. A hole is a mobile site of positive charge created when an electron is removed, for example by radiation or contact with a mutagenic agent. The hole spectra are quantum mechanically computed using a two-stranded tight binding model of DNA. We observe significant correlation between spectra of hole probabilities and of genetic variation frequencies from the MITOMAP database. These results suggest that hole-enhanced mutation mechanisms exert a substantial, perhaps dominant, influence on mutation patterns in DNA. One example is where a trapped hole induces a hydrogen bond shift, known as tautomerization, which then triggers a base-pair mismatch during replication. Our results deepen overall understanding of sequence specific mutation rates, encompassing both hotspots and cold spots, which drive molecular evolution.
Programmable chemical controllers made from DNA
NASA Astrophysics Data System (ADS)
Chen, Yuan-Jyue; Dalchau, Neil; Srinivas, Niranjan; Phillips, Andrew; Cardelli, Luca; Soloveichik, David; Seelig, Georg
2013-10-01
Biological organisms use complex molecular networks to navigate their environment and regulate their internal state. The development of synthetic systems with similar capabilities could lead to applications such as smart therapeutics or fabrication methods based on self-organization. To achieve this, molecular control circuits need to be engineered to perform integrated sensing, computation and actuation. Here we report a DNA-based technology for implementing the computational core of such controllers. We use the formalism of chemical reaction networks as a 'programming language' and our DNA architecture can, in principle, implement any behaviour that can be mathematically expressed as such. Unlike logic circuits, our formulation naturally allows complex signal processing of intrinsically analogue biological and chemical inputs. Controller components can be derived from biologically synthesized (plasmid) DNA, which reduces errors associated with chemically synthesized DNA. We implement several building-block reaction types and then combine them into a network that realizes, at the molecular level, an algorithm used in distributed control systems for achieving consensus between multiple agents.
Programmable chemical controllers made from DNA
Chen, Yuan-Jyue; Dalchau, Neil; Srinivas, Niranjan; Phillips, Andrew; Cardelli, Luca; Soloveichik, David; Seelig, Georg
2014-01-01
Biological organisms use complex molecular networks to navigate their environment and regulate their internal state. The development of synthetic systems with similar capabilities could lead to applications such as smart therapeutics or fabrication methods based on self-organization. To achieve this, molecular control circuits need to be engineered to perform integrated sensing, computation and actuation. Here we report a DNA-based technology for implementing the computational core of such controllers. We use the formalism of chemical reaction networks as a 'programming language', and our DNA architecture can, in principle, implement any behaviour that can be mathematically expressed as such. Unlike logic circuits, our formulation naturally allows complex signal processing of intrinsically analogue biological and chemical inputs. Controller components can be derived from biologically synthesized (plasmid) DNA, which reduces errors associated with chemically synthesized DNA. We implement several building-block reaction types and then combine them into a network that realizes, at the molecular level, an algorithm used in distributed control systems for achieving consensus between multiple agents. PMID:24077029
A stochastic reaction-diffusion model for protein aggregation on DNA
NASA Astrophysics Data System (ADS)
Voulgarakis, Nikolaos K.
Vital functions of DNA, such as transcription and packaging, depend on the proper clustering of proteins on the double strand. The present study investigates how the interplay between DNA allostery and electrostatic interactions affects protein clustering. The statistical analysis of a simple but transparent computational model reveals two major consequences of this interplay. First, depending on the protein and salt concentration, protein filaments exhibit a bimodal DNA stiffening and softening behavior. Second, within a certain domain of the control parameters, electrostatic interactions can cause energetic frustration that forces proteins to assemble in rigid spiral configurations. Such spiral filaments might trigger both positive and negative supercoiling, which can ultimately promote gene compaction and regulate the promoter. It has been experimentally shown that bacterial histone-like proteins assemble in similar spiral patterns and/or exhibit the same bimodal behavior. The proposed model can, thus, provide computational insights into the physical mechanisms used by proteins to control the mechanical properties of the DNA.
DNA profiles, computer searches, and the Fourth Amendment.
Kimel, Catherine W
2013-01-01
Pursuant to federal statutes and to laws in all fifty states, the United States government has assembled a database containing the DNA profiles of over eleven million citizens. Without judicial authorization, the government searches each of these profiles one-hundred thousand times every day, seeking to link database subjects to crimes they are not suspected of committing. Yet, courts and scholars that have addressed DNA databasing have focused their attention almost exclusively on the constitutionality of the government's seizure of the biological samples from which the profiles are generated. This Note fills a gap in the scholarship by examining the Fourth Amendment problems that arise when the government searches its vast DNA database. This Note argues that each attempt to match two DNA profiles constitutes a Fourth Amendment search because each attempted match infringes upon database subjects' expectations of privacy in their biological relationships and physical movements. The Note further argues that database searches are unreasonable as they are currently conducted, and it suggests an adaptation of computer-search procedures to remedy the constitutional deficiency.
Paternity testing that involves a DNA mixture.
Mortera, Julia; Vecchiotti, Carla; Zoppis, Silvia; Merigioli, Sara
2016-07-01
Here we analyse a complex disputed paternity case, where the DNA of the putative father was extracted from his corpse that had been inhumed for over 20 years. This DNA was contaminated and appears to be a mixture of at least two individuals. Furthermore, the mother's DNA was not available. The DNA mixture was analysed so as to predict the most probable genotypes of each contributor. The major contributor's profile was then used to compute the likelihood ratio for paternity. We also show how to take into account a dropout allele and the possibility of mutation in paternity testing. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Diao, Y.; Hinson, K.; Sun, Y.; Arsuaga, J.
2015-10-01
Kinetoplast DNA (kDNA) is the mitochondrial of DNA of disease causing organisms such as Trypanosoma Brucei (T. Brucei) and Trypanosoma Cruzi (T. Cruzi). In most organisms, KDNA is made of thousands of small circular DNA molecules that are highly condensed and topologically linked forming a gigantic planar network. In our previous work we have developed mathematical and computational models to test the confinement hypothesis, that is that the formation of kDNA minicircle networks is a product of the high DNA condensation achieved in the mitochondrion of these organisms. In these studies we studied three parameters that characterize the growth of the network topology upon confinement: the critical percolation density, the mean saturation density and the mean valence (i.e. the number of mini circles topologically linked to any chosen minicircle). Experimental results on insect-infecting organisms showed that the mean valence is equal to three, forming a structure similar to those found in medieval chain-mails. These same studies hypothesized that this value of the mean valence was driven by the DNA excluded volume. Here we extend our previous work on kDNA by characterizing the effects of DNA excluded volume on the three descriptive parameters. Using computer simulations of polymer swelling we found that (1) in agreement with previous studies the linking probability of two minicircles does not decrease linearly with the distance between the two minicircles, (2) the mean valence grows linearly with the density of minicircles and decreases with the thickness of the excluded volume, (3) the critical percolation and mean saturation densities grow linearly with the thickness of the excluded volume. Our results therefore suggest that the swelling of the DNA molecule, due to electrostatic interactions, has relatively mild implications on the overall topology of the network. Our results also validate our topological descriptors since they appear to reflect the changes in the physical properties of the polymeric chains and at the same time remain faithful to their description of kDNA.
Probabilistic simple sticker systems
NASA Astrophysics Data System (ADS)
Selvarajoo, Mathuri; Heng, Fong Wan; Sarmin, Nor Haniza; Turaev, Sherzod
2017-04-01
A model for DNA computing using the recombination behavior of DNA molecules, known as a sticker system, was introduced by by L. Kari, G. Paun, G. Rozenberg, A. Salomaa, and S. Yu in the paper entitled DNA computing, sticker systems and universality from the journal of Acta Informatica vol. 35, pp. 401-420 in the year 1998. A sticker system uses the Watson-Crick complementary feature of DNA molecules: starting from the incomplete double stranded sequences, and iteratively using sticking operations until a complete double stranded sequence is obtained. It is known that sticker systems with finite sets of axioms and sticker rules generate only regular languages. Hence, different types of restrictions have been considered to increase the computational power of sticker systems. Recently, a variant of restricted sticker systems, called probabilistic sticker systems, has been introduced [4]. In this variant, the probabilities are initially associated with the axioms, and the probability of a generated string is computed by multiplying the probabilities of all occurrences of the initial strings in the computation of the string. Strings for the language are selected according to some probabilistic requirements. In this paper, we study fundamental properties of probabilistic simple sticker systems. We prove that the probabilistic enhancement increases the computational power of simple sticker systems.
Modelling of DNA-protein recognition
NASA Technical Reports Server (NTRS)
Rein, R.; Garduno, R.; Colombano, S.; Nir, S.; Haydock, K.; Macelroy, R. D.
1980-01-01
Computer model-building procedures using stereochemical principles together with theoretical energy calculations appear to be, at this stage, the most promising route toward the elucidation of DNA-protein binding schemes and recognition principles. A review of models and bonding principles is conducted and approaches to modeling are considered, taking into account possible di-hydrogen-bonding schemes between a peptide and a base (or a base pair) of a double-stranded nucleic acid in the major groove, aspects of computer graphic modeling, and a search for isogeometric helices. The energetics of recognition complexes is discussed and several models for peptide DNA recognition are presented.
Cowell, Robert G
2018-05-04
Current models for single source and mixture samples, and probabilistic genotyping software based on them used for analysing STR electropherogram data, assume simple probability distributions, such as the gamma distribution, to model the allelic peak height variability given the initial amount of DNA prior to PCR amplification. Here we illustrate how amplicon number distributions, for a model of the process of sample DNA collection and PCR amplification, may be efficiently computed by evaluating probability generating functions using discrete Fourier transforms. Copyright © 2018 Elsevier B.V. All rights reserved.
Charge Structure and Counterion Distribution in Hexagonal DNA Liquid Crystal
Dai, Liang; Mu, Yuguang; Nordenskiöld, Lars; Lapp, Alain; van der Maarel, Johan R. C.
2007-01-01
A hexagonal liquid crystal of DNA fragments (double-stranded, 150 basepairs) with tetramethylammonium (TMA) counterions was investigated with small angle neutron scattering (SANS). We obtained the structure factors pertaining to the DNA and counterion density correlations with contrast matching in the water. Molecular dynamics (MD) computer simulation of a hexagonal assembly of nine DNA molecules showed that the inter-DNA distance fluctuates with a correlation time around 2 ns and a standard deviation of 8.5% of the interaxial spacing. The MD simulation also showed a minimal effect of the fluctuations in inter-DNA distance on the radial counterion density profile and significant penetration of the grooves by TMA. The radial density profile of the counterions was also obtained from a Monte Carlo (MC) computer simulation of a hexagonal array of charged rods with fixed interaxial spacing. Strong ordering of the counterions between the DNA molecules and the absence of charge fluctuations at longer wavelengths was shown by the SANS number and charge structure factors. The DNA-counterion and counterion structure factors are interpreted with the correlation functions derived from the Poisson-Boltzmann equation, MD, and MC simulation. Best agreement is observed between the experimental structure factors and the prediction based on the Poisson-Boltzmann equation and/or MC simulation. The SANS results show that TMA is too large to penetrate the grooves to a significant extent, in contrast to what is shown by MD simulation. PMID:17098791
Towards computational improvement of DNA database indexing and short DNA query searching.
Stojanov, Done; Koceski, Sašo; Mileva, Aleksandra; Koceska, Nataša; Bande, Cveta Martinovska
2014-09-03
In order to facilitate and speed up the search of massive DNA databases, the database is indexed at the beginning, employing a mapping function. By searching through the indexed data structure, exact query hits can be identified. If the database is searched against an annotated DNA query, such as a known promoter consensus sequence, then the starting locations and the number of potential genes can be determined. This is particularly relevant if unannotated DNA sequences have to be functionally annotated. However, indexing a massive DNA database and searching an indexed data structure with millions of entries is a time-demanding process. In this paper, we propose a fast DNA database indexing and searching approach, identifying all query hits in the database, without having to examine all entries in the indexed data structure, limiting the maximum length of a query that can be searched against the database. By applying the proposed indexing equation, the whole human genome could be indexed in 10 hours on a personal computer, under the assumption that there is enough RAM to store the indexed data structure. Analysing the methodology proposed by Reneker, we observed that hits at starting positions [Formula: see text] are not reported, if the database is searched against a query shorter than [Formula: see text] nucleotides, such that [Formula: see text] is the length of the DNA database words being mapped and [Formula: see text] is the length of the query. A solution of this drawback is also presented.
Second-generation DNA-templated macrocycle libraries for the discovery of bioactive small molecules.
Usanov, Dmitry L; Chan, Alix I; Maianti, Juan Pablo; Liu, David R
2018-07-01
DNA-encoded libraries have emerged as a widely used resource for the discovery of bioactive small molecules, and offer substantial advantages compared with conventional small-molecule libraries. Here, we have developed and streamlined multiple fundamental aspects of DNA-encoded and DNA-templated library synthesis methodology, including computational identification and experimental validation of a 20 × 20 × 20 × 80 set of orthogonal codons, chemical and computational tools for enhancing the structural diversity and drug-likeness of library members, a highly efficient polymerase-mediated template library assembly strategy, and library isolation and purification methods. We have integrated these improved methods to produce a second-generation DNA-templated library of 256,000 small-molecule macrocycles with improved drug-like physical properties. In vitro selection of this library for insulin-degrading enzyme affinity resulted in novel insulin-degrading enzyme inhibitors, including one of unusual potency and novel macrocycle stereochemistry (IC 50 = 40 nM). Collectively, these developments enable DNA-templated small-molecule libraries to serve as more powerful, accessible, streamlined and cost-effective tools for bioactive small-molecule discovery.
Computing exponentially faster: implementing a non-deterministic universal Turing machine using DNA
Currin, Andrew; Korovin, Konstantin; Ababi, Maria; Roper, Katherine; Kell, Douglas B.; Day, Philip J.
2017-01-01
The theory of computer science is based around universal Turing machines (UTMs): abstract machines able to execute all possible algorithms. Modern digital computers are physical embodiments of classical UTMs. For the most important class of problem in computer science, non-deterministic polynomial complete problems, non-deterministic UTMs (NUTMs) are theoretically exponentially faster than both classical UTMs and quantum mechanical UTMs (QUTMs). However, no attempt has previously been made to build an NUTM, and their construction has been regarded as impossible. Here, we demonstrate the first physical design of an NUTM. This design is based on Thue string rewriting systems, and thereby avoids the limitations of most previous DNA computing schemes: all the computation is local (simple edits to strings) so there is no need for communication, and there is no need to order operations. The design exploits DNA's ability to replicate to execute an exponential number of computational paths in P time. Each Thue rewriting step is embodied in a DNA edit implemented using a novel combination of polymerase chain reactions and site-directed mutagenesis. We demonstrate that the design works using both computational modelling and in vitro molecular biology experimentation: the design is thermodynamically favourable, microprogramming can be used to encode arbitrary Thue rules, all classes of Thue rule can be implemented, and non-deterministic rule implementation. In an NUTM, the resource limitation is space, which contrasts with classical UTMs and QUTMs where it is time. This fundamental difference enables an NUTM to trade space for time, which is significant for both theoretical computer science and physics. It is also of practical importance, for to quote Richard Feynman ‘there's plenty of room at the bottom’. This means that a desktop DNA NUTM could potentially utilize more processors than all the electronic computers in the world combined, and thereby outperform the world's current fastest supercomputer, while consuming a tiny fraction of its energy. PMID:28250099
USDA-ARS?s Scientific Manuscript database
A computer algorithm was created to inspect scanned images from DNA microarray slides developed to rapidly detect and genotype E. Coli O157 virulent strains. The algorithm computes centroid locations for signal and background pixels in RGB space and defines a plane perpendicular to the line connect...
Making Ordered DNA and Protein Structures from Computer-Printed Transparency Film Cut-Outs
ERIC Educational Resources Information Center
Jittivadhna, Karnyupha; Ruenwongsa, Pintip; Panijpan, Bhinyo
2009-01-01
Instructions are given for building physical scale models of ordered structures of B-form DNA, protein [alpha]-helix, and parallel and antiparallel protein [beta]-pleated sheets made from colored computer printouts designed for transparency film sheets. Cut-outs from these sheets are easily assembled. Conventional color coding for atoms are used…
XLS (c9orf142) is a new component of mammalian DNA double-stranded break repair.
Craxton, A; Somers, J; Munnur, D; Jukes-Jones, R; Cain, K; Malewicz, M
2015-06-01
Repair of double-stranded DNA breaks (DSBs) in mammalian cells primarily occurs by the non-homologous end-joining (NHEJ) pathway, which requires seven core proteins (Ku70/Ku86, DNA-PKcs (DNA-dependent protein kinase catalytic subunit), Artemis, XRCC4-like factor (XLF), XRCC4 and DNA ligase IV). Here we show using combined affinity purification and mass spectrometry that DNA-PKcs co-purifies with all known core NHEJ factors. Furthermore, we have identified a novel evolutionary conserved protein associated with DNA-PKcs-c9orf142. Computer-based modelling of c9orf142 predicted a structure very similar to XRCC4, hence we have named c9orf142-XLS (XRCC4-like small protein). Depletion of c9orf142/XLS in cells impaired DSB repair consistent with a defect in NHEJ. Furthermore, c9orf142/XLS interacted with other core NHEJ factors. These results demonstrate the existence of a new component of the NHEJ DNA repair pathway in mammalian cells.
Computer-aided design of DNA origami structures.
Selnihhin, Denis; Andersen, Ebbe Sloth
2015-01-01
The DNA origami method enables the creation of complex nanoscale objects that can be used to organize molecular components and to function as reconfigurable mechanical devices. Of relevance to synthetic biology, DNA origami structures can be delivered to cells where they can perform complicated sense-and-act tasks, and can be used as scaffolds to organize enzymes for enhanced synthesis. The design of DNA origami structures is a complicated matter and is most efficiently done using dedicated software packages. This chapter describes a procedure for designing DNA origami structures using a combination of state-of-the-art software tools. First, we introduce the basic method for calculating crossover positions between DNA helices and the standard crossover patterns for flat, square, and honeycomb DNA origami lattices. Second, we provide a step-by-step tutorial for the design of a simple DNA origami biosensor device, from schematic idea to blueprint creation and to 3D modeling and animation, and explain how careful modeling can facilitate later experimentation in the laboratory.
Context influences on TALE–DNA binding revealed by quantitative profiling
Rogers, Julia M.; Barrera, Luis A.; Reyon, Deepak; Sander, Jeffry D.; Kellis, Manolis; Joung, J Keith; Bulyk, Martha L.
2015-01-01
Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE–DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000–20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE–DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design. PMID:26067805
Context influences on TALE-DNA binding revealed by quantitative profiling.
Rogers, Julia M; Barrera, Luis A; Reyon, Deepak; Sander, Jeffry D; Kellis, Manolis; Joung, J Keith; Bulyk, Martha L
2015-06-11
Transcription activator-like effector (TALE) proteins recognize DNA using a seemingly simple DNA-binding code, which makes them attractive for use in genome engineering technologies that require precise targeting. Although this code is used successfully to design TALEs to target specific sequences, off-target binding has been observed and is difficult to predict. Here we explore TALE-DNA interactions comprehensively by quantitatively assaying the DNA-binding specificities of 21 representative TALEs to ∼5,000-20,000 unique DNA sequences per protein using custom-designed protein-binding microarrays (PBMs). We find that protein context features exert significant influences on binding. Thus, the canonical recognition code does not fully capture the complexity of TALE-DNA binding. We used the PBM data to develop a computational model, Specificity Inference For TAL-Effector Design (SIFTED), to predict the DNA-binding specificity of any TALE. We provide SIFTED as a publicly available web tool that predicts potential genomic off-target sites for improved TALE design.
2010-01-01
Background The robust storage, updating and utilization of information are necessary for the maintenance and perpetuation of dynamic systems. These systems can exist as constructs of metal-oxide semiconductors and silicon, as in a digital computer, or in the "wetware" of organic compounds, proteins and nucleic acids that make up biological organisms. We propose that there are essential functional properties of centralized information-processing systems; for digital computers these properties reside in the computer's hard drive, and for eukaryotic cells they are manifest in the DNA and associated structures. Methods Presented herein is a descriptive framework that compares DNA and its associated proteins and sub-nuclear structure with the structure and function of the computer hard drive. We identify four essential properties of information for a centralized storage and processing system: (1) orthogonal uniqueness, (2) low level formatting, (3) high level formatting and (4) translation of stored to usable form. The corresponding aspects of the DNA complex and a computer hard drive are categorized using this classification. This is intended to demonstrate a functional equivalence between the components of the two systems, and thus the systems themselves. Results Both the DNA complex and the computer hard drive contain components that fulfill the essential properties of a centralized information storage and processing system. The functional equivalence of these components provides insight into both the design process of engineered systems and the evolved solutions addressing similar system requirements. However, there are points where the comparison breaks down, particularly when there are externally imposed information-organizing structures on the computer hard drive. A specific example of this is the imposition of the File Allocation Table (FAT) during high level formatting of the computer hard drive and the subsequent loading of an operating system (OS). Biological systems do not have an external source for a map of their stored information or for an operational instruction set; rather, they must contain an organizational template conserved within their intra-nuclear architecture that "manipulates" the laws of chemistry and physics into a highly robust instruction set. We propose that the epigenetic structure of the intra-nuclear environment and the non-coding RNA may play the roles of a Biological File Allocation Table (BFAT) and biological operating system (Bio-OS) in eukaryotic cells. Conclusions The comparison of functional and structural characteristics of the DNA complex and the computer hard drive leads to a new descriptive paradigm that identifies the DNA as a dynamic storage system of biological information. This system is embodied in an autonomous operating system that inductively follows organizational structures, data hierarchy and executable operations that are well understood in the computer science industry. Characterizing the "DNA hard drive" in this fashion can lead to insights arising from discrepancies in the descriptive framework, particularly with respect to positing the role of epigenetic processes in an information-processing context. Further expansions arising from this comparison include the view of cells as parallel computing machines and a new approach towards characterizing cellular control systems. PMID:20092652
D'Onofrio, David J; An, Gary
2010-01-21
The robust storage, updating and utilization of information are necessary for the maintenance and perpetuation of dynamic systems. These systems can exist as constructs of metal-oxide semiconductors and silicon, as in a digital computer, or in the "wetware" of organic compounds, proteins and nucleic acids that make up biological organisms. We propose that there are essential functional properties of centralized information-processing systems; for digital computers these properties reside in the computer's hard drive, and for eukaryotic cells they are manifest in the DNA and associated structures. Presented herein is a descriptive framework that compares DNA and its associated proteins and sub-nuclear structure with the structure and function of the computer hard drive. We identify four essential properties of information for a centralized storage and processing system: (1) orthogonal uniqueness, (2) low level formatting, (3) high level formatting and (4) translation of stored to usable form. The corresponding aspects of the DNA complex and a computer hard drive are categorized using this classification. This is intended to demonstrate a functional equivalence between the components of the two systems, and thus the systems themselves. Both the DNA complex and the computer hard drive contain components that fulfill the essential properties of a centralized information storage and processing system. The functional equivalence of these components provides insight into both the design process of engineered systems and the evolved solutions addressing similar system requirements. However, there are points where the comparison breaks down, particularly when there are externally imposed information-organizing structures on the computer hard drive. A specific example of this is the imposition of the File Allocation Table (FAT) during high level formatting of the computer hard drive and the subsequent loading of an operating system (OS). Biological systems do not have an external source for a map of their stored information or for an operational instruction set; rather, they must contain an organizational template conserved within their intra-nuclear architecture that "manipulates" the laws of chemistry and physics into a highly robust instruction set. We propose that the epigenetic structure of the intra-nuclear environment and the non-coding RNA may play the roles of a Biological File Allocation Table (BFAT) and biological operating system (Bio-OS) in eukaryotic cells. The comparison of functional and structural characteristics of the DNA complex and the computer hard drive leads to a new descriptive paradigm that identifies the DNA as a dynamic storage system of biological information. This system is embodied in an autonomous operating system that inductively follows organizational structures, data hierarchy and executable operations that are well understood in the computer science industry. Characterizing the "DNA hard drive" in this fashion can lead to insights arising from discrepancies in the descriptive framework, particularly with respect to positing the role of epigenetic processes in an information-processing context. Further expansions arising from this comparison include the view of cells as parallel computing machines and a new approach towards characterizing cellular control systems.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stewart, R.
Radiation therapy for the treatment of cancer has been established as a highly precise and effective way to eradicate a localized region of diseased tissue. To achieve further significant gains in the therapeutic ratio, we need to move towards biologically optimized treatment planning. To achieve this goal, we need to understand how the radiation-type dependent patterns of induced energy depositions within the cell (physics) connect via molecular, cellular and tissue reactions to treatment outcome such as tumor control and undesirable effects on normal tissue. Several computational biology approaches have been developed connecting physics to biology. Monte Carlo simulations are themore » most accurate method to calculate physical dose distributions at the nanometer scale, however simulations at the DNA scale are slow and repair processes are generally not simulated. Alternative models that rely on the random formation of individual DNA lesions within one or two turns of the DNA have been shown to reproduce the clusters of DNA lesions, including single strand breaks (SSBs), double strand breaks (DSBs) without the need for detailed track structure simulations. Efficient computational simulations of initial DNA damage induction facilitate computational modeling of DNA repair and other molecular and cellular processes. Mechanistic, multiscale models provide a useful conceptual framework to test biological hypotheses and help connect fundamental information about track structure and dosimetry at the sub-cellular level to dose-response effects on larger scales. In this symposium we will learn about the current state of the art of computational approaches estimating radiation damage at the cellular and sub-cellular scale. How can understanding the physics interactions at the DNA level be used to predict biological outcome? We will discuss if and how such calculations are relevant to advance our understanding of radiation damage and its repair, or, if the underlying biological processes are too complex for a mechanistic approach. Can computer simulations be used to guide future biological research? We will debate the feasibility of explaining biology from a physicists’ perspective. Learning Objectives: Understand the potential applications and limitations of computational methods for dose-response modeling at the molecular, cellular and tissue levels Learn about mechanism of action underlying the induction, repair and biological processing of damage to DNA and other constituents Understand how effects and processes at one biological scale impact on biological processes and outcomes on other scales J. Schuemann, NCI/NIH grantsS. McMahon, Funding: European Commission FP7 (grant EC FP7 MC-IOF-623630)« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
McMahon, S.
Radiation therapy for the treatment of cancer has been established as a highly precise and effective way to eradicate a localized region of diseased tissue. To achieve further significant gains in the therapeutic ratio, we need to move towards biologically optimized treatment planning. To achieve this goal, we need to understand how the radiation-type dependent patterns of induced energy depositions within the cell (physics) connect via molecular, cellular and tissue reactions to treatment outcome such as tumor control and undesirable effects on normal tissue. Several computational biology approaches have been developed connecting physics to biology. Monte Carlo simulations are themore » most accurate method to calculate physical dose distributions at the nanometer scale, however simulations at the DNA scale are slow and repair processes are generally not simulated. Alternative models that rely on the random formation of individual DNA lesions within one or two turns of the DNA have been shown to reproduce the clusters of DNA lesions, including single strand breaks (SSBs), double strand breaks (DSBs) without the need for detailed track structure simulations. Efficient computational simulations of initial DNA damage induction facilitate computational modeling of DNA repair and other molecular and cellular processes. Mechanistic, multiscale models provide a useful conceptual framework to test biological hypotheses and help connect fundamental information about track structure and dosimetry at the sub-cellular level to dose-response effects on larger scales. In this symposium we will learn about the current state of the art of computational approaches estimating radiation damage at the cellular and sub-cellular scale. How can understanding the physics interactions at the DNA level be used to predict biological outcome? We will discuss if and how such calculations are relevant to advance our understanding of radiation damage and its repair, or, if the underlying biological processes are too complex for a mechanistic approach. Can computer simulations be used to guide future biological research? We will debate the feasibility of explaining biology from a physicists’ perspective. Learning Objectives: Understand the potential applications and limitations of computational methods for dose-response modeling at the molecular, cellular and tissue levels Learn about mechanism of action underlying the induction, repair and biological processing of damage to DNA and other constituents Understand how effects and processes at one biological scale impact on biological processes and outcomes on other scales J. Schuemann, NCI/NIH grantsS. McMahon, Funding: European Commission FP7 (grant EC FP7 MC-IOF-623630)« less
Saito, Samuel; Silva, Givaldo; Santos, Regineide Xavier; Gosmann, Grace; Pungartnik, Cristina; Brendel, Martin
2012-01-01
Reverse phase-solid phase extraction from Cassia alata leaves (CaRP) was used to obtain a refined extract. Higher than wild-type sensitivity to CaRP was exhibited by 16 haploid Saccharomyces cerevisiae mutants with defects in DNA repair and membrane transport. CaRP had a strong DPPH free radical scavenging activity with an IC50 value of 2.27 μg mL−1 and showed no pro-oxidant activity in yeast. CaRP compounds were separated by HPLC and the three major components were shown to bind to DNA in vitro. The major HPLC peak was identified as kampferol-3-O-β-d-glucoside (astragalin), which showed high affinity to DNA as seen by HPLC-UV measurement after using centrifugal ultrafiltration of astragalin-DNA mixtures. Astragalin-DNA interaction was further studied by spectroscopic methods and its interaction with DNA was evaluated using solid-state FTIR. These and computational (in silico) docking studies revealed that astragalin-DNA binding occurs through interaction with G-C base pairs, possibly by intercalation stabilized by H-bond formation. PMID:22489129
Saito, Samuel; Silva, Givaldo; Santos, Regineide Xavier; Gosmann, Grace; Pungartnik, Cristina; Brendel, Martin
2012-01-01
Reverse phase-solid phase extraction from Cassia alata leaves (CaRP) was used to obtain a refined extract. Higher than wild-type sensitivity to CaRP was exhibited by 16 haploid Saccharomyces cerevisiae mutants with defects in DNA repair and membrane transport. CaRP had a strong DPPH free radical scavenging activity with an IC(50) value of 2.27 μg mL(-1) and showed no pro-oxidant activity in yeast. CaRP compounds were separated by HPLC and the three major components were shown to bind to DNA in vitro. The major HPLC peak was identified as kampferol-3-O-β-d-glucoside (astragalin), which showed high affinity to DNA as seen by HPLC-UV measurement after using centrifugal ultrafiltration of astragalin-DNA mixtures. Astragalin-DNA interaction was further studied by spectroscopic methods and its interaction with DNA was evaluated using solid-state FTIR. These and computational (in silico) docking studies revealed that astragalin-DNA binding occurs through interaction with G-C base pairs, possibly by intercalation stabilized by H-bond formation.
Carvalho, Alexandra T P; Gouveia, Leonor; Kanna, Charan Raju; Wärmländer, Sebastian K T S; Platts, Jamie A; Kamerlin, Shina Caroline Lynn
2014-01-01
We report a series of molecular dynamics (MD) simulations of up to a microsecond combined simulation time designed to probe epigenetically modified DNA sequences. More specifically, by monitoring the effects of methylation and hydroxymethylation of cytosine in different DNA sequences, we show, for the first time, that DNA epigenetic modifications change the molecule's dynamical landscape, increasing the propensity of DNA toward different values of twist and/or roll/tilt angles (in relation to the unmodified DNA) at the modification sites. Moreover, both the extent and position of different modifications have significant effects on the amount of structural variation observed. We propose that these conformational differences, which are dependent on the sequence environment, can provide specificity for protein binding. PMID:25625845
4P: fast computing of population genetics statistics from large DNA polymorphism panels
Benazzo, Andrea; Panziera, Alex; Bertorelle, Giorgio
2015-01-01
Massive DNA sequencing has significantly increased the amount of data available for population genetics and molecular ecology studies. However, the parallel computation of simple statistics within and between populations from large panels of polymorphic sites is not yet available, making the exploratory analyses of a set or subset of data a very laborious task. Here, we present 4P (parallel processing of polymorphism panels), a stand-alone software program for the rapid computation of genetic variation statistics (including the joint frequency spectrum) from millions of DNA variants in multiple individuals and multiple populations. It handles a standard input file format commonly used to store DNA variation from empirical or simulation experiments. The computational performance of 4P was evaluated using large SNP (single nucleotide polymorphism) datasets from human genomes or obtained by simulations. 4P was faster or much faster than other comparable programs, and the impact of parallel computing using multicore computers or servers was evident. 4P is a useful tool for biologists who need a simple and rapid computer program to run exploratory population genetics analyses in large panels of genomic data. It is also particularly suitable to analyze multiple data sets produced in simulation studies. Unix, Windows, and MacOs versions are provided, as well as the source code for easier pipeline implementations. PMID:25628874
Discrete Ramanujan transform for distinguishing the protein coding regions from other regions.
Hua, Wei; Wang, Jiasong; Zhao, Jian
2014-01-01
Based on the study of Ramanujan sum and Ramanujan coefficient, this paper suggests the concepts of discrete Ramanujan transform and spectrum. Using Voss numerical representation, one maps a symbolic DNA strand as a numerical DNA sequence, and deduces the discrete Ramanujan spectrum of the numerical DNA sequence. It is well known that of discrete Fourier power spectrum of protein coding sequence has an important feature of 3-base periodicity, which is widely used for DNA sequence analysis by the technique of discrete Fourier transform. It is performed by testing the signal-to-noise ratio at frequency N/3 as a criterion for the analysis, where N is the length of the sequence. The results presented in this paper show that the property of 3-base periodicity can be only identified as a prominent spike of the discrete Ramanujan spectrum at period 3 for the protein coding regions. The signal-to-noise ratio for discrete Ramanujan spectrum is defined for numerical measurement. Therefore, the discrete Ramanujan spectrum and the signal-to-noise ratio of a DNA sequence can be used for distinguishing the protein coding regions from the noncoding regions. All the exon and intron sequences in whole chromosomes 1, 2, 3 and 4 of Caenorhabditis elegans have been tested and the histograms and tables from the computational results illustrate the reliability of our method. In addition, we have analyzed theoretically and gotten the conclusion that the algorithm for calculating discrete Ramanujan spectrum owns the lower computational complexity and higher computational accuracy. The computational experiments show that the technique by using discrete Ramanujan spectrum for classifying different DNA sequences is a fast and effective method. Copyright © 2014 Elsevier Ltd. All rights reserved.
Lyons, Eli; Sheridan, Paul; Tremmel, Georg; Miyano, Satoru; Sugano, Sumio
2017-10-24
High-throughput screens allow for the identification of specific biomolecules with characteristics of interest. In barcoded screens, DNA barcodes are linked to target biomolecules in a manner allowing for the target molecules making up a library to be identified by sequencing the DNA barcodes using Next Generation Sequencing. To be useful in experimental settings, the DNA barcodes in a library must satisfy certain constraints related to GC content, homopolymer length, Hamming distance, and blacklisted subsequences. Here we report a novel framework to quickly generate large-scale libraries of DNA barcodes for use in high-throughput screens. We show that our framework dramatically reduces the computation time required to generate large-scale DNA barcode libraries, compared with a naїve approach to DNA barcode library generation. As a proof of concept, we demonstrate that our framework is able to generate a library consisting of one million DNA barcodes for use in a fragment antibody phage display screening experiment. We also report generating a general purpose one billion DNA barcode library, the largest such library yet reported in literature. Our results demonstrate the value of our novel large-scale DNA barcode library generation framework for use in high-throughput screening applications.
Enzyme-guided DNA Sewing Architecture
Song, In Hyun; Shin, Seung Won; Park, Kyung Soo; Lansac, Yves; Jang, Yun Hee; Um, Soong Ho
2015-01-01
With the advent of nanotechnology, a variety of nanoarchitectures with varied physicochemical properties have been designed. Owing to the unique characteristics, DNAs have been used as a functional building block for novel nanoarchitecture. In particular, a self-assembly of long DNA molecules via a piece DNA staple has been utilized to attain such constructs. However, it needs many talented prerequisites (e.g., complicated computer program) with fewer yields of products. In addition, it has many limitations to overcome: for instance, (i) thermal instability under moderate environments and (ii) restraint in size caused by the restricted length of scaffold strands. Alternatively, the enzymatic sewing linkage of short DNA blocks is simply designed into long DNA assemblies but it is more error-prone due to the undeveloped sequence data. Here, we present, for the first time, a comprehensive study for directly combining DNA structures into higher DNA sewing constructs through the 5′-end cohesive ligation of T4 enzyme. Inspired by these achievements, the synthesized DNA nanomaterials were also utilized for effective detection and real-time diagnosis of cancer-specific and cytosolic RNA markers. This generalized protocol for generic DNA sewing is expected to be useful in several DNA nanotechnology as well as any nucleic acid-related fields. PMID:26634810
Enzyme-guided DNA Sewing Architecture
NASA Astrophysics Data System (ADS)
Song, In Hyun; Shin, Seung Won; Park, Kyung Soo; Lansac, Yves; Jang, Yun Hee; Um, Soong Ho
2015-12-01
With the advent of nanotechnology, a variety of nanoarchitectures with varied physicochemical properties have been designed. Owing to the unique characteristics, DNAs have been used as a functional building block for novel nanoarchitecture. In particular, a self-assembly of long DNA molecules via a piece DNA staple has been utilized to attain such constructs. However, it needs many talented prerequisites (e.g., complicated computer program) with fewer yields of products. In addition, it has many limitations to overcome: for instance, (i) thermal instability under moderate environments and (ii) restraint in size caused by the restricted length of scaffold strands. Alternatively, the enzymatic sewing linkage of short DNA blocks is simply designed into long DNA assemblies but it is more error-prone due to the undeveloped sequence data. Here, we present, for the first time, a comprehensive study for directly combining DNA structures into higher DNA sewing constructs through the 5‧-end cohesive ligation of T4 enzyme. Inspired by these achievements, the synthesized DNA nanomaterials were also utilized for effective detection and real-time diagnosis of cancer-specific and cytosolic RNA markers. This generalized protocol for generic DNA sewing is expected to be useful in several DNA nanotechnology as well as any nucleic acid-related fields.
NASA Technical Reports Server (NTRS)
Ho, P. S.; Ellison, M. J.; Quigley, G. J.; Rich, A.
1986-01-01
The ease with which a particular DNA segment adopts the left-handed Z-conformation depends largely on the sequence and on the degree of negative supercoiling to which it is subjected. We describe a computer program (Z-hunt) that is designed to search long sequences of naturally occurring DNA and retrieve those nucleotide combinations of up to 24 bp in length which show a strong propensity for Z-DNA formation. Incorporated into Z-hunt is a statistical mechanical model based on empirically determined energetic parameters for the B to Z transition accumulated to date. The Z-forming potential of a sequence is assessed by ranking its behavior as a function of negative superhelicity relative to the behavior of similar sized randomly generated nucleotide sequences assembled from over 80,000 combinations. The program makes it possible to compare directly the Z-forming potential of sequences with different base compositions and different sequence lengths. Using Z-hunt, we have analyzed the DNA sequences of the bacteriophage phi X174, plasmid pBR322, the animal virus SV40 and the replicative form of the eukaryotic adenovirus-2. The results are compared with those previously obtained by others from experiments designed to locate Z-DNA forming regions in these sequences using probes which show specificity for the left-handed DNA conformation.
Ryan, K; Williams, D Gareth; Balding, David J
2016-11-01
Many DNA profiles recovered from crime scene samples are of a quality that does not allow them to be searched against, nor entered into, databases. We propose a method for the comparison of profiles arising from two DNA samples, one or both of which can have multiple donors and be affected by low DNA template or degraded DNA. We compute likelihood ratios to evaluate the hypothesis that the two samples have a common DNA donor, and hypotheses specifying the relatedness of two donors. Our method uses a probability distribution for the genotype of the donor of interest in each sample. This distribution can be obtained from a statistical model, or we can exploit the ability of trained human experts to assess genotype probabilities, thus extracting much information that would be discarded by standard interpretation rules. Our method is compatible with established methods in simple settings, but is more widely applicable and can make better use of information than many current methods for the analysis of mixed-source, low-template DNA profiles. It can accommodate uncertainty arising from relatedness instead of or in addition to uncertainty arising from noisy genotyping. We describe a computer program GPMDNA, available under an open source licence, to calculate LRs using the method presented in this paper. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Modeling photoionization of aqueous DNA and its components.
Pluhařová, Eva; Slavíček, Petr; Jungwirth, Pavel
2015-05-19
Radiation damage to DNA is usually considered in terms of UVA and UVB radiation. These ultraviolet rays, which are part of the solar spectrum, can indeed cause chemical lesions in DNA, triggered by photoexcitation particularly in the UVB range. Damage can, however, be also caused by higher energy radiation, which can ionize directly the DNA or its immediate surroundings, leading to indirect damage. Thanks to absorption in the atmosphere, the intensity of such ionizing radiation is negligible in the solar spectrum at the surface of Earth. Nevertheless, such an ionizing scenario can become dangerously plausible for astronauts or flight personnel, as well as for persons present at nuclear power plant accidents. On the beneficial side, ionizing radiation is employed as means for destroying the DNA of cancer cells during radiation therapy. Quantitative information about ionization of DNA and its components is important not only for DNA radiation damage, but also for understanding redox properties of DNA in redox sensing or labeling, as well as charge migration along the double helix in nanoelectronics applications. Until recently, the vast majority of experimental and computational data on DNA ionization was pertinent to its components in the gas phase, which is far from its native aqueous environment. The situation has, however, changed for the better due to the advent of photoelectron spectroscopy in liquid microjets and its most recent application to photoionization of aqueous nucleosides, nucleotides, and larger DNA fragments. Here, we present a consistent and efficient computational methodology, which allows to accurately evaluate ionization energies and model photoelectron spectra of aqueous DNA and its individual components. After careful benchmarking, the method based on density functional theory and its time-dependent variant with properly chosen hybrid functionals and polarizable continuum solvent model provides ionization energies with accuracy of 0.2-0.3 eV, allowing for faithful modeling and interpretation of DNA photoionization. The key finding is that the aqueous medium is remarkably efficient in screening the interactions within DNA such that, unlike in the gas phase, ionization of a base, nucleoside, or nucleotide depends only very weakly on the particular DNA context. An exception is the electronic interaction between neighboring bases which can lead to sequence-specific effects, such as a partial delocalization of the cationic hole upon ionization enabled by presence of adjacent bases of the same type.
Nisha, J; Shanthi, V
2018-06-01
Mycobacterium leprae, the causal agent of leprosy is non-cultivable in vitro. Thus, the assessment of antibiotic activity against Mycobacterium leprae depends primarily upon the time-consuming mouse footpad system. The GyrA protein of Mycobacterium leprae is the target of the antimycobacterial drug, Ofloxacin. In recent times, the GyrA mutation (A91V) has been found to be resistant to Ofloxacin. This phenomenon has necessitated the development of new, long-acting antimycobacterial compounds. The underlying mechanism of drug resistance is not completely known. Currently, experimentally crystallized GyrA-DNA-OFLX models are not available for highlighting the binding and mechanism of Ofloxacin resistance. Hence, we employed computational approaches to characterize the Ofloxacin interaction with both the native and mutant forms of GyrA complexed with DNA. Binding energy measurements obtained from molecular docking studies highlights hydrogen bond-mediated efficient binding of Ofloxacin to Asp47 in the native GyrA-DNA complex in comparison with that of the mutant GyrA-DNA complex. Further, molecular dynamics studies highlighted the stable binding of Ofloxacin with native GyrA-DNA complex than with the mutant GyrA-DNA complex. This mechanism provided a plausible reason for the reported, reduced effect of Ofloxacin to control leprosy in individuals with the A91V mutation. Our report is the first of its kind wherein the basis for the Ofloxacin drug resistance mechanism has been explored with the help of ternary Mycobacterium leprae complex, GyrA-DNA-OFLX. These structural insights will provide useful information for designing new drugs to target the Ofloxacin-resistant DNA gyrase.
KiranKumar, Hulihalli N; RohitKumar, Heggodu G; Advirao, Gopal M
2018-01-01
Two new derivatives of pyrimido[4',5';4,5]thieno(2,3-b)quinoline (PTQ), 9-hydroxy-4-(3-diethylaminopropylamino)pyrimido[4',5';4,5]thieno(2,3-b)quinoline (Hydroxy-DPTQ) and 8-methoxy-4-(3-diethylaminopropylamino)pyrimido[4',5';4,5]thieno(2,3-b)quinoline (Methoxy-DPTQ) were synthesized and their DNA binding ability was analyzed using spectroscopy (UV-visible, fluorescence and circular dichroism), ethidium bromide dye displacement assay, melting temperature (T m ) analysis and computational docking studies. The hypochromism in UV-visible spectrum and increased fluorescence emission of Hydroxy-DPTQ and Methoxy-DPTQ in the presence of DNA suggested the molecule-DNA interaction. The association constants calculated from UV-visible and spectral titrations were of the order 10 4 to 10 6 M -1 . Circular dichroism studies corroborated the induced conformational changes in DNA upon addition of molecules. The change in the ellipticity was observed both in negative and positive peak of DNA, thus, suggesting the intercalation of molecules. The observed displacement of ethidium bromide from the DNA and increased T m , upon addition of DNA confirmed the intercalative mode of binding. This was further validated by computational docking, which showed clear intercalation of molecules into the d(GpC)-d(CpG) site of the receptor DNA. Anticancer activities of these molecules are evaluated by using MTT assay. Both molecules showed antiproliferative activity against all the three cancer cells studied, with Hydroxy-DPTQ being more potential molecule among the two. IC 50 value of Hydroxy-DPTQ and Methoxy-DPTQ were in the range of 3-5μM and 130-250μM, respectively. Copyright © 2017 Elsevier B.V. All rights reserved.
Iyer, Lakshminarayan M; Zhang, Dapeng; Burroughs, A Maxwell; Aravind, L
2013-09-01
Discovery of the TET/JBP family of dioxygenases that modify bases in DNA has sparked considerable interest in novel DNA base modifications and their biological roles. Using sensitive sequence and structure analyses combined with contextual information from comparative genomics, we computationally characterize over 12 novel biochemical systems for DNA modifications. We predict previously unidentified enzymes, such as the kinetoplastid J-base generating glycosyltransferase (and its homolog GREB1), the catalytic specificity of bacteriophage TET/JBP proteins and their role in complex DNA base modifications. We also predict the enzymes involved in synthesis of hypermodified bases such as alpha-glutamylthymine and alpha-putrescinylthymine that have remained enigmatic for several decades. Moreover, the current analysis suggests that bacteriophages and certain nucleo-cytoplasmic large DNA viruses contain an unexpectedly diverse range of DNA modification systems, in addition to those using previously characterized enzymes such as Dam, Dcm, TET/JBP, pyrimidine hydroxymethylases, Mom and glycosyltransferases. These include enzymes generating modified bases such as deazaguanines related to queuine and archaeosine, pyrimidines comparable with lysidine, those derived using modified S-adenosyl methionine derivatives and those using TET/JBP-generated hydroxymethyl pyrimidines as biosynthetic starting points. We present evidence that some of these modification systems are also widely dispersed across prokaryotes and certain eukaryotes such as basidiomycetes, chlorophyte and stramenopile alga, where they could serve as novel epigenetic marks for regulation or discrimination of self from non-self DNA. Our study extends the role of the PUA-like fold domains in recognition of modified nucleic acids and predicts versions of the ASCH and EVE domains to be novel 'readers' of modified bases in DNA. These results open opportunities for the investigation of the biology of these systems and their use in biotechnology.
Iyer, Lakshminarayan M.; Zhang, Dapeng; Maxwell Burroughs, A.; Aravind, L.
2013-01-01
Discovery of the TET/JBP family of dioxygenases that modify bases in DNA has sparked considerable interest in novel DNA base modifications and their biological roles. Using sensitive sequence and structure analyses combined with contextual information from comparative genomics, we computationally characterize over 12 novel biochemical systems for DNA modifications. We predict previously unidentified enzymes, such as the kinetoplastid J-base generating glycosyltransferase (and its homolog GREB1), the catalytic specificity of bacteriophage TET/JBP proteins and their role in complex DNA base modifications. We also predict the enzymes involved in synthesis of hypermodified bases such as alpha-glutamylthymine and alpha-putrescinylthymine that have remained enigmatic for several decades. Moreover, the current analysis suggests that bacteriophages and certain nucleo-cytoplasmic large DNA viruses contain an unexpectedly diverse range of DNA modification systems, in addition to those using previously characterized enzymes such as Dam, Dcm, TET/JBP, pyrimidine hydroxymethylases, Mom and glycosyltransferases. These include enzymes generating modified bases such as deazaguanines related to queuine and archaeosine, pyrimidines comparable with lysidine, those derived using modified S-adenosyl methionine derivatives and those using TET/JBP-generated hydroxymethyl pyrimidines as biosynthetic starting points. We present evidence that some of these modification systems are also widely dispersed across prokaryotes and certain eukaryotes such as basidiomycetes, chlorophyte and stramenopile alga, where they could serve as novel epigenetic marks for regulation or discrimination of self from non-self DNA. Our study extends the role of the PUA-like fold domains in recognition of modified nucleic acids and predicts versions of the ASCH and EVE domains to be novel ‘readers’ of modified bases in DNA. These results open opportunities for the investigation of the biology of these systems and their use in biotechnology. PMID:23814188
Yang, Bin; Zhang, Xiao-Bing; Kang, Li-Ping; Huang, Zhi-Mei; Shen, Guo-Li; Yu, Ru-Qin; Tan, Weihong
2014-08-07
DNA strand displacement cascades have been engineered to construct various fascinating DNA circuits. However, biological applications are limited by the insufficient cellular internalization of naked DNA structures, as well as the separated multicomponent feature. In this work, these problems are addressed by the development of a novel DNA nanodevice, termed intelligent layered nanoflare, which integrates DNA computing at the nanoscale, via the self-assembly of DNA flares on a single gold nanoparticle. As a "lab-on-a-nanoparticle", the intelligent layered nanoflare could be engineered to perform a variety of Boolean logic gate operations, including three basic logic gates, one three-input AND gate, and two complex logic operations, in a digital non-leaky way. In addition, the layered nanoflare can serve as a programmable strategy to sequentially tune the size of nanoparticles, as well as a new fingerprint spectrum technique for intelligent multiplex biosensing. More importantly, the nanoflare developed here can also act as a single entity for intracellular DNA logic gate delivery, without the need of commercial transfection agents or other auxiliary carriers. By incorporating DNA circuits on nanoparticles, the presented layered nanoflare will broaden the applications of DNA circuits in biological systems, and facilitate the development of DNA nanotechnology.
NASA Astrophysics Data System (ADS)
Khajeh, Masoumeh Ashrafi; Dehghan, Gholamreza; Dastmalchi, Siavoush; Shaghaghi, Masoomeh; Iranshahi, Mehrdad
2018-03-01
DNA is a major target for a number of anticancer substances. Interaction studies between small molecules and DNA are essential for rational drug designing to influence main biological processes and also introducing new probes for the assay of DNA. Tschimgine (TMG) is a monoterpene derivative with anticancer properties. In the present study we tried to elucidate the interaction of TMG with calf thymus DNA (CT-DNA) using different spectroscopic methods. UV-visible absorption spectrophotometry, fluorescence and circular dichroism (CD) spectroscopies as well as molecular docking study revealed formation of complex between TMG and CT-DNA. Binding constant (Kb) between TMG and DNA was 2.27 × 104 M- 1, that is comparable to groove binding agents. The fluorescence spectroscopic data revealed that the quenching mechanism of fluorescence of TMG by CT-DNA is static quenching. Thermodynamic parameters (ΔH < 0 and ΔS < 0) at different temperatures indicated that van der Waals forces and hydrogen bonds were involved in the binding process of TMG with CT-DNA. Competitive binding assay with methylene blue (MB) and Hoechst 33258 using fluorescence spectroscopy displayed that TMG possibly binds to the minor groove of CT-DNA. These observations were further confirmed by CD spectral analysis, viscosity measurements and molecular docking.
Signatures of DNA Methylation across Insects Suggest Reduced DNA Methylation Levels in Holometabola
Provataris, Panagiotis; Meusemann, Karen; Niehuis, Oliver; Grath, Sonja; Misof, Bernhard
2018-01-01
Abstract It has been experimentally shown that DNA methylation is involved in the regulation of gene expression and the silencing of transposable element activity in eukaryotes. The variable levels of DNA methylation among different insect species indicate an evolutionarily flexible role of DNA methylation in insects, which due to a lack of comparative data is not yet well-substantiated. Here, we use computational methods to trace signatures of DNA methylation across insects by analyzing transcriptomic and genomic sequence data from all currently recognized insect orders. We conclude that: 1) a functional methylation system relying exclusively on DNA methyltransferase 1 is widespread across insects. 2) DNA methylation has potentially been lost or extremely reduced in species belonging to springtails (Collembola), flies and relatives (Diptera), and twisted-winged parasites (Strepsiptera). 3) Holometabolous insects display signs of reduced DNA methylation levels in protein-coding sequences compared with hemimetabolous insects. 4) Evolutionarily conserved insect genes associated with housekeeping functions tend to display signs of heavier DNA methylation in comparison to the genomic/transcriptomic background. With this comparative study, we provide the much needed basis for experimental and detailed comparative analyses required to gain a deeper understanding on the evolution and function of DNA methylation in insects. PMID:29697817
NASA Astrophysics Data System (ADS)
Shafirovich, Vladimir; Singh, Carolyn; Geacintov, Nicholas E.
2003-11-01
Oxidative damage of DNA molecules associated with electron-transfer reactions is an important phenomenon in living cells, which can lead to mutations and contribute to carcinogenesis and the aging processes. This article describes the design of several simple experiments to explore DNA damage initiated by photoinduced electron-transfer reactions sensitized by the acridine derivative, proflavine (PF). A supercoiled DNA agarose gel nicking assay is employed as a sensitive probe of DNA strand cleavage. A low-cost experimental and computer-interfaced imaging apparatus is described allowing for the digital recording and analysis of the gel electrophoresis results. The first experiment describes the formation of direct strand breaks in double-stranded DNA induced by photoexcitation of the intercalated PF molecules. The second experiment demonstrates that the addition of the well-known electron acceptor, methylviologen, gives rise to a significant enhancement of the photochemical DNA strand cleavage effect. This occurs by an electron transfer step to methylviologen that renders the inital photoinduced charge separation between photoexcited PF and DNA irreversible. The third experiment demonstrates that the action spectrum of the DNA photocleavage matches the absorption spectrum of DNA-bound, intercalated PF molecules, which differs from that of free PF molecules. This result demonstrates that the photoinduced DNA strand cleavage is initiated by intercalated rather than free PF molecules.
Qin, Chao; Kang, Fuxing; Zhang, Wei; Shou, Weijun; Hu, Xiaojie; Gao, Yanzheng
2017-10-15
Environmental persistence of free DNA is influenced by its complexation with other chemical species and its aggregation mechanisms. However, it is not well-known how naturally-abundant metal ions, e.g., Al(III) and Fe(III), influence DNA aggregation. This study investigated aggregation behaviors of model DNA from salmon testes as influenced by metal cations, and elucidated the predominant mechanism responsible for DNA aggregation. Compared to monovalent (K + and Na + ) and divalent (Ca 2+ and Mg 2+ ) cations, Al(III) and Fe(III) species in aqueous solution caused rapid DNA aggregations. The maximal DNA aggregation occurred at 0.05 mmol/L Al(III) or 0.075 mmol/L Fe(III), respectively. A combination of atomic force microscopy, transmission electron microscopy, Fourier transform infrared spectroscopy, and X-ray photoelectron spectroscopy revealed that Al(III) and Fe(III) complexed with negatively charged phosphate groups to neutralize DNA charges, resulting in decreased electrostatic repulsion and subsequent DNA aggregation. Zeta potential measurements and molecular computation further support this mechanism. Furthermore, DNA aggregation was enhanced at higher temperature and near neutral pH. Therefore, DNA aggregation is collectively determined by many environmental factors such as ion species, temperature, and solution pH. Copyright © 2017 Elsevier Ltd. All rights reserved.
Transitional circuitry for studying the properties of DNA
NASA Astrophysics Data System (ADS)
Trubochkina, N.
2018-01-01
The article is devoted to a new view of the structure of DNA as an intellectual scheme possessing the properties of logic and memory. The theory of transient circuitry, developed by the author for optimal computer circuits, revealed an amazing structural similarity between mathematical models of transition silicon elements and logic and memory circuits of solid state transient circuitry and atomic models of parts of DNA.
"First generation" automated DNA sequencing technology.
Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M
2011-10-01
Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
DNA algorithms of implementing biomolecular databases on a biological computer.
Chang, Weng-Long; Vasilakos, Athanasios V
2015-01-01
In this paper, DNA algorithms are proposed to perform eight operations of relational algebra (calculus), which include Cartesian product, union, set difference, selection, projection, intersection, join, and division, on biomolecular relational databases.
Engineering bacteria to solve the Burnt Pancake Problem
Haynes, Karmella A; Broderick, Marian L; Brown, Adam D; Butner, Trevor L; Dickson, James O; Harden, W Lance; Heard, Lane H; Jessen, Eric L; Malloy, Kelly J; Ogden, Brad J; Rosemond, Sabriya; Simpson, Samantha; Zwack, Erin; Campbell, A Malcolm; Eckdahl, Todd T; Heyer, Laurie J; Poet, Jeffrey L
2008-01-01
Background We investigated the possibility of executing DNA-based computation in living cells by engineering Escherichia coli to address a classic mathematical puzzle called the Burnt Pancake Problem (BPP). The BPP is solved by sorting a stack of distinct objects (pancakes) into proper order and orientation using the minimum number of manipulations. Each manipulation reverses the order and orientation of one or more adjacent objects in the stack. We have designed a system that uses site-specific DNA recombination to mediate inversions of genetic elements that represent pancakes within plasmid DNA. Results Inversions (or "flips") of the DNA fragment pancakes are driven by the Salmonella typhimurium Hin/hix DNA recombinase system that we reconstituted as a collection of modular genetic elements for use in E. coli. Our system sorts DNA segments by inversions to produce different permutations of a promoter and a tetracycline resistance coding region; E. coli cells become antibiotic resistant when the segments are properly sorted. Hin recombinase can mediate all possible inversion operations on adjacent flippable DNA fragments. Mathematical modeling predicts that the system reaches equilibrium after very few flips, where equal numbers of permutations are randomly sorted and unsorted. Semiquantitative PCR analysis of in vivo flipping suggests that inversion products accumulate on a time scale of hours or days rather than minutes. Conclusion The Hin/hix system is a proof-of-concept demonstration of in vivo computation with the potential to be scaled up to accommodate larger and more challenging problems. Hin/hix may provide a flexible new tool for manipulating transgenic DNA in vivo. PMID:18492232
Jana, Kalyanashis; Ganguly, Bishwajit
2014-10-16
DNA nucleobases are reactive in nature and undergo modifications by deamination, oxidation, alkylation, or hydrolysis processes. Many such modified bases are susceptible to mutagenesis when formed in cellular DNA. The mutagenesis can occur by mispairing with DNA nucleobases by a DNA polymerase during replication. We have performed a study of mispairing of DNA bases with unnatural bases computationally. 5-Halo uracils have been studied as mispairs in mutagenesis; however, the reports on their different forms are scarce in the literature. The stability of mispairs with keto form, enol form, and ionized form of 5-halo-uracil has been computed with the M06-2X/6-31+G** level of theory. The enol form of 5-halo-uracil showed remarkable stability toward DNA mispair compared to the corresponding keto and ionized forms. (F)U-G mispair showed the highest stability in the series and (Halo)(U(enol/ionized)-G mispair interactions energies are more stable than the natural G-C basepair of DNA. To enhance the stability of DNA mispairs, we have introduced the hydroxyl group in the place of halogen atoms, which provides additional hydrogen-bonding interactions in the system while forming the 5-membered ring. The study has been further extended with lithiated 5-hydroxymethyl-uracil to stabilize the DNA mispair. (CH2OLi)U(ionized)-G mispair has shown the highest stability (ΔG = -32.4 kcal/mol) with multi O-Li interactions. AIM (atoms in molecules) and EDA (energy decomposition analysis) analysis has been performed to examine the nature of noncovalent interactions in such mispairs. EDA analysis has shown that electrostatic energy mainly contributes toward the interaction energy of mispairs. The higher stability achieved in these studied mispairs can play a pivotal role in the mutagenesis and can help to attain the mutation for many desired biological processes.
FY02 CBNP Annual Report Input: Bioinformatics Support for CBNP Research and Deployments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Slezak, T; Wolinsky, M
2002-10-31
The events of FY01 dynamically reprogrammed the objectives of the CBNP bioinformatics support team, to meet rapidly-changing Homeland Defense needs and requests from other agencies for assistance: Use computational techniques to determine potential unique DNA signature candidates for microbial and viral pathogens of interest to CBNP researcher and to our collaborating partner agencies such as the Centers for Disease Control and Prevention (CDC), U.S. Department of Agriculture (USDA), Department of Defense (DOD), and Food and Drug Administration (FDA). Develop effective electronic screening measures for DNA signatures to reduce the cost and time of wet-bench screening. Build a comprehensive system formore » tracking the development and testing of DNA signatures. Build a chain-of-custody sample tracking system for field deployment of the DNA signatures as part of the BASIS project. Provide computational tools for use by CBNP Biological Foundations researchers.« less
DNA strand displacement system running logic programs.
Rodríguez-Patón, Alfonso; Sainz de Murieta, Iñaki; Sosík, Petr
2014-01-01
The paper presents a DNA-based computing model which is enzyme-free and autonomous, not requiring a human intervention during the computation. The model is able to perform iterated resolution steps with logical formulae in conjunctive normal form. The implementation is based on the technique of DNA strand displacement, with each clause encoded in a separate DNA molecule. Propositions are encoded assigning a strand to each proposition p, and its complementary strand to the proposition ¬p; clauses are encoded comprising different propositions in the same strand. The model allows to run logic programs composed of Horn clauses by cascading resolution steps. The potential of the model is demonstrated also by its theoretical capability of solving SAT. The resulting SAT algorithm has a linear time complexity in the number of resolution steps, whereas its spatial complexity is exponential in the number of variables of the formula. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
A DNA-based molecular motor that can navigate a network of tracks
NASA Astrophysics Data System (ADS)
Wickham, Shelley F. J.; Bath, Jonathan; Katsuda, Yousuke; Endo, Masayuki; Hidaka, Kumi; Sugiyama, Hiroshi; Turberfield, Andrew J.
2012-03-01
Synthetic molecular motors can be fuelled by the hydrolysis or hybridization of DNA. Such motors can move autonomously and programmably, and long-range transport has been observed on linear tracks. It has also been shown that DNA systems can compute. Here, we report a synthetic DNA-based system that integrates long-range transport and information processing. We show that the path of a motor through a network of tracks containing four possible routes can be programmed using instructions that are added externally or carried by the motor itself. When external control is used we find that 87% of the motors follow the correct path, and when internal control is used 71% of the motors follow the correct path. Programmable motion will allow the development of computing networks, molecular systems that can sort and process cargoes according to instructions that they carry, and assembly lines that can be reconfigured dynamically in response to changing demands.
A Theoretical and Experimental Study of DNA Self-assembly
NASA Astrophysics Data System (ADS)
Chandran, Harish
The control of matter and phenomena at the nanoscale is fast becoming one of the most important challenges of the 21st century with wide-ranging applications from energy and health care to computing and material science. Conventional top-down approaches to nanotechnology, having served us well for long, are reaching their inherent limitations. Meanwhile, bottom-up methods such as self-assembly are emerging as viable alternatives for nanoscale fabrication and manipulation. A particularly successful bottom up technique is DNA self-assembly where a set of carefully designed DNA strands form a nanoscale object as a consequence of specific, local interactions among the different components, without external direction. The final product of the self-assembly process might be a static nanostructure or a dynamic nanodevice that performs a specific function. Over the past two decades, DNA self-assembly has produced stunning nanoscale objects such as 2D and 3D lattices, polyhedra and addressable arbitrary shaped substrates, and a myriad of nanoscale devices such as molecular tweezers, computational circuits, biosensors and molecular assembly lines. In this dissertation we study multiple problems in the theory, simulations and experiments of DNA self-assembly. We extend the Turing-universal mathematical framework of self-assembly known as the Tile Assembly Model by incorporating randomization during the assembly process. This allows us to reduce the tile complexity of linear assemblies. We develop multiple techniques to build linear assemblies of expected length N using far fewer tile types than previously possible. We abstract the fundamental properties of DNA and develop a biochemical system, which we call meta-DNA, based entirely on strands of DNA as the only component molecule. We further develop various enzyme-free protocols to manipulate meta-DNA systems and provide strand level details along with abstract notations for these mechanisms. We simulate DNA circuits by providing detailed designs for local molecular computations that involve spatially contiguous molecules arranged on addressable substrates via enzyme-free DNA hybridization reaction cascades. We use the Visual DSD simulation software in conjunction with localized reaction rates obtained from biophysical modeling to create chemical reaction networks of localized hybridization circuits that are then model checked using the PRISM model checking software. We develop a DNA detection system employing the triggered self-assembly of a novel DNA dendritic nanostructure. Detection begins when a specific, single-stranded target DNA strand triggers a hybridization chain reaction between two distinct DNA hairpins. Each hairpin opens and hybridizes up to two copies of the other, and hence each layer of the growing dendritic nanostructure can in principle accommodate an exponentially increasing number of cognate molecules, generating a nanostructure with high molecular weight. We build linear activatable assemblies employing a novel protection/deprotection strategy to strictly enforce the direction of tiling assembly growth to ensure the robustness of the assembly process. Our system consists of two tiles that can form a linear co-polymer. These tiles, which are initially protected such that they do not react with each other, can be activated to form linear co-polymers via the use of a strand displacing enzyme.
XLS (c9orf142) is a new component of mammalian DNA double-stranded break repair
Craxton, A; Somers, J; Munnur, D; Jukes-Jones, R; Cain, K; Malewicz, M
2015-01-01
Repair of double-stranded DNA breaks (DSBs) in mammalian cells primarily occurs by the non-homologous end-joining (NHEJ) pathway, which requires seven core proteins (Ku70/Ku86, DNA-PKcs (DNA-dependent protein kinase catalytic subunit), Artemis, XRCC4-like factor (XLF), XRCC4 and DNA ligase IV). Here we show using combined affinity purification and mass spectrometry that DNA-PKcs co-purifies with all known core NHEJ factors. Furthermore, we have identified a novel evolutionary conserved protein associated with DNA-PKcs—c9orf142. Computer-based modelling of c9orf142 predicted a structure very similar to XRCC4, hence we have named c9orf142—XLS (XRCC4-like small protein). Depletion of c9orf142/XLS in cells impaired DSB repair consistent with a defect in NHEJ. Furthermore, c9orf142/XLS interacted with other core NHEJ factors. These results demonstrate the existence of a new component of the NHEJ DNA repair pathway in mammalian cells. PMID:25941166
New t-gap insertion-deletion-like metrics for DNA hybridization thermodynamic modeling.
D'yachkov, Arkadii G; Macula, Anthony J; Pogozelski, Wendy K; Renz, Thomas E; Rykov, Vyacheslav V; Torney, David C
2006-05-01
We discuss the concept of t-gap block isomorphic subsequences and use it to describe new abstract string metrics that are similar to the Levenshtein insertion-deletion metric. Some of the metrics that we define can be used to model a thermodynamic distance function on single-stranded DNA sequences. Our model captures a key aspect of the nearest neighbor thermodynamic model for hybridized DNA duplexes. One version of our metric gives the maximum number of stacked pairs of hydrogen bonded nucleotide base pairs that can be present in any secondary structure in a hybridized DNA duplex without pseudoknots. Thermodynamic distance functions are important components in the construction of DNA codes, and DNA codes are important components in biomolecular computing, nanotechnology, and other biotechnical applications that employ DNA hybridization assays. We show how our new distances can be calculated by using a dynamic programming method, and we derive a Varshamov-Gilbert-like lower bound on the size of some of codes using these distance functions as constraints. We also discuss software implementation of our DNA code design methods.
Multi-scale Modeling of Chromosomal DNA in Living Cells
NASA Astrophysics Data System (ADS)
Spakowitz, Andrew
The organization and dynamics of chromosomal DNA play a pivotal role in a range of biological processes, including gene regulation, homologous recombination, replication, and segregation. Establishing a quantitative theoretical model of DNA organization and dynamics would be valuable in bridging the gap between the molecular-level packaging of DNA and genome-scale chromosomal processes. Our research group utilizes analytical theory and computational modeling to establish a predictive theoretical model of chromosomal organization and dynamics. In this talk, I will discuss our efforts to develop multi-scale polymer models of chromosomal DNA that are both sufficiently detailed to address specific protein-DNA interactions while capturing experimentally relevant time and length scales. I will demonstrate how these modeling efforts are capable of quantitatively capturing aspects of behavior of chromosomal DNA in both prokaryotic and eukaryotic cells. This talk will illustrate that capturing dynamical behavior of chromosomal DNA at various length scales necessitates a range of theoretical treatments that accommodate the critical physical contributions that are relevant to in vivo behavior at these disparate length and time scales. National Science Foundation, Physics of Living Systems Program (PHY-1305516).
Building block synthesis using the polymerase chain assembly method.
Marchand, Julie A; Peccoud, Jean
2012-01-01
De novo gene synthesis allows the creation of custom DNA molecules without the typical constraints of traditional cloning assembly: scars, restriction site incompatibility, and the quest to find all the desired parts to name a few. Moreover, with the help of computer-assisted design, the perfect DNA molecule can be created along with its matching sequence ready to download. The challenge is to build the physical DNA molecules that have been designed with the software. Although there are several DNA assembly methods, this section presents and describes a method using the polymerase chain assembly (PCA).
Simulations Meet Experiment to Reveal New Insights into DNA Intrinsic Mechanics
Ben Imeddourene, Akli; Elbahnsi, Ahmad; Guéroult, Marc; Oguey, Christophe; Foloppe, Nicolas; Hartmann, Brigitte
2015-01-01
The accurate prediction of the structure and dynamics of DNA remains a major challenge in computational biology due to the dearth of precise experimental information on DNA free in solution and limitations in the DNA force-fields underpinning the simulations. A new generation of force-fields has been developed to better represent the sequence-dependent B-DNA intrinsic mechanics, in particular with respect to the BI ↔ BII backbone equilibrium, which is essential to understand the B-DNA properties. Here, the performance of MD simulations with the newly updated force-fields Parmbsc0εζOLI and CHARMM36 was tested against a large ensemble of recent NMR data collected on four DNA dodecamers involved in nucleosome positioning. We find impressive progress towards a coherent, realistic representation of B-DNA in solution, despite residual shortcomings. This improved representation allows new and deeper interpretation of the experimental observables, including regarding the behavior of facing phosphate groups in complementary dinucleotides, and their modulation by the sequence. It also provides the opportunity to extensively revisit and refine the coupling between backbone states and inter base pair parameters, which emerges as a common theme across all the complementary dinucleotides. In sum, the global agreement between simulations and experiment reveals new aspects of intrinsic DNA mechanics, a key component of DNA-protein recognition. PMID:26657165
Wendelsdorf, Katherine V.; Song, Zhuo; Cao, Yang; Samuels, David C.
2009-01-01
Nucleoside analogs used in antiretroviral treatment have been associated with mitochondrial toxicity. The polymerase-γ hypothesis states that this toxicity stems from the analogs' inhibition of the mitochondrial DNA polymerase (polymerase-γ) leading to mitochondrial DNA (mtDNA) depletion. We have constructed a computational model of the interaction of polymerase-γ with activated nucleoside and nucleotide analog drugs, based on experimentally measured reaction rates and base excision rates, together with the mtDNA genome size, the human mtDNA sequence, and mitochondrial dNTP concentrations. The model predicts an approximately 1000-fold difference in the activated drug concentration required for a 50% probability of mtDNA strand termination between the activated di-deoxy analogs d4T, ddC, and ddI (activated to ddA) and the activated forms of the analogs 3TC, TDF, AZT, FTC, and ABC. These predictions are supported by experimental and clinical data showing significantly greater mtDNA depletion in cell culture and patient samples caused by the di-deoxy analog drugs. For zidovudine (AZT) we calculated a very low mtDNA replication termination probability, in contrast to its reported mitochondrial toxicity in vitro and clinically. Therefore AZT mitochondrial toxicity is likely due to a mechanism that does not involve strand termination of mtDNA replication. PMID:19132079
NASA Astrophysics Data System (ADS)
Vologodskii, Alexander
2016-09-01
The widespread circular form of DNA molecules inside cells creates very serious topological problems during replication. Due to the helical structure of the double helix the parental strands of circular DNA form a link of very high order, and yet they have to be unlinked before the cell division. DNA topoisomerases, the enzymes that catalyze passing of one DNA segment through another, solve this problem in principle. However, it is very difficult to remove all entanglements between the replicated DNA molecules due to huge length of DNA comparing to the cell size. One strategy that nature uses to overcome this problem is to create the topoisomerases that can dramatically reduce the fraction of linked circular DNA molecules relative to the corresponding fraction at thermodynamic equilibrium. This striking property of the enzymes means that the enzymes that interact with DNA only locally can access their topology, a global property of circular DNA molecules. This review considers the experimental studies of the phenomenon and analyzes the theoretical models that have been suggested in attempts to explain it. We describe here how various models of enzyme action can be investigated computationally. There is no doubt at the moment that we understand basic principles governing enzyme action. Still, there are essential quantitative discrepancies between the experimental data and the theoretical predictions. We consider how these discrepancies can be overcome.
Shi, Jie-Hua; Lou, Yan-Yue; Zhou, Kai-Li; Pan, Dong-Qi
2018-06-18
As a sulfonylurea herbicide, sulfosulfuron is extensively applied in controlling broad-leaves and weeds in agriculture. It may cause a potential risk for human and herbivores health due to its widely application and residue in crops and fruits. The study of the binding characteristics of calf thymus DNA (ct-DNA) with sulfosulfuron was performed through a series of spectroscopic techniques and computer simulation. The experimental results showed sulfosulfuron interacted with ct-DNA through the groove binding. The negative values of thermodynamic parameter (ΔH 0 , ΔS 0 and ΔG 0 ) revealed that the reaction of sulfosulfuron with DNA could proceed spontaneously, and the hydrogen bonding and van der Waals forces were essential to sulfosulfuron-ct-DNA binding, which was further verified by molecular docking study. Meanwhile, the electrostatic and hydrophobic interactions also played a supporting function for the interaction of sulfosulfuron with ct-DNA. The circular dichroism (CD) results exhibited a minor change in the secondary structure of ct-DNA during interaction process. Moreover, the conformation of sulfosulfuron had the obvious change after binding to DNA, which suggested that the flexibility of sulfosulfuron contributed to stabilizing the sulfosulfuron-ct-DNA complex. Copyright © 2018 Elsevier B.V. All rights reserved.
Dedkov, V S
2009-01-01
The specificity of DNA-methyltransferase M.Bsc4I was defined in cellular lysate of Bacillus schlegelii 4. For this purpose, we used methylation sensitivity of restriction endonucleases, and also modeling of methylation. The modeling consisted in editing sequences of DNA using replacements of methylated bases and their complementary bases. The substratum DNA processed by M.Bsc4I also were used for studying sensitivity of some restriction endonucleases to methylation. Thus, it was shown that M.Bsc4I methylated 5'-Cm4CNNNNNNNGG-3' and the overlapped dcm-methylation blocked its activity. The offered approach can appear universal enough and simple for definition of specificity of DNA-methyltransferases.
NASA Astrophysics Data System (ADS)
Strom, Richard A.; Zimmerly, Andrew T.; Andrianarijaona, Vola M.
2014-05-01
It is known that ionizing radiation generates low-energy secondary electrons, which may interact with the surrounding area, including biomolecules, such as triggering DNA single strand and double strand breaks as demonstrated by Sanche and coworkers (Radiat. Res. 157, 227(2002)). The bio-effects of low-energy electrons are currently a topic of high interest. Most of the studies are dedicated to dissociative electron attachments; however, the area is still mostly unexplored and still not well understood. We are computationally investigating the effect of ionizing radiation on DNA, such as its ionization to DNA+. More specifically, we are exploring the possibility of the dissociative recombination of the temporary DNA+ with one of the low-energy secondary electrons, produced by the ionizing radiation, to be another process of DNA strand breaks. Our preliminary results, which are performed with the binaries of ORCA, will be presented. Authors wish to give special thanks to Pacific Union College Student Senate in Angwin, California, for their financial support.
An experimental study of the putative mechanism of a synthetic autonomous rotary DNA nanomotor
NASA Astrophysics Data System (ADS)
Dunn, K. E.; Leake, M. C.; Wollman, A. J. M.; Trefzer, M. A.; Johnson, S.; Tyrrell, A. M.
2017-03-01
DNA has been used to construct a wide variety of nanoscale molecular devices. Inspiration for such synthetic molecular machines is frequently drawn from protein motors, which are naturally occurring and ubiquitous. However, despite the fact that rotary motors such as ATP synthase and the bacterial flagellar motor play extremely important roles in nature, very few rotary devices have been constructed using DNA. This paper describes an experimental study of the putative mechanism of a rotary DNA nanomotor, which is based on strand displacement, the phenomenon that powers many synthetic linear DNA motors. Unlike other examples of rotary DNA machines, the device described here is designed to be capable of autonomous operation after it is triggered. The experimental results are consistent with operation of the motor as expected, and future work on an enhanced motor design may allow rotation to be observed at the single-molecule level. The rotary motor concept presented here has potential applications in molecular processing, DNA computing, biosensing and photonics.
DNA packaging in viral capsids with peptide arms.
Cao, Qianqian; Bachmann, Michael
2017-01-18
Strong chain rigidity and electrostatic self-repulsion of packed double-stranded DNA in viruses require a molecular motor to pull the DNA into the capsid. However, what is the role of electrostatic interactions between different charged components in the packaging process? Though various theories and computer simulation models were developed for the understanding of viral assembly and packaging dynamics of the genome, long-range electrostatic interactions and capsid structure have typically been neglected or oversimplified. By means of molecular dynamics simulations, we explore the effects of electrostatic interactions on the packaging dynamics of DNA based on a coarse-grained DNA and capsid model by explicitly including peptide arms (PAs), linked to the inner surface of the capsid, and counterions. Our results indicate that the electrostatic interactions between PAs, DNA, and counterions have a significant influence on the packaging dynamics. We also find that the packed DNA conformations are largely affected by the structure of the PA layer, but the packaging rate is insensitive to the layer structure.
Khara, Dinesh C; Berger, Yaron; Ouldridge, Thomas E
2018-01-01
Abstract We present a detailed coarse-grained computer simulation and single molecule fluorescence study of the walking dynamics and mechanism of a DNA bipedal motor striding on a DNA origami. In particular, we study the dependency of the walking efficiency and stepping kinetics on step size. The simulations accurately capture and explain three different experimental observations. These include a description of the maximum possible step size, a decrease in the walking efficiency over short distances and a dependency of the efficiency on the walking direction with respect to the origami track. The former two observations were not expected and are non-trivial. Based on this study, we suggest three design modifications to improve future DNA walkers. Our study demonstrates the ability of the oxDNA model to resolve the dynamics of complex DNA machines, and its usefulness as an engineering tool for the design of DNA machines that operate in the three spatial dimensions. PMID:29294083
Electronic Transport in Single-Stranded DNA Molecule Related to Huntington's Disease
NASA Astrophysics Data System (ADS)
Sarmento, R. G.; Silva, R. N. O.; Madeira, M. P.; Frazão, N. F.; Sousa, J. O.; Macedo-Filho, A.
2018-04-01
We report a numerical analysis of the electronic transport in single chain DNA molecule consisting of 182 nucleotides. The DNA chains studied were extracted from a segment of the human chromosome 4p16.3, which were modified by expansion of CAG (cytosine-adenine-guanine) triplet repeats to mimics Huntington's disease. The mutated DNA chains were connected between two platinum electrodes to analyze the relationship between charge propagation in the molecule and Huntington's disease. The computations were performed within a tight-binding model, together with a transfer matrix technique, to investigate the current-voltage (I-V) of 23 types of DNA sequence and compare them with the distributions of the related CAG repeat numbers with the disease. All DNA sequences studied have a characteristic behavior of a semiconductor. In addition, the results showed a direct correlation between the current-voltage curves and the distributions of the CAG repeat numbers, suggesting possible applications in the development of DNA-based biosensors for molecular diagnostics.
Papavasileiou, Konstantinos D; Avramopoulos, Aggelos; Leonis, Georgios; Papadopoulos, Manthos G
2017-06-01
DNA is the building block of life, as it carries the biological information controlling development, function and reproduction of all organisms. However, its central role in storing and transferring genetic information can be severely hindered by molecules with structure altering abilities. Fullerenes are nanoparticles that find a broad spectrum of uses, but their toxicological effects on living organisms upon exposure remain unclear. The present study examines the interactions of a diverse array of fullerenes with DNA, by means of Molecular Dynamics and MM-PBSA methodologies, with special focus on structural deformations that may hint toxicity implications. Our results show that pristine and hydroxylated fullerenes have no unwinding effects upon DNA structure, with the latter displaying binding preference to the DNA major groove, achieved by both direct formation of hydrogen bonds and water molecule mediation. Fluorinated derivatives are capable of penetrating DNA structure, forming intercalative complexes with high binding affinities. Copyright © 2017 Elsevier Inc. All rights reserved.
Khajeh, Masoumeh Ashrafi; Dehghan, Gholamreza; Dastmalchi, Siavoush; Shaghaghi, Masoomeh; Iranshahi, Mehrdad
2018-03-05
DNA is a major target for a number of anticancer substances. Interaction studies between small molecules and DNA are essential for rational drug designing to influence main biological processes and also introducing new probes for the assay of DNA. Tschimgine (TMG) is a monoterpene derivative with anticancer properties. In the present study we tried to elucidate the interaction of TMG with calf thymus DNA (CT-DNA) using different spectroscopic methods. UV-visible absorption spectrophotometry, fluorescence and circular dichroism (CD) spectroscopies as well as molecular docking study revealed formation of complex between TMG and CT-DNA. Binding constant (K b ) between TMG and DNA was 2.27×10 4 M -1 , that is comparable to groove binding agents. The fluorescence spectroscopic data revealed that the quenching mechanism of fluorescence of TMG by CT-DNA is static quenching. Thermodynamic parameters (ΔH<0 and ΔS<0) at different temperatures indicated that van der Waals forces and hydrogen bonds were involved in the binding process of TMG with CT-DNA. Competitive binding assay with methylene blue (MB) and Hoechst 33258 using fluorescence spectroscopy displayed that TMG possibly binds to the minor groove of CT-DNA. These observations were further confirmed by CD spectral analysis, viscosity measurements and molecular docking. Copyright © 2017 Elsevier B.V. All rights reserved.
DNA-Binding Kinetics Determines the Mechanism of Noise-Induced Switching in Gene Networks
Tse, Margaret J.; Chu, Brian K.; Roy, Mahua; Read, Elizabeth L.
2015-01-01
Gene regulatory networks are multistable dynamical systems in which attractor states represent cell phenotypes. Spontaneous, noise-induced transitions between these states are thought to underlie critical cellular processes, including cell developmental fate decisions, phenotypic plasticity in fluctuating environments, and carcinogenesis. As such, there is increasing interest in the development of theoretical and computational approaches that can shed light on the dynamics of these stochastic state transitions in multistable gene networks. We applied a numerical rare-event sampling algorithm to study transition paths of spontaneous noise-induced switching for a ubiquitous gene regulatory network motif, the bistable toggle switch, in which two mutually repressive genes compete for dominant expression. We find that the method can efficiently uncover detailed switching mechanisms that involve fluctuations both in occupancies of DNA regulatory sites and copy numbers of protein products. In addition, we show that the rate parameters governing binding and unbinding of regulatory proteins to DNA strongly influence the switching mechanism. In a regime of slow DNA-binding/unbinding kinetics, spontaneous switching occurs relatively frequently and is driven primarily by fluctuations in DNA-site occupancies. In contrast, in a regime of fast DNA-binding/unbinding kinetics, switching occurs rarely and is driven by fluctuations in levels of expressed protein. Our results demonstrate how spontaneous cell phenotype transitions involve collective behavior of both regulatory proteins and DNA. Computational approaches capable of simulating dynamics over many system variables are thus well suited to exploring dynamic mechanisms in gene networks. PMID:26488666
Zhi, Hui; Li, Xin; Wang, Peng; Gao, Yue; Gao, Baoqing; Zhou, Dianshuang; Zhang, Yan; Guo, Maoni; Yue, Ming; Shen, Weitao
2018-01-01
Abstract Lnc2Meth (http://www.bio-bigdata.com/Lnc2Meth/), an interactive resource to identify regulatory relationships between human long non-coding RNAs (lncRNAs) and DNA methylation, is not only a manually curated collection and annotation of experimentally supported lncRNAs-DNA methylation associations but also a platform that effectively integrates tools for calculating and identifying the differentially methylated lncRNAs and protein-coding genes (PCGs) in diverse human diseases. The resource provides: (i) advanced search possibilities, e.g. retrieval of the database by searching the lncRNA symbol of interest, DNA methylation patterns, regulatory mechanisms and disease types; (ii) abundant computationally calculated DNA methylation array profiles for the lncRNAs and PCGs; (iii) the prognostic values for each hit transcript calculated from the patients clinical data; (iv) a genome browser to display the DNA methylation landscape of the lncRNA transcripts for a specific type of disease; (v) tools to re-annotate probes to lncRNA loci and identify the differential methylation patterns for lncRNAs and PCGs with user-supplied external datasets; (vi) an R package (LncDM) to complete the differentially methylated lncRNAs identification and visualization with local computers. Lnc2Meth provides a timely and valuable resource that can be applied to significantly expand our understanding of the regulatory relationships between lncRNAs and DNA methylation in various human diseases. PMID:29069510
ERIC Educational Resources Information Center
King, Angela G.
2007-01-01
This article presents three reports of research advances. The first report describes a deoxyribonucleic acid (DNA)-based computer that could lead to faster, more accurate tests for diagnosing West Nile Virus and bird flu. Representing the first "medium-scale integrated molecular circuit," it is the most powerful computing device of its type to…
Petkevičiūtė, D; Pasi, M; Gonzalez, O; Maddocks, J H
2014-11-10
cgDNA is a package for the prediction of sequence-dependent configuration-space free energies for B-form DNA at the coarse-grain level of rigid bases. For a fragment of any given length and sequence, cgDNA calculates the configuration of the associated free energy minimizer, i.e. the relative positions and orientations of each base, along with a stiffness matrix, which together govern differences in free energies. The model predicts non-local (i.e. beyond base-pair step) sequence dependence of the free energy minimizer. Configurations can be input or output in either the Curves+ definition of the usual helical DNA structural variables, or as a PDB file of coordinates of base atoms. We illustrate the cgDNA package by comparing predictions of free energy minimizers from (a) the cgDNA model, (b) time-averaged atomistic molecular dynamics (or MD) simulations, and (c) NMR or X-ray experimental observation, for (i) the Dickerson-Drew dodecamer and (ii) three oligomers containing A-tracts. The cgDNA predictions are rather close to those of the MD simulations, but many orders of magnitude faster to compute. Both the cgDNA and MD predictions are in reasonable agreement with the available experimental data. Our conclusion is that cgDNA can serve as a highly efficient tool for studying structural variations in B-form DNA over a wide range of sequences. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Significance of DNA bond strength in programmable nanoparticle thermodynamics and dynamics.
Yu, Qiuyan; Hu, Jinglei; Hu, Yi; Wang, Rong
2018-04-04
Assembly of nanoparticles (NPs) coated with complementary DNA strands leads to novel crystals with nanosized basic units rather than classic atoms, ions or molecules. The assembly process is mediated by hybridization of DNA via specific base pairing interaction, and is kinetically linked to the disassociation of DNA duplexes. DNA-level physiochemical quantities, both thermodynamic and kinetic, are key to understanding this process and essential for the design of DNA-NP crystals. The melting transition properties are helpful to judge the thermostability and sensitivity of relative DNA probes or other applications. Three different cases are investigated by changing the linker length and the spacer length on which the melting properties depend using the molecular dynamics method. Melting temperature is determined by sigmoidal melting curves based on hybridization percentage versus temperature and the Lindemann melting rule simultaneously. We provide a computational strategy based on a coarse-grained model to estimate the hybridization enthalpy, entropy and free energy from percentages of hybridizations which are readily accessible in experiments. Importantly, the lifetime of DNA bond dehybridization based on temperature and the activation energy depending on DNA bond strength are also calculated. The simulation results are in good agreement with the theoretical analysis and the present experimental data. Our study provides a good strategy to predict the melting temperature which is important for the DNA-directed nanoparticle system, and bridges the dynamics and thermodynamics of DNA-directed nanoparticle systems by estimating the equilibrium constant from the hybridization of DNA bonds quantitatively.
Nanoscale Bio-engineering Solutions for Space Exploration: The Nanopore Sequencer
NASA Technical Reports Server (NTRS)
Stolc, Viktor; Cozmuta, Ioana
2004-01-01
Characterization of biological systems at the molecular level and extraction of essential information for nano-engineering design to guide the nano-fabrication of solid-state sensors and molecular identification devices is a computational challenge. The alpha hemolysin protein ion channel is used as a model system for structural analysis of nucleic acids like DNA. Applied voltage draws a DNA strand and surrounding ionic solution through the biological nanopore. The subunits in the DNA strand block ion flow by differing amounts. Atomistic scale simulations are employed using NASA supercomputers to study DNA translocation, with the aim to enhance single DNA subunit identification. Compared to protein channels, solid-state nanopores offer a better temporal control of the translocation of DNA and the possibility to easily tune its chemistry to increase the signal resolution. Potential applications for NASA missions, besides real-time genome sequencing include astronaut health, life detection and decoding of various genomes.
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.
Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab
2012-01-01
RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.
Functional specificity of a Hox protein mediated by the recognition of minor groove structure.
Joshi, Rohit; Passner, Jonathan M; Rohs, Remo; Jain, Rinku; Sosinsky, Alona; Crickmore, Michael A; Jacob, Vinitha; Aggarwal, Aneel K; Honig, Barry; Mann, Richard S
2007-11-02
The recognition of specific DNA-binding sites by transcription factors is a critical yet poorly understood step in the control of gene expression. Members of the Hox family of transcription factors bind DNA by making nearly identical major groove contacts via the recognition helices of their homeodomains. In vivo specificity, however, often depends on extended and unstructured regions that link Hox homeodomains to a DNA-bound cofactor, Extradenticle (Exd). Using a combination of structure determination, computational analysis, and in vitro and in vivo assays, we show that Hox proteins recognize specific Hox-Exd binding sites via residues located in these extended regions that insert into the minor groove but only when presented with the correct DNA sequence. Our results suggest that these residues, which are conserved in a paralog-specific manner, confer specificity by recognizing a sequence-dependent DNA structure instead of directly reading a specific DNA sequence.
A coarse-grained DNA model for the prediction of current signals in DNA translocation experiments
NASA Astrophysics Data System (ADS)
Weik, Florian; Kesselheim, Stefan; Holm, Christian
2016-11-01
We present an implicit solvent coarse-grained double-stranded DNA (dsDNA) model confined to an infinite cylindrical pore that reproduces the experimentally observed current modulations of a KaCl solution at various concentrations. Our model extends previous coarse-grained and mean-field approaches by incorporating a position dependent friction term on the ions, which Kesselheim et al. [Phys. Rev. Lett. 112, 018101 (2014)] identified as an essential ingredient to correctly reproduce the experimental data of Smeets et al. [Nano Lett. 6, 89 (2006)]. Our approach reduces the computational effort by orders of magnitude compared with all-atom simulations and serves as a promising starting point for modeling the entire translocation process of dsDNA. We achieve a consistent description of the system's electrokinetics by using explicitly parameterized ions, a friction term between the DNA beads and the ions, and a lattice-Boltzmann model for the solvent.
Heinmets, F; Leary, R H
1991-06-01
A model system (1) was established to analyze purine and pyrimidine metabolism. This system has been expanded to include macrosimulation of DNA synthesis and the study of its regulation by terminal deoxynucleoside triphosphates (dNTPs) via a complex set of interactions. Computer experiments reveal that our model exhibits adequate and reasonable sensitivity in terms of dNTP pool levels and rates of DNA synthesis when inputs to the system are varied. These simulation experiments reveal that in order to achieve maximum DNA synthesis (in terms of purine metabolism), a proper balance is required in guanine and adenine input into this metabolic system. Excessive inputs will become inhibitory to DNA synthesis. In addition, studies are carried out on rates of DNA synthesis when various parameters are changed quantitatively. The current system is formulated by 110 differential equations.
Nanoscale Bioengineering Solutions for Space Exploration the Nanopore Sequencer
NASA Technical Reports Server (NTRS)
Ioana, Cozmuta; Viktor, Stoic
2005-01-01
Characterization of biological systems at the molecular level and extraction of essential information for nano-engineering design to guide the nano-fabrication of solid-state sensors and molecular identification devices is a computational challenge. The alpha hemolysin protein ion channel is used as a model system for structural analysis of nucleic acids like DNA. Applied voltage draws a DNA strand and surrounding ionic solution through the biological nanopore. The subunits in the DNA strand block ion flow by differing amounts. Atomistic scale simulations are employed using NASA supercomputers to study DNA translocation. with the aim to enhance single DNA subunit identification. Compared to protein channels, solid-state nanopores offer a better temporal control of the translocation of DNA and the possibility to easily tune its chemistry to increase the signal resolution. Potential applications for NASA missions, besides real-time genome sequencing include astronaut health, life detection and decoding of various genomes. http://phenomrph.arc.nasa.gov/index.php
Wang, Pengfei; Gaitanaros, Stavros; Lee, Seungwoo; Bathe, Mark; Shih, William M; Ke, Yonggang
2016-06-22
Scaffolded DNA origami has proven to be a versatile method for generating functional nanostructures with prescribed sub-100 nm shapes. Programming DNA-origami tiles to form large-scale 2D lattices that span hundreds of nanometers to the micrometer scale could provide an enabling platform for diverse applications ranging from metamaterials to surface-based biophysical assays. Toward this end, here we design a family of hexagonal DNA-origami tiles using computer-aided design and demonstrate successful self-assembly of micrometer-scale 2D honeycomb lattices and tubes by controlling their geometric and mechanical properties including their interconnecting strands. Our results offer insight into programmed self-assembly of low-defect supra-molecular DNA-origami 2D lattices and tubes. In addition, we demonstrate that these DNA-origami hexagon tiles and honeycomb lattices are versatile platforms for assembling optical metamaterials via programmable spatial arrangement of gold nanoparticles (AuNPs) into cluster and superlattice geometries.
Antibody-controlled actuation of DNA-based molecular circuits.
Engelen, Wouter; Meijer, Lenny H H; Somers, Bram; de Greef, Tom F A; Merkx, Maarten
2017-02-17
DNA-based molecular circuits allow autonomous signal processing, but their actuation has relied mostly on RNA/DNA-based inputs, limiting their application in synthetic biology, biomedicine and molecular diagnostics. Here we introduce a generic method to translate the presence of an antibody into a unique DNA strand, enabling the use of antibodies as specific inputs for DNA-based molecular computing. Our approach, antibody-templated strand exchange (ATSE), uses the characteristic bivalent architecture of antibodies to promote DNA-strand exchange reactions both thermodynamically and kinetically. Detailed characterization of the ATSE reaction allowed the establishment of a comprehensive model that describes the kinetics and thermodynamics of ATSE as a function of toehold length, antibody-epitope affinity and concentration. ATSE enables the introduction of complex signal processing in antibody-based diagnostics, as demonstrated here by constructing molecular circuits for multiplex antibody detection, integration of multiple antibody inputs using logic gates and actuation of enzymes and DNAzymes for signal amplification.
Antibody-controlled actuation of DNA-based molecular circuits
NASA Astrophysics Data System (ADS)
Engelen, Wouter; Meijer, Lenny H. H.; Somers, Bram; de Greef, Tom F. A.; Merkx, Maarten
2017-02-01
DNA-based molecular circuits allow autonomous signal processing, but their actuation has relied mostly on RNA/DNA-based inputs, limiting their application in synthetic biology, biomedicine and molecular diagnostics. Here we introduce a generic method to translate the presence of an antibody into a unique DNA strand, enabling the use of antibodies as specific inputs for DNA-based molecular computing. Our approach, antibody-templated strand exchange (ATSE), uses the characteristic bivalent architecture of antibodies to promote DNA-strand exchange reactions both thermodynamically and kinetically. Detailed characterization of the ATSE reaction allowed the establishment of a comprehensive model that describes the kinetics and thermodynamics of ATSE as a function of toehold length, antibody-epitope affinity and concentration. ATSE enables the introduction of complex signal processing in antibody-based diagnostics, as demonstrated here by constructing molecular circuits for multiplex antibody detection, integration of multiple antibody inputs using logic gates and actuation of enzymes and DNAzymes for signal amplification.
Role of DNA secondary structures in fragile site breakage along human chromosome 10
Dillon, Laura W.; Pierce, Levi C. T.; Ng, Maggie C. Y.; Wang, Yuh-Hwa
2013-01-01
The formation of alternative DNA secondary structures can result in DNA breakage leading to cancer and other diseases. Chromosomal fragile sites, which are regions of the genome that exhibit chromosomal breakage under conditions of mild replication stress, are predicted to form stable DNA secondary structures. DNA breakage at fragile sites is associated with regions that are deleted, amplified or rearranged in cancer. Despite the correlation, unbiased examination of the ability to form secondary structures has not been evaluated in fragile sites. Here, using the Mfold program, we predict potential DNA secondary structure formation on the human chromosome 10 sequence, and utilize this analysis to compare fragile and non-fragile DNA. We found that aphidicolin (APH)-induced common fragile sites contain more sequence segments with potential high secondary structure-forming ability, and these segments clustered more densely than those in non-fragile DNA. Additionally, using a threshold of secondary structure-forming ability, we refined legitimate fragile sites within the cytogenetically defined boundaries, and identified potential fragile regions within non-fragile DNA. In vitro detection of alternative DNA structure formation and a DNA breakage cell assay were used to validate the computational predictions. Many of the regions identified by our analysis coincide with genes mutated in various diseases and regions of copy number alteration in cancer. This study supports the role of DNA secondary structures in common fragile site instability, provides a systematic method for their identification and suggests a mechanism by which DNA secondary structures can lead to human disease. PMID:23297364
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sherman, W.B.
2012-04-16
Synthetic DNA nanostructures are typically held together primarily by Holliday junctions. One of the most basic types of structures possible to assemble with only DNA and Holliday junctions is the triangle. To date, however, only equilateral triangles have been assembled in this manner - primarily because it is difficult to figure out what configurations of Holliday triangles have low strain. Early attempts at identifying such configurations relied upon calculations that followed the strained helical paths of DNA. Those methods, however, were computationally expensive, and failed to find many of the possible solutions. I have developed a new approach to identifyingmore » Holliday triangles that is computationally faster, and finds well over 95% of the possible solutions. The new approach is based on splitting the problem into two parts. The first part involves figuring out all the different ways that three featureless rods of the appropriate length and diameter can weave over and under one another to form a triangle. The second part of the computation entails seeing whether double helical DNA backbones can fit into the shape dictated by the rods in such a manner that the strands can cross over from one domain to the other at the appropriate spots. Structures with low strain (that is, good fit between the rods and the helices) on all three edges are recorded as promising for assembly.« less
Looping and clustering model for the organization of protein-DNA complexes on the bacterial genome
NASA Astrophysics Data System (ADS)
Walter, Jean-Charles; Walliser, Nils-Ole; David, Gabriel; Dorignac, Jérôme; Geniet, Frédéric; Palmeri, John; Parmeggiani, Andrea; Wingreen, Ned S.; Broedersz, Chase P.
2018-03-01
The bacterial genome is organized by a variety of associated proteins inside a structure called the nucleoid. These proteins can form complexes on DNA that play a central role in various biological processes, including chromosome segregation. A prominent example is the large ParB-DNA complex, which forms an essential component of the segregation machinery in many bacteria. ChIP-Seq experiments show that ParB proteins localize around centromere-like parS sites on the DNA to which ParB binds specifically, and spreads from there over large sections of the chromosome. Recent theoretical and experimental studies suggest that DNA-bound ParB proteins can interact with each other to condense into a coherent 3D complex on the DNA. However, the structural organization of this protein-DNA complex remains unclear, and a predictive quantitative theory for the distribution of ParB proteins on DNA is lacking. Here, we propose the looping and clustering model, which employs a statistical physics approach to describe protein-DNA complexes. The looping and clustering model accounts for the extrusion of DNA loops from a cluster of interacting DNA-bound proteins that is organized around a single high-affinity binding site. Conceptually, the structure of the protein-DNA complex is determined by a competition between attractive protein interactions and loop closure entropy of this protein-DNA cluster on the one hand, and the positional entropy for placing loops within the cluster on the other. Indeed, we show that the protein interaction strength determines the ‘tightness’ of the loopy protein-DNA complex. Thus, our model provides a theoretical framework for quantitatively computing the binding profiles of ParB-like proteins around a cognate (parS) binding site.
iDNA-Prot: Identification of DNA Binding Proteins Using Random Forest with Grey Model
Lin, Wei-Zhong; Fang, Jian-An; Xiao, Xuan; Chou, Kuo-Chen
2011-01-01
DNA-binding proteins play crucial roles in various cellular processes. Developing high throughput tools for rapidly and effectively identifying DNA-binding proteins is one of the major challenges in the field of genome annotation. Although many efforts have been made in this regard, further effort is needed to enhance the prediction power. By incorporating the features into the general form of pseudo amino acid composition that were extracted from protein sequences via the “grey model” and by adopting the random forest operation engine, we proposed a new predictor, called iDNA-Prot, for identifying uncharacterized proteins as DNA-binding proteins or non-DNA binding proteins based on their amino acid sequences information alone. The overall success rate by iDNA-Prot was 83.96% that was obtained via jackknife tests on a newly constructed stringent benchmark dataset in which none of the proteins included has pairwise sequence identity to any other in a same subset. In addition to achieving high success rate, the computational time for iDNA-Prot is remarkably shorter in comparison with the relevant existing predictors. Hence it is anticipated that iDNA-Prot may become a useful high throughput tool for large-scale analysis of DNA-binding proteins. As a user-friendly web-server, iDNA-Prot is freely accessible to the public at the web-site on http://icpr.jci.edu.cn/bioinfo/iDNA-Prot or http://www.jci-bioinfo.cn/iDNA-Prot. Moreover, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results. PMID:21935457
Simulation studies of DNA at the nanoscale: Interactions with proteins, polycations, and surfaces
NASA Astrophysics Data System (ADS)
Elder, Robert M.
Understanding the nanoscale interactions of DNA, a multifunctional biopolymer with sequence-dependent properties, with other biological and synthetic substrates and molecules is essential to advancing these technologies. This doctoral thesis research is aimed at understanding the thermodynamics and molecular-level structure when DNA interacts with proteins, polycations, and functionalized surfaces. First, we investigate the ability of a DNA damage recognition protein (HMGB1a) to bind to anti-cancer drug-induced DNA damage, seeking to explain how HMGB1a differentiates between the drugs in vivo. Using atomistic molecular dynamics simulations, we show that the structure of the drug-DNA molecule exhibits drug- and base sequence-dependence that explains some of the experimentally observed differential recognition of the drugs in various sequence contexts. Then, we show how steric hindrance from the drug decreases the deformability of the drug-DNA molecule, which decreases recognition by the protein, a concept that can be applied to rational drug design. Second, we study how polycation architecture and chemistry affect polycation-DNA binding so as to design optimal polycations for high efficiency gene (DNA) delivery. Using a multiscale computational approach involving atomistic and coarse-grained simulations, we examine how rearranging polylysine from a linear to a grafted architecture, and several aspects of the grafted architecture, affect polycation-DNA binding and the structure of polycation-DNA complexes. Next, going beyond lysine we examine how oligopeptide chemistry and sequence in the grafted architecture affects polycation-DNA binding and find that strategic placement of hydrophobic peptides might be used to tailor binding strength. Third, we study the adsorption and conformations of single-stranded DNA (an amphiphilic biopolymer) on model hydrophilic and hydrophobic surfaces. Short ssDNA oligomers adsorb to both surfaces with similar strength, with the strength of adsorption to the hydrophobic surface depending on the composition of the DNA strands, i.e. purine or pyrimidine bases. Additionally, DNA-surface and DNA-water interactions near the surfaces govern the adsorption. For longer ssDNA oligomers, the effects of surface chemistry and temperature on ssDNA conformations are rather small, but either the hydrophilic surface or increased temperature favor slightly more compact conformations due to energetic and entropic effects, respectively.
On the topology of chromatin fibres
Barbi, Maria; Mozziconacci, Julien; Victor, Jean-Marc; Wong, Hua; Lavelle, Christophe
2012-01-01
The ability of cells to pack, use and duplicate DNA remains one of the most fascinating questions in biology. To understand DNA organization and dynamics, it is important to consider the physical and topological constraints acting on it. In the eukaryotic cell nucleus, DNA is organized by proteins acting as spools on which DNA can be wrapped. These proteins can subsequently interact and form a structure called the chromatin fibre. Using a simple geometric model, we propose a general method for computing topological properties (twist, writhe and linking number) of the DNA embedded in those fibres. The relevance of the method is reviewed through the analysis of magnetic tweezers single molecule experiments that revealed unexpected properties of the chromatin fibre. Possible biological implications of these results are discussed. PMID:24098838
Computational Model for DNA Organization Mediated by Protein Interaction in Prokaryotes
NASA Astrophysics Data System (ADS)
Garimella, Karthik; Kharel, Savan
2016-03-01
In Escherichia Coli, there are several mechanisms that drive chromosomal organization. We know through experiments that the E. Coli chromosome is condensed into highly structured regions known as macrodomains (MDs). One of the regions known as the Terminus undergoes DNA-bridging condensation that form loops between distant DNA sites and it is known to be mediated by a Terminus specific protein, which binds to specific markers within the Terminus region. In the absence of Terminus specific protein, however, the Terminus region is known to not condense nearly as much, which will likely impede several biological processes including DNA replication. In order to understand the molecular basis of protein mediation in vivo several models of Terminus specific segregation have been constructed in silico which model DNA as polymer chains.
Noise reduction in single time frame optical DNA maps
Müller, Vilhelm; Westerlund, Fredrik
2017-01-01
In optical DNA mapping technologies sequence-specific intensity variations (DNA barcodes) along stretched and stained DNA molecules are produced. These “fingerprints” of the underlying DNA sequence have a resolution of the order one kilobasepairs and the stretching of the DNA molecules are performed by surface adsorption or nano-channel setups. A post-processing challenge for nano-channel based methods, due to local and global random movement of the DNA molecule during imaging, is how to align different time frames in order to produce reproducible time-averaged DNA barcodes. The current solutions to this challenge are computationally rather slow. With high-throughput applications in mind, we here introduce a parameter-free method for filtering a single time frame noisy barcode (snap-shot optical map), measured in a fraction of a second. By using only a single time frame barcode we circumvent the need for post-processing alignment. We demonstrate that our method is successful at providing filtered barcodes which are less noisy and more similar to time averaged barcodes. The method is based on the application of a low-pass filter on a single noisy barcode using the width of the Point Spread Function of the system as a unique, and known, filtering parameter. We find that after applying our method, the Pearson correlation coefficient (a real number in the range from -1 to 1) between the single time-frame barcode and the time average of the aligned kymograph increases significantly, roughly by 0.2 on average. By comparing to a database of more than 3000 theoretical plasmid barcodes we show that the capabilities to identify plasmids is improved by filtering single time-frame barcodes compared to the unfiltered analogues. Since snap-shot experiments and computational time using our method both are less than a second, this study opens up for high throughput optical DNA mapping with improved reproducibility. PMID:28640821
Raisali, Gholamreza; Mirzakhanian, Lalageh; Masoudi, Seyed Farhad; Semsarha, Farid
2013-01-01
In this work the number of DNA single-strand breaks (SSB) and double-strand breaks (DSB) due to direct and indirect effects of Auger electrons from incorporated (123)I and (125)I have been calculated by using the Geant4-DNA toolkit. We have performed and compared the calculations for several cases: (125)I versus (123)I, source positions and direct versus indirect breaks to study the capability of the Geant4-DNA in calculations of DNA damage yields. Two different simple geometries of a 41 base pair of B-DNA have been simulated. The location of (123)I has been considered to be in (123)IdUrd and three different locations for (125)I. The results showed that the simpler geometry is sufficient for direct break calculations while indirect damage yield is more sensitive to the helical shape of DNA. For (123)I Auger electrons, the average number of DSB due to the direct hits is almost twice the DSB due to the indirect hits. Furthermore, a comparison between the average number of SSB or DSB caused by Auger electrons of (125)I and (123)I in (125)IdUrd and (123)IdUrd shows that (125)I is 1.5 times more effective than (123)I per decay. The results are in reasonable agreement with previous experimental and theoretical results which shows the applicability of the Geant-DNA toolkit in nanodosimetry calculations which benefits from the open-source accessibility with the advantage that the DNA models used in this work enable us to save the computational time. Also, the results showed that the simpler geometry is suitable for direct break calculations, while for the indirect damage yield, the more precise model is preferred.
Shi, Zhenyu; Wedd, Anthony G.; Gras, Sally L.
2013-01-01
The development of synthetic biology requires rapid batch construction of large gene networks from combinations of smaller units. Despite the availability of computational predictions for well-characterized enzymes, the optimization of most synthetic biology projects requires combinational constructions and tests. A new building-brick-style parallel DNA assembly framework for simple and flexible batch construction is presented here. It is based on robust recombination steps and allows a variety of DNA assembly techniques to be organized for complex constructions (with or without scars). The assembly of five DNA fragments into a host genome was performed as an experimental demonstration. PMID:23468883
Moghadam, Neda Hosseinpour; Salehzadeh, Sadegh; Shahabadi, Nahid; Golbedaghi, Reza
2017-07-03
The possible interaction between the antiviral drug oseltamivir and calf thymus DNA at physiological pH was studied by spectrophotometry, competitive spectrofluorimetry, differential pulse voltammogram (DPV), circular dichroism spectroscopy (CD), viscosity measurements, salt effect, and computational studies. Intercalation of oseltamivir between the base pairs of DNA was shown by a sharp increase in specific viscosity of DNA and a decrease of the peak current and a positive shift in differential pulse voltammogram. Competitive fluorescence experiments were performed using neutral red (NR) as a probe for the intercalation binding mode. The studies showed that oseltamivir is able to release the NR.
Although recent technological advances in DNA sequencing and computational biology now allow scientists to compare entire microbial genomes, the use of these approaches to discern key genomic differences between natural microbial communities remains prohibitively expensive for mo...
Technological advances in DNA sequencing and computational biology allow scientists to compare entire microbial genomes. However, the use of these approaches to discern key genomic differences between natural microbial communities remains prohibitively expensive for most laborato...
IDENTIFICATION OF BACTERIAL DNA MARKERS FOR THE DETECTION OF HUMAN AND CATTLE FECAL POLLUTION
Technological advances in DNA sequencing and computational biology allow scientists to compare entire microbial genomes. However, the use of these approaches to discern key genomic differences between natural microbial communities remains prohibitively expensive for most laborato...
mtDNA-Server: next-generation sequencing data analysis of human mitochondrial DNA in the cloud.
Weissensteiner, Hansi; Forer, Lukas; Fuchsberger, Christian; Schöpf, Bernd; Kloss-Brandstätter, Anita; Specht, Günther; Kronenberg, Florian; Schönherr, Sebastian
2016-07-08
Next generation sequencing (NGS) allows investigating mitochondrial DNA (mtDNA) characteristics such as heteroplasmy (i.e. intra-individual sequence variation) to a higher level of detail. While several pipelines for analyzing heteroplasmies exist, issues in usability, accuracy of results and interpreting final data limit their usage. Here we present mtDNA-Server, a scalable web server for the analysis of mtDNA studies of any size with a special focus on usability as well as reliable identification and quantification of heteroplasmic variants. The mtDNA-Server workflow includes parallel read alignment, heteroplasmy detection, artefact or contamination identification, variant annotation as well as several quality control metrics, often neglected in current mtDNA NGS studies. All computational steps are parallelized with Hadoop MapReduce and executed graphically with Cloudgene. We validated the underlying heteroplasmy and contamination detection model by generating four artificial sample mix-ups on two different NGS devices. Our evaluation data shows that mtDNA-Server detects heteroplasmies and artificial recombinations down to the 1% level with perfect specificity and outperforms existing approaches regarding sensitivity. mtDNA-Server is currently able to analyze the 1000G Phase 3 data (n = 2,504) in less than 5 h and is freely accessible at https://mtdna-server.uibk.ac.at. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Macroscopic modeling and simulations of supercoiled DNA with bound proteins
NASA Astrophysics Data System (ADS)
Huang, Jing; Schlick, Tamar
2002-11-01
General methods are presented for modeling and simulating DNA molecules with bound proteins on the macromolecular level. These new approaches are motivated by the need for accurate and affordable methods to simulate slow processes (on the millisecond time scale) in DNA/protein systems, such as the large-scale motions involved in the Hin-mediated inversion process. Our approaches, based on the wormlike chain model of long DNA molecules, introduce inhomogeneous potentials for DNA/protein complexes based on available atomic-level structures. Electrostatically, treat those DNA/protein complexes as sets of effective charges, optimized by our discrete surface charge optimization package, in which the charges are distributed on an excluded-volume surface that represents the macromolecular complex. We also introduce directional bending potentials as well as non-identical bead hydrodynamics algorithm to further mimic the inhomogeneous effects caused by protein binding. These models thus account for basic elements of protein binding effects on DNA local structure but remain computational tractable. To validate these models and methods, we reproduce various properties measured by both Monte Carlo methods and experiments. We then apply the developed models to study the Hin-mediated inversion system in long DNA. By simulating supercoiled, circular DNA with or without bound proteins, we observe significant effects of protein binding on global conformations and long-time dynamics of the DNA on the kilo basepair length.
Lee, Hwan Young; Song, Injee; Ha, Eunho; Cho, Sung-Bae; Yang, Woo Ick; Shin, Kyoung-Jin
2008-01-01
Background For the past few years, scientific controversy has surrounded the large number of errors in forensic and literature mitochondrial DNA (mtDNA) data. However, recent research has shown that using mtDNA phylogeny and referring to known mtDNA haplotypes can be useful for checking the quality of sequence data. Results We developed a Web-based bioinformatics resource "mtDNAmanager" that offers a convenient interface supporting the management and quality analysis of mtDNA sequence data. The mtDNAmanager performs computations on mtDNA control-region sequences to estimate the most-probable mtDNA haplogroups and retrieves similar sequences from a selected database. By the phased designation of the most-probable haplogroups (both expected and estimated haplogroups), mtDNAmanager enables users to systematically detect errors whilst allowing for confirmation of the presence of clear key diagnostic mutations and accompanying mutations. The query tools of mtDNAmanager also facilitate database screening with two options of "match" and "include the queried nucleotide polymorphism". In addition, mtDNAmanager provides Web interfaces for users to manage and analyse their own data in batch mode. Conclusion The mtDNAmanager will provide systematic routines for mtDNA sequence data management and analysis via easily accessible Web interfaces, and thus should be very useful for population, medical and forensic studies that employ mtDNA analysis. mtDNAmanager can be accessed at . PMID:19014619
2011-09-30
DNA profiles. Referred to as geneGIS, the program will provide the ability to display, browse, select, filter and summarize spatial or temporal...of the SPLASH photo-identification records and available DNA profiles is underway through integration and crosschecking by Cascadia and MMI . An...Darwin Core standards where possible and can accommodate the current databases developed for telemetry data at MMI and SPLASH collection records at
Lin, Xiaodong; Liu, Yaqing; Deng, Jiankang; Lyu, Yanlong; Qian, Pengcheng; Li, Yunfei; Wang, Shuo
2018-02-21
The integration of multiple DNA logic gates on a universal platform to implement advance logic functions is a critical challenge for DNA computing. Herein, a straightforward and powerful strategy in which a guanine-rich DNA sequence lighting up a silver nanocluster and fluorophore was developed to construct a library of logic gates on a simple DNA-templated silver nanoclusters (DNA-AgNCs) platform. This library included basic logic gates, YES, AND, OR, INHIBIT, and XOR, which were further integrated into complex logic circuits to implement diverse advanced arithmetic/non-arithmetic functions including half-adder, half-subtractor, multiplexer, and demultiplexer. Under UV irradiation, all the logic functions could be instantly visualized, confirming an excellent repeatability. The logic operations were entirely based on DNA hybridization in an enzyme-free and label-free condition, avoiding waste accumulation and reducing cost consumption. Interestingly, a DNA-AgNCs-based multiplexer was, for the first time, used as an intelligent biosensor to identify pathogenic genes, E. coli and S. aureus genes, with a high sensitivity. The investigation provides a prototype for the wireless integration of multiple devices on even the simplest single-strand DNA platform to perform diverse complex functions in a straightforward and cost-effective way.
Temporal patterns of damage and decay kinetics of DNA retrieved from plant herbarium specimens.
Weiß, Clemens L; Schuenemann, Verena J; Devos, Jane; Shirsekar, Gautam; Reiter, Ella; Gould, Billie A; Stinchcombe, John R; Krause, Johannes; Burbano, Hernán A
2016-06-01
Herbaria archive a record of changes of worldwide plant biodiversity harbouring millions of specimens that contain DNA suitable for genome sequencing. To profit from this resource, it is fundamental to understand in detail the process of DNA degradation in herbarium specimens. We investigated patterns of DNA fragmentation and nucleotide misincorporation by analysing 86 herbarium samples spanning the last 300 years using Illumina shotgun sequencing. We found an exponential decay relationship between DNA fragmentation and time, and estimated a per nucleotide fragmentation rate of 1.66 × 10(-4) per year, which is six times faster than the rate estimated for ancient bones. Additionally, we found that strand breaks occur specially before purines, and that depurination-driven DNA breakage occurs constantly through time and can to a great extent explain decreasing fragment length over time. Similar to what has been found analysing ancient DNA from bones, we found a strong correlation between the deamination-driven accumulation of cytosine to thymine substitutions and time, which reinforces the importance of substitution patterns to authenticate the ancient/historical nature of DNA fragments. Accurate estimations of DNA degradation through time will allow informed decisions about laboratory and computational procedures to take advantage of the vast collection of worldwide herbarium specimens.
Large-Scale Concatenation cDNA Sequencing
Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.
1997-01-01
A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Comparison of computational methods to model DNA minor groove binders.
Srivastava, Hemant Kumar; Chourasia, Mukesh; Kumar, Devesh; Sastry, G Narahari
2011-03-28
There has been a profound interest in designing small molecules that interact in sequence-selective fashion with DNA minor grooves. However, most in silico approaches have not been parametrized for DNA ligand interaction. In this regard, a systematic computational analysis of 57 available PDB structures of noncovalent DNA minor groove binders has been undertaken. The study starts with a rigorous benchmarking of GOLD, GLIDE, CDOCKER, and AUTODOCK docking protocols followed by developing QSSR models and finally molecular dynamics simulations. In GOLD and GLIDE, the orientation of the best score pose is closer to the lowest rmsd pose, and the deviation in the conformation of various poses is also smaller compared to other docking protocols. Efficient QSSR models were developed with constitutional, topological, and quantum chemical descriptors on the basis of B3LYP/6-31G* optimized geometries, and with this ΔT(m) values of 46 ligands were predicted. Molecular dynamics simulations of the 14 DNA-ligand complexes with Amber 8.0 show that the complexes are stable in aqueous conditions and do not undergo noticeable fluctuations during the 5 ns production run, with respect to their initial placement in the minor groove region.
Universal computing by DNA origami robots in a living animal
Levner, Daniel; Ittah, Shmulik; Abu-Horowitz, Almogit; Bachelet, Ido
2014-01-01
Biological systems are collections of discrete molecular objects that move around and collide with each other. Cells carry out elaborate processes by precisely controlling these collisions, but developing artificial machines that can interface with and control such interactions remains a significant challenge. DNA is a natural substrate for computing and has been used to implement a diverse set of mathematical problems1-3, logic circuits4-6 and robotics7-9. The molecule also naturally interfaces with living systems, and different forms of DNA-based biocomputing have previously been demonstrated10-13. Here we show that DNA origami14-16 can be used to fabricate nanoscale robots that are capable of dynamically interacting with each other17-18 in a living animal. The interactions generate logical outputs, which are relayed to switch molecular payloads on or off. As a proof-of-principle, we use the system to create architectures that emulate various logic gates (AND, OR, XOR, NAND, NOT, CNOT, and a half adder). Following an ex vivo prototyping phase, we successfully employed the DNA origami robots in living cockroaches (Blaberus discoidalis) to control a molecule that targets the cells of the animal. PMID:24705510
Gu, Jiande; Wang, Jing; Leszczynski, Jerzy
2014-01-30
Computational chemistry approach was applied to explore the nature of electron attachment to cytosine-rich DNA single strands. An oligomer dinucleoside phosphate deoxycytidylyl-3',5'-deoxycytidine (dCpdC) was selected as a model system for investigations by density functional theory. Electron distribution patterns for the radical anions of dCpdC in aqueous solution were explored. The excess electron may reside on the nucleobase at the 5' position (dC(•-)pdC) or at the 3' position (dCpdC(•-)). From comparison with electron attachment to the cytosine related DNA fragments, the electron affinity for the formation of the cytosine-centered radical anion in DNA is estimated to be around 2.2 eV. Electron attachment to cytosine sites in DNA single strands might cause perturbations of local structural characteristics. Visible absorption spectroscopy may be applied to validate computational results and determine experimentally the existence of the base-centered radical anion. The time-dependent DFT study shows the absorption around 550-600 nm for the cytosine-centered radical anions of DNA oligomers. This indicates that if such species are detected experimentally they would be characterized by a distinctive color.
Nishimoto, Sachiko; Fukuda, Daiju; Higashikuni, Yasutomi; Tanaka, Kimie; Hirata, Yoichiro; Murata, Chie; Kim-Kaneyama, Joo-Ri; Sato, Fukiko; Bando, Masahiro; Yagi, Shusuke; Soeki, Takeshi; Hayashi, Tetsuya; Imoto, Issei; Sakaue, Hiroshi; Shimabukuro, Michio; Sata, Masataka
2016-03-01
Obesity stimulates chronic inflammation in adipose tissue, which is associated with insulin resistance, although the underlying mechanism remains largely unknown. Here we showed that obesity-related adipocyte degeneration causes release of cell-free DNA (cfDNA), which promotes macrophage accumulation in adipose tissue via Toll-like receptor 9 (TLR9), originally known as a sensor of exogenous DNA fragments. Fat-fed obese wild-type mice showed increased release of cfDNA, as determined by the concentrations of single-stranded DNA (ssDNA) and double-stranded DNA (dsDNA) in plasma. cfDNA released from degenerated adipocytes promoted monocyte chemoattractant protein-1 (MCP-1) expression in wild-type macrophages, but not in TLR9-deficient (Tlr9 (-/-) ) macrophages. Fat-fed Tlr9 (-/-) mice demonstrated reduced macrophage accumulation and inflammation in adipose tissue and better insulin sensitivity compared with wild-type mice, whereas bone marrow reconstitution with wild-type bone marrow restored the attenuation of insulin resistance observed in fat-fed Tlr9 (-/-) mice. Administration of a TLR9 inhibitory oligonucleotide to fat-fed wild-type mice reduced the accumulation of macrophages in adipose tissue and improved insulin resistance. Furthermore, in humans, plasma ssDNA level was significantly higher in patients with computed tomography-determined visceral obesity and was associated with homeostasis model assessment of insulin resistance (HOMA-IR), which is the index of insulin resistance. Our study may provide a novel mechanism for the development of sterile inflammation in adipose tissue and a potential therapeutic target for insulin resistance.
NGS-based likelihood ratio for identifying contributors in two- and three-person DNA mixtures.
Chan Mun Wei, Joshua; Zhao, Zicheng; Li, Shuai Cheng; Ng, Yen Kaow
2018-06-01
DNA fingerprinting, also known as DNA profiling, serves as a standard procedure in forensics to identify a person by the short tandem repeat (STR) loci in their DNA. By comparing the STR loci between DNA samples, practitioners can calculate a probability of match to identity the contributors of a DNA mixture. Most existing methods are based on 13 core STR loci which were identified by the Federal Bureau of Investigation (FBI). Analyses based on these loci of DNA mixture for forensic purposes are highly variable in procedures, and suffer from subjectivity as well as bias in complex mixture interpretation. With the emergence of next-generation sequencing (NGS) technologies, the sequencing of billions of DNA molecules can be parallelized, thus greatly increasing throughput and reducing the associated costs. This allows the creation of new techniques that incorporate more loci to enable complex mixture interpretation. In this paper, we propose a computation for likelihood ratio that uses NGS (next generation sequencing) data for DNA testing on mixed samples. We have applied the method to 4480 simulated DNA mixtures, which consist of various mixture proportions of 8 unrelated whole-genome sequencing data. The results confirm the feasibility of utilizing NGS data in DNA mixture interpretations. We observed an average likelihood ratio as high as 285,978 for two-person mixtures. Using our method, all 224 identity tests for two-person mixtures and three-person mixtures were correctly identified. Copyright © 2018 Elsevier Ltd. All rights reserved.
Christen, Matthias; Deutsch, Samuel; Christen, Beat
2015-08-21
Recent advances in synthetic biology have resulted in an increasing demand for the de novo synthesis of large-scale DNA constructs. Any process improvement that enables fast and cost-effective streamlining of digitized genetic information into fabricable DNA sequences holds great promise to study, mine, and engineer genomes. Here, we present Genome Calligrapher, a computer-aided design web tool intended for whole genome refactoring of bacterial chromosomes for de novo DNA synthesis. By applying a neutral recoding algorithm, Genome Calligrapher optimizes GC content and removes obstructive DNA features known to interfere with the synthesis of double-stranded DNA and the higher order assembly into large DNA constructs. Subsequent bioinformatics analysis revealed that synthesis constraints are prevalent among bacterial genomes. However, a low level of codon replacement is sufficient for refactoring bacterial genomes into easy-to-synthesize DNA sequences. To test the algorithm, 168 kb of synthetic DNA comprising approximately 20 percent of the synthetic essential genome of the cell-cycle bacterium Caulobacter crescentus was streamlined and then ordered from a commercial supplier of low-cost de novo DNA synthesis. The successful assembly into eight 20 kb segments indicates that Genome Calligrapher algorithm can be efficiently used to refactor difficult-to-synthesize DNA. Genome Calligrapher is broadly applicable to recode biosynthetic pathways, DNA sequences, and whole bacterial genomes, thus offering new opportunities to use synthetic biology tools to explore the functionality of microbial diversity. The Genome Calligrapher web tool can be accessed at https://christenlab.ethz.ch/GenomeCalligrapher .
Streamlining the Design-to-Build Transition with Build-Optimization Software Tools.
Oberortner, Ernst; Cheng, Jan-Fang; Hillson, Nathan J; Deutsch, Samuel
2017-03-17
Scaling-up capabilities for the design, build, and test of synthetic biology constructs holds great promise for the development of new applications in fuels, chemical production, or cellular-behavior engineering. Construct design is an essential component in this process; however, not every designed DNA sequence can be readily manufactured, even using state-of-the-art DNA synthesis methods. Current biological computer-aided design and manufacture tools (bioCAD/CAM) do not adequately consider the limitations of DNA synthesis technologies when generating their outputs. Designed sequences that violate DNA synthesis constraints may require substantial sequence redesign or lead to price-premiums and temporal delays, which adversely impact the efficiency of the DNA manufacturing process. We have developed a suite of build-optimization software tools (BOOST) to streamline the design-build transition in synthetic biology engineering workflows. BOOST incorporates knowledge of DNA synthesis success determinants into the design process to output ready-to-build sequences, preempting the need for sequence redesign. The BOOST web application is available at https://boost.jgi.doe.gov and its Application Program Interfaces (API) enable integration into automated, customized DNA design processes. The herein presented results highlight the effectiveness of BOOST in reducing DNA synthesis costs and timelines.
NASA Astrophysics Data System (ADS)
Sumarsono, Danardono A.; Ibrahim, Fera; Santoso, Satria P.; Sari, Gema P.
2018-02-01
Gene gun is a mechanical device which has been used to deliver DNA vaccine into the cells and tissues by increasing the uptake of DNA plasmid so it can generate a high immune response with less amount of DNA. Nozzle is an important part of the gene gun which used to accelerate DNA in particle form with a gas flow to reach adequate momentum to enter the epidermis of human skin and elicit immune response. We developed new designs of nozzle for gene gun to make DNA uptake more efficient in vaccination. We used Computational Fluid Dynamics (CFD) by Autodesk® Simulation 2015 to simulate static fluid pressure and velocity contour of supersonic wave and parametric distance to predict the accuracy of the new nozzle. The result showed that the nozzle could create a shockwave at the distance parametric to the object from 4 to 5 cm using fluid pressure varied between 0.8-1.2 MPa. This is indication a possibility that the DNA particle could penetrate under the mammalian skin. For the future research step, this new nozzle model could be considered for development the main component of the DNA delivery system in vaccination in vivo
3DNALandscapes: a database for exploring the conformational features of DNA.
Zheng, Guohui; Colasanti, Andrew V; Lu, Xiang-Jun; Olson, Wilma K
2010-01-01
3DNALandscapes, located at: http://3DNAscapes.rutgers.edu, is a new database for exploring the conformational features of DNA. In contrast to most structural databases, which archive the Cartesian coordinates and/or derived parameters and images for individual structures, 3DNALandscapes enables searches of conformational information across multiple structures. The database contains a wide variety of structural parameters and molecular images, computed with the 3DNA software package and known to be useful for characterizing and understanding the sequence-dependent spatial arrangements of the DNA sugar-phosphate backbone, sugar-base side groups, base pairs, base-pair steps, groove structure, etc. The data comprise all DNA-containing structures--both free and bound to proteins, drugs and other ligands--currently available in the Protein Data Bank. The web interface allows the user to link, report, plot and analyze this information from numerous perspectives and thereby gain insight into DNA conformation, deformability and interactions in different sequence and structural contexts. The data accumulated from known, well-resolved DNA structures can serve as useful benchmarks for the analysis and simulation of new structures. The collective data can also help to understand how DNA deforms in response to proteins and other molecules and undergoes conformational rearrangements.
Shi, Liang; Khandurina, Julia; Ronai, Zsolt; Li, Bi-Yu; Kwan, Wai King; Wang, Xun; Guttman, András
2003-01-01
A capillary gel electrophoresis based automated DNA fraction collection technique was developed to support a novel DNA fragment-pooling strategy for expressed sequence tag (EST) library construction. The cDNA population is first cleaved by BsaJ I and EcoR I restriction enzymes, and then subpooled by selective ligation with specific adapters followed by polymerase chain reaction (PCR) amplification and labeling. Combination of this cDNA fingerprinting method with high-resolution capillary gel electrophoresis separation and precise fractionation of individual cDNA transcript representatives avoids redundant fragment selection and concomitant repetitive sequencing of abundant transcripts. Using a computer-controlled capillary electrophoresis device the transcript representatives were separated by their size and fractions were automatically collected in every 30 s into 96-well plates. The high resolving power of the sieving matrix ensured sequencing grade separation of the DNA fragments (i.e., single-base resolution) and successful fraction collection. Performance and precision of the fraction collection procedure was validated by PCR amplification of the collected DNA fragments followed by capillary electrophoresis analysis for size and purity verification. The collected and PCR-amplified transcript representatives, ranging up to several hundred base pairs, were then sequenced to create an EST library.
Simulations of DNA stretching by flow field in microchannels with complex geometry.
Huang, Chiou-De; Kang, Dun-Yen; Hsieh, Chih-Chen
2014-01-01
Recently, we have reported the experimental results of DNA stretching by flow field in three microchannels (C. H. Lee and C. C. Hsieh, Biomicrofluidics 7(1), 014109 (2013)) designed specifically for the purpose of preconditioning DNA conformation for easier stretching. The experimental results do not only demonstrate the superiority of the new devices but also provides detailed observation of DNA behavior in complex flow field that was not available before. In this study, we use Brownian dynamics-finite element method (BD-FEM) to simulate DNA behavior in these microchannels, and compare the results against the experiments. Although the hydrodynamic interaction (HI) between DNA segments and between DNA and the device boundaries was not included in the simulations, the simulation results are in fairly good agreement with the experimental data from either the aspect of the single molecule behavior or from the aspect of ensemble averaged properties. The discrepancy between the simulation and the experimental results can be explained by the neglect of HI effect in the simulations. Considering the huge savings on the computational cost from neglecting HI, we conclude that BD-FEM can be used as an efficient and economic designing tool for developing new microfluidic device for DNA manipulation.
NASA Astrophysics Data System (ADS)
Suenaga, A.; Yatsu, C.; Komeiji, Y.; Uebayasi, M.; Meguro, T.; Yamato, I.
2000-08-01
Molecular dynamics simulation of Escherichia colitrp-repressor/operator complex was performed to elucidate protein-DNA interactions in solution for 800 ps on special-purpose computer MD-GRAPE. The Ewald summation method was employed to treat the electrostatic interaction without cutoff. DNA kept stable conformation in comparison with the result of the conventional cutoff method. Thus, the trajectories obtained were used to analyze the protein-DNA interaction and to understand the role of dynamics of water molecules forming sequence specific recognition interface. The dynamical cross-correlation map showed a significant positive correlation between the helix-turn-helix DNA-binding motifs and the major grooves of operator DNA. The extensive contact surface was stable during the simulation. Most of the contacts consisted of direct interactions between phosphates of DNA and the protein, but several water-mediated polar contacts were also observed. These water-mediated interactions, which were also seen in the crystal structure (Z. Otwinowski, et al., Nature, 335 (1998) 321) emerged spontaneously from the randomized initial configuration of the solvent. This result suggests the importance of the water-mediated interaction in specific recognition of DNA by the trp-repressor, consistent with X-ray structural information.
NASA Astrophysics Data System (ADS)
Mielke, Steven P.; Grønbech-Jensen, Niels; Krishnan, V. V.; Fink, William H.; Benham, Craig J.
2005-09-01
The topological state of DNA in vivo is dynamically regulated by a number of processes that involve interactions with bound proteins. In one such process, the tracking of RNA polymerase along the double helix during transcription, restriction of rotational motion of the polymerase and associated structures, generates waves of overtwist downstream and undertwist upstream from the site of transcription. The resulting superhelical stress is often sufficient to drive double-stranded DNA into a denatured state at locations such as promoters and origins of replication, where sequence-specific duplex opening is a prerequisite for biological function. In this way, transcription and other events that actively supercoil the DNA provide a mechanism for dynamically coupling genetic activity with regulatory and other cellular processes. Although computer modeling has provided insight into the equilibrium dynamics of DNA supercoiling, to date no model has appeared for simulating sequence-dependent DNA strand separation under the nonequilibrium conditions imposed by the dynamic introduction of torsional stress. Here, we introduce such a model and present results from an initial set of computer simulations in which the sequences of dynamically superhelical, 147 base pair DNA circles were systematically altered in order to probe the accuracy with which the model can predict location, extent, and time of stress-induced duplex denaturation. The results agree both with well-tested statistical mechanical calculations and with available experimental information. Additionally, we find that sites susceptible to denaturation show a propensity for localizing to supercoil apices, suggesting that base sequence determines locations of strand separation not only through the energetics of interstrand interactions, but also by influencing the geometry of supercoiling.
Musset, Lise; Hubert, Véronique; Le Bras, Jacques
2014-01-01
The usefulness of atovaquone-proguanil (AP) as an antimalarial treatment is compromised by the emergence of atovaquone resistance during therapy. However, the origin of the parasite mitochondrial DNA (mtDNA) mutation conferring atovaquone resistance remains elusive. Here, we report a patient-based stochastic model that tracks the intrahost emergence of mutations in the multicopy mtDNA during the first erythrocytic parasite cycles leading to the malaria febrile episode. The effect of mtDNA copy number, mutation rate, mutation cost, and total parasite load on the mutant parasite load per patient was evaluated. Computer simulations showed that almost any infected patient carried, after four to seven erythrocytic cycles, de novo mutant parasites at low frequency, with varied frequencies of parasites carrying varied numbers of mutant mtDNA copies. A large interpatient variability in the size of this mutant reservoir was found; this variability was due to the different parameters tested but also to the relaxed replication and partitioning of mtDNA copies during mitosis. We also report seven clinical cases in which AP-resistant infections were treated by AP. These provided evidence that parasiticidal drug concentrations against AP-resistant parasites were transiently obtained within days after treatment initiation. Altogether, these results suggest that each patient carries new mtDNA mutant parasites that emerge before treatment but are killed by high starting drug concentrations. However, because the size of this mutant reservoir is highly variable from patient to patient, we propose that some patients fail to eliminate all of the mutant parasites, repeatedly producing de novo AP treatment failures. PMID:24867967
Mielke, Steven P; Grønbech-Jensen, Niels; Krishnan, V V; Fink, William H; Benham, Craig J
2005-09-22
The topological state of DNA in vivo is dynamically regulated by a number of processes that involve interactions with bound proteins. In one such process, the tracking of RNA polymerase along the double helix during transcription, restriction of rotational motion of the polymerase and associated structures, generates waves of overtwist downstream and undertwist upstream from the site of transcription. The resulting superhelical stress is often sufficient to drive double-stranded DNA into a denatured state at locations such as promoters and origins of replication, where sequence-specific duplex opening is a prerequisite for biological function. In this way, transcription and other events that actively supercoil the DNA provide a mechanism for dynamically coupling genetic activity with regulatory and other cellular processes. Although computer modeling has provided insight into the equilibrium dynamics of DNA supercoiling, to date no model has appeared for simulating sequence-dependent DNA strand separation under the nonequilibrium conditions imposed by the dynamic introduction of torsional stress. Here, we introduce such a model and present results from an initial set of computer simulations in which the sequences of dynamically superhelical, 147 base pair DNA circles were systematically altered in order to probe the accuracy with which the model can predict location, extent, and time of stress-induced duplex denaturation. The results agree both with well-tested statistical mechanical calculations and with available experimental information. Additionally, we find that sites susceptible to denaturation show a propensity for localizing to supercoil apices, suggesting that base sequence determines locations of strand separation not only through the energetics of interstrand interactions, but also by influencing the geometry of supercoiling.
Zhi, Hui; Li, Xin; Wang, Peng; Gao, Yue; Gao, Baoqing; Zhou, Dianshuang; Zhang, Yan; Guo, Maoni; Yue, Ming; Shen, Weitao; Ning, Shangwei; Jin, Lianhong; Li, Xia
2018-01-04
Lnc2Meth (http://www.bio-bigdata.com/Lnc2Meth/), an interactive resource to identify regulatory relationships between human long non-coding RNAs (lncRNAs) and DNA methylation, is not only a manually curated collection and annotation of experimentally supported lncRNAs-DNA methylation associations but also a platform that effectively integrates tools for calculating and identifying the differentially methylated lncRNAs and protein-coding genes (PCGs) in diverse human diseases. The resource provides: (i) advanced search possibilities, e.g. retrieval of the database by searching the lncRNA symbol of interest, DNA methylation patterns, regulatory mechanisms and disease types; (ii) abundant computationally calculated DNA methylation array profiles for the lncRNAs and PCGs; (iii) the prognostic values for each hit transcript calculated from the patients clinical data; (iv) a genome browser to display the DNA methylation landscape of the lncRNA transcripts for a specific type of disease; (v) tools to re-annotate probes to lncRNA loci and identify the differential methylation patterns for lncRNAs and PCGs with user-supplied external datasets; (vi) an R package (LncDM) to complete the differentially methylated lncRNAs identification and visualization with local computers. Lnc2Meth provides a timely and valuable resource that can be applied to significantly expand our understanding of the regulatory relationships between lncRNAs and DNA methylation in various human diseases. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Connecting localized DNA strand displacement reactions
NASA Astrophysics Data System (ADS)
Mullor Ruiz, Ismael; Arbona, Jean-Michel; Lad, Amitkumar; Mendoza, Oscar; Aimé, Jean-Pierre; Elezgaray, Juan
2015-07-01
Logic circuits based on DNA strand displacement reactions have been shown to be versatile enough to compute the square root of four-bit numbers. The implementation of these circuits as a set of bulk reactions faces difficulties which include leaky reactions and intrinsically slow, diffusion-limited reaction rates. In this paper, we consider simple examples of these circuits when they are attached to platforms (DNA origamis). As expected, constraining distances between DNA strands leads to faster reaction rates. However, it also induces side-effects that are not detectable in the solution-phase version of this circuitry. Appropriate design of the system, including protection and asymmetry between input and fuel strands, leads to a reproducible behaviour, at least one order of magnitude faster than the one observed under bulk conditions.Logic circuits based on DNA strand displacement reactions have been shown to be versatile enough to compute the square root of four-bit numbers. The implementation of these circuits as a set of bulk reactions faces difficulties which include leaky reactions and intrinsically slow, diffusion-limited reaction rates. In this paper, we consider simple examples of these circuits when they are attached to platforms (DNA origamis). As expected, constraining distances between DNA strands leads to faster reaction rates. However, it also induces side-effects that are not detectable in the solution-phase version of this circuitry. Appropriate design of the system, including protection and asymmetry between input and fuel strands, leads to a reproducible behaviour, at least one order of magnitude faster than the one observed under bulk conditions. Electronic supplementary information (ESI) available. See DOI: 10.1039/C5NR02434J
Liu, Ying; Matthews, Kathleen S.; Bondos, Sarah E.
2008-01-01
During animal development, distinct tissues, organs, and appendages are specified through differential gene transcription by Hox transcription factors. However, the conserved Hox homeodomains bind DNA with high affinity yet low specificity. We have therefore explored the structure of the Drosophila melanogaster Hox protein Ultrabithorax and the impact of its nonhomeodomain regions on DNA binding properties. Computational and experimental approaches identified several conserved, intrinsically disordered regions outside the homeodomain of Ultrabithorax that impact DNA binding by the homeodomain. Full-length Ultrabithorax bound to target DNA 2.5-fold weaker than its isolated homeodomain. Using N-terminal and C-terminal deletion mutants, we demonstrate that the YPWM region and the disordered microexons (termed the I1 region) inhibit DNA binding ∼2-fold, whereas the disordered I2 region inhibits homeodomain-DNA interaction a further ∼40-fold. Binding is restored almost to homeodomain affinity by the mostly disordered N-terminal 174 amino acids (R region) in a length-dependent manner. Both the I2 and R regions contain portions of the activation domain, functionally linking DNA binding and transcription regulation. Given that (i) the I1 region and a portion of the R region alter homeodomain-DNA binding as a function of pH and (ii) an internal deletion within I1 increases Ultrabithorax-DNA affinity, I1 must directly impact homeodomain-DNA interaction energetics. However, I2 appears to indirectly affect DNA binding in a manner countered by the N terminus. The amino acid sequences of I2 and much of the I1 and R regions vary significantly among Ultrabithorax orthologues, potentially diversifying Hox-DNA interactions. PMID:18508761
Benesova, L; Belsanova, B; Suchanek, S; Kopeckova, M; Minarikova, P; Lipska, L; Levy, M; Visokai, V; Zavoral, M; Minarik, M
2013-02-15
Prognosis of solid cancers is generally more favorable if the disease is treated early and efficiently. A key to long cancer survival is in radical surgical therapy directed at the primary tumor followed by early detection of possible progression, with swift application of subsequent therapeutic intervention reducing the risk of disease generalization. The conventional follow-up care is based on regular observation of tumor markers in combination with computed tomography/endoscopic ultrasound/magnetic resonance/positron emission tomography imaging to monitor potential tumor progression. A recent development in methodologies allowing screening for a presence of cell-free DNA (cfDNA) brings a new viable tool in early detection and management of major cancers. It is believed that cfDNA is released from tumors primarily due to necrotization, whereas the origin of nontumorous cfDNA is mostly apoptotic. The process of cfDNA detection starts with proper collection and treatment of blood and isolation and storage of blood plasma. The next important steps include cfDNA extraction from plasma and its detection and/or quantification. To distinguish tumor cfDNA from nontumorous cfDNA, specific somatic DNA mutations, previously localized in the primary tumor tissue, are identified in the extracted cfDNA. Apart from conventional mutation detection approaches, several dedicated techniques have been presented to detect low levels of cfDNA in an excess of nontumorous (nonmutated) DNA, including real-time polymerase chain reaction (PCR), "BEAMing" (beads, emulsion, amplification, and magnetics), and denaturing capillary electrophoresis. Techniques to facilitate the mutant detection, such as mutant-enriched PCR and COLD-PCR (coamplification at lower denaturation temperature PCR), are also applicable. Finally, a number of newly developed miniaturized approaches, such as single-molecule sequencing, are promising for the future. Copyright © 2012 Elsevier Inc. All rights reserved.
Dynamics of bacteriophage genome ejection in vitro and in vivo
NASA Astrophysics Data System (ADS)
Panja, Debabrata; Molineux, Ian J.
2010-12-01
Bacteriophages, phages for short, are viruses of bacteria. The majority of phages contain a double-stranded DNA genome packaged in a capsid at a density of ~500 mg ml-1. This high density requires substantial compression of the normal B-form helix, leading to the conjecture that DNA in mature phage virions is under significant pressure, and that pressure is used to eject the DNA during infection. A large number of theoretical, computer simulation and in vitro experimental studies surrounding this conjecture have revealed many—though often isolated and/or contradictory—aspects of packaged DNA. This prompts us to present a unified view of the statistical physics and thermodynamics of DNA packaged in phage capsids. We argue that the DNA in a mature phage is in a (meta)stable state, wherein electrostatic self-repulsion is balanced by curvature stress due to confinement in the capsid. We show that in addition to the osmotic pressure associated with the packaged DNA and its counterions, there are four different pressures within the capsid: pressure on the DNA, hydrostatic pressure, the pressure experienced by the capsid and the pressure associated with the chemical potential of DNA ejection. Significantly, we analyze the mechanism of force transmission in the packaged DNA and demonstrate that the pressure on DNA is not important for ejection. We derive equations showing a strong hydrostatic pressure difference across the capsid shell. We propose that when a phage is triggered to eject by interaction with its receptor in vitro, the (thermodynamic) incentive of water molecules to enter the phage capsid flushes the DNA out of the capsid. In vivo, the difference between the osmotic pressures in the bacterial cell cytoplasm and the culture medium similarly results in a water flow that drags the DNA out of the capsid and into the bacterial cell.
A new method for enhancer prediction based on deep belief network.
Bu, Hongda; Gan, Yanglan; Wang, Yang; Zhou, Shuigeng; Guan, Jihong
2017-10-16
Studies have shown that enhancers are significant regulatory elements to play crucial roles in gene expression regulation. Since enhancers are unrelated to the orientation and distance to their target genes, it is a challenging mission for scholars and researchers to accurately predicting distal enhancers. In the past years, with the high-throughout ChiP-seq technologies development, several computational techniques emerge to predict enhancers using epigenetic or genomic features. Nevertheless, the inconsistency of computational models across different cell-lines and the unsatisfactory prediction performance call for further research in this area. Here, we propose a new Deep Belief Network (DBN) based computational method for enhancer prediction, which is called EnhancerDBN. This method combines diverse features, composed of DNA sequence compositional features, DNA methylation and histone modifications. Our computational results indicate that 1) EnhancerDBN outperforms 13 existing methods in prediction, and 2) GC content and DNA methylation can serve as relevant features for enhancer prediction. Deep learning is effective in boosting the performance of enhancer prediction.
DNA-based construction at the nanoscale: emerging trends and applications
NASA Astrophysics Data System (ADS)
Lourdu Xavier, P.; Chandrasekaran, Arun Richard
2018-02-01
The field of structural DNA nanotechnology has evolved remarkably—from the creation of artificial immobile junctions to the recent DNA-protein hybrid nanoscale shapes—in a span of about 35 years. It is now possible to create complex DNA-based nanoscale shapes and large hierarchical assemblies with greater stability and predictability, thanks to the development of computational tools and advances in experimental techniques. Although it started with the original goal of DNA-assisted structure determination of difficult-to-crystallize molecules, DNA nanotechnology has found its applications in a myriad of fields. In this review, we cover some of the basic and emerging assembly principles: hybridization, base stacking/shape complementarity, and protein-mediated formation of nanoscale structures. We also review various applications of DNA nanostructures, with special emphasis on some of the biophysical applications that have been reported in recent years. In the outlook, we discuss further improvements in the assembly of such structures, and explore possible future applications involving super-resolved fluorescence, single-particle cryo-electron (cryo-EM) and x-ray free electron laser (XFEL) nanoscopic imaging techniques, and in creating new synergistic designer materials.
DNA-based construction at the nanoscale: emerging trends and applications.
Xavier, P Lourdu; Chandrasekaran, Arun Richard
2018-02-09
The field of structural DNA nanotechnology has evolved remarkably-from the creation of artificial immobile junctions to the recent DNA-protein hybrid nanoscale shapes-in a span of about 35 years. It is now possible to create complex DNA-based nanoscale shapes and large hierarchical assemblies with greater stability and predictability, thanks to the development of computational tools and advances in experimental techniques. Although it started with the original goal of DNA-assisted structure determination of difficult-to-crystallize molecules, DNA nanotechnology has found its applications in a myriad of fields. In this review, we cover some of the basic and emerging assembly principles: hybridization, base stacking/shape complementarity, and protein-mediated formation of nanoscale structures. We also review various applications of DNA nanostructures, with special emphasis on some of the biophysical applications that have been reported in recent years. In the outlook, we discuss further improvements in the assembly of such structures, and explore possible future applications involving super-resolved fluorescence, single-particle cryo-electron (cryo-EM) and x-ray free electron laser (XFEL) nanoscopic imaging techniques, and in creating new synergistic designer materials.
Jakubec, David; Laskowski, Roman A.; Vondrasek, Jiri
2016-01-01
Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost impossible to generalise. In this work we investigated the energetic characteristics of all DNA residue—amino acid side chain combinations in the conformations found at the interaction interface in a very large set of protein—DNA complexes by the means of empirical potential-based calculations. General specificity-defining criteria were derived and utilised to look beyond the binding motifs considered in previous studies. Linking energetic favourability to the observed geometrical preferences, our approach reveals several additional amino acid motifs which can distinguish between individual DNA bases. Our results remained valid in environments with various dielectric properties. PMID:27384774
Competition between B-Z and B-L transitions in a single DNA molecule: Computational studies
NASA Astrophysics Data System (ADS)
Kwon, Ah-Young; Nam, Gi-Moon; Johner, Albert; Kim, Seyong; Hong, Seok-Cheol; Lee, Nam-Kyung
2016-02-01
Under negative torsion, DNA adopts left-handed helical forms, such as Z-DNA and L-DNA. Using the random copolymer model developed for a wormlike chain, we represent a single DNA molecule with structural heterogeneity as a helical chain consisting of monomers which can be characterized by different helical senses and pitches. By Monte Carlo simulation, where we take into account bending and twist fluctuations explicitly, we study sequence dependence of B-Z transitions under torsional stress and tension focusing on the interaction with B-L transitions. We consider core sequences, (GC) n repeats or (TG) n repeats, which can interconvert between the right-handed B form and the left-handed Z form, imbedded in a random sequence, which can convert to left-handed L form with different (tension dependent) helical pitch. We show that Z-DNA formation from the (GC) n sequence is always supported by unwinding torsional stress but Z-DNA formation from the (TG) n sequence, which are more costly to convert but numerous, can be strongly influenced by the quenched disorder in the surrounding random sequence.
Diagnosis of Lung Cancer by Fractal Analysis of Damaged DNA
Namazi, Hamidreza; Kiminezhadmalaie, Mona
2015-01-01
Cancer starts when cells in a part of the body start to grow out of control. In fact cells become cancer cells because of DNA damage. A DNA walk of a genome represents how the frequency of each nucleotide of a pairing nucleotide couple changes locally. In this research in order to study the cancer genes, DNA walk plots of genomes of patients with lung cancer were generated using a program written in MATLAB language. The data so obtained was checked for fractal property by computing the fractal dimension using a program written in MATLAB. Also, the correlation of damaged DNA was studied using the Hurst exponent measure. We have found that the damaged DNA sequences are exhibiting higher degree of fractality and less correlation compared with normal DNA sequences. So we confirmed this method can be used for early detection of lung cancer. The method introduced in this research not only is useful for diagnosis of lung cancer but also can be applied for detection and growth analysis of different types of cancers. PMID:26539245
Fabrication of DNA/Hydroxyapatite nanocomposites by simulated body fluid for gene delivery
NASA Astrophysics Data System (ADS)
Takeshita, Takayuki; Okamoto, Masami
2015-05-01
The hydroxyapatite (HA) formation on the surface of DNA molecules in simulated body fluid (SBF) was examined. The osteoconductivity is estimated using SBF having ion concentrations approximately equal to those of human blood plasma. After immersion for 4 weeks in SBF at 36.5 °C, the HA crystallites possessing 1-14 micrometer in diameter grew on the surface of DNA molecules. The leaf flake-like and spherical shapes morphologies were observed through scanning electron microscopy analysis. Original peaks of both of DNA and HA were characterized by fourier transform infrared spectroscopy. The Ca/P ratio (1.1-1.5) in HA was estimated by energy dispersive X-ray analysis. After biomineralization, the calculated weight ratio of DNA/HA was 18/82 by thermogravimetry/differential thermal analysis. The molecular orbital computer simulation has been used to probe the interaction of DNA with two charge-balancing ions, CaOH+ and C a H2P O4+ . The adsorption enthalpy of the two ions on DNA having negative value was the evidence for the interface in mineralization of HA in SBF.
Artificial Intelligence, DNA Mimicry, and Human Health.
Stefano, George B; Kream, Richard M
2017-08-14
The molecular evolution of genomic DNA across diverse plant and animal phyla involved dynamic registrations of sequence modifications to maintain existential homeostasis to increasingly complex patterns of environmental stressors. As an essential corollary, driver effects of positive evolutionary pressure are hypothesized to effect concerted modifications of genomic DNA sequences to meet expanded platforms of regulatory controls for successful implementation of advanced physiological requirements. It is also clearly apparent that preservation of updated registries of advantageous modifications of genomic DNA sequences requires coordinate expansion of convergent cellular proofreading/error correction mechanisms that are encoded by reciprocally modified genomic DNA. Computational expansion of operationally defined DNA memory extends to coordinate modification of coding and previously under-emphasized noncoding regions that now appear to represent essential reservoirs of untapped genetic information amenable to evolutionary driven recruitment into the realm of biologically active domains. Additionally, expansion of DNA memory potential via chemical modification and activation of noncoding sequences is targeted to vertical augmentation and integration of an expanded cadre of transcriptional and epigenetic regulatory factors affecting linear coding of protein amino acid sequences within open reading frames.
Although recent technological advances in DNA sequencing and computational biology now allow scientists to compare entire microbial genomes, comparisons of closely related bacterial species and individual isolates by whole-genome sequencing approaches remains prohibitively expens...
Computer Center: 2 HyperCard Stacks for Biology.
ERIC Educational Resources Information Center
Duhrkopf, Richard, Ed.
1989-01-01
Two Hypercard stacks are reviewed including "Amino Acids," created to help students associate amino acid names with their structures, and "DNA Teacher," a tutorial on the structure and function of DNA. Availability, functions, hardware requirements, and general comments on these stacks are provided. (CW)
DeYonker, Nathan J; Webster, Charles Edwin
2015-07-14
Tyrosyl-DNA phosphodiesterase I (Tdp1) is a DNA repair enzyme conserved across eukaryotes that catalyzes the hydrolysis of the phosphodiester bond between the tyrosine residue of topoisomerase I and the 3'-phosphate of DNA. Atomic level details of the mechanism of Tdp1 are proposed and analyzed using a fully quantum mechanical, geometrically constrained model. The structural basis for the computational model is the vanadate-inhibited crystal structure of human Tdp1 (hTdp1, Protein Data Bank entry 1RFF ). Density functional theory computations are used to acquire thermodynamic and kinetic data along the catalytic pathway, including the phosphoryl transfer and subsequent hydrolysis. Located transition states and intermediates along the reaction coordinate suggest an associative phosphoryl transfer mechanism with five-coordinate phosphorane intermediates. Similar to both theoretical and experimental results for phospholipase D, the proposed mechanism for hTdp1 also includes the thermodynamically favorable possibility of a four-coordinate phosphohistidine "dead-end" product.
A DNA network as an information processing system.
Santini, Cristina Costa; Bath, Jonathan; Turberfield, Andrew J; Tyrrell, Andy M
2012-01-01
Biomolecular systems that can process information are sought for computational applications, because of their potential for parallelism and miniaturization and because their biocompatibility also makes them suitable for future biomedical applications. DNA has been used to design machines, motors, finite automata, logic gates, reaction networks and logic programs, amongst many other structures and dynamic behaviours. Here we design and program a synthetic DNA network to implement computational paradigms abstracted from cellular regulatory networks. These show information processing properties that are desirable in artificial, engineered molecular systems, including robustness of the output in relation to different sources of variation. We show the results of numerical simulations of the dynamic behaviour of the network and preliminary experimental analysis of its main components.
2013-09-30
profiles of right whales Eubalaena glacialis from the North Atlantic Right Whale Consortium; 2) DNA profiles of sperm whales Physeter macrocephalus...of other cetacean databases in Wildbook format (e.g., North Atlantic right whales, sperm whales and Hector’s dolphins); 8) Supported continuing...of sperm whales, using samples collected during the 5-year Voyage of the Odyssey; and 3) DNA profiles of Hector’s dolphins from Cloudy Bay, New
Cloud-based MOTIFSIM: Detecting Similarity in Large DNA Motif Data Sets.
Tran, Ngoc Tam L; Huang, Chun-Hsi
2017-05-01
We developed the cloud-based MOTIFSIM on Amazon Web Services (AWS) cloud. The tool is an extended version from our web-based tool version 2.0, which was developed based on a novel algorithm for detecting similarity in multiple DNA motif data sets. This cloud-based version further allows researchers to exploit the computing resources available from AWS to detect similarity in multiple large-scale DNA motif data sets resulting from the next-generation sequencing technology. The tool is highly scalable with expandable AWS.
Application of permanents of square matrices for DNA identification in multiple-fatality cases
2013-01-01
Background DNA profiling is essential for individual identification. In forensic medicine, the likelihood ratio (LR) is commonly used to identify individuals. The LR is calculated by comparing two hypotheses for the sample DNA: that the sample DNA is identical or related to a reference DNA, and that it is randomly sampled from a population. For multiple-fatality cases, however, identification should be considered as an assignment problem, and a particular sample and reference pair should therefore be compared with other possibilities conditional on the entire dataset. Results We developed a new method to compute the probability via permanents of square matrices of nonnegative entries. As the exact permanent is known as a #P-complete problem, we applied the Huber–Law algorithm to approximate the permanents. We performed a computer simulation to evaluate the performance of our method via receiver operating characteristic curve analysis compared with LR under the assumption of a closed incident. Differences between the two methods were well demonstrated when references provided neither obligate alleles nor impossible alleles. The new method exhibited higher sensitivity (0.188 vs. 0.055) at a threshold value of 0.999, at which specificity was 1, and it exhibited higher area under a receiver operating characteristic curve (0.990 vs. 0.959, P = 9.6E-15). Conclusions Our method therefore offers a solution for a computationally intensive assignment problem and may be a viable alternative to LR-based identification for closed-incident multiple-fatality cases. PMID:23962363
ProbeDesigner: for the design of probesets for branched DNA (bDNA) signal amplification assays.
Bushnell, S; Budde, J; Catino, T; Cole, J; Derti, A; Kelso, R; Collins, M L; Molino, G; Sheridan, P; Monahan, J; Urdea, M
1999-05-01
The sensitivity and specificity of branched DNA (bDNA) assays are derived in part through the judicious design of the capture and label extender probes. To minimize non-specific hybridization (NSH) events, which elevate assay background, candidate probes must be computer screened for complementarity with generic sequences present in the assay. We present a software application which allows for rapid and flexible design of bDNA probesets for novel targets. It includes an algorithm for estimating the magnitude of NSH contribution to background, a mechanism for removing probes with elevated contributions, a methodology for the simultaneous design of probesets for multiple targets, and a graphical user interface which guides the user through the design steps. The program is available as a commercial package through the Pharmaceutical Drug Discovery program at Chiron Diagnostics.
Genetic Constructor: An Online DNA Design Platform.
Bates, Maxwell; Lachoff, Joe; Meech, Duncan; Zulkower, Valentin; Moisy, Anaïs; Luo, Yisha; Tekotte, Hille; Franziska Scheitz, Cornelia Johanna; Khilari, Rupal; Mazzoldi, Florencio; Chandran, Deepak; Groban, Eli
2017-12-15
Genetic Constructor is a cloud Computer Aided Design (CAD) application developed to support synthetic biologists from design intent through DNA fabrication and experiment iteration. The platform allows users to design, manage, and navigate complex DNA constructs and libraries, using a new visual language that focuses on functional parts abstracted from sequence. Features like combinatorial libraries and automated primer design allow the user to separate design from construction by focusing on functional intent, and design constraints aid iterative refinement of designs. A plugin architecture enables contributions from scientists and coders to leverage existing powerful software and connect to DNA foundries. The software is easily accessible and platform agnostic, free for academics, and available in an open-source community edition. Genetic Constructor seeks to democratize DNA design, manufacture, and access to tools and services from the synthetic biology community.
Interlocked DNA nanostructures controlled by a reversible logic circuit.
Li, Tao; Lohmann, Finn; Famulok, Michael
2014-09-17
DNA nanostructures constitute attractive devices for logic computing and nanomechanics. An emerging interest is to integrate these two fields and devise intelligent DNA nanorobots. Here we report a reversible logic circuit built on the programmable assembly of a double-stranded (ds) DNA [3]pseudocatenane that serves as a rigid scaffold to position two separate branched-out head-motifs, a bimolecular i-motif and a G-quadruplex. The G-quadruplex only forms when preceded by the assembly of the i-motif. The formation of the latter, in turn, requires acidic pH and unhindered mobility of the head-motif containing dsDNA nanorings with respect to the central ring to which they are interlocked, triggered by release oligodeoxynucleotides. We employ these features to convert the structural changes into Boolean operations with fluorescence labelling. The nanostructure behaves as a reversible logic circuit consisting of tandem YES and AND gates. Such reversible logic circuits integrated into functional nanodevices may guide future intelligent DNA nanorobots to manipulate cascade reactions in biological systems.
Interlocked DNA nanostructures controlled by a reversible logic circuit
Li, Tao; Lohmann, Finn; Famulok, Michael
2014-01-01
DNA nanostructures constitute attractive devices for logic computing and nanomechanics. An emerging interest is to integrate these two fields and devise intelligent DNA nanorobots. Here we report a reversible logic circuit built on the programmable assembly of a double-stranded (ds) DNA [3]pseudocatenane that serves as a rigid scaffold to position two separate branched-out head-motifs, a bimolecular i-motif and a G-quadruplex. The G-quadruplex only forms when preceded by the assembly of the i-motif. The formation of the latter, in turn, requires acidic pH and unhindered mobility of the head-motif containing dsDNA nanorings with respect to the central ring to which they are interlocked, triggered by release oligodeoxynucleotides. We employ these features to convert the structural changes into Boolean operations with fluorescence labelling. The nanostructure behaves as a reversible logic circuit consisting of tandem YES and AND gates. Such reversible logic circuits integrated into functional nanodevices may guide future intelligent DNA nanorobots to manipulate cascade reactions in biological systems. PMID:25229207
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis
Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab
2012-01-01
RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. Availability http://www.cemb.edu.pk/sw.html Abbreviations RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language. PMID:23055611
Analysis of mutational spectra by denaturant capillary electrophoresis
Ekstrøm, Per O.; Khrapko, Konstantin; Li-Sucholeiki, Xiao-Cheng; Hunter, Ian W.; Thilly, William G.
2009-01-01
Numbers and kinds of point mutant within DNA from cells, tissues and human population may be discovered for nearly any 75–250bp DNA sequence. High fidelity DNA amplification incorporating a thermally stable DNA “clamp” is followed by separation by denaturing capillary electrophoresis (DCE). DCE allows for peak collection and verification sequencing. DCE in a mode of cycling temperature, e.g.+/− 5°C, CyDCE, permits high resolution of mutant sequences using computer defined analytes without preliminary optimization experiments. DNA sequencers have been modified to permit higher throughput CyDCE and a massively parallel,~25,000 capillary system, has been designed for pangenomic scans in large human populations. DCE has been used to define quantitative point mutational spectra for study a wide variety of genetic phenomena: errors of DNA polymerases, mutations induced in human cells by chemicals and irradiation, testing of human gene-common disease associations and the discovery of origins of point mutations in human development and carcinogenesis. PMID:18600220
Imaging and sizing of single DNA molecules on a mobile phone.
Wei, Qingshan; Luo, Wei; Chiang, Samuel; Kappel, Tara; Mejia, Crystal; Tseng, Derek; Chan, Raymond Yan Lok; Yan, Eddie; Qi, Hangfei; Shabbir, Faizan; Ozkan, Haydar; Feng, Steve; Ozcan, Aydogan
2014-12-23
DNA imaging techniques using optical microscopy have found numerous applications in biology, chemistry and physics and are based on relatively expensive, bulky and complicated set-ups that limit their use to advanced laboratory settings. Here we demonstrate imaging and length quantification of single molecule DNA strands using a compact, lightweight and cost-effective fluorescence microscope installed on a mobile phone. In addition to an optomechanical attachment that creates a high contrast dark-field imaging setup using an external lens, thin-film interference filters, a miniature dovetail stage and a laser-diode for oblique-angle excitation, we also created a computational framework and a mobile phone application connected to a server back-end for measurement of the lengths of individual DNA molecules that are labeled and stretched using disposable chips. Using this mobile phone platform, we imaged single DNA molecules of various lengths to demonstrate a sizing accuracy of <1 kilobase-pairs (kbp) for 10 kbp and longer DNA samples imaged over a field-of-view of ∼2 mm2.
Pauthenier, Cyrille; Faulon, Jean-Loup
2014-07-01
PrecisePrimer is a web-based primer design software made to assist experimentalists in any repetitive primer design task such as preparing, cloning and shuffling DNA libraries. Unlike other popular primer design tools, it is conceived to generate primer libraries with popular PCR polymerase buffers proposed as pre-set options. PrecisePrimer is also meant to design primers in batches, such as for DNA libraries creation of DNA shuffling experiments and to have the simplest interface possible. It integrates the most up-to-date melting temperature algorithms validated with experimental data, and cross validated with other computational tools. We generated a library of primers for the extraction and cloning of 61 genes from yeast DNA genomic extract using default parameters. All primer pairs efficiently amplified their target without any optimization of the PCR conditions. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Alkylpurine glycosylase D employs DNA sculpting as a strategy to extrude and excise damaged bases.
Kossmann, Bradley; Ivanov, Ivaylo
2014-07-01
Alkylpurine glycosylase D (AlkD) exhibits a unique base excision strategy. Instead of interacting directly with the lesion, the enzyme engages the non-lesion DNA strand. AlkD induces flipping of the alkylated and opposing base accompanied by DNA stack compression. Since this strategy leaves the alkylated base solvent exposed, the means to achieve enzymatic cleavage had remained unclear. We determined a minimum energy path for flipping out a 3-methyl adenine by AlkD and computed a potential of mean force along this path to delineate the energetics of base extrusion. We show that AlkD acts as a scaffold to stabilize three distinct DNA conformations, including the final extruded state. These states are almost equivalent in free energy and separated by low barriers. Thus, AlkD acts by sculpting the global DNA conformation to achieve lesion expulsion from DNA. N-glycosidic bond scission is then facilitated by a backbone phosphate group proximal to the alkylated base.
High-speed DNA-based rolling motors powered by RNase H
Yehl, Kevin; Mugler, Andrew; Vivek, Skanda; Liu, Yang; Zhang, Yun; Fan, Mengzhen; Weeks, Eric R.
2016-01-01
DNA-based machines that walk by converting chemical energy into controlled motion could be of use in applications such as next generation sensors, drug delivery platforms, and biological computing. Despite their exquisite programmability, DNA-based walkers are, however, challenging to work with due to their low fidelity and slow rates (~1 nm/min). Here, we report DNA-based machines that roll rather than walk, and consequently have a maximum speed and processivity that is three-orders of magnitude greater than conventional DNA motors. The motors are made from DNA-coated spherical particles that hybridise to a surface modified with complementary RNA; motion is achieved through the addition of RNase H, which selectively hydrolyses hybridised RNA. Spherical motors move in a self-avoiding manner, whereas anisotropic particles, such as dimerised particles or rod-shaped particles travel linearly without a track or external force. Finally, we demonstrate detection of single nucleotide polymorphism by measuring particle displacement using a smartphone camera. PMID:26619152
Matsumoto, Atsushi; Tobias, Irwin; Olson, Wilma K
2005-01-01
Fine structural and energetic details embedded in the DNA base sequence, such as intrinsic curvature, are important to the packaging and processing of the genetic material. Here we investigate the internal dynamics of a 200 bp closed circular molecule with natural curvature using a newly developed normal-mode treatment of DNA in terms of neighboring base-pair "step" parameters. The intrinsic curvature of the DNA is described by a 10 bp repeating pattern of bending distortions at successive base-pair steps. We vary the degree of intrinsic curvature and the superhelical stress on the molecule and consider the normal-mode fluctuations of both the circle and the stable figure-8 configuration under conditions where the energies of the two states are similar. To extract the properties due solely to curvature, we ignore other important features of the double helix, such as the extensibility of the chain, the anisotropy of local bending, and the coupling of step parameters. We compare the computed normal modes of the curved DNA model with the corresponding dynamical features of a covalently closed duplex of the same chain length constructed from naturally straight DNA and with the theoretically predicted dynamical properties of a naturally circular, inextensible elastic rod, i.e., an O-ring. The cyclic molecules with intrinsic curvature are found to be more deformable under superhelical stress than rings formed from naturally straight DNA. As superhelical stress is accumulated in the DNA, the frequency, i.e., energy, of the dominant bending mode decreases in value, and if the imposed stress is sufficiently large, a global configurational rearrangement of the circle to the figure-8 form takes place. We combine energy minimization with normal-mode calculations of the two states to decipher the configurational pathway between the two states. We also describe and make use of a general analytical treatment of the thermal fluctuations of an elastic rod to characterize the motions of the minicircle as a whole from knowledge of the full set of normal modes. The remarkable agreement between computed and theoretically predicted values of the average deviation and dispersion of the writhe of the circular configuration adds to the reliability in the computational approach. Application of the new formalism to the computed modes of the figure-8 provides insights into macromolecular motions which are beyond the scope of current theoretical treatments.
Nanopore Logic Operation with DNA to RNA Transcription in a Droplet System.
Ohara, Masayuki; Takinoue, Masahiro; Kawano, Ryuji
2017-07-21
This paper describes an AND logic operation with amplification and transcription from DNA to RNA, using T7 RNA polymerase. All four operations, (0 0) to (1 1), with an enzyme reaction can be performed simultaneously, using four-droplet devices that are directly connected to a patch-clamp amplifier. The output RNA molecule is detected using a biological nanopore with single-molecule translocation. Channel current recordings can be obtained using the enzyme solution. The integration of DNA logic gates into electrochemical devices is necessary to obtain output information in a human-recognizable form. Our method will be useful for rapid and confined DNA computing applications, including the development of programmable diagnostic devices.
Would Dissociative Recombination of DNA+ be a Possible Pathway of DNA Damage?
NASA Astrophysics Data System (ADS)
Kwon, H. C.; Chen, Z. P.; Strom, R. A.; Andrianarijaona, V. M.
2015-05-01
It is known that dissociative recombination (DR) is one of the very efficient processes of destruction of molecular cations into neutral particles. During the past few years, the focus of DR has been expanded from small inorganic molecules to macromolecular cation. We are probing the possibility of the DR of DNA+ after ionization of DNA, for example due to ionizing radiation. Therefore we are investigating the existence of autoionization states within nucleotide bases (Guanine, Adenine, Cytosine, and Thymine). Our results from computational analysis using the modern electronic structure program ORCA will be presented. Authors wish to give special thanks to Pacific Union College Student Senate for their financial support.
Self-Assembling Molecular Logic Gates Based on DNA Crossover Tiles.
Campbell, Eleanor A; Peterson, Evan; Kolpashchikov, Dmitry M
2017-07-05
DNA-based computational hardware has attracted ever-growing attention due to its potential to be useful in the analysis of complex mixtures of biological markers. Here we report the design of self-assembling logic gates that recognize DNA inputs and assemble into crossover tiles when the output signal is high; the crossover structures disassemble to form separate DNA stands when the output is low. The output signal can be conveniently detected by fluorescence using a molecular beacon probe as a reporter. AND, NOT, and OR logic gates were designed. We demonstrate that the gates can connect to each other to produce other logic functions. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
NASA Astrophysics Data System (ADS)
Enayatifar, Rasul; Sadaei, Hossein Javedani; Abdullah, Abdul Hanan; Lee, Malrey; Isnin, Ismail Fauzi
2015-08-01
Currently, there are many studies have conducted on developing security of the digital image in order to protect such data while they are sending on the internet. This work aims to propose a new approach based on a hybrid model of the Tinkerbell chaotic map, deoxyribonucleic acid (DNA) and cellular automata (CA). DNA rules, DNA sequence XOR operator and CA rules are used simultaneously to encrypt the plain-image pixels. To determine rule number in DNA sequence and also CA, a 2-dimension Tinkerbell chaotic map is employed. Experimental results and computer simulations, both confirm that the proposed scheme not only demonstrates outstanding encryption, but also resists various typical attacks.
DNA packaging and ejection forces in bacteriophage
Kindt, James; Tzlil, Shelly; Ben-Shaul, Avinoam; Gelbart, William M.
2001-01-01
We calculate the forces required to package (or, equivalently, acting to eject) DNA into (from) a bacteriophage capsid, as a function of the loaded (ejected) length, under conditions for which the DNA is either self-repelling or self-attracting. Through computer simulation and analytical theory, we find the loading force to increase more than 10-fold (to tens of piconewtons) during the final third of the loading process; correspondingly, the internal pressure drops 10-fold to a few atmospheres (matching the osmotic pressure in the cell) upon ejection of just a small fraction of the phage genome. We also determine an evolution of the arrangement of packaged DNA from toroidal to spool-like structures. PMID:11707588
NASA Astrophysics Data System (ADS)
Meyer, Sam; Everaers, Ralf
2015-02-01
The histone-DNA interaction in the nucleosome is a fundamental mechanism of genomic compaction and regulation, which remains largely unknown despite increasing structural knowledge of the complex. In this paper, we propose a framework for the extraction of a nanoscale histone-DNA force-field from a collection of high-resolution structures, which may be adapted to a larger class of protein-DNA complexes. We applied the procedure to a large crystallographic database extended by snapshots from molecular dynamics simulations. The comparison of the structural models first shows that, at histone-DNA contact sites, the DNA base-pairs are shifted outwards locally, consistent with locally repulsive forces exerted by the histones. The second step shows that the various force profiles of the structures under analysis derive locally from a unique, sequence-independent, quadratic repulsive force-field, while the sequence preferences are entirely due to internal DNA mechanics. We have thus obtained the first knowledge-derived nanoscale interaction potential for histone-DNA in the nucleosome. The conformations obtained by relaxation of nucleosomal DNA with high-affinity sequences in this potential accurately reproduce the experimental values of binding preferences. Finally we address the more generic binding mechanisms relevant to the 80% genomic sequences incorporated in nucleosomes, by computing the conformation of nucleosomal DNA with sequence-averaged properties. This conformation differs from those found in crystals, and the analysis suggests that repulsive histone forces are related to local stretch tension in nucleosomal DNA, mostly between adjacent contact points. This tension could play a role in the stability of the complex.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vaithiyalingam, Sivaraja; Warren, Eric M.; Eichman, Brandt F.
2010-10-19
DNA replication requires priming of DNA templates by enzymes known as primases. Although DNA primase structures are available from archaea and bacteria, the mechanism of DNA priming in higher eukaryotes remains poorly understood in large part due to the absence of the structure of the unique, highly conserved C-terminal regulatory domain of the large subunit (p58C). Here, we present the structure of this domain determined to 1.7-{angstrom} resolution by X-ray crystallography. The p58C structure reveals a novel arrangement of an evolutionarily conserved 4Fe-4S cluster buried deeply within the protein core and is not similar to any known protein structure. Analysismore » of the binding of DNA to p58C by fluorescence anisotropy measurements revealed a strong preference for ss/dsDNA junction substrates. This approach was combined with site-directed mutagenesis to confirm that the binding of DNA occurs to a distinctively basic surface on p58C. A specific interaction of p58C with the C-terminal domain of the intermediate subunit of replication protein A (RPA32C) was identified and characterized by isothermal titration calorimetry and NMR. Restraints from NMR experiments were used to drive computational docking of the two domains and generate a model of the p58C-RPA32C complex. Together, our results explain functional defects in human DNA primase mutants and provide insights into primosome loading on RPA-coated ssDNA and regulation of primase activity.« less
Jalili, Seifollah; Karami, Leila
2012-03-01
The proline-rich homeodomain (PRH)-DNA complex consists of a protein with 60 residues and a 13-base-pair DNA. The PRH protein is a transcription factor that plays a key role in the regulation of gene expression. PRH is a significant member of the Q50 class of homeodomain proteins. The homeodomain section of PRH is essential for binding to DNA and mediates sequence-specific DNA binding. Three 20-ns molecular dynamics (MD) simulations (free protein, free DNA and protein-DNA complex) in explicit solvent water were performed to elucidate the intermolecular contacts in the PRH-DNA complex and the role of dynamics of water molecules forming water-mediated contacts. The simulation provides a detailed explanation of the trajectory of hydration water molecules. The simulations show that some water molecules in the protein-DNA interface exchange with bulk waters. The simulation identifies that most of the contacts consisted of direct interactions between the protein and DNA including specific and non-specific contacts, but several water-mediated polar contacts were also observed. The specific interaction between Gln50 and C18 and water-mediated hydrogen bond between Gln50 and T7 were found to be present during almost the entire time of the simulation. These results show good consistency with experimental and previous computational studies. Structural properties such as root-mean-square deviations (RMSD), root-mean-square fluctuations (RMSF) and secondary structure were also analyzed as a function of time. Analyses of the trajectories showed that the dynamic fluctuations of both the protein and the DNA were lowered by the complex formation.
Single molecule fluorescence burst detection of DNA fragments separated by capillary electrophoresis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haab, B.B.; Mathies, R.A.
A method has been developed for detecting DNA separated by capillary gel electrophoresis (CGE) using single molecule photon burst counting. A confocal fluorescence microscope was used to observe the fluorescence bursts from single molecules of DNA multiply labeled with the thiazole orange derivative TO6 as they passed through the nearly 2-{mu}m diameter focused laser beam. Amplified photo-electron pulses from the photomultiplier are grouped into bins of 360-450 {mu}s in duration, and the resulting histogram is stored in a computer for analysis. Solutions of M13 DNA were first flowed through the capillary at various concentrations, and the resulting data were usedmore » to optimize the parameters for digital filtering using a low-pass Fourier filter, selecting a discriminator level for peak detection, and applying a peak-calling algorithm. The optimized single molecule counting method was then applied to an electrophoretic separation of M13 DNA and to a separation of pBR 322 DNA from pRL 277 DNA. Clusters of discreet fluorescence bursts were observed at the expected appearance time of each DNA band. The auto-correlation function of these data indicated transit times that were consistent with the observed electrophoretic velocity. These separations were easily detected when only 50-100 molecules of DNA per band traveled through the detection region. This new detection technology should lead to the routine analysis of DNA in capillary columns with an on-column sensitivity of nearly 100 DNA molecules/band or better. 45 refs., 10 figs.« less
An in silico DNA cloning experiment for the biochemistry laboratory.
Elkins, Kelly M
2011-01-01
This laboratory exercise introduces students to concepts in recombinant DNA technology while accommodating a major semester project in protein purification, structure, and function in a biochemistry laboratory for junior- and senior-level undergraduate students. It is also suitable for forensic science courses focused in DNA biology and advanced high school biology classes. Students begin by examining a plasmid map with the goal of identifying which restriction enzymes may be used to clone a piece of foreign DNA containing a gene of interest into the vector. From the National Center for Biotechnology Initiative website, students are instructed to retrieve a protein sequence and use Expasy's Reverse Translate program to reverse translate the protein to cDNA. Students then use Integrated DNA Technologies' OligoAnalyzer to predict the complementary DNA strand and obtain DNA recognition sequences for the desired restriction enzymes from New England Biolabs' website. Students add the appropriate DNA restriction sequences to the double-stranded foreign DNA for cloning into the plasmid and infecting Escherichia coli cells. Students are introduced to computational biology tools, molecular biology terminology and the process of DNA cloning in this valuable single session, in silico experiment. This project develops students' understanding of the cloning process as a whole and contrasts with other laboratory and internship experiences in which the students may be involved in only a piece of the cloning process/techniques. Students interested in pursuing postgraduate study and research or employment in an academic biochemistry or molecular biology laboratory or industry will benefit most from this experience. Copyright © 2010 Wiley Periodicals, Inc.
Probing of miniPEGγ-PNA-DNA Hybrid Duplex Stability with AFM Force Spectroscopy.
Dutta, Samrat; Armitage, Bruce A; Lyubchenko, Yuri L
2016-03-15
Peptide nucleic acids (PNA) are synthetic polymers, the neutral peptide backbone of which provides elevated stability to PNA-PNA and PNA-DNA hybrid duplexes. It was demonstrated that incorporation of diethylene glycol (miniPEG) at the γ position of the peptide backbone increased the thermal stability of the hybrid duplexes (Sahu, B. et al. J. Org. Chem. 2011, 76, 5614-5627). Here, we applied atomic force microscopy (AFM) based single molecule force spectroscopy and dynamic force spectroscopy (DFS) to test the strength and stability of the hybrid 10 bp duplex. This hybrid duplex consisted of miniPEGγ-PNA and DNA of the same length (γ(MP)PNA-DNA), which we compared to a DNA duplex with a homologous sequence. AFM force spectroscopy data obtained at the same conditions showed that the γ(MP)PNA-DNA hybrid is more stable than the DNA counterpart, 65 ± 15 pN vs 47 ± 15 pN, respectively. The DFS measurements performed in a range of pulling speeds analyzed in the framework of the Bell-Evans approach yielded a dissociation constant, koff ≈ 0.030 ± 0.01 s⁻¹ for γ(MP)PNA-DNA hybrid duplex vs 0.375 ± 0.18 s⁻¹ for the DNA-DNA duplex suggesting that the hybrid duplex is much more stable. Correlating the high affinity of γ(MP)PNA-DNA to slow dissociation kinetics is consistent with prior bulk characterization by surface plasmon resonance. Given the growing interest in γ(MP)PNA as well as other synthetic DNA analogues, the use of single molecule experiments along with computational analysis of force spectroscopy data will provide direct characterization of various modifications as well as higher order structures such as triplexes and quadruplexes.
NASA Astrophysics Data System (ADS)
Aminzadeh, Mohammad; Eslami, Abbas; Kia, Reza; Aleeshah, Roghayeh
2017-10-01
Diquaternarization of dipyrido-[2,3-a:2‧,3‧-c]-phenazine,(dppz) and its analogous dipyrido-[2,3-a:2‧,3‧-c]-dimethylphenazine,(dppx) using 1,3-dibromopropane afford new water-soluble derivatives of phenazine, propylene-bipyridyldiylium-phenazine (1) and propylene-bipyridyldiylium-dimethylphenazine (2). The compounds have been characterized by means of FT-IR, NMR, elemental analysis and conductometric measurements and their structure were determined by X-ray crystallography. The experimental studies on the compounds have been accompanied computationally by Density Functional Theory (DFT) calculations. The DNA binding properties of both compounds to calf thymus DNA (ctDNA) were investigated by UV-Vis absorption and emission methods. The expanded UV-Vis spectral data matrix was analyzed by multivariate curve resolution-alternating least squares (MCR-ALS) technique to obtain the concentration profile and pure spectra of all reaction species which existed in the interaction procedure. Multivariate curve resolution may help us to give a better understanding of the 1(Cl)2-ctDNA and 2(Cl)2-ctDNA interaction mechanism. The results suggest that both compounds bind tightly to DNA through intercalation mechanism and the DNA binding affinity of 2 is slightly lower than that of 1 due to steric hindrance of the methyl group. Also, thermal denaturation studies reveal that these compounds show strong affinity for binding with calf thymus DNA. The thermodynamic parameters of the DNA binding process were obtained from the temperature dependence of the binding constants and the results showed that binding of both compounds to DNA is an enthalpically driven process that is in agreement with proposed DNA intercalation capability of these compounds.
Directed nucleation assembly of DNA tile complexes for barcode-patterned lattices
NASA Astrophysics Data System (ADS)
Yan, Hao; Labean, Thomas H.; Feng, Liping; Reif, John H.
2003-07-01
The programmed self-assembly of patterned aperiodic molecular structures is a major challenge in nanotechnology and has numerous potential applications for nanofabrication of complex structures and useful devices. Here we report the construction of an aperiodic patterned DNA lattice (barcode lattice) by a self-assembly process of directed nucleation of DNA tiles around a scaffold DNA strand. The input DNA scaffold strand, constructed by ligation of shorter synthetic oligonucleotides, provides layers of the DNA lattice with barcode patterning information represented by the presence or absence of DNA hairpin loops protruding out of the lattice plane. Self-assembly of multiple DNA tiles around the scaffold strand was shown to result in a patterned lattice containing barcode information of 01101. We have also demonstrated the reprogramming of the system to another patterning. An inverted barcode pattern of 10010 was achieved by modifying the scaffold strands and one of the strands composing each tile. A ribbon lattice, consisting of repetitions of the barcode pattern with expected periodicity, was also constructed by the addition of sticky ends. The patterning of both classes of lattices was clearly observable via atomic force microscopy. These results represent a step toward implementation of a visual readout system capable of converting information encoded on a 1D DNA strand into a 2D form readable by advanced microscopic techniques. A functioning visual output method would not only increase the readout speed of DNA-based computers, but may also find use in other sequence identification techniques such as mutation or allele mapping.
Directed nucleation assembly of DNA tile complexes for barcode-patterned lattices.
Yan, Hao; LaBean, Thomas H; Feng, Liping; Reif, John H
2003-07-08
The programmed self-assembly of patterned aperiodic molecular structures is a major challenge in nanotechnology and has numerous potential applications for nanofabrication of complex structures and useful devices. Here we report the construction of an aperiodic patterned DNA lattice (barcode lattice) by a self-assembly process of directed nucleation of DNA tiles around a scaffold DNA strand. The input DNA scaffold strand, constructed by ligation of shorter synthetic oligonucleotides, provides layers of the DNA lattice with barcode patterning information represented by the presence or absence of DNA hairpin loops protruding out of the lattice plane. Self-assembly of multiple DNA tiles around the scaffold strand was shown to result in a patterned lattice containing barcode information of 01101. We have also demonstrated the reprogramming of the system to another patterning. An inverted barcode pattern of 10010 was achieved by modifying the scaffold strands and one of the strands composing each tile. A ribbon lattice, consisting of repetitions of the barcode pattern with expected periodicity, was also constructed by the addition of sticky ends. The patterning of both classes of lattices was clearly observable via atomic force microscopy. These results represent a step toward implementation of a visual readout system capable of converting information encoded on a 1D DNA strand into a 2D form readable by advanced microscopic techniques. A functioning visual output method would not only increase the readout speed of DNA-based computers, but may also find use in other sequence identification techniques such as mutation or allele mapping.
Lin, Xiaodong; Deng, Jiankang; Lyu, Yanlong; Qian, Pengcheng; Li, Yunfei
2018-01-01
The integration of multiple DNA logic gates on a universal platform to implement advance logic functions is a critical challenge for DNA computing. Herein, a straightforward and powerful strategy in which a guanine-rich DNA sequence lighting up a silver nanocluster and fluorophore was developed to construct a library of logic gates on a simple DNA-templated silver nanoclusters (DNA-AgNCs) platform. This library included basic logic gates, YES, AND, OR, INHIBIT, and XOR, which were further integrated into complex logic circuits to implement diverse advanced arithmetic/non-arithmetic functions including half-adder, half-subtractor, multiplexer, and demultiplexer. Under UV irradiation, all the logic functions could be instantly visualized, confirming an excellent repeatability. The logic operations were entirely based on DNA hybridization in an enzyme-free and label-free condition, avoiding waste accumulation and reducing cost consumption. Interestingly, a DNA-AgNCs-based multiplexer was, for the first time, used as an intelligent biosensor to identify pathogenic genes, E. coli and S. aureus genes, with a high sensitivity. The investigation provides a prototype for the wireless integration of multiple devices on even the simplest single-strand DNA platform to perform diverse complex functions in a straightforward and cost-effective way. PMID:29675221
Lin, Ru-Wei; Yang, Chia-Ning; Ku, ShengYu; Ho, Cheng-Jung; Huang, Shih-Bo; Yang, Min-Chi; Chang, Hsin-Wen; Lin, Chun-Mao; Hwang, Jaulang; Chen, Yeh-Long; Tzeng, Cherg-Chyi; Wang, Chihuei
2014-01-01
CFS-1686 (chemical name (E)-N-(2-(diethylamino)ethyl)-4-(2-(2-(5-nitrofuran-2-yl)vinyl)quinolin-4-ylamino)benzamide) inhibits cell proliferation and triggers late apoptosis in prostate cancer cell lines. Comparing the effect of CFS-1686 on cell cycle progression with the topoisomerase 1 inhibitor camptothecin revealed that CFS-1686 and camptothecin reduced DNA synthesis in S-phase, resulting in cell cycle arrest at the intra-S phase and G1-S boundary, respectively. The DNA damage in CFS-1686 and camptothecin treated cells was evaluated by the level of ATM phosphorylation, γH2AX, and γH2AX foci, showing that camptothecin was more effective than CFS-1686. However, despite its lower DNA damage capacity, CFS-1686 demonstrated 4-fold higher inhibition of topoisomerase 1 than camptothecin in a DNA relaxation assay. Unlike camptothecin, CFS-1686 demonstrated no activity on topoisomerase 1 in a DNA cleavage assay, but nevertheless it reduced the camptothecin-induced DNA cleavage of topoisomerase 1 in a dose-dependent manner. Our results indicate that CFS-1686 might bind to topoisomerase 1 to inhibit this enzyme from interacting with DNA relaxation activity, unlike campothecin's induction of a topoisomerase 1-DNA cleavage complex. Finally, we used a computer docking strategy to localize the potential binding site of CFS-1686 to topoisomerase 1, further indicating that CFS-1686 might inhibit the binding of Top1 to DNA.
DNA fragmentation and sperm head morphometry in cat epididymal spermatozoa.
Vernocchi, Valentina; Morselli, Maria Giorgia; Lange Consiglio, Anna; Faustini, Massimo; Luvoni, Gaia Cecilia
2014-10-15
Sperm DNA fragmentation is an important parameter to assess sperm quality and can be a putative fertility predictor. Because the sperm head consists almost entirely of DNA, subtle differences in sperm head morphometry might be related to DNA status. Several techniques are available to analyze sperm DNA fragmentation, but they are labor-intensive and require expensive instrumentations. Recently, a kit (Sperm-Halomax) based on the sperm chromatin dispersion test and developed for spermatozoa of different species, but not for cat spermatozoa, became commercially available. The first aim of the present study was to verify the suitability of Sperm-Halomax assay, specifically developed for canine semen, for the evaluation of DNA fragmentation of epididymal cat spermatozoa. For this purpose, DNA fragmentation indexes (DFIs) obtained with Sperm-Halomax and terminal deoxynucleotidyl transferase-mediated nick-end labeling (TUNEL) were compared. The second aim was to investigate whether a correlation between DNA status, sperm head morphology, and morphometry assessed by computer-assisted semen analysis exists in cat epididymal spermatozoa. No differences were observed in DFIs obtained with Sperm-Halomax and TUNEL. This result indicates that Sperm-Halomax assay provides a reliable evaluation of DNA fragmentation of epididymal feline spermatozoa. The DFI seems to be independent from all the measured variables of sperm head morphology and morphometry. Thus, the evaluation of the DNA status of spermatozoa could effectively contribute to the completion of the standard analysis of fresh or frozen semen used in assisted reproductive technologies. Copyright © 2014 Elsevier Inc. All rights reserved.
Computer-aided design of nano-filter construction using DNA self-assembly
NASA Astrophysics Data System (ADS)
Mohammadzadegan, Reza; Mohabatkar, Hassan
2007-01-01
Computer-aided design plays a fundamental role in both top-down and bottom-up nano-system fabrication. This paper presents a bottom-up nano-filter patterning process based on DNA self-assembly. In this study we designed a new method to construct fully designed nano-filters with the pores between 5 nm and 9 nm in diameter. Our calculations illustrated that by constructing such a nano-filter we would be able to separate many molecules.
2012-09-30
computational tools provide the ability to display, browse, select, filter and summarize spatio-temporal relationships of these individual-based...her research assistant at Esri, Shaun Walbridge, and members of the Marine Mammal Institute ( MMI ), including Tomas Follet and Debbie Steel. This...Genomics Laboratory, MMI , OSU. 4 As part of the geneGIS initiative, these SPLASH photo-identification records and the geneSPLASH DNA profiles
DCJ-indel and DCJ-substitution distances with distinct operation costs
2013-01-01
Background Classical approaches to compute the genomic distance are usually limited to genomes with the same content and take into consideration only rearrangements that change the organization of the genome (i.e. positions and orientation of pieces of DNA, number and type of chromosomes, etc.), such as inversions, translocations, fusions and fissions. These operations are generically represented by the double-cut and join (DCJ) operation. The distance between two genomes, in terms of number of DCJ operations, can be computed in linear time. In order to handle genomes with distinct contents, also insertions and deletions of fragments of DNA – named indels – must be allowed. More powerful than an indel is a substitution of a fragment of DNA by another fragment of DNA. Indels and substitutions are called content-modifying operations. It has been shown that both the DCJ-indel and the DCJ-substitution distances can also be computed in linear time, assuming that the same cost is assigned to any DCJ or content-modifying operation. Results In the present study we extend the DCJ-indel and the DCJ-substitution models, considering that the content-modifying cost is distinct from and upper bounded by the DCJ cost, and show that the distance in both models can still be computed in linear time. Although the triangular inequality can be disrupted in both models, we also show how to efficiently fix this problem a posteriori. PMID:23879938
DNA Photo Lithography with Cinnamate-based Photo-Bio-Nano-Glue
NASA Astrophysics Data System (ADS)
Feng, Lang; Li, Minfeng; Romulus, Joy; Sha, Ruojie; Royer, John; Wu, Kun-Ta; Xu, Qin; Seeman, Nadrian; Weck, Marcus; Chaikin, Paul
2013-03-01
We present a technique to make patterned functional surfaces, using a cinnamate photo cross-linker and photolithography. We have designed and modified a complementary set of single DNA strands to incorporate a pair of opposing cinnamate molecules. On exposure to 360nm UV, the cinnamate makes a highly specific covalent bond permanently linking only the complementary strands containing the cinnamates. We have studied this specific and efficient crosslinking with cinnamate-containing DNA in solution and on particles. UV addressability allows us to pattern surfaces functionally. The entire surface is coated with a DNA sequence A incorporating cinnamate. DNA strands A'B with one end containing a complementary cinnamated sequence A' attached to another sequence B, are then hybridized to the surface. UV photolithography is used to bind the A'B strand in a specific pattern. The system is heated and the unbound DNA is washed away. The pattern is then observed by thermo-reversibly hybridizing either fluorescently dyed B' strands complementary to B, or colloids coated with B' strands. Our techniques can be used to reversibly and/or permanently bind, via DNA linkers, an assortment of molecules, proteins and nanostructures. Potential applications range from advanced self-assembly, such as templated self-replication schemes recently reported, to designed physical and chemical patterns, to high-resolution multi-functional DNA surfaces for genetic detection or DNA computing.
Sehgal, Manika; Singh, Tiratha Raj
2014-04-01
We present DR-GAS(1), a unique, consolidated and comprehensive DNA repair genetic association studies database of human DNA repair system. It presents information on repair genes, assorted mechanisms of DNA repair, linkage disequilibrium, haplotype blocks, nsSNPs, phosphorylation sites, associated diseases, and pathways involved in repair systems. DNA repair is an intricate process which plays an essential role in maintaining the integrity of the genome by eradicating the damaging effect of internal and external changes in the genome. Hence, it is crucial to extensively understand the intact process of DNA repair, genes involved, non-synonymous SNPs which perhaps affect the function, phosphorylated residues and other related genetic parameters. All the corresponding entries for DNA repair genes, such as proteins, OMIM IDs, literature references and pathways are cross-referenced to their respective primary databases. DNA repair genes and their associated parameters are either represented in tabular or in graphical form through images elucidated by computational and statistical analyses. It is believed that the database will assist molecular biologists, biotechnologists, therapeutic developers and other scientific community to encounter biologically meaningful information, and meticulous contribution of genetic level information towards treacherous diseases in human DNA repair systems. DR-GAS is freely available for academic and research purposes at: http://www.bioinfoindia.org/drgas. Copyright © 2014 Elsevier B.V. All rights reserved.
The role of correlation and solvation in ion interactions with B-DNA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sushko, Maria L.; Thomas, Dennis G.; Pabit, Suzette
Ionic atmosphere around nucleic acids plays important roles in biological function. Large-scale explicit solvent simulations coupled to experimental assays such as anomalous small-angle X-ray scattering (ASAXS) can provide important insights into the structure and energetics of the ionic atmosphere but are time- and resource-intensive. In this paper, we demonstrate the use of classical density functional theory to model DNA-ion interactions and explore the balance between ion-DNA, ion-water, and ion-ion interactions. In particular, we compute the distribution of RbCl, SrCl2, and CoHexCl3 (cobalt hexammine chlo- ride) around a B-form DNA molecule. The accuracy of the DFT calculations was assessed by comparisonmore » between simulated and experimental ASAXS curves. As expected, these calculations revealed significant differences between the monovalent, divalent, and trivalent cations. About half of the DNA-bound Rb+ ions penetrate into the minor groove of the DNA and half adsorb on the DNA strands. The fraction of cations in the minor groove decreases for the larger Sr2+ ions and becomes zero for CoHex3+ ions, which all adsorb on the DNA strands. The distribution of CoHex3+ ions is mainly determined by Coulomb interactions, while ion-correlation forces play a central role in the monovalent Rb+ distribution and a combination of ion-correlation and hydration forces affect the Sr2+ distribution around DNA.« less
NASA Astrophysics Data System (ADS)
Al-Otaibi, Jamelah S.; EL Gogary, Tarek M.
2017-02-01
Anthraquinones are well-known anticancer drugs. Anthraquinones anticancer drugs carry out their cytotoxic activities through their interaction with DNA, and inhibition of topoisomerase II activity. Anthraquinones (AQ5 and AQ5H) were synthesized and studied with 1,5-DAAQ by computational and experimental tools. The purpose of this study is to shade more light on mechanism of interaction between anthraquinone DNA affinic agents and different types of DNA. This study will lead to gain of information useful for drug design and development. Molecular structures were optimized using DFT B3LYP/6-31 + G(d). Depending on intramolecular hydrogen bonding interactions four conformers of AQ5 were detected within the range of about 42 kcal/mol. Molecular reactivity of the anthraquinone compounds was explored using global and condensed descriptors (electrophilicity and Fukui functions). NMR and UV-VIS electronic absorption spectra of anthraquinones/DNA were investigated at the physiological pH. The interaction of the anthraquinones (AQ5 and AQ5H) were studied with different DNA namely, calf thymus DNA, (Poly[dA].Poly[dT]) and (Poly[dG].Poly[dC]). UV-VIS electronic absorption spectral data were employed to measure the affinity constants of drug/DNA binding using Scatchard analysis. NMR study confirms qualitatively the drug/DNA interaction in terms of peak shift and broadening.
Simulations Using Random-Generated DNA and RNA Sequences
ERIC Educational Resources Information Center
Bryce, C. F. A.
1977-01-01
Using a very simple computer program written in BASIC, a very large number of random-generated DNA or RNA sequences are obtained. Students use these sequences to predict complementary sequences and translational products, evaluate base compositions, determine frequencies of particular triplet codons, and suggest possible secondary structures.…
Modulating the DNA affinity of Elk-1 with computationally selected mutations.
Park, Sheldon; Boder, Eric T; Saven, Jeffery G
2005-04-22
In order to regulate gene expression, transcription factors must first bind their target DNA sequences. The affinity of this binding is determined by both the network of interactions at the interface and the entropy change associated with the complex formation. To study the role of structural fluctuation in fine-tuning DNA affinity, we performed molecular dynamics simulations of two highly homologous proteins, Elk-1 and SAP-1, that exhibit different sequence specificity. Simulation studies show that several residues in Elk have significantly higher main-chain root-mean-square deviations than their counterparts in SAP. In particular, a single residue, D69, may contribute to Elk's lower DNA affinity for P(c-fos) by structurally destabilizing the carboxy terminus of the recognition helix. While D69 does not contact DNA directly, the increased mobility in the region may contribute to its weaker binding. We measured the ability of single point mutants of Elk to bind P(c-fos) in a reporter assay, in which D69 of wild-type Elk has been mutated to other residues with higher helix propensity in order to stabilize the local conformation. The gains in transcriptional activity and the free energy of binding suggested from these measurements correlate well with stability gains computed from helix propensity and charge-macrodipole interactions. The study suggests that residues that are distal to the binding interface may indirectly modulate the binding affinity by stabilizing the protein scaffold required for efficient DNA interaction.
Quantitative fluorescence correlation spectroscopy on DNA in living cells
NASA Astrophysics Data System (ADS)
Hodges, Cameron; Kafle, Rudra P.; Meiners, Jens-Christian
2017-02-01
FCS is a fluorescence technique conventionally used to study the kinetics of fluorescent molecules in a dilute solution. Being a non-invasive technique, it is now drawing increasing interest for the study of more complex systems like the dynamics of DNA or proteins in living cells. Unlike an ordinary dye solution, the dynamics of macromolecules like proteins or entangled DNA in crowded environments is often slow and subdiffusive in nature. This in turn leads to longer residence times of the attached fluorophores in the excitation volume of the microscope and artifacts from photobleaching abound that can easily obscure the signature of the molecular dynamics of interest and make quantitative analysis challenging.We discuss methods and procedures to make FCS applicable to quantitative studies of the dynamics of DNA in live prokaryotic and eukaryotic cells. The intensity autocorrelation is computed function from weighted arrival times of the photons on the detector that maximizes the information content while simultaneously correcting for the effect of photobleaching to yield an autocorrelation function that reflects only the underlying dynamics of the sample. This autocorrelation function in turn is used to calculate the mean square displacement of the fluorophores attached to DNA. The displacement data is more amenable to further quantitative analysis than the raw correlation functions. By using a suitable integral transform of the mean square displacement, we can then determine the viscoelastic moduli of the DNA in its cellular environment. The entire analysis procedure is extensively calibrated and validated using model systems and computational simulations.
Biomolecular Assembly of Gold Nanocrystals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Micheel, Christine Marya
2005-05-20
Over the past ten years, methods have been developed to construct discrete nanostructures using nanocrystals and biomolecules. While these frequently consist of gold nanocrystals and DNA, semiconductor nanocrystals as well as antibodies and enzymes have also been used. One example of discrete nanostructures is dimers of gold nanocrystals linked together with complementary DNA. This type of nanostructure is also known as a nanocrystal molecule. Discrete nanostructures of this kind have a number of potential applications, from highly parallel self-assembly of electronics components and rapid read-out of DNA computations to biological imaging and a variety of bioassays. My research focused inmore » three main areas. The first area, the refinement of electrophoresis as a purification and characterization method, included application of agarose gel electrophoresis to the purification of discrete gold nanocrystal/DNA conjugates and nanocrystal molecules, as well as development of a more detailed understanding of the hydrodynamic behavior of these materials in gels. The second area, the development of methods for quantitative analysis of transmission electron microscope data, used computer programs written to find pair correlations as well as higher order correlations. With these programs, it is possible to reliably locate and measure nanocrystal molecules in TEM images. The final area of research explored the use of DNA ligase in the formation of nanocrystal molecules. Synthesis of dimers of gold particles linked with a single strand of DNA possible through the use of DNA ligase opens the possibility for amplification of nanostructures in a manner similar to polymerase chain reaction. These three areas are discussed in the context of the work in the Alivisatos group, as well as the field as a whole.« less
NASA Astrophysics Data System (ADS)
Voityuk, Alexander A.
2008-03-01
The electron hole transfer (HT) properties of DNA are substantially affected by thermal fluctuations of the π stack structure. Depending on the mutual position of neighboring nucleobases, electronic coupling V may change by several orders of magnitude. In the present paper, we report the results of systematic QM/molecular dynamic (MD) calculations of the electronic couplings and on-site energies for the hole transfer. Based on 15ns MD trajectories for several DNA oligomers, we calculate the average coupling squares ⟨V2⟩ and the energies of basepair triplets XG +Y and XA +Y, where X, Y =G, A, T, and C. For each of the 32 systems, 15 000 conformations separated by 1ps are considered. The three-state generalized Mulliken-Hush method is used to derive electronic couplings for HT between neighboring basepairs. The adiabatic energies and dipole moment matrix elements are computed within the INDO/S method. We compare the rms values of V with the couplings estimated for the idealized B-DNA structure and show that in several important cases the couplings calculated for the idealized B-DNA structure are considerably underestimated. The rms values for intrastrand couplings G-G, A-A, G-A, and A-G are found to be similar, ˜0.07eV, while the interstrand couplings are quite different. The energies of hole states G+ and A+ in the stack depend on the nature of the neighboring pairs. The XG +Y are by 0.5eV more stable than XA +Y. The thermal fluctuations of the DNA structure facilitate the HT process from guanine to adenine. The tabulated couplings and on-site energies can be used as reference parameters in theoretical and computational studies of HT processes in DNA.
Pan, Gaofeng; Jiang, Limin; Tang, Jijun; Guo, Fei
2018-02-08
DNA methylation is an important biochemical process, and it has a close connection with many types of cancer. Research about DNA methylation can help us to understand the regulation mechanism and epigenetic reprogramming. Therefore, it becomes very important to recognize the methylation sites in the DNA sequence. In the past several decades, many computational methods-especially machine learning methods-have been developed since the high-throughout sequencing technology became widely used in research and industry. In order to accurately identify whether or not a nucleotide residue is methylated under the specific DNA sequence context, we propose a novel method that overcomes the shortcomings of previous methods for predicting methylation sites. We use k -gram, multivariate mutual information, discrete wavelet transform, and pseudo amino acid composition to extract features, and train a sparse Bayesian learning model to do DNA methylation prediction. Five criteria-area under the receiver operating characteristic curve (AUC), Matthew's correlation coefficient (MCC), accuracy (ACC), sensitivity (SN), and specificity-are used to evaluate the prediction results of our method. On the benchmark dataset, we could reach 0.8632 on AUC, 0.8017 on ACC, 0.5558 on MCC, and 0.7268 on SN. Additionally, the best results on two scBS-seq profiled mouse embryonic stem cells datasets were 0.8896 and 0.9511 by AUC, respectively. When compared with other outstanding methods, our method surpassed them on the accuracy of prediction. The improvement of AUC by our method compared to other methods was at least 0.0399 . For the convenience of other researchers, our code has been uploaded to a file hosting service, and can be downloaded from: https://figshare.com/s/0697b692d802861282d3.
An accurate algorithm for the detection of DNA fragments from dilution pool sequencing experiments.
Bansal, Vikas
2018-01-01
The short read lengths of current high-throughput sequencing technologies limit the ability to recover long-range haplotype information. Dilution pool methods for preparing DNA sequencing libraries from high molecular weight DNA fragments enable the recovery of long DNA fragments from short sequence reads. These approaches require computational methods for identifying the DNA fragments using aligned sequence reads and assembling the fragments into long haplotypes. Although a number of computational methods have been developed for haplotype assembly, the problem of identifying DNA fragments from dilution pool sequence data has not received much attention. We formulate the problem of detecting DNA fragments from dilution pool sequencing experiments as a genome segmentation problem and develop an algorithm that uses dynamic programming to optimize a likelihood function derived from a generative model for the sequence reads. This algorithm uses an iterative approach to automatically infer the mean background read depth and the number of fragments in each pool. Using simulated data, we demonstrate that our method, FragmentCut, has 25-30% greater sensitivity compared with an HMM based method for fragment detection and can also detect overlapping fragments. On a whole-genome human fosmid pool dataset, the haplotypes assembled using the fragments identified by FragmentCut had greater N50 length, 16.2% lower switch error rate and 35.8% lower mismatch error rate compared with two existing methods. We further demonstrate the greater accuracy of our method using two additional dilution pool datasets. FragmentCut is available from https://bansal-lab.github.io/software/FragmentCut. vibansal@ucsd.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Theory of high-force DNA stretching and overstretching.
Storm, C; Nelson, P C
2003-05-01
Single-molecule experiments on single- and double-stranded DNA have sparked a renewed interest in the force versus extension of polymers. The extensible freely jointed chain (FJC) model is frequently invoked to explain the observed behavior of single-stranded DNA, but this model does not satisfactorily describe recent high-force stretching data. We instead propose a model (the discrete persistent chain) that borrows features from both the FJC and the wormlike chain, and show that it resembles the data more closely. We find that most of the high-force behavior previously attributed to stretch elasticity is really a feature of the corrected entropic elasticity; the true stretch compliance of single-stranded DNA is several times smaller than that found by previous authors. Next we elaborate our model to allow coexistence of two conformational states of DNA, each with its own stretch and bend elastic constants. Our model is computationally simple and gives an excellent fit through the entire overstretching transition of nicked, double-stranded DNA. The fit gives the first value for the bend stiffness of the overstretched state. In particular, we find the effective bend stiffness for DNA in this state to be about 12 nm k(B)T, a value quite different from either the B-form or single-stranded DNA.
New Modeling Approaches to Study DNA Damage by the Direct and Indirect Effects of Ionizing Radiation
NASA Technical Reports Server (NTRS)
Plante, Ianik; Cucinotta, Francis A.
2012-01-01
DNA is damaged both by the direct and indirect effects of radiation. In the direct effect, the DNA itself is ionized, whereas the indirect effect involves the radiolysis of the water molecules surrounding the DNA and the subsequent reaction of the DNA with radical products. While this problem has been studied for many years, many unknowns still exist. To study this problem, we have developed the computer code RITRACKS [1], which simulates the radiation track structure for heavy ions and electrons, calculating all energy deposition events and the coordinates of all species produced by the water radiolysis. In this work, we plan to simulate DNA damage by using the crystal structure of a nucleosome and calculations performed by RITRACKS. The energy deposition events are used to calculate the dose deposited in nanovolumes [2] and therefore can be used to simulate the direct effect of the radiation. Using the positions of the radiolytic species with a radiation chemistry code [3] it will be possible to simulate DNA damage by indirect effect. The simulation results can be compared with results from previous calculations such as the frequencies of simple and complex strand breaks [4] and with newer experimental data using surrogate markers of DNA double ]strand breaks such as . ]H2AX foci [5].
The Essential Component in DNA-Based Information Storage System: Robust Error-Tolerating Module
Yim, Aldrin Kay-Yuen; Yu, Allen Chi-Shing; Li, Jing-Woei; Wong, Ada In-Chun; Loo, Jacky F. C.; Chan, King Ming; Kong, S. K.; Yip, Kevin Y.; Chan, Ting-Fung
2014-01-01
The size of digital data is ever increasing and is expected to grow to 40,000 EB by 2020, yet the estimated global information storage capacity in 2011 is <300 EB, indicating that most of the data are transient. DNA, as a very stable nano-molecule, is an ideal massive storage device for long-term data archive. The two most notable illustrations are from Church et al. and Goldman et al., whose approaches are well-optimized for most sequencing platforms – short synthesized DNA fragments without homopolymer. Here, we suggested improvements on error handling methodology that could enable the integration of DNA-based computational process, e.g., algorithms based on self-assembly of DNA. As a proof of concept, a picture of size 438 bytes was encoded to DNA with low-density parity-check error-correction code. We salvaged a significant portion of sequencing reads with mutations generated during DNA synthesis and sequencing and successfully reconstructed the entire picture. A modular-based programing framework – DNAcodec with an eXtensible Markup Language-based data format was also introduced. Our experiments demonstrated the practicability of long DNA message recovery with high error tolerance, which opens the field to biocomputing and synthetic biology. PMID:25414846
Molecular modelling study of changes induced by netropsin binding to nucleosome core particles.
Pérez, J J; Portugal, J
1990-01-01
It is well known that certain sequence-dependent modulators in structure appear to determine the rotational positioning of DNA on the nucleosome core particle. That preference is rather weak and could be modified by some ligands as netropsin, a minor-groove binding antibiotic. We have undertaken a molecular modelling approach to calculate the relative energy of interaction between a DNA molecule and the protein core particle. The histones particle is considered as a distribution of positive charges on the protein surface that interacts with the DNA molecule. The molecular electrostatic potentials for the DNA, simulated as a discontinuous cylinder, were calculated using the values for all the base pairs. Computing these parameters, we calculated the relative energy of interaction and the more stable rotational setting of DNA. The binding of four molecules of netropsin to this model showed that a new minimum of energy is obtained when the DNA turns toward the protein surface by about 180 degrees, so a new energetically favoured structure appears where netropsin binding sites are located facing toward the histones surface. The effect of netropsin could be explained in terms of an induced change in the phasing of DNA on the core particle. The induced rotation is considered to optimize non-bonded contacts between the netropsin molecules and the DNA backbone. PMID:2165249
Computational characterization of DNA/peptide/nanotube self assembly for bioenergy applications
NASA Astrophysics Data System (ADS)
Ortiz, Vanessa; Araki, Ruriko; Collier, Galen
2012-02-01
Multi-enzyme pathways have become a subject of increasing interest for their role in the engineering of biomimetic systems for applications including biosensors, bioelectronics, and bioenergy. The efficiencies found in natural metabolic pathways partially arise from biomolecular self-assembly of the component enzymes in an effort to avoid transport limitations. The ultimate goal of this effort is to design and build biofuel cells with efficiencies similar to those of native systems by introducing biomimetic structures that immobilize multiple enzymes in specific orientations on a bioelectrode. To achieve site-specific immobilization, the specificity of DNA-binding domains is exploited with an approach that allows any redox enzyme to be modified to site-specifically bind to double stranded (ds) DNA while retaining activity. Because of its many desirable properties, the bioelectrode of choice is single-wall carbon nanotubes (SWNTs), but little is known about dsDNA/SWNT assembly and how this might affect the activity of the DNA-binding domains. Here we evaluate the feasibility of the proposed assembly by performing atomistic molecular dynamics simulations to look at the stability and conformations adopted by dsDNA when bound to a SWNT. We also evaluate the effects of the presence of a SWNT on the stability of the complex formed by a DNA-binding domain and DNA.
Performing SELEX experiments in silico
NASA Astrophysics Data System (ADS)
Wondergem, J. A. J.; Schiessel, H.; Tompitak, M.
2017-11-01
Due to the sequence-dependent nature of the elasticity of DNA, many protein-DNA complexes and other systems in which DNA molecules must be deformed have preferences for the type of DNA sequence they interact with. SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiments and similar sequence selection experiments have been used extensively to examine the (indirect readout) sequence preferences of, e.g., nucleosomes (protein spools around which DNA is wound for compactification) and DNA rings. We show how recently developed computational and theoretical tools can be used to emulate such experiments in silico. Opening up this possibility comes with several benefits. First, it allows us a better understanding of our models and systems, specifically about the roles played by the simulation temperature and the selection pressure on the sequences. Second, it allows us to compare the predictions made by the model of choice with experimental results. We find agreement on important features between predictions of the rigid base-pair model and experimental results for DNA rings and interesting differences that point out open questions in the field. Finally, our simulations allow application of the SELEX methodology to systems that are experimentally difficult to realize because they come with high energetic costs and are therefore unlikely to form spontaneously, such as very short or overwound DNA rings.
Abolfath, Ramin M; Biswas, P K; Rajnarayanam, R; Brabec, Thomas; Kodym, Reinhard; Papiez, Lech
2012-04-19
Understanding the damage of DNA bases from hydrogen abstraction by free OH radicals is of particular importance to understanding the indirect effect of ionizing radiation. Previous studies address the problem with truncated DNA bases as ab initio quantum simulations required to study such electronic-spin-dependent processes are computationally expensive. Here, for the first time, we employ a multiscale and hybrid quantum mechanical-molecular mechanical simulation to study the interaction of OH radicals with a guanine-deoxyribose-phosphate DNA molecular unit in the presence of water, where all of the water molecules and the deoxyribose-phosphate fragment are treated with the simplistic classical molecular mechanical scheme. Our result illustrates that the presence of water strongly alters the hydrogen-abstraction reaction as the hydrogen bonding of OH radicals with water restricts the relative orientation of the OH radicals with respect to the DNA base (here, guanine). This results in an angular anisotropy in the chemical pathway and a lower efficiency in the hydrogen-abstraction mechanisms than previously anticipated for identical systems in vacuum. The method can easily be extended to single- and double-stranded DNA without any appreciable computational cost as these molecular units can be treated in the classical subsystem, as has been demonstrated here. © 2012 American Chemical Society
Sub-Terrahertz Spectroscopy of E.COLI Dna: Experiment, Statistical Model, and MD Simulations
NASA Astrophysics Data System (ADS)
Sizov, I.; Dorofeeva, T.; Khromova, T.; Gelmont, B.; Globus, T.
2012-06-01
We will present result of combined experimental and computational study of sub-THz absorption spectra from Escherichia coli (E.coli) DNA. Measurements were conducted using a Bruker FTIR spectrometer with a liquid helium cooled bolometer and a recently developed frequency domain sensor operating at room temperature, with spectral resolution of 0.25 cm-1 and 0.03 cm-1, correspondingly. We have earlier demonstrated that molecular dynamics (MD) simulation can be effectively applied for characterizing relatively small biological molecules, such as transfer RNA or small protein thioredoxin from E. coli , and help to understand and predict their absorption spectra. Large size of DNA macromolecules ( 5 million base pairs for E. coli DNA) prevents, however, direct application of MD simulation at the current level of computational capabilities. Therefore, by applying a second order Markov chain approach and Monte-Carlo technique, we have developed a new statistical model to construct DNA sequences from biological cells. These short representative sequences (20-60 base pairs) are built upon the most frequently repeated fragments (2-10 base pairs) in the original DNA. Using this new approach, we constructed DNA sequences for several non-pathogenic strains of E.coli, including a well-known strain BL21, uro-pathogenic strain, CFT073, and deadly EDL933 strain (O157:H7), and used MD simulations to calculate vibrational absorption spectra of these strains. Significant differences are clearly present in spectra of strains in averaged spectra and in all components for particular orientations. The mechanism of interaction of THz radiation with a biological molecule is studied by analyzing dynamics of atoms and correlation of local vibrations in the modeled molecule. Simulated THz vibrational spectra of DNA are compared with experimental results. With the spectral resolution of 0.1 cm-1 or better, which is now available in experiments, the very easy discrimination between different strains of the same bacteria becomes possible.
Polymorphic design of DNA origami structures through mechanical control of modular components.
Lee, Chanseok; Lee, Jae Young; Kim, Do-Nyun
2017-12-12
Scaffolded DNA origami enables the bottom-up fabrication of diverse DNA nanostructures by designing hundreds of staple strands, comprised of complementary sequences to the specific binding locations of a scaffold strand. Despite its exceptionally high design flexibility, poor reusability of staples has been one of the major hurdles to fabricate assorted DNA constructs in an effective way. Here we provide a rational module-based design approach to create distinct bent shapes with controllable geometries and flexibilities from a single, reference set of staples. By revising the staple connectivity within the desired module, we can control the location, stiffness, and included angle of hinges precisely, enabling the construction of dozens of single- or multiple-hinge structures with the replacement of staple strands up to 12.8% only. Our design approach, combined with computational shape prediction and analysis, can provide a versatile and cost-effective procedure in the design of DNA origami shapes with stiffness-tunable units.
Lareau, Caleb A; Aryee, Martin J; Berger, Bonnie
2018-02-15
The 3D architecture of DNA within the nucleus is a key determinant of interactions between genes, regulatory elements, and transcriptional machinery. As a result, differences in DNA looping structure are associated with variation in gene expression and cell state. To systematically assess changes in DNA looping architecture between samples, we introduce diffloop, an R/Bioconductor package that provides a suite of functions for the quality control, statistical testing, annotation, and visualization of DNA loops. We demonstrate this functionality by detecting differences between ENCODE ChIA-PET samples and relate looping to variability in epigenetic state. Diffloop is implemented as an R/Bioconductor package available at https://bioconductor.org/packages/release/bioc/html/diffloop.html. aryee.martin@mgh.harvard.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Maffeo, C.; Yoo, J.; Comer, J.; Wells, D. B.; Luan, B.; Aksimentiev, A.
2014-01-01
Over the past ten years, the all-atom molecular dynamics method has grown in the scale of both systems and processes amenable to it and in its ability to make quantitative predictions about the behavior of experimental systems. The field of computational DNA research is no exception, witnessing a dramatic increase in the size of systems simulated with atomic resolution, the duration of individual simulations and the realism of the simulation outcomes. In this topical review, we describe the hallmark physical properties of DNA from the perspective of all-atom simulations. We demonstrate the amazing ability of such simulations to reveal the microscopic physical origins of experimentally observed phenomena and we review the frustrating limitations associated with imperfections of present atomic force fields and inadequate sampling. The review is focused on the following four physical properties of DNA: effective electric charge, response to an external mechanical force, interaction with other DNA molecules and behavior in an external electric field. PMID:25238560
Designing nucleosomal force sensors
NASA Astrophysics Data System (ADS)
Tompitak, M.; de Bruin, L.; Eslami-Mossallam, B.; Schiessel, H.
2017-05-01
About three quarters of our DNA is wrapped into nucleosomes: DNA spools with a protein core. It is well known that the affinity of a given DNA stretch to be incorporated into a nucleosome depends on the geometry and elasticity of the basepair sequence involved, causing the positioning of nucleosomes. Here we show that DNA elasticity can have a much deeper effect on nucleosomes than just their positioning: it affects their "identities". Employing a recently developed computational algorithm, the mutation Monte Carlo method, we design nucleosomes with surprising physical characteristics. Unlike any other nucleosomes studied so far, these nucleosomes are short-lived when put under mechanical tension whereas other physical properties are largely unaffected. This suggests that the nucleosome, the most abundant DNA-protein complex in our cells, might more properly be considered a class of complexes with a wide array of physical properties, and raises the possibility that evolution has shaped various nucleosome species according to their genomic context.
Maffeo, C; Yoo, J; Comer, J; Wells, D B; Luan, B; Aksimentiev, A
2014-10-15
Over the past ten years, the all-atom molecular dynamics method has grown in the scale of both systems and processes amenable to it and in its ability to make quantitative predictions about the behavior of experimental systems. The field of computational DNA research is no exception, witnessing a dramatic increase in the size of systems simulated with atomic resolution, the duration of individual simulations and the realism of the simulation outcomes. In this topical review, we describe the hallmark physical properties of DNA from the perspective of all-atom simulations. We demonstrate the amazing ability of such simulations to reveal the microscopic physical origins of experimentally observed phenomena. We also discuss the frustrating limitations associated with imperfections of present atomic force fields and inadequate sampling. The review is focused on the following four physical properties of DNA: effective electric charge, response to an external mechanical force, interaction with other DNA molecules and behavior in an external electric field.
Construction of a fuzzy and Boolean logic gates based on DNA.
Zadegan, Reza M; Jepsen, Mette D E; Hildebrandt, Lasse L; Birkedal, Victoria; Kjems, Jørgen
2015-04-17
Logic gates are devices that can perform logical operations by transforming a set of inputs into a predictable single detectable output. The hybridization properties, structure, and function of nucleic acids can be used to make DNA-based logic gates. These devices are important modules in molecular computing and biosensing. The ideal logic gate system should provide a wide selection of logical operations, and be integrable in multiple copies into more complex structures. Here we show the successful construction of a small DNA-based logic gate complex that produces fluorescent outputs corresponding to the operation of the six Boolean logic gates AND, NAND, OR, NOR, XOR, and XNOR. The logic gate complex is shown to work also when implemented in a three-dimensional DNA origami box structure, where it controlled the position of the lid in a closed or open position. Implementation of multiple microRNA sensitive DNA locks on one DNA origami box structure enabled fuzzy logical operation that allows biosensing of complex molecular signals. Integrating logic gates with DNA origami systems opens a vast avenue to applications in the fields of nanomedicine for diagnostics and therapeutics. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Cardamone, Francesca; Pizzi, Simone; Iacovelli, Federico; Falconi, Mattia; Desideri, Alessandro
2017-01-01
Human topoisomerase IB is an important target in cancer therapy and drugs selectively stabilizing the topoisomerase IB-DNA covalent complex are in clinical use for several cancer types. Tyrosyl- DNA phosphodiesterase 1 is involved in the DNA repair resolving the topoisomerase IB-DNA covalent complex that is extremely dangerous for the survival of the cells since it produces an irreversible DNA damage. Given the close biological relationship between these two enzymes, the development of synergistic inhibitors, called dual-inhibitors, is an important challenge in cancer therapy and computer-aided drug design may help in the identification of the best compounds. In this review, an overview of the compounds inhibiting one of the two enzymes or acting as dual inhibitors is provided. Moreover, the general procedures of the virtual screening approach, providing a description of two widely used opensource programs, namely AutoDock4 and AutoDock Vina, are described. Finally, an application of the two programs on a selected number of dual inhibitors for tyrosyl-DNA phosphodiesterase 1 and topoisomerase IB and their performance is briefly discussed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Modular structural elements in the replication origin region of Tetrahymena rDNA.
Du, C; Sanzgiri, R P; Shaiu, W L; Choi, J K; Hou, Z; Benbow, R M; Dobbs, D L
1995-01-01
Computer analyses of the DNA replication origin region in the amplified rRNA genes of Tetrahymena thermophila identified a potential initiation zone in the 5'NTS [Dobbs, Shaiu and Benbow (1994), Nucleic Acids Res. 22, 2479-2489]. This region consists of a putative DNA unwinding element (DUE) aligned with predicted bent DNA segments, nuclear matrix or scaffold associated region (MAR/SAR) consensus sequences, and other common modular sequence elements previously shown to be clustered in eukaryotic chromosomal origin regions. In this study, two mung bean nuclease-hypersensitive sites in super-coiled plasmid DNA were localized within the major DUE-like element predicted by thermodynamic analyses. Three restriction fragments of the 5'NTS region predicted to contain bent DNA segments exhibited anomalous migration characteristic of bent DNA during electrophoresis on polyacrylamide gels. Restriction fragments containing the 5'NTS region bound Tetrahymena nuclear matrices in an in vitro binding assay, consistent with an association of the replication origin region with the nuclear matrix in vivo. The direct demonstration in a protozoan origin region of elements previously identified in Drosophila, chick and mammalian origin regions suggests that clusters of modular structural elements may be a conserved feature of eukaryotic chromosomal origins of replication. Images PMID:7784181
Verma, Kapil; Sharma, Sapna; Sharma, Arun; Dalal, Jyoti; Bhardwaj, Tapeshwar
2018-06-01
Genetic variations among humans occur both within and among populations and range from single nucleotide changes to multiple-nucleotide variants. These multiple-nucleotide variants are useful for studying the relationships among individuals or various population groups. The study of human genetic variations can help scientists understand how different population groups are biologically related to one another. Sequence analysis of hypervariable regions of human mitochondrial DNA (mtDNA) has been successfully used for the genetic characterization of different population groups for forensic purposes. It is well established that different ethnic or population groups differ significantly in their mtDNA distributions. In the last decade, very little research has been conducted on mtDNA variations in the Indian population, although such data would be useful for elucidating the history of human population expansion across the world. Moreover, forensic studies on mtDNA variations in the Indian subcontinent are also scarce, particularly in the northern part of India. In this report, variations in the hypervariable regions of mtDNA were analyzed in the Yadav population of Haryana. Different molecular diversity indices were computed. Further, the obtained haplotypes were classified into different haplogroups and the phylogenetic relationship between different haplogroups was inferred.
JavaScript DNA translator: DNA-aligned protein translations.
Perry, William L
2002-12-01
There are many instances in molecular biology when it is necessary to identify ORFs in a DNA sequence. While programs exist for displaying protein translations in multiple ORFs in alignment with a DNA sequence, they are often expensive, exist as add-ons to software that must be purchased, or are only compatible with a particular operating system. JavaScript DNA Translator is a shareware application written in JavaScript, a scripting language interpreted by the Netscape Communicator and Internet Explorer Web browsers, which makes it compatible with several different operating systems. While the program uses a familiar Web page interface, it requires no connection to the Internet since calculations are performed on the user's own computer. The program analyzes one or multiple DNA sequences and generates translations in up to six reading frames aligned to a DNA sequence, in addition to displaying translations as separate sequences in FASTA format. ORFs within a reading frame can also be displayed as separate sequences. Flexible formatting options are provided, including the ability to hide ORFs below a minimum size specified by the user. The program is available free of charge at the BioTechniques Software Library (www.Biotechniques.com).
Developing DNA nanotechnology using single-molecule fluorescence.
Tsukanov, Roman; Tomov, Toma E; Liber, Miran; Berger, Yaron; Nir, Eyal
2014-06-17
CONSPECTUS: An important effort in the DNA nanotechnology field is focused on the rational design and manufacture of molecular structures and dynamic devices made of DNA. As is the case for other technologies that deal with manipulation of matter, rational development requires high quality and informative feedback on the building blocks and final products. For DNA nanotechnology such feedback is typically provided by gel electrophoresis, atomic force microscopy (AFM), and transmission electron microscopy (TEM). These analytical tools provide excellent structural information; however, usually they do not provide high-resolution dynamic information. For the development of DNA-made dynamic devices such as machines, motors, robots, and computers this constitutes a major problem. Bulk-fluorescence techniques are capable of providing dynamic information, but because only ensemble averaged information is obtained, the technique may not adequately describe the dynamics in the context of complex DNA devices. The single-molecule fluorescence (SMF) technique offers a unique combination of capabilities that make it an excellent tool for guiding the development of DNA-made devices. The technique has been increasingly used in DNA nanotechnology, especially for the analysis of structure, dynamics, integrity, and operation of DNA-made devices; however, its capabilities are not yet sufficiently familiar to the community. The purpose of this Account is to demonstrate how different SMF tools can be utilized for the development of DNA devices and for structural dynamic investigation of biomolecules in general and DNA molecules in particular. Single-molecule diffusion-based Förster resonance energy transfer and alternating laser excitation (sm-FRET/ALEX) and immobilization-based total internal reflection fluorescence (TIRF) techniques are briefly described and demonstrated. To illustrate the many applications of SMF to DNA nanotechnology, examples of SMF studies of DNA hairpins and Holliday junctions and of the interactions of DNA strands with DNA origami and origami-related devices such as a DNA bipedal motor are provided. These examples demonstrate how SMF can be utilized for measurement of distances and conformational distributions and equilibrium and nonequilibrium kinetics, to monitor structural integrity and operation of DNA devices, and for isolation and investigation of minor subpopulations including malfunctioning and nonreactive devices. Utilization of a flow-cell to achieve measurements of dynamics with increased time resolution and for convenient and efficient operation of DNA devices is discussed briefly. We conclude by summarizing the various benefits provided by SMF for the development of DNA nanotechnology and suggest that the method can significantly assist in the design and manufacture and evaluation of operation of DNA devices.
Understanding DNA under oxidative stress and sensitization: the role of molecular modeling
Dumont, Elise; Monari, Antonio
2015-01-01
DNA is constantly exposed to damaging threats coming from oxidative stress, i.e., from the presence of free radicals and reactive oxygen species. Sensitization from exogenous and endogenous compounds that strongly enhance the frequency of light-induced lesions also plays an important role. The experimental determination of DNA lesions, though a difficult subject, is somehow well established and allows to elucidate even extremely rare DNA lesions. In parallel, molecular modeling has become fundamental to clearly understand the fine mechanisms related to DNA defects induction. Indeed, it offers an unprecedented possibility to get access to an atomistic or even electronic resolution. Ab initio molecular dynamics may also describe the time-evolution of the molecular system and its reactivity. Yet the modeling of DNA (photo-)reactions does necessitate elaborate multi-scale methodologies to tackle a damage induction reactivity that takes place in a complex environment. The double-stranded DNA environment is first characterized by a very high flexibility, but also a strongly inhomogeneous electrostatic embedding. Additionally, one aims at capturing more subtle effects, such as the sequence selectivity which is of critical important for DNA damage. The structure and dynamics of the DNA/sensitizers complexes, as well as the photo-induced electron- and energy-transfer phenomena taking place upon sensitization, should be carefully modeled. Finally the factors inducing different repair ratios for different lesions should also be rationalized. In this review we will critically analyze the different computational strategies used to model DNA lesions. A clear picture of the complex interplay between reactivity and structural factors will be sketched. The use of proper multi-scale modeling leads to the in-depth comprehension of DNA lesions mechanisms and also to the rational design of new chemo-therapeutic agents. PMID:26236706
Single DNA imaging and length quantification through a mobile phone microscope
NASA Astrophysics Data System (ADS)
Wei, Qingshan; Luo, Wei; Chiang, Samuel; Kappel, Tara; Mejia, Crystal; Tseng, Derek; Chan, Raymond Yan L.; Yan, Eddie; Qi, Hangfei; Shabbir, Faizan; Ozkan, Haydar; Feng, Steve; Ozcan, Aydogan
2016-03-01
The development of sensitive optical microscopy methods for the detection of single DNA molecules has become an active research area which cultivates various promising applications including point-of-care (POC) genetic testing and diagnostics. Direct visualization of individual DNA molecules usually relies on sophisticated optical microscopes that are mostly available in well-equipped laboratories. For POC DNA testing/detection, there is an increasing need for the development of new single DNA imaging and sensing methods that are field-portable, cost-effective, and accessible for diagnostic applications in resource-limited or field-settings. For this aim, we developed a mobile-phone integrated fluorescence microscopy platform that allows imaging and sizing of single DNA molecules that are stretched on a chip. This handheld device contains an opto-mechanical attachment integrated onto a smartphone camera module, which creates a high signal-to-noise ratio dark-field imaging condition by using an oblique illumination/excitation configuration. Using this device, we demonstrated imaging of individual linearly stretched λ DNA molecules (48 kilobase-pair, kbp) over 2 mm2 field-of-view. We further developed a robust computational algorithm and a smartphone app that allowed the users to quickly quantify the length of each DNA fragment imaged using this mobile interface. The cellphone based device was tested by five different DNA samples (5, 10, 20, 40, and 48 kbp), and a sizing accuracy of <1 kbp was demonstrated for DNA strands longer than 10 kbp. This mobile DNA imaging and sizing platform can be very useful for various diagnostic applications including the detection of disease-specific genes and quantification of copy-number-variations at POC settings.
NASA Technical Reports Server (NTRS)
Hu, Shaowen; Cucinotta, Francis A.
2009-01-01
The Ku70/80 heterodimer is the first repair protein in the initial binding of double-strand break (DSB) ends following DNA damage, and is a component of nonhomologous end joining repair, the primary pathway for DSB repair in mammalian cells. In this study we constructed a full-length human Ku70 structure based on its crystal structure, and performed 20 ns conventional molecular dynamic (CMD) simulations on this protein and several other complexes with short DNA duplexes of different sequences. The trajectories of these simulations indicated that, without the topological support of Ku80, the residues in the bridge and C-terminal arm of Ku70 are more flexible than other experimentally identified domains. We studied the two missing loops in the crystal structure and predicted that they are also very flexible. Simulations revealed that they make an important contribution to the Ku70 interaction with DNA. Dislocation of the previously studied SAP domain was observed in several systems, implying its role in DNA binding. Targeted molecular dynamic (TMD) simulation was also performed for one system with a far-away 14bp DNA duplex. The TMD trajectory and energetic analysis disclosed detailed interactions of the DNA-binding residues during the DNA dislocation, and revealed a possible conformational transition for a DSB end when encountering Ku70 in solution. Compared to experimentally based analysis, this study identified more detailed interactions between DNA and Ku70. Free energy analysis indicated Ku70 alone is able to bind DNA with relatively high affinity, with consistent contributions from various domains of Ku70 in different systems. The functional implications of these domains in the processes of Ku heterodimerization and DNA damage recognition and repair can be characterized in detail based upon this analysis.
NASA Astrophysics Data System (ADS)
Smith, Jarrod Anson
2D homonuclear 1H NMR methods and restrained molecular dynamics (rMD) calculations have been applied to determining the three-dimensional structures of DNA and minor groove-binding ligand-DNA complexes in solution. The structure of the DNA decamer sequence d(GCGTTAACGC)2 has been solved both with a distance-based rMD protocol and an NOE relaxation matrix backcalculation-based protocol in order to probe the relative merits of the different refinement methods. In addition, three minor groove binding ligand-DNA complexes have been examined. The solution structure of the oligosaccharide moiety of the antitumor DNA scission agent calicheamicin γ1I has been determined in complex with a decamer duplex containing its high affinity 5'-TCCT- 3' binding sequence. The structure of the complex reinforces the belief that the oligosaccharide moiety is responsible for the sequence selective minor-groove binding activity of the agent, and critical intermolecular contacts are revealed. The solution structures of both the (+) and (-) enantiomers of the minor groove binding DNA alkylating agent duocarmycin SA have been determined in covalent complex with the undecamer DNA duplex d(GACTAATTGTC).d(GAC AATTAGTC). The results support the proposal that the alkylation activity of the duocarmycin antitumor antibiotics is catalyzed by a binding-induced conformational change in the ligand which activates the cyclopropyl group for reaction with the DNA. Comparisons between the structures of the two enantiomers covalently bound to the same DNA sequence at the same 5'-AATTA-3 ' site have provided insight into the binding orientation and site selectivity, as well as the relative rates of reactivity of these two agents.
A System Architecture for Efficient Transmission of Massive DNA Sequencing Data.
Sağiroğlu, Mahmut Şamİl; Külekcİ, M Oğuzhan
2017-11-01
The DNA sequencing data analysis pipelines require significant computational resources. In that sense, cloud computing infrastructures appear as a natural choice for this processing. However, the first practical difficulty in reaching the cloud computing services is the transmission of the massive DNA sequencing data from where they are produced to where they will be processed. The daily practice here begins with compressing the data in FASTQ file format, and then sending these data via fast data transmission protocols. In this study, we address the weaknesses in that daily practice and present a new system architecture that incorporates the computational resources available on the client side while dynamically adapting itself to the available bandwidth. Our proposal considers the real-life scenarios, where the bandwidth of the connection between the parties may fluctuate, and also the computing power on the client side may be of any size ranging from moderate personal computers to powerful workstations. The proposed architecture aims at utilizing both the communication bandwidth and the computing resources for satisfying the ultimate goal of reaching the results as early as possible. We present a prototype implementation of the proposed architecture, and analyze several real-life cases, which provide useful insights for the sequencing centers, especially on deciding when to use a cloud service and in what conditions.
Modeling Structure-Function Relationships in Synthetic DNA Sequences using Attribute Grammars
Cai, Yizhi; Lux, Matthew W.; Adam, Laura; Peccoud, Jean
2009-01-01
Recognizing that certain biological functions can be associated with specific DNA sequences has led various fields of biology to adopt the notion of the genetic part. This concept provides a finer level of granularity than the traditional notion of the gene. However, a method of formally relating how a set of parts relates to a function has not yet emerged. Synthetic biology both demands such a formalism and provides an ideal setting for testing hypotheses about relationships between DNA sequences and phenotypes beyond the gene-centric methods used in genetics. Attribute grammars are used in computer science to translate the text of a program source code into the computational operations it represents. By associating attributes with parts, modifying the value of these attributes using rules that describe the structure of DNA sequences, and using a multi-pass compilation process, it is possible to translate DNA sequences into molecular interaction network models. These capabilities are illustrated by simple example grammars expressing how gene expression rates are dependent upon single or multiple parts. The translation process is validated by systematically generating, translating, and simulating the phenotype of all the sequences in the design space generated by a small library of genetic parts. Attribute grammars represent a flexible framework connecting parts with models of biological function. They will be instrumental for building mathematical models of libraries of genetic constructs synthesized to characterize the function of genetic parts. This formalism is also expected to provide a solid foundation for the development of computer assisted design applications for synthetic biology. PMID:19816554
Parker, Trent M; Hohenstein, Edward G; Parrish, Robert M; Hud, Nicholas V; Sherrill, C David
2013-01-30
Symmetry-adapted perturbation theory (SAPT) is applied to pairs of hydrogen-bonded nucleobases to obtain the energetic components of base stacking (electrostatic, exchange-repulsion, induction/polarization, and London dispersion interactions) and how they vary as a function of the helical parameters Rise, Twist, and Slide. Computed average values of Rise and Twist agree well with experimental data for B-form DNA from the Nucleic Acids Database, even though the model computations omitted the backbone atoms (suggesting that the backbone in B-form DNA is compatible with having the bases adopt their ideal stacking geometries). London dispersion forces are the most important attractive component in base stacking, followed by electrostatic interactions. At values of Rise typical of those in DNA (3.36 Å), the electrostatic contribution is nearly always attractive, providing further evidence for the importance of charge-penetration effects in π-π interactions (a term neglected in classical force fields). Comparison of the computed stacking energies with those from model complexes made of the "parent" nucleobases purine and 2-pyrimidone indicates that chemical substituents in DNA and RNA account for 20-40% of the base-stacking energy. A lack of correspondence between the SAPT results and experiment for Slide in RNA base-pair steps suggests that the backbone plays a larger role in determining stacking geometries in RNA than in B-form DNA. In comparisons of base-pair steps with thymine versus uracil, the thymine methyl group tends to enhance the strength of the stacking interaction through a combination of dispersion and electrosatic interactions.
Directed nucleation assembly of DNA tile complexes for barcode-patterned lattices
Yan, Hao; LaBean, Thomas H.; Feng, Liping; Reif, John H.
2003-01-01
The programmed self-assembly of patterned aperiodic molecular structures is a major challenge in nanotechnology and has numerous potential applications for nanofabrication of complex structures and useful devices. Here we report the construction of an aperiodic patterned DNA lattice (barcode lattice) by a self-assembly process of directed nucleation of DNA tiles around a scaffold DNA strand. The input DNA scaffold strand, constructed by ligation of shorter synthetic oligonucleotides, provides layers of the DNA lattice with barcode patterning information represented by the presence or absence of DNA hairpin loops protruding out of the lattice plane. Self-assembly of multiple DNA tiles around the scaffold strand was shown to result in a patterned lattice containing barcode information of 01101. We have also demonstrated the reprogramming of the system to another patterning. An inverted barcode pattern of 10010 was achieved by modifying the scaffold strands and one of the strands composing each tile. A ribbon lattice, consisting of repetitions of the barcode pattern with expected periodicity, was also constructed by the addition of sticky ends. The patterning of both classes of lattices was clearly observable via atomic force microscopy. These results represent a step toward implementation of a visual readout system capable of converting information encoded on a 1D DNA strand into a 2D form readable by advanced microscopic techniques. A functioning visual output method would not only increase the readout speed of DNA-based computers, but may also find use in other sequence identification techniques such as mutation or allele mapping. PMID:12821776
Structure solution of DNA-binding proteins and complexes with ARCIMBOLDO libraries
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pröpper, Kevin; Instituto de Biologia Molecular de Barcelona; Meindl, Kathrin
2014-06-01
The structure solution of DNA-binding protein structures and complexes based on the combination of location of DNA-binding protein motif fragments with density modification in a multi-solution frame is described. Protein–DNA interactions play a major role in all aspects of genetic activity within an organism, such as transcription, packaging, rearrangement, replication and repair. The molecular detail of protein–DNA interactions can be best visualized through crystallography, and structures emphasizing insight into the principles of binding and base-sequence recognition are essential to understanding the subtleties of the underlying mechanisms. An increasing number of high-quality DNA-binding protein structure determinations have been witnessed despite themore » fact that the crystallographic particularities of nucleic acids tend to pose specific challenges to methods primarily developed for proteins. Crystallographic structure solution of protein–DNA complexes therefore remains a challenging area that is in need of optimized experimental and computational methods. The potential of the structure-solution program ARCIMBOLDO for the solution of protein–DNA complexes has therefore been assessed. The method is based on the combination of locating small, very accurate fragments using the program Phaser and density modification with the program SHELXE. Whereas for typical proteins main-chain α-helices provide the ideal, almost ubiquitous, small fragments to start searches, in the case of DNA complexes the binding motifs and DNA double helix constitute suitable search fragments. The aim of this work is to provide an effective library of search fragments as well as to determine the optimal ARCIMBOLDO strategy for the solution of this class of structures.« less
A 3D puzzle approach to building protein-DNA structures.
Hinton, Deborah M
2017-03-15
Despite recent advances in structural analysis, it is still challenging to obtain a high-resolution structure for a complex of RNA polymerase, transcriptional factors, and DNA. However, using biochemical constraints, 3D printed models of available structures, and computer modeling, one can build biologically relevant models of such supramolecular complexes.
DMINDA: an integrated web server for DNA motif identification and analyses
Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying
2014-01-01
DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. PMID:24753419
Kawano, Tomonori
2013-03-01
There have been a wide variety of approaches for handling the pieces of DNA as the "unplugged" tools for digital information storage and processing, including a series of studies applied to the security-related area, such as DNA-based digital barcodes, water marks and cryptography. In the present article, novel designs of artificial genes as the media for storing the digitally compressed data for images are proposed for bio-computing purpose while natural genes principally encode for proteins. Furthermore, the proposed system allows cryptographical application of DNA through biochemically editable designs with capacity for steganographical numeric data embedment. As a model case of image-coding DNA technique application, numerically and biochemically combined protocols are employed for ciphering the given "passwords" and/or secret numbers using DNA sequences. The "passwords" of interest were decomposed into single letters and translated into the font image coded on the separate DNA chains with both the coding regions in which the images are encoded based on the novel run-length encoding rule, and the non-coding regions designed for biochemical editing and the remodeling processes revealing the hidden orientation of letters composing the original "passwords." The latter processes require the molecular biological tools for digestion and ligation of the fragmented DNA molecules targeting at the polymerase chain reaction-engineered termini of the chains. Lastly, additional protocols for steganographical overwriting of the numeric data of interests over the image-coding DNA are also discussed.
NASA Astrophysics Data System (ADS)
Wadams, Robert Christopher
DNA nanotechnology is one of the most flourishing interdisciplinary research fields. Through the features of programmability and predictability, DNA nanostructures can be designed to self-assemble into a variety of periodic or aperiodic patterns of different shapes and length scales, and more importantly, they can be used as scaffolds for organizing other nanoparticles, proteins and chemical groups. By leveraging these molecules, DNA nanostructures can be used to direct the organization of complex bio-inspired materials that may serve as smart drug delivery systems and in vitro or in vivo bio-molecular computing and diagnostic devices. In this dissertation I describe a systematic study of the thermodynamic properties of complex DNA nanostructures, including 2D and 3D DNA origami, in order to understand their assembly, stability and functionality and inform future design endeavors. It is conceivable that a more thorough understanding of DNA self-assembly can be used to guide the structural design process and optimize the conditions for assembly, manipulation, and functionalization, thus benefiting both upstream design and downstream applications. As a biocompatible nanoscale motif, the successful integration, stabilization and separation of DNA nanostructures from cells/cell lysate suggests its potential to serve as a diagnostic platform at the cellular level. Here, DNA origami was used to capture and identify multiple T cell receptor mRNA species from single cells within a mixed cell population. This demonstrates the potential of DNA nanostructure as an ideal nano scale tool for biological applications.
Gener: a minimal programming module for chemical controllers based on DNA strand displacement
Kahramanoğulları, Ozan; Cardelli, Luca
2015-01-01
Summary: Gener is a development module for programming chemical controllers based on DNA strand displacement. Gener is developed with the aim of providing a simple interface that minimizes the opportunities for programming errors: Gener allows the user to test the computations of the DNA programs based on a simple two-domain strand displacement algebra, the minimal available so far. The tool allows the user to perform stepwise computations with respect to the rules of the algebra as well as exhaustive search of the computation space with different options for exploration and visualization. Gener can be used in combination with existing tools, and in particular, its programs can be exported to Microsoft Research’s DSD tool as well as to LaTeX. Availability and implementation: Gener is available for download at the Cosbi website at http://www.cosbi.eu/research/prototypes/gener as a windows executable that can be run on Mac OS X and Linux by using Mono. Contact: ozan@cosbi.eu PMID:25957353
MIGS-GPU: Microarray Image Gridding and Segmentation on the GPU.
Katsigiannis, Stamos; Zacharia, Eleni; Maroulis, Dimitris
2017-05-01
Complementary DNA (cDNA) microarray is a powerful tool for simultaneously studying the expression level of thousands of genes. Nevertheless, the analysis of microarray images remains an arduous and challenging task due to the poor quality of the images that often suffer from noise, artifacts, and uneven background. In this study, the MIGS-GPU [Microarray Image Gridding and Segmentation on Graphics Processing Unit (GPU)] software for gridding and segmenting microarray images is presented. MIGS-GPU's computations are performed on the GPU by means of the compute unified device architecture (CUDA) in order to achieve fast performance and increase the utilization of available system resources. Evaluation on both real and synthetic cDNA microarray images showed that MIGS-GPU provides better performance than state-of-the-art alternatives, while the proposed GPU implementation achieves significantly lower computational times compared to the respective CPU approaches. Consequently, MIGS-GPU can be an advantageous and useful tool for biomedical laboratories, offering a user-friendly interface that requires minimum input in order to run.
Gener: a minimal programming module for chemical controllers based on DNA strand displacement.
Kahramanoğulları, Ozan; Cardelli, Luca
2015-09-01
: Gener is a development module for programming chemical controllers based on DNA strand displacement. Gener is developed with the aim of providing a simple interface that minimizes the opportunities for programming errors: Gener allows the user to test the computations of the DNA programs based on a simple two-domain strand displacement algebra, the minimal available so far. The tool allows the user to perform stepwise computations with respect to the rules of the algebra as well as exhaustive search of the computation space with different options for exploration and visualization. Gener can be used in combination with existing tools, and in particular, its programs can be exported to Microsoft Research's DSD tool as well as to LaTeX. Gener is available for download at the Cosbi website at http://www.cosbi.eu/research/prototypes/gener as a windows executable that can be run on Mac OS X and Linux by using Mono. ozan@cosbi.eu. © The Author 2015. Published by Oxford University Press.
Effective Design of Multifunctional Peptides by Combining Compatible Functions
Diener, Christian; Garza Ramos Martínez, Georgina; Moreno Blas, Daniel; Castillo González, David A.; Corzo, Gerardo; Castro-Obregon, Susana; Del Rio, Gabriel
2016-01-01
Multifunctionality is a common trait of many natural proteins and peptides, yet the rules to generate such multifunctionality remain unclear. We propose that the rules defining some protein/peptide functions are compatible. To explore this hypothesis, we trained a computational method to predict cell-penetrating peptides at the sequence level and learned that antimicrobial peptides and DNA-binding proteins are compatible with the rules of our predictor. Based on this finding, we expected that designing peptides for CPP activity may render AMP and DNA-binding activities. To test this prediction, we designed peptides that embedded two independent functional domains (nuclear localization and yeast pheromone activity), linked by optimizing their composition to fit the rules characterizing cell-penetrating peptides. These peptides presented effective cell penetration, DNA-binding, pheromone and antimicrobial activities, thus confirming the effectiveness of our computational approach to design multifunctional peptides with potential therapeutic uses. Our computational implementation is available at http://bis.ifc.unam.mx/en/software/dcf. PMID:27096600
Computational Approach to Explore the B/A Junction Free Energy in DNA.
Kulkarni, Mandar; Mukherjee, Arnab
2016-01-04
Protein-DNA interactions induce conformational changes in DNA such as B- to A-form transitions at a local level. Such transitions are associated with a junction free energy cost at the boundary of two different conformations in a DNA molecule. In this study, we performed umbrella sampling simulations to find the free energy values of the B-A transition at the dinucleotide and trinucleotide level of DNA. Using a combination of dinucleotide and trinucleotide free energy costs obtained from simulations, we calculated the B/A junction free energy. Our study shows that the B/A junction free energy is 0.52 kcal mol(-1) for the A-philic GG step and 1.59 kcal mol(-1) for the B-philic AA step. This observation is in agreement with experimentally derived values. After excluding junction effects, we obtained an absolute free energy cost for the B- to A-form conversion for all the dinucleotide steps. These absolute free energies may be used for predicting the propensity of structural transitions in DNA. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A Reversible DNA Logic Gate Platform Operated by One- and Two-Photon Excitations.
Tam, Dick Yan; Dai, Ziwen; Chan, Miu Shan; Liu, Ling Sum; Cheung, Man Ching; Bolze, Frederic; Tin, Chung; Lo, Pik Kwan
2016-01-04
We demonstrate the use of two different wavelength ranges of excitation light as inputs to remotely trigger the responses of the self-assembled DNA devices (D-OR). As an important feature of this device, the dependence of the readout fluorescent signals on the two external inputs, UV excitation for 1 min and/or near infrared irradiation (NIR) at 800 nm fs laser pulses, can mimic function of signal communication in OR logic gates. Their operations could be reset easily to its initial state. Furthermore, these DNA devices exhibit efficient cellular uptake, low cytotoxicity, and high bio-stability in different cell lines. They are considered as the first example of a photo-responsive DNA logic gate system, as well as a biocompatible, multi-wavelength excited system in response to UV and NIR. This is an important step to explore the concept of photo-responsive DNA-based systems as versatile tools in DNA computing, display devices, optical communication, and biology. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DNA nanotechnology: understanding and optimisation through simulation
NASA Astrophysics Data System (ADS)
Ouldridge, Thomas E.
2015-01-01
DNA nanotechnology promises to provide controllable self-assembly on the nanoscale, allowing for the design of static structures, dynamic machines and computational architectures. In this article, I review the state-of-the art of DNA nanotechnology, highlighting the need for a more detailed understanding of the key processes, both in terms of theoretical modelling and experimental characterisation. I then consider coarse-grained models of DNA, mesoscale descriptions that have the potential to provide great insight into the operation of DNA nanotechnology if they are well designed. In particular, I discuss a number of nanotechnological systems that have been studied with oxDNA, a recently developed coarse-grained model, highlighting the subtle interplay of kinetic, thermodynamic and mechanical factors that can determine behaviour. Finally, new results highlighting the importance of mechanical tension in the operation of a two-footed walker are presented, demonstrating that recovery from an unintended 'overstepped' configuration can be accelerated by three to four orders of magnitude by application of a moderate tension to the walker's track. More generally, the walker illustrates the possibility of biasing strand-displacement processes to affect the overall rate.
Monte Carlo approach in assessing damage in higher order structures of DNA
NASA Technical Reports Server (NTRS)
Chatterjee, A.; Schmidt, J. B.; Holley, W. R.
1994-01-01
We have developed a computer monitor of nuclear DNA in the form of chromatin fibre. The fibres are modeled as a ideal solenoid consisting of twenty helical turns with six nucleosomes per turn. The chromatin model, in combination with are Monte Carlo theory of radiation damage induces by charged particles, based on general features of tack structure and stopping power theory, has been used to evaluate the influence of DNA structure on initial damage. An interesting has emerged from our calculations. Our calculated results predict the existence of strong spatial correlations in damage sites associated with the symmetries in the solenoidal model. We have calculated spectra of short fragments of double stranded DNA produced by multiple double strand breaks induced by both high and low LET radiation. The spectra exhibit peaks at multiples of approximately 85 base pairs (the nucleosome periodicity), and approximately 1000 base pairs (solenoid periodicity). Preliminary experiments to investigate the fragment distributions from irradiated DNA, made by B. Rydberg at Lawrence Berkeley Laboratory, confirm the existence of short DNA fragments and are in substantial agreement with the predictions of our theory.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Takeshita, Takayuki; Okamoto, Masami
The hydroxyapatite (HA) formation on the surface of DNA molecules in simulated body fluid (SBF) was examined. The osteoconductivity is estimated using SBF having ion concentrations approximately equal to those of human blood plasma. After immersion for 4 weeks in SBF at 36.5 °C, the HA crystallites possessing 1-14 micrometer in diameter grew on the surface of DNA molecules. The leaf flake-like and spherical shapes morphologies were observed through scanning electron microscopy analysis. Original peaks of both of DNA and HA were characterized by fourier transform infrared spectroscopy. The Ca/P ratio (1.1-1.5) in HA was estimated by energy dispersive X-raymore » analysis. After biomineralization, the calculated weight ratio of DNA/HA was 18/82 by thermogravimetry/differential thermal analysis. The molecular orbital computer simulation has been used to probe the interaction of DNA with two charge-balancing ions, CaOH{sup +} and CaH{sub 2}PO{sub 4}{sup +}. The adsorption enthalpy of the two ions on DNA having negative value was the evidence for the interface in mineralization of HA in SBF.« less
Energy barriers and rates of tautomeric transitions in DNA bases: ab initio quantum chemical study.
Basu, Soumalee; Majumdar, Rabi; Das, Gourab K; Bhattacharyya, Dhananjay
2005-12-01
Tautomeric transitions of DNA bases are proton transfer reactions, which are important in biology. These reactions are involved in spontaneous point mutations of the genetic material. In the present study, intrinsic reaction coordinates (IRC) analyses through ab initio quantum chemical calculations have been carried out for the individual DNA bases A, T, G, C and also A:T and G:C base pairs to estimate the kinetic and thermodynamic barriers using MP2/6-31G** method for tautomeric transitions. Relatively higher values of kinetic barriers (about 50-60 kcal/mol) have been observed for the single bases, indicating that tautomeric alterations of isolated single bases are quite unlikely. On the other hand, relatively lower values of the kinetic barriers (about 20-25 kcal/mol) for the DNA base pairs A:T and G:C clearly suggest that the tautomeric shifts are much more favorable in DNA base pairs than in isolated single bases. The unusual base pairing A':C, T':G, C':A or G':T in the daughter DNA molecule, resulting from a parent DNA molecule with tautomeric shifts, is found to be stable enough to result in a mutation. The transition rate constants for the single DNA bases in addition to the base pairs are also calculated by computing the free energy differences between the transition states and the reactants.
Transition between B-DNA and Z-DNA: free energy landscape for the B-Z junction propagation.
Lee, Juyong; Kim, Yang-Gyun; Kim, Kyeong Kyu; Seok, Chaok
2010-08-05
Canonical, right-handed B-DNA can be transformed into noncanonical, left-handed Z-DNA in vitro at high salt concentrations or in vivo under physiological conditions. The molecular mechanism of this drastic conformational transition is still unknown despite numerous studies. Inspired by the crystal structure of a B-Z junction and the previous zipper model, we show here, with the aid of molecular dynamics simulations, that a stepwise propagation of a B-Z junction is a highly probable pathway for the B-Z transition. In this paper, the movement of a B-Z junction by a two-base-pair step in a double-strand nonamer, [d(GpCpGpCpGpCpGpCpG)](2), is considered. Targeted molecular dynamics simulations and umbrella sampling for this transition resulted in a transition pathway with a free energy barrier of 13 kcal/mol. This barrier is much more favorable than those obtained from previous atomistic simulations that lead to concerted transitions of the whole strands. The free energy difference between B-DNA and Z-DNA evaluated from our simulation is 0.9 kcal/mol per dinucleotide unit, which is consistent with previous experiments. The current computation thus strongly supports the proposal that the B-Z transition involves a relatively fast extension of B-DNA or Z-DNA by sequential propagation of B-Z junctions once nucleation of junctions is established.
Litwin, S; Shahn, E; Kozinski, A W
1969-07-01
Mass distribution in a sucrose gradient of deoxyribonucleic acid (DNA) fragments arising as a result of random breaks is predicted by analytical means from which computer evaluations are plotted. The analytical results are compared with the results of verifying experiments: (i) a Monte Carlo computer experiment in which simulated molecules of DNA were individuals of unit length subjected to random "breaks" applied by a random number generator, and (ii) an in vitro experiment in which molecules of T4 DNA, highly labeled with (32)P, were stored in liquid nitrogen for variable periods of time during which a precisely known number of (32)P atoms decayed, causing single-stranded breaks. The distribution of sizes of the resulting fragments was measured in an alkaline sucrose gradient. The profiles obtained in this fashion were compared with the mathematical predictions. Both experiments agree with the analytical approach and thus permit the use of the graphs obtained from the latter as a means of determining the average number of random breaks in DNA from distributions obtained experimentally in a sucrose gradient. An example of the application of this procedure to a previously unresolved problem is provided in the case of DNA from ultraviolet-irradiated phage which undergoes a dose-dependent intracellular breakdown. The relationship between the number of lethal hits and the number of single-stranded breaks was not previously established. A comparison of the calculated number of nicks per strand of DNA with the known dose in phage-lethal hits reveals a relationship closely approximating one lethal hit to one single-stranded break.
Tumor purity and differential methylation in cancer epigenomics.
Wang, Fayou; Zhang, Naiqian; Wang, Jun; Wu, Hao; Zheng, Xiaoqi
2016-11-01
DNA methylation is an epigenetic modification of DNA molecule that plays a vital role in gene expression regulation. It is not only involved in many basic biological processes, but also considered an important factor for tumorigenesis and other human diseases. Study of DNA methylation has been an active field in cancer epigenomics research. With the advances of high-throughput technologies and the accumulation of enormous amount of data, method development for analyzing these data has gained tremendous interests in the fields of computational biology and bioinformatics. In this review, we systematically summarize the recent developments of computational methods and software tools in high-throughput methylation data analysis with focus on two aspects: differential methylation analysis and tumor purity estimation in cancer studies. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
DNA-binding specificity prediction with FoldX.
Nadra, Alejandro D; Serrano, Luis; Alibés, Andreu
2011-01-01
With the advent of Synthetic Biology, a field between basic science and applied engineering, new computational tools are needed to help scientists reach their goal, their design, optimizing resources. In this chapter, we present a simple and powerful method to either know the DNA specificity of a wild-type protein or design new specificities by using the protein design algorithm FoldX. The only basic requirement is having a good resolution structure of the complex. Protein-DNA interaction design may aid the development of new parts designed to be orthogonal, decoupled, and precise in its target. Further, it could help to fine-tune the systems in terms of specificity, discrimination, and binding constants. In the age of newly developed devices and invented systems, computer-aided engineering promises to be an invaluable tool. Copyright © 2011 Elsevier Inc. All rights reserved.
A novel model for DNA sequence similarity analysis based on graph theory.
Qi, Xingqin; Wu, Qin; Zhang, Yusen; Fuller, Eddie; Zhang, Cun-Quan
2011-01-01
Determination of sequence similarity is one of the major steps in computational phylogenetic studies. As we know, during evolutionary history, not only DNA mutations for individual nucleotide but also subsequent rearrangements occurred. It has been one of major tasks of computational biologists to develop novel mathematical descriptors for similarity analysis such that various mutation phenomena information would be involved simultaneously. In this paper, different from traditional methods (eg, nucleotide frequency, geometric representations) as bases for construction of mathematical descriptors, we construct novel mathematical descriptors based on graph theory. In particular, for each DNA sequence, we will set up a weighted directed graph. The adjacency matrix of the directed graph will be used to induce a representative vector for DNA sequence. This new approach measures similarity based on both ordering and frequency of nucleotides so that much more information is involved. As an application, the method is tested on a set of 0.9-kb mtDNA sequences of twelve different primate species. All output phylogenetic trees with various distance estimations have the same topology, and are generally consistent with the reported results from early studies, which proves the new method's efficiency; we also test the new method on a simulated data set, which shows our new method performs better than traditional global alignment method when subsequent rearrangements happen frequently during evolutionary history.
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.
Cao, Yinhe; Tung, Wen-Wen; Gao, J B
2004-01-01
With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
Spreadsheet-based program for alignment of overlapping DNA sequences.
Anbazhagan, R; Gabrielson, E
1999-06-01
Molecular biology laboratories frequently face the challenge of aligning small overlapping DNA sequences derived from a long DNA segment. Here, we present a short program that can be used to adapt Excel spreadsheets as a tool for aligning DNA sequences, regardless of their orientation. The program runs on any Windows or Macintosh operating system computer with Excel 97 or Excel 98. The program is available for use as an Excel file, which can be downloaded from the BioTechniques Web site. Upon execution, the program opens a specially designed customized workbook and is capable of identifying overlapping regions between two sequence fragments and displaying the sequence alignment. It also performs a number of specialized functions such as recognition of restriction enzyme cutting sites and CpG island mapping without costly specialized software.
Bharti, Sanjay Kumar; Sommers, Joshua A.; Zhou, Jun; Kaplan, Daniel L.; Spelbrink, Johannes N.; Mergny, Jean-Louis; Brosh, Robert M.
2014-01-01
Mitochondrial DNA deletions are prominent in human genetic disorders, cancer, and aging. It is thought that stalling of the mitochondrial replication machinery during DNA synthesis is a prominent source of mitochondrial genome instability; however, the precise molecular determinants of defective mitochondrial replication are not well understood. In this work, we performed a computational analysis of the human mitochondrial genome using the “Pattern Finder” G-quadruplex (G4) predictor algorithm to assess whether G4-forming sequences reside in close proximity (within 20 base pairs) to known mitochondrial DNA deletion breakpoints. We then used this information to map G4P sequences with deletions characteristic of representative mitochondrial genetic disorders and also those identified in various cancers and aging. Circular dichroism and UV spectral analysis demonstrated that mitochondrial G-rich sequences near deletion breakpoints prevalent in human disease form G-quadruplex DNA structures. A biochemical analysis of purified recombinant human Twinkle protein (gene product of c10orf2) showed that the mitochondrial replicative helicase inefficiently unwinds well characterized intermolecular and intramolecular G-quadruplex DNA substrates, as well as a unimolecular G4 substrate derived from a mitochondrial sequence that nests a deletion breakpoint described in human renal cell carcinoma. Although G4 has been implicated in the initiation of mitochondrial DNA replication, our current findings suggest that mitochondrial G-quadruplexes are also likely to be a source of instability for the mitochondrial genome by perturbing the normal progression of the mitochondrial replication machinery, including DNA unwinding by Twinkle helicase. PMID:25193669
Methylation analyses in liquid biopsy
Lissa, Delphine
2016-01-01
Lung cancer is the leading cause of cancer-related deaths worldwide. Recent implementation of low-dose computed tomography (LDCT) screening is predicted to lead to diagnosis of lung cancer at an earlier stage, with survival benefit. However, there is still a pressing need for biomarkers that will identify individuals eligible for screening, as well as improve the diagnostic accuracy of LDCT. In addition, biomarkers for prognostic stratification of patients with early stage disease, and those that can be used as surrogates to monitor tumor evolution, will greatly improve clinical management. Molecular alterations found in the DNA of tumor cells, such as mutations, translocations and methylation, are reflected in DNA that is released from the tumor into the bloodstream. Thus, in recent years, circulating tumor DNA (ctDNA) has gained increasing attention as a noninvasive alternative to tissue biopsies and potential surrogate for the entire tumor genome. Activating gene mutations found in ctDNA have been proven effective in predicting response to targeted therapy. Analysis of ctDNA is also a valuable tool for longitudinal follow-up of cancer patients that does not require serial biopsies and may anticipate the acquisition of resistance. DNA methylation has also emerged as a promising marker for early detection, prognosis and real-time follow-up of tumor dynamics that is independent of the genomic composition of the primary tumor. This review summarizes the various investigational applications of methylated ctDNA in lung cancer reported to date. It also provides a brief overview of the technologies for analysis of DNA methylation in liquid biopsies, and the challenges that befall the implementation of methylated ctDNA into routine clinical practice. PMID:27826530
Vermaak, Danielle; Bayes, Joshua J.
2009-01-01
Comparative genomics provides a facile way to address issues of evolutionary constraint acting on different elements of the genome. However, several important DNA elements have not reaped the benefits of this new approach. Some have proved intractable to current day sequencing technology. These include centromeric and heterochromatic DNA, which are essential for chromosome segregation as well as gene regulation, but the highly repetitive nature of the DNA sequences in these regions make them difficult to assemble into longer contigs. Other sequences, like dosage compensation X chromosomal sites, origins of DNA replication, or heterochromatic sequences that encode piwi-associated RNAs, have proved difficult to study because they do not have recognizable DNA features that allow them to be described functionally or computationally. We have employed an alternate approach to the direct study of these DNA elements. By using proteins that specifically bind these noncoding DNAs as surrogates, we can indirectly assay the evolutionary constraints acting on these important DNA elements. We review the impact that such “surrogate strategies” have had on our understanding of the evolutionary constraints shaping centromeres, origins of DNA replication, and dosage compensation X chromosomal sites. These have begun to reveal that in contrast to the view that such structural DNA elements are either highly constrained (under purifying selection) or free to drift (under neutral evolution), some of them may instead be shaped by adaptive evolution and genetic conflicts (these are not mutually exclusive). These insights also help to explain why the same elements (e.g., centromeres and replication origins), which are so complex in some eukaryotic genomes, can be simple and well defined in other where similar conflicts do not exist. PMID:19635763
The Effect of Basepair Mismatch on DNA Strand Displacement.
Broadwater, D W Bo; Kim, Harold D
2016-04-12
DNA strand displacement is a key reaction in DNA homologous recombination and DNA mismatch repair and is also heavily utilized in DNA-based computation and locomotion. Despite its ubiquity in science and engineering, sequence-dependent effects of displacement kinetics have not been extensively characterized. Here, we measured toehold-mediated strand displacement kinetics using single-molecule fluorescence in the presence of a single basepair mismatch. The apparent displacement rate varied significantly when the mismatch was introduced in the invading DNA strand. The rate generally decreased as the mismatch in the invader was encountered earlier in displacement. Our data indicate that a single base pair mismatch in the invader stalls branch migration and displacement occurs via direct dissociation of the destabilized incumbent strand from the substrate strand. We combined both branch migration and direct dissociation into a model, which we term the concurrent displacement model, and used the first passage time approach to quantitatively explain the salient features of the observed relationship. We also introduce the concept of splitting probabilities to justify that the concurrent model can be simplified into a three-step sequential model in the presence of an invader mismatch. We expect our model to become a powerful tool to design DNA-based reaction schemes with broad functionality. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
DNA dynamics in aqueous solution: opening the double helix
NASA Technical Reports Server (NTRS)
Pohorille, A.; Ross, W. S.; Tinoco, I. Jr; MacElroy, R. D. (Principal Investigator)
1990-01-01
The opening of a DNA base pair is a simple reaction that is a prerequisite for replication, transcription, and other vital biological functions. Understanding the molecular mechanisms of biological reactions is crucial for predicting and, ultimately, controlling them. Realistic computer simulations of the reactions can provide the needed understanding. To model even the simplest reaction in aqueous solution requires hundreds of hours of supercomputing time. We have used molecular dynamics techniques to simulate fraying of the ends of a six base pair double strand of DNA, [TCGCGA]2, where the four bases of DNA are denoted by T (thymine), C (cytosine), G (guanine), and A (adenine), and to estimate the free energy barrier to this process. The calculations, in which the DNA was surrounded by 2,594 water molecules, required 50 hours of CRAY-2 CPU time for every simulated 100 picoseconds. A free energy barrier to fraying, which is mainly characterized by the movement of adenine away from thymine into aqueous environment, was estimated to be 4 kcal/mol. Another fraying pathway, which leads to stacking between terminal adenine and thymine, was also observed. These detailed pictures of the motions and energetics of DNA base pair opening in water are a first step toward understanding how DNA will interact with any molecule.
Measurement of inelastic cross sections for low-energy electron scattering from DNA bases.
Michaud, Marc; Bazin, Marc; Sanche, Léon
2012-01-01
To determine experimentally the absolute cross sections (CS) to deposit various amount of energies into DNA bases by low-energy electron (LEE) impact. Electron energy loss (EEL) spectra of DNA bases were recorded for different LEE impact energies on the molecules deposited at very low coverage on an inert argon (Ar) substrate. Following their normalisation to the effective incident electron current and molecular surface number density, the EEL spectra were then fitted with multiple Gaussian functions in order to delimit the various excitation energy regions. The CS to excite a molecule into its various excitation modes were finally obtained from computing the area under the corresponding Gaussians. The EEL spectra and absolute CS for the electronic excitations of pyrimidine and the DNA bases thymine, adenine, and cytosine by electron impacts below 18 eV were reported for the molecules deposited at about monolayer coverage on a solid Ar substrate. The CS for electronic excitations of DNA bases by LEE impact were found to lie within the 10(216) to 10(218) cm(2) range. The large value of the total ionisation CS indicated that ionisation of DNA bases by LEE is an important dissipative process via which ionising radiation degrades and is absorbed in DNA.
Measurement of inelastic cross sections for low-energy electron scattering from DNA bases
Michaud, Marc; Bazin, Marc.; Sanche, Léon
2013-01-01
Purpose Determine experimentally the absolute cross sections (CS) to deposit various amount of energies into DNA bases by low-energy electron (LEE) impact. Materials and methods Electron energy loss (EEL) spectra of DNA bases are recorded for different LEE impact energies on the molecules deposited at very low coverage on an inert argon (Ar) substrate. Following their normalisation to the effective incident electron current and molecular surface number density, the EEL spectra are then fitted with multiple Gaussian functions in order to delimit the various excitation energy regions. The CS to excite a molecule into its various excitation modes are finally obtained from computing the area under the corresponding Gaussians. Results The EEL spectra and absolute CS for the electronic excitations of pyrimidine and the DNA bases thymine, adenine, and cytosine by electron impacts below 18 eV are reported for the molecules deposited at about monolayer coverage on a solid Ar substrate. Conclusions The CS for electronic excitations of DNA bases by LEE impact are found to lie within the 10−16 – 10−18 cm2 range. The large value of the total ionisation CS indicates that ionisation of DNA bases by LEE is an important dissipative process via which ionising radiation degrades and is absorbed in DNA. PMID:21615242
Diagnosis of skin cancer by correlation and complexity analyses of damaged DNA
Namazi, Hamidreza; Kulish, Vladimir V.; Delaviz, Fatemeh; Delaviz, Ali
2015-01-01
Skin cancer is a common, low-grade cancerous (malignant) growth of the skin. It starts from cells that begin as normal skin cells and transform into those with the potential to reproduce in an out-of-control manner. Cancer develops when DNA, the molecule found in cells that encodes genetic information, becomes damaged and the body cannot repair the damage. A DNA walk of a genome represents how the frequency of each nucleotide of a pairing nucleotide couple changes locally. In this research in order to diagnose the skin cancer, first DNA walk plots of genomes of patients with skin cancer were generated. Then, the data so obtained was checked for complexity by computing the fractal dimension. Furthermore, the Hurst exponent has been employed in order to study the correlation of damaged DNA. By analysing different samples it has been found that the damaged DNA sequences are exhibiting higher degree of complexity and less correlation compared to normal DNA sequences. This investigation confirms that this method can be used for diagnosis of skin cancer. The method discussed in this research is useful not only for diagnosis of skin cancer but can be applied for diagnosis and growth analysis of different types of cancers. PMID:26497203
Utro, Filippo; Di Benedetto, Valeria; Corona, Davide F V; Giancarlo, Raffaele
2016-03-15
Thanks to research spanning nearly 30 years, two major models have emerged that account for nucleosome organization in chromatin: statistical and sequence specific. The first is based on elegant, easy to compute, closed-form mathematical formulas that make no assumptions of the physical and chemical properties of the underlying DNA sequence. Moreover, they need no training on the data for their computation. The latter is based on some sequence regularities but, as opposed to the statistical model, it lacks the same type of closed-form formulas that, in this case, should be based on the DNA sequence only. We contribute to close this important methodological gap between the two models by providing three very simple formulas for the sequence specific one. They are all based on well-known formulas in Computer Science and Bioinformatics, and they give different quantifications of how complex a sequence is. In view of how remarkably well they perform, it is very surprising that measures of sequence complexity have not even been considered as candidates to close the mentioned gap. We provide experimental evidence that the intrinsic level of combinatorial organization and information-theoretic content of subsequences within a genome are strongly correlated to the level of DNA encoded nucleosome organization discovered by Kaplan et al Our results establish an important connection between the intrinsic complexity of subsequences in a genome and the intrinsic, i.e. DNA encoded, nucleosome organization of eukaryotic genomes. It is a first step towards a mathematical characterization of this latter 'encoding'. Supplementary data are available at Bioinformatics online. futro@us.ibm.com. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Hohenstein, Edward G; Parrish, Robert M; Sherrill, C David; Turney, Justin M; Schaefer, Henry F
2011-11-07
Symmetry-adapted perturbation theory (SAPT) provides a means of probing the fundamental nature of intermolecular interactions. Low-orders of SAPT (here, SAPT0) are especially attractive since they provide qualitative (sometimes quantitative) results while remaining tractable for large systems. The application of density fitting and Laplace transformation techniques to SAPT0 can significantly reduce the expense associated with these computations and make even larger systems accessible. We present new factorizations of the SAPT0 equations with density-fitted two-electron integrals and the first application of Laplace transformations of energy denominators to SAPT. The improved scalability of the DF-SAPT0 implementation allows it to be applied to systems with more than 200 atoms and 2800 basis functions. The Laplace-transformed energy denominators are compared to analogous partial Cholesky decompositions of the energy denominator tensor. Application of our new DF-SAPT0 program to the intercalation of DNA by proflavine has allowed us to determine the nature of the proflavine-DNA interaction. Overall, the proflavine-DNA interaction contains important contributions from both electrostatics and dispersion. The energetics of the intercalator interaction are are dominated by the stacking interactions (two-thirds of the total), but contain important contributions from the intercalator-backbone interactions. It is hypothesized that the geometry of the complex will be determined by the interactions of the intercalator with the backbone, because by shifting toward one side of the backbone, the intercalator can form two long hydrogen-bonding type interactions. The long-range interactions between the intercalator and the next-nearest base pairs appear to be negligible, justifying the use of truncated DNA models in computational studies of intercalation interaction energies.
NASA Astrophysics Data System (ADS)
Hohenstein, Edward G.; Parrish, Robert M.; Sherrill, C. David; Turney, Justin M.; Schaefer, Henry F.
2011-11-01
Symmetry-adapted perturbation theory (SAPT) provides a means of probing the fundamental nature of intermolecular interactions. Low-orders of SAPT (here, SAPT0) are especially attractive since they provide qualitative (sometimes quantitative) results while remaining tractable for large systems. The application of density fitting and Laplace transformation techniques to SAPT0 can significantly reduce the expense associated with these computations and make even larger systems accessible. We present new factorizations of the SAPT0 equations with density-fitted two-electron integrals and the first application of Laplace transformations of energy denominators to SAPT. The improved scalability of the DF-SAPT0 implementation allows it to be applied to systems with more than 200 atoms and 2800 basis functions. The Laplace-transformed energy denominators are compared to analogous partial Cholesky decompositions of the energy denominator tensor. Application of our new DF-SAPT0 program to the intercalation of DNA by proflavine has allowed us to determine the nature of the proflavine-DNA interaction. Overall, the proflavine-DNA interaction contains important contributions from both electrostatics and dispersion. The energetics of the intercalator interaction are are dominated by the stacking interactions (two-thirds of the total), but contain important contributions from the intercalator-backbone interactions. It is hypothesized that the geometry of the complex will be determined by the interactions of the intercalator with the backbone, because by shifting toward one side of the backbone, the intercalator can form two long hydrogen-bonding type interactions. The long-range interactions between the intercalator and the next-nearest base pairs appear to be negligible, justifying the use of truncated DNA models in computational studies of intercalation interaction energies.
Liu, Bin; Liu, Fule; Fang, Longyun; Wang, Xiaolong; Chou, Kuo-Chen
2015-04-15
In order to develop powerful computational predictors for identifying the biological features or attributes of DNAs, one of the most challenging problems is to find a suitable approach to effectively represent the DNA sequences. To facilitate the studies of DNAs and nucleotides, we developed a Python package called representations of DNAs (repDNA) for generating the widely used features reflecting the physicochemical properties and sequence-order effects of DNAs and nucleotides. There are three feature groups composed of 15 features. The first group calculates three nucleic acid composition features describing the local sequence information by means of kmers; the second group calculates six autocorrelation features describing the level of correlation between two oligonucleotides along a DNA sequence in terms of their specific physicochemical properties; the third group calculates six pseudo nucleotide composition features, which can be used to represent a DNA sequence with a discrete model or vector yet still keep considerable sequence-order information via the physicochemical properties of its constituent oligonucleotides. In addition, these features can be easily calculated based on both the built-in and user-defined properties via using repDNA. The repDNA Python package is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repDNA/. bliu@insun.hit.edu.cn or kcchou@gordonlifescience.org Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Kawano, Tomonori
2013-01-01
There have been a wide variety of approaches for handling the pieces of DNA as the “unplugged” tools for digital information storage and processing, including a series of studies applied to the security-related area, such as DNA-based digital barcodes, water marks and cryptography. In the present article, novel designs of artificial genes as the media for storing the digitally compressed data for images are proposed for bio-computing purpose while natural genes principally encode for proteins. Furthermore, the proposed system allows cryptographical application of DNA through biochemically editable designs with capacity for steganographical numeric data embedment. As a model case of image-coding DNA technique application, numerically and biochemically combined protocols are employed for ciphering the given “passwords” and/or secret numbers using DNA sequences. The “passwords” of interest were decomposed into single letters and translated into the font image coded on the separate DNA chains with both the coding regions in which the images are encoded based on the novel run-length encoding rule, and the non-coding regions designed for biochemical editing and the remodeling processes revealing the hidden orientation of letters composing the original “passwords.” The latter processes require the molecular biological tools for digestion and ligation of the fragmented DNA molecules targeting at the polymerase chain reaction-engineered termini of the chains. Lastly, additional protocols for steganographical overwriting of the numeric data of interests over the image-coding DNA are also discussed. PMID:23750303
Guo, Yan; Cai, Qiuyin; Samuels, David C; Ye, Fei; Long, Jirong; Li, Chung-I; Winther, Jeanette F; Tawn, E Janet; Stovall, Marilyn; Lähteenmäki, Päivi; Malila, Nea; Levy, Shawn; Shaffer, Christian; Shyr, Yu; Shu, Xiao-Ou; Boice, John D
2012-05-15
The human mitochondrial genome has an exclusively maternal mode of inheritance. Mitochondrial DNA (mtDNA) is particularly vulnerable to environmental insults due in part to an underdeveloped DNA repair system, limited to base excision and homologous recombination repair. Radiation exposure to the ovaries may cause mtDNA mutations in oocytes, which may in turn be transmitted to offspring. We hypothesized that the children of female cancer survivors who received radiation therapy may have an increased rate of mtDNA heteroplasmy mutations, which conceivably could increase their risk of developing cancer and other diseases. We evaluated 44 DNA blood samples from 17 Danish and 1 Finnish families (18 mothers and 26 children). All mothers had been treated for cancer as children and radiation doses to their ovaries were determined based on medical records and computational models. DNA samples were sequenced for the entire mitochondrial genome using the Illumina GAII system. Mother's age at sample collection was positively correlated with mtDNA heteroplasmy mutations. There was evidence of heteroplasmy inheritance in that 9 of the 18 families had at least one child who inherited at least one heteroplasmy site from his or her mother. No significant difference in single nucleotide polymorphisms between mother and offspring, however, was observed. Radiation therapy dose to ovaries also was not significantly associated with the heteroplasmy mutation rate among mothers and children. No evidence was found that radiotherapy for pediatric cancer is associated with the mitochondrial genome mutation rate in female cancer survivors and their children. Copyright © 2012 Elsevier B.V. All rights reserved.
Versatile RNA tetra-U helix linking motif as a toolkit for nucleic acid nanotechnology.
Bui, My N; Brittany Johnson, M; Viard, Mathias; Satterwhite, Emily; Martins, Angelica N; Li, Zhihai; Marriott, Ian; Afonin, Kirill A; Khisamutdinov, Emil F
2017-04-01
RNA nanotechnology employs synthetically modified ribonucleic acid (RNA) to engineer highly stable nanostructures in one, two, and three dimensions for medical applications. Despite the tremendous advantages in RNA nanotechnology, unmodified RNA itself is fragile and prone to enzymatic degradation. In contrast to use traditionally modified RNA strands e.g. 2'-fluorine, 2'-amine, 2'-methyl, we studied the effect of RNA/DNA hybrid approach utilizing a computer-assisted RNA tetra-uracil (tetra-U) motif as a toolkit to address questions related to assembly efficiency, versatility, stability, and the production costs of hybrid RNA/DNA nanoparticles. The tetra-U RNA motif was implemented to construct four functional triangles using RNA, DNA and RNA/DNA mixtures, resulting in fine-tunable enzymatic and thermodynamic stabilities, immunostimulatory activity and RNAi capability. Moreover, the tetra-U toolkit has great potential in the fabrication of rectangular, pentagonal, and hexagonal NPs, representing the power of simplicity of RNA/DNA approach for RNA nanotechnology and nanomedicine community. Copyright © 2017 Elsevier Inc. All rights reserved.
Ancient DNA sequence revealed by error-correcting codes.
Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo
2015-07-10
A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.
Ancient DNA sequence revealed by error-correcting codes
Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo
2015-01-01
A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228
Free-energy simulations reveal molecular mechanism for functional switch of a DNA helicase
Ma, Wen; Whitley, Kevin D; Schulten, Klaus
2018-01-01
Helicases play key roles in genome maintenance, yet it remains elusive how these enzymes change conformations and how transitions between different conformational states regulate nucleic acid reshaping. Here, we developed a computational technique combining structural bioinformatics approaches and atomic-level free-energy simulations to characterize how the Escherichia coli DNA repair enzyme UvrD changes its conformation at the fork junction to switch its function from unwinding to rezipping DNA. The lowest free-energy path shows that UvrD opens the interface between two domains, allowing the bound ssDNA to escape. The simulation results predict a key metastable 'tilted' state during ssDNA strand switching. By simulating FRET distributions with fluorophores attached to UvrD, we show that the new state is supported quantitatively by single-molecule measurements. The present study deciphers key elements for the 'hyper-helicase' behavior of a mutant and provides an effective framework to characterize directly structure-function relationships in molecular machines. PMID:29664402
A stochastic DNA walker that traverses a microparticle surface
NASA Astrophysics Data System (ADS)
Jung, C.; Allen, P. B.; Ellington, A. D.
2016-02-01
Molecular machines have previously been designed that are propelled by DNAzymes, protein enzymes and strand displacement. These engineered machines typically move along precisely defined one- and two-dimensional tracks. Here, we report a DNA walker that uses hybridization to drive walking on DNA-coated microparticle surfaces. Through purely DNA:DNA hybridization reactions, the nanoscale movements of the walker can lead to the generation of a single-stranded product and the subsequent immobilization of fluorescent labels on the microparticle surface. This suggests that the system could be of use in analytical and diagnostic applications, similar to how strand exchange reactions in solution have been used for transducing and quantifying signals from isothermal molecular amplification assays. The walking behaviour is robust and the walker can take more than 30 continuous steps. The traversal of an unprogrammed, inhomogeneous surface is also due entirely to autonomous decisions made by the walker, behaviour analogous to amorphous chemical reaction network computations, which have been shown to lead to pattern formation.
Addressable configurations of DNA nanostructures for rewritable memory
Levchenko, Oksana; Patel, Dhruv S.; MacIsaac, Molly
2017-01-01
Abstract DNA serves as nature's information storage molecule, and has been the primary focus of engineered systems for biological computing and data storage. Here we combine recent efforts in DNA self-assembly and toehold-mediated strand displacement to develop a rewritable multi-bit DNA memory system. The system operates by encoding information in distinct and reversible conformations of a DNA nanoswitch and decoding by gel electrophoresis. We demonstrate a 5-bit system capable of writing, erasing, and rewriting binary representations of alphanumeric symbols, as well as compatibility with ‘OR’ and ‘AND’ logic operations. Our strategy is simple to implement, requiring only a single mixing step at room temperature for each operation and standard gel electrophoresis to read the data. We envision such systems could find use in covert product labeling and barcoding, as well as secure messaging and authentication when combined with previously developed encryption strategies. Ultimately, this type of memory has exciting potential in biomedical sciences as data storage can be coupled to sensing of biological molecules. PMID:28977499
Kong, Muwen; Van Houten, Bennett
2017-08-01
Since Robert Brown's first observations of random walks by pollen particles suspended in solution, the concept of diffusion has been subject to countless theoretical and experimental studies in diverse fields from finance and social sciences, to physics and biology. Diffusive transport of macromolecules in cells is intimately linked to essential cellular functions including nutrient uptake, signal transduction, gene expression, as well as DNA replication and repair. Advancement in experimental techniques has allowed precise measurements of these diffusion processes. Mathematical and physical descriptions and computer simulations have been applied to model complicated biological systems in which anomalous diffusion, in addition to simple Brownian motion, was observed. The purpose of this review is to provide an overview of the major physical models of anomalous diffusion and corresponding experimental evidence on the target search problem faced by DNA-binding proteins, with an emphasis on DNA repair proteins and the role of anomalous diffusion in DNA target recognition. Copyright © 2016 Elsevier Ltd. All rights reserved.
Long, Hannah K; Sims, David; Heger, Andreas; Blackledge, Neil P; Kutter, Claudia; Wright, Megan L; Grützner, Frank; Odom, Duncan T; Patient, Roger; Ponting, Chris P; Klose, Robert J
2013-01-01
Two-thirds of gene promoters in mammals are associated with regions of non-methylated DNA, called CpG islands (CGIs), which counteract the repressive effects of DNA methylation on chromatin. In cold-blooded vertebrates, computational CGI predictions often reside away from gene promoters, suggesting a major divergence in gene promoter architecture across vertebrates. By experimentally identifying non-methylated DNA in the genomes of seven diverse vertebrates, we instead reveal that non-methylated islands (NMIs) of DNA are a central feature of vertebrate gene promoters. Furthermore, NMIs are present at orthologous genes across vast evolutionary distances, revealing a surprising level of conservation in this epigenetic feature. By profiling NMIs in different tissues and developmental stages we uncover a unifying set of features that are central to the function of NMIs in vertebrates. Together these findings demonstrate an ancient logic for NMI usage at gene promoters and reveal an unprecedented level of epigenetic conservation across vertebrate evolution. DOI: http://dx.doi.org/10.7554/eLife.00348.001 PMID:23467541
Molecular Dynamics Studies of Self-Assembling Biomolecules and DNA-functionalized Gold Nanoparticles
NASA Astrophysics Data System (ADS)
Cho, Vince Y.
This thesis is organized as following. In Chapter 2, we use fully atomistic MD simulations to study the conformation of DNA molecules that link gold nanoparticles to form nanoparticle superlattice crystals. In Chapter 3, we study the self-assembly of peptide amphiphiles (PAs) into a cylindrical micelle fiber by using CGMD simulations. Compared to fully atomistic MD simulations, CGMD simulations prove to be computationally cost-efficient and reasonably accurate for exploring self-assembly, and are used in all subsequent chapters. In Chapter 4, we apply CGMD methods to study the self-assembly of small molecule-DNA hybrid (SMDH) building blocks into well-defined cage-like dimers, and reveal the role of kinetics and thermodynamics in this process. In Chapter 5, we extend the CGMD model for this system and find that the assembly of SMDHs can be fine-tuned by changing parameters. In Chapter 6, we explore superlattice crystal structures of DNA-functionalized gold nanoparticles (DNA-AuNP) with the CGMD model and compare the hybridization.
DNA melting profiles from a matrix method.
Poland, Douglas
2004-02-05
In this article we give a new method for the calculation of DNA melting profiles. Based on the matrix formulation of the DNA partition function, the method relies for its efficiency on the fact that the required matrices are very sparse, essentially reducing matrix multiplication to vector multiplication and thus making the computer time required to treat a DNA molecule containing N base pairs proportional to N(2). A key ingredient in the method is the result that multiplication by the inverse matrix can also be reduced to vector multiplication. The task of calculating the melting profile for the entire genome is further reduced by treating regions of the molecule between helix-plateaus, thus breaking the molecule up into independent parts that can each be treated individually. The method is easily modified to incorporate changes in the assignment of statistical weights to the different structural features of DNA. We illustrate the method using the genome of Haemophilus influenzae. Copyright 2003 Wiley Periodicals, Inc.
Chang, Y. Paul; Xu, Meng; Machado, Ana Carolina Dantas; Yu, Xian Jessica; Rohs, Remo; Chen, Xiaojiang S.
2013-01-01
SUMMARY The DNA tumor virus Simian virus 40 (SV40) is a model system for studying eukaryotic replication. SV40 large tumor antigen (LTag) is the initiator/helicase that is essential for genome replication. LTag recognizes and assembles at the viral replication origin. We determined the structure of two multidomain LTag subunits bound to origin DNA. The structure reveals that the origin binding domains (OBDs) and Zn and AAA+ domains are involved in origin recognition and assembly. Notably, the OBDs recognize the origin in an unexpected manner. The histidine residues of the AAA+ domains insert into a narrow minor groove region with enhanced negative electrostatic potential. Computational analysis indicates that this region is intrinsically narrow, demonstrating the role of DNA shape readout in origin recognition. Our results provide important insights into the assembly of the LTag initiator/ helicase at the replication origin and suggest that histidine contacts with the minor groove serve as a mechanism of DNA shape readout. PMID:23545501
Free-energy simulations reveal molecular mechanism for functional switch of a DNA helicase.
Ma, Wen; Whitley, Kevin D; Chemla, Yann R; Luthey-Schulten, Zaida; Schulten, Klaus
2018-04-17
Helicases play key roles in genome maintenance, yet it remains elusive how these enzymes change conformations and how transitions between different conformational states regulate nucleic acid reshaping. Here, we developed a computational technique combining structural bioinformatics approaches and atomic-level free-energy simulations to characterize how the Escherichia coli DNA repair enzyme UvrD changes its conformation at the fork junction to switch its function from unwinding to rezipping DNA. The lowest free-energy path shows that UvrD opens the interface between two domains, allowing the bound ssDNA to escape. The simulation results predict a key metastable 'tilted' state during ssDNA strand switching. By simulating FRET distributions with fluorophores attached to UvrD, we show that the new state is supported quantitatively by single-molecule measurements. The present study deciphers key elements for the 'hyper-helicase' behavior of a mutant and provides an effective framework to characterize directly structure-function relationships in molecular machines. © 2018, Ma et al.
Extension of nanoconfined DNA: Quantitative comparison between experiment and theory
NASA Astrophysics Data System (ADS)
Iarko, V.; Werner, E.; Nyberg, L. K.; Müller, V.; Fritzsche, J.; Ambjörnsson, T.; Beech, J. P.; Tegenfeldt, J. O.; Mehlig, K.; Westerlund, F.; Mehlig, B.
2015-12-01
The extension of DNA confined to nanochannels has been studied intensively and in detail. However, quantitative comparisons between experiments and model calculations are difficult because most theoretical predictions involve undetermined prefactors, and because the model parameters (contour length, Kuhn length, effective width) are difficult to compute reliably, leading to substantial uncertainties. Here we use a recent asymptotically exact theory for the DNA extension in the "extended de Gennes regime" that allows us to compare experimental results with theory. For this purpose, we performed experiments measuring the mean DNA extension and its standard deviation while varying the channel geometry, dye intercalation ratio, and ionic strength of the buffer. The experimental results agree very well with theory at high ionic strengths, indicating that the model parameters are reliable. At low ionic strengths, the agreement is less good. We discuss possible reasons. In principle, our approach allows us to measure the Kuhn length and the effective width of a single DNA molecule and more generally of semiflexible polymers in solution.
Twisting short dsDNA with applied tension
NASA Astrophysics Data System (ADS)
Zoli, Marco
2018-02-01
The twisting deformation of mechanically stretched DNA molecules is studied by a coarse grained Hamiltonian model incorporating the fundamental interactions that stabilize the double helix and accounting for the radial and angular base pair fluctuations. The latter are all the more important at short length scales in which DNA fragments maintain an intrinsic flexibility. The presented computational method simulates a broad ensemble of possible molecule conformations characterized by a specific average twist and determines the energetically most convenient helical twist by free energy minimization. As this is done for any external load, the method yields the characteristic twist-stretch profile of the molecule and also computes the changes in the macroscopic helix parameters i.e. average diameter and rise distance. It is predicted that short molecules under stretching should first over-twist and then untwist by increasing the external load. Moreover, applying a constant load and simulating a torsional strain which over-twists the helix, it is found that the average helix diameter shrinks while the molecule elongates, in agreement with the experimental trend observed in kilo-base long sequences. The quantitative relation between percent relative elongation and superhelical density at fixed load is derived. The proposed theoretical model and computational method offer a general approach to characterize specific DNA fragments and predict their macroscopic elastic response as a function of the effective potential parameters of the mesoscopic Hamiltonian.
Uncovering the self-assembly of DNA nanostructures by thermodynamics and kinetics.
Wei, Xixi; Nangreave, Jeanette; Liu, Yan
2014-06-17
CONSPECTUS: DNA nanotechnology is one of the most flourishing interdisciplinary research fields. DNA nanostructures can be designed to self-assemble into a variety of periodic or aperiodic patterns of different shapes and length scales. They can be used as scaffolds for organizing other nanoparticles, proteins, and chemical groups, leveraging their functions for creating complex bioinspired materials that may serve as smart drug delivery systems, in vitro or in vivo biomolecular computing platforms, and diagnostic devices. Achieving optimal structural features, efficient assembly protocols, and precise functional group positioning and modification requires a thorough understanding of the thermodynamics and kinetics of the DNA nanostructure self-assembly process. The most common real-time measurement strategies include monitoring changes in UV absorbance based on the hyperchromic effect of DNA, and the emission signal changes of DNA intercalating dyes or covalently conjugated fluorescent dyes/pairs that accompany temperature dependent structural changes. Thermodynamic studies of a variety of DNA nanostructures have been performed, from simple double stranded DNA formation to more complex origami assembly. The key parameters that have been evaluated in terms of stability and cooperativity include the overall dimensions, the folding path of the scaffold, crossover and nick point arrangement, length and sequence of single strands, and salt and ion concentrations. DNA tile-tile interactions through sticky end hybridization have also been analyzed, and the steric inhibition and rigidity of tiles turn out to be important factors. Many kinetic studies have also been reported, and most are based on double stranded DNA formation. A two-state assumption and the hypothesis of several intermediate states have been applied to determine the rate constant and activation energy of the DNA hybridization process. A few simulated models were proposed to represent the structural, mechanical, and kinetic properties of DNA hybridization. The kinetics of strand displacement reactions has also been studied as a special case of DNA hybridization. The thermodynamic and kinetic characteristics of DNA nanostructures have been exploited to develop rapid and isothermal annealing protocols. It is conceivable that a more thorough understanding of the DNA assembly process could be used to guide the structural design process and optimize the conditions for assembly, manipulation, and functionalization, thus benefiting both upstream design and downstream applications.
The 'Biologically-Inspired Computing' Column
NASA Technical Reports Server (NTRS)
Hinchey, Mike
2006-01-01
The field of Biology changed dramatically in 1953, with the determination by Francis Crick and James Dewey Watson of the double helix structure of DNA. This discovery changed Biology for ever, allowing the sequencing of the human genome, and the emergence of a "new Biology" focused on DNA, genes, proteins, data, and search. Computational Biology and Bioinformatics heavily rely on computing to facilitate research into life and development. Simultaneously, an understanding of the biology of living organisms indicates a parallel with computing systems: molecules in living cells interact, grow, and transform according to the "program" dictated by DNA. Moreover, paradigms of Computing are emerging based on modelling and developing computer-based systems exploiting ideas that are observed in nature. This includes building into computer systems self-management and self-governance mechanisms that are inspired by the human body's autonomic nervous system, modelling evolutionary systems analogous to colonies of ants or other insects, and developing highly-efficient and highly-complex distributed systems from large numbers of (often quite simple) largely homogeneous components to reflect the behaviour of flocks of birds, swarms of bees, herds of animals, or schools of fish. This new field of "Biologically-Inspired Computing", often known in other incarnations by other names, such as: Autonomic Computing, Pervasive Computing, Organic Computing, Biomimetics, and Artificial Life, amongst others, is poised at the intersection of Computer Science, Engineering, Mathematics, and the Life Sciences. Successes have been reported in the fields of drug discovery, data communications, computer animation, control and command, exploration systems for space, undersea, and harsh environments, to name but a few, and augur much promise for future progress.
Screening for Protein-DNA Interactions by Automatable DNA-Protein Interaction ELISA
Schüssler, Axel; Kolukisaoglu, H. Üner; Koch, Grit; Wallmeroth, Niklas; Hecker, Andreas; Thurow, Kerstin; Zell, Andreas; Harter, Klaus; Wanke, Dierk
2013-01-01
DNA-binding proteins (DBPs), such as transcription factors, constitute about 10% of the protein-coding genes in eukaryotic genomes and play pivotal roles in the regulation of chromatin structure and gene expression by binding to short stretches of DNA. Despite their number and importance, only for a minor portion of DBPs the binding sequence had been disclosed. Methods that allow the de novo identification of DNA-binding motifs of known DBPs, such as protein binding microarray technology or SELEX, are not yet suited for high-throughput and automation. To close this gap, we report an automatable DNA-protein-interaction (DPI)-ELISA screen of an optimized double-stranded DNA (dsDNA) probe library that allows the high-throughput identification of hexanucleotide DNA-binding motifs. In contrast to other methods, this DPI-ELISA screen can be performed manually or with standard laboratory automation. Furthermore, output evaluation does not require extensive computational analysis to derive a binding consensus. We could show that the DPI-ELISA screen disclosed the full spectrum of binding preferences for a given DBP. As an example, AtWRKY11 was used to demonstrate that the automated DPI-ELISA screen revealed the entire range of in vitro binding preferences. In addition, protein extracts of AtbZIP63 and the DNA-binding domain of AtWRKY33 were analyzed, which led to a refinement of their known DNA-binding consensi. Finally, we performed a DPI-ELISA screen to disclose the DNA-binding consensus of a yet uncharacterized putative DBP, AtTIFY1. A palindromic TGATCA-consensus was uncovered and we could show that the GATC-core is compulsory for AtTIFY1 binding. This specific interaction between AtTIFY1 and its DNA-binding motif was confirmed by in vivo plant one-hybrid assays in protoplasts. Thus, the value and applicability of the DPI-ELISA screen for de novo binding site identification of DBPs, also under automatized conditions, is a promising approach for a deeper understanding of gene regulation in any organism of choice. PMID:24146751
The development of miniplex primer sets for the analysis of degraded DNA
NASA Astrophysics Data System (ADS)
McCord, Bruce; Opel, Kerry; Chung, Denise; Drabek, Jiri; Tatarek, Nancy; Meadows Jantz, Lee; Butler, John
2005-05-01
In this project, a new set of multiplexed PCR reactions has been developed for the analysis of degraded DNA. These DNA markers, known as Miniplexes, utilize primers that have shorter amplicons for use in short tandem repeat (STR) analysis of degraded DNA. In our work we have defined six of these new STR multiplexes, each of which consists of 3 to 4 reduced size STR loci, and each labeled with a different fluorescent dye. When compared to commercially available STR systems, reductions in size of up to 300 basepairs are possible. In addition, these newly designed amplicons consist of loci that are fully compatible with the the national computer DNA database known as CODIS. To demonstrate compatibility with commercial STR kits, a concordance study of 532 DNA samples of Caucasian, African American, and Hispanic origin was undertaken There was 99.77% concordance between allele calls with the two methods. Of these 532 samples, only 15 samples showed discrepancies at one of 12 loci. These occurred predominantly at 2 loci, vWA and D13S317. DNA sequencing revealed that these locations had deletions between the two primer binding sites. Uncommon deletions like these can be expected in certain samples and will not affect the utility of the Miniplexes as tools for degraded DNA analysis. The Miniplexes were also applied to enzymatically digested DNA to assess their potential in degraded DNA analysis. The results demonstrated a greatly improved efficiency in the analysis of degraded DNA when compared to commercial STR genotyping kits. A series of human skeletal remains that had been exposed to a variety of environmental conditions were also examined. Sixty-four percent of the samples generated full profiles when amplified with the Miniplexes, while only sixteen percent of the samples tested generated full profiles with a commercial kit. In addition, complete profiles were obtained for eleven of the twelve Miniplex loci which had amplicon size ranges less than 200 base pairs. These data clearly demonstrate that smaller PCR amplicons provide an attractive alternative to mitochondrial DNA for forensic analysis of degraded DNA.
An efficient variational method to study the denaturation of DNA induced by superhelical stress
NASA Astrophysics Data System (ADS)
Jost, Daniel; Everaers, Ralf
2010-03-01
Many fundamental biological processes, like transcription or replication, need the opening of the double-stranded DNA. One common way to control the local denaturation is to impose superhelical stress to the DNA using protein machineries. To describe superhelical effect for circular molecules, Benham introduced a model where the standard thermodynamic description of base-pairing is coupled with torsional stress energetics. Here, we introduce an efficient mean-field approximation of the Benham model. Our self-consistent solution is confident and computationally-fast, compared to the full treatment of the model. In particular, our formulation allows to compute the probability of bubble formation for given length and position along the sequence. Evolution of this probability as a function of the superhelical stress could inform us on the ability for organisms to control the strength of superhelicity acting on their genomes.
Improved Force Fields for Peptide Nucleic Acids with Optimized Backbone Torsion Parameters.
Jasiński, Maciej; Feig, Michael; Trylska, Joanna
2018-06-06
Peptide nucleic acids are promising nucleic acid analogs for antisense therapies as they can form stable duplex and triplex structures with DNA and RNA. Computational studies of PNA-containing duplexes and triplexes are an important component for guiding their design, yet existing force fields have not been well validated and parametrized with modern computational capabilities. We present updated CHARMM and Amber force fields for PNA that greatly improve the stability of simulated PNA-containing duplexes and triplexes in comparison with experimental structures and allow such systems to be studied on microsecond time scales. The force field modifications focus on reparametrized PNA backbone torsion angles to match high-level quantum mechanics reference energies for a model compound. The microsecond simulations of PNA-PNA, PNA-DNA, PNA-RNA, and PNA-DNA-PNA complexes also allowed a comprehensive analysis of hydration and ion interactions with such systems.
Eldar, Amir; Rozenberg, Haim; Diskin-Posner, Yael; Rohs, Remo; Shakked, Zippora
2013-01-01
A p53 hot-spot mutation found frequently in human cancer is the replacement of R273 by histidine or cysteine residues resulting in p53 loss of function as a tumor suppressor. These mutants can be reactivated by the incorporation of second-site suppressor mutations. Here, we present high-resolution crystal structures of the p53 core domains of the cancer-related proteins, the rescued proteins and their complexes with DNA. The structures show that inactivation of p53 results from the incapacity of the mutated residues to form stabilizing interactions with the DNA backbone, and that reactivation is achieved through alternative interactions formed by the suppressor mutations. Detailed structural and computational analysis demonstrates that the rescued p53 complexes are not fully restored in terms of DNA structure and its interface with p53. Contrary to our previously studied wild-type (wt) p53-DNA complexes showing non-canonical Hoogsteen A/T base pairs of the DNA helix that lead to local minor-groove narrowing and enhanced electrostatic interactions with p53, the current structures display Watson–Crick base pairs associated with direct or water-mediated hydrogen bonds with p53 at the minor groove. These findings highlight the pivotal role played by R273 residues in supporting the unique geometry of the DNA target and its sequence-specific complex with p53. PMID:23863845
Biosensing via light scattering from plasmonic core-shell nanospheres coated with DNA molecules
NASA Astrophysics Data System (ADS)
Xie, Huai-Yi; Chen, Minfeng; Chang, Yia-Chung; Moirangthem, Rakesh Singh
2017-05-01
We present both experimental and theoretical studies for investigating DNA molecules attached on metallic nanospheres. We have developed an efficient and accurate numerical method to investigate light scattering from plasmonic nanospheres on a substrate covered by a shell, based on the Green's function approach with suitable spherical harmonic basis. Next, we use this method to study optical scattering from DNA molecules attached to metallic nanoparticles placed on a substrate and compare with experimental results. We obtain fairly good agreement between theoretical predictions and the measured ellipsometric spectra. The metallic nanoparticles were used to detect the binding with DNA molecules in a microfluidic setup via spectroscopic ellipsometry (SE), and a detectable change in ellipsometric spectra was found when DNA molecules are captured on Au nanoparticles. Our theoretical simulation indicates that the coverage of Au nanosphere by a submonolayer of DNA molecules, which is modeled by a thin layer of dielectric material (which may absorb light), can lead to a small but detectable spectroscopic shift in both the Ψ and Δ spectra with more significant change in Δ spectra in agreement with experimental results. Our studies demonstrated the ultrasensitive capability of SE for sensing submonolayer coverage of DNA molecules on Au nanospheres. Hence the spectroscopic ellipsometric measurements coupled with theoretical analysis via an efficient computation method can be an effective tool for detecting DNA molecules attached on Au nanoparticles, thus achieving label-free, non-destructive, and high-sensitivity biosensing with nanoscale resolution.
Ma, Xin; Guo, Jing; Sun, Xiao
2016-01-01
DNA-binding proteins are fundamentally important in cellular processes. Several computational-based methods have been developed to improve the prediction of DNA-binding proteins in previous years. However, insufficient work has been done on the prediction of DNA-binding proteins from protein sequence information. In this paper, a novel predictor, DNABP (DNA-binding proteins), was designed to predict DNA-binding proteins using the random forest (RF) classifier with a hybrid feature. The hybrid feature contains two types of novel sequence features, which reflect information about the conservation of physicochemical properties of the amino acids, and the binding propensity of DNA-binding residues and non-binding propensities of non-binding residues. The comparisons with each feature demonstrated that these two novel features contributed most to the improvement in predictive ability. Furthermore, to improve the prediction performance of the DNABP model, feature selection using the minimum redundancy maximum relevance (mRMR) method combined with incremental feature selection (IFS) was carried out during the model construction. The results showed that the DNABP model could achieve 86.90% accuracy, 83.76% sensitivity, 90.03% specificity and a Matthews correlation coefficient of 0.727. High prediction accuracy and performance comparisons with previous research suggested that DNABP could be a useful approach to identify DNA-binding proteins from sequence information. The DNABP web server system is freely available at http://www.cbi.seu.edu.cn/DNABP/.
Zgarbová, Marie; Luque, F. Javier; Šponer, Jiří; Cheatham, Thomas E.; Otyepka, Michal; Jurečka, Petr
2013-01-01
We present a refinement of the backbone torsion parameters ε and ζ of the Cornell et al. AMBER force field for DNA simulations. The new parameters, denoted as εζOL1, were derived from quantum-mechanical calculations with inclusion of conformation-dependent solvation effects according to the recently reported methodology (J. Chem. Theory Comput. 2012, 7(9), 2886-2902). The performance of the refined parameters was analyzed by means of extended molecular dynamics (MD) simulations for several representative systems. The results showed that the εζOL1 refinement improves the backbone description of B-DNA double helices and G-DNA stem. In B-DNA simulations, we observed an average increase of the helical twist and narrowing of the major groove, thus achieving better agreement with X-ray and solution NMR data. The balance between populations of BI and BII backbone substates was shifted towards the BII state, in better agreement with ensemble-refined solution experimental results. Furthermore, the refined parameters decreased the backbone RMS deviations in B-DNA MD simulations. In the antiparallel guanine quadruplex (G-DNA) the εζOL1 modification improved the description of non-canonical α/γ backbone substates, which were shown to be coupled to the ε/ζ torsion potential. Thus, the refinement is suggested as a possible alternative to the current ε/ζ torsion potential, which may enable more accurate modeling of nucleic acids. However, long-term testing is recommended before its routine application in DNA simulations. PMID:24058302
Role of geometrical shape in like-charge attraction of DNA.
Kuron, Michael; Arnold, Axel
2015-03-01
While the phenomenon of like-charge attraction of DNA is clearly observed experimentally and in simulations, mean-field theories fail to predict it. Kornyshev et al. argued that like-charge attraction is due to DNA's helical geometry and hydration forces. Strong-coupling (SC) theory shows that attraction of like-charged rods is possible through ion correlations alone at large coupling parameters, usually by multivalent counterions. However for SC theory to be applicable, counterion-counterion correlations perpendicular to the DNA strands need to be sufficiently small, which is not a priori the case for DNA even with trivalent counterions. We study a system containing infinitely long DNA strands and trivalent counterions by computer simulations employing varying degrees of coarse-graining. Our results show that there is always attraction between the strands, but its magnitude is indeed highly dependent on the specific shape of the strand. While discreteness of the charge distribution has little influence on the attractive forces, the role of the helical charge distribution is considerable: charged rods maintain a finite distance in equilibrium, while helices collapse to close contact with a phase shift of π, in full agreement with SC predictions. The SC limit is applicable because counterions strongly bind to the charged sites of the helices, so that helix-counterion interactions dominate over counterion-counterion interactions. Thus DNA's helical geometry is not crucial for like-charge DNA attraction, but strongly enhances it, and electrostatic interactions in the strong-coupling limit are sufficient to explain this attraction.
Liao, Wei-Ching; Chuang, Min-Chieh; Ho, Ja-An Annie
2013-12-15
Genetically modified (GM) technique, one of the modern biomolecular engineering technologies, has been deemed as profitable strategy to fight against global starvation. Yet rapid and reliable analytical method is deficient to evaluate the quality and potential risk of such resulting GM products. We herein present a biomolecular analytical system constructed with distinct biochemical activities to expedite the computational detection of genetically modified organisms (GMOs). The computational mechanism provides an alternative to the complex procedures commonly involved in the screening of GMOs. Given that the bioanalytical system is capable of processing promoter, coding and species genes, affirmative interpretations succeed to identify specified GM event in terms of both electrochemical and optical fashions. The biomolecular computational assay exhibits detection capability of genetically modified DNA below sub-nanomolar level and is found interference-free by abundant coexistence of non-GM DNA. This bioanalytical system, furthermore, sophisticates in array fashion operating multiplex screening against variable GM events. Such a biomolecular computational assay and biosensor holds great promise for rapid, cost-effective, and high-fidelity screening of GMO. Copyright © 2013 Elsevier B.V. All rights reserved.
Interplay of protein and DNA structure revealed in simulations of the lac operon.
Czapla, Luke; Grosner, Michael A; Swigon, David; Olson, Wilma K
2013-01-01
The E. coli Lac repressor is the classic textbook example of a protein that attaches to widely spaced sites along a genome and forces the intervening DNA into a loop. The short loops implicated in the regulation of the lac operon suggest the involvement of factors other than DNA and repressor in gene control. The molecular simulations presented here examine two likely structural contributions to the in-vivo looping of bacterial DNA: the distortions of the double helix introduced upon association of the highly abundant, nonspecific nucleoid protein HU and the large-scale deformations of the repressor detected in low-resolution experiments. The computations take account of the three-dimensional arrangements of nucleotides and amino acids found in crystal structures of DNA with the two proteins, the natural rest state and deformational properties of protein-free DNA, and the constraints on looping imposed by the conformation of the repressor and the orientation of bound DNA. The predicted looping propensities capture the complex, chain-length-dependent variation in repression efficacy extracted from gene expression studies and in vitro experiments and reveal unexpected chain-length-dependent variations in the uptake of HU, the deformation of repressor, and the folding of DNA. Both the opening of repressor and the presence of HU, at levels approximating those found in vivo, enhance the probability of loop formation. HU affects the global organization of the repressor and the opening of repressor influences the levels of HU binding to DNA. The length of the loop determines whether the DNA adopts antiparallel or parallel orientations on the repressor, whether the repressor is opened or closed, and how many HU molecules bind to the loop. The collective behavior of proteins and DNA is greater than the sum of the parts and hints of ways in which multiple proteins may coordinate the packaging and processing of genetic information.
Halper, Sean M; Cetnar, Daniel P; Salis, Howard M
2018-01-01
Engineering many-enzyme metabolic pathways suffers from the design curse of dimensionality. There are an astronomical number of synonymous DNA sequence choices, though relatively few will express an evolutionary robust, maximally productive pathway without metabolic bottlenecks. To solve this challenge, we have developed an integrated, automated computational-experimental pipeline that identifies a pathway's optimal DNA sequence without high-throughput screening or many cycles of design-build-test. The first step applies our Operon Calculator algorithm to design a host-specific evolutionary robust bacterial operon sequence with maximally tunable enzyme expression levels. The second step applies our RBS Library Calculator algorithm to systematically vary enzyme expression levels with the smallest-sized library. After characterizing a small number of constructed pathway variants, measurements are supplied to our Pathway Map Calculator algorithm, which then parameterizes a kinetic metabolic model that ultimately predicts the pathway's optimal enzyme expression levels and DNA sequences. Altogether, our algorithms provide the ability to efficiently map the pathway's sequence-expression-activity space and predict DNA sequences with desired metabolic fluxes. Here, we provide a step-by-step guide to applying the Pathway Optimization Pipeline on a desired multi-enzyme pathway in a bacterial host.
Pancoska, Petr; Moravek, Zdenek; Moll, Ute M
2004-01-01
Nucleic acids are molecules of choice for both established and emerging nanoscale technologies. These technologies benefit from large functional densities of 'DNA processing elements' that can be readily manufactured. To achieve the desired functionality, polynucleotide sequences are currently designed by a process that involves tedious and laborious filtering of potential candidates against a series of requirements and parameters. Here, we present a complete novel methodology for the rapid rational design of large sets of DNA sequences. This method allows for the direct implementation of very complex and detailed requirements for the generated sequences, thus avoiding 'brute force' filtering. At the same time, these sequences have narrow distributions of melting temperatures. The molecular part of the design process can be done without computer assistance, using an efficient 'human engineering' approach by drawing a single blueprint graph that represents all generated sequences. Moreover, the method eliminates the necessity for extensive thermodynamic calculations. Melting temperature can be calculated only once (or not at all). In addition, the isostability of the sequences is independent of the selection of a particular set of thermodynamic parameters. Applications are presented for DNA sequence designs for microarrays, universal microarray zip sequences and electron transfer experiments.
Synthetic Ion Channels and DNA Logic Gates as Components of Molecular Robots.
Kawano, Ryuji
2018-02-19
A molecular robot is a next-generation biochemical machine that imitates the actions of microorganisms. It is made of biomaterials such as DNA, proteins, and lipids. Three prerequisites have been proposed for the construction of such a robot: sensors, intelligence, and actuators. This Minireview focuses on recent research on synthetic ion channels and DNA computing technologies, which are viewed as potential candidate components of molecular robots. Synthetic ion channels, which are embedded in artificial cell membranes (lipid bilayers), sense ambient ions or chemicals and import them. These artificial sensors are useful components for molecular robots with bodies consisting of a lipid bilayer because they enable the interface between the inside and outside of the molecular robot to function as gates. After the signal molecules arrive inside the molecular robot, they can operate DNA logic gates, which perform computations. These functions will be integrated into the intelligence and sensor sections of molecular robots. Soon, these molecular machines will be able to be assembled to operate as a mass microrobot and play an active role in environmental monitoring and in vivo diagnosis or therapy. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Privacy-preserving microbiome analysis using secure computation.
Wagner, Justin; Paulson, Joseph N; Wang, Xiao; Bhattacharjee, Bobby; Corrada Bravo, Héctor
2016-06-15
Developing targeted therapeutics and identifying biomarkers relies on large amounts of research participant data. Beyond human DNA, scientists now investigate the DNA of micro-organisms inhabiting the human body. Recent work shows that an individual's collection of microbial DNA consistently identifies that person and could be used to link a real-world identity to a sensitive attribute in a research dataset. Unfortunately, the current suite of DNA-specific privacy-preserving analysis tools does not meet the requirements for microbiome sequencing studies. To address privacy concerns around microbiome sequencing, we implement metagenomic analyses using secure computation. Our implementation allows comparative analysis over combined data without revealing the feature counts for any individual sample. We focus on three analyses and perform an evaluation on datasets currently used by the microbiome research community. We use our implementation to simulate sharing data between four policy-domains. Additionally, we describe an application of our implementation for patients to combine data that allows drug developers to query against and compensate patients for the analysis. The software is freely available for download at: http://cbcb.umd.edu/∼hcorrada/projects/secureseq.html Supplementary data are available at Bioinformatics online. hcorrada@umiacs.umd.edu. © The Author 2016. Published by Oxford University Press.
DMINDA: an integrated web server for DNA motif identification and analyses.
Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying
2014-07-01
DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Grunewald, J P; Röhl, F W; Kirches, E; Dietzmann, K
1998-02-01
Many studies dealing with extracranial cancer showed a strong correlation of DNA ploidy to a poor clinical outcome, recurrence, or malignancy. In brain tumors, analysis of DNA content did not always provided significant diagnostic information. In this study, DNA density and karyometric parameters of 50 meningiomas (26 Grade I, 10 Grade II, 14 Grade III) were quantitatively evaluated by digital cell image analyses of Feulgen-stained nuclei. In particular, the densitometric parameter SEXT, which describes nuclear DNA content, as well as the morphometric values LENG (a computer-assisted measurement of nuclear circumference), AREA (a computer-assisted measurement of nuclear area), FCON (a parameter that describes nuclear roundness), and CONC (a describing nuclear contour), evaluated with the software IMAGE C, were correlated to World Health Organization (WHO) grading using univariate and multivariate methods. AREA and LENG values showed significant differences between tumors of Grades I and III. FCON values were unable to distinguish WHO Grade III from Grade I/II but were useful in clearly separating Grade II from Grade I tumors. CONC values detected differences between WHO Grades II and I/III tumors but not between the latter. SEXT values clearly distinguished Grade III from Grade I/II tumors. The 1c, 2c, 2.5c, and 5c exceeding rates showed no predictive values. Only the 6c exceeding rate showed a significant difference between Grades I and III. These results outline the characteristic features of the atypical (Grade II) meningiomas, which make them a recognizable tumor entity distinct from benign and anaplastic meningiomas. The combination of DNA densitometric and morphometric findings seems to be a powerful addition to the histopathologic classification of meningiomas, as suggested by the WHO.
Schwämmle, Veit; Jensen, Ole Nørregaard
2013-01-01
Chromatin is a highly compact and dynamic nuclear structure that consists of DNA and associated proteins. The main organizational unit is the nucleosome, which consists of a histone octamer with DNA wrapped around it. Histone proteins are implicated in the regulation of eukaryote genes and they carry numerous reversible post-translational modifications that control DNA-protein interactions and the recruitment of chromatin binding proteins. Heterochromatin, the transcriptionally inactive part of the genome, is densely packed and contains histone H3 that is methylated at Lys 9 (H3K9me). The propagation of H3K9me in nucleosomes along the DNA in chromatin is antagonizing by methylation of H3 Lysine 4 (H3K4me) and acetylations of several lysines, which is related to euchromatin and active genes. We show that the related histone modifications form antagonized domains on a coarse scale. These histone marks are assumed to be initiated within distinct nucleation sites in the DNA and to propagate bi-directionally. We propose a simple computer model that simulates the distribution of heterochromatin in human chromosomes. The simulations are in agreement with previously reported experimental observations from two different human cell lines. We reproduced different types of barriers between heterochromatin and euchromatin providing a unified model for their function. The effect of changes in the nucleation site distribution and of propagation rates were studied. The former occurs mainly with the aim of (de-)activation of single genes or gene groups and the latter has the power of controlling the transcriptional programs of entire chromosomes. Generally, the regulatory program of gene transcription is controlled by the distribution of nucleation sites along the DNA string.
Nucleotide exchange and excision technology DNA shuffling and directed evolution.
Speck, Janina; Stebel, Sabine C; Arndt, Katja M; Müller, Kristian M
2011-01-01
Remarkable success in optimizing complex properties within DNA and proteins has been achieved by directed evolution. In contrast to various random mutagenesis methods and high-throughput selection methods, the number of available DNA shuffling procedures is limited, and protocols are often difficult to adjust. The strength of the nucleotide exchange and excision technology (NExT) DNA shuffling described here is the robust, efficient, and easily controllable DNA fragmentation step based on random incorporation of the so-called 'exchange nucleotides' by PCR. The exchange nucleotides are removed enzymatically, followed by chemical cleavage of the DNA backbone. The oligonucleotide pool is reassembled into full-length genes by internal primer extension, and the recombined gene library is amplified by standard PCR. The technique has been demonstrated by shuffling a defined gene library of chloramphenicol acetyltransferase variants using uridine as fragmentation defining exchange nucleotide. Substituting 33% of the dTTP with dUTP in the incorporation PCR resulted in shuffled clones with an average parental fragment size of 86 bases and revealed a mutation rate of only 0.1%. Additionally, a computer program (NExTProg) has been developed that predicts the fragment size distribution depending on the relative amount of the exchange nucleotide.
DR-Integrator: a new analytic tool for integrating DNA copy number and gene expression data.
Salari, Keyan; Tibshirani, Robert; Pollack, Jonathan R
2010-02-01
DNA copy number alterations (CNA) frequently underlie gene expression changes by increasing or decreasing gene dosage. However, only a subset of genes with altered dosage exhibit concordant changes in gene expression. This subset is likely to be enriched for oncogenes and tumor suppressor genes, and can be identified by integrating these two layers of genome-scale data. We introduce DNA/RNA-Integrator (DR-Integrator), a statistical software tool to perform integrative analyses on paired DNA copy number and gene expression data. DR-Integrator identifies genes with significant correlations between DNA copy number and gene expression, and implements a supervised analysis that captures genes with significant alterations in both DNA copy number and gene expression between two sample classes. DR-Integrator is freely available for non-commercial use from the Pollack Lab at http://pollacklab.stanford.edu/ and can be downloaded as a plug-in application to Microsoft Excel and as a package for the R statistical computing environment. The R package is available under the name 'DRI' at http://cran.r-project.org/. An example analysis using DR-Integrator is included as supplemental material. Supplementary data are available at Bioinformatics online.
Swigon, David; Coleman, Bernard D.; Olson, Wilma K.
2006-01-01
Repression of transcription of the Escherichia coli Lac operon by the Lac repressor (LacR) is accompanied by the simultaneous binding of LacR to two operators and the formation of a DNA loop. A recently developed theory of sequence-dependent DNA elasticity enables one to relate the fine structure of the LacR–DNA complex to a wide range of heretofore-unconnected experimental observations. Here, that theory is used to calculate the configuration and free energy of the DNA loop as a function of its length and base-pair sequence, its linking number, and the end conditions imposed by the LacR tetramer. The tetramer can assume two types of conformations. Whereas a rigid V-shaped structure is observed in the crystal, EM images show extended forms in which two dimer subunits are flexibly joined. Upon comparing our computed loop configurations with published experimental observations of permanganate sensitivities, DNase I cutting patterns, and loop stabilities, we conclude that linear DNA segments of short-to-medium chain length (50–180 bp) give rise to loops with the extended form of LacR and that loops formed within negatively supercoiled plasmids induce the V-shaped structure. PMID:16785444
Contribution of indirect effects to clustered damage in DNA irradiated with protons.
Pachnerová Brabcová, K; Štěpán, V; Karamitros, M; Karabín, M; Dostálek, P; Incerti, S; Davídková, M; Sihver, L
2015-09-01
Protons are the dominant particles both in galactic cosmic rays and in solar particle events and, furthermore, proton irradiation becomes increasingly used in tumour treatment. It is believed that complex DNA damage is the determining factor for the consequent cellular response to radiation. DNA plasmid pBR322 was irradiated at U120-M cyclotron with 30 MeV protons and treated with two Escherichia coli base excision repair enzymes. The yields of SSBs and DSBs were analysed using agarose gel electrophoresis. DNA has been irradiated in the presence of hydroxyl radical scavenger (coumarin-3-carboxylic acid) in order to distinguish between direct and indirect damage of the biological target. Pure scavenger solution was used as a probe for measurement of induced OH· radical yields. Experimental OH· radical yield kinetics was compared with predictions computed by two theoretical models-RADAMOL and Geant4-DNA. Both approaches use Geant4-DNA for description of physical stages of radiation action, and then each of them applies a distinct model for description of the pre-chemical and chemical stage. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Theoretical studies of protein-protein and protein-DNA binding rates
NASA Astrophysics Data System (ADS)
Alsallaq, Ramzi A.
Proteins are folded chains of amino acids. Some of the amino acids (e.g. Lys, Arg, His, Asp, and Glu) carry charges under physiological conditions. Proteins almost always function through binding to other proteins or ligands, for example barnase is a ribonuclease protein, found in the bacterium Bacillus amyloliquefaceus. Barnase degrades RNA by hydrolysis. For the bacterium to inhibit the potentially lethal action of Barnase within its own cell it co-produces another protein called barstar which binds quickly, and tightly, to barnase. The biological function of this binding is to block the active site of barnase. The speeds (rates) at which proteins associate are vital to many biological processes. They span a wide range (from less than 103 to 108 M-1s-1 ). Rates greater than ˜ 106 M -1s-1 are typically found to be manifestations of enhancements by long-range electrostatic interactions between the associating proteins. A different paradigm appears in the case of protein binding to DNA. The rate in this case is enhanced through attractive surface potential that effectively reduces the dimensionality of the available search space for the diffusing protein. This thesis presents computational and theoretical models on the rate of association of ligands/proteins to other proteins or DNA. For protein-protein association we present a general strategy for computing protein-protein rates of association. The main achievements of this strategy is the ability to obtain a stringent reaction criteria based on the landscape of short-range interactions between the associating proteins, and the ability to compute the effect of the electrostatic interactions on the rates of association accurately using the best known solvers for Poisson-Boltzmann equation presently available. For protein-DNA association we present a mathematical model for proteins targeting specific sites on a circular DNA topology. The main achievements are the realization that a linear DNA with reflecting ends and specific site in the middle of the chain is kinetically indistinguishable from its circularized topology, and the ability to predict the effect of the dissociation via the ends of linear DNA on the rate of association which is to reduce the rate.* *This dissertation is a compound document (contains both a paper copy and a CD as part of the dissertation). The CD requires the following system requirements: QuickTime.
Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G
2007-01-01
Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.
1987-08-01
READ COMPASS SEC. z Xl; CYC 0A,3B1818 0879’ CO 131A’ CMPRTN: LER SIRD ;READ SWITCHES SEC. =ALL; CYC 24,25 1819 387C’ CO 1322’ SWRTN: LBE AD ;A/D, NUX...GP:84); G? -> 35 282 OE08’ 50 STR DNA COMPASS TO DNA BUFFER 0B 2883 OE09’ 10 I1C DNA DNA BUFFER TO OC 2884 2885 DEGA’ 72 LDXA ;LOAD BOG (GP:85); GP...8217 3343 ID6’ 44 3D 3344 1iDS’ 8D DB 08DH RELAY POD STATUS 3345 1109’ 20 20 4 4D DB ’ CMPS:’ 3346 11DD’ 50 53 3D 3347 IlEC’ 8DCA DW 08DCAH COMPASS , CR
Facilitated Diffusion of Transcription Factor Proteins with Anomalous Bulk Diffusion.
Liu, Lin; Cherstvy, Andrey G; Metzler, Ralf
2017-02-16
What are the physical laws of the diffusive search of proteins for their specific binding sites on DNA in the presence of the macromolecular crowding in cells? We performed extensive computer simulations to elucidate the protein target search on DNA. The novel feature is the viscoelastic non-Brownian protein bulk diffusion recently observed experimentally. We examine the influence of the protein-DNA binding affinity and the anomalous diffusion exponent on the target search time. In all cases an optimal search time is found. The relative contribution of intermittent three-dimensional bulk diffusion and one-dimensional sliding of proteins along the DNA is quantified. Our results are discussed in the light of recent single molecule tracking experiments, aiming at a better understanding of the influence of anomalous kinetics of proteins on the facilitated diffusion mechanism.
MSuPDA: A Memory Efficient Algorithm for Sequence Alignment.
Khan, Mohammad Ibrahim; Kamal, Md Sarwar; Chowdhury, Linkon
2016-03-01
Space complexity is a million dollar question in DNA sequence alignments. In this regard, memory saving under pushdown automata can help to reduce the occupied spaces in computer memory. Our proposed process is that anchor seed (AS) will be selected from given data set of nucleotide base pairs for local sequence alignment. Quick splitting techniques will separate the AS from all the DNA genome segments. Selected AS will be placed to pushdown automata's (PDA) input unit. Whole DNA genome segments will be placed into PDA's stack. AS from input unit will be matched with the DNA genome segments from stack of PDA. Match, mismatch and indel of nucleotides will be popped from the stack under the control unit of pushdown automata. During the POP operation on stack, it will free the memory cell occupied by the nucleotide base pair.
Gao, Jinting; Liu, Yaqing; Lin, Xiaodong; Deng, Jiankang; Yin, Jinjin; Wang, Shuo
2017-10-25
Wiring a series of simple logic gates to process complex data is significantly important and a large challenge for untraditional molecular computing systems. The programmable property of DNA endows its powerful application in molecular computing. In our investigation, it was found that DNA exhibits excellent peroxidase-like activity in a colorimetric system of TMB/H 2 O 2 /Hemin (TMB, 3,3', 5,5'-Tetramethylbenzidine) in the presence of K + and Cu 2+ , which is significantly inhibited by the addition of an antioxidant. According to the modulated catalytic activity of this DNA-based catalyst, three cascade logic gates including AND-OR-INH (INHIBIT), AND-INH and OR-INH were successfully constructed. Interestingly, by only modulating the concentration of Cu 2+ , a majority logic gate with a single-vote veto function was realized following the same threshold value as that of the cascade logic gates. The strategy is quite straightforward and versatile and provides an instructive method for constructing multiple logic gates on a simple platform to implement complex molecular computing.
NASA Technical Reports Server (NTRS)
Ojha, R. P.; Dhingra, M. M.; Sarma, M. H.; Myer, Y. P.; Setlik, R. F.; Shibata, M.; Kazim, A. L.; Ornstein, R. L.; Rein, R.; Turner, C. J.;
1997-01-01
The structure of an anti-HIV-1 ribozyme-DNA abortive substrate complex was investigated by 750 MHz NMR and computer modeling experiments. The ribozyme was a chimeric molecule with 30 residues-18 DNA nucleotides, and 12 RNA residues in the conserved core. The DNA substrate analog had 17 residues. The chimeric ribozyme and the DNA substrate formed a shortened ribozyme-abortive substrate complex of 47 nucleotides with two DNA stems (stems I and III) and a loop consisting of the conserved core residues. Circular dichroism spectra showed that the DNA stems assume A-family conformation at the NMR concentration and a temperature of 15 degrees C, contrary to the conventional wisdom that DNA duplexes in aqueous solution populate entirely in the B-form. It is proposed that the A-family RNA residues at the core expand the A-family initiated at the core into the DNA stems because of the large free energy requirement for the formation of A/B junctions. Assignments of the base H8/H6 protons and H1' of the 47 residues were made by a NOESY walk. In addition to the methyl groups of all T's, the imino resonances of stems I and III and AH2's were assigned from appropriate NOESY walks. The extracted NMR data along with available crystallographic data, were used to derive a structural model of the complex. Stems I and III of the final model displayed a remarkable similarity to the A form of DNA; in stem III, a GC base pair was found to be moving into the floor of the minor groove defined by flanking AT pairs; data suggest the formation of a buckled rhombic structure with the adjacent pair; in addition, the base pair at the interface of stem III and the loop region displayed deformed geometry. The loop with the catalytic core, and the immediate region of the stems displayed conformational multiplicity within the NMR time scale. A catalytic mechanism for ribozyme action based on the derived structure, and consistent with biochemical data in the literature, is proposed. The complex between the anti HIV-1 gag ribozyme and its abortive DNA substrate manifests in the detection of a continuous track of A.T base pairs; this suggests that the interaction between the ribozyme and its DNA substrate is stronger than the one observed in the case of the free ribozyme where the bases in stem I and stem III regions interact strongly with the ribozyme core region (Sarma, R. H., et al. FEBS Letters 375, 317-23, 1995). The complex formation provides certain guidelines in the design of suitable therapeutic ribozymes. If the residues in the ribozyme stem regions interact with the conserved core, it may either prevent or interfere with the formation of a catalytically active tertiary structure.
Probabilistic switching circuits in DNA
Wilhelm, Daniel; Bruck, Jehoshua
2018-01-01
A natural feature of molecular systems is their inherent stochastic behavior. A fundamental challenge related to the programming of molecular information processing systems is to develop a circuit architecture that controls the stochastic states of individual molecular events. Here we present a systematic implementation of probabilistic switching circuits, using DNA strand displacement reactions. Exploiting the intrinsic stochasticity of molecular interactions, we developed a simple, unbiased DNA switch: An input signal strand binds to the switch and releases an output signal strand with probability one-half. Using this unbiased switch as a molecular building block, we designed DNA circuits that convert an input signal to an output signal with any desired probability. Further, this probability can be switched between 2n different values by simply varying the presence or absence of n distinct DNA molecules. We demonstrated several DNA circuits that have multiple layers and feedback, including a circuit that converts an input strand to an output strand with eight different probabilities, controlled by the combination of three DNA molecules. These circuits combine the advantages of digital and analog computation: They allow a small number of distinct input molecules to control a diverse signal range of output molecules, while keeping the inputs robust to noise and the outputs at precise values. Moreover, arbitrarily complex circuit behaviors can be implemented with just a single type of molecular building block. PMID:29339484
Pavani, Raphael Souza; da Silva, Marcelo Santos; Fernandes, Carlos Alexandre Henrique; Morini, Flavia Souza; Araujo, Christiane Bezerra; Fontes, Marcos Roberto de Mattos; Sant'Anna, Osvaldo Augusto; Machado, Carlos Renato; Cano, Maria Isabel; Fragoso, Stenio Perdigão; Elias, Maria Carolina
2016-12-01
Replication Protein A (RPA), the major single stranded DNA binding protein in eukaryotes, is composed of three subunits and is a fundamental player in DNA metabolism, participating in replication, transcription, repair, and the DNA damage response. In human pathogenic trypanosomatids, only limited studies have been performed on RPA-1 from Leishmania. Here, we performed in silico, in vitro and in vivo analysis of Trypanosoma cruzi RPA-1 and RPA-2 subunits. Although computational analysis suggests similarities in DNA binding and Ob-fold structures of RPA from T. cruzi compared with mammalian and fungi RPA, the predicted tridimensional structures of T. cruzi RPA-1 and RPA-2 indicated that these molecules present a more flexible tertiary structure, suggesting that T. cruzi RPA could be involved in additional responses. Here, we demonstrate experimentally that the T. cruzi RPA complex interacts with DNA via RPA-1 and is directly related to canonical functions, such as DNA replication and DNA damage response. Accordingly, a reduction of TcRPA-2 expression by generating heterozygous knockout cells impaired cell growth, slowing down S-phase progression. Moreover, heterozygous knockout cells presented a better efficiency in differentiation from epimastigote to metacyclic trypomastigote forms and metacyclic trypomastigote infection. Taken together, these findings indicate the involvement of TcRPA in the metacyclogenesis process and suggest that a delay in cell cycle progression could be linked with differentiation in T. cruzi.
Dynamics of genome size evolution in birds and mammals.
Kapusta, Aurélie; Suh, Alexander; Feschotte, Cédric
2017-02-21
Genome size in mammals and birds shows remarkably little interspecific variation compared with other taxa. However, genome sequencing has revealed that many mammal and bird lineages have experienced differential rates of transposable element (TE) accumulation, which would be predicted to cause substantial variation in genome size between species. Thus, we hypothesize that there has been covariation between the amount of DNA gained by transposition and lost by deletion during mammal and avian evolution, resulting in genome size equilibrium. To test this model, we develop computational methods to quantify the amount of DNA gained by TE expansion and lost by deletion over the last 100 My in the lineages of 10 species of eutherian mammals and 24 species of birds. The results reveal extensive variation in the amount of DNA gained via lineage-specific transposition, but that DNA loss counteracted this expansion to various extents across lineages. Our analysis of the rate and size spectrum of deletion events implies that DNA removal in both mammals and birds has proceeded mostly through large segmental deletions (>10 kb). These findings support a unified "accordion" model of genome size evolution in eukaryotes whereby DNA loss counteracting TE expansion is a major determinant of genome size. Furthermore, we propose that extensive DNA loss, and not necessarily a dearth of TE activity, has been the primary force maintaining the greater genomic compaction of flying birds and bats relative to their flightless relatives.
TaxI: a software tool for DNA barcoding using distance methods
Steinke, Dirk; Vences, Miguel; Salzburger, Walter; Meyer, Axel
2005-01-01
DNA barcoding is a promising approach to the diagnosis of biological diversity in which DNA sequences serve as the primary key for information retrieval. Most existing software for evolutionary analysis of DNA sequences was designed for phylogenetic analyses and, hence, those algorithms do not offer appropriate solutions for the rapid, but precise analyses needed for DNA barcoding, and are also unable to process the often large comparative datasets. We developed a flexible software tool for DNA taxonomy, named TaxI. This program calculates sequence divergences between a query sequence (taxon to be barcoded) and each sequence of a dataset of reference sequences defined by the user. Because the analysis is based on separate pairwise alignments this software is also able to work with sequences characterized by multiple insertions and deletions that are difficult to align in large sequence sets (i.e. thousands of sequences) by multiple alignment algorithms because of computational restrictions. Here, we demonstrate the utility of this approach with two datasets of fish larvae and juveniles from Lake Constance and juvenile land snails under different models of sequence evolution. Sets of ribosomal 16S rRNA sequences, characterized by multiple indels, performed as good as or better than cox1 sequence sets in assigning sequences to species, demonstrating the suitability of rRNA genes for DNA barcoding. PMID:16214755
NASA Technical Reports Server (NTRS)
Vecchioni, Simon; Toomey, Emily; Capece, Mark C.; Rothschild, Lynn; Wind, Shalom
2017-01-01
DNA is an ideal template for a biological nanowire-it has a linear structure several atoms thick; it possesses addressable nucleobase geometry that can be precisely defined; and it is massively scalable into branched networks. Until now, the drawback of DNA as a conducting nanowire been, simply put, its low conductance. To address this deficiency, we extensively characterize a chemical variant of canonical DNA that exploits the affinity of natural cytosine bases for silver ions. We successfully construct chains of single silver ions inside double-stranded DNA, confirm the basic dC-Ag+-dC bond geometry and kinetics, and show length-tunability dependent on mismatch distribution, ion availability and enzyme activity. An analysis of the absorbance spectra of natural DNA and silver-binding, poly-cytosine DNA demonstrates the heightened thermostability of the ion chain and its resistance to aqueous stresses such as precipitation, dialysis and forced reduction. These chemically critical traits lend themselves to an increase in electrical conductivity of over an order of magnitude for 11-base silver-paired duplexes over natural strands when assayed by STM break junction. We further construct and implement a genetic pathway in the E. coli bacterium for the biosynthesis of highly ionizable DNA sequences. Toward future circuits, we construct a model of transcription network architectures to determine the most efficient and robust connectivity for cell-based fabrication, and we perform sequence optimization with a genetic algorithm to identify oligonucleotides robust to changes in the base-pairing energy landscape. We propose that this system will serve as a synthetic biological fabrication platform for more complex DNA nanotechnology and nanoelectronics with applications to deep space and low resource environments.
Molecular Dynamics Study of the Opening Mechanism for DNA Polymerase I
Miller, Bill R.; Parish, Carol A.; Wu, Eugene Y.
2014-01-01
During DNA replication, DNA polymerases follow an induced fit mechanism in order to rapidly distinguish between correct and incorrect dNTP substrates. The dynamics of this process are crucial to the overall effectiveness of catalysis. Although X-ray crystal structures of DNA polymerase I with substrate dNTPs have revealed key structural states along the catalytic pathway, solution fluorescence studies indicate that those key states are populated in the absence of substrate. Herein, we report the first atomistic simulations showing the conformational changes between the closed, open, and ajar conformations of DNA polymerase I in the binary (enzyme∶DNA) state to better understand its dynamics. We have applied long time-scale, unbiased molecular dynamics to investigate the opening process of the fingers domain in the absence of substrate for B. stearothermophilis DNA polymerase in silico. These simulations are biologically and/or physiologically relevant as they shed light on the transitions between states in this important enzyme. All closed and ajar simulations successfully transitioned into the fully open conformation, which is known to be the dominant binary enzyme-DNA conformation from solution and crystallographic studies. Furthermore, we have detailed the key stages in the opening process starting from the open and ajar crystal structures, including the observation of a previously unknown key intermediate structure. Four backbone dihedrals were identified as important during the opening process, and their movements provide insight into the recognition of dNTP substrate molecules by the polymerase binary state. In addition to revealing the opening mechanism, this study also demonstrates our ability to study biological events of DNA polymerase using current computational methods without biasing the dynamics. PMID:25474643
Applications of statistical physics and information theory to the analysis of DNA sequences
NASA Astrophysics Data System (ADS)
Grosse, Ivo
2000-10-01
DNA carries the genetic information of most living organisms, and the of genome projects is to uncover that genetic information. One basic task in the analysis of DNA sequences is the recognition of protein coding genes. Powerful computer programs for gene recognition have been developed, but most of them are based on statistical patterns that vary from species to species. In this thesis I address the question if there exist universal statistical patterns that are different in coding and noncoding DNA of all living species, regardless of their phylogenetic origin. In search for such species-independent patterns I study the mutual information function of genomic DNA sequences, and find that it shows persistent period-three oscillations. To understand the biological origin of the observed period-three oscillations, I compare the mutual information function of genomic DNA sequences to the mutual information function of stochastic model sequences. I find that the pseudo-exon model is able to reproduce the mutual information function of genomic DNA sequences. Moreover, I find that a generalization of the pseudo-exon model can connect the existence and the functional form of long-range correlations to the presence and the length distributions of coding and noncoding regions. Based on these theoretical studies I am able to find an information-theoretical quantity, the average mutual information (AMI), whose probability distributions are significantly different in coding and noncoding DNA, while they are almost identical in all studied species. These findings show that there exist universal statistical patterns that are different in coding and noncoding DNA of all studied species, and they suggest that the AMI may be used to identify genes in different living species, irrespective of their taxonomic origin.
NASA Technical Reports Server (NTRS)
Ponomarev, A. L.; Huff, J. L.; Cucinotta, F. A.
2011-01-01
Future long-tem space travel will face challenges from radiation concerns as the space environment poses health risk to humans in space from radiations with high biological efficiency and adverse post-flight long-term effects. Solar particles events may dramatically affect the crew performance, while Galactic Cosmic Rays will induce a chronic exposure to high-linear-energy-transfer (LET) particles. These types of radiation, not present on the ground level, can increase the probability of a fatal cancer later in astronaut life. No feasible shielding is possible from radiation in space, especially for the heavy ion component, as suggested solutions will require a dramatic increase in the mass of the mission. Our research group focuses on fundamental research and strategic analysis leading to better shielding design and to better understanding of the biological mechanisms of radiation damage. We present our recent effort to model DNA damage and tissue damage using computational models based on the physics of heavy ion radiation, DNA structure and DNA damage and repair in human cells. Our particular area of expertise include the clustered DNA damage from high-LET radiation, the visualization of DSBs (DNA double strand breaks) via DNA damage foci, image analysis and the statistics of the foci for different experimental situations, chromosomal aberration formation through DSB misrepair, the kinetics of DSB repair leading to a model-derived spectrum of chromosomal aberrations, and, finally, the simulation of human tissue and the pattern of apoptotic cell damage. This compendium of theoretical and experimental data sheds light on the complex nature of radiation interacting with human DNA, cells and tissues, which can lead to mutagenesis and carcinogenesis later in human life after the space mission.
P-Hint-Hunt: a deep parallelized whole genome DNA methylation detection tool.
Peng, Shaoliang; Yang, Shunyun; Gao, Ming; Liao, Xiangke; Liu, Jie; Yang, Canqun; Wu, Chengkun; Yu, Wenqiang
2017-03-14
The increasing studies have been conducted using whole genome DNA methylation detection as one of the most important part of epigenetics research to find the significant relationships among DNA methylation and several typical diseases, such as cancers and diabetes. In many of those studies, mapping the bisulfite treated sequence to the whole genome has been the main method to study DNA cytosine methylation. However, today's relative tools almost suffer from inaccuracies and time-consuming problems. In our study, we designed a new DNA methylation prediction tool ("Hint-Hunt") to solve the problem. By having an optimal complex alignment computation and Smith-Waterman matrix dynamic programming, Hint-Hunt could analyze and predict the DNA methylation status. But when Hint-Hunt tried to predict DNA methylation status with large-scale dataset, there are still slow speed and low temporal-spatial efficiency problems. In order to solve the problems of Smith-Waterman dynamic programming and low temporal-spatial efficiency, we further design a deep parallelized whole genome DNA methylation detection tool ("P-Hint-Hunt") on Tianhe-2 (TH-2) supercomputer. To the best of our knowledge, P-Hint-Hunt is the first parallel DNA methylation detection tool with a high speed-up to process large-scale dataset, and could run both on CPU and Intel Xeon Phi coprocessors. Moreover, we deploy and evaluate Hint-Hunt and P-Hint-Hunt on TH-2 supercomputer in different scales. The experimental results illuminate our tools eliminate the deviation caused by bisulfite treatment in mapping procedure and the multi-level parallel program yields a 48 times speed-up with 64 threads. P-Hint-Hunt gain a deep acceleration on CPU and Intel Xeon Phi heterogeneous platform, which gives full play of the advantages of multi-cores (CPU) and many-cores (Phi).
Arias, María Elena; Sánchez-Villalba, Esther; Delgado, Andrea; Felmer, Ricardo
2017-02-01
Sperm-mediated gene transfer (SMGT) is based on the capacity of sperm to bind exogenous DNA and transfer it into the oocyte during fertilization. In bovines, the progress of this technology has been slow due to the poor reproducibility and efficiency of the production of transgenic embryos. The aim of the present study was to evaluate the effects of different sperm transfection systems on the quality and functional parameters of sperm. Additionally, the ability of sperm to bind and incorporate exogenous DNA was assessed. These analyses were carried out by flow cytometry and confocal fluorescence microscopy, and motility parameters were also evaluated by computer-assisted sperm analysis (CASA). Transfection was carried out using complexes of plasmid DNA with Lipofectamine, SuperFect and TurboFect for 0.5, 1, 2 or 4 h. The results showed that all of the transfection treatments promoted sperm binding and incorporation of exogenous DNA, similar to sperm incorporation of DNA alone, without affecting the viability. Nevertheless, the treatments and incubation times significantly affected the motility parameters, although no effect on the integrity of DNA or the levels of reactive oxygen species (ROS) was observed. Additionally, we observed that transfection using SuperFect and TurboFect negatively affected the acrosome integrity, and TurboFect affected the mitochondrial membrane potential of sperm. In conclusion, we demonstrated binding and incorporation of exogenous DNA by sperm after transfection and confirmed the capacity of sperm to spontaneously incorporate exogenous DNA. These findings will allow the establishment of the most appropriate method [intracytoplasmic sperm injection (ICSI) or in vitro fertilization (IVF)] of generating transgenic embryos via SMGT based on the fertilization capacity of transfected sperm.
Lando, Dmitri Y; Chang, Chun-Ling; Fridman, Alexander S; Grigoryan, Inessa E; Galyuk, Elena N; Hsueh, Ya-Wei; Hu, Chin-Kun
2014-08-01
Antitumor activity of cisplatin is exerted by covalent binding to DNA. For comparison, studies of cisplatin-DNA complexes often employ the very similar but inactive transplatin. In this work, thermal and thermodynamic properties of DNA complexes with these compounds were studied using differential scanning calorimetry (DSC) and computer modeling. DSC demonstrates that cisplatin decreases thermal stability (melting temperature, Tm) of long DNA, and transplatin increases it. At the same time, both compounds decrease the enthalpy and entropy of the helix-coil transition, and the impact of transplatin is much higher. From Pt/nucleotide molar ratio rb=0.001, both compounds destroy the fine structure of DSC profile and increase the temperature melting range (ΔT). For cisplatin and transplatin, the dependences δTm vs rb differ in sign, while δΔT vs rb are positive for both compounds. The change in the parameter δΔT vs rb demonstrates the GC specificity in the location of DNA distortions. Our experimental results and calculations show that 1) in contrast to [Pt(dien)Cl]Cl, monofunctional adducts formed by transplatin decrease the thermal stability of long DNA at [Na(+)]>30mM; 2) interstrand crosslinks of cisplatin and transplatin only slightly increase Tm; 3) the difference in thermal stability of DNA complexes with cisplatin vs DNA complexes with transplatin mainly arises from the different thermodynamic properties of their intrastrand crosslinks. This type of crosslink appears to be responsible for the antitumor activity of cisplatin. At any [Na(+)] from interval 10-210mM, cisplatin and transplatin intrastrand crosslinks give rise to destabilization and stabilization, respectively. Copyright © 2014 Elsevier Inc. All rights reserved.
Remote control of nanoscale devices
NASA Astrophysics Data System (ADS)
Högberg, Björn
2018-01-01
Processes that occur at the nanometer scale have a tremendous impact on our daily lives. Sophisticated evolved nanomachines operate in each of our cells; we also, as a society, increasingly rely on synthetic nanodevices for communication and computation. Scientists are still only beginning to master this scale, but, recently, DNA nanotechnology (1)—in particular, DNA origami (2)—has emerged as a powerful tool to build structures precise enough to help us do so. On page 296 of this issue, Kopperger et al. (3) show that they are now also able to control the motion of a DNA origami device from the outside by applying electric fields.
Implementation of Arithmetic and Nonarithmetic Functions on a Label-free and DNA-based Platform
NASA Astrophysics Data System (ADS)
Wang, Kun; He, Mengqi; Wang, Jin; He, Ronghuan; Wang, Jianhua
2016-10-01
A series of complex logic gates were constructed based on graphene oxide and DNA-templated silver nanoclusters to perform both arithmetic and nonarithmetic functions. For the purpose of satisfying the requirements of progressive computational complexity and cost-effectiveness, a label-free and universal platform was developed by integration of various functions, including half adder, half subtractor, multiplexer and demultiplexer. The label-free system avoided laborious modification of biomolecules. The designed DNA-based logic gates can be implemented with readout of near-infrared fluorescence, and exhibit great potential applications in the field of bioimaging as well as disease diagnosis.
Mukunthan, B; Nagaveni, N
2014-01-01
In genetic engineering, conventional techniques and algorithms employed by forensic scientists to assist in identification of individuals on the basis of their respective DNA profiles involves more complex computational steps and mathematical formulae, also the identification of location of mutation in a genomic sequence in laboratories is still an exigent task. This novel approach provides ability to solve the problems that do not have an algorithmic solution and the available solutions are also too complex to be found. The perfect blend made of bioinformatics and neural networks technique results in efficient DNA pattern analysis algorithm with utmost prediction accuracy.
Genome-wide association between DNA methylation and alternative splicing in an invertebrate
2012-01-01
Background Gene bodies are the most evolutionarily conserved targets of DNA methylation in eukaryotes. However, the regulatory functions of gene body DNA methylation remain largely unknown. DNA methylation in insects appears to be primarily confined to exons. Two recent studies in Apis mellifera (honeybee) and Nasonia vitripennis (jewel wasp) analyzed transcription and DNA methylation data for one gene in each species to demonstrate that exon-specific DNA methylation may be associated with alternative splicing events. In this study we investigated the relationship between DNA methylation, alternative splicing, and cross-species gene conservation on a genome-wide scale using genome-wide transcription and DNA methylation data. Results We generated RNA deep sequencing data (RNA-seq) to measure genome-wide mRNA expression at the exon- and gene-level. We produced a de novo transcriptome from this RNA-seq data and computationally predicted splice variants for the honeybee genome. We found that exons that are included in transcription are higher methylated than exons that are skipped during transcription. We detected enrichment for alternative splicing among methylated genes compared to unmethylated genes using fisher’s exact test. We performed a statistical analysis to reveal that the presence of DNA methylation or alternative splicing are both factors associated with a longer gene length and a greater number of exons in genes. In concordance with this observation, a conservation analysis using BLAST revealed that each of these factors is also associated with higher cross-species gene conservation. Conclusions This study constitutes the first genome-wide analysis exhibiting a positive relationship between exon-level DNA methylation and mRNA expression in the honeybee. Our finding that methylated genes are enriched for alternative splicing suggests that, in invertebrates, exon-level DNA methylation may play a role in the construction of splice variants by positively influencing exon inclusion during transcription. The results from our cross-species homology analysis suggest that DNA methylation and alternative splicing are genetic mechanisms whose utilization could contribute to a longer gene length and a slower rate of gene evolution. PMID:22978521
OSG-GEM: Gene Expression Matrix Construction Using the Open Science Grid.
Poehlman, William L; Rynge, Mats; Branton, Chris; Balamurugan, D; Feltus, Frank A
2016-01-01
High-throughput DNA sequencing technology has revolutionized the study of gene expression while introducing significant computational challenges for biologists. These computational challenges include access to sufficient computer hardware and functional data processing workflows. Both these challenges are addressed with our scalable, open-source Pegasus workflow for processing high-throughput DNA sequence datasets into a gene expression matrix (GEM) using computational resources available to U.S.-based researchers on the Open Science Grid (OSG). We describe the usage of the workflow (OSG-GEM), discuss workflow design, inspect performance data, and assess accuracy in mapping paired-end sequencing reads to a reference genome. A target OSG-GEM user is proficient with the Linux command line and possesses basic bioinformatics experience. The user may run this workflow directly on the OSG or adapt it to novel computing environments.
OSG-GEM: Gene Expression Matrix Construction Using the Open Science Grid
Poehlman, William L.; Rynge, Mats; Branton, Chris; Balamurugan, D.; Feltus, Frank A.
2016-01-01
High-throughput DNA sequencing technology has revolutionized the study of gene expression while introducing significant computational challenges for biologists. These computational challenges include access to sufficient computer hardware and functional data processing workflows. Both these challenges are addressed with our scalable, open-source Pegasus workflow for processing high-throughput DNA sequence datasets into a gene expression matrix (GEM) using computational resources available to U.S.-based researchers on the Open Science Grid (OSG). We describe the usage of the workflow (OSG-GEM), discuss workflow design, inspect performance data, and assess accuracy in mapping paired-end sequencing reads to a reference genome. A target OSG-GEM user is proficient with the Linux command line and possesses basic bioinformatics experience. The user may run this workflow directly on the OSG or adapt it to novel computing environments. PMID:27499617
A Review of Computational Intelligence Methods for Eukaryotic Promoter Prediction.
Singh, Shailendra; Kaur, Sukhbir; Goel, Neelam
2015-01-01
In past decades, prediction of genes in DNA sequences has attracted the attention of many researchers but due to its complex structure it is extremely intricate to correctly locate its position. A large number of regulatory regions are present in DNA that helps in transcription of a gene. Promoter is one such region and to find its location is a challenging problem. Various computational methods for promoter prediction have been developed over the past few years. This paper reviews these promoter prediction methods. Several difficulties and pitfalls encountered by these methods are also detailed, along with future research directions.
Grindon, Christina; Harris, Sarah; Evans, Tom; Novik, Keir; Coveney, Peter; Laughton, Charles
2004-07-15
Molecular modelling played a central role in the discovery of the structure of DNA by Watson and Crick. Today, such modelling is done on computers: the more powerful these computers are, the more detailed and extensive can be the study of the dynamics of such biological macromolecules. To fully harness the power of modern massively parallel computers, however, we need to develop and deploy algorithms which can exploit the structure of such hardware. The Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) is a scalable molecular dynamics code including long-range Coulomb interactions, which has been specifically designed to function efficiently on parallel platforms. Here we describe the implementation of the AMBER98 force field in LAMMPS and its validation for molecular dynamics investigations of DNA structure and flexibility against the benchmark of results obtained with the long-established code AMBER6 (Assisted Model Building with Energy Refinement, version 6). Extended molecular dynamics simulations on the hydrated DNA dodecamer d(CTTTTGCAAAAG)(2), which has previously been the subject of extensive dynamical analysis using AMBER6, show that it is possible to obtain excellent agreement in terms of static, dynamic and thermodynamic parameters between AMBER6 and LAMMPS. In comparison with AMBER6, LAMMPS shows greatly improved scalability in massively parallel environments, opening up the possibility of efficient simulations of order-of-magnitude larger systems and/or for order-of-magnitude greater simulation times.
Jermusek, Frank; Benedict, Chelsea; Dreischmeier, Emma; Brand, Michael; Uder, Michael; Jeffery, Justin J; Ranallo, Frank N; Fahl, William E
2018-05-21
While computed tomography (CT) is now commonly used and considered to be clinically valuable, significant DNA double-strand breaks (γ-H2AX foci) in white blood cells from adult and pediatric CT patients have been frequently reported. In this study to determine whether γ-H2AX foci and X-ray-induced naked DNA damage are suppressed by administration of the PrC-210 radioprotector, human blood samples were irradiated in a CT scanner at 50-150 mGy with or without PrC-210, and γ-H2AX foci were scored. X-ray-induced naked DNA damage was also studied, and the DNA protective efficacy of PrC-210 was compared against 12 other common "antioxidants." PrC-210 reduced CT radiation-induced γ-H2AX foci in white blood cells to near background ( P < 0.0001) at radiation doses of 50-150 mGy. PrC-210 was most effective among the 13 "antioxidants" in reducing naked DNA X-ray damage, and its addition at 30 s before an • OH pulse reduced to background the • OH insult that otherwise induced >95% DNA damage. A systemic PrC-210 dose known to confer 100% survival in irradiated mice had no discernible effect on micro-CT image signal-to-noise ratio and CT image integrity. PrC-210 suppressed DNA damage to background or near background in each of these assay systems, thus supporting its development as a radioprotector for humans in multiple radiation exposure settings.
Carbon-14 decay as a source of non-canonical bases in DNA.
Sassi, Michel; Carter, Damien J; Uberuaga, Blas P; Stanek, Chris R; Marks, Nigel A
2014-01-01
Significant experimental effort has been applied to study radioactive beta-decay in biological systems. Atomic-scale knowledge of this transmutation process is lacking due to the absence of computer simulations. Carbon-14 is an important beta-emitter, being ubiquitous in the environment and an intrinsic part of the genetic code. Over a lifetime, around 50 billion (14)C decays occur within human DNA. We apply ab initio molecular dynamics to quantify (14)C-induced bond rupture in a variety of organic molecules, including DNA base pairs. We show that double bonds and ring structures confer radiation resistance. These features, present in the canonical bases of the DNA, enhance their resistance to (14)C-induced bond-breaking. In contrast, the sugar group of the DNA and RNA backbone is vulnerable to single-strand breaking. We also show that Carbon-14 decay provides a mechanism for creating mutagenic wobble-type mispairs. The observation that DNA has a resistance to natural radioactivity has not previously been recognized. We show that (14)C decay can be a source for generating non-canonical bases. Our findings raise questions such as how the genetic apparatus deals with the appearance of an extra nitrogen in the canonical bases. It is not obvious whether or not the DNA repair mechanism detects this modification nor how DNA replication is affected by a non-canonical nucleobase. Accordingly, (14)C may prove to be a source of genetic alteration that is impossible to avoid due to the universal presence of radiocarbon in the environment. © 2013.
Yang, Changwon; Kim, Eunae; Pak, Youngshang
2015-01-01
Houghton (HG) base pairing plays a central role in the DNA binding of proteins and small ligands. Probing detailed transition mechanism from Watson–Crick (WC) to HG base pair (bp) formation in duplex DNAs is of fundamental importance in terms of revealing intrinsic functions of double helical DNAs beyond their sequence determined functions. We investigated a free energy landscape of a free B-DNA with an adenosine–thymine (A–T) rich sequence to probe its conformational transition pathways from WC to HG base pairing. The free energy landscape was computed with a state-of-art two-dimensional umbrella molecular dynamics simulation at the all-atom level. The present simulation showed that in an isolated duplex DNA, the spontaneous transition from WC to HG bp takes place via multiple pathways. Notably, base flipping into the major and minor grooves was found to play an important role in forming these multiple transition pathways. This finding suggests that naked B-DNA under normal conditions has an inherent ability to form HG bps via spontaneous base opening events. PMID:26250116
Highly Accurate Classification of Watson-Crick Basepairs on Termini of Single DNA Molecules
Winters-Hilt, Stephen; Vercoutere, Wenonah; DeGuzman, Veronica S.; Deamer, David; Akeson, Mark; Haussler, David
2003-01-01
We introduce a computational method for classification of individual DNA molecules measured by an α-hemolysin channel detector. We show classification with better than 99% accuracy for DNA hairpin molecules that differ only in their terminal Watson-Crick basepairs. Signal classification was done in silico to establish performance metrics (i.e., where train and test data were of known type, via single-species data files). It was then performed in solution to assay real mixtures of DNA hairpins. Hidden Markov Models (HMMs) were used with Expectation/Maximization for denoising and for associating a feature vector with the ionic current blockade of the DNA molecule. Support Vector Machines (SVMs) were used as discriminators, and were the focus of off-line training. A multiclass SVM architecture was designed to place less discriminatory load on weaker discriminators, and novel SVM kernels were used to boost discrimination strength. The tuning on HMMs and SVMs enabled biophysical analysis of the captured molecule states and state transitions; structure revealed in the biophysical analysis was used for better feature selection. PMID:12547778
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pluta, Radoslaw; Boer, D. Roeland; Lorenzo-Diaz, Fabian
Relaxases are metal-dependent nucleases that break and join DNA for the initiation and completion of conjugative bacterial gene transfer. Conjugation is the main process through which antibiotic resistance spreads among bacteria, with multidrug-resistant staphylococci and streptococci infections posing major threats to human health. The MOB V family of relaxases accounts for approximately 85% of all relaxases found in Staphylococcus aureus isolates. Here, we present six structures of the MOB V relaxase MobM from the promiscuous plasmid pMV158 in complex with several origin of transfer DNA fragments. A combined structural, biochemical, and computational approach reveals that MobM follows a previously uncharacterizedmore » histidine/metal-dependent DNA processing mechanism, which involves the formation of a covalent phosphoramidate histidine-DNA adduct for cell-to-cell transfer. In conclusion, we discuss how the chemical features of the high-energy phosphorus-nitrogen bond shape the dominant position of MOB V histidine relaxases among small promiscuous plasmids and their preference toward Gram-positive bacteria.« less
Pluta, Radoslaw; Boer, D. Roeland; Lorenzo-Diaz, Fabian; ...
2017-07-24
Relaxases are metal-dependent nucleases that break and join DNA for the initiation and completion of conjugative bacterial gene transfer. Conjugation is the main process through which antibiotic resistance spreads among bacteria, with multidrug-resistant staphylococci and streptococci infections posing major threats to human health. The MOB V family of relaxases accounts for approximately 85% of all relaxases found in Staphylococcus aureus isolates. Here, we present six structures of the MOB V relaxase MobM from the promiscuous plasmid pMV158 in complex with several origin of transfer DNA fragments. A combined structural, biochemical, and computational approach reveals that MobM follows a previously uncharacterizedmore » histidine/metal-dependent DNA processing mechanism, which involves the formation of a covalent phosphoramidate histidine-DNA adduct for cell-to-cell transfer. In conclusion, we discuss how the chemical features of the high-energy phosphorus-nitrogen bond shape the dominant position of MOB V histidine relaxases among small promiscuous plasmids and their preference toward Gram-positive bacteria.« less
Georgieva, Milena; Zagorchev, Plamen; Miloshev, George
2015-10-01
Comet assay is an invaluable tool in DNA research. It is widely used to detect DNA damage as an indicator of exposure to genotoxic stress. A canonical set of parameters and specialized software programs exist for Comet assay data quantification and analysis. None of them so far has proven its potential to employ a computer-based algorithm for assessment of the shape of the comet as an indicator of the exact mechanism by which the studied genotoxins cut in the molecule of DNA. Here, we present 14 unique measurements of the comet image based on the comet morphology. Their mathematical derivation and statistical analysis allowed precise description of the shape of the comet image which in turn discriminated the cause of genotoxic stress. This algorithm led to the development of the "CometShape" software which allowed easy discrimination among different genotoxins depending on the type of DNA damage they induce. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Addressable configurations of DNA nanostructures for rewritable memory.
Chandrasekaran, Arun Richard; Levchenko, Oksana; Patel, Dhruv S; MacIsaac, Molly; Halvorsen, Ken
2017-11-02
DNA serves as nature's information storage molecule, and has been the primary focus of engineered systems for biological computing and data storage. Here we combine recent efforts in DNA self-assembly and toehold-mediated strand displacement to develop a rewritable multi-bit DNA memory system. The system operates by encoding information in distinct and reversible conformations of a DNA nanoswitch and decoding by gel electrophoresis. We demonstrate a 5-bit system capable of writing, erasing, and rewriting binary representations of alphanumeric symbols, as well as compatibility with 'OR' and 'AND' logic operations. Our strategy is simple to implement, requiring only a single mixing step at room temperature for each operation and standard gel electrophoresis to read the data. We envision such systems could find use in covert product labeling and barcoding, as well as secure messaging and authentication when combined with previously developed encryption strategies. Ultimately, this type of memory has exciting potential in biomedical sciences as data storage can be coupled to sensing of biological molecules. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Fluctuations in the DNA double helix
NASA Astrophysics Data System (ADS)
Peyrard, M.; López, S. C.; Angelov, D.
2007-08-01
DNA is not the static entity suggested by the famous double helix structure. It shows large fluctuational openings, in which the bases, which contain the genetic code, are temporarily open. Therefore it is an interesting system to study the effect of nonlinearity on the physical properties of a system. A simple model for DNA, at a mesoscopic scale, can be investigated by computer simulation, in the same spirit as the original work of Fermi, Pasta and Ulam. These calculations raise fundamental questions in statistical physics because they show a temporary breaking of equipartition of energy, regions with large amplitude fluctuations being able to coexist with regions where the fluctuations are very small, even when the model is studied in the canonical ensemble. This phenomenon can be related to nonlinear excitations in the model. The ability of the model to describe the actual properties of DNA is discussed by comparing theoretical and experimental results for the probability that base pairs open an a given temperature in specific DNA sequences. These studies give us indications on the proper description of the effect of the sequence in the mesoscopic model.
NASA Astrophysics Data System (ADS)
Rodriguez, Jorge H.; Deligkaris, Christos
2013-03-01
Investigating the complementary, but different, effects of physical (non-covalent) and chemical (covalent) mutagen-DNA and carcinogen-DNA interactions is important for understanding possible mechanisms of development and prevention of mutagenesis and carcinogenesis. A highly mutagenic and carcinogenic metabolite of the polycyclic aromatic hydrocarbon benzo[ α]pyrene, namely (+)-anti-BPDE, is known to undergo both physical and chemical complexation with DNA. The major covalent adduct, a promutagenic, is known to be an external (+)-trans-anti-BPDE-N2-dGuanosine configuration whose origins are not fully understood. Thus, it is desirable to study the mechanisms of external non-covalent BPDE-DNA binding and their possible relationships to external covalent trans adduct formation. We present a detailed codon-by-codon computational study of the non-covalent interactions of (+)-anti-BPDE with DNA which explains and correctly predicts preferential (+)-anti-BPDE binding at minor groove guanosines. Due to its relevance to carcinogenesis, the interaction of (+)-anti-BPDE with exon 1 of the human K-ras gene has been studied in detail. Present address: Department of Physics, Drury University
Red light improves spermatozoa motility and does not induce oxidative DNA damage
NASA Astrophysics Data System (ADS)
Preece, Daryl; Chow, Kay W.; Gomez-Godinez, Veronica; Gustafson, Kyle; Esener, Selin; Ravida, Nicole; Durrant, Barbara; Berns, Michael W.
2017-04-01
The ability to successfully fertilize ova relies upon the swimming ability of spermatozoa. Both in humans and in animals, sperm motility has been used as a metric for the viability of semen samples. Recently, several studies have examined the efficacy of low dosage red light exposure for cellular repair and increasing sperm motility. Of prime importance to the practical application of this technique is the absence of DNA damage caused by radiation exposure. In this study, we examine the effect of 633 nm coherent, red laser light on sperm motility using a novel wavelet-based algorithm that allows for direct measurement of curvilinear velocity under red light illumination. This new algorithm gives results comparable to the standard computer-assisted sperm analysis (CASA) system. We then assess the safety of red light treatment of sperm by analyzing, (1) the levels of double-strand breaks in the DNA, and (2) oxidative damage in the sperm DNA. The results demonstrate that for the parameters used there are insignificant differences in oxidative DNA damage as a result of irradiation.
A multiple-alignment based primer design algorithm for genetically highly variable DNA targets
2013-01-01
Background Primer design for highly variable DNA sequences is difficult, and experimental success requires attention to many interacting constraints. The advent of next-generation sequencing methods allows the investigation of rare variants otherwise hidden deep in large populations, but requires attention to population diversity and primer localization in relatively conserved regions, in addition to recognized constraints typically considered in primer design. Results Design constraints include degenerate sites to maximize population coverage, matching of melting temperatures, optimizing de novo sequence length, finding optimal bio-barcodes to allow efficient downstream analyses, and minimizing risk of dimerization. To facilitate primer design addressing these and other constraints, we created a novel computer program (PrimerDesign) that automates this complex procedure. We show its powers and limitations and give examples of successful designs for the analysis of HIV-1 populations. Conclusions PrimerDesign is useful for researchers who want to design DNA primers and probes for analyzing highly variable DNA populations. It can be used to design primers for PCR, RT-PCR, Sanger sequencing, next-generation sequencing, and other experimental protocols targeting highly variable DNA samples. PMID:23965160
The complete DNA sequence of lymphocystis disease virus.
Tidona, C A; Darai, G
1997-04-14
Lymphocystis disease virus (LCDV) is the causative agent of lymphocystis disease, which has been reported to occur in over 100 different fish species worldwide. LCDV is a member of the family Iridoviridae and the type species of the genus Lymphocystivirus. The virions contain a single linear double-stranded DNA molecule, which is circularly permuted, terminally redundant, and heavily methylated at cytosines in CpG sequences. The complete nucleotide sequence of LCDV-1 (flounder isolate) was determined by automated cycle sequencing and primer walking. The genome of LCDV-1 is 102.653 bp in length and contains 195 open reading frames with coding capacities ranging from 40 to 1199 amino acids. Computer-assisted analyses of the deduced amino acid sequences led to the identification of several putative gene products with significant homologies to entries in protein data banks, such as the two major subunits of the viral DNA-dependent RNA polymerase, DNA polymerase, several protein kinases, two subunits of the ribonucleoside diphosphate reductase, DNA methyltransferase, the viral major capsid protein, insulin-like growth factor, and tumor necrosis factor receptor homolog.
Programmable autonomous synthesis of single-stranded DNA
NASA Astrophysics Data System (ADS)
Kishi, Jocelyn Y.; Schaus, Thomas E.; Gopalkrishnan, Nikhil; Xuan, Feng; Yin, Peng
2018-02-01
DNA performs diverse functional roles in biology, nanotechnology and biotechnology, but current methods for autonomously synthesizing arbitrary single-stranded DNA are limited. Here, we introduce the concept of primer exchange reaction (PER) cascades, which grow nascent single-stranded DNA with user-specified sequences following prescribed reaction pathways. PER synthesis happens in a programmable, autonomous, in situ and environmentally responsive fashion, providing a platform for engineering molecular circuits and devices with a wide range of sensing, monitoring, recording, signal-processing and actuation capabilities. We experimentally demonstrate a nanodevice that transduces the detection of a trigger RNA into the production of a DNAzyme that degrades an independent RNA substrate, a signal amplifier that conditionally synthesizes long fluorescent strands only in the presence of a particular RNA signal, molecular computing circuits that evaluate logic (AND, OR, NOT) combinations of RNA inputs, and a temporal molecular event recorder that records in the PER transcript the order in which distinct RNA inputs are sequentially detected.
Programmable autonomous synthesis of single-stranded DNA.
Kishi, Jocelyn Y; Schaus, Thomas E; Gopalkrishnan, Nikhil; Xuan, Feng; Yin, Peng
2018-02-01
DNA performs diverse functional roles in biology, nanotechnology and biotechnology, but current methods for autonomously synthesizing arbitrary single-stranded DNA are limited. Here, we introduce the concept of primer exchange reaction (PER) cascades, which grow nascent single-stranded DNA with user-specified sequences following prescribed reaction pathways. PER synthesis happens in a programmable, autonomous, in situ and environmentally responsive fashion, providing a platform for engineering molecular circuits and devices with a wide range of sensing, monitoring, recording, signal-processing and actuation capabilities. We experimentally demonstrate a nanodevice that transduces the detection of a trigger RNA into the production of a DNAzyme that degrades an independent RNA substrate, a signal amplifier that conditionally synthesizes long fluorescent strands only in the presence of a particular RNA signal, molecular computing circuits that evaluate logic (AND, OR, NOT) combinations of RNA inputs, and a temporal molecular event recorder that records in the PER transcript the order in which distinct RNA inputs are sequentially detected.
Antibacterial Drug Leads: DNA and Enzyme Multitargeting
Zhu, Wei; Wang, Yang; Li, Kai; ...
2015-01-09
Here, we report the results of an investigation of the activity of a series of amidine and bisamidine compounds against Staphylococcus aureus and Escherichia coli. The most active compounds bound to an AT-rich DNA dodecamer (CGCGAATTCGCG) 2 and using DSC were found to increase the melting transition by up to 24 °C. Several compounds also inhibited undecaprenyl diphosphate synthase (UPPS) with IC 50 values of 100–500 nM, and we found good correlations (R 2 = 0.89, S. aureus; R 2 = 0.79, E. coli) between experimental and predicted cell growth inhibition by using DNA ΔT m and UPPS IC 50more » experimental results together with one computed descriptor. Finally, we also solved the structures of three bisamidines binding to DNA as well as three UPPS structures. Overall, the results are of general interest in the context of the development of resistance-resistant antibiotics that involve multitargeting.« less
End-to-end distance and contour length distribution functions of DNA helices
NASA Astrophysics Data System (ADS)
Zoli, Marco
2018-06-01
I present a computational method to evaluate the end-to-end and the contour length distribution functions of short DNA molecules described by a mesoscopic Hamiltonian. The method generates a large statistical ensemble of possible configurations for each dimer in the sequence, selects the global equilibrium twist conformation for the molecule, and determines the average base pair distances along the molecule backbone. Integrating over the base pair radial and angular fluctuations, I derive the room temperature distribution functions as a function of the sequence length. The obtained values for the most probable end-to-end distance and contour length distance, providing a measure of the global molecule size, are used to examine the DNA flexibility at short length scales. It is found that, also in molecules with less than ˜60 base pairs, coiled configurations maintain a large statistical weight and, consistently, the persistence lengths may be much smaller than in kilo-base DNA.
Computational optimisation of targeted DNA sequencing for cancer detection
NASA Astrophysics Data System (ADS)
Martinez, Pierre; McGranahan, Nicholas; Birkbak, Nicolai Juul; Gerlinger, Marco; Swanton, Charles
2013-12-01
Despite recent progress thanks to next-generation sequencing technologies, personalised cancer medicine is still hampered by intra-tumour heterogeneity and drug resistance. As most patients with advanced metastatic disease face poor survival, there is need to improve early diagnosis. Analysing circulating tumour DNA (ctDNA) might represent a non-invasive method to detect mutations in patients, facilitating early detection. In this article, we define reduced gene panels from publicly available datasets as a first step to assess and optimise the potential of targeted ctDNA scans for early tumour detection. Dividing 4,467 samples into one discovery and two independent validation cohorts, we show that up to 76% of 10 cancer types harbour at least one mutation in a panel of only 25 genes, with high sensitivity across most tumour types. Our analyses demonstrate that targeting ``hotspot'' regions would introduce biases towards in-frame mutations and would compromise the reproducibility of tumour detection.
Adaptive resolution simulation of oligonucleotides
NASA Astrophysics Data System (ADS)
Netz, Paulo A.; Potestio, Raffaello; Kremer, Kurt
2016-12-01
Nucleic acids are characterized by a complex hierarchical structure and a variety of interaction mechanisms with other molecules. These features suggest the need of multiscale simulation methods in order to grasp the relevant physical properties of deoxyribonucleic acid (DNA) and RNA using in silico experiments. Here we report an implementation of a dual-resolution modeling of a DNA oligonucleotide in physiological conditions; in the presented setup only the nucleotide molecule and the solvent and ions in its proximity are described at the atomistic level; in contrast, the water molecules and ions far from the DNA are represented as computationally less expensive coarse-grained particles. Through the analysis of several structural and dynamical parameters, we show that this setup reliably reproduces the physical properties of the DNA molecule as observed in reference atomistic simulations. These results represent a first step towards a realistic multiscale modeling of nucleic acids and provide a quantitatively solid ground for their simulation using dual-resolution methods.
NASA Astrophysics Data System (ADS)
Talyzina, A. A.; Agapova, Yu. K.; Podshivalov, D. D.; Timofeev, V. I.; Sidorov-Biryukov, D. D.; Rakitina, T. V.
2017-11-01
DNA-Binding HU proteins are essential for the maintenance of genomic DNA supercoiling and compaction in prokaryotic cells and are promising pharmacological targets for the design of new antibacterial agents. The virtual screening for low-molecular-weight compounds capable of specifically interacting with the DNA-recognition loop of the HU protein from the mycoplasma Spiroplasma melliferum was performed. The ability of the initially selected ligands to form stable complexes with the protein target was assessed by molecular dynamics simulation. One compound, which forms an unstable complex, was eliminated by means of a combination of computational methods, resulting in a decrease in the number of compounds that will pass to the experimental test phase. This approach can be used to solve a wide range of problems related to the search for and validation of low-molecular-weight inhibitors specific for a particular protein target.
MSuPDA: A memory efficient algorithm for sequence alignment.
Khan, Mohammad Ibrahim; Kamal, Md Sarwar; Chowdhury, Linkon
2015-01-16
Space complexity is a million dollar question in DNA sequence alignments. In this regards, MSuPDA (Memory Saving under Pushdown Automata) can help to reduce the occupied spaces in computer memory. Our proposed process is that Anchor Seed (AS) will be selected from given data set of Nucleotides base pairs for local sequence alignment. Quick Splitting (QS) techniques will separate the Anchor Seed from all the DNA genome segments. Selected Anchor Seed will be placed to pushdown Automata's (PDA) input unit. Whole DNA genome segments will be placed into PDA's stack. Anchor Seed from input unit will be matched with the DNA genome segments from stack of PDA. Whatever matches, mismatches or Indel, of Nucleotides will be POP from the stack under the control of control unit of Pushdown Automata. During the POP operation on stack it will free the memory cell occupied by the Nucleotide base pair.
Determining the optimal forensic DNA analysis procedure following investigation of sample quality.
Hedell, Ronny; Hedman, Johannes; Mostad, Petter
2018-07-01
Crime scene traces of various types are routinely sent to forensic laboratories for analysis, generally with the aim of addressing questions about the source of the trace. The laboratory may choose to analyse the samples in different ways depending on the type and quality of the sample, the importance of the case and the cost and performance of the available analysis methods. Theoretically well-founded guidelines for the choice of analysis method are, however, lacking in most situations. In this paper, it is shown how such guidelines can be created using Bayesian decision theory. The theory is applied to forensic DNA analysis, showing how the information from the initial qPCR analysis can be utilized. It is assumed the alternatives for analysis are using a standard short tandem repeat (STR) DNA analysis assay, using the standard assay and a complementary assay, or the analysis may be cancelled following quantification. The decision is based on information about the DNA amount and level of DNA degradation of the forensic sample, as well as case circumstances and the cost for analysis. Semi-continuous electropherogram models are used for simulation of DNA profiles and for computation of likelihood ratios. It is shown how tables and graphs, prepared beforehand, can be used to quickly find the optimal decision in forensic casework.
Theory on the mechanism of site-specific DNA-protein interactions in the presence of traps
NASA Astrophysics Data System (ADS)
Niranjani, G.; Murugan, R.
2016-08-01
The speed of site-specific binding of transcription factor (TFs) proteins with genomic DNA seems to be strongly retarded by the randomly occurring sequence traps. Traps are those DNA sequences sharing significant similarity with the original specific binding sites (SBSs). It is an intriguing question how the naturally occurring TFs and their SBSs are designed to manage the retarding effects of such randomly occurring traps. We develop a simple random walk model on the site-specific binding of TFs with genomic DNA in the presence of sequence traps. Our dynamical model predicts that (a) the retarding effects of traps will be minimum when the traps are arranged around the SBS such that there is a negative correlation between the binding strength of TFs with traps and the distance of traps from the SBS and (b) the retarding effects of sequence traps can be appeased by the condensed conformational state of DNA. Our computational analysis results on the distribution of sequence traps around the putative binding sites of various TFs in mouse and human genome clearly agree well the theoretical predictions. We propose that the distribution of traps can be used as an additional metric to efficiently identify the SBSs of TFs on genomic DNA.
Ranjbar, Reza; Hafezi-Moghadam, Mohammad Sadegh
2016-02-01
With all of the developments on infectious diseases, tuberculosis (TB) remains a cause of death among people. One of the most promising assembly techniques in nano-technology is "scaffolded DNA origami" to design and construct a nano-scale drug delivery system. Because of the global health problems of tuberculosis, the development of potent new anti-tuberculosis drug delivery system without cross-resistance with known anti-mycobacterial agents is urgently needed. The aim of this study was to design a nano-scale drug delivery system for TB treatment using the DNA origami method. In this study, we presented an experimental research on a DNA drug delivery system for treating Tuberculosis. TEM images were visualized with an FEI Tecnai T12 BioTWIN at 120 kV. The model was designed by caDNAno software and computational prediction of the 3D solution shape and its flexibility was calculated with a CanDo server. Synthesizing the product was imaged using transmission electron microscopy after negative-staining by uranyl formate. We constructed a multilayer 3D DNA nanostructure system by designing square lattice geometry with the scaffolded-DNA-origami method. With changes in the lock and key sequences, we recommend that this system be used for other infectious diseases to target the pathogenic bacteria.
Development of novel vaccines using DNA shuffling and screening strategies.
Locher, Christopher P; Soong, Nay Wei; Whalen, Robert G; Punnonen, Juha
2004-02-01
DNA shuffling and screening technologies recombine and evolve genes in vitro to rapidly obtain molecules with improved biological activity and fitness. In this way, genes from related strains are bred like plants or livestock and their successive progeny are selected. These technologies have also been called molecular breeding-directed molecular evolution. Recent developments in bioinformatics-assisted computer programs have facilitated the design, synthesis and analysis of DNA shuffled libraries of chimeric molecules. New applications in vaccine development are among the key features of DNA shuffling and screening technologies because genes from several strains or antigenic variants of pathogens can be recombined to create novel molecules capable of inducing immune responses that protect against infections by multiple strains of pathogens. In addition, molecules such as co-stimulatory molecules and cytokines have been evolved to have improved T-cell proliferation and cytokine production compared with the wild-type human molecules. These molecules can be used to immunomodulate vaccine responsiveness and have multiple applications in infectious diseases, cancer, allergy and autoimmunity. Moreover, DNA shuffling and screening technologies can facilitate process development of vaccine manufacturing through increased expression of recombinant polypeptides and viruses. Therefore, DNA shuffling and screening technologies can overcome some of the challenges that vaccine development currently faces.
Alignment of high-throughput sequencing data inside in-memory databases.
Firnkorn, Daniel; Knaup-Gregori, Petra; Lorenzo Bermejo, Justo; Ganzinger, Matthias
2014-01-01
In times of high-throughput DNA sequencing techniques, performance-capable analysis of DNA sequences is of high importance. Computer supported DNA analysis is still an intensive time-consuming task. In this paper we explore the potential of a new In-Memory database technology by using SAP's High Performance Analytic Appliance (HANA). We focus on read alignment as one of the first steps in DNA sequence analysis. In particular, we examined the widely used Burrows-Wheeler Aligner (BWA) and implemented stored procedures in both, HANA and the free database system MySQL, to compare execution time and memory management. To ensure that the results are comparable, MySQL has been running in memory as well, utilizing its integrated memory engine for database table creation. We implemented stored procedures, containing exact and inexact searching of DNA reads within the reference genome GRCh37. Due to technical restrictions in SAP HANA concerning recursion, the inexact matching problem could not be implemented on this platform. Hence, performance analysis between HANA and MySQL was made by comparing the execution time of the exact search procedures. Here, HANA was approximately 27 times faster than MySQL which means, that there is a high potential within the new In-Memory concepts, leading to further developments of DNA analysis procedures in the future.
A Comparison Study for DNA Motif Modeling on Protein Binding Microarray.
Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San
2016-01-01
Transcription factor binding sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, protein binding microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k = 8∼10). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build TFBS (also known as DNA motif) models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement if choosing di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.
Quantum Mechanical Modeling of Ballistic MOSFETs
NASA Technical Reports Server (NTRS)
Svizhenko, Alexei; Anantram, M. P.; Govindan, T. R.; Biegel, Bryan (Technical Monitor)
2001-01-01
The objective of this project was to develop theory, approximations, and computer code to model quasi 1D structures such as nanotubes, DNA, and MOSFETs: (1) Nanotubes: Influence of defects on ballistic transport, electro-mechanical properties, and metal-nanotube coupling; (2) DNA: Model electron transfer (biochemistry) and transport experiments, and sequence dependence of conductance; and (3) MOSFETs: 2D doping profiles, polysilicon depletion, source to drain and gate tunneling, understand ballistic limit.
Hummer, G; García, A E; Soumpasis, D M
1995-01-01
A computationally efficient method to describe the organization of water around solvated biomolecules is presented. It is based on a statistical mechanical expression for the water-density distribution in terms of particle correlation functions. The method is applied to analyze the hydration of small nucleic acid molecules in the crystal environment, for which high-resolution x-ray crystal structures have been reported. Results for RNA [r(ApU).r(ApU)] and DNA [d(CpG).d(CpG) in Z form and with parallel strand orientation] and for DNA-drug complexes [d(CpG).d(CpG) with the drug proflavine intercalated] are described. A detailed comparison of theoretical and experimental data shows positional agreement for the experimentally observed water sites. The presented method can be used for refinement of the water structure in x-ray crystallography, hydration analysis of nuclear magnetic resonance structures, and theoretical modeling of biological macromolecules such as molecular docking studies. The speed of the computations allows hydration analyses of molecules of almost arbitrary size (tRNA, protein-nucleic acid complexes, etc.) in the crystal environment and in aqueous solution. Images FIGURE 1 FIGURE 2 FIGURE 5 FIGURE 6 FIGURE 9 FIGURE 12 FIGURE 13 PMID:7542034
Quantification of DNA cleavage specificity in Hi-C experiments.
Meluzzi, Dario; Arya, Gaurav
2016-01-08
Hi-C experiments produce large numbers of DNA sequence read pairs that are typically analyzed to deduce genomewide interactions between arbitrary loci. A key step in these experiments is the cleavage of cross-linked chromatin with a restriction endonuclease. Although this cleavage should happen specifically at the enzyme's recognition sequence, an unknown proportion of cleavage events may involve other sequences, owing to the enzyme's star activity or to random DNA breakage. A quantitative estimation of these non-specific cleavages may enable simulating realistic Hi-C read pairs for validation of downstream analyses, monitoring the reproducibility of experimental conditions and investigating biophysical properties that correlate with DNA cleavage patterns. Here we describe a computational method for analyzing Hi-C read pairs to estimate the fractions of cleavages at different possible targets. The method relies on expressing an observed local target distribution downstream of aligned reads as a linear combination of known conditional local target distributions. We validated this method using Hi-C read pairs obtained by computer simulation. Application of the method to experimental Hi-C datasets from murine cells revealed interesting similarities and differences in patterns of cleavage across the various experiments considered. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Validation of DNA-based identification software by computation of pedigree likelihood ratios.
Slooten, K
2011-08-01
Disaster victim identification (DVI) can be aided by DNA-evidence, by comparing the DNA-profiles of unidentified individuals with those of surviving relatives. The DNA-evidence is used optimally when such a comparison is done by calculating the appropriate likelihood ratios. Though conceptually simple, the calculations can be quite involved, especially with large pedigrees, precise mutation models etc. In this article we describe a series of test cases designed to check if software designed to calculate such likelihood ratios computes them correctly. The cases include both simple and more complicated pedigrees, among which inbred ones. We show how to calculate the likelihood ratio numerically and algebraically, including a general mutation model and possibility of allelic dropout. In Appendix A we show how to derive such algebraic expressions mathematically. We have set up these cases to validate new software, called Bonaparte, which performs pedigree likelihood ratio calculations in a DVI context. Bonaparte has been developed by SNN Nijmegen (The Netherlands) for the Netherlands Forensic Institute (NFI). It is available free of charge for non-commercial purposes (see www.dnadvi.nl for details). Commercial licenses can also be obtained. The software uses Bayesian networks and the junction tree algorithm to perform its calculations. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Kuang, Hua; Ma, Wei; Xu, Liguang; Wang, Libing; Xu, Chuanlai
2013-11-19
Polymerase chain reaction (PCR) is an essential tool in biotechnology laboratories and is becoming increasingly important in other areas of research. Extensive data obtained over the last 12 years has shown that the combination of PCR with nanoscale dispersions can resolve issues in the preparation DNA-based materials that include both inorganic and organic nanoscale components. Unlike conventional DNA hybridization and antibody-antigen complexes, PCR provides a new, effective assembly platform that both increases the yield of DNA-based nanomaterials and allows researchers to program and control assembly with predesigned parameters including those assisted and automated by computers. As a result, this method allows researchers to optimize to the combinatorial selection of the DNA strands for their nanoparticle conjugates. We have developed a PCR approach for producing various nanoscale assemblies including organic motifs such as small molecules, macromolecules, and inorganic building blocks, such as nanorods (NRs), metal, semiconductor, and magnetic nanoparticles (NPs). We start with a nanoscale primer and then modify that building block using the automated steps of PCR-based assembly including initialization, denaturation, annealing, extension, final elongation, and final hold. The intermediate steps of denaturation, annealing, and extension are cyclic, and we use computer control so that the assembled superstructures reach their predetermined complexity. The structures assembled using a small number of PCR cycles show a lower polydispersity than similar discrete structures obtained by direct hybridization between the nanoscale building blocks. Using different building blocks, we assembled the following structural motifs by PCR: (1) discrete nanostructures (NP dimers, NP multimers including trimers, pyramids, tetramers or hexamers, etc.), (2) branched NP superstructures and heterochains, (3) NP satellite-like superstructures, (4) Y-shaped nanostructures and DNA networks, (5) protein-DNA co-assembly structures, and (6) DNA block copolymers including trimers and pentamers. These results affirm that this method can produce a variety of chemical structures and in yields that are tunable. Using PCR-based preparation of DNA-bridged nanostructures, we can program the assembly of the nanoscale blocks through the adjustment of the primer intensity on the assembled units, the number of PCR cycles, or both. The resulting structures are highly complex and diverse and have interesting dynamics and collective properties. Potential applications of these materials include chirooptical materials, probe fabrication, and environmental and biomedical sensors.
DNA as Sensors and Imaging Agents for Metal Ions
Xiang, Yu
2014-01-01
Increasing interests in detecting metal ions in many chemical and biomedical fields have created demands for developing sensors and imaging agents for metal ions with high sensitivity and selectivity. This review covers recent progress in DNA-based sensors and imaging agents for metal ions. Through both combinatorial selection and rational design, a number of metal ion-dependent DNAzymes and metal ion-binding DNA structures that can selectively recognize specific metal ions have been obtained. By attaching these DNA molecules with signal reporters such as fluorophores, chromophores, electrochemical tags, and Raman tags, a number of DNA-based sensors for both diamagnetic and paramagnetic metal ions have been developed for fluorescent, colorimetric, electrochemical, and surface Raman detections. These sensors are highly sensitive (with detection limit down to 11 ppt) and selective (with selectivity up to millions-fold) toward specific metal ions. In addition, through further development to simplify the operation, such as the use of “dipstick tests”, portable fluorometers, computer-readable discs, and widely available glucose meters, these sensors have been applied for on-site and real-time environmental monitoring and point-of-care medical diagnostics. The use of these sensors for in situ cellular imaging has also been reported. The generality of the combinatorial selection to obtain DNAzymes for almost any metal ion in any oxidation state, and the ease of modification of the DNA with different signal reporters make DNA an emerging and promising class of molecules for metal ion sensing and imaging in many fields of applications. PMID:24359450
Alchemical Free Energy Calculations for Nucleotide Mutations in Protein-DNA Complexes.
Gapsys, Vytautas; de Groot, Bert L
2017-12-12
Nucleotide-sequence-dependent interactions between proteins and DNA are responsible for a wide range of gene regulatory functions. Accurate and generalizable methods to evaluate the strength of protein-DNA binding have long been sought. While numerous computational approaches have been developed, most of them require fitting parameters to experimental data to a certain degree, e.g., machine learning algorithms or knowledge-based statistical potentials. Molecular-dynamics-based free energy calculations offer a robust, system-independent, first-principles-based method to calculate free energy differences upon nucleotide mutation. We present an automated procedure to set up alchemical MD-based calculations to evaluate free energy changes occurring as the result of a nucleotide mutation in DNA. We used these methods to perform a large-scale mutation scan comprising 397 nucleotide mutation cases in 16 protein-DNA complexes. The obtained prediction accuracy reaches 5.6 kJ/mol average unsigned deviation from experiment with a correlation coefficient of 0.57 with respect to the experimentally measured free energies. Overall, the first-principles-based approach performed on par with the molecular modeling approaches Rosetta and FoldX. Subsequently, we utilized the MD-based free energy calculations to construct protein-DNA binding profiles for the zinc finger protein Zif268. The calculation results compare remarkably well with the experimentally determined binding profiles. The software automating the structure and topology setup for alchemical calculations is a part of the pmx package; the utilities have also been made available online at http://pmx.mpibpc.mpg.de/dna_webserver.html .
Oakley, Jennifer A; Shaw, Kirsty J; Docker, Peter T; Dyer, Charlotte E; Greenman, John; Greenway, Gillian M; Haswell, Stephen J
2009-06-07
A silica monolith used to support both electro-osmotic pumping (EOP) and the extraction/elution of DNA coupled with gel-supported reagents is described. The benefits of the combined EOP extraction/elution system were illustrated by combining DNA extraction and gene amplification using the polymerase chain reaction (PCR) process. All the reagents necessary for both processes were supported within pre-loaded gels that allow the reagents to be stored at 4 degrees C for up to four weeks in the microfluidic device. When carrying out an analysis the crude sample only needed to be hydrodynamically introduced into the device which was connected to an external computer controlled power supply via platinum wire electrodes. DNA was extracted with 65% efficiency after loading lysed cells onto a silica monolith. Ethanol contained within an agarose gel matrix was then used to wash unwanted debris away from the sample by EOP (100 V cm(-1) for 5 min). The retained DNA was subsequently eluted from the monolith by water contained in a second agarose gel, again by EOP using an electric field of 100 V cm(-1) for 5 min, and transferred into the PCR reagent containing gel. The eluted DNA in solution was successfully amplified by PCR, confirming that the concept of a complete self-contained microfluidic device could be realised for DNA sample clean up and amplification, using a simple pumping and on-chip reagent storage methodology.
DNA barcoding gap: reliable species identification over morphological and geographical scales.
Čandek, Klemen; Kuntner, Matjaž
2015-03-01
The philosophical basis and utility of DNA barcoding have been a subject of numerous debates. While most literature embraces it, some studies continue to question its use in dipterans, butterflies and marine gastropods. Here, we explore the utility of DNA barcoding in identifying spider species that vary in taxonomic affiliation, morphological diagnosibility and geographic distribution. Our first test searched for a 'barcoding gap' by comparing intra- and interspecific means, medians and overlap in more than 75,000 computed Kimura 2-parameter (K2P) genetic distances in three families. Our second test compared K2P distances of congeneric species with high vs. low morphological distinctness in 20 genera of 11 families. Our third test explored the effect of enlarging geographical sampling area at a continental scale on genetic variability in DNA barcodes within 20 species of nine families. Our results generally point towards a high utility of DNA barcodes in identifying spider species. However, the size of the barcoding gap strongly depends on taxonomic groups and practices. It is becoming critical to define the barcoding gap statistically more consistently and to document its variation over taxonomic scales. Our results support models of independent patterns of morphological and molecular evolution by showing that DNA barcodes are effective in species identification regardless of their morphological diagnosibility. We also show that DNA barcodes represent an effective tool for identifying spider species over geographic scales, yet their variation contains useful biogeographic information. © 2014 John Wiley & Sons Ltd.
A discriminatory function for prediction of protein-DNA interactions based on alpha shape modeling.
Zhou, Weiqiang; Yan, Hong
2010-10-15
Protein-DNA interaction has significant importance in many biological processes. However, the underlying principle of the molecular recognition process is still largely unknown. As more high-resolution 3D structures of protein-DNA complex are becoming available, the surface characteristics of the complex become an important research topic. In our work, we apply an alpha shape model to represent the surface structure of the protein-DNA complex and developed an interface-atom curvature-dependent conditional probability discriminatory function for the prediction of protein-DNA interaction. The interface-atom curvature-dependent formalism captures atomic interaction details better than the atomic distance-based method. The proposed method provides good performance in discriminating the native structures from the docking decoy sets, and outperforms the distance-dependent formalism in terms of the z-score. Computer experiment results show that the curvature-dependent formalism with the optimal parameters can achieve a native z-score of -8.17 in discriminating the native structure from the highest surface-complementarity scored decoy set and a native z-score of -7.38 in discriminating the native structure from the lowest RMSD decoy set. The interface-atom curvature-dependent formalism can also be used to predict apo version of DNA-binding proteins. These results suggest that the interface-atom curvature-dependent formalism has a good prediction capability for protein-DNA interactions. The code and data sets are available for download on http://www.hy8.com/bioinformatics.htm kenandzhou@hotmail.com.
Dynamics of genome size evolution in birds and mammals
Feschotte, Cédric
2017-01-01
Genome size in mammals and birds shows remarkably little interspecific variation compared with other taxa. However, genome sequencing has revealed that many mammal and bird lineages have experienced differential rates of transposable element (TE) accumulation, which would be predicted to cause substantial variation in genome size between species. Thus, we hypothesize that there has been covariation between the amount of DNA gained by transposition and lost by deletion during mammal and avian evolution, resulting in genome size equilibrium. To test this model, we develop computational methods to quantify the amount of DNA gained by TE expansion and lost by deletion over the last 100 My in the lineages of 10 species of eutherian mammals and 24 species of birds. The results reveal extensive variation in the amount of DNA gained via lineage-specific transposition, but that DNA loss counteracted this expansion to various extents across lineages. Our analysis of the rate and size spectrum of deletion events implies that DNA removal in both mammals and birds has proceeded mostly through large segmental deletions (>10 kb). These findings support a unified “accordion” model of genome size evolution in eukaryotes whereby DNA loss counteracting TE expansion is a major determinant of genome size. Furthermore, we propose that extensive DNA loss, and not necessarily a dearth of TE activity, has been the primary force maintaining the greater genomic compaction of flying birds and bats relative to their flightless relatives. PMID:28179571
NASA Astrophysics Data System (ADS)
Urban, Matthias; Möller, Robert; Fritzsche, Wolfgang
2003-02-01
DNA analytics is a growing field based on the increasing knowledge about the genome with special implications for the understanding of molecular bases for diseases. Driven by the need for cost-effective and high-throughput methods for molecular detection, DNA chips are an interesting alternative to more traditional analytical methods in this field. The standard readout principle for DNA chips is fluorescence based. Fluorescence is highly sensitive and broadly established, but shows limitations regarding quantification (due to signal and/or dye instability) and the need for sophisticated (and therefore high-cost) equipment. This article introduces a readout system for an alternative detection scheme based on electrical detection of nanoparticle-labeled DNA. If labeled DNA is present in the analyte solution, it will bind on complementary capture DNA immobilized in a microelectrode gap. A subsequent metal enhancement step leads to a deposition of conductive material on the nanoparticles, and finally an electrical contact between the electrodes. This detection scheme offers the potential for a simple (low-cost as well as robust) and highly miniaturizable method, which could be well-suited for point-of-care applications in the context of lab-on-a-chip technologies. The demonstrated apparatus allows a parallel readout of an entire array of microstructured measurement sites. The readout is combined with data-processing by an embedded personal computer, resulting in an autonomous instrument that measures and presents the results. The design and realization of such a system is described, and first measurements are presented.
Pavani, Raphael Souza; da Silva, Marcelo Santos; Fernandes, Carlos Alexandre Henrique; Morini, Flavia Souza; Araujo, Christiane Bezerra; Fontes, Marcos Roberto de Mattos; Sant’Anna, Osvaldo Augusto; Machado, Carlos Renato; Cano, Maria Isabel; Fragoso, Stenio Perdigão; Elias, Maria Carolina
2016-01-01
Replication Protein A (RPA), the major single stranded DNA binding protein in eukaryotes, is composed of three subunits and is a fundamental player in DNA metabolism, participating in replication, transcription, repair, and the DNA damage response. In human pathogenic trypanosomatids, only limited studies have been performed on RPA-1 from Leishmania. Here, we performed in silico, in vitro and in vivo analysis of Trypanosoma cruzi RPA-1 and RPA-2 subunits. Although computational analysis suggests similarities in DNA binding and Ob-fold structures of RPA from T. cruzi compared with mammalian and fungi RPA, the predicted tridimensional structures of T. cruzi RPA-1 and RPA-2 indicated that these molecules present a more flexible tertiary structure, suggesting that T. cruzi RPA could be involved in additional responses. Here, we demonstrate experimentally that the T. cruzi RPA complex interacts with DNA via RPA-1 and is directly related to canonical functions, such as DNA replication and DNA damage response. Accordingly, a reduction of TcRPA-2 expression by generating heterozygous knockout cells impaired cell growth, slowing down S-phase progression. Moreover, heterozygous knockout cells presented a better efficiency in differentiation from epimastigote to metacyclic trypomastigote forms and metacyclic trypomastigote infection. Taken together, these findings indicate the involvement of TcRPA in the metacyclogenesis process and suggest that a delay in cell cycle progression could be linked with differentiation in T. cruzi. PMID:27984589
Li, Xingang; Lu, Hongming; Fan, Guilian; He, Miao; Sun, Yu; Xu, Kai; Shi, Fengjun
2017-11-01
Osteosarcoma (OS) is one of the most prevalent primary malignant bone tumors in adolescent. HOTAIR is highly expressed and associated with the epigenetic modifications, especially DNA methylation, in cancer. However, the regulation mechanism between HOTAIR and DNA methylation and the biological effects of them in the pathogenesis of osteosarcoma remains elusive. Through RNA-sequencing and computational analysis, followed by a variety of experimental validations, we report a novel interplay between HOTAIR, miR-126, and DNA methylation in OS. We found that HOTAIR is highly expressed in OS cells and the knockdown of HOTAIR leads to the down-regulation of DNMT1, as well as the decrease of global DNA methylation level. RNA-sequencing analysis of HOTAIR-regulated gene shows that CDKN2A is significantly repressed by HOTAIR. A series of experiments show that HOTAIR represses the expression of CDKN2A through inhibiting the promoter activity of CDKN2A by DNA hypermethylation. Further evidence shows that HOTAIR activates the expression of DNMT1 through repressing miR-126, which is the negative regulator of DNMT1. Functionally, HOTAIR depletion increases the sensibility of OS cells to DNMT1 inhibitor through regulating the viability and apoptosis of OS cells via HOTAIR-miR126-DNMT1-CDKN2A axis. These results not only enrich our understanding of the regulation relationship between non-coding RNA, DNA methylation, and gene expression, however, also provide a novel direction in developing more sophisticated therapeutic strategies for OS patients.
Identification of DNA-Binding Proteins Using Structural, Electrostatic and Evolutionary Features
Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2009-01-01
Summary DNA binding proteins (DBPs) often take part in various crucial processes of the cell's life cycle. Therefore, the identification and characterization of these proteins are of great importance. We present here a random forests classifier for identifying DBPs among proteins with known three-dimensional structures. First, clusters of evolutionarily conserved regions (patches) on the protein's surface are detected using the PatchFinder algorithm; previous studies showed that these regions are typically the proteins' functionally important regions. Next, we train a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein including its dipole moment. Using 10-fold cross validation on a dataset of 138 DNA-binding proteins and 110 proteins which do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of previously published methods. Furthermore, when we tested 5 different methods on 11 new DBPs which did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA. PMID:19233205