artificial dna sequences: Topics by Science.gov

Sample records for artificial dna sequences

Satellite DNA-based artificial chromosomes for use in gene therapy.

PubMed

Hadlaczky, G

2001-04-01

Satellite DNA-based artificial chromosomes (SATACs) can be made by induced de novo chromosome formation in cells of different mammalian species. These artificially generated accessory chromosomes are composed of predictable DNA sequences and they contain defined genetic information. Prototype human SATACs have been successfully constructed in different cell types from 'neutral' endogenous DNA sequences from the short arm of the human chromosome 15. SATACs have already passed a number of hurdles crucial to their further development as gene therapy vectors, including: large-scale purification; transfer of purified artificial chromosomes into different cells and embryos; generation of transgenic animals and germline transmission with purified SATACs; and the tissue-specific expression of a therapeutic gene from an artificial chromosome in the milk of transgenic animals.
Recognising promoter sequences using an artificial immune system

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cooke, D.E.; Hunt, J.E.

1995-12-31

We have developed an artificial immune system (AIS) which is based on the human immune system. The AIS possesses an adaptive learning mechanism which enables antibodies to emerge which can be used for classification tasks. In this paper, we describe how the AIS has been used to evolve antibodies which can classify promoter containing and promoter negative DNA sequences. The DNA sequences used for teaching were 57 nucleotides in length and contained procaryotic promoters. The system classified previously unseen DNA sequences with an accuracy of approximately 90%.
High-speed all-optical DNA local sequence alignment based on a three-dimensional artificial neural network.

PubMed

Maleki, Ehsan; Babashah, Hossein; Koohi, Somayyeh; Kavehvash, Zahra

2017-07-01

This paper presents an optical processing approach for exploring a large number of genome sequences. Specifically, we propose an optical correlator for global alignment and an extended moiré matching technique for local analysis of spatially coded DNA, whose output is fed to a novel three-dimensional artificial neural network for local DNA alignment. All-optical implementation of the proposed 3D artificial neural network is developed and its accuracy is verified in Zemax. Thanks to its parallel processing capability, the proposed structure performs local alignment of 4 million sequences of 150 base pairs in a few seconds, which is much faster than its electrical counterparts, such as the basic local alignment search tool.
Fabrication and characterization of a solid state nanopore with self-aligned carbon nanoelectrodes for molecular detection

NASA Astrophysics Data System (ADS)

Spinney, Patrick; Collins, Scott D.; Howitt, David G.; Smith, Rosemary L.

2012-06-01

Rapid and cost-effective DNA sequencing is a pivotal prerequisite for the genomics era. Many of the recent advances in forensics, medicine, agriculture, taxonomy, and drug discovery have paralleled critical advances in DNA sequencing technology. Nanopore modalities for DNA sequencing have recently surfaced including the electrical interrogation of protein ion channels and/or solid-state nanopores during translocation of DNA. However to date, most of this work has met with mixed success. In this work, we present a unique nanofabrication strategy that realizes an artificial nanopore articulated with carbon electrodes to sense the current modulations during the transport of DNA through the nanopore. This embodiment overcomes most of the technical difficulties inherent in other artificial nanopore embodiments and present a versatile platform for the testing of DNA single nucleotide detection. Characterization of the device using gold nanoparticles, silica nanoparticles, lambda dsDNA and 16-mer ssDNA are presented. Although single molecule DNA sequencing is still not demonstrated, the device shows a path towards this goal.
Multi-modulus algorithm based on global artificial fish swarm intelligent optimization of DNA encoding sequences.

PubMed

Guo, Y C; Wang, H; Wu, H P; Zhang, M Q

2015-12-21

Aimed to address the defects of the large mean square error (MSE), and the slow convergence speed in equalizing the multi-modulus signals of the constant modulus algorithm (CMA), a multi-modulus algorithm (MMA) based on global artificial fish swarm (GAFS) intelligent optimization of DNA encoding sequences (GAFS-DNA-MMA) was proposed. To improve the convergence rate and reduce the MSE, this proposed algorithm adopted an encoding method based on DNA nucleotide chains to provide a possible solution to the problem. Furthermore, the GAFS algorithm, with its fast convergence and global search ability, was used to find the best sequence. The real and imaginary parts of the initial optimal weight vector of MMA were obtained through DNA coding of the best sequence. The simulation results show that the proposed algorithm has a faster convergence speed and smaller MSE in comparison with the CMA, the MMA, and the AFS-DNA-MMA.
Microbial identification by immunohybridization assay of artificial RNA labels

NASA Technical Reports Server (NTRS)

Kourentzi, Katerina D.; Fox, George E.; Willson, Richard C.

2002-01-01

Ribosomal RNA (rRNA) and engineered stable artificial RNAs (aRNAs) are frequently used to monitor bacteria in complex ecosystems. In this work, we describe a solid-phase immunocapture hybridization assay that can be used with low molecular weight RNA targets. A biotinylated DNA probe is efficiently hybridized in solution with the target RNA, and the DNA-RNA hybrids are captured on streptavidin-coated plates and quantified using a DNA-RNA heteroduplex-specific antibody conjugated to alkaline phosphatase. The assay was shown to be specific for both 5S rRNA and low molecular weight (LMW) artificial RNAs and highly sensitive, allowing detection of as little as 5.2 ng (0.15 pmol) in the case of 5S rRNA. Target RNAs were readily detected even in the presence of excess nontarget RNA. Detection using DNA probes as small as 17 bases targeting a repetitive artificial RNA sequence in an engineered RNA was more efficient than the detection of a unique sequence.
DNA capture elements for rapid detection and identification of biological agents

NASA Astrophysics Data System (ADS)

Kiel, Johnathan L.; Parker, Jill E.; Holwitt, Eric A.; Vivekananda, Jeeva

2004-08-01

DNA capture elements (DCEs; aptamers) are artificial DNA sequences, from a random pool of sequences, selected for their specific binding to potential biological warfare agents. These sequences were selected by an affinity method using filters to which the target agent was attached and the DNA isolated and amplified by polymerase chain reaction (PCR) in an iterative, increasingly stringent, process. Reporter molecules were attached to the finished sequences. To date, we have made DCEs to Bacillus anthracis spores, Shiga toxin, Venezuelan Equine Encephalitis (VEE) virus, and Francisella tularensis. These DCEs have demonstrated specificity and sensitivity equal to or better than antibody.
Storing data encoded DNA in living organisms

DOEpatents

Wong,; Pak C. , Wong; Kwong K. , Foote; Harlan, P [Richland, WA

2006-06-06

Current technologies allow the generation of artificial DNA molecules and/or the ability to alter the DNA sequences of existing DNA molecules. With a careful coding scheme and arrangement, it is possible to encode important information as an artificial DNA strand and store it in a living host safely and permanently. This inventive technology can be used to identify origins and protect R&D investments. It can also be used in environmental research to track generations of organisms and observe the ecological impact of pollutants. Today, there are microorganisms that can survive under extreme conditions. As well, it is advantageous to consider multicellular organisms as hosts for stored information. These living organisms can provide as memory housing and protection for stored data or information. The present invention provides well for data storage in a living organism wherein at least one DNA sequence is encoded to represent data and incorporated into a living organism.
Programmable DNA-Guided Artificial Restriction Enzymes.

PubMed

Enghiad, Behnam; Zhao, Huimin

2017-05-19

Restriction enzymes are essential tools for recombinant DNA technology that have revolutionized modern biological research. However, they have limited sequence specificity and availability. Here we report a Pyrococcus furiosus Argonaute (PfAgo) based platform for generating artificial restriction enzymes (AREs) capable of recognizing and cleaving DNA sequences at virtually any arbitrary site and generating defined sticky ends of varying length. Short DNA guides are used to direct PfAgo to target sites for cleavage at high temperatures (>87 °C) followed by reannealing of the cleaved single stranded DNAs. We used this platform to generate over 18 AREs for DNA fingerprinting and molecular cloning of PCR-amplified or genomic DNAs. These AREs work as efficiently as their naturally occurring counterparts, and some of them even do not have any naturally occurring counterparts, demonstrating easy programmability, generality, versatility, and high efficiency for this new technology.
Long-range correlations and charge transport properties of DNA sequences

NASA Astrophysics Data System (ADS)

Liu, Xiao-liang; Ren, Yi; Xie, Qiong-tao; Deng, Chao-sheng; Xu, Hui

2010-04-01

By using Hurst's analysis and transfer approach, the rescaled range functions and Hurst exponents of human chromosome 22 and enterobacteria phage lambda DNA sequences are investigated and the transmission coefficients, Landauer resistances and Lyapunov coefficients of finite segments based on above genomic DNA sequences are calculated. In a comparison with quasiperiodic and random artificial DNA sequences, we find that λ-DNA exhibits anticorrelation behavior characterized by a Hurst exponent 0.5
DNASynth: a software application to optimization of artificial gene synthesis

NASA Astrophysics Data System (ADS)

Muczyński, Jan; Nowak, Robert M.

2017-08-01

DNASynth is a client-server software application in which the client runs in a web browser. The aim of this program is to support and optimize process of artificial gene synthesizing using Ligase Chain Reaction. Thanks to LCR it is possible to obtain DNA strand coding defined by user peptide. The DNA sequence is calculated by optimization algorithm that consider optimal codon usage, minimal energy of secondary structures and minimal number of required LCR. Additionally absence of sequences characteristic for defined by user set of restriction enzymes is guaranteed. The presented software was tested on synthetic and real data.
mtDNA-Server: next-generation sequencing data analysis of human mitochondrial DNA in the cloud.

PubMed

Weissensteiner, Hansi; Forer, Lukas; Fuchsberger, Christian; Schöpf, Bernd; Kloss-Brandstätter, Anita; Specht, Günther; Kronenberg, Florian; Schönherr, Sebastian

2016-07-08

Next generation sequencing (NGS) allows investigating mitochondrial DNA (mtDNA) characteristics such as heteroplasmy (i.e. intra-individual sequence variation) to a higher level of detail. While several pipelines for analyzing heteroplasmies exist, issues in usability, accuracy of results and interpreting final data limit their usage. Here we present mtDNA-Server, a scalable web server for the analysis of mtDNA studies of any size with a special focus on usability as well as reliable identification and quantification of heteroplasmic variants. The mtDNA-Server workflow includes parallel read alignment, heteroplasmy detection, artefact or contamination identification, variant annotation as well as several quality control metrics, often neglected in current mtDNA NGS studies. All computational steps are parallelized with Hadoop MapReduce and executed graphically with Cloudgene. We validated the underlying heteroplasmy and contamination detection model by generating four artificial sample mix-ups on two different NGS devices. Our evaluation data shows that mtDNA-Server detects heteroplasmies and artificial recombinations down to the 1% level with perfect specificity and outperforms existing approaches regarding sensitivity. mtDNA-Server is currently able to analyze the 1000G Phase 3 data (n = 2,504) in less than 5 h and is freely accessible at https://mtdna-server.uibk.ac.at. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sequence and Analysis of the Tomato JOINTLESS Locus1

PubMed Central

Mao, Long; Begum, Dilara; Goff, Stephen A.; Wing, Rod A.

2001-01-01

A 119-kb bacterial artificial chromosome from the JOINTLESS locus on the tomato (Lycopersicon esculentum) chromosome 11 contained 15 putative genes. Repetitive sequences in this region include one copia-like LTR retrotransposon, 13 simple sequence repeats, three copies of a novel type III foldback transposon, and four putative short DNA repeats. Database searches showed that the foldback transposon and the short DNA repeats seemed to be associated preferably with genes. The predicted tomato genes were compared with the complete Arabidopsis genome. Eleven out of 15 tomato open reading frames were found to be colinear with segments on five Arabidopsis bacterial artificial chromosome/P1-derived artificial chromosome clones. The synteny patterns, however, did not reveal duplicated segments in Arabidopsis, where over half of the genome is duplicated. Our analysis indicated that the microsynteny between the tomato and Arabidopsis genomes was still conserved at a very small scale but was complicated by the large number of gene families in the Arabidopsis genome. PMID:11457984
Specific DNA duplex formation at an artificial lipid bilayer: towards a new DNA biosensor technology.

PubMed

Werz, Emma; Korneev, Sergei; Montilla-Martinez, Malayko; Wagner, Richard; Hemmler, Roland; Walter, Claudius; Eisfeld, Jörg; Gall, Karsten; Rosemeyer, Helmut

2012-02-01

A novel technique is described which comprises a base-specific DNA duplex formation at a lipid bilayer-H(2) O-phase boundary layer. Two different probes of oligonucleotides both carrying a double-tailed lipid at the 5'-terminus were incorporated into stable artificial lipid bilayers separating two compartments (cis/trans-channel) of an optically transparent microfluidic sample carrier with perfusion capabilities. Both the cis- and trans-channels are filled with saline buffer. Injection of a cyanine-5-labeled target DNA sequence, which is complementary to only one of the oligonucleotide probes, into the cis-channel, followed by a thorough perfusion, leads to an immobilization of the labeled complementary oligonucleotide on the membrane as detected by single-molecule fluorescence spectroscopy and microscopy. In the case of fluorescent but non-complementary DNA sequences, no immobilized fluorescent oligonucleotide duplex could be detected on the membrane. This clearly verifies a specific duplex formation at the membrane interface. Copyright © 2012 Verlag Helvetica Chimica Acta AG, Zürich.
ANN modeling of DNA sequences: new strategies using DNA shape code.

PubMed

Parbhane, R V; Tambe, S S; Kulkarni, B D

2000-09-01

Two new encoding strategies, namely, wedge and twist codes, which are based on the DNA helical parameters, are introduced to represent DNA sequences in artificial neural network (ANN)-based modeling of biological systems. The performance of the new coding strategies has been evaluated by conducting three case studies involving mapping (modeling) and classification applications of ANNs. The proposed coding schemes have been compared rigorously and shown to outperform the existing coding strategies especially in situations wherein limited data are available for building the ANN models.
Human Artificial Chromosomes with Alpha Satellite-Based De Novo Centromeres Show Increased Frequency of Nondisjunction and Anaphase Lag

PubMed Central

Rudd, M. Katharine; Mays, Robert W.; Schwartz, Stuart; Willard, Huntington F.

2003-01-01

Human artificial chromosomes have been used to model requirements for human chromosome segregation and to explore the nature of sequences competent for centromere function. Normal human centromeres require specialized chromatin that consists of alpha satellite DNA complexed with epigenetically modified histones and centromere-specific proteins. While several types of alpha satellite DNA have been used to assemble de novo centromeres in artificial chromosome assays, the extent to which they fully recapitulate normal centromere function has not been explored. Here, we have used two kinds of alpha satellite DNA, DXZ1 (from the X chromosome) and D17Z1 (from chromosome 17), to generate human artificial chromosomes. Although artificial chromosomes are mitotically stable over many months in culture, when we examined their segregation in individual cell divisions using an anaphase assay, artificial chromosomes exhibited more segregation errors than natural human chromosomes (P < 0.001). Naturally occurring, but abnormal small ring chromosomes derived from chromosome 17 and the X chromosome also missegregate more than normal chromosomes, implicating overall chromosome size and/or structure in the fidelity of chromosome segregation. As different artificial chromosomes missegregate over a fivefold range, the data suggest that variable centromeric DNA content and/or epigenetic assembly can influence the mitotic behavior of artificial chromosomes. PMID:14560014
Human artificial chromosomes with alpha satellite-based de novo centromeres show increased frequency of nondisjunction and anaphase lag.

PubMed

Rudd, M Katharine; Mays, Robert W; Schwartz, Stuart; Willard, Huntington F

2003-11-01

Human artificial chromosomes have been used to model requirements for human chromosome segregation and to explore the nature of sequences competent for centromere function. Normal human centromeres require specialized chromatin that consists of alpha satellite DNA complexed with epigenetically modified histones and centromere-specific proteins. While several types of alpha satellite DNA have been used to assemble de novo centromeres in artificial chromosome assays, the extent to which they fully recapitulate normal centromere function has not been explored. Here, we have used two kinds of alpha satellite DNA, DXZ1 (from the X chromosome) and D17Z1 (from chromosome 17), to generate human artificial chromosomes. Although artificial chromosomes are mitotically stable over many months in culture, when we examined their segregation in individual cell divisions using an anaphase assay, artificial chromosomes exhibited more segregation errors than natural human chromosomes (P < 0.001). Naturally occurring, but abnormal small ring chromosomes derived from chromosome 17 and the X chromosome also missegregate more than normal chromosomes, implicating overall chromosome size and/or structure in the fidelity of chromosome segregation. As different artificial chromosomes missegregate over a fivefold range, the data suggest that variable centromeric DNA content and/or epigenetic assembly can influence the mitotic behavior of artificial chromosomes.
Biocompatible artificial DNA linker that is read through by DNA polymerases and is functional in Escherichia coli

PubMed Central

El-Sagheer, Afaf H.; Sanzone, A. Pia; Gao, Rachel; Tavassoli, Ali; Brown, Tom

2011-01-01

A triazole mimic of a DNA phosphodiester linkage has been produced by templated chemical ligation of oligonucleotides functionalized with 5′-azide and 3′-alkyne. The individual azide and alkyne oligonucleotides were synthesized by standard phosphoramidite methods and assembled using a straightforward ligation procedure. This highly efficient chemical equivalent of enzymatic DNA ligation has been used to assemble a 300-mer from three 100-mer oligonucleotides, demonstrating the total chemical synthesis of very long oligonucleotides. The base sequences of the DNA strands containing this artificial linkage were copied during PCR with high fidelity and a gene containing the triazole linker was functional in Escherichia coli. PMID:21709264
Palindromic Sequence Artifacts Generated during Next Generation Sequencing Library Preparation from Historic and Ancient DNA

PubMed Central

Star, Bastiaan; Nederbragt, Alexander J.; Hansen, Marianne H. S.; Skage, Morten; Gilfillan, Gregor D.; Bradbury, Ian R.; Pampoulie, Christophe; Stenseth, Nils Chr; Jakobsen, Kjetill S.; Jentoft, Sissel

2014-01-01

Degradation-specific processes and variation in laboratory protocols can bias the DNA sequence composition from samples of ancient or historic origin. Here, we identify a novel artifact in sequences from historic samples of Atlantic cod (Gadus morhua), which forms interrupted palindromes consisting of reverse complementary sequence at the 5′ and 3′-ends of sequencing reads. The palindromic sequences themselves have specific properties – the bases at the 5′-end align well to the reference genome, whereas extensive misalignments exists among the bases at the terminal 3′-end. The terminal 3′ bases are artificial extensions likely caused by the occurrence of hairpin loops in single stranded DNA (ssDNA), which can be ligated and amplified in particular library creation protocols. We propose that such hairpin loops allow the inclusion of erroneous nucleotides, specifically at the 3′-end of DNA strands, with the 5′-end of the same strand providing the template. We also find these palindromes in previously published ancient DNA (aDNA) datasets, albeit at varying and substantially lower frequencies. This artifact can negatively affect the yield of endogenous DNA in these types of samples and introduces sequence bias. PMID:24608104
A discrete artificial bee colony algorithm for detecting transcription factor binding sites in DNA sequences.

PubMed

Karaboga, D; Aslan, S

2016-04-27

The great majority of biological sequences share significant similarity with other sequences as a result of evolutionary processes, and identifying these sequence similarities is one of the most challenging problems in bioinformatics. In this paper, we present a discrete artificial bee colony (ABC) algorithm, which is inspired by the intelligent foraging behavior of real honey bees, for the detection of highly conserved residue patterns or motifs within sequences. Experimental studies on three different data sets showed that the proposed discrete model, by adhering to the fundamental scheme of the ABC algorithm, produced competitive or better results than other metaheuristic motif discovery techniques.

Comparative analyses of genetic/epigenetic diversities and structures in a wild barley species (Hordeum brevisubulatum) using MSAP, SSAP and AFLP.

PubMed

Shan, X H; Li, Y D; Liu, X M; Wu, Y; Zhang, M Z; Guo, W L; Liu, B; Yuan, Y P

2012-08-17

We analyzed genetic diversity and population genetic structure of four artificial populations of wild barley (Hordeum brevisubulatum); 96 plants collected from the Songnen Prairie in northeastern China were analyzed using amplified fragment length polymorphism (AFLP), specific-sequence amplified polymorphism (SSAP) and methylation-sensitive amplified polymorphism (MSAP) markers. Indices of (epi-)genetic diversity, (epi-)genetic distance, gene flow, genotype frequency, cluster analysis, PCA analysis and AMOVA analysis generated from MSAP, AFLP and SSAP markers had the same trend. We found a high level of correlation in the artificial populations between MSAP, SSAP and AFLP markers by the Mantel test (r > 0.8). This is incongruent with previous findings showing that there is virtually no correlation between DNA methylation polymorphism and classical genetic variation; the high level of genetic polymorphism could be a result of epigenetic regulation. We compared our results with data from natural populations. The population diversity of the artificial populations was lower. However, different from what was found using AFLP and SSAP, based on MSAP results the methylation polymorphism of the artificial populations was not significantly reduced. This leads us to suggest that the DNA methylation pattern change in H. brevisubulatum populations is not only related to DNA sequence variation, but is also regulated by other controlling systems.
Statistical properties of DNA sequences

NASA Technical Reports Server (NTRS)

Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Simons, M.; Stanley, H. E.

1995-01-01

We review evidence supporting the idea that the DNA sequence in genes containing non-coding regions is correlated, and that the correlation is remarkably long range--indeed, nucleotides thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationarity" feature of the sequence of base pairs by applying a new algorithm called detrended fluctuation analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and non-coding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to every DNA sequence (33301 coding and 29453 non-coding) in the entire GenBank database. Finally, we describe briefly some recent work showing that the non-coding sequences have certain statistical features in common with natural and artificial languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts. These statistical properties of non-coding sequences support the possibility that non-coding regions of DNA may carry biological information.
Artificial Intelligence, DNA Mimicry, and Human Health.

PubMed

Stefano, George B; Kream, Richard M

2017-08-14

The molecular evolution of genomic DNA across diverse plant and animal phyla involved dynamic registrations of sequence modifications to maintain existential homeostasis to increasingly complex patterns of environmental stressors. As an essential corollary, driver effects of positive evolutionary pressure are hypothesized to effect concerted modifications of genomic DNA sequences to meet expanded platforms of regulatory controls for successful implementation of advanced physiological requirements. It is also clearly apparent that preservation of updated registries of advantageous modifications of genomic DNA sequences requires coordinate expansion of convergent cellular proofreading/error correction mechanisms that are encoded by reciprocally modified genomic DNA. Computational expansion of operationally defined DNA memory extends to coordinate modification of coding and previously under-emphasized noncoding regions that now appear to represent essential reservoirs of untapped genetic information amenable to evolutionary driven recruitment into the realm of biologically active domains. Additionally, expansion of DNA memory potential via chemical modification and activation of noncoding sequences is targeted to vertical augmentation and integration of an expanded cadre of transcriptional and epigenetic regulatory factors affecting linear coding of protein amino acid sequences within open reading frames.
Diatom centromeres suggest a mechanism for nuclear DNA acquisition

DOE PAGES

Diner, Rachel E.; Noddings, Chari M.; Lian, Nathan C.; ...

2017-07-18

Centromeres are essential for cell division and growth in all eukaryotes, and knowledge of their sequence and structure guides the development of artificial chromosomes for functional cellular biology studies. Centromeric proteins are conserved among eukaryotes; however, centromeric DNA sequences are highly variable. We combined forward and reverse genetic approaches with chromatin immunoprecipitation to identify centromeres of the model diatom Phaeodactylum tricornutum. We observed 25 unique centromere sequences typically occurring once per chromosome, a finding that helps to resolve nuclear genome organization and indicates monocentric regional centromeres. Diatom centromere sequences contain low-GC content regions but lack repeats or other conserved sequencemore » features. Native and foreign sequences with similar GC content to P. tricornutum centromeres can maintain episomes and recruit the diatom centromeric histone protein CENH3, suggesting nonnative sequences can also function as diatom centromeres. Thus, simple sequence requirements may enable DNA from foreign sources to persist in the nucleus as extrachromosomal episomes, revealing a potential mechanism for organellar and foreign DNA acquisition.« less
Diatom centromeres suggest a mechanism for nuclear DNA acquisition

DOE Office of Scientific and Technical Information (OSTI.GOV)

Diner, Rachel E.; Noddings, Chari M.; Lian, Nathan C.

Centromeres are essential for cell division and growth in all eukaryotes, and knowledge of their sequence and structure guides the development of artificial chromosomes for functional cellular biology studies. Centromeric proteins are conserved among eukaryotes; however, centromeric DNA sequences are highly variable. We combined forward and reverse genetic approaches with chromatin immunoprecipitation to identify centromeres of the model diatom Phaeodactylum tricornutum. We observed 25 unique centromere sequences typically occurring once per chromosome, a finding that helps to resolve nuclear genome organization and indicates monocentric regional centromeres. Diatom centromere sequences contain low-GC content regions but lack repeats or other conserved sequencemore » features. Native and foreign sequences with similar GC content to P. tricornutum centromeres can maintain episomes and recruit the diatom centromeric histone protein CENH3, suggesting nonnative sequences can also function as diatom centromeres. Thus, simple sequence requirements may enable DNA from foreign sources to persist in the nucleus as extrachromosomal episomes, revealing a potential mechanism for organellar and foreign DNA acquisition.« less
An artificial neural network system to identify alleles in reference electropherograms.

PubMed

Taylor, Duncan; Harrison, Ash; Powers, David

2017-09-01

Electropherograms are produced in great numbers in forensic DNA laboratories as part of everyday criminal casework. Before the results of these electropherograms can be used they must be scrutinised by analysts to determine what the identified data tells them about the underlying DNA sequences and what is purely an artefact of the DNA profiling process. This process of interpreting the electropherograms can be time consuming and is prone to subjective differences between analysts. Recently it was demonstrated that artificial neural networks could be used to classify information within an electropherogram as allelic (i.e. representative of a DNA fragment present in the DNA extract) or as one of several different categories of artefactual fluorescence that arise as a result of generating an electropherogram. We extend that work here to demonstrate a series of algorithms and artificial neural networks that can be used to identify peaks on an electropherogram and classify them. We demonstrate the functioning of the system on several profiles and compare the results to a leading commercial DNA profile reading system. Copyright © 2017 Elsevier B.V. All rights reserved.
A modular DNA signal translator for the controlled release of a protein by an aptamer.

PubMed

Beyer, Stefan; Simmel, Friedrich C

2006-01-01

Owing to the intimate linkage of sequence and structure in nucleic acids, DNA is an extremely attractive molecule for the development of molecular devices, in particular when a combination of information processing and chemomechanical tasks is desired. Many of the previously demonstrated devices are driven by hybridization between DNA 'effector' strands and specific recognition sequences on the device. For applications it is of great interest to link several of such molecular devices together within artificial reaction cascades. Often it will not be possible to choose DNA sequences freely, e.g. when functional nucleic acids such as aptamers are used. In such cases translation of an arbitrary 'input' sequence into a desired effector sequence may be required. Here we demonstrate a molecular 'translator' for information encoded in DNA and show how it can be used to control the release of a protein by an aptamer using an arbitrarily chosen DNA input strand. The function of the translator is based on branch migration and the action of the endonuclease FokI. The modular design of the translator facilitates the adaptation of the device to various input or output sequences.
A modular DNA signal translator for the controlled release of a protein by an aptamer

PubMed Central

Beyer, Stefan; Simmel, Friedrich C.

2006-01-01

Owing to the intimate linkage of sequence and structure in nucleic acids, DNA is an extremely attractive molecule for the development of molecular devices, in particular when a combination of information processing and chemomechanical tasks is desired. Many of the previously demonstrated devices are driven by hybridization between DNA ‘effector’ strands and specific recognition sequences on the device. For applications it is of great interest to link several of such molecular devices together within artificial reaction cascades. Often it will not be possible to choose DNA sequences freely, e.g. when functional nucleic acids such as aptamers are used. In such cases translation of an arbitrary ‘input’ sequence into a desired effector sequence may be required. Here we demonstrate a molecular ‘translator’ for information encoded in DNA and show how it can be used to control the release of a protein by an aptamer using an arbitrarily chosen DNA input strand. The function of the translator is based on branch migration and the action of the endonuclease FokI. The modular design of the translator facilitates the adaptation of the device to various input or output sequences. PMID:16547201
Directing an artificial zinc finger protein to new targets by fusion to a non-DNA-binding domain.

PubMed

Lim, Wooi F; Burdach, Jon; Funnell, Alister P W; Pearson, Richard C M; Quinlan, Kate G R; Crossley, Merlin

2016-04-20

Transcription factors are often regarded as having two separable components: a DNA-binding domain (DBD) and a functional domain (FD), with the DBD thought to determine target gene recognition. While this holds true for DNA bindingin vitro, it appears thatin vivoFDs can also influence genomic targeting. We fused the FD from the well-characterized transcription factor Krüppel-like Factor 3 (KLF3) to an artificial zinc finger (AZF) protein originally designed to target the Vascular Endothelial Growth Factor-A (VEGF-A) gene promoter. We compared genome-wide occupancy of the KLF3FD-AZF fusion to that observed with AZF. AZF bound to theVEGF-Apromoter as predicted, but was also found to occupy approximately 25,000 other sites, a large number of which contained the expected AZF recognition sequence, GCTGGGGGC. Interestingly, addition of the KLF3 FD re-distributes the fusion protein to new sites, with total DNA occupancy detected at around 50,000 sites. A portion of these sites correspond to known KLF3-bound regions, while others contained sequences similar but not identical to the expected AZF recognition sequence. These results show that FDs can influence and may be useful in directing AZF DNA-binding proteins to specific targets and provide insights into how natural transcription factors operate. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
DNA sequences and composition from 12 BAC clones-derived MUSB SSR markers mapped to cotton (Gossypium Hirsutum L. x G. Barbadense L.)chromosomes 11 and 21

USDA-ARS?s Scientific Manuscript database

To discover resistance (R) and/or pathogen-induced (PR) genes involved in disease response, 12 bacterial artificial chromosome (BAC) clones from cv. Acala Maxxa (G. hirsutum) were sequenced at the Clemson University, Genomics Institute, Clemson, SC. These BACs derived MUSB single sequence repeat (SS...
Polymorphism of Paramecium pentaurelia (Ciliophora, Oligohymenophorea) strains revealed by rDNA and mtDNA sequences.

PubMed

Przyboś, Ewa; Tarcz, Sebastian; Greczek-Stachura, Magdalena; Surmacz, Marta

2011-05-01

Paramecium pentaurelia is one of 15 known sibling species of the Paramecium aurelia complex. It is recognized as a species showing no intra-specific differentiation on the basis of molecular fingerprint analyses, whereas the majority of other species are polymorphic. This study aimed at assessing genetic polymorphism within P. pentaurelia including new strains recently found in Poland (originating from two water bodies, different years, seasons, and clones of one strain) as well as strains collected from distant habitats (USA, Europe, Asia), and strains representing other species of the complex. We compared two DNA fragments: partial sequences (349 bp) of the LSU rDNA and partial sequences (618 bp) of cytochrome B gene. A correlation between the geographical origin of the strains and the genetic characteristics of their genotypes was not observed. Different genotypes were found in Kraków in two types of water bodies (Opatkowice-natural pond; Jordan's Park-artificial pond). Haplotype diversity within a single water body was not recorded. Likewise, seasonal haplotype differences between the strains within the artificial water body, as well as differences between clones originating from one strain, were not detected. The clustering of some strains belonging to different species was observed in the phylogenies. Copyright © 2010 Elsevier GmbH. All rights reserved.
Teaching artificial intelligence to read electropherograms.

PubMed

Taylor, Duncan; Powers, David

2016-11-01

Electropherograms are produced in great numbers in forensic DNA laboratories as part of everyday criminal casework. Before the results of these electropherograms can be used they must be scrutinised by analysts to determine what the identified data tells us about the underlying DNA sequences and what is purely an artefact of the DNA profiling process. A technique that lends itself well to such a task of classification in the face of vast amounts of data is the use of artificial neural networks. These networks, inspired by the workings of the human brain, have been increasingly successful in analysing large datasets, performing medical diagnoses, identifying handwriting, playing games, or recognising images. In this work we demonstrate the use of an artificial neural network which we train to 'read' electropherograms and show that it can generalise to unseen profiles. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Active role of a human genomic insert in replication of a yeast artificial chromosome.

PubMed

van Brabant, A J; Fangman, W L; Brewer, B J

1999-06-01

Yeast artificial chromosomes (YACs) are a common tool for cloning eukaryotic DNA. The manner by which large pieces of foreign DNA are assimilated by yeast cells into a functional chromosome is poorly understood, as is the reason why some of them are stably maintained and some are not. We examined the replication of a stable YAC containing a 240-kb insert of DNA from the human T-cell receptor beta locus. The human insert contains multiple sites that serve as origins of replication. The activity of these origins appears to require the yeast ARS consensus sequence and, as with yeast origins, additional flanking sequences. In addition, the origins in the human insert exhibit a spacing, a range of activation efficiencies, and a variation in times of activation during S phase similar to those found for normal yeast chromosomes. We propose that an appropriate combination of replication origin density, activation times, and initiation efficiencies is necessary for the successful maintenance of YAC inserts.
Recovery of infectious virus from full-length cowpox virus (CPXV) DNA cloned as a bacterial artificial chromosome (BAC)

PubMed Central

2011-01-01

Transmission from pet rats and cats to humans as well as severe infection in felids and other animal species have recently drawn increasing attention to cowpox virus (CPXV). We report the cloning of the entire genome of cowpox virus strain Brighton Red (BR) as a bacterial artificial chromosome (BAC) in Escherichia coli and the recovery of infectious virus from cloned DNA. Generation of a full-length CPXV DNA clone was achieved by first introducing a mini-F vector, which allows maintenance of large circular DNA in E. coli, into the thymidine kinase locus of CPXV by homologous recombination. Circular replication intermediates were then electroporated into E. coli DH10B cells. Upon successful establishment of the infectious BR clone, we modified the full-length clone such that recombination-mediated excision of bacterial sequences can occur upon transfection in eukaryotic cells. This self-excision of the bacterial replicon is made possible by a sequence duplication within mini-F sequences and allows recovery of recombinant virus progeny without remaining marker or vector sequences. The in vitro growth properties of viruses derived from both BAC clones were determined and found to be virtually indistinguishable from those of parental, wild-type BR. Finally, the complete genomic sequence of the infectious clone was determined and the cloned viral genome was shown to be identical to that of the parental virus. In summary, the generated infectious clone will greatly facilitate studies on individual genes and pathogenesis of CPXV. Moreover, the vector potential of CPXV can now be more systematically explored using this newly generated tool. PMID:21314965
Genetic stability of progeny from an artificial allotetraploid carp using sperm from five fish species.

PubMed

Ye, Yuzhen; Wang, Zhongwei; Zhou, Jianfeng; Wu, Qingjiang

2009-08-01

Microsatellite markers and D-loop sequences of mtDNA from a female allotetraploid parent carp and her progenies of generations 1 and 2 induced by sperm of five distant fish species were analyzed. Eleven microsatellite markers were used to identify 48 alleles from the allotetraploid female. The same number of alleles (48) appeared in the first and second generations of the gynogenetic offspring, regardless of the source of the sperm used as an activator. The mtDNA D-loop analysis was performed on the female tetraploid parent, 25 gynogenetic offspring, and 5 sperm-donor species. Fourteen variable sites from the 1,018 bp sequences were observed in the offspring as compared to the female tetraploid parent. Results from D-loop sequence and microsatellite marker analysis showed exclusive maternal transmission, and no genetic information was derived from the father. Our study suggests that progenies of artificial tetraploid carp are genetically stable, which is important for genetic breeding of this tetraploid fish.
Introduction to the Natural Anticipator and the Artificial Anticipator

NASA Astrophysics Data System (ADS)

Dubois, Daniel M.

2010-11-01

This short communication deals with the introduction of the concept of anticipator, which is one who anticipates, in the framework of computing anticipatory systems. The definition of anticipation deals with the concept of program. Indeed, the word program, comes from "pro-gram" meaning "to write before" by anticipation, and means a plan for the programming of a mechanism, or a sequence of coded instructions that can be inserted into a mechanism, or a sequence of coded instructions, as genes or behavioural responses, that is part of an organism. Any natural or artificial programs are thus related to anticipatory rewriting systems, as shown in this paper. All the cells in the body, and the neurons in the brain, are programmed by the anticipatory genetic code, DNA, in a low-level language with four signs. The programs in computers are also computing anticipatory systems. It will be shown, at one hand, that the genetic code DNA is a natural anticipator. As demonstrated by Nobel laureate McClintock [8], genomes are programmed. The fundamental program deals with the DNA genetic code. The properties of the DNA consist in self-replication and self-modification. The self-replicating process leads to reproduction of the species, while the self-modifying process leads to new species or evolution and adaptation in existing ones. The genetic code DNA keeps its instructions in memory in the DNA coding molecule. The genetic code DNA is a rewriting system, from DNA coding to DNA template molecule. The DNA template molecule is a rewriting system to the Messenger RNA molecule. The information is not destroyed during the execution of the rewriting program. On the other hand, it will be demonstrated that Turing machine is an artificial anticipator. The Turing machine is a rewriting system. The head reads and writes, modifying the content of the tape. The information is destroyed during the execution of the program. This is an irreversible process. The input data are lost.
A simple method for semi-random DNA amplicon fragmentation using the methylation-dependent restriction enzyme MspJI.

PubMed

Shinozuka, Hiroshi; Cogan, Noel O I; Shinozuka, Maiko; Marshall, Alexis; Kay, Pippa; Lin, Yi-Han; Spangenberg, German C; Forster, John W

2015-04-11

Fragmentation at random nucleotide locations is an essential process for preparation of DNA libraries to be used on massively parallel short-read DNA sequencing platforms. Although instruments for physical shearing, such as the Covaris S2 focused-ultrasonicator system, and products for enzymatic shearing, such as the Nextera technology and NEBNext dsDNA Fragmentase kit, are commercially available, a simple and inexpensive method is desirable for high-throughput sequencing library preparation. MspJI is a recently characterised restriction enzyme which recognises the sequence motif CNNR (where R = G or A) when the first base is modified to 5-methylcytosine or 5-hydroxymethylcytosine. A semi-random enzymatic DNA amplicon fragmentation method was developed based on the unique cleavage properties of MspJI. In this method, random incorporation of 5-methyl-2'-deoxycytidine-5'-triphosphate is achieved through DNA amplification with DNA polymerase, followed by DNA digestion with MspJI. Due to the recognition sequence of the enzyme, DNA amplicons are fragmented in a relatively sequence-independent manner. The size range of the resulting fragments was capable of control through optimisation of 5-methyl-2'-deoxycytidine-5'-triphosphate concentration in the reaction mixture. A library suitable for sequencing using the Illumina MiSeq platform was prepared and processed using the proposed method. Alignment of generated short reads to a reference sequence demonstrated a relatively high level of random fragmentation. The proposed method may be performed with standard laboratory equipment. Although the uniformity of coverage was slightly inferior to the Covaris physical shearing procedure, due to efficiencies of cost and labour, the method may be more suitable than existing approaches for implementation in large-scale sequencing activities, such as bacterial artificial chromosome (BAC)-based genome sequence assembly, pan-genomic studies and locus-targeted genotyping-by-sequencing.
De novo formed satellite DNA-based mammalian artificial chromosomes and their possible applications.

PubMed

Katona, Robert L

2015-02-01

Mammalian artificial chromosomes (MACs) are non-integrating, autonomously replicating natural chromosome-based vectors that may carry a vast amount of genetic material, which in turn enable potentially prolonged, safe, and regulated therapeutic transgene expression and render MACs as attractive genetic vectors for "gene replacement" or for controlling differentiation pathways in target cells. Satellite-DNA-based artificial chromosomes (SATACs) can be made by induced de novo chromosome formation in cells of different mammalian and plant species. These artificially generated accessory chromosomes are composed of predictable DNA sequences, and they contain defined genetic information. SATACs have already passed a number of obstacles crucial to their further development as gene therapy vectors, including large-scale purification, transfer of purified artificial chromosomes into different cells and embryos, generation of transgenic animals and germline transmission with purified SATACs, and the tissue-specific expression of a therapeutic gene from an artificial chromosome in the milk of transgenic animals. SATACs could be used in cell therapy protocols. For these methods, the most versatile target cell would be one that was pluripotent and self-renewing to address multiple disease target cell types, thus making multilineage stem cells, such as adult derived early progenitor cells and embryonic stem cells, as attractive universal host cells.
Writing DNA with GenoCAD.

PubMed

Czar, Michael J; Cai, Yizhi; Peccoud, Jean

2009-07-01

Chemical synthesis of custom DNA made to order calls for software streamlining the design of synthetic DNA sequences. GenoCAD (www.genocad.org) is a free web-based application to design protein expression vectors, artificial gene networks and other genetic constructs composed of multiple functional blocks called genetic parts. By capturing design strategies in grammatical models of DNA sequences, GenoCAD guides the user through the design process. By successively clicking on icons representing structural features or actual genetic parts, complex constructs composed of dozens of functional blocks can be designed in a matter of minutes. GenoCAD automatically derives the construct sequence from its comprehensive libraries of genetic parts. Upon completion of the design process, users can download the sequence for synthesis or further analysis. Users who elect to create a personal account on the system can customize their workspace by creating their own parts libraries, adding new parts to the libraries, or reusing designs to quickly generate sets of related constructs.
Efficient production of artificially designed gelatins with a Bacillus brevis system.

PubMed

Kajino, T; Takahashi, H; Hirai, M; Yamada, Y

2000-01-01

Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.

Construction of trypanosome artificial mini-chromosomes.

PubMed Central

Lee, M G; E, Y; Axelrod, N

1995-01-01

We report the preparation of two linear constructs which, when transformed into the procyclic form of Trypanosoma brucei, become stably inherited artificial mini-chromosomes. Both of the two constructs, one of 10 kb and the other of 13 kb, contain a T.brucei PARP promoter driving a chloramphenicol acetyltransferase (CAT) gene. In the 10 kb construct the CAT gene is followed by one hygromycin phosphotransferase (Hph) gene, and in the 13 kb construct the CAT gene is followed by three tandemly linked Hph genes. At each end of these linear molecules are telomere repeats and subtelomeric sequences. Electroporation of these linear DNA constructs into the procyclic form of T.brucei generated hygromycin-B resistant cell lines. In these cell lines, the input DNA remained linear and bounded by the telomere ends, but it increased in size. In the cell lines generated by the 10 kb construct, the input DNA increased in size to 20-50 kb. In the cell lines generated by the 13 kb constructs, two sizes of linear DNAs containing the input plasmid were detected: one of 40-50 kb and the other of 150 kb. The increase in size was not the result of in vivo tandem repetitions of the input plasmid, but represented the addition of new sequences. These Hph containing linear DNA molecules were maintained stably in cell lines for at least 20 generations in the absence of drug selection and were subsequently referred to as trypanosome artificial mini-chromosomes, or TACs. Images PMID:8532534
Biomimetic Artificial Epigenetic Code for Targeted Acetylation of Histones.

PubMed

Taniguchi, Junichi; Feng, Yihong; Pandian, Ganesh N; Hashiya, Fumitaka; Hidaka, Takuya; Hashiya, Kaori; Park, Soyoung; Bando, Toshikazu; Ito, Shinji; Sugiyama, Hiroshi

2018-06-13

While the central role of locus-specific acetylation of histone proteins in eukaryotic gene expression is well established, the availability of designer tools to regulate acetylation at particular nucleosome sites remains limited. Here, we develop a unique strategy to introduce acetylation by constructing a bifunctional molecule designated Bi-PIP. Bi-PIP has a P300/CBP-selective bromodomain inhibitor (Bi) as a P300/CBP recruiter and a pyrrole-imidazole polyamide (PIP) as a sequence-selective DNA binder. Biochemical assays verified that Bi-PIPs recruit P300 to the nucleosomes having their target DNA sequences and extensively accelerate acetylation. Bi-PIPs also activated transcription of genes that have corresponding cognate DNA sequences inside living cells. Our results demonstrate that Bi-PIPs could act as a synthetic programmable histone code of acetylation, which emulates the bromodomain-mediated natural propagation system of histone acetylation to activate gene expression in a sequence-selective manner.
Non-B-Form DNA Is Enriched at Centromeres

PubMed Central

Henikoff, Steven

2018-01-01

Abstract Animal and plant centromeres are embedded in repetitive “satellite” DNA, but are thought to be epigenetically specified. To define genetic characteristics of centromeres, we surveyed satellite DNA from diverse eukaryotes and identified variation in <10-bp dyad symmetries predicted to adopt non-B-form conformations. Organisms lacking centromeric dyad symmetries had binding sites for sequence-specific DNA-binding proteins with DNA-bending activity. For example, human and mouse centromeres are depleted for dyad symmetries, but are enriched for non-B-form DNA and are associated with binding sites for the conserved DNA-binding protein CENP-B, which is required for artificial centromere function but is paradoxically nonessential. We also detected dyad symmetries and predicted non-B-form DNA structures at neocentromeres, which form at ectopic loci. We propose that centromeres form at non-B-form DNA because of dyad symmetries or are strengthened by sequence-specific DNA binding proteins. This may resolve the CENP-B paradox and provide a general basis for centromere specification. PMID:29365169
Nanopore with Transverse Nanoelectrodes for Electrical Characterization and Sequencing of DNA

PubMed Central

Gierhart, Brian C.; Howitt, David G.; Chen, Shiahn J.; Zhu, Zhineng; Kotecki, David E.; Smith, Rosemary L.; Collins, Scott D.

2009-01-01

A DNA sequencing device which integrates transverse conducting electrodes for the measurement of electrode currents during DNA translocation through a nanopore has been nanofabricated and characterized. A focused electron beam (FEB) milling technique, capable of creating features on the order of 1 nm in diameter, was used to create the nanopore. The device was characterized electrically using gold nanoparticles as an artificial analyte with both DC and AC measurement methods. Single nanoparticle/electrode interaction events were recorded. A low-noise, high-speed transimpedance current amplifier for the detection of nano to picoampere currents at microsecond time scales was designed, fabricated and tested for future integration with the nanopore device. PMID:19584949
Nanopore with Transverse Nanoelectrodes for Electrical Characterization and Sequencing of DNA.

PubMed

Gierhart, Brian C; Howitt, David G; Chen, Shiahn J; Zhu, Zhineng; Kotecki, David E; Smith, Rosemary L; Collins, Scott D

2008-06-16

A DNA sequencing device which integrates transverse conducting electrodes for the measurement of electrode currents during DNA translocation through a nanopore has been nanofabricated and characterized. A focused electron beam (FEB) milling technique, capable of creating features on the order of 1 nm in diameter, was used to create the nanopore. The device was characterized electrically using gold nanoparticles as an artificial analyte with both DC and AC measurement methods. Single nanoparticle/electrode interaction events were recorded. A low-noise, high-speed transimpedance current amplifier for the detection of nano to picoampere currents at microsecond time scales was designed, fabricated and tested for future integration with the nanopore device.
Selecting Fully-Modified XNA Aptamers Using Synthetic Genetics.

PubMed

Taylor, Alexander I; Holliger, Philipp

2018-06-01

This unit describes the application of "synthetic genetics," i.e., the replication of xeno nucleic acids (XNAs), artificial analogs of DNA and RNA bearing alternative backbone or sugar congeners, to the directed evolution of synthetic oligonucleotide ligands (XNA aptamers) specific for target proteins or nucleic acid motifs, using a cross-chemistry selective exponential enrichment (X-SELEX) approach. Protocols are described for synthesis of diverse-sequence XNA repertoires (typically 10 14 molecules) using DNA templates, isolation and panning for functional XNA sequences using targets immobilized on solid phase or gel shift induced by target binding in solution, and XNA reverse transcription to allow cDNA amplification or sequencing. The method may be generally applied to select fully-modified XNA aptamers specific for a wide range of target molecules. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.
[Construction of a general AAV vector regulated by minimal and artificial hypoxic-responsive element].

PubMed

Nie, Xiao-wei; Sun, Li-jun; Hao, Yue-wen; Yang, Guang-xiao; Wang, Quan-ying

2011-03-01

To synthesize the minimal and artificial HRE, and to insert it into the anterior extremity of CMV promoter of a AAV plasmid, and then to construct the AAV regulated by hypoxic-responsive element which was introduced into 293 cell by method of Ca3(PO4)2 using three plasmids. Thus obtaining the adenoassociated virus vector regulated by hypoxic-responsive element was possibly used for gene therapy in ischemia angiocardiopathy and cerebrovascular disease. Artificially synthesize the 36 bp nucleotide sequences of four connection in series HIF-binding sites A/GCGTG(4×HBS)and a 35 bp nucleotide sequences spacing inserted into anterior extremity of CMV promoter TATA Box, then amplified by PCR. The cDNA fragment was confirmed to be right by DNA sequencing. Molecular biology routine method was used to construct a AAV vector regulated by minimal hypoxic-responsive element after the normal CMV promoter in AAV vector was replaced by the CMV promoter included minimal hypoxic-responsive element. Then, NT4-6His-PR39 fusogenic peptide was inserted into MCS of the plasmid, the recombinant AAV vector was obtained by three plasmid co-transfection in 293 cells, in which we can also investigate the expression of 6×His using immunochemistry in hypoxia environment. Artificial HRE was inserted into anterior extremity of CMV promoter and there was a correct spacing between the HRE and the TATA-box. The DNA sequencing and restriction enzyme digestion results indicated that the AAV regulated by hypoxic-responsive element was successfully constructed. Compared to the control group, the expressions of 6×His was significantly increased in the experimental groups in hypoxia environment, which confirmed that the AAV effectually regulated by the minimal HRE was inserted into anterior extremity of CMV promoter. The HRE is inserted into anterior extremity of CMV promoter to lack incision enzyme recognition site by PCR. And eukaryotic expression vector regulated by hypoxic-responsive is constructed. The AAV effectually regulated by the minimal HRE inserted into anterior extremity of CMV promoter. The vector is successfully constructed and it has important theoretical and practical value in the synteresis and therapy of ischemia angiocardiopathy and cerebrovascular disease.
New dye-labeled terminators for improved DNA sequencing patterns.

PubMed Central

Rosenblum, B B; Lee, L G; Spurgeon, S L; Khan, S H; Menchen, S M; Heiner, C R; Chen, S M

1997-01-01

We have used two new dye sets for automated dye-labeled terminator DNA sequencing. One set consists of four, 4,7-dichlororhodamine dyes (d-rhodamines). The second set consists of energy-transfer dyes that use the 5-carboxy-d-rhodamine dyes as acceptor dyes and the 5- or 6-carboxy isomers of 4'-aminomethylfluorescein as the donor dye. Both dye sets utilize a new linker between the dye and the nucleotide, and both provide more even peak heights in terminator sequencing than the dye-terminators consisting of unsubstituted rhodamine dyes. The unsubstituted rhodamine terminators produced electropherograms in which weak G peaks are observed after A peaks and occasionally C peaks. The number of weak G peaks has been reduced or eliminated with the new dye terminators. The general improvement in peak evenness improves accuracy for the automated base-calling software. The improved signal-to-noise ratio of the energy-transfer dye-labeled terminators combined with more even peak heights results in successful sequencing of high molecular weight DNA templates such as bacterial artificial chromosome DNA. PMID:9358158
DNA hypomethylation of individual sequences in aborted cloned bovine fetuses.

PubMed

Chen, Tao; Jiang, Yan; Zhang, Yan-Ling; Liu, Jing-He; Hou, Yi; Schatten, Heide; Chen, Da-Yuan; Sun, Qing-Yuan

2005-09-01

Cloned bovines have a much higher abortion rate than those derived in vivo. Available evidence indicates that inappropriate epigenetic reprogramming of donor nuclei is the primary cause of cloning failure. To gain a better understanding of the DNA methylation changes associated with the high abortion rate of cloned bovines, we examined the DNA methylation status of a repeated sequence (satellite I) and the promoter regions of two single-copy genes (interleukin 3/cytokeratin) in aborted cloned fetuses, aborted fetuses derived from artificial insemination (AI), cloned adults and AI adults by bisulfite sequencing and restriction enzyme analysis. Two of four aborted cloned fetuses show very low methylation levels in the two single-copy gene promoter regions. One of the two fetuses also showed undermethylated status in the satellite I sequence. The other two aborted cloned fetuses have similar methylation levels to those of aborted AI fetuses. However, no difference in methylation was observed between cloned adults and AI adults. Our results demonstrate for the first time the undermethylated status of individual sequences in aborted cloned fetuses. These findings suggest that aberrant DNA methylation may contribute to the developmental failure of cloned bovine fetuses.
Method of artificial DNA splicing by directed ligation (SDL).

PubMed Central

Lebedenko, E N; Birikh, K R; Plutalov, O V; Berlin YuA

1991-01-01

An approach to directed genetic recombination in vitro has been devised, which allows for joining together, in a predetermined way, a series of DNA segments to give a precisely spliced polynucleotide sequence (DNA splicing by directed ligation, SDL). The approach makes use of amplification, by means of several polymerase chain reactions (PCR), of a chosen set of DNA segments. Primers for the amplifications contain recognition sites of the class IIS restriction endonucleases, which transform blunt ends of the amplification products into protruding ends of unique primary structures, the ends to be used for joining segments together being mutually complementary. Ligation of the mixture of the segments so synthesized gives the desired sequence in an unambiguous way. The suggested approach has been exemplified by the synthesis of a totally processed (intronless) gene encoding human mature interleukin-1 alpha. Images PMID:1662363
Monitoring Arthrobacter protophormiae RKJ100 in a 'tag and chase' method during p-nitrophenol bio-remediation in soil microcosms.

PubMed

Pandey, Gunjan; Pandey, Janmejay; Jain, Rakesh K

2006-05-01

Monitoring of micro-organisms released deliberately into the environment is essential to assess their movement during the bio-remediation process. During the last few years, DNA-based genetic methods have emerged as the preferred method for such monitoring; however, their use is restricted in cases where organisms used for bio-remediation are not well characterized or where the public domain databases do not provide sufficient information regarding their sequence. For monitoring of such micro-organisms, alternate approaches have to be undertaken. In this study, we have specifically monitored a p-nitrophenol (PNP)-degrading organism, Arthrobacter protophormiae RKJ100, using molecular methods during PNP degradation in soil microcosm. Cells were tagged with a transposon-based foreign DNA sequence prior to their introduction into PNP-contaminated microcosms. Later, this artificially introduced DNA sequence was PCR-amplified to distinguish the bio-augmented organism from the indigenous microflora during PNP bio-remediation.
Design, synthesis and DNA interactions of a chimera between a platinum complex and an IHF mimicking peptide.

PubMed

Rao, Harita; Damian, Mariana S; Alshiekh, Alak; Elmroth, Sofi K C; Diederichsen, Ulf

2015-12-28

Conjugation of metal complexes with peptide scaffolds possessing high DNA binding affinity has shown to modulate their biological activities and to enhance their interaction with DNA. In this work, a platinum complex/peptide chimera was synthesized based on a model of the Integration Host Factor (IHF), an architectural protein possessing sequence specific DNA binding and bending abilities through its interaction with a minor groove. The model peptide consists of a cyclic unit resembling the minor grove binding subdomain of IHF, a positively charged lysine dendrimer for electrostatic interactions with the DNA phosphate backbone and a flexible glycine linker tethering the two units. A norvaline derived artificial amino acid was designed to contain a dimethylethylenediamine as a bidentate platinum chelating unit, and introduced into the IHF mimicking peptides. The interaction of the chimeric peptides with various DNA sequences was studied by utilizing the following experiments: thermal melting studies, agarose gel electrophoresis for plasmid DNA unwinding experiments, and native and denaturing gel electrophoresis to visualize non-covalent and covalent peptide-DNA adducts, respectively. By incorporation of the platinum metal center within the model peptide mimicking IHF we have attempted to improve its specificity and DNA targeting ability, particularly towards those sequences containing adjacent guanine residues.
Two RNAs or DNAs May Artificially Fuse Together at a Short Homologous Sequence (SHS) during Reverse Transcription or Polymerase Chain Reactions, and Thus Reporting an SHS-Containing Chimeric RNA Requires Extra Caution

PubMed Central

Xie, Bingkun; Yang, Wei; Ouyang, Yongchang; Chen, Lichan; Jiang, Hesheng; Liao, Yuying; Liao, D. Joshua

2016-01-01

Tens of thousands of chimeric RNAs have been reported. Most of them contain a short homologous sequence (SHS) at the joining site of the two partner genes but are not associated with a fusion gene. We hypothesize that many of these chimeras may be technical artifacts derived from SHS-caused mis-priming in reverse transcription (RT) or polymerase chain reactions (PCR). We cloned six chimeric complementary DNAs (cDNAs) formed by human mitochondrial (mt) 16S rRNA sequences at an SHS, which were similar to several expression sequence tags (ESTs).These chimeras, which could not be detected with cDNA protection assay, were likely formed because some regions of the 16S rRNA are reversely complementary to another region to form an SHS, which allows the downstream sequence to loop back and anneal at the SHS to prime the synthesis of its complementary strand, yielding a palindromic sequence that can form a hairpin-like structure.We identified a 16S rRNA that ended at the 4th nucleotide(nt) of the mt-tRNA-leu was dominant and thus should be the wild type. We also cloned a mouse Bcl2-Nek9 chimeric cDNA that contained a 5-nt unmatchable sequence between the two partners, contained two copies of the reverse primer in the same direction but did not contain the forward primer, making it unclear how this Bcl2-Nek9 was formed and amplified. Moreover, a cDNA was amplified because one primer has 4 nts matched to the template, suggesting that there may be many more artificial cDNAs than we have realized, because the nuclear and mt genomes have many more 4-nt than 5-nt or longer homologues. Altogether, the chimeric cDNAs we cloned are good examples suggesting that many cDNAs may be artifacts due to SHS-caused mis-priming and thus greater caution should be taken when new sequence is obtained from a technique involving DNA polymerization. PMID:27148738
Unlinking the methylome pattern from nucleotide sequence, revealed by large-scale in vivo genome engineering and methylome editing in medaka fish

PubMed Central

Nakamura, Ryohei; Uno, Ayako; Kumagai, Masahiko; Fukushima, Hiroto S.; Morishita, Shinichi; Takeda, Hiroyuki

2017-01-01

The heavily methylated vertebrate genomes are punctuated by stretches of poorly methylated DNA sequences that usually mark gene regulatory regions. It is known that the methylation state of these regions confers transcriptional control over their associated genes. Given its governance on the transcriptome, cellular functions and identity, genome-wide DNA methylation pattern is tightly regulated and evidently predefined. However, how is the methylation pattern determined in vivo remains enigmatic. Based on in silico and in vitro evidence, recent studies proposed that the regional hypomethylated state is primarily determined by local DNA sequence, e.g., high CpG density and presence of specific transcription factor binding sites. Nonetheless, the dependency of DNA methylation on nucleotide sequence has not been carefully validated in vertebrates in vivo. Herein, with the use of medaka (Oryzias latipes) as a model, the sequence dependency of DNA methylation was intensively tested in vivo. Our statistical modeling confirmed the strong statistical association between nucleotide sequence pattern and methylation state in the medaka genome. However, by manipulating the methylation state of a number of genomic sequences and reintegrating them into medaka embryos, we demonstrated that artificially conferred DNA methylation states were predominantly and robustly maintained in vivo, regardless of their sequences and endogenous states. This feature was also observed in the medaka transgene that had passed across generations. Thus, despite the observed statistical association, nucleotide sequence was unable to autonomously determine its own methylation state in medaka in vivo. Our results apparently argue against the notion of the governance on the DNA methylation by nucleotide sequence, but instead suggest the involvement of other epigenetic factors in defining and maintaining the DNA methylation landscape. Further investigation in other vertebrate models in vivo will be needed for the generalization of our observations made in medaka. PMID:29267279
Nanoparticle-labeled DNA capture elements for detection and identification of biological agents

NASA Astrophysics Data System (ADS)

Kiel, Johnathan L.; Holwitt, Eric A.; Parker, Jill E.; Vivekananda, Jeevalatha; Franz, Veronica

2004-12-01

Aptamers, synthetic DNA capture elements (DCEs), can be made chemically or in genetically engineered bacteria. DNA capture elements are artificial DNA sequences, from a random pool of sequences, selected for their specific binding to potential biological warfare or terrorism agents. These sequences were selected by an affinity method using filters to which the target agent was attached and the DNA isolated and amplified by polymerase chain reaction (PCR) in an iterative, increasingly stringent, process. The probes can then be conjugated to Quantum Dots and super paramagnetic nanoparticles. The former provide intense, bleach-resistant fluorescent detection of bioagent and the latter provide a means to collect the bioagents with a magnet. The fluorescence can be detected in a flow cytometer, in a fluorescence plate reader, or with a fluorescence microscope. To date, we have made DCEs to Bacillus anthracis spores, Shiga toxin, Venezuelan Equine Encephalitis (VEE) virus, and Francisella tularensis. DCEs can easily distinguish Bacillus anthracis from its nearest relatives, Bacillus cereus and Bacillus thuringiensis. Development of a high through-put process is currently being investigated.
The Centromere: Chromatin Foundation for the Kinetochore Machinery

PubMed Central

Fukagawa, Tatsuo; Earnshaw, William C.

2014-01-01

Since discovery of the centromere-specific histone H3 variant CENP-A, centromeres have come to be defined as chromatin structures that establish the assembly site for the complex kinetochore machinery. In most organisms, centromere activity is defined epigenetically, rather than by specific DNA sequences. In this review, we describe selected classic work and recent progress in studies of centromeric chromatin with a focus on vertebrates. We consider possible roles for repetitive DNA sequences found at most centromeres, chromatin factors and modifications that assemble and activate CENP-A chromatin for kinetochore assembly, plus the use of artificial chromosomes and kinetochores to study centromere function. PMID:25203206
Engineering and Application of Zinc Finger Proteins and TALEs for Biomedical Research.

PubMed

Kim, Moon-Soo; Kini, Anu Ganesh

2017-08-01

Engineered DNA-binding domains provide a powerful technology for numerous biomedical studies due to their ability to recognize specific DNA sequences. Zinc fingers (ZF) are one of the most common DNA-binding domains and have been extensively studied for a variety of applications, such as gene regulation, genome engineering and diagnostics. Another novel DNA-binding domain known as a transcriptional activator-like effector (TALE) has been more recently discovered, which has a previously undescribed DNA-binding mode. Due to their modular architecture and flexibility, TALEs have been rapidly developed into artificial gene targeting reagents. Here, we describe the methods used to design these DNA-binding proteins and their key applications in biomedical research.
Variation in the number of nucleoli and incomplete homogenization of 18S ribosomal DNA sequences in leaf cells of the cultivated Oriental ginseng (Panax ginseng Meyer).

PubMed

Chelomina, Galina N; Rozhkovan, Konstantin V; Voronova, Anastasia N; Burundukova, Olga L; Muzarok, Tamara I; Zhuravlev, Yuri N

2016-04-01

Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440-640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine.
Variation in the number of nucleoli and incomplete homogenization of 18S ribosomal DNA sequences in leaf cells of the cultivated Oriental ginseng (Panax ginseng Meyer)

PubMed Central

Chelomina, Galina N.; Rozhkovan, Konstantin V.; Voronova, Anastasia N.; Burundukova, Olga L.; Muzarok, Tamara I.; Zhuravlev, Yuri N.

2015-01-01

Background Wild ginseng, Panax ginseng Meyer, is an endangered species of medicinal plants. In the present study, we analyzed variations within the ribosomal DNA (rDNA) cluster to gain insight into the genetic diversity of the Oriental ginseng, P. ginseng, at artificial plant cultivation. Methods The roots of wild P. ginseng plants were sampled from a nonprotected natural population of the Russian Far East. The slides were prepared from leaf tissues using the squash technique for cytogenetic analysis. The 18S rDNA sequences were cloned and sequenced. The distribution of nucleotide diversity, recombination events, and interspecific phylogenies for the total 18S rDNA sequence data set was also examined. Results In mesophyll cells, mononucleolar nuclei were estimated to be dominant (75.7%), while the remaining nuclei contained two to four nucleoli. Among the analyzed 18S rDNA clones, 20% were identical to the 18S rDNA sequence of P. ginseng from Japan, and other clones differed in one to six substitutions. The nucleotide polymorphism was more expressed at the positions 440–640 bp, and distributed in variable regions, expansion segments, and conservative elements of core structure. The phylogenetic analysis confirmed conspecificity of ginseng plants cultivated in different regions, with two fixed mutations between P. ginseng and other species. Conclusion This study identified the evidences of the intragenomic nucleotide polymorphism in the 18S rDNA sequences of P. ginseng. These data suggest that, in cultivated plants, the observed genome instability may influence the synthesis of biologically active compounds, which are widely used in traditional medicine. PMID:27158239
HLA genotyping by next-generation sequencing of complementary DNA.

PubMed

Segawa, Hidenobu; Kukita, Yoji; Kato, Kikuya

2017-11-28

Genotyping of the human leucocyte antigen (HLA) is indispensable for various medical treatments. However, unambiguous genotyping is technically challenging due to high polymorphism of the corresponding genomic region. Next-generation sequencing is changing the landscape of genotyping. In addition to high throughput of data, its additional advantage is that DNA templates are derived from single molecules, which is a strong merit for the phasing problem. Although most currently developed technologies use genomic DNA, use of cDNA could enable genotyping with reduced costs in data production and analysis. We thus developed an HLA genotyping system based on next-generation sequencing of cDNA. Each HLA gene was divided into 3 or 4 target regions subjected to PCR amplification and subsequent sequencing with Ion Torrent PGM. The sequence data were then subjected to an automated analysis. The principle of the analysis was to construct candidate sequences generated from all possible combinations of variable bases and arrange them in decreasing order of the number of reads. Upon collecting candidate sequences from all target regions, 2 haplotypes were usually assigned. Cases not assigned 2 haplotypes were forwarded to 4 additional processes: selection of candidate sequences applying more stringent criteria, removal of artificial haplotypes, selection of candidate sequences with a relaxed threshold for sequence matching, and countermeasure for incomplete sequences in the HLA database. The genotyping system was evaluated using 30 samples; the overall accuracy was 97.0% at the field 3 level and 98.3% at the G group level. With one sample, genotyping of DPB1 was not completed due to short read size. We then developed a method for complete sequencing of individual molecules of the DPB1 gene, using the molecular barcode technology. The performance of the automatic genotyping system was comparable to that of systems developed in previous studies. Thus, next-generation sequencing of cDNA is a viable option for HLA genotyping.

Investigations of Escherichia coli promoter sequences with artificial neural networks: New signals discovered upstream of the transcriptional startpoint

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pedersen, A.G.; Engelbrecht, J.

1995-12-31

In this paper we present a novel method for using the learning ability of a neural network as a measure of information in local regions of input data. Using the method to analyze Escherichia coli promoters, we discover all previously described signals, and furthermore find new signals that are regularly spaced along the promoter region. The spacing of all signals correspond to the helical periodicity of DNA, meaning that the signals are all present on the same face of the DNA helix in the promoter region. This is consistent with a model where the RNA polymerase contacts the promoter onmore » one side of the DNA, and suggests that the regions important for promoter recognition may include more positions on the DNA than usually assumed. We furthermore analyze the E.coli promoters by calculating the Kullback Leibler distance, and by constructing sequence logos.« less
Identifying sites of replication initiation in yeast chromosomes: looking for origins in all the right places.

PubMed

van Brabant, A J; Hunt, S Y; Fangman, W L; Brewer, B J

1998-06-01

DNA fragments that contain an active origin of replication generate bubble-shaped replication intermediates with diverging forks. We describe two methods that use two-dimensional (2-D) agarose gel electrophoresis along with DNA sequence information to identify replication origins in natural and artificial Saccharomyces cerevisiae chromosomes. The first method uses 2-D gels of overlapping DNA fragments to locate an active chromosomal replication origin within a region known to confer autonomous replication on a plasmid. A variant form of 2-D gels can be used to determine the direction of fork movement, and the second method uses this technique to find restriction fragments that are replicated by diverging forks, indicating that a bidirectional replication origin is located between the two fragments. Either of these two methods can be applied to the analysis of any genomic region for which there is DNA sequence information or an adequate restriction map.
Development of positive control materials for DNA-based detection of cystic fibrosis: Cloning and sequencing of 31 mutations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Iovannisci, D.; Brown, C.; Winn-Deen, E.

1994-09-01

The cloning and sequencing of the gene associated with cystic fibrosis (CF) now provides the opportunity for earlier detection and carrier screening through DNA-based detection schemes. To date, over 300 mutations have been reported to the CF Consortium; however, only 30 mutations have been observed frequently enough world-wide to warrant routine screening. Many of these mutations are not available as cloned material or as established tissue culture cell lines to aid in the development of DNA-based detection assays. We have therefore cloned the 30 most frequently reported mutations, plus the mutation R347H due to its association with male infertility (31more » mutations, total). Two approaches were employed: direct PCR amplification, where mutations were available from patient sources, and site-directed PCR mutagenesis of normal genomic DNA to generate the remaining mutations. After amplification, products were cloned into a sequencing vector, bacterial transformants were screened by a novel method (PCR/oligonucleotide litigation assay/sequence-coded separation), and plamid DNA sequences determined by automated fluorescent methods on the Applied Biosystems 373A. Mixing of the clones allows the construction of artificial genotypes useful as positive control material for assay validation. A second round of mutagenesis, resulting in the construction of plasmids bearing multiple mutations, will be evaluated for their utility as reagent control materials in kit development.« less
Sequence-selective DNA cleavage by a chimeric metallopeptide.

PubMed

Kovacic, Roger T; Welch, Joel T; Franklin, Sonya J

2003-06-04

A chimeric metallopeptide derived from the sequences of two structurally superimposable motifs was designed as an artificial nuclease. Both DNA recognition and nuclease activity have been incorporated into a small peptide sequence. P3W, a 33-mer peptide comprising helices alpha2 and alpha3 from the engrailed homeodomain and the consensus EF-hand Ca-binding loop binds one equivalent of lanthanides or calcium and folds upon metal binding. The conditional formation constants (in the presence of 50 mM Tris) of P3W for Eu(III) (K(a) = (2.1 +/- 0.1) x 10(5) M(-1)) and Ce(IV) (K(a) = (2.6 +/- 0.1) x 10(5) M(-1)) are typical of isolated EF-hand peptides. Circular dichroism studies show that 1:1 CeP3W is 26% alpha-helical and EuP3W is up to 40% alpha-helical in the presence of excess metal. The predicted helicity of the folded peptide based on helix length and end effects is about 50%, showing the metallopeptides are significantly folded. EuP3W has considerably more secondary structure than our previously reported chimeras (Welch, J. T.; Sirish, M.; Lindstrom, K. M.; Franklin, S. J. Inorg. Chem. 2001, 40, 1982-1984). Eu(III)P3W and Ce(IV)P3W nick supercoiled DNA at pH 6.9, although EuP3W is more active at pH 8. CeP3W cleaves linearized, duplex DNA as well as supercoiled plasmid. The cleavage of a 5'-(32)P-labeled 121-mer DNA fragment was followed by polyacrylamide gel electrophoresis. The cleavage products are 3'-OPO(3) termini exclusively, suggesting a regioselective or multistep mechanism. In contrast, uncomplexed Ce(IV) and Eu(III) ions produce both 3'-OPO(3) and 3'-OH, and no evidence of 4'-oxidative cleavage termini with either metal. The complementary 3'-(32)P-labeled oligonucleotide experiment also showed both 5'-OPO(3) and 5'-OH termini were produced by the free ions, whereas CeP3W produces only 5'-OPO(3) termini. In addition to apparent regioselectivity, the metallopeptides cut DNA with modest sequence discrimination, which suggests that the HTH motif binds DNA as a folded domain and thus cleaves selected sequences. The de novo artificial nuclease LnP3W represents the first small, underivatized peptide that is both active as a nuclease and sequence selective.
Autoregulation of transcription of the hupA gene in Escherichia coli: evidence for steric hindrance of the functional promoter domains induced by HU.

PubMed

Kohno, K; Yasuzawa, K; Hirose, M; Kano, Y; Goshima, N; Tanaka, H; Imamoto, F

1994-06-01

The molecular mechanism of autoregulation of expression of the hupA gene in Escherichia coli was examined. The promoter of the gene contains a palindromic sequence with the potential to form a cruciform DNA structure in which the -35 sequence lies at the base of the stem and the -10 sequence forms a single-stranded loop. An artificial promoter lacking the palindrome, which was constructed by replacing a 10 nucleotide repeat for the predicted cruciform arm by a sequence in the opposite orientation, was not subject to HU-repression. DNA relaxation induced by deleting HU proteins and/or inhibiting DNA gyrase in cells results in increased expression from the hupA promoter. We propose that initiation of transcription of the hupA gene is negatively regulated by steric hindrance of the functional promoter domains for formation of the cruciform configuration, which is facilitated at least in part by negative supercoiling of the hupA promoter DNA region. The promoter region of the hupB gene also contains a palindromic sequence that can assume a cruciform configuration. Negative regulation of this gene by HU proteins may occur by a mechanism similar to that operating for the hupA gene.
[Analysis of free foetal DNA in maternal plasma using STR loci].

PubMed

Vodicka, R; Vrtel, R; Procházka, M; Santavá, A; Dusek, L; Vrbická, D; Singh, R; Krejciríková, E; Schneiderová, E; Santavý, J

2006-01-01

Problems of maternal and foetal genotype differentiation of maternal plasma in pregnant women are solved generally by real-time systems. In this case the specific probes are used to distinguish particular genotype. Mostly gonosomal sequences are utilised to recognise the male foetus. This work describes possibilities in free foetal DNA detection and quantification by STR. Artificial genotype mixtures ranging from 0,2 % to 100 % to simulate maternal and paternal genotypes and 27 DNA samples from pregnant women in different stage of pregnancy were used for DNA quantification and detection. Foetal genotype was confirmed by biological father genotyping. The detection was performed in STR from 21st chromosome Down syndrome (DS) responsible region by innovated (I) QF PCR which allows to reveal and quantify even very rare DNA mosaics. The STR quantification was assessed in artificial mixtures of genotypes and discriminability of particular genotypes was on the level of few percent. Foetal DNA was detected in 74 % of tested samples. The IQF PCR application in quantification and differentiation between maternal and foetal genotypes by STR loci could have importance in non-invasive prenatal diagnostics as another possible marker for DS risk assessment.
A Comprehensive, Automatically Updated Fungal ITS Sequence Dataset for Reference-Based Chimera Control in Environmental Sequencing Efforts.

PubMed

Nilsson, R Henrik; Tedersoo, Leho; Ryberg, Martin; Kristiansson, Erik; Hartmann, Martin; Unterseher, Martin; Porter, Teresita M; Bengtsson-Palme, Johan; Walker, Donald M; de Sousa, Filipe; Gamper, Hannes Andres; Larsson, Ellen; Larsson, Karl-Henrik; Kõljalg, Urmas; Edgar, Robert C; Abarenkov, Kessy

2015-01-01

The nuclear ribosomal internal transcribed spacer (ITS) region is the most commonly chosen genetic marker for the molecular identification of fungi in environmental sequencing and molecular ecology studies. Several analytical issues complicate such efforts, one of which is the formation of chimeric-artificially joined-DNA sequences during PCR amplification or sequence assembly. Several software tools are currently available for chimera detection, but rely to various degrees on the presence of a chimera-free reference dataset for optimal performance. However, no such dataset is available for use with the fungal ITS region. This study introduces a comprehensive, automatically updated reference dataset for fungal ITS sequences based on the UNITE database for the molecular identification of fungi. This dataset supports chimera detection throughout the fungal kingdom and for full-length ITS sequences as well as partial (ITS1 or ITS2 only) datasets. The performance of the dataset on a large set of artificial chimeras was above 99.5%, and we subsequently used the dataset to remove nearly 1,000 compromised fungal ITS sequences from public circulation. The dataset is available at http://unite.ut.ee/repository.php and is subject to web-based third-party curation.
A Comprehensive, Automatically Updated Fungal ITS Sequence Dataset for Reference-Based Chimera Control in Environmental Sequencing Efforts

PubMed Central

Nilsson, R. Henrik; Tedersoo, Leho; Ryberg, Martin; Kristiansson, Erik; Hartmann, Martin; Unterseher, Martin; Porter, Teresita M.; Bengtsson-Palme, Johan; Walker, Donald M.; de Sousa, Filipe; Gamper, Hannes Andres; Larsson, Ellen; Larsson, Karl-Henrik; Kõljalg, Urmas; Edgar, Robert C.; Abarenkov, Kessy

2015-01-01

The nuclear ribosomal internal transcribed spacer (ITS) region is the most commonly chosen genetic marker for the molecular identification of fungi in environmental sequencing and molecular ecology studies. Several analytical issues complicate such efforts, one of which is the formation of chimeric—artificially joined—DNA sequences during PCR amplification or sequence assembly. Several software tools are currently available for chimera detection, but rely to various degrees on the presence of a chimera-free reference dataset for optimal performance. However, no such dataset is available for use with the fungal ITS region. This study introduces a comprehensive, automatically updated reference dataset for fungal ITS sequences based on the UNITE database for the molecular identification of fungi. This dataset supports chimera detection throughout the fungal kingdom and for full-length ITS sequences as well as partial (ITS1 or ITS2 only) datasets. The performance of the dataset on a large set of artificial chimeras was above 99.5%, and we subsequently used the dataset to remove nearly 1,000 compromised fungal ITS sequences from public circulation. The dataset is available at http://unite.ut.ee/repository.php and is subject to web-based third-party curation. PMID:25786896
Mechanical control of Renilla luciferase.

PubMed

Tseng, Chiao-Yu; Zocchi, Giovanni

2013-08-14

We report experiments where the activity of the enzyme luciferase from Renilla reniformis is controlled through a DNA spring attached to the enzyme. In the wake of previous work on kinases, these results establish that mechanical stress applied through the DNA springs is indeed a general method for the artificial control of enzymes, and for the quantitative study of mechano-chemical coupling in these molecules. We also show proof of concept of the luciferase construct as a sensitive molecular probe, detecting a specific DNA target sequence in an easy, one-step, homogeneous assay, as well as SNP detection without melting curve analysis.
Defiant: (DMRs: easy, fast, identification and ANnoTation) identifies differentially Methylated regions from iron-deficient rat hippocampus.

PubMed

Condon, David E; Tran, Phu V; Lien, Yu-Chin; Schug, Jonathan; Georgieff, Michael K; Simmons, Rebecca A; Won, Kyoung-Jae

2018-02-05

Identification of differentially methylated regions (DMRs) is the initial step towards the study of DNA methylation-mediated gene regulation. Previous approaches to call DMRs suffer from false prediction, use extreme resources, and/or require library installation and input conversion. We developed a new approach called Defiant to identify DMRs. Employing Weighted Welch Expansion (WWE), Defiant showed superior performance to other predictors in the series of benchmarking tests on artificial and real data. Defiant was subsequently used to investigate DNA methylation changes in iron-deficient rat hippocampus. Defiant identified DMRs close to genes associated with neuronal development and plasticity, which were not identified by its competitor. Importantly, Defiant runs between 5 to 479 times faster than currently available software packages. Also, Defiant accepts 10 different input formats widely used for DNA methylation data. Defiant effectively identifies DMRs for whole-genome bisulfite sequencing (WGBS), reduced-representation bisulfite sequencing (RRBS), Tet-assisted bisulfite sequencing (TAB-seq), and HpaII tiny fragment enrichment by ligation-mediated PCR-tag (HELP) assays.
Error baseline rates of five sample preparation methods used to characterize RNA virus populations.

PubMed

Kugelman, Jeffrey R; Wiley, Michael R; Nagle, Elyse R; Reyes, Daniel; Pfeffer, Brad P; Kuhn, Jens H; Sanchez-Lockhart, Mariano; Palacios, Gustavo F

2017-01-01

Individual RNA viruses typically occur as populations of genomes that differ slightly from each other due to mutations introduced by the error-prone viral polymerase. Understanding the variability of RNA virus genome populations is critical for understanding virus evolution because individual mutant genomes may gain evolutionary selective advantages and give rise to dominant subpopulations, possibly even leading to the emergence of viruses resistant to medical countermeasures. Reverse transcription of virus genome populations followed by next-generation sequencing is the only available method to characterize variation for RNA viruses. However, both steps may lead to the introduction of artificial mutations, thereby skewing the data. To better understand how such errors are introduced during sample preparation, we determined and compared error baseline rates of five different sample preparation methods by analyzing in vitro transcribed Ebola virus RNA from an artificial plasmid-based system. These methods included: shotgun sequencing from plasmid DNA or in vitro transcribed RNA as a basic "no amplification" method, amplicon sequencing from the plasmid DNA or in vitro transcribed RNA as a "targeted" amplification method, sequence-independent single-primer amplification (SISPA) as a "random" amplification method, rolling circle reverse transcription sequencing (CirSeq) as an advanced "no amplification" method, and Illumina TruSeq RNA Access as a "targeted" enrichment method. The measured error frequencies indicate that RNA Access offers the best tradeoff between sensitivity and sample preparation error (1.4-5) of all compared methods.
Error baseline rates of five sample preparation methods used to characterize RNA virus populations

PubMed Central

Kugelman, Jeffrey R.; Wiley, Michael R.; Nagle, Elyse R.; Reyes, Daniel; Pfeffer, Brad P.; Kuhn, Jens H.; Sanchez-Lockhart, Mariano; Palacios, Gustavo F.

2017-01-01

Individual RNA viruses typically occur as populations of genomes that differ slightly from each other due to mutations introduced by the error-prone viral polymerase. Understanding the variability of RNA virus genome populations is critical for understanding virus evolution because individual mutant genomes may gain evolutionary selective advantages and give rise to dominant subpopulations, possibly even leading to the emergence of viruses resistant to medical countermeasures. Reverse transcription of virus genome populations followed by next-generation sequencing is the only available method to characterize variation for RNA viruses. However, both steps may lead to the introduction of artificial mutations, thereby skewing the data. To better understand how such errors are introduced during sample preparation, we determined and compared error baseline rates of five different sample preparation methods by analyzing in vitro transcribed Ebola virus RNA from an artificial plasmid-based system. These methods included: shotgun sequencing from plasmid DNA or in vitro transcribed RNA as a basic “no amplification” method, amplicon sequencing from the plasmid DNA or in vitro transcribed RNA as a “targeted” amplification method, sequence-independent single-primer amplification (SISPA) as a “random” amplification method, rolling circle reverse transcription sequencing (CirSeq) as an advanced “no amplification” method, and Illumina TruSeq RNA Access as a “targeted” enrichment method. The measured error frequencies indicate that RNA Access offers the best tradeoff between sensitivity and sample preparation error (1.4−5) of all compared methods. PMID:28182717
Time series data analysis using DFA

NASA Astrophysics Data System (ADS)

Okumoto, A.; Akiyama, T.; Sekino, H.; Sumi, T.

2014-02-01

Detrended fluctuation analysis (DFA) was originally developed for the evaluation of DNA sequence and interval for heart rate variability (HRV), but it is now used to obtain various biological information. In this study we perform DFA on artificially generated data where we already know the relationship between signal and the physical event causing the signal. We generate artificial data using molecular dynamics. The Brownian motion of a polymer under an external force is investigated. In order to generate artificial fluctuation in the physical properties, we introduce obstacle pillars fixed to nanostructures. Using different conditions such as presence or absence of obstacles, external field, and the polymer length, we perform DFA on energies and positions of the polymer.
Replication of alpha-satellite DNA arrays in endogenous human centromeric regions and in human artificial chromosome

PubMed Central

Erliandri, Indri; Fu, Haiqing; Nakano, Megumi; Kim, Jung-Hyun; Miga, Karen H.; Liskovykh, Mikhail; Earnshaw, William C.; Masumoto, Hiroshi; Kouprina, Natalay; Aladjem, Mirit I.; Larionov, Vladimir

2014-01-01

In human chromosomes, centromeric regions comprise megabase-size arrays of 171 bp alpha-satellite DNA monomers. The large distances spanned by these arrays preclude their replication from external sites and imply that the repetitive monomers contain replication origins. However, replication within these arrays has not previously been profiled and the role of alpha-satellite DNA in initiation of DNA replication has not yet been demonstrated. Here, replication of alpha-satellite DNA in endogenous human centromeric regions and in de novo formed Human Artificial Chromosome (HAC) was analyzed. We showed that alpha-satellite monomers could function as origins of DNA replication and that replication of alphoid arrays organized into centrochromatin occurred earlier than those organized into heterochromatin. The distribution of inter-origin distances within centromeric alphoid arrays was comparable to the distribution of inter-origin distances on randomly selected non-centromeric chromosomal regions. Depletion of CENP-B, a kinetochore protein that binds directly to a 17 bp CENP-B box motif common to alpha-satellite DNA, resulted in enrichment of alpha-satellite sequences for proteins of the ORC complex, suggesting that CENP-B may have a role in regulating the replication of centromeric regions. Mapping of replication initiation sites in the HAC revealed that replication preferentially initiated in transcriptionally active regions. PMID:25228468
[Characterization of a bacterial biocontrol strain 1404 and its efficacy in controlling postharvest citrus anthracnose].

PubMed

Wang, Qian; Hu, Chunjin; Ke, Fanggang; Huang, Siliang; Li, Qiqin

2010-09-01

Anthracnose caused by Colletotrichum gloeosporioides (Penz.) Sacc. is a main disease in citrus production. To develop an effective biocontrol measure against citrus postharvest anthracnose, we screened antagonistic microbes and obtained a bacterial strain 1404 from the rhizospheric soil of chili plants in Nanning city, Guangxi, China. The objectives of the present study were to: (1) identify and characterize the antagonistic bacterium; and (2) to evaluate the efficacy of the antagonistic strain in controlling citrus postharvest anthracnose disease. Strain 1404 was identified by comparing its 16S rDNA sequence with related bacteria from GenBank database, as well as analyzing its morphological, physiological and biochemical characters. The antagonistic stability of the strain 1404 was determined by continuously transferring it on artificial media. The effect of the strain on suppressing citrus anthracnose at postharvest stage was tested by stab inoculation method. The 16S rDNA of strain 1404 was amplified with primers PF1 (5'-AGAGTTTGATCATGGCTCAG-3') and PR1 (5'-TACGGTTACCTTGTTACGACTT-3') and its sequence submitted to GenBank (accession number: GU361113). Strain 1404 clustered with the GenBank-derived Brevibacillus brevis strains in the 16S-rDNA-sequence-based phylogenetic tree at 100% bootstrap level. The morphological traits, physiological and biochemical characters of strain 1404 agreed with that of Brevibacillus brevis. Less change in the suppressive ability of antagonist against growth of Colletotrichum gloeosporioides was observed during four continuous transfers on artificial media. The average control efficacy of the strain was 64. 9 % against the disease 20 days after the antagonist application. Strain 1404 was identified as Brevibacillus brevis based on its morphological traits, phyiological and biochemical characters as well as 16S rDNA sequence analysis. The antagonist was approved to be a promising biocontrol agent. This is the first report of Brevibacillus brevis as an effective antagonist against citrus postharvest anthracnose disease.
Massively parallel sequencing of 17 commonly used forensic autosomal STRs and amelogenin with small amplicons.

PubMed

Kim, Eun Hye; Lee, Hwan Young; Yang, In Seok; Jung, Sang-Eun; Yang, Woo Ick; Shin, Kyoung-Jin

2016-05-01

The next-generation sequencing (NGS) method has been utilized to analyze short tandem repeat (STR) markers, which are routinely used for human identification purposes in the forensic field. Some researchers have demonstrated the successful application of the NGS system to STR typing, suggesting that NGS technology may be an alternative or additional method to overcome limitations of capillary electrophoresis (CE)-based STR profiling. However, there has been no available multiplex PCR system that is optimized for NGS analysis of forensic STR markers. Thus, we constructed a multiplex PCR system for the NGS analysis of 18 markers (13CODIS STRs, D2S1338, D19S433, Penta D, Penta E and amelogenin) by designing amplicons in the size range of 77-210 base pairs. Then, PCR products were generated from two single-sources, mixed samples and artificially degraded DNA samples using a multiplex PCR system, and were prepared for sequencing on the MiSeq system through construction of a subsequent barcoded library. By performing NGS and analyzing the data, we confirmed that the resultant STR genotypes were consistent with those of CE-based typing. Moreover, sequence variations were detected in targeted STR regions. Through the use of small-sized amplicons, the developed multiplex PCR system enables researchers to obtain successful STR profiles even from artificially degraded DNA as well as STR loci which are analyzed with large-sized amplicons in the CE-based commercial kits. In addition, successful profiles can be obtained from mixtures up to a 1:19 ratio. Consequently, the developed multiplex PCR system, which produces small size amplicons, can be successfully applied to STR NGS analysis of forensic casework samples such as mixtures and degraded DNA samples. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Controlling gene networks and cell fate with precision-targeted DNA-binding proteins and small-molecule-based genome readers

PubMed Central

Eguchi, Asuka; Lee, Garrett O.; Wan, Fang; Erwin, Graham S.; Ansari, Aseem Z.

2014-01-01

Transcription factors control the fate of a cell by regulating the expression of genes and regulatory networks. Recent successes in inducing pluripotency in terminally differentiated cells as well as directing differentiation with natural transcription factors has lent credence to the efforts that aim to direct cell fate with rationally designed transcription factors. Because DNA-binding factors are modular in design, they can be engineered to target specific genomic sequences and perform pre-programmed regulatory functions upon binding. Such precision-tailored factors can serve as molecular tools to reprogramme or differentiate cells in a targeted manner. Using different types of engineered DNA binders, both regulatory transcriptional controls of gene networks, as well as permanent alteration of genomic content, can be implemented to study cell fate decisions. In the present review, we describe the current state of the art in artificial transcription factor design and the exciting prospect of employing artificial DNA-binding factors to manipulate the transcriptional networks as well as epigenetic landscapes that govern cell fate. PMID:25145439
Graphical classification of DNA sequences of HLA alleles by deep learning.

PubMed

Miyake, Jun; Kaneshita, Yuhei; Asatani, Satoshi; Tagawa, Seiichi; Niioka, Hirohiko; Hirano, Takashi

2018-04-01

Alleles of human leukocyte antigen (HLA)-A DNAs are classified and expressed graphically by using artificial intelligence "Deep Learning (Stacked autoencoder)". Nucleotide sequence data corresponding to the length of 822 bp, collected from the Immuno Polymorphism Database, were compressed to 2-dimensional representation and were plotted. Profiles of the two-dimensional plots indicate that the alleles can be classified as clusters are formed. The two-dimensional plot of HLA-A DNAs gives a clear outlook for characterizing the various alleles.
OligArch: A software tool to allow artificially expanded genetic information systems (AEGIS) to guide the autonomous self-assembly of long DNA constructs from multiple DNA single strands.

PubMed

Bradley, Kevin M; Benner, Steven A

2014-01-01

Synthetic biologists wishing to self-assemble large DNA (L-DNA) constructs from small DNA fragments made by automated synthesis need fragments that hybridize predictably. Such predictability is difficult to obtain with nucleotides built from just the four standard nucleotides. Natural DNA's peculiar combination of strong and weak G:C and A:T pairs, the context-dependence of the strengths of those pairs, unimolecular strand folding that competes with desired interstrand hybridization, and non-Watson-Crick interactions available to standard DNA, all contribute to this unpredictability. In principle, adding extra nucleotides to the genetic alphabet can improve the predictability and reliability of autonomous DNA self-assembly, simply by increasing the information density of oligonucleotide sequences. These extra nucleotides are now available as parts of artificially expanded genetic information systems (AEGIS), and tools are now available to generate entirely standard DNA from AEGIS DNA during PCR amplification. Here, we describe the OligArch (for "oligonucleotide architecting") software, an application that permits synthetic biologists to engineer optimally self-assembling DNA constructs from both six- and eight-letter AEGIS alphabets. This software has been used to design oligonucleotides that self-assemble to form complete genes from 20 or more single-stranded synthetic oligonucleotides. OligArch is therefore a key element of a scalable and integrated infrastructure for the rapid and designed engineering of biology.
A second generation integrated map of the rainbow trout (Oncorhynchus mykiss) genome: analysis of synteny with model fish genomes

USDA-ARS?s Scientific Manuscript database

In this paper we generated DNA fingerprints and end sequences from bacterial artificial chromosomes (BACs) from two new libraries to improve the first generation integrated physical and genetic map of the rainbow trout (Oncorhynchus mykiss) genome. The current version of the physical map is compose...

Taming the Past: Ancient DNA and the Study of Animal Domestication.

PubMed

MacHugh, David E; Larson, Greger; Orlando, Ludovic

2017-02-08

During the last decade, ancient DNA research has been revolutionized by the availability of increasingly powerful DNA sequencing and ancillary genomics technologies, giving rise to the new field of paleogenomics. In this review, we show how our understanding of the genetic basis of animal domestication and the origins and dispersal of livestock and companion animals during the Upper Paleolithic and Neolithic periods is being rapidly transformed through new scientific knowledge generated with paleogenomic methods. These techniques have been particularly informative in revealing high-resolution patterns of artificial and natural selection and evidence for significant admixture between early domestic animal populations and their wild congeners.
Sequence selective capture, release and analysis of DNA using a magnetic microbead-assisted toehold-mediated DNA strand displacement reaction.

PubMed

Khodakov, Dmitriy A; Khodakova, Anastasia S; Linacre, Adrian; Ellis, Amanda V

2014-07-21

This paper reports on the modification of magnetic beads with oligonucleotide capture probes with a specially designed pendant toehold (overhang) aimed specifically to capture double-stranded PCR products. After capture, the PCR products were selectively released from the magnetic beads by means of a toehold-mediated strand displacement reaction using short artificial oligonucleotide triggers and analysed using capillary electrophoresis. The approach was successfully shown on two genes widely used in human DNA genotyping, namely human c-fms (macrophage colony-stimulating factor) proto-oncogene for the CSF-1 receptor (CSF1PO) and amelogenin.
Identification of unique repeated patterns, location of mutation in DNA finger printing using artificial intelligence technique.

PubMed

Mukunthan, B; Nagaveni, N

2014-01-01

In genetic engineering, conventional techniques and algorithms employed by forensic scientists to assist in identification of individuals on the basis of their respective DNA profiles involves more complex computational steps and mathematical formulae, also the identification of location of mutation in a genomic sequence in laboratories is still an exigent task. This novel approach provides ability to solve the problems that do not have an algorithmic solution and the available solutions are also too complex to be found. The perfect blend made of bioinformatics and neural networks technique results in efficient DNA pattern analysis algorithm with utmost prediction accuracy.
Construction of a yeast artificial chromosome contig encompassing the chromosome 14 Alzheimer`s disease locus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sharma, V.; Bonnycastle, L.; Poorkai, P.

1994-09-01

We have constructed a yeast artificial chromosome (YAC) contig of chromosome 14q24.3 which encompasses the chromosome 14 Alzheimer`s disease locus (AD3). Determined by linkage analysis of early-onset Alzheimer`s disease kindreds, this interval is bounded by the genetic markers D14S61-D14S63 and spans approximately 15 centimorgans. The contig consists of 29 markers and 74 YACs of which 57 are defined by one or more sequence tagged sites (STSs). The STS markers comprise 5 genes, 16 short tandem repeat polymorphisms and 8 cDNA clones. An additional number of genes, expressed sequence tags and cDNA fragments have been identified and localized to the contigmore » by hybridization and sequence analysis of anonymous clones isolated by cDNA direct selection techniques. A minimal contig of about 15 YACs averaging 0.5-1.5 megabase in length will span this interval and is, at first approximation, in rough agreement with the genetic map. For two regions of the contig, our coverage has relied on L1/THE fingerprint and Alu-PCR hybridization data of YACs provided by CEPH/Genethon. We are currently developing sequence tagged sites from these to confirm the overlaps revealed by the fingerprint data. Among the genes which map to the contig are transforming growth factor beta 3, c-fos, and heat shock protein 2A (HSPA2). C-fos is not a candidate gene for AD3 based on the sequence analysis of affected and unaffected individuals. HSPA2 maps to the proximal edge of the contig and Calmodulin 1, a candidate gene from 4q24.3, maps outside of the region. The YAC contig is a framework physical map from which cosmid or P1 clone contigs can be constructed. As more genes and cDNAs are mapped, a highly resolved transcription map will emerge, a necessary step towards positionally cloning the AD3 gene.« less
EdiPy: a resource to simulate the evolution of plant mitochondrial genes under the RNA editing.

PubMed

Picardi, Ernesto; Quagliariello, Carla

2006-02-01

EdiPy is an online resource appropriately designed to simulate the evolution of plant mitochondrial genes in a biologically realistic fashion. EdiPy takes into account the presence of sites subjected to RNA editing and provides multiple artificial alignments corresponding to both genomic and cDNA sequences. Each artificial data set can successively be submitted to main and widespread evolutionary and phylogenetic software packages such as PAUP, Phyml, PAML and Phylip. As an online bioinformatic resource, EdiPy is available at the following web page: http://biologia.unical.it/py_script/index.html.
A Universal Method for Species Identification of Mammals Utilizing Next Generation Sequencing for the Analysis of DNA Mixtures

PubMed Central

Tillmar, Andreas O.; Dell'Amico, Barbara; Welander, Jenny; Holmlund, Gunilla

2013-01-01

Species identification can be interesting in a wide range of areas, for example, in forensic applications, food monitoring and in archeology. The vast majority of existing DNA typing methods developed for species determination, mainly focuses on a single species source. There are, however, many instances where all species from mixed sources need to be determined, even when the species in minority constitutes less than 1 % of the sample. The introduction of next generation sequencing opens new possibilities for such challenging samples. In this study we present a universal deep sequencing method using 454 GS Junior sequencing of a target on the mitochondrial gene 16S rRNA. The method was designed through phylogenetic analyses of DNA reference sequences from more than 300 mammal species. Experiments were performed on artificial species-species mixture samples in order to verify the method’s robustness and its ability to detect all species within a mixture. The method was also tested on samples from authentic forensic casework. The results showed to be promising, discriminating over 99.9 % of mammal species and the ability to detect multiple donors within a mixture and also to detect minor components as low as 1 % of a mixed sample. PMID:24358309
Targeting vector construction through recombineering.

PubMed

Malureanu, Liviu A

2011-01-01

Gene targeting in mouse embryonic stem cells is an essential, yet still very expensive and highly time-consuming, tool and method to study gene function at the organismal level or to create mouse models of human diseases. Conventional cloning-based methods have been largely used for generating targeting vectors, but are hampered by a number of limiting factors, including the variety and location of restriction enzymes in the gene locus of interest, the specific PCR amplification of repetitive DNA sequences, and cloning of large DNA fragments. Recombineering is a technique that exploits the highly efficient homologous recombination function encoded by λ phage in Escherichia coli. Bacteriophage-based recombination can recombine homologous sequences as short as 30-50 bases, allowing manipulations such as insertion, deletion, or mutation of virtually any genomic region. The large availability of mouse genomic bacterial artificial chromosome (BAC) libraries covering most of the genome facilitates the retrieval of genomic DNA sequences from the bacterial chromosomes through recombineering. This chapter describes a successfully applied protocol and aims to be a detailed guide through the steps of generation of targeting vectors through recombineering.
DNA G-Wire Formation Using an Artificial Peptide is Controlled by Protease Activity.

PubMed

Usui, Kenji; Okada, Arisa; Sakashita, Shungo; Shimooka, Masayuki; Tsuruoka, Takaaki; Nakano, Shu-Ichi; Miyoshi, Daisuke; Mashima, Tsukasa; Katahira, Masato; Hamada, Yoshio

2017-11-16

The development of a switching system for guanine nanowire (G-wire) formation by external signals is important for nanobiotechnological applications. Here, we demonstrate a DNA nanostructural switch (G-wire <--> particles) using a designed peptide and a protease. The peptide consists of a PNA sequence for inducing DNA to form DNA-PNA hybrid G-quadruplex structures, and a protease substrate sequence acting as a switching module that is dependent on the activity of a particular protease. Micro-scale analyses via TEM and AFM showed that G-rich DNA alone forms G-wires in the presence of Ca 2+ , and that the peptide disrupted this formation, resulting in the formation of particles. The addition of the protease and digestion of the peptide regenerated the G-wires. Macro-scale analyses by DLS, zeta potential, CD, and gel filtration were in agreement with the microscopic observations. These results imply that the secondary structure change (DNA G-quadruplex <--> DNA/PNA hybrid structure) induces a change in the well-formed nanostructure (G-wire <--> particles). Our findings demonstrate a control system for forming DNA G-wire structures dependent on protease activity using designed peptides. Such systems hold promise for regulating the formation of nanowire for various applications, including electronic circuits for use in nanobiotechnologies.
Lawyers' delights and geneticists' nightmares: at forty, the double helix shows some wrinkles.

PubMed

Sgaramella, V

1993-12-15

The National Institutes of Health (NIH) request to patent the base sequences of incomplete and uncharacterized fragments of DNA copied on messenger RNAs (cDNAs) extracted from human tissues, the refusal by the patent office, and the appeal placed by NIH, have incited a violent controversy, fueled by rational, as well as emotional elements. In a compromising mode between liberalism and protectionism, I propose that legal protection be considered only for those RNA/DNA sequences, either natural or artificial, which can generate practical applications per se, and not through their expression products. Another controversy is developing around a popular tool for genomic research: the fidelity of yeast artificial chromosome (YAC) libraries being distributed worldwide for physical mapping is being questioned. Some of these libraries have been shown to be affected by surprisingly high levels of co-cloning, in addition to more common gene reshuffling instances. Also in this case, scientific as well as non-scientific components have to be considered. Possible remedies for the underlying problems may be found in the proper use of kinetic, enzymatic and microbiological variables in the production of YACs. Here too, a sharper distinction between the secular and scientific gratifications of research could help.
Facile Recovery of Individual High-Molecular-Weight, Low-Copy-Number Natural Plasmids for Genomic Sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, L.E.; Detter, C,; Barrie, K.

2006-06-01

Sequencing of the large (>50 kb), low-copy-number (<5 per cell) plasmids that mediate horizontal gene transfer has been hindered by the difficulty and expense of isolating DNA from individual plasmids of this class. We report here that a kit method previously devised for purification of bacterial artificial chromosomes (BACs) can be adapted for effective preparation of individual plasmids up to 220 kb from wild gram-negative and gram-positive bacteria. Individual plasmid DNA recovered from less than 10 ml of Escherichia coli, Staphylococcus, and Corynebacterium cultures was of sufficient quantity and quality for construction of highcoverage libraries, as shown by sequencing fivemore » native plasmids ranging in size from 30 kb to 94 kb. We also report recommendations for vector screening to optimize plasmid sequence assembly, preliminary annotation of novel plasmid genomes, and insights on mobile genetic element biology derived from these sequences. Adaptation of this BAC method for large plasmid isolation removes one major technical hurdle to expanding our knowledge of the natural plasmid gene pool.« less
Clinical sequencing in leukemia with the assistance of artificial intelligence.

PubMed

Tojo, Arinobu

2017-01-01

Next generation sequencing (NGS) of cancer genomes is now becoming a prerequisite for accurate diagnosis and proper treatment in clinical oncology. Because the genomic regions for NGS expand from a certain set of genes to the whole exome or whole genome, the resulting sequence data becomes incredibly enormous and makes it quite laborious to translate the genomic data into medicine, so-called annotation and curation. We organized a clinical sequencing team and established a bidirectional (bed-to-bench and bench-to-bed) system to integrate clinical and genomic data for hematological malignancies. We also started a collaborative research project with IBM Japan to adopt the artificial intelligence Watson for Genomics (WfG) to the pipeline of medical informatics. Genomic DNA was prepared from malignant as well as normal tissues in each patient and subjected to NGS. Sequence data was analyzed using an in-house semi-automated pipeline in combination with WfG, which was used to identify candidate driver mutations and relevant pathways from which applicable drug information was deduced. Currently, we have analyzed more than 150 patients with hematological disorders, including AML and ALL, and obtained many informative findings. In this presentation, I will introduce some of the achievements we have made so far.
Characterization of Three Maize Bacterial Artificial Chromosome Libraries toward Anchoring of the Physical Map to the Genetic Map Using High-Density Bacterial Artificial Chromosome Filter Hybridization1

PubMed Central

Yim, Young-Sun; Davis, Georgia L.; Duru, Ngozi A.; Musket, Theresa A.; Linton, Eric W.; Messing, Joachim W.; McMullen, Michael D.; Soderlund, Carol A.; Polacco, Mary L.; Gardiner, Jack M.; Coe, Edward H.

2002-01-01

Three maize (Zea mays) bacterial artificial chromosome (BAC) libraries were constructed from inbred line B73. High-density filter sets from all three libraries, made using different restriction enzymes (HindIII, EcoRI, and MboI, respectively), were evaluated with a set of complex probes including the185-bp knob repeat, ribosomal DNA, two telomere-associated repeat sequences, four centromere repeats, the mitochondrial genome, a multifragment chloroplast DNA probe, and bacteriophage λ. The results indicate that the libraries are of high quality with low contamination by organellar and λ-sequences. The use of libraries from multiple enzymes increased the chance of recovering each region of the genome. Ninety maize restriction fragment-length polymorphism core markers were hybridized to filters of the HindIII library, representing 6× coverage of the genome, to initiate development of a framework for anchoring BAC contigs to the intermated B73 × Mo17 genetic map and to mark the bin boundaries on the physical map. All of the clones used as hybridization probes detected at least three BACs. Twenty-two single-copy number core markers identified an average of 7.4 ± 3.3 positive clones, consistent with the expectation of six clones. This information is integrated into fingerprinting data generated by the Arizona Genomics Institute to assemble the BAC contigs using fingerprint contig and contributed to the process of physical map construction. PMID:12481051
Targeting of >1.5 Mb of Human DNA into the Mouse X Chromosome Reveals Presence of cis-Acting Regulators of Epigenetic Silencing

PubMed Central

Yang, Christine; McLeod, Andrea J.; Cotton, Allison M.; de Leeuw, Charles N.; Laprise, Stéphanie; Banks, Kathleen G.; Simpson, Elizabeth M.; Brown, Carolyn J.

2012-01-01

Regulatory sequences can influence the expression of flanking genes over long distances, and X chromosome inactivation is a classic example of cis-acting epigenetic gene regulation. Knock-ins directed to the Mus musculus Hprt locus offer a unique opportunity to analyze the spread of silencing into different human DNA sequences in the identical genomic environment. X chromosome inactivation of four knock-in constructs, including bacterial artificial chromosome (BAC) integrations of over 195 kb, was demonstrated by both the lack of expression from the inactive X chromosome in females with nonrandom X chromosome inactivation and promoter DNA methylation of the human transgene in females. We further utilized promoter DNA methylation to assess the inactivation status of 74 human reporter constructs comprising >1.5 Mb of DNA. Of the 47 genes examined, only the PHB gene showed female DNA hypomethylation approaching the level seen in males, and escape from X chromosome inactivation was verified by demonstration of expression from the inactive X chromosome. Integration of PHB resulted in lower DNA methylation of the flanking HPRT promoter in females, suggesting the action of a dominant cis-acting escape element. Female-specific DNA hypermethylation of CpG islands not associated with promoters implies a widespread imposition of DNA methylation during X chromosome inactivation; yet transgenes demonstrated differential capacities to accumulate DNA methylation when integrated into the identical location on the inactive X chromosome, suggesting additional cis-acting sequence effects. As only one of the human transgenes analyzed escaped X chromosome inactivation, we conclude that elements permitting ongoing expression from the inactive X are rare in the human genome. PMID:23023002
DNA-Catalyzed Amide Hydrolysis.

PubMed

Zhou, Cong; Avins, Joshua L; Klauser, Paul C; Brandsen, Benjamin M; Lee, Yujeong; Silverman, Scott K

2016-02-24

DNA catalysts (deoxyribozymes) for a variety of reactions have been identified by in vitro selection. However, for certain reactions this identification has not been achieved. One important example is DNA-catalyzed amide hydrolysis, for which a previous selection experiment instead led to DNA-catalyzed DNA phosphodiester hydrolysis. Subsequent efforts in which the selection strategy deliberately avoided phosphodiester hydrolysis led to DNA-catalyzed ester and aromatic amide hydrolysis, but aliphatic amide hydrolysis has been elusive. In the present study, we show that including modified nucleotides that bear protein-like functional groups (any one of primary amino, carboxyl, or primary hydroxyl) enables identification of amide-hydrolyzing deoxyribozymes. In one case, the same deoxyribozyme sequence without the modifications still retains substantial catalytic activity. Overall, these findings establish the utility of introducing protein-like functional groups into deoxyribozymes for identifying new catalytic function. The results also suggest the longer-term feasibility of deoxyribozymes as artificial proteases.
Whole genome nucleosome sequencing identifies novel types of forensic markers in degraded DNA samples

PubMed Central

Dong, Chun-nan; Yang, Ya-dong; Li, Shu-jin; Yang, Ya-ran; Zhang, Xiao-jing; Fang, Xiang-dong; Yan, Jiang-wei; Cong, Bin

2016-01-01

In the case of mass disasters, missing persons and forensic caseworks, highly degraded biological samples are often encountered. It can be a challenge to analyze and interpret the DNA profiles from these samples. Here we provide a new strategy to solve the problem by taking advantage of the intrinsic structural properties of DNA. We have assessed the in vivo positions of more than 35 million putative nucleosome cores in human leukocytes using high-throughput whole genome sequencing, and identified 2,462 single nucleotide variations (SNVs), 128 insertion-deletion polymorphisms (indels). After comparing the sequence reads with 44 STR loci commonly used in forensics, five STRs (TH01, TPOX, D18S51, DYS391, and D10S1248)were matched. We compared these “nucleosome protected STRs” (NPSTRs) with five other non-NPSTRs using mini-STR primer design, real-time PCR, and capillary gel electrophoresis on artificially degraded DNA. Moreover, genotyping performance of the five NPSTRs and five non-NPSTRs was also tested with real casework samples. All results show that loci located in nucleosomes are more likely to be successfully genotyped in degraded samples. In conclusion, after further strict validation, these markers could be incorporated into future forensic and paleontology identification kits, resulting in higher discriminatory power for certain degraded sample types. PMID:27189082
An Improved Method for TAL Effectors DNA-Binding Sites Prediction Reveals Functional Convergence in TAL Repertoires of Xanthomonas oryzae Strains

PubMed Central

Pérez-Quintero, Alvaro L.; Rodriguez-R, Luis M.; Dereeper, Alexis; López, Camilo; Koebnik, Ralf; Szurek, Boris; Cunnac, Sebastien

2013-01-01

Transcription Activators-Like Effectors (TALEs) belong to a family of virulence proteins from the Xanthomonas genus of bacterial plant pathogens that are translocated into the plant cell. In the nucleus, TALEs act as transcription factors inducing the expression of susceptibility genes. A code for TALE-DNA binding specificity and high-resolution three-dimensional structures of TALE-DNA complexes were recently reported. Accurate prediction of TAL Effector Binding Elements (EBEs) is essential to elucidate the biological functions of the many sequenced TALEs as well as for robust design of artificial TALE DNA-binding domains in biotechnological applications. In this work a program with improved EBE prediction performances was developed using an updated specificity matrix and a position weight correction function to account for the matching pattern observed in a validation set of TALE-DNA interactions. To gain a systems perspective on the large TALE repertoires from X. oryzae strains, this program was used to predict rice gene targets for 99 sequenced family members. Integrating predictions and available expression data in a TALE-gene network revealed multiple candidate transcriptional targets for many TALEs as well as several possible instances of functional convergence among TALEs. PMID:23869221
Innovative Methods for Integrating Knowledge for Long-Term Monitoring of Contaminated Groundwater Sites: Understanding Microorganism Communities and their Associated Hydrochemical Environment

NASA Astrophysics Data System (ADS)

Mouser, P. J.; Rizzo, D. M.; Druschel, G.; O'Grady, P.; Stevens, L.

2005-12-01

This interdisciplinary study integrates hydrochemical and genome-based data to estimate the redox processes occurring at long-term monitoring sites. Groundwater samples have been collected from a well-characterized landfill-leachate contaminated aquifer in northeastern New York. Primers from the 16S rDNA gene were used to amplify Bacteria and Archaea in groundwater taken from monitoring wells located in clean, fringe, and contaminated locations within the aquifer. PCR-amplified rDNA were digested with restriction enzymes to evaluate terminal restriction fragment length polymorphism (T-RFLP) community profiles. The rDNA was cloned, sequenced, and partial sequences were matched against known organisms using the NCBI Blast database. Phylogenetic trees and bootstrapping were used to identify classifications of organisms and compare the communities from clean, fringe, and contaminated locations. We used Artificial Neural Network (ANN) models to incorporate microbial data with hydrochemical information for improving our understanding of subsurface processes.
Single molecule targeted sequencing for cancer gene mutation detection.

PubMed

Gao, Yan; Deng, Liwei; Yan, Qin; Gao, Yongqian; Wu, Zengding; Cai, Jinsen; Ji, Daorui; Li, Gailing; Wu, Ping; Jin, Huan; Zhao, Luyang; Liu, Song; Ge, Liangjin; Deem, Michael W; He, Jiankui

2016-05-19

With the rapid decline in cost of sequencing, it is now affordable to examine multiple genes in a single disease-targeted clinical test using next generation sequencing. Current targeted sequencing methods require a separate step of targeted capture enrichment during sample preparation before sequencing. Although there are fast sample preparation methods available in market, the library preparation process is still relatively complicated for physicians to use routinely. Here, we introduced an amplification-free Single Molecule Targeted Sequencing (SMTS) technology, which combined targeted capture and sequencing in one step. We demonstrated that this technology can detect low-frequency mutations using artificially synthesized DNA sample. SMTS has several potential advantages, including simple sample preparation thus no biases and errors are introduced by PCR reaction. SMTS has the potential to be an easy and quick sequencing technology for clinical diagnosis such as cancer gene mutation detection, infectious disease detection, inherited condition screening and noninvasive prenatal diagnosis.
Cloning, expression and activity analysis of a novel fibrinolytic serine protease from Arenicola cristata

NASA Astrophysics Data System (ADS)

Zhao, Chunling; Ju, Jiyu

2015-06-01

The full-length cDNA of a protease gene from a marine annelid Arenicola cristata was amplified through rapid amplification of cDNA ends technique and sequenced. The size of the cDNA was 936 bp in length, including an open reading frame encoding a polypeptide of 270 amino acid residues. The deduced amino acid sequnce consisted of pro- and mature sequences. The protease belonged to the serine protease family because it contained the highly conserved sequence GDSGGP. This protease was novel as it showed a low amino acid sequence similarity (< 40%) to other serine proteases. The gene encoding the active form of A. cristata serine protease was cloned and expressed in E. coli. Purified recombinant protease in a supernatant could dissolve an artificial fibrin plate with plasminogen-rich fibrin, whereas the plasminogen-free fibrin showed no clear zone caused by hydrolysis. This result suggested that the recombinant protease showed an indirect fibrinolytic activity of dissolving fibrin, and was probably a plasminogen activator. A rat model with venous thrombosis was established to demonstrate that the recombinant protease could also hydrolyze blood clot in vivo. Therefore, this recombinant protease may be used as a thrombolytic agent for thrombosis treatment. To our knowledge, this study is the first of reporting the fibrinolytic serine protease gene in A. cristata.
Artificial intelligence in hematology.

PubMed

Zini, Gina

2005-10-01

Artificial intelligence (AI) is a computer based science which aims to simulate human brain faculties using a computational system. A brief history of this new science goes from the creation of the first artificial neuron in 1943 to the first artificial neural network application to genetic algorithms. The potential for a similar technology in medicine has immediately been identified by scientists and researchers. The possibility to store and process all medical knowledge has made this technology very attractive to assist or even surpass clinicians in reaching a diagnosis. Applications of AI in medicine include devices applied to clinical diagnosis in neurology and cardiopulmonary diseases, as well as the use of expert or knowledge-based systems in routine clinical use for diagnosis, therapeutic management and for prognostic evaluation. Biological applications include genome sequencing or DNA gene expression microarrays, modeling gene networks, analysis and clustering of gene expression data, pattern recognition in DNA and proteins, protein structure prediction. In the field of hematology the first devices based on AI have been applied to the routine laboratory data management. New tools concern the differential diagnosis in specific diseases such as anemias, thalassemias and leukemias, based on neural networks trained with data from peripheral blood analysis. A revolution in cancer diagnosis, including the diagnosis of hematological malignancies, has been the introduction of the first microarray based and bioinformatic approach for molecular diagnosis: a systematic approach based on the monitoring of simultaneous expression of thousands of genes using DNA microarray, independently of previous biological knowledge, analysed using AI devices. Using gene profiling, the traditional diagnostic pathways move from clinical to molecular based diagnostic systems.

Authentication of forensic DNA samples.

PubMed

Frumkin, Dan; Wasserstrom, Adam; Davidson, Ariane; Grafit, Arnon

2010-02-01

Over the past twenty years, DNA analysis has revolutionized forensic science, and has become a dominant tool in law enforcement. Today, DNA evidence is key to the conviction or exoneration of suspects of various types of crime, from theft to rape and murder. However, the disturbing possibility that DNA evidence can be faked has been overlooked. It turns out that standard molecular biology techniques such as PCR, molecular cloning, and recently developed whole genome amplification (WGA), enable anyone with basic equipment and know-how to produce practically unlimited amounts of in vitro synthesized (artificial) DNA with any desired genetic profile. This artificial DNA can then be applied to surfaces of objects or incorporated into genuine human tissues and planted in crime scenes. Here we show that the current forensic procedure fails to distinguish between such samples of blood, saliva, and touched surfaces with artificial DNA, and corresponding samples with in vivo generated (natural) DNA. Furthermore, genotyping of both artificial and natural samples with Profiler Plus((R)) yielded full profiles with no anomalies. In order to effectively deal with this problem, we developed an authentication assay, which distinguishes between natural and artificial DNA based on methylation analysis of a set of genomic loci: in natural DNA, some loci are methylated and others are unmethylated, while in artificial DNA all loci are unmethylated. The assay was tested on natural and artificial samples of blood, saliva, and touched surfaces, with complete success. Adopting an authentication assay for casework samples as part of the forensic procedure is necessary for maintaining the high credibility of DNA evidence in the judiciary system.
Evolutionary genomics of miniature inverted-repeat transposable elements (MITEs) in Brassica.

PubMed

Nouroz, Faisal; Noreen, Shumaila; Heslop-Harrison, J S

2015-12-01

Miniature inverted-repeat transposable elements (MITEs) are truncated derivatives of autonomous DNA transposons, and are dispersed abundantly in most eukaryotic genomes. We aimed to characterize various MITEs families in Brassica in terms of their presence, sequence characteristics and evolutionary activity. Dot plot analyses involving comparison of homoeologous bacterial artificial chromosome (BAC) sequences allowed identification of 15 novel families of mobile MITEs. Of which, 5 were Stowaway-like with TA Target Site Duplications (TSDs), 4 Tourist-like with TAA/TTA TSDs, 5 Mutator-like with 9-10 bp TSDs and 1 novel MITE (BoXMITE1) flanked by 3 bp TSDs. Our data suggested that there are about 30,000 MITE-related sequences in Brassica rapa and B. oleracea genomes. In situ hybridization showed one abundant family was dispersed in the A-genome, while another was located near 45S rDNA sites. PCR analysis using primers flanking sequences of MITE elements detected MITE insertion polymorphisms between and within the three Brassica (AA, BB, CC) genomes, with many insertions being specific to single genomes and others showing evidence of more recent evolutionary insertions. Our BAC sequence comparison strategy enables identification of evolutionarily active MITEs with no prior knowledge of MITE sequences. The details of MITE families reported in Brassica enable their identification, characterization and annotation. Insertion polymorphisms of MITEs and their transposition activity indicated important mechanism of genome evolution and diversification. MITE families derived from known Mariner, Harbinger and Mutator DNA transposons were discovered, as well as some novel structures. The identification of Brassica MITEs will have broad applications in Brassica genomics, breeding, hybridization and phylogeny through their use as DNA markers.
The Chapel Hill hemophilia A dog colony exhibits a factor VIII gene inversion

PubMed Central

Lozier, Jay N.; Dutra, Amalia; Pak, Evgenia; Zhou, Nan; Zheng, Zhili; Nichols, Timothy C.; Bellinger, Dwight A.; Read, Marjorie; Morgan, Richard A.

2002-01-01

In the Chapel Hill colony of factor VIII-deficient dogs, abnormal sequence (ch8, for canine hemophilia 8, GenBank no. AF361485) follows exons 1–22 in the factor VIII transcript in place of exons 23–26. The canine hemophilia 8 locus (ch8) sequence was found in a 140-kb normal dog genomic DNA bacterial artificial chromosome (BAC) clone that was completely outside the factor VIII gene, but not in BAC clones containing the factor VIII gene. The BAC clone that contained ch8 also contained a homologue of F8A (factor 8 associated) sequence, which participates in a common inversion that causes severe hemophilia A in humans. Fluorescence in situ hybridization analysis indicated that exons 1–26 normally proceed sequentially from telomere to centromere at Xq28, and ch8 is telomeric to the factor VIII gene. The appearance of an “upstream” genomic sequence element (ch8) at the end of the aberrant factor VIII transcript suggested that an inversion of genomic DNA replaced factor VIII exons 22–26 with ch8. The F8A sequence appeared also in overlapping normal BAC clones containing factor VIII sequence. We hypothesized that homologous recombination between copies of canine F8A inside and outside the factor VIII gene had occurred, as in human hemophilia A. High-resolution fluorescent in situ hybridization on hemophilia A dog DNA revealed a pattern consistent with this inversion mechanism. We also identified a HindIII restriction fragment length polymorphism of F8A fragments that distinguished hemophilia A, carrier, and normal dogs' DNA. The Chapel Hill hemophilia A dog colony therefore replicates the factor VIII gene inversion commonly seen in humans with severe hemophilia A. PMID:12242334
Construction of BAC Libraries from Flow-Sorted Chromosomes.

PubMed

Šafář, Jan; Šimková, Hana; Doležel, Jaroslav

2016-01-01

Cloned DNA libraries in bacterial artificial chromosome (BAC) are the most widely used form of large-insert DNA libraries. BAC libraries are typically represented by ordered clones derived from genomic DNA of a particular organism. In the case of large eukaryotic genomes, whole-genome libraries consist of a hundred thousand to a million clones, which make their handling and screening a daunting task. The labor and cost of working with whole-genome libraries can be greatly reduced by constructing a library derived from a smaller part of the genome. Here we describe construction of BAC libraries from mitotic chromosomes purified by flow cytometric sorting. Chromosome-specific BAC libraries facilitate positional gene cloning, physical mapping, and sequencing in complex plant genomes.
Production and removal of superoxide anion radical by artificial metalloenzymes and redox-active metals

PubMed Central

Kawano, Tomonori; Kagenishi, Tomoko; Kadono, Takashi; Bouteau, François; Hiramatsu, Takuya; Lin, Cun; Tanaka, Kenichiro; Tanaka, Licca; Mancuso, Stefano; Uezu, Kazuya; Okobira, Tadashi; Furukawa, Hiroka; Iwase, Junichiro; Inokuchi, Reina; Baluška, Frantisek; Yokawa, Ken

2015-01-01

Generation of reactive oxygen species is useful for various medical, engineering and agricultural purposes. These include clinical modulation of immunological mechanism, enhanced degradation of organic compounds released to the environments, removal of microorganisms for the hygienic purpose, and agricultural pest control; both directly acting against pathogenic microorganisms and indirectly via stimulation of plant defense mechanism represented by systemic acquired resistance and hypersensitive response. By aiming to develop a novel classes of artificial redox-active biocatalysts involved in production and/or removal of superoxide anion radicals, recent attempts for understanding and modification of natural catalytic proteins and functional DNA sequences of mammalian and plant origins are covered in this review article. PMID:27066179
Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data.

PubMed

Favero, F; Joshi, T; Marquard, A M; Birkbak, N J; Krzystanek, M; Li, Q; Szallasi, Z; Eklund, A C

2015-01-01

Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specimen, by intratumor heterogeneity, and by the sheer size of the raw data. In particular, determination of copy number variations from exome sequencing data alone has proven difficult; thus, single nucleotide polymorphism (SNP) arrays have often been used for this task. Recently, algorithms to estimate absolute, but not allele-specific, copy number profiles from tumor sequencing data have been described. We developed Sequenza, a software package that uses paired tumor-normal DNA sequencing data to estimate tumor cellularity and ploidy, and to calculate allele-specific copy number profiles and mutation profiles. We applied Sequenza, as well as two previously published algorithms, to exome sequence data from 30 tumors from The Cancer Genome Atlas. We assessed the performance of these algorithms by comparing their results with those generated using matched SNP arrays and processed by the allele-specific copy number analysis of tumors (ASCAT) algorithm. Comparison between Sequenza/exome and SNP/ASCAT revealed strong correlation in cellularity (Pearson's r = 0.90) and ploidy estimates (r = 0.42, or r = 0.94 after manual inspecting alternative solutions). This performance was noticeably superior to previously published algorithms. In addition, in artificial data simulating normal-tumor admixtures, Sequenza detected the correct ploidy in samples with tumor content as low as 30%. The agreement between Sequenza and SNP array-based copy number profiles suggests that exome sequencing alone is sufficient not only for identifying small scale mutations but also for estimating cellularity and inferring DNA copy number aberrations. © The Author 2014. Published by Oxford University Press on behalf of the European Society for Medical Oncology.
Complementary DNA display selection of high-affinity peptides binding the vacuolating toxin (VacA) of Helicobacter pylori.

PubMed

Hayakawa, Yumiko; Matsuno, Mitsuhiro; Tanaka, Makoto; Wada, Akihiro; Kitamura, Koichiro; Takei, Osamu; Sasaki, Ryuzo; Mizukami, Tamio; Hasegawa, Makoto

2015-09-01

Artificial peptides designed for molecular recognition of a bacterial toxin have been developed. Vacuolating cytotoxin A protein (VacA) is a major virulence factor of Helicobacter pylori, a gram-negative microaerophilic bacterium inhabiting the upper gastrointestinal tract, particularly the stomach. This study attempted to identify specific peptide sequences with high affinity for VacA using systematic directed evolution in vitro, a cDNA display method. A surface plasmon resonance-based biosensor and fluorescence correlation spectroscopy to examine binding of peptides with VacA identified a peptide (GRVNQRL) with high affinity. Cyclization of the peptide by attaching cysteine residues to both termini improved its binding affinity to VacA, with a dissociation constant (Kd ) of 58 nm. This study describes a new strategy for the development of artificial functional peptides, which are promising materials in biochemical analyses and medical applications. Copyright © 2015 European Peptide Society and John Wiley & Sons, Ltd.
The genome of cultivated sweet potato contains Agrobacterium T-DNAs with expressed genes: An example of a naturally transgenic food crop

PubMed Central

Kyndt, Tina; Quispe, Dora; Zhai, Hong; Jarret, Robert; Ghislain, Marc; Liu, Qingchang; Gheysen, Godelieve

2015-01-01

Agrobacterium rhizogenes and Agrobacterium tumefaciens are plant pathogenic bacteria capable of transferring DNA fragments [transfer DNA (T-DNA)] bearing functional genes into the host plant genome. This naturally occurring mechanism has been adapted by plant biotechnologists to develop genetically modified crops that today are grown on more than 10% of the world’s arable land, although their use can result in considerable controversy. While assembling small interfering RNAs, or siRNAs, of sweet potato plants for metagenomic analysis, sequences homologous to T-DNA sequences from Agrobacterium spp. were discovered. Simple and quantitative PCR, Southern blotting, genome walking, and bacterial artificial chromosome library screening and sequencing unambiguously demonstrated that two different T-DNA regions (IbT-DNA1 and IbT-DNA2) are present in the cultivated sweet potato (Ipomoea batatas [L.] Lam.) genome and that these foreign genes are expressed at detectable levels in different tissues of the sweet potato plant. IbT-DNA1 was found to contain four open reading frames (ORFs) homologous to the tryptophan-2-monooxygenase (iaaM), indole-3-acetamide hydrolase (iaaH), C-protein (C-prot), and agrocinopine synthase (Acs) genes of Agrobacterium spp. IbT-DNA1 was detected in all 291 cultigens examined, but not in close wild relatives. IbT-DNA2 contained at least five ORFs with significant homology to the ORF14, ORF17n, rooting locus (Rol)B/RolC, ORF13, and ORF18/ORF17n genes of A. rhizogenes. IbT-DNA2 was detected in 45 of 217 genotypes that included both cultivated and wild species. Our finding, that sweet potato is naturally transgenic while being a widely and traditionally consumed food crop, could affect the current consumer distrust of the safety of transgenic food crops. PMID:25902487
Endogenous Hot Spots of De Novo Telomere Addition in the Yeast Genome Contain Proximal Enhancers That Bind Cdc13

PubMed Central

Obodo, Udochukwu C.; Epum, Esther A.; Platts, Margaret H.; Seloff, Jacob; Dahlson, Nicole A.; Velkovsky, Stoycho M.; Paul, Shira R.

2016-01-01

DNA double-strand breaks (DSBs) pose a threat to genome stability and are repaired through multiple mechanisms. Rarely, telomerase, the enzyme that maintains telomeres, acts upon a DSB in a mutagenic process termed telomere healing. The probability of telomere addition is increased at specific genomic sequences termed sites of repair-associated telomere addition (SiRTAs). By monitoring repair of an induced DSB, we show that SiRTAs on chromosomes V and IX share a bipartite structure in which a core sequence (Core) is directly targeted by telomerase, while a proximal sequence (Stim) enhances the probability of de novo telomere formation. The Stim and Core sequences are sufficient to confer a high frequency of telomere addition to an ectopic site. Cdc13, a single-stranded DNA binding protein that recruits telomerase to endogenous telomeres, is known to stimulate de novo telomere addition when artificially recruited to an induced DSB. Here we show that the ability of the Stim sequence to enhance de novo telomere addition correlates with its ability to bind Cdc13, indicating that natural sites at which telomere addition occurs at high frequency require binding by Cdc13 to a sequence 20 to 100 bp internal from the site at which telomerase acts to initiate de novo telomere addition. PMID:27044869
Structure, Function, and Evolution of Rice Centromeres

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jiang, Jiming

2010-02-04

The centromere is the most characteristic landmark of eukaryotic chromosomes. Centromeres function as the site for kinetochore assembly and spindle attachment, allowing for the faithful pairing and segregation of sister chromatids during cell division. Characterization of centromeric DNA is not only essential to understand the structure and organization of plant genomes, but it is also a critical step in the development of plant artificial chromosomes. The centromeres of most model eukaryotic species, consist predominantly of long arrays of satellite DNA. Determining the precise DNA boundary of a centromere has proven to be a difficult task in multicellular eukaryotes. We havemore » successfully cloned and sequenced the centromere of rice chromosome 8 (Cen8), representing the first fully sequenced centromere from any multicellular eukaryotes. The functional core of Cen8 spans ~800 kb of DNA, which was determined by chromatin immunoprecipitation (ChIP) using an antibody against the rice centromere-specific H3 histone. We discovered 16 actively transcribed genes distributed throughout the Cen8 region. In addition to Cen8, we have characterized eight additional rice centromeres using the next generation sequencing technology. We discovered four subfamilies of the CRR retrotransposon that is highly enriched in rice centromeres. CRR elements are constitutively transcribed and different CRR subfamilies are differentially processed by RNAi. These results suggest that different CRR subfamilies may play different roles in the RNAi-mediated pathway for formation and maintenance of centromeric chromatin.« less
Constructing Smart Protocells with Built-In DNA Computational Core to Eliminate Exogenous Challenge.

PubMed

Lyu, Yifan; Wu, Cuichen; Heinke, Charles; Han, Da; Cai, Ren; Teng, I-Ting; Liu, Yuan; Liu, Hui; Zhang, Xiaobing; Liu, Qiaoling; Tan, Weihong

2018-06-06

A DNA reaction network is like a biological algorithm that can respond to "molecular input signals", such as biological molecules, while the artificial cell is like a microrobot whose function is powered by the encapsulated DNA reaction network. In this work, we describe the feasibility of using a DNA reaction network as the computational core of a protocell, which will perform an artificial immune response in a concise way to eliminate a mimicked pathogenic challenge. Such a DNA reaction network (RN)-powered protocell can realize the connection of logical computation and biological recognition due to the natural programmability and biological properties of DNA. Thus, the biological input molecules can be easily involved in the molecular computation and the computation process can be spatially isolated and protected by artificial bilayer membrane. We believe the strategy proposed in the current paper, i.e., using DNA RN to power artificial cells, will lay the groundwork for understanding the basic design principles of DNA algorithm-based nanodevices which will, in turn, inspire the construction of artificial cells, or protocells, that will find a place in future biomedical research.
AFEAP cloning: a precise and efficient method for large DNA sequence assembly.

PubMed

Zeng, Fanli; Zang, Jinping; Zhang, Suhua; Hao, Zhimin; Dong, Jingao; Lin, Yibin

2017-11-14

Recent development of DNA assembly technologies has spurred myriad advances in synthetic biology, but new tools are always required for complicated scenarios. Here, we have developed an alternative DNA assembly method named AFEAP cloning (Assembly of Fragment Ends After PCR), which allows scarless, modular, and reliable construction of biological pathways and circuits from basic genetic parts. The AFEAP method requires two-round of PCRs followed by ligation of the sticky ends of DNA fragments. The first PCR yields linear DNA fragments and is followed by a second asymmetric (one primer) PCR and subsequent annealing that inserts overlapping overhangs at both sides of each DNA fragment. The overlapping overhangs of the neighboring DNA fragments annealed and the nick was sealed by T4 DNA ligase, followed by bacterial transformation to yield the desired plasmids. We characterized the capability and limitations of new developed AFEAP cloning and demonstrated its application to assemble DNA with varying scenarios. Under the optimized conditions, AFEAP cloning allows assembly of an 8 kb plasmid from 1-13 fragments with high accuracy (between 80 and 100%), and 8.0, 11.6, 19.6, 28, and 35.6 kb plasmids from five fragments at 91.67, 91.67, 88.33, 86.33, and 81.67% fidelity, respectively. AFEAP cloning also is capable to construct bacterial artificial chromosome (BAC, 200 kb) with a fidelity of 46.7%. AFEAP cloning provides a powerful, efficient, seamless, and sequence-independent DNA assembly tool for multiple fragments up to 13 and large DNA up to 200 kb that expands synthetic biologist's toolbox.
Resistance gene candidates identified by PCR with degenerate oligonucleotide primers map to clusters of resistance genes in lettuce.

PubMed

Shen, K A; Meyers, B C; Islam-Faridi, M N; Chin, D B; Stelly, D M; Michelmore, R W

1998-08-01

The recent cloning of genes for resistance against diverse pathogens from a variety of plants has revealed that many share conserved sequence motifs. This provides the possibility of isolating numerous additional resistance genes by polymerase chain reaction (PCR) with degenerate oligonucleotide primers. We amplified resistance gene candidates (RGCs) from lettuce with multiple combinations of primers with low degeneracy designed from motifs in the nucleotide binding sites (NBSs) of RPS2 of Arabidopsis thaliana and N of tobacco. Genomic DNA, cDNA, and bacterial artificial chromosome (BAC) clones were successfully used as templates. Four families of sequences were identified that had the same similarity to each other as to resistance genes from other species. The relationship of the amplified products to resistance genes was evaluated by several sequence and genetic criteria. The amplified products contained open reading frames with additional sequences characteristic of NBSs. Hybridization of RGCs to genomic DNA and to BAC clones revealed large numbers of related sequences. Genetic analysis demonstrated the existence of clustered multigene families for each of the four RGC sequences. This parallels classical genetic data on clustering of disease resistance genes. Two of the four families mapped to known clusters of resistance genes; these two families were therefore studied in greater detail. Additional evidence that these RGCs could be resistance genes was gained by the identification of leucine-rich repeat (LRR) regions in sequences adjoining the NBS similar to those in RPM1 and RPS2 of A. thaliana. Fluorescent in situ hybridization confirmed the clustered genomic distribution of these sequences. The use of PCR with degenerate oligonucleotide primers is therefore an efficient method to identify numerous RGCs in plants.
Non-coding-regulatory regions of human brain genes delineated by bacterial artificial chromosome knock-in mice.

PubMed

Schmouth, Jean-François; Castellarin, Mauro; Laprise, Stéphanie; Banks, Kathleen G; Bonaguro, Russell J; McInerny, Simone C; Borretta, Lisa; Amirabbasi, Mahsa; Korecki, Andrea J; Portales-Casamar, Elodie; Wilson, Gary; Dreolini, Lisa; Jones, Steven J M; Wasserman, Wyeth W; Goldowitz, Daniel; Holt, Robert A; Simpson, Elizabeth M

2013-10-14

The next big challenge in human genetics is understanding the 98% of the genome that comprises non-coding DNA. Hidden in this DNA are sequences critical for gene regulation, and new experimental strategies are needed to understand the functional role of gene-regulation sequences in health and disease. In this study, we build upon our HuGX ('high-throughput human genes on the X chromosome') strategy to expand our understanding of human gene regulation in vivo. In all, ten human genes known to express in therapeutically important brain regions were chosen for study. For eight of these genes, human bacterial artificial chromosome clones were identified, retrofitted with a reporter, knocked single-copy into the Hprt locus in mouse embryonic stem cells, and mouse strains derived. Five of these human genes expressed in mouse, and all expressed in the adult brain region for which they were chosen. This defined the boundaries of the genomic DNA sufficient for brain expression, and refined our knowledge regarding the complexity of gene regulation. We also characterized for the first time the expression of human MAOA and NR2F2, two genes for which the mouse homologs have been extensively studied in the central nervous system (CNS), and AMOTL1 and NOV, for which roles in CNS have been unclear. We have demonstrated the use of the HuGX strategy to functionally delineate non-coding-regulatory regions of therapeutically important human brain genes. Our results also show that a careful investigation, using publicly available resources and bioinformatics, can lead to accurate predictions of gene expression.
Revising the phylogenetic position of the extinct Mascarene Parrot Mascarinus mascarin (Linnaeus 1771) (Aves: Psittaciformes: Psittacidae).

PubMed

Podsiadlowski, Lars; Gamauf, Anita; Töpfer, Till

2017-02-01

The phylogenetic position of the extinct Mascarene Parrot Mascarinus mascarin from La Réunion has been unresolved for centuries. A recent molecular study unexpectedly placed M. mascarin within the clade of phenotypically very different Vasa parrots Coracopsis. Based on DNA extracted from the only other preserved Mascarinus specimen, we show that the previously obtained cytb sequence is probably an artificial composite of partial sequences from two other parrot species and that M. mascarin is indeed a part of the Psittacula diversification, placed close to P. eupatria and P. wardi. Copyright © 2016 Elsevier Inc. All rights reserved.
Developing de novo human artificial chromosomes in embryonic stem cells using HSV-1 amplicon technology.

PubMed

Moralli, Daniela; Monaco, Zoia L

2015-02-01

De novo artificial chromosomes expressing genes have been generated in human embryonic stem cells (hESc) and are maintained following differentiation into other cell types. Human artificial chromosomes (HAC) are small, functional, extrachromosomal elements, which behave as normal chromosomes in human cells. De novo HAC are generated following delivery of alpha satellite DNA into target cells. HAC are characterized by high levels of mitotic stability and are used as models to study centromere formation and chromosome organisation. They are successful and effective as gene expression vectors since they remain autonomous and can accommodate larger genes and regulatory regions for long-term expression studies in cells unlike other viral gene delivery vectors currently used. Transferring the essential DNA sequences for HAC formation intact across the cell membrane has been challenging for a number of years. A highly efficient delivery system based on HSV-1 amplicons has been used to target DNA directly to the ES cell nucleus and HAC stably generated in human embryonic stem cells (hESc) at high frequency. HAC were detected using an improved protocol for hESc chromosome harvesting, which consistently produced high-quality metaphase spreads that could routinely detect HAC in hESc. In tumour cells, the input DNA often integrated in the host chromosomes, but in the host ES genome, it remained intact. The hESc containing the HAC formed embryoid bodies, generated teratoma in mice, and differentiated into neuronal cells where the HAC were maintained. The HAC structure and chromatin composition was similar to the endogenous hESc chromosomes. This review will discuss the technological advances in HAC vector delivery using HSV-1 amplicons and the improvements in the identification of de novo HAC in hESc.
Single-Molecule Denaturation Mapping of DNA in Nanofluidic Channels

NASA Astrophysics Data System (ADS)

Reisner, Walter; Larsen, Niels; Silahtaroglu, Asli; Kristensen, Anders; Tommerup, Niels; Tegenfeldt, Jonas O.; Flyvbjerg, Henrik

2010-03-01

Nanochannel based DNA stretching can serve as a platform for a new optical mapping technique based on measuring the pattern of partial melting along the extended molecules. We partially melt DNA extended in nanofluidic channels via a combination of local heating and added chemical denaturants. The melted molecules, imaged via a standard fluorescence videomicroscopy setup, exhibit a nonuniform fluorescence profile corresponding to a series of local dips and peaks in the intensity trace along the stretched molecule. We show that this barcode is consistent with the presence of locally melted regions along the molecule and can be explained by calculations of sequence-dependent melting probability. Specifically, we obtain experimental melting profiles for T4, T7, lambda-phage and bacterial artificial chromosome DNA (from human chromosome 12) and compare these profiles to theory. In addition, we demonstrate that the BAC melting profile can be used to align the BAC to its correct position on chromosome 12.
Structural DNA Nanotechnology: Artificial Nanostructures for Biomedical Research.

PubMed

Ke, Yonggang; Castro, Carlos; Choi, Jong Hyun

2018-06-04

Structural DNA nanotechnology utilizes synthetic or biologic DNA as designer molecules for the self-assembly of artificial nanostructures. The field is founded upon the specific interactions between DNA molecules, known as Watson-Crick base pairing. After decades of active pursuit, DNA has demonstrated unprecedented versatility in constructing artificial nanostructures with significant complexity and programmability. The nanostructures could be either static, with well-controlled physicochemical properties, or dynamic, with the ability to reconfigure upon external stimuli. Researchers have devoted considerable effort to exploring the usability of DNA nanostructures in biomedical research. We review the basic design methods for fabricating both static and dynamic DNA nanostructures, along with their biomedical applications in fields such as biosensing, bioimaging, and drug delivery.
Spatial and temporal heterogeneity of microbial life in artificial landscapes

NASA Astrophysics Data System (ADS)

Sengupta, A.; Kaur, R.; Meredith, L. K.; Troch, P. A. A.

2017-12-01

The Landscape Evolution Observatory (LEO) project at Biosphere 2 consists of three replicated artificial landscapes which are sealed within a climate-controlled glass house. LEO is composed of basaltic soil material with low organic matter, nutrients, and microbes. The landscapes are built to resemble zero-order basins and enable researchers to observe hydrological, biological, and geochemical evolution of landscapes in a controlled environment. This study is focused on capturing microbial community dynamics in LEO soil, pre- and post-controlled rainfall episodes. Soil samples were collected from six different locations and at five depths in each of the three slopes followed by DNA extraction from 180 samples and sent for amplicon and minimal draft metagenome sequencing. The average concentration of DNA recovered from each sample was higher in the post-rainfall samples than the pre-rainfall samples, a trend consistent in all three slopes. The sequence data will be evaluated to reveal heterogeneity of the soil microbes, providing a more exact narrative of the microbes present in each slope and the spatiotemporal trends of microbial life in the landscapes. Next, functional traits will be predicted from the community data and metagenomes to determine whether consistent changes occur with respect to wetting and drying episodes. Together, these results will highlight the relevance of a unique terrestrial ecosystem research infrastructure in supporting interdisciplinary hydrobiogeochemical research.
An efficient approach to BAC based assembly of complex genomes.

PubMed

Visendi, Paul; Berkman, Paul J; Hayashi, Satomi; Golicz, Agnieszka A; Bayer, Philipp E; Ruperao, Pradeep; Hurgobin, Bhavna; Montenegro, Juan; Chan, Chon-Kit Kenneth; Staňková, Helena; Batley, Jacqueline; Šimková, Hana; Doležel, Jaroslav; Edwards, David

2016-01-01

There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.

More Genetic Engineering With Cloned Hemoglobin Genes

NASA Technical Reports Server (NTRS)

Bailey, James E.

1992-01-01

Cells modified to enhance growth and production of proteins. Method for enhancing both growth of micro-organisms in vitro and production of various proteins or metalbolites in these micro-organisms provides for incorporation of selected chromosomal or extrachormosomal deoxyribonucleic acid (DNA) sequences into micro-organisms from other cells or from artificial sources. Incorporated DNA includes parts encoding desired product(s) or characteristic(s) of cells and parts that control expression of productor characteristic-encoding parts in response to variations in environment. Extended method enables increased research into growth of organisms in oxygen-poor environments. Industrial applications found in enhancement of processing steps requiring oxygen in fermentation, enzymatic degradation, treatment of wastes containing toxic chemicals, brewing, and some oxidative chemical reactions.
A LDR-PCR approach for multiplex polymorphisms genotyping of severely degraded DNA with fragment sizes <100 bp.

PubMed

Zhang, Zhen; Wang, Bao-Jie; Guan, Hong-Yu; Pang, Hao; Xuan, Jin-Feng

2009-11-01

Reducing amplicon sizes has become a major strategy for analyzing degraded DNA typical of forensic samples. However, amplicon sizes in current mini-short tandem repeat-polymerase chain reaction (PCR) and mini-sequencing assays are still not suitable for analysis of severely degraded DNA. In this study, we present a multiplex typing method that couples ligase detection reaction with PCR that can be used to identify single nucleotide polymorphisms and small-scale insertion/deletions in a sample of severely fragmented DNA. This method adopts thermostable ligation for allele discrimination and subsequent PCR for signal enhancement. In this study, four polymorphic loci were used to assess the ability of this technique to discriminate alleles in an artificially degraded sample of DNA with fragment sizes <100 bp. Our results showed clear allelic discrimination of single or multiple loci, suggesting that this method might aid in the analysis of extremely degraded samples in which allelic drop out of larger fragments is observed.
Control of electrochemical signals from quantum dots conjugated to organic materials by using DNA structure in an analog logic gate.

PubMed

Chen, Qi; Yoo, Si-Youl; Chung, Yong-Ho; Lee, Ji-Young; Min, Junhong; Choi, Jeong-Woo

2016-10-01

Various bio-logic gates have been studied intensively to overcome the rigidity of single-function silicon-based logic devices arising from combinations of various gates. Here, a simple control tool using electrochemical signals from quantum dots (QDs) was constructed using DNA and organic materials for multiple logic functions. The electrochemical redox current generated from QDs was controlled by the DNA structure. DNA structure, in turn, was dependent on the components (organic materials) and the input signal (pH). Independent electrochemical signals from two different logic units containing QDs were merged into a single analog-type logic gate, which was controlled by two inputs. We applied this electrochemical biodevice to a simple logic system and achieved various logic functions from the controlled pH input sets. This could be further improved by choosing QDs, ionic conditions, or DNA sequences. This research provides a feasible method for fabricating an artificial intelligence system. Copyright © 2016 Elsevier B.V. All rights reserved.
A two-dimensional DNA lattice implanted polymer solar cell.

PubMed

Lee, Keun Woo; Kim, Kyung Min; Lee, Junwye; Amin, Rashid; Kim, Byeonghoon; Park, Sung Kye; Lee, Seok Kiu; Park, Sung Ha; Kim, Hyun Jae

2011-09-16

A double crossover tile based artificial two-dimensional (2D) DNA lattice was fabricated and the dry-wet method was introduced to recover an original DNA lattice structure in order to deposit DNA lattices safely on the organic layer without damaging the layer. The DNA lattice was then employed as an electron blocking layer in a polymer solar cell causing an increase of about 10% up to 160% in the power conversion efficiency. Consequently, the resulting solar cell which had an artificial 2D DNA blocking layer showed a significant enhancement in power conversion efficiency compared to conventional polymer solar cells. It should be clear that the artificial DNA nanostructure holds unique physical properties that are extremely attractive for various energy-related and photonic applications.
PNA binding to the non-template DNA strand interferes with transcription, suggesting a blockage mechanism mediated by R-loop formation.

PubMed

Belotserkovskii, Boris P; Hanawalt, Philip C

2015-11-01

Peptide Nucleic Acids (PNAs) are artificial DNA mimics with superior nucleic acid binding capabilities. T7 RNA polymerase (T7 RNAP) transcription upon encountering PNA bound to the non-template DNA strand was studied in vitro. A characteristic pattern of blockage signals was observed, extending downstream from the PNA binding site, similar to that produced by G-rich homopurine-homopyrimidine (hPu-hPy) sequences and likely caused by R-loop formation. Since blocked transcription complexes in association with stable R-loops may interfere with replication and in some cases trigger apoptosis, targeted R-loop formation might be employed to inactivate selected cells, such as those in tumors, based upon their unique complement of expressed genes. © 2014 The Authors. Molecular Carcinogenesis published by Wiley Periodicals, Inc.
In vitro gene expression by cationized derivatives of an artificial protein with repeated RGD sequences, Pronectin.

PubMed

Hosseinkhani, Hossein; Tabata, Yasuhiko

2003-01-09

The objective of this study is to investigate the efficiency of a non-viral gene carrier with RGD sequences, Pronectin F(+) for gene transfection. The Pronectin F(+) was cationized by introducing ethylenediamine (Ed), spermidine (Sd), and spermine (Sm) to the hydroxyl groups while the corresponding gelatin derivative was prepared similarly because gelatin also has one RGD sequence per molecule. The zeta potential and molecular size of Pronectin F(+) and gelatin derivatives were examined before and after polyion complexation with a plasmid DNA of luciferase. When complexed with the plasmid DNA at the Pronectin F(+)/plasmid DNA mixing ratio of 50, the complex exhibited a zeta potential of about 10 mV, which is similar to that of the gelatin derivative-plasmid DNA complex. Irrespective of the type of Pronectin F(+) and gelatin derivatives, their complexation enabled the apparent molecular size of plasmid DNA to reduce to about 200 nm, the size decreasing with the increased derivative/plasmid DNA weight mixing ratio. The rat gastric mucosal (RGM)-1 cells treated with both complexes exhibited significantly stronger luciferase activities than free plasmid DNA although the enhanced extent was significant for the Sm derivative compared with the corresponding Ed and Sd derivatives. Cell attachment was enhanced by the Pronectin F(+) derivative to a significant high extent compared with the gelatin derivative. The amount of plasmid DNA internalized into the cells was enhanced by the complexation with every Pronectin F(+) derivative compared with the gelatin derivative. For both of Pronectin F(+) and gelatin carriers, the buffering capacity of Sm derivatives was higher than that of Ed and Sd derivatives and comparable to that of polyethyleneimine. It is likely that the high efficiency of gene transfection for the Sm derivative is due to the superior buffering effect. We conclude that the Sm derivative of Pronectin F(+) is promising as a non-viral vector of gene transfection.
Acidovorax anthurii sp. nov., a new phytopathogenic bacterium which causes bacterial leaf-spot of anthurium.

PubMed

Gardan, L; Dauga, C; Prior, P; Gillis, M; Saddler, G S

2000-01-01

The bacterial leaf-spot of anthurium emerged during the 1980s, in the French West Indies and Trinidad. This new bacterial disease is presently wide spread and constitutes a serious limiting factor for commercial anthurium production. Twenty-nine strains isolated from leaf-spots of naturally infected anthurium were characterized and compared with reference strains belonging to the Comamonadaceae family, the genera Ralstonia and Burkholderia, and representative fluorescent pseudomonads. From artificial inoculations 25 out of 29 strains were pathogenic on anthurium. Biochemical and physiological tests, fatty acid analysis, DNA-DNA hybridization, 16S rRNA gene sequence analysis, DNA-16S RNA hybridization were performed. The 25 pathogenic strains on anthurium were clustered in one phenon closely related to phytopathogenic strains of the genus Acidovorax. Anthurium strains were 79-99% (deltaTm range 0.2-1.6) related to the strain CFBP 3232 and constituted a discrete DNA homology group indicating that they belong to the same species. DNA-rRNA hybridization, 16S rRNA sequence and fatty acid analysis confirmed that this new species belongs to the beta-subclass of Proteobacteria and to rRNA superfamily III, to the family of Comamonadaceae and to the genus Acidovorax. The name Acidovorax anthurii is proposed for this new phytopathogenic bacterium. The type strain has been deposited in the Collection Française des Bactéries Phytopathogènes as CFBP 3232T.
Artificial selection increased body weight but induced increase of runs of homozygosity in Hanwoo cattle

PubMed Central

Kim, Kwondo; Jung, Jaehoon; Caetano-Anollés, Kelsey; Sung, Samsun; Yoo, DongAhn; Choi, Bong-Hwan; Kim, Hyung-Chul; Jeong, Jin-Young; Cho, Yong-Min; Park, Eung-Woo; Choi, Tae-Jeong; Park, Byoungho; Lim, Dajeong

2018-01-01

Artificial selection has been demonstrated to have a rapid and significant effect on the phenotype and genome of an organism. However, most previous studies on artificial selection have focused solely on genomic sequences modified by artificial selection or genomic sequences associated with a specific trait. In this study, we generated whole genome sequencing data of 126 cattle under artificial selection, and 24,973,862 single nucleotide variants to investigate the relationship among artificial selection, genomic sequences and trait. Using runs of homozygosity detected by the variants, we showed increase of inbreeding for decades, and at the same time demonstrated a little influence of recent inbreeding on body weight. Also, we could identify ~0.2 Mb runs of homozygosity segment which may be created by recent artificial selection. This approach may aid in development of genetic markers directly influenced by artificial selection, and provide insight into the process of artificial selection. PMID:29561881
A genome-wide BAC-end sequence survey provides first insights into sweetpotato (Ipomoea batatas (L.) Lam.) genome composition.

PubMed

Si, Zengzhi; Du, Bing; Huo, Jinxi; He, Shaozhen; Liu, Qingchang; Zhai, Hong

2016-11-21

Sweetpotato, Ipomoea batatas (L.) Lam., is an important food crop widely grown in the world. However, little is known about the genome of this species because it is a highly heterozygous hexaploid. Gaining a more in-depth knowledge of sweetpotato genome is therefore necessary and imperative. In this study, the first bacterial artificial chromosome (BAC) library of sweetpotato was constructed. Clones from the BAC library were end-sequenced and analyzed to provide genome-wide information about this species. The BAC library contained 240,384 clones with an average insert size of 101 kb and had a 7.93-10.82 × coverage of the genome, and the probability of isolating any single-copy DNA sequence from the library was more than 99%. Both ends of 8310 BAC clones randomly selected from the library were sequenced to generate 11,542 high-quality BAC-end sequences (BESs), with an accumulative length of 7,595,261 bp and an average length of 658 bp. Analysis of the BESs revealed that 12.17% of the sweetpotato genome were known repetitive DNA, including 7.37% long terminal repeat (LTR) retrotransposons, 1.15% Non-LTR retrotransposons and 1.42% Class II DNA transposons etc., 18.31% of the genome were identified as sweetpotato-unique repetitive DNA and 10.00% of the genome were predicted to be coding regions. In total, 3,846 simple sequences repeats (SSRs) were identified, with a density of one SSR per 1.93 kb, from which 288 SSRs primers were designed and tested for length polymorphism using 20 sweetpotato accessions, 173 (60.07%) of them produced polymorphic bands. Sweetpotato BESs had significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum than those of Vitis vinifera, Theobroma cacao and Arabidopsis thaliana. The first BAC library for sweetpotato has been successfully constructed. The high quality BESs provide first insights into sweetpotato genome composition, and have significant hits to the genome sequences of I. trifida and more matches to the whole-genome sequences of Solanum lycopersicum. These resources as a robust platform will be used in high-resolution mapping, gene cloning, assembly of genome sequences, comparative genomics and evolution for sweetpotato.
A multi-scale analysis of bull sperm methylome revealed both species peculiarities and conserved tissue-specific features.

PubMed

Perrier, Jean-Philippe; Sellem, Eli; Prézelin, Audrey; Gasselin, Maxime; Jouneau, Luc; Piumi, François; Al Adhami, Hala; Weber, Michaël; Fritz, Sébastien; Boichard, Didier; Le Danvic, Chrystelle; Schibler, Laurent; Jammes, Hélène; Kiefer, Hélène

2018-05-29

Spermatozoa have a remarkable epigenome in line with their degree of specialization, their unique nature and different requirements for successful fertilization. Accordingly, perturbations in the establishment of DNA methylation patterns during male germ cell differentiation have been associated with infertility in several species. While bull semen is widely used in artificial insemination, the literature describing DNA methylation in bull spermatozoa is still scarce. The purpose of this study was therefore to characterize the bull sperm methylome relative to both bovine somatic cells and the sperm of other mammals through a multiscale analysis. The quantification of DNA methylation at CCGG sites using luminometric methylation assay (LUMA) highlighted the undermethylation of bull sperm compared to the sperm of rams, stallions, mice, goats and men. Total blood cells displayed a similarly high level of methylation in bulls and rams, suggesting that undermethylation of the bovine genome was specific to sperm. Annotation of CCGG sites in different species revealed no striking bias in the distribution of genome features targeted by LUMA that could explain undermethylation of bull sperm. To map DNA methylation at a genome-wide scale, bull sperm was compared with bovine liver, fibroblasts and monocytes using reduced representation bisulfite sequencing (RRBS) and immunoprecipitation of methylated DNA followed by microarray hybridization (MeDIP-chip). These two methods exhibited differences in terms of genome coverage, and consistently, two independent sets of sequences differentially methylated in sperm and somatic cells were identified for RRBS and MeDIP-chip. Remarkably, in the two sets most of the differentially methylated sequences were hypomethylated in sperm. In agreement with previous studies in other species, the sequences that were specifically hypomethylated in bull sperm targeted processes relevant to the germline differentiation program (piRNA metabolism, meiosis, spermatogenesis) and sperm functions (cell adhesion, fertilization), as well as satellites and rDNA repeats. These results highlight the undermethylation of bull spermatozoa when compared with both bovine somatic cells and the sperm of other mammals, and raise questions regarding the dynamics of DNA methylation in bovine male germline. Whether sperm undermethylation has potential interactions with structural variation in the cattle genome may deserve further attention.
Organization of Synthetic Alphoid DNA Array in Human Artificial Chromosome (HAC) with a Conditional Centromere

PubMed Central

Kouprina, Natalay; Samoshkin, Alexander; Erliandri, Indri; Nakano, Megumi; Lee, Hee-Sheung; Fu, Haiging; Iida, Yuichi; Aladjem, Mirit; Oshimura, Mitsuo; Masumoto, Hiroshi; Earnshaw, William C.; Larionov, Vladimir

2012-01-01

Human artificial chromosomes (HACs) represent a novel promising episomal system for functional genomics, gene therapy and synthetic biology. HACs are engineered from natural and synthetic alphoid DNA arrays upon transfection into human cells. The use of HACs for gene expression studies requires the knowledge of their structural organization. However, none of de novo HACs constructed so far has been physically mapped in detail. Recently we constructed a synthetic alphoidtetO-HAC that was successfully used for expression of full-length genes to correct genetic deficiencies in human cells. The HAC can be easily eliminated from cell populations by inactivation of its conditional kinetochore. This unique feature provides a control for phenotypic changes attributed to expression of HAC-encoded genes. This work describes organization of a megabase-size synthetic alphoid DNA array in the alphoidtetO-HAC that has been formed from a ~50 kb synthetic alphoidtetO-construct. Our analysis showed that this array represents a 1.1 Mb continuous sequence assembled from multiple copies of input DNA, a significant part of which was rearranged before assembling. The tandem and inverted alphoid DNA repeats in the HAC range in size from 25 to 150 kb. In addition, we demonstrated that the structure and functional domains of the HAC remains unchanged after several rounds of its transfer into different host cells. The knowledge of the alphoidtetO-HAC structure provides a tool to control HAC integrity during different manipulations. Our results also shed light on a mechanism for de novo HAC formation in human cells. PMID:23411994
Transcription of telomeric DNA leads to high levels of homologous recombination and t-loops.

PubMed

Kar, Anirban; Willcox, Smaranda; Griffith, Jack D

2016-11-02

The formation of DNA loops at chromosome ends (t-loops) and the transcription of telomeres producing G-rich RNA (TERRA) represent two central features of telomeres. To explore a possible link between them we employed artificial human telomeres containing long arrays of TTAGGG repeats flanked by the T7 or T3 promoters. Transcription of these DNAs generates a high frequency of t-loops within individual molecules and homologous recombination events between different DNAs at their telomeric sequences. T-loop formation does not require a single strand overhang, arguing that both terminal strands insert into the preceding duplex. The loops are very stable and some RNase H resistant TERRA remains at the t-loop, likely adding to their stability. Transcription of DNAs containing TTAGTG or TGAGTG repeats showed greatly reduced loop formation. While in the cell multiple pathways may lead to t-loop formation, the pathway revealed here does not depend on the shelterins but rather on the unique character of telomeric DNA when it is opened for transcription. Hence, telomeric sequences may have evolved to facilitate their ability to loop back on themselves. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Microinjection of CRISPR/Cas9 Protein into Channel Catfish, Ictalurus punctatus, Embryos for Gene Editing.

PubMed

Elaswad, Ahmed; Khalil, Karim; Cline, David; Page-McCaw, Patrick; Chen, Wenbiao; Michel, Maximilian; Cone, Roger; Dunham, Rex

2018-01-20

The complete genome of the channel catfish, Ictalurus punctatus, has been sequenced, leading to greater opportunities for studying channel catfish gene function. Gene knockout has been used to study these gene functions in vivo. The clustered regularly interspaced short palindromic repeats/CRISPR associated protein 9 (CRISPR/Cas9) system is a powerful tool used to edit genomic DNA sequences to alter gene function. While the traditional approach has been to introduce CRISPR/Cas9 mRNA into the single cell embryos through microinjection, this can be a slow and inefficient process in catfish. Here, a detailed protocol for microinjection of channel catfish embryos with CRISPR/Cas9 protein is described. Briefly, eggs and sperm were collected and then artificial fertilization performed. Fertilized eggs were transferred to a Petri dish containing Holtfreter's solution. Injection volume was calibrated and then guide RNAs/Cas9 targeting the toll/interleukin 1 receptor domain-containing adapter molecule (TICAM 1) gene and rhamnose binding lectin (RBL) gene were microinjected into the yolk of one-cell embryos. The gene knockout was successful as indels were confirmed by DNA sequencing. The predicted protein sequence alterations due to these mutations included frameshift and truncated protein due to premature stop codons.
An Efficient Method for High-Fidelity BAC/PAC Retrofitting with a Selectable Marker for Mammalian Cell Transfection

PubMed Central

Wang, Zunde; Engler, Peter; Longacre, Angelika; Storb, Ursula

2001-01-01

Large-scale genomic sequencing projects have provided DNA sequence information for many genes, but the biological functions for most of them will only be known through functional studies. Bacterial artificial chromosomes (BACs) and P1-derived artificial chromosomes (PACs) are large genomic clones stably maintained in bacteria and are very important in functional studies through transfection because of their large size and stability. Because most BAC or PAC vectors do not have a mammalian selection marker, transfecting mammalian cells with genes cloned in BACs or PACs requires the insertion into the BAC/PAC of a mammalian selectable marker. However, currently available procedures are not satisfactory in efficiency and fidelity. We describe a very simple and efficient procedure that allows one to retrofit dozens of BACs in a day with no detectable deletions or unwanted recombination. We use a BAC/PAC retrofitting vector that, on transformation into competent BAC or PAC strains, will catalyze the specific insertion of itself into BAC/PAC vectors through in vivo cre/loxP site-specific recombination. PMID:11156622
Engineering artificial machines from designable DNA materials for biomedical applications.

PubMed

Qi, Hao; Huang, Guoyou; Han, Yulong; Zhang, Xiaohui; Li, Yuhui; Pingguan-Murphy, Belinda; Lu, Tian Jian; Xu, Feng; Wang, Lin

2015-06-01

Deoxyribonucleic acid (DNA) emerges as building bricks for the fabrication of nanostructure with complete artificial architecture and geometry. The amazing ability of DNA in building two- and three-dimensional structures raises the possibility of developing smart nanomachines with versatile controllability for various applications. Here, we overviewed the recent progresses in engineering DNA machines for specific bioengineering and biomedical applications.
Engineering Artificial Machines from Designable DNA Materials for Biomedical Applications

PubMed Central

Huang, Guoyou; Han, Yulong; Zhang, Xiaohui; Li, Yuhui; Pingguan-Murphy, Belinda; Lu, Tian Jian; Xu, Feng

2015-01-01

Deoxyribonucleic acid (DNA) emerges as building bricks for the fabrication of nanostructure with complete artificial architecture and geometry. The amazing ability of DNA in building two- and three-dimensional structures raises the possibility of developing smart nanomachines with versatile controllability for various applications. Here, we overviewed the recent progresses in engineering DNA machines for specific bioengineering and biomedical applications. PMID:25547514
Construction, Characterization, and Preliminary BAC-End Sequence Analysis of a Bacterial Artificial Chromosome Library of the Tea Plant (Camellia sinensis)

PubMed Central

Lin, Jinke; Kudrna, Dave; Wing, Rod A.

2011-01-01

We describe the construction and characterization of a publicly available BAC library for the tea plant, Camellia sinensis. Using modified methods, the library was constructed with the aim of developing public molecular resources to advance tea plant genomics research. The library consists of a total of 401,280 clones with an average insert size of 135 kb, providing an approximate coverage of 13.5 haploid genome equivalents. No empty vector clones were observed in a random sampling of 576 BAC clones. Further analysis of 182 BAC-end sequences from randomly selected clones revealed a GC content of 40.35% and low chloroplast and mitochondrial contamination. Repetitive sequence analyses indicated that LTR retrotransposons were the most predominant sequence class (86.93%–87.24%), followed by DNA retrotransposons (11.16%–11.69%). Additionally, we found 25 simple sequence repeats (SSRs) that could potentially be used as genetic markers. PMID:21234344
Self-Organizing Hidden Markov Model Map (SOHMMM).

PubMed

Ferles, Christos; Stafylopatis, Andreas

2013-12-01

A hybrid approach combining the Self-Organizing Map (SOM) and the Hidden Markov Model (HMM) is presented. The Self-Organizing Hidden Markov Model Map (SOHMMM) establishes a cross-section between the theoretic foundations and algorithmic realizations of its constituents. The respective architectures and learning methodologies are fused in an attempt to meet the increasing requirements imposed by the properties of deoxyribonucleic acid (DNA), ribonucleic acid (RNA), and protein chain molecules. The fusion and synergy of the SOM unsupervised training and the HMM dynamic programming algorithms bring forth a novel on-line gradient descent unsupervised learning algorithm, which is fully integrated into the SOHMMM. Since the SOHMMM carries out probabilistic sequence analysis with little or no prior knowledge, it can have a variety of applications in clustering, dimensionality reduction and visualization of large-scale sequence spaces, and also, in sequence discrimination, search and classification. Two series of experiments based on artificial sequence data and splice junction gene sequences demonstrate the SOHMMM's characteristics and capabilities. Copyright © 2013 Elsevier Ltd. All rights reserved.
Transcription factor trapping by RNA in gene regulatory elements.

PubMed

Sigova, Alla A; Abraham, Brian J; Ji, Xiong; Molinie, Benoit; Hannett, Nancy M; Guo, Yang Eric; Jangi, Mohini; Giallourakis, Cosmas C; Sharp, Phillip A; Young, Richard A

2015-11-20

Transcription factors (TFs) bind specific sequences in promoter-proximal and -distal DNA elements to regulate gene transcription. RNA is transcribed from both of these DNA elements, and some DNA binding TFs bind RNA. Hence, RNA transcribed from regulatory elements may contribute to stable TF occupancy at these sites. We show that the ubiquitously expressed TF Yin-Yang 1 (YY1) binds to both gene regulatory elements and their associated RNA species across the entire genome. Reduced transcription of regulatory elements diminishes YY1 occupancy, whereas artificial tethering of RNA enhances YY1 occupancy at these elements. We propose that RNA makes a modest but important contribution to the maintenance of certain TFs at gene regulatory elements and suggest that transcription of regulatory elements produces a positive-feedback loop that contributes to the stability of gene expression programs. Copyright © 2015, American Association for the Advancement of Science.
Mutation at a distance caused by homopolymeric guanine repeats in Saccharomyces cerevisiae

PubMed Central

McDonald, Michael J.; Yu, Yen-Hsin; Guo, Jheng-Fen; Chong, Shin Yen; Kao, Cheng-Fu; Leu, Jun-Yi

2016-01-01

Mutation provides the raw material from which natural selection shapes adaptations. The rate at which new mutations arise is therefore a key factor that determines the tempo and mode of evolution. However, an accurate assessment of the mutation rate of a given organism is difficult because mutation rate varies on a fine scale within a genome. A central challenge of evolutionary genetics is to determine the underlying causes of this variation. In earlier work, we had shown that repeat sequences not only are prone to a high rate of expansion and contraction but also can cause an increase in mutation rate (on the order of kilobases) of the sequence surrounding the repeat. We perform experiments that show that simple guanine repeats 13 bp (base pairs) in length or longer (G13+) increase the substitution rate 4- to 18-fold in the downstream DNA sequence, and this correlates with DNA replication timing (R = 0.89). We show that G13+ mutagenicity results from the interplay of both error-prone translesion synthesis and homologous recombination repair pathways. The mutagenic repeats that we study have the potential to be exploited for the artificial elevation of mutation rate in systems biology and synthetic biology applications. PMID:27386516

Evaluation of nearest-neighbor methods for detection of chimeric small-subunit rRNA sequences

NASA Technical Reports Server (NTRS)

Robison-Cox, J. F.; Bateson, M. M.; Ward, D. M.

1995-01-01

Detection of chimeric artifacts formed when PCR is used to retrieve naturally occurring small-subunit (SSU) rRNA sequences may rely on demonstrating that different sequence domains have different phylogenetic affiliations. We evaluated the CHECK_CHIMERA method of the Ribosomal Database Project and another method which we developed, both based on determining nearest neighbors of different sequence domains, for their ability to discern artificially generated SSU rRNA chimeras from authentic Ribosomal Database Project sequences. The reliability of both methods decreases when the parental sequences which contribute to chimera formation are more than 82 to 84% similar. Detection is also complicated by the occurrence of authentic SSU rRNA sequences that behave like chimeras. We developed a naive statistical test based on CHECK_CHIMERA output and used it to evaluate previously reported SSU rRNA chimeras. Application of this test also suggests that chimeras might be formed by retrieving SSU rRNAs as cDNA. The amount of uncertainty associated with nearest-neighbor analyses indicates that such tests alone are insufficient and that better methods are needed.
Comparative performance of high-density oligonucleotide sequencing and dideoxynucleotide sequencing of HIV type 1 pol from clinical samples.

PubMed

Günthard, H F; Wong, J K; Ignacio, C C; Havlir, D V; Richman, D D

1998-07-01

The performance of the high-density oligonucleotide array methodology (GeneChip) in detecting drug resistance mutations in HIV-1 pol was compared with that of automated dideoxynucleotide sequencing (ABI) of clinical samples, viral stocks, and plasmid-derived NL4-3 clones. Sequences from 29 clinical samples (plasma RNA, n = 17; lymph node RNA, n = 5; lymph node DNA, n = 7) from 12 patients, from 6 viral stock RNA samples, and from 13 NL4-3 clones were generated by both methods. Editing was done independently by a different investigator for each method before comparing the sequences. In addition, NL4-3 wild type (WT) and mutants were mixed in varying concentrations and sequenced by both methods. Overall, a concordance of 99.1% was found for a total of 30,865 bases compared. The comparison of clinical samples (plasma RNA and lymph node RNA and DNA) showed a slightly lower match of base calls, 98.8% for 19,831 nucleotides compared (protease region, 99.5%, n = 8272; RT region, 98.3%, n = 11,316), than for viral stocks and NL4-3 clones (protease region, 99.8%; RT region, 99.5%). Artificial mixing experiments showed a bias toward calling wild-type bases by GeneChip. Discordant base calls are most likely due to differential detection of mixtures. The concordance between GeneChip and ABI was high and appeared dependent on the nature of the templates (directly amplified versus cloned) and the complexity of mixes.
DNA cytoskeleton for stabilizing artificial cells.

PubMed

Kurokawa, Chikako; Fujiwara, Kei; Morita, Masamune; Kawamata, Ibuki; Kawagishi, Yui; Sakai, Atsushi; Murayama, Yoshihiro; Nomura, Shin-Ichiro M; Murata, Satoshi; Takinoue, Masahiro; Yanagisawa, Miho

2017-07-11

Cell-sized liposomes and droplets coated with lipid layers have been used as platforms for understanding live cells, constructing artificial cells, and implementing functional biomedical tools such as biosensing platforms and drug delivery systems. However, these systems are very fragile, which results from the absence of cytoskeletons in these systems. Here, we construct an artificial cytoskeleton using DNA nanostructures. The designed DNA oligomers form a Y-shaped nanostructure and connect to each other with their complementary sticky ends to form networks. To undercoat lipid membranes with this DNA network, we used cationic lipids that attract negatively charged DNA. By encapsulating the DNA into the droplets, we successfully created a DNA shell underneath the membrane. The DNA shells increased interfacial tension, elastic modulus, and shear modulus of the droplet surface, consequently stabilizing the lipid droplets. Such drastic changes in stability were detected only when the DNA shell was in the gel phase. Furthermore, we demonstrate that liposomes with the DNA gel shell are substantially tolerant against outer osmotic shock. These results clearly show the DNA gel shell is a stabilizer of the lipid membrane akin to the cytoskeleton in live cells.
Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing.

PubMed

Yang, Hua; Wei, Chao-Ling; Liu, Hong-Wei; Wu, Jun-Lan; Li, Zheng-Guo; Zhang, Liang; Jian, Jian-Bo; Li, Ye-Yun; Tai, Yu-Ling; Zhang, Jing; Zhang, Zheng-Zhu; Jiang, Chang-Jun; Xia, Tao; Wan, Xiao-Chun

2016-01-01

Tea is one of the most popular beverages across the world and is made exclusively from cultivars of Camellia sinensis. Many wild relatives of the genus Camellia that are closely related to C. sinensis are native to Southwest China. In this study, we first identified the distinct genetic divergence between C. sinensis and its wild relatives and provided a glimpse into the artificial selection of tea plants at a genome-wide level by analyzing 15,444 genomic SNPs that were identified from 18 cultivated and wild tea accessions using a high-throughput genome-wide restriction site-associated DNA sequencing (RAD-Seq) approach. Six distinct clusters were detected by phylogeny inferrence and principal component and genetic structural analyses, and these clusters corresponded to six Camellia species/varieties. Genetic divergence apparently indicated that C. taliensis var. bangwei is a semi-wild or transient landrace occupying a phylogenetic position between those wild and cultivated tea plants. Cultivated accessions exhibited greater heterozygosity than wild accessions, with the exception of C. taliensis var. bangwei. Thirteen genes with non-synonymous SNPs exhibited strong selective signals that were suggestive of putative artificial selective footprints for tea plants during domestication. The genome-wide SNPs provide a fundamental data resource for assessing genetic relationships, characterizing complex traits, comparing heterozygosity and analyzing putatitve artificial selection in tea plants.
Altered imprinted gene expression and methylation patterns in mid-gestation aborted cloned porcine fetuses and placentas.

PubMed

Zhang, Xiaoyang; Wang, Dongxu; Han, Yang; Duan, Feifei; Lv, Qinyan; Li, Zhanjun

2014-11-01

To determine the expression patterns of imprinted genes and their methylation status in aborted cloned porcine fetuses and placentas. RNA and DNA were prepared from fetuses and placentas that were produced by SCNT and controls from artificial insemination. The expression of 18 imprinted genes was determined by quantitative real-time PCR (q-PCR). Bisulfite sequencing PCR (BSP) was conducted to determine the methylation status of PRE-1 short interspersed repetitive element (SINE), satellite DNA and H19 differentially methylated region 3 (DMR3). The weight, imprinted gene expression and genome-wide DNA methylation patterns were compared between the mid-gestation aborted and normal control samples. The results showed hypermethylation of PRE-1 and satellite sequences, the aberrant expression of imprinted genes, and the hypomethylation of H19 DMR3 occurred in mid-gestation aborted fetuses and placentas. Cloned pigs generated by somatic cell nuclear transfer (SCNT) showed a greater ratio of early abortion during mid-gestation than did normal controls because of the incomplete epigenetic reprogramming of the donor cells. Altered expression of imprinted genes and the hypermethylation profile of the repetitive regions (PRE-1 and satellite DNA) may be associated with defective development and early abortion of cloned pigs, emphasizing the importance of epigenetics during pregnancy and implications thereof for patient-specific embryonic stem cells for human therapeutic cloning and improvement of human assisted reproduction.
Is plant mitochondrial RNA editing a source of phylogenetic incongruence? An answer from in silico and in vivo data sets.

PubMed

Picardi, Ernesto; Quagliariello, Carla

2008-03-26

In plant mitochondria, the post-transcriptional RNA editing process converts C to U at a number of specific sites of the mRNA sequence and usually restores phylogenetically conserved codons and the encoded amino acid residues. Sites undergoing RNA editing evolve at a higher rate than sites not modified by the process. As a result, editing sites strongly affect the evolution of plant mitochondrial genomes, representing an important source of sequence variability and potentially informative characters. To date no clear and convincing evidence has established whether or not editing sites really affect the topology of reconstructed phylogenetic trees. For this reason, we investigated here the effect of RNA editing on the tree building process of twenty different plant mitochondrial gene sequences and by means of computer simulations. Based on our simulation study we suggest that the editing 'noise' in tree topology inference is mainly manifested at the cDNA level. In particular, editing sites tend to confuse tree topologies when artificial genomic and cDNA sequences are generated shorter than 500 bp and with an editing percentage higher than 5.0%. Similar results have been also obtained with genuine plant mitochondrial genes. In this latter instance, indeed, the topology incongruence increases when the editing percentage goes up from about 3.0 to 14.0%. However, when the average gene length is higher than 1,000 bp (rps3, matR and atp1) no differences in the comparison between inferred genomic and cDNA topologies could be detected. Our findings by the here reported in silico and in vivo computer simulation system seem to strongly suggest that editing sites contribute in the generation of misleading phylogenetic trees if the analyzed mitochondrial gene sequence is highly edited (higher than 3.0%) and reduced in length (shorter than 500 bp). In the current lack of direct experimental evidence the results presented here encourage, thus, the use of genomic mitochondrial rather than cDNA sequences for reconstructing phylogenetic events in land plants.
Isolation and sequence analysis of the wheat B genome subtelomeric DNA.

PubMed

Salina, Elena A; Sergeeva, Ekaterina M; Adonina, Irina G; Shcherban, Andrey B; Afonnikov, Dmitry A; Belcram, Harry; Huneau, Cecile; Chalhoub, Boulos

2009-09-05

Telomeric and subtelomeric regions are essential for genome stability and regular chromosome replication. In this work, we have characterized the wheat BAC (bacterial artificial chromosome) clones containing Spelt1 and Spelt52 sequences, which belong to the subtelomeric repeats of the B/G genomes of wheats and Aegilops species from the section Sitopsis. The BAC library from Triticum aestivum cv. Renan was screened using Spelt1 and Spelt52 as probes. Nine positive clones were isolated; of them, clone 2050O8 was localized mainly to the distal parts of wheat chromosomes by in situ hybridization. The distribution of the other clones indicated the presence of different types of repetitive sequences in BACs. Use of different approaches allowed us to prove that seven of the nine isolated clones belonged to the subtelomeric chromosomal regions. Clone 2050O8 was sequenced and its sequence of 119,737 bp was annotated. It is composed of 33% transposable elements (TEs), 8.2% Spelt52 (namely, the subfamily Spelt52.2) and five non-TE-related genes. DNA transposons are predominant, making up 24.6% of the entire BAC clone, whereas retroelements account for 8.4% of the clone length. The full-length CACTA transposon Caspar covers 11,666 bp, encoding a transposase and CTG-2 proteins, and this transposon accounts for 40% of the DNA transposons. The in situ hybridization data for 2050O8 derived subclones in combination with the BLAST search against wheat mapped ESTs (expressed sequence tags) suggest that clone 2050O8 is located in the terminal bin 4BL-10 (0.95-1.0). Additionally, four of the predicted 2050O8 genes showed significant homology to four putative orthologous rice genes in the distal part of rice chromosome 3S and confirm the synteny to wheat 4BL. Satellite DNA sequences from the subtelomeric regions of diploid wheat progenitor can be used for selecting the BAC clones from the corresponding regions of hexaploid wheat chromosomes. It has been demonstrated for the first time that Spelt52 sequences were involved in the evolution of terminal regions of common wheat chromosomes. Our research provides new insights into the microcollinearity in the terminal regions of wheat chromosomes 4BL and rice chromosome 3S.
Biotechnology: Opportunities to Enhance Army Capabilities

DTIC Science & Technology

1989-12-01

without gene vectors Hand pollination Physical methods for DNA or RNA introduction Artificial insemination Cross-species cell fusion and embryo rescue...antibodies and DNA probes to test blood and anaylyze body fluids for more accurate identification of both suspects and victims. 3 I 6 l Artificial blood for...the Army include: *combat casualty care - wound repair, organ regeneration, nerve cell repair, and artificial blood eprophylaxis - protection from
Cloning of the anhidrotic ectodermal dysplasia gene: Identification of cDNAs associated with CpG islands mapped near translocation breakpoint in two female patients

DOE Office of Scientific and Technical Information (OSTI.GOV)

Srivastava, A.K.; Schlessinger, D.; Kere, J.

1994-09-01

The gene for the X chromosomal developmental disorder anhidrotic ectodermal dysplasia (EDA) has been mapped to Xq12-q13 by linkage analysis and is expressed in a few females with chromosomal translocations involving band Xq12-q13. A yeast artificial chromosome (YAC) contig (2.0 Mb) spanning two translocation breakpoints has been assembled by sequence-tagged site (STS)-based chromosomal walking. The two translocation breakpoints (X:autosome translocations from the affected female patients) have been mapped less than 60 kb apart within a YAC contig. Unique probes and intragenic STSs (mapped between the two translocations) have been developed and a somatic cell hybrid carrying the translocated X chromosomemore » from the AK patient has been analyzed by isolating unique probes that span the breakpoint. Several STSs made from intragenic sequences have been found to be conserved in mouse, hamster and monkey, but we have detected no mRNAs in a number of tissues tested. However, a probe and STS developed from the DNA spanning the AK breakpoint is conserved in mouse, hamster and monkey, and we have detected expressed sequences in skin cells and cDNA libraries. In addition, unique sequences have been obtained from two CpG islands in the region that maps proximal to the breakpoints. cDNAs containing these sequences are being studied as candidates for the gene affected in the etiology of EDA.« less
Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

PubMed

Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

2012-12-01

In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
A sequence of 'factishes': the media-metaphorical knowledge dynamics structuring the German press coverage of the human genome.

PubMed

Doring, Martin

2005-12-01

This article deals with the cultural framing of the near sequencing of the human genome and its impact on the media coverage in Germany. It investigates in particular the way in which the weekly journal Die Zeit and the daily newspaper Frankfurter Rundschau reported this media event and its aftermath between June 2000 and June 2001. Both newspapers are quality papers that played an essential role in framing the human genome debate--alongside the Frankfurter Allgemeine Zeitung--which became the most prominent genomic forum. The decoding of the human genome prompted a huge controversy concerning the ethics of human engineering, research on stem cells and Preimplantation Genetic Diagnosis. The main aim of this article is to show how this controversy was structured by metaphor. The media coverage of the genome generated DNA-factishes--a neologism designating the ambivalence of something as fact (fait) and as a fetish (fetiche)--that mostly propagated images of a new DNA-scienticism or biological determinism. Mediated by cultural experiences, the human genome became a highly artificial and social construct of a 'NatureCulture'.
Genome medicine: gene therapy for the millennium, 30 September-3 October 2001, Rome, Italy.

PubMed

Gruenert, D C; Novelli, G; Dallapiccola, B; Colosimo, A

2002-06-01

The recent surge of DNA sequence information resulting from the efforts of agencies interested in deciphering the human genetic code has facilitated technological developments that have been critical in the identification of genes associated with numerous disease pathologies. In addition, these efforts have opened the door to the opportunity to develop novel genetic therapies to treat a broad range of inherited disorders. Through a joint effort by the University of Vermont, the University of Rome, Tor Vergata, University of Rome, La Sapienza, and the CSS Mendel Institute, Rome, an international meeting, 'Genome Medicine: Gene Therapy for the Millennium' was organized. This meeting provided a forum for the discussion of scientific and clinical advances stimulated by the explosion of sequence information generated by the Human Genome Project and the implications these advances have for gene therapy. The meeting had six sessions that focused on the functional evaluation of specific genes via biochemical analysis and through animal models, the development of novel therapeutic strategies involving gene targeting, artificial chromsomes, DNA delivery systems and non-embryonic stem cells, and on the ethical and social implications of these advances.
Base modifications affecting RNA polymerase and reverse transcriptase fidelity.

PubMed

Potapov, Vladimir; Fu, Xiaoqing; Dai, Nan; Corrêa, Ivan R; Tanner, Nathan A; Ong, Jennifer L

2018-06-20

Ribonucleic acid (RNA) is capable of hosting a variety of chemically diverse modifications, in both naturally-occurring post-transcriptional modifications and artificial chemical modifications used to expand the functionality of RNA. However, few studies have addressed how base modifications affect RNA polymerase and reverse transcriptase activity and fidelity. Here, we describe the fidelity of RNA synthesis and reverse transcription of modified ribonucleotides using an assay based on Pacific Biosciences Single Molecule Real-Time sequencing. Several modified bases, including methylated (m6A, m5C and m5U), hydroxymethylated (hm5U) and isomeric bases (pseudouridine), were examined. By comparing each modified base to the equivalent unmodified RNA base, we can determine how the modification affected cumulative RNA polymerase and reverse transcriptase fidelity. 5-hydroxymethyluridine and N6-methyladenosine both increased the combined error rate of T7 RNA polymerase and reverse transcriptases, while pseudouridine specifically increased the error rate of RNA synthesis by T7 RNA polymerase. In addition, we examined the frequency, mutational spectrum and sequence context of reverse transcription errors on DNA templates from an analysis of second strand DNA synthesis.
One-Dimensional Multichromophor Arrays Based on DNA: From Self-Assembly to Light-Harvesting.

PubMed

Ensslen, Philipp; Wagenknecht, Hans-Achim

2015-10-20

Light-harvesting complexes collect light energy and deliver it by a cascade of energy and electron transfer processes to the reaction center where charge separation leads to storage as chemical energy. The design of artificial light-harvesting assemblies faces enormous challenges because several antenna chromophores need to be kept in close proximity but self-quenching needs to be avoided. Double stranded DNA as a supramolecular scaffold plays a promising role due to its characteristic structural properties. Automated DNA synthesis allows incorporation of artificial chromophore-modified building blocks, and sequence design allows precise control of the distances and orientations between the chromophores. The helical twist between the chromophores, which is induced by the DNA framework, controls energy and electron transfer and thereby reduces the self-quenching that is typically observed in chromophore aggregates. This Account summarizes covalently multichromophore-modified DNA and describes how such multichromophore arrays were achieved by Watson-Crick-specific and DNA-templated self-assembly. The covalent DNA systems were prepared by incorporation of chromophores as DNA base substitutions (either as C-nucleosides or with acyclic linkers as substitutes for the 2'-deoxyribofuranoside) and as DNA base modifications. Studies with DNA base substitutions revealed that distances but more importantly relative orientations of the chromophores govern the energy transfer efficiencies and thereby the light-harvesting properties. With DNA base substitutions, duplex stabilization was faced and could be overcome, for instance, by zipper-like placement of the chromophores in both strands. For both principal structural approaches, DNA-based light-harvesting antenna could be realized. The major disadvantages, however, for covalent multichromophore DNA conjugates are the poor yields of synthesis and the solubility issues for oligonucleotides with more than 5-10 chromophore modifications in a row. A logical alternative approach is to leave out the phosphodiester bridges between the chromophores and let chromophore-nucleoside conjugates self-assemble specifically along single stranded DNA as template. The self-organization of chromophores along the DNA template based on canonical base pairing would be advantageous because sequence selective base pairing could provide a structural basis for programmed complexity within the chromophore assembly. The self-assembly is governed by two interactions. The chromophore-nucleoside conjugates as guest molecules are recognized via hydrogen bonds to the corresponding counter bases in the single stranded DNA template. Moreover, the π-π interactions between the stacked chromophores stabilize these self-assembled constructs with increasing length. Longer DNA templates are more attractive for self-assembled antenna. The helicity in the stack of porphyrins as guest molecules assembled on the DNA template can be switched by environmental changes, such as pH variations. DNA-templated stacks of ethynyl pyrene and nile red exhibit left-handed chirality, which stands in contrast to similar covalent multichromophore-DNA conjugates with enforced right-handed helicity. With ethynyl nile red, it is possible to occupy every available binding site on the templates. Mixed assemblies of ethynyl pyrene and nile red show energy transfer and thereby provide a proof-of-principle that simple light-harvesting antennae can be obtained in a noncovalent and self-assembled fashion. With respect to the next important step, chemical storage of the absorbed light energy, future research has to focus on the coupling of sophisticated DNA-based light-harvesting antenna to reaction centers.
Genomics and museum specimens.

PubMed

Nachman, Michael W

2013-12-01

Nearly 25 years ago, Allan Wilson and colleagues isolated DNA sequences from museum specimens of kangaroo rats (Dipodomys panamintinus) and compared these sequences with those from freshly collected animals (Thomas et al. 1990). The museum specimens had been collected up to 78 years earlier, so the two samples provided a direct temporal comparison of patterns of genetic variation. This was not the first time DNA sequences had been isolated from preserved material, but it was the first time it had been carried out with a population sample. Population geneticists often try to make inferences about the influence of historical processes such as selection, drift, mutation and migration on patterns of genetic variation in the present. The work of Wilson and colleagues was important in part because it suggested a way in which population geneticists could actually study genetic change in natural populations through time, much the same way that experimentalists can do with artificial populations in the laboratory. Indeed, the work of Thomas et al. (1990) spawned dozens of studies in which museum specimens were used to compare historical and present-day genetic diversity (reviewed in Wandeler et al. 2007). All of these studies, however, were limited by the same fundamental problem: old DNA is degraded into short fragments. As a consequence, these studies mostly involved PCR amplification of short templates, usually short stretches of mitochondrial DNA or microsatellites. In this issue, Bi et al. (2013) report a breakthrough that should open the door to studies of genomic variation in museum specimens. They used target enrichment (exon capture) and next-generation (Illumina) sequencing to compare patterns of genetic variation in historic and present-day population samples of alpine chipmunks (Tamias alpinus) (Fig. 1). The historic samples came from specimens collected in 1915, so the temporal span of this comparison is nearly 100 years. © 2013 John Wiley & Sons Ltd.
Generation of an approximately 2.4 Mb human X centromere-based minichromosome by targeted telomere-associated chromosome fragmentation in DT40.

PubMed

Mills, W; Critcher, R; Lee, C; Farr, C J

1999-05-01

A linear mammalian artificial chromosome (MAC) will require at least three types of functional element: a centromere, two telomeres and origins of replication. As yet, our understanding of these elements, as well as many other aspects of structure and organization which may be critical for a fully functional mammalian chromosome, remains poor. As a way of defining these various requirements, minichromosome reagents are being developed and analysed. Approaches for minichromosome generation fall into two broad categories: de novo assembly from candidate DNA sequences, or the fragmentation of an existing chromosome to reduce it to a minimal size. Here we describe the generation of a human minichromosome using the latter, top-down, approach. A human X chromosome, present in a DT40-human microcell hybrid, has been manipulated using homologous recombination and the targeted seeding of a de novo telomere. This strategy has generated a linear approximately 2.4 Mb human X centromere-based minichromosome capped by two artificially seeded telomeres: one immediately flanking the centromeric alpha-satellite DNA and the other targeted to the zinc finger gene ZXDA in Xp11.21. The chromosome retains an alpha-satellite domain of approximately 1. 8 Mb, a small array of gamma-satellite repeat ( approximately 40 kb) and approximately 400 kb of Xp proximal DNA sequence. The mitotic stability of this minichromosome has been examined, both in DT40 and following transfer into hamster and human cell lines. In all three backgrounds, the minichromosome is retained efficiently, but in the human and hamster microcell hybrids its copy number is poorly regulated. This approach of engineering well-defined chromosome reagents will allow key questions in MAC development (such as whether a lower size limit exists) to be addressed. In addition, the 2.4 Mb minichromosome described here has potential to be developed as a vector for gene delivery.
Low X/Y divergence in four pairs of papaya sex-linked genes.

PubMed

Yu, Qingyi; Hou, Shaobin; Feltus, F Alex; Jones, Meghan R; Murray, Jan E; Veatch, Olivia; Lemke, Cornelia; Saw, Jimmy H; Moore, Richard C; Thimmapuram, Jyothi; Liu, Lei; Moore, Paul H; Alam, Maqsudul; Jiang, Jiming; Paterson, Andrew H; Ming, Ray

2008-01-01

Sex chromosomes in flowering plants, in contrast to those in animals, evolved relatively recently and only a few are heteromorphic. The homomorphic sex chromosomes of papaya show features of incipient sex chromosome evolution. We investigated the features of paired X- and Y-specific bacterial artificial chromosomes (BACs), and estimated the time of divergence in four pairs of sex-linked genes. We report the results of a comparative analysis of long contiguous genomic DNA sequences between the X and hermaphrodite Y (Y(h)) chromosomes. Numerous chromosomal rearrangements were detected in the male-specific region of the Y chromosome (MSY), including inversions, deletions, insertions, duplications and translocations, showing the dynamic evolutionary process on the MSY after recombination ceased. DNA sequence expansion was documented in the two regions of the MSY, demonstrating that the cytologically homomorphic sex chromosomes are heteromorphic at the molecular level. Analysis of sequence divergence between four X and Y(h) gene pairs resulted in a estimated age of divergence of between 0.5 and 2.2 million years, supporting a recent origin of the papaya sex chromosomes. Our findings indicate that sex chromosomes did not evolve at the family level in Caricaceae, and reinforce the theory that sex chromosomes evolve at the species level in some lineages.
A cascade reaction network mimicking the basic functional steps of adaptive immune response

NASA Astrophysics Data System (ADS)

Han, Da; Wu, Cuichen; You, Mingxu; Zhang, Tao; Wan, Shuo; Chen, Tao; Qiu, Liping; Zheng, Zheng; Liang, Hao; Tan, Weihong

2015-10-01

Biological systems use complex ‘information-processing cores’ composed of molecular networks to coordinate their external environment and internal states. An example of this is the acquired, or adaptive, immune system (AIS), which is composed of both humoral and cell-mediated components. Here we report the step-by-step construction of a prototype mimic of the AIS that we call an adaptive immune response simulator (AIRS). DNA and enzymes are used as simple artificial analogues of the components of the AIS to create a system that responds to specific molecular stimuli in vitro. We show that this network of reactions can function in a manner that is superficially similar to the most basic responses of the vertebrate AIS, including reaction sequences that mimic both humoral and cellular responses. As such, AIRS provides guidelines for the design and engineering of artificial reaction networks and molecular devices.
Codon Optimization of the Human Papillomavirus E7 Oncogene Induces a CD8+ T Cell Response to a Cryptic Epitope Not Harbored by Wild-Type E7

PubMed Central

Lorenz, Felix K. M.; Wilde, Susanne; Voigt, Katrin; Kieback, Elisa; Mosetter, Barbara; Schendel, Dolores J.; Uckert, Wolfgang

2015-01-01

Codon optimization of nucleotide sequences is a widely used method to achieve high levels of transgene expression for basic and clinical research. Until now, immunological side effects have not been described. To trigger T cell responses against human papillomavirus, we incubated T cells with dendritic cells that were pulsed with RNA encoding the codon-optimized E7 oncogene. All T cell receptors isolated from responding T cell clones recognized target cells expressing the codon-optimized E7 gene but not the wild type E7 sequence. Epitope mapping revealed recognition of a cryptic epitope from the +3 alternative reading frame of codon-optimized E7, which is not encoded by the wild type E7 sequence. The introduction of a stop codon into the +3 alternative reading frame protected the transgene product from recognition by T cell receptor gene-modified T cells. This is the first experimental study demonstrating that codon optimization can render a transgene artificially immunogenic through generation of a dominant cryptic epitope. This finding may be of great importance for the clinical field of gene therapy to avoid rejection of gene-corrected cells and for the design of DNA- and RNA-based vaccines, where codon optimization may artificially add a strong immunogenic component to the vaccine. PMID:25799237
TALE: a tale of genome editing.

PubMed

Zhang, Mingjie; Wang, Feng; Li, Shifei; Wang, Yan; Bai, Yun; Xu, Xueqing

2014-01-01

Transcription activator-like effectors (TALEs), first identified in Xanthomonas bacteria, are naturally occurring or artificially designed proteins that modulate gene transcription. These proteins recognize and bind DNA sequences based on a variable numbers of tandem repeats. Each repeat is comprised of a set of ∼ 34 conserved amino acids; within this conserved domain, there are usually two amino acids that distinguish one TALE from another. Interestingly, TALEs have revealed a simple cipher for the one-to-one recognition of proteins for DNA bases. Synthetic TALEs have been used to successfully target genes in a variety of species, including humans. Depending on the type of functional domain that is fused to the TALE of interest, these proteins can have diverse biological effects. For example, after binding DNA, TALEs fused to transcriptional activation domains can function as robust transcription factors (TALE-TFs), while fused to restriction endonucleases (TALENs) can cut DNA. Targeted genome editing, in theory, is capable of modifying any endogenous gene sequence of interest; this can be performed in cells or organisms, and may be applied to clinical gene-based therapies in the future. With current technologies, highly accurate, specific, and reliable gene editing cannot be achieved. Thus, recognition and binding mechanisms governing TALE biology are currently hot research areas. In this review, we summarize the major advances in TALE technology over the past several years with a focus on the interaction between TALEs and DNA, TALE design and construction, potential applications for this technology, and unique characteristics that make TALEs superior to zinc finger endonucleases. Copyright © 2013 Elsevier Ltd. All rights reserved.

Draft Sequences of the Radish (Raphanus sativus L.) Genome

PubMed Central

Kitashiba, Hiroyasu; Li, Feng; Hirakawa, Hideki; Kawanabe, Takahiro; Zou, Zhongwei; Hasegawa, Yoichi; Tonosaki, Kaoru; Shirasawa, Sachiko; Fukushima, Aki; Yokoi, Shuji; Takahata, Yoshihito; Kakizaki, Tomohiro; Ishida, Masahiko; Okamoto, Shunsuke; Sakamoto, Koji; Shirasawa, Kenta; Tabata, Satoshi; Nishio, Takeshi

2014-01-01

Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified. PMID:24848699
Quantitative Detection of Small Molecule/DNA Complexes Employing a Force-Based and Label-Free DNA-Microarray

PubMed Central

Ho, Dominik; Dose, Christian; Albrecht, Christian H.; Severin, Philip; Falter, Katja; Dervan, Peter B.; Gaub, Hermann E.

2009-01-01

Force-based ligand detection is a promising method to characterize molecular complexes label-free at physiological conditions. Because conventional implementations of this technique, e.g., based on atomic force microscopy or optical traps, are low-throughput and require extremely sensitive and sophisticated equipment, this approach has to date found only limited application. We present a low-cost, chip-based assay, which combines high-throughput force-based detection of dsDNA·ligand interactions with the ease of fluorescence detection. Within the comparative unbinding force assay, many duplicates of a target DNA duplex are probed against a defined reference DNA duplex each. The fractions of broken target and reference DNA duplexes are determined via fluorescence. With this assay, we investigated the DNA binding behavior of artificial pyrrole-imidazole polyamides. These small compounds can be programmed to target specific dsDNA sequences and distinguish between D- and L-DNA. We found that titration with polyamides specific for a binding motif, which is present in the target DNA duplex and not in the reference DNA duplex, reliably resulted in a shift toward larger fractions of broken reference bonds. From the concentration dependence nanomolar to picomolar dissociation constants of dsDNA·ligand complexes were determined, agreeing well with prior quantitative DNAase footprinting experiments. This finding corroborates that the forced unbinding of dsDNA in presence of a ligand is a nonequilibrium process that produces a snapshot of the equilibrium distribution between dsDNA and dsDNA·ligand complexes. PMID:19486688
Bacterial genomes lacking long-range correlations may not be modeled by low-order Markov chains: the role of mixing statistics and frame shift of neighboring genes.

PubMed

Cocho, Germinal; Miramontes, Pedro; Mansilla, Ricardo; Li, Wentian

2014-12-01

We examine the relationship between exponential correlation functions and Markov models in a bacterial genome in detail. Despite the well known fact that Markov models generate sequences with correlation function that decays exponentially, simply constructed Markov models based on nearest-neighbor dimer (first-order), trimer (second-order), up to hexamer (fifth-order), and treating the DNA sequence as being homogeneous all fail to predict the value of exponential decay rate. Even reading-frame-specific Markov models (both first- and fifth-order) could not explain the fact that the exponential decay is very slow. Starting with the in-phase coding-DNA-sequence (CDS), we investigated correlation within a fixed-codon-position subsequence, and in artificially constructed sequences by packing CDSs with out-of-phase spacers, as well as altering CDS length distribution by imposing an upper limit. From these targeted analyses, we conclude that the correlation in the bacterial genomic sequence is mainly due to a mixing of heterogeneous statistics at different codon positions, and the decay of correlation is due to the possible out-of-phase between neighboring CDSs. There are also small contributions to the correlation from bases at the same codon position, as well as by non-coding sequences. These show that the seemingly simple exponential correlation functions in bacterial genome hide a complexity in correlation structure which is not suitable for a modeling by Markov chain in a homogeneous sequence. Other results include: use of the (absolute value) second largest eigenvalue to represent the 16 correlation functions and the prediction of a 10-11 base periodicity from the hexamer frequencies. Copyright © 2014 Elsevier Ltd. All rights reserved.
A Synthetic DNA-Binding Domain Guides Distinct Chromatin-Modifying Small Molecules to Activate an Identical Gene Network.

PubMed

Han, Le; Pandian, Ganesh N; Chandran, Anandhakumar; Sato, Shinsuke; Taniguchi, Junichi; Kashiwazaki, Gengo; Sawatani, Yoshito; Hashiya, Kaori; Bando, Toshikazu; Xu, Yufang; Qian, Xuhong; Sugiyama, Hiroshi

2015-07-20

Synthetic dual-function ligands targeting specific DNA sequences and histone-modifying enzymes were applied to achieve regulatory control over multi-gene networks in living cells. Unlike the broad array of targeting small molecules for histone deacetylases (HDACs), few modulators are known for histone acetyltransferases (HATs), which play a central role in transcriptional control. As a novel chemical approach to induce selective HAT-regulated genes, we conjugated a DNA-binding domain (DBD) "I" to N-(4-chloro-3-trifluoromethyl-phenyl)-2-ethoxy-benzamide (CTB), an artificial HAT activator. In vitro enzyme activity assays and microarray studies were used to demonstrate that distinct functional small molecules could be transformed to have identical bioactivity when conjugated with a targeting DBD. This proof-of-concept synthetic strategy validates the switchable functions of HDACs and HATs in gene regulation and provides a molecular basis for developing versatile bioactive ligands. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Rapid construction of a Bacterial Artificial Chromosomal (BAC) expression vector using designer DNA fragments.

PubMed

Chen, Chao; Zhao, Xinqing; Jin, Yingyu; Zhao, Zongbao Kent; Suh, Joo-Won

2014-11-01

Bacterial artificial chromosomal (BAC) vectors are increasingly being used in cloning large DNA fragments containing complex biosynthetic pathways to facilitate heterologous production of microbial metabolites for drug development. To express inserted genes using Streptomyces species as the production hosts, an integration expression cassette is required to be inserted into the BAC vector, which includes genetic elements encoding a phage-specific attachment site, an integrase, an origin of transfer, a selection marker and a promoter. Due to the large sizes of DNA inserted into the BAC vectors, it is normally inefficient and time-consuming to assemble these fragments by routine PCR amplifications and restriction-ligations. Here we present a rapid method to insert fragments to construct BAC-based expression vectors. A DNA fragment of about 130 bp was designed, which contains upstream and downstream homologous sequences of both BAC vector and pIB139 plasmid carrying the whole integration expression cassette. In-Fusion cloning was performed using the designer DNA fragment to modify pIB139, followed by λ-RED-mediated recombination to obtain the BAC-based expression vector. We demonstrated the effectiveness of this method by rapid construction of a BAC-based expression vector with an insert of about 120 kb that contains the entire gene cluster for biosynthesis of immunosuppressant FK506. The empty BAC-based expression vector constructed in this study can be conveniently used for construction of BAC libraries using either microbial pure culture or environmental DNA, and the selected BAC clones can be directly used for heterologous expression. Alternatively, if a BAC library has already been constructed using a commercial BAC vector, the selected BAC vectors can be manipulated using the method described here to get the BAC-based expression vectors with desired gene clusters for heterologous expression. The rapid construction of a BAC-based expression vector facilitates heterologous expression of large gene clusters for drug discovery. Copyright © 2014 Elsevier Inc. All rights reserved.
The nucleoid protein Dps binds genomic DNA of Escherichia coli in a non-random manner

PubMed Central

Kondrashov, F. A.; Toshchakov, S. V.; Dominova, I.; Shvyreva, U. S.; Vrublevskaya, V. V.; Morenkov, O. S.; Panyukov, V. V.

2017-01-01

Dps is a multifunctional homododecameric protein that oxidizes Fe2+ ions accumulating them in the form of Fe2O3 within its protein cavity, interacts with DNA tightly condensing bacterial nucleoid upon starvation and performs some other functions. During the last two decades from discovery of this protein, its ferroxidase activity became rather well studied, but the mechanism of Dps interaction with DNA still remains enigmatic. The crucial role of lysine residues in the unstructured N-terminal tails led to the conventional point of view that Dps binds DNA without sequence or structural specificity. However, deletion of dps changed the profile of proteins in starved cells, SELEX screen revealed genomic regions preferentially bound in vitro and certain affinity of Dps for artificial branched molecules was detected by atomic force microscopy. Here we report a non-random distribution of Dps binding sites across the bacterial chromosome in exponentially growing cells and show their enrichment with inverted repeats prone to form secondary structures. We found that the Dps-bound regions overlap with sites occupied by other nucleoid proteins, and contain overrepresented motifs typical for their consensus sequences. Of the two types of genomic domains with extensive protein occupancy, which can be highly expressed or transcriptionally silent only those that are enriched with RNA polymerase molecules were preferentially occupied by Dps. In the dps-null mutant we, therefore, observed a differentially altered expression of several targeted genes and found suppressed transcription from the dps promoter. In most cases this can be explained by the relieved interference with Dps for nucleoid proteins exploiting sequence-specific modes of DNA binding. Thus, protecting bacterial cells from different stresses during exponential growth, Dps can modulate transcriptional integrity of the bacterial chromosome hampering RNA biosynthesis from some genes via competition with RNA polymerase or, vice versa, competing with inhibitors to activate transcription. PMID:28800583
Single-molecule nanopore enzymology

PubMed Central

Wloka, Carsten; Maglia, Giovanni

2017-01-01

Biological nanopores are a class of membrane proteins that open nanoscale water-conduits in biological membranes. When they are reconstituted in artificial membranes and a bias voltage is applied across the membrane, the ionic current passing through individual nanopores can be used to monitor chemical reactions, to recognize individual molecules and, of most interest, to sequence DNA. More recently, proteins and enzymes have started being analysed with nanopores. Monitoring enzymatic reactions with nanopores, i.e. nanopore enzymology, has the unique advantage that it allows long-timescale observations of native proteins at the single-molecule level. Here we describe the approaches and challenges in nanopore enzymology. PMID:28630164
Exploring the loblolly pine (Pinus taeda L.) genome by BAC sequencing and Cot analysis.

PubMed

Perera, Dinum; Magbanua, Zenaida V; Thummasuwan, Supaphan; Mukherjee, Dipaloke; Arick, Mark; Chouvarine, Philippe; Nairn, Campbell J; Schmutz, Jeremy; Grimwood, Jane; Dean, Jeffrey F D; Peterson, Daniel G

2018-07-15

Loblolly pine (LP; Pinus taeda L.) is an economically and ecologically important tree in the southeastern U.S. To advance understanding of the loblolly pine (LP; Pinus taeda L.) genome, we sequenced and analyzed 100 BAC clones and performed a Cot analysis. The Cot analysis indicates that the genome is composed of 57, 24, and 10% highly-repetitive, moderately-repetitive, and single/low-copy sequences, respectively (the remaining 9% of the genome is a combination of fold back and damaged DNA). Although single/low-copy DNA only accounts for 10% of the LP genome, the amount of single/low-copy DNA in LP is still 14 times the size of the Arabidopsis genome. Since gene numbers in LP are similar to those in Arabidopsis, much of the single/low-copy DNA of LP would appear to be composed of DNA that is both gene- and repeat-poor. Macroarrays prepared from a LP bacterial artificial chromosome (BAC) library were hybridized with probes designed from cell wall synthesis/wood development cDNAs, and 50 of the "targeted" clones were selected for further analysis. An additional 25 clones were selected because they contained few repeats, while 25 more clones were selected at random. The 100 BAC clones were Sanger sequenced and assembled. Of the targeted BACs, 80% contained all or part of the cDNA used to target them. One targeted BAC was found to contain fungal DNA and was eliminated from further analysis. Combinations of similarity-based and ab initio gene prediction approaches were utilized to identify and characterize potential coding regions in the 99 BACs containing LP DNA. From this analysis, we identified 154 gene models (GMs) representing both putative protein-coding genes and likely pseudogenes. Ten of the GMs (all of which were specifically targeted) had enough support to be classified as intact genes. Interestingly, the 154 GMs had statistically indistinguishable (α = 0.05) distributions in the targeted and random BAC clones (15.18 and 12.61 GM/Mb, respectively), whereas the low-repeat BACs contained significantly fewer GMs (7.08 GM/Mb). However, when GM length was considered, the targeted BACs had a significantly greater percentage of their length in GMs (3.26%) when compared to random (1.63%) and low-repeat (0.62%) BACs. The results of our study provide insight into LP evolution and inform ongoing efforts to produce a reference genome sequence for LP, while characterization of genes involved in cell wall production highlights carbon metabolism pathways that can be leveraged for increasing wood production. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
A methylation status analysis of the apomixis-specific region in Paspalum spp. suggests an epigenetic control of parthenogenesis.

PubMed

Podio, Maricel; Cáceres, Maria E; Samoluk, Sergio S; Seijo, José G; Pessino, Silvina C; Ortiz, Juan Pablo A; Pupilli, Fulvio

2014-12-01

Apomixis, a clonal plant reproduction by seeds, is controlled in Paspalum spp. by a single locus which is blocked in terms of recombination. Partial sequence analysis of the apomixis locus revealed structural features of heterochromatin, namely the presence of repetitive elements, gene degeneration, and de-regulation. To test the epigenetic control of apomixis, a study on the distribution of cytosine methylation at the apomixis locus and the effect of artificial DNA demethylation on the mode of reproduction was undertaken in two apomictic Paspalum species. The 5-methylcytosine distribution in the apomixis-controlling genomic region was studied in P. simplex by methylation-sensitive restriction fragment length polymorphism (RFLP) analysis and in P. notatum by fluorescene in situ hybridization (FISH). The effect of DNA demethylation was studied on the mode of reproduction of P. simplex by progeny test analysis of apomictic plants treated with the demethylating agent 5'-azacytidine. A high level of cytosine methylation was detected at the apomixis-controlling genomic region in both species. By analysing a total of 374 open pollination progeny, it was found that artificial demethylation had little or no effect on apospory, whereas it induced a significant depression of parthenogenesis. The results suggested that factors controlling repression of parthenogenesis might be inactivated in apomictic Paspalum by DNA methylation. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Variations in 5S rDNAs in diploid and tetraploid offspring of red crucian carp × common carp.

PubMed

Ye, Lihai; Zhang, Chun; Tang, Xiaojun; Chen, Yiyi; Liu, Shaojun

2017-08-08

The allotetraploid hybrid fish (4nAT) that was created in a previous study through an intergeneric cross between red crucian carp (Carassius auratus red var., ♀) and common carp (Cyprinus carpio L., ♂) provided an excellent platform to investigate the effect of hybridization and polyploidization on the evolution of 5S rDNA. The 5S rDNAs of paternal common carp were made up of a coding sequence (CDS) and a non-transcribed spacer (NTS) unit, and while the 5S rDNAs of maternal red crucian carp contained a CDS and a NTS unit, they also contained a variable number of interposed regions (IPRs). The CDSs of the 5S rDNAs in both parental fishes were conserved, while their NTS units seemed to have been subjected to rapid evolution. The diploid hybrid 2nF 1 inherited all the types of 5S rDNAs in both progenitors and there were no signs of homeologous recombination in the 5S rDNAs of 2nF 1 by sequencing of PCR products. We obtained two segments of 5S rDNA with a total length of 16,457 bp from allotetraploid offspring 4nAT through bacterial artificial chromosome (BAC) sequencing. Using this sequence together with the 5S rDNA sequences amplified from the genomic DNA of 4nAT, we deduced that the 5S rDNAs of 4nAT might be inherited from the maternal progenitor red crucian carp. Additionally, the IPRs in the 5S rDNAs of 4nAT contained A-repeats and TA-repeats, which was not the case for the IPRs in the 5S rDNAs of 2nF 1 . We also detected two signals of a 200-bp fragment of 5S rDNA in the chromosomes of parental progenitors and hybrid progenies by fluorescence in situ hybridization (FISH). We deduced that during the evolution of 5S rDNAs in different ploidy hybrid fishes, interlocus gene conversion events and tandem repeat insertion events might occurred in the process of polyploidization. This study provided new insights into the relationship among the evolution of 5S rDNAs, hybridization and polyploidization, which were significant in clarifying the genome evolution of polyploid fish.
Metal-mediated DNA base pairing: alternatives to hydrogen-bonded Watson-Crick base pairs.

PubMed

Takezawa, Yusuke; Shionoya, Mitsuhiko

2012-12-18

With its capacity to store and transfer the genetic information within a sequence of monomers, DNA forms its central role in chemical evolution through replication and amplification. This elegant behavior is largely based on highly specific molecular recognition between nucleobases through the specific hydrogen bonds in the Watson-Crick base pairing system. While the native base pairs have been amazingly sophisticated through the long history of evolution, synthetic chemists have devoted considerable efforts to create alternative base pairing systems in recent decades. Most of these new systems were designed based on the shape complementarity of the pairs or the rearrangement of hydrogen-bonding patterns. We wondered whether metal coordination could serve as an alternative driving force for DNA base pairing and why hydrogen bonding was selected on Earth in the course of molecular evolution. Therefore, we envisioned an alternative design strategy: we replaced hydrogen bonding with another important scheme in biological systems, metal-coordination bonding. In this Account, we provide an overview of the chemistry of metal-mediated base pairing including basic concepts, molecular design, characteristic structures and properties, and possible applications of DNA-based molecular systems. We describe several examples of artificial metal-mediated base pairs, such as Cu(2+)-mediated hydroxypyridone base pair, H-Cu(2+)-H (where H denotes a hydroxypyridone-bearing nucleoside), developed by us and other researchers. To design the metallo-base pairs we carefully chose appropriate combinations of ligand-bearing nucleosides and metal ions. As expected from their stronger bonding through metal coordination, DNA duplexes possessing metallo-base pairs exhibited higher thermal stability than natural hydrogen-bonded DNAs. Furthermore, we could also use metal-mediated base pairs to construct or induce other high-order structures. These features could lead to metal-responsive functional DNA molecules such as artificial DNAzymes and DNA machines. In addition, the metallo-base pairing system is a powerful tool for the construction of homogeneous and heterogeneous metal arrays, which can lead to DNA-based nanomaterials such as electronic wires and magnetic devices. Recently researchers have investigated these systems as enzyme replacements, which may offer an additional contribution to chemical biology and synthetic biology through the expansion of the genetic alphabet.
A physical map, including a BAC/PAC clone contig, of the Williams-Beuren syndrome--deletion region at 7q11.23.

PubMed

Peoples, R; Franke, Y; Wang, Y K; Pérez-Jurado, L; Paperna, T; Cisco, M; Francke, U

2000-01-01

Williams-Beuren syndrome (WBS) is a developmental disorder caused by haploinsufficiency for genes in a 2-cM region of chromosome band 7q11.23. With the exception of vascular stenoses due to deletion of the elastin gene, the various features of WBS have not yet been attributed to specific genes. Although >/=16 genes have been identified within the WBS deletion, completion of a physical map of the region has been difficult because of the large duplicated regions flanking the deletion. We present a physical map of the WBS deletion and flanking regions, based on assembly of a bacterial artificial chromosome/P1-derived artificial chromosome contig, analysis of high-throughput genome-sequence data, and long-range restriction mapping of genomic and cloned DNA by pulsed-field gel electrophoresis. Our map encompasses 3 Mb, including 1.6 Mb within the deletion. Two large duplicons, flanking the deletion, of >/=320 kb contain unique sequence elements from the internal border regions of the deletion, such as sequences from GTF2I (telomeric) and FKBP6 (centromeric). A third copy of this duplicon exists in inverted orientation distal to the telomeric flanking one. These duplicons show stronger sequence conservation with regard to each other than to the presumptive ancestral loci within the common deletion region. Sequence elements originating from beyond 7q11.23 are also present in these duplicons. Although the duplicons are not present in mice, the order of the single-copy genes in the conserved syntenic region of mouse chromosome 5 is inverted relative to the human map. A model is presented for a mechanism of WBS-deletion formation, based on the orientation of duplicons' components relative to each other and to the ancestral elements within the deletion region.
Gadd45a Is an RNA Binding Protein and Is Localized in Nuclear Speckles

PubMed Central

Sytnikova, Yuliya A.; Kubarenko, Andriy V.; Schäfer, Andrea; Weber, Alexander N. R.; Niehrs, Christof

2011-01-01

Background The Gadd45 proteins play important roles in growth control, maintenance of genomic stability, DNA repair, and apoptosis. Recently, Gadd45 proteins have also been implicated in epigenetic gene regulation by promoting active DNA demethylation. Gadd45 proteins have sequence homology with the L7Ae/L30e/S12e RNA binding superfamily of ribosomal proteins, which raises the question if they may interact directly with nucleic acids. Principal Findings Here we show that Gadd45a binds RNA but not single- or double stranded DNA or methylated DNA in vitro. Sucrose density gradient centrifugation experiments demonstrate that Gadd45a is present in high molecular weight particles, which are RNase sensitive. Gadd45a displays RNase-sensitive colocalization in nuclear speckles with the RNA helicase p68 and the RNA binding protein SC35. A K45A point mutation defective in RNA binding was still active in DNA demethylation. This suggests that RNA binding is not absolutely essential for demethylation of an artificial substrate. A point mutation at G39 impared RNA binding, nuclear speckle localization and DNA demethylation, emphasizing its relevance for Gadd45a function. Significance The results implicate RNA in Gadd45a function and suggest that Gadd45a is associated with a ribonucleoprotein particle. PMID:21249130
Assessment of primer/template mismatch effects on real-time PCR amplification of target taxa for GMO quantification.

PubMed

Ghedira, Rim; Papazova, Nina; Vuylsteke, Marnik; Ruttink, Tom; Taverniers, Isabel; De Loose, Marc

2009-10-28

GMO quantification, based on real-time PCR, relies on the amplification of an event-specific transgene assay and a species-specific reference assay. The uniformity of the nucleotide sequences targeted by both assays across various transgenic varieties is an important prerequisite for correct quantification. Single nucleotide polymorphisms (SNPs) frequently occur in the maize genome and might lead to nucleotide variation in regions used to design primers and probes for reference assays. Further, they may affect the annealing of the primer to the template and reduce the efficiency of DNA amplification. We assessed the effect of a minor DNA template modification, such as a single base pair mismatch in the primer attachment site, on real-time PCR quantification. A model system was used based on the introduction of artificial mismatches between the forward primer and the DNA template in the reference assay targeting the maize starch synthase (SSIIb) gene. The results show that the presence of a mismatch between the primer and the DNA template causes partial to complete failure of the amplification of the initial DNA template depending on the type and location of the nucleotide mismatch. With this study, we show that the presence of a primer/template mismatch affects the estimated total DNA quantity to a varying degree.
PCR cycles above routine numbers do not compromise high-throughput DNA barcoding results.

PubMed

Vierna, J; Doña, J; Vizcaíno, A; Serrano, D; Jovani, R

2017-10-01

High-throughput DNA barcoding has become essential in ecology and evolution, but some technical questions still remain. Increasing the number of PCR cycles above the routine 20-30 cycles is a common practice when working with old-type specimens, which provide little amounts of DNA, or when facing annealing issues with the primers. However, increasing the number of cycles can raise the number of artificial mutations due to polymerase errors. In this work, we sequenced 20 COI libraries in the Illumina MiSeq platform. Libraries were prepared with 40, 45, 50, 55, and 60 PCR cycles from four individuals belonging to four species of four genera of cephalopods. We found no relationship between the number of PCR cycles and the number of mutations despite using a nonproofreading polymerase. Moreover, even when using a high number of PCR cycles, the resulting number of mutations was low enough not to be an issue in the context of high-throughput DNA barcoding (but may still remain an issue in DNA metabarcoding due to chimera formation). We conclude that the common practice of increasing the number of PCR cycles should not negatively impact the outcome of a high-throughput DNA barcoding study in terms of the occurrence of point mutations.
Computational Tools and Algorithms for Designing Customized Synthetic Genes

PubMed Central

Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris

2014-01-01

Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations. PMID:25340050
In vitro DNA binding studies of Aspartame, an artificial sweetener.

PubMed

Kashanian, Soheila; Khodaei, Mohammad Mehdi; Kheirdoosh, Fahimeh

2013-03-05

A number of small molecules bind directly and selectively to DNA, by inhibiting replication, transcription or topoisomerase activity. In this work the interaction of native calf thymus DNA (CT-DNA) with Aspartame (APM), an artificial sweeteners was studied at physiological pH. DNA binding study of APM is useful to understand APM-DNA interaction mechanism and to provide guidance for the application and design of new and safer artificial sweeteners. The interaction was investigated using spectrophotometric, spectrofluorometric competition experiment and circular dichroism (CD). Hypochromism and red shift are shown in UV absorption band of APM. A strong fluorescence quenching reaction of DNA to APM was observed and the binding constants (Kf) of DNA with APM and corresponding number of binding sites (n) were calculated at different temperatures. Thermodynamic parameters, enthalpy changes (ΔH) and entropy changes (ΔS) were calculated to be +181kJmol(-1) and +681Jmol(-1)K(-1) according to Van't Hoff equation, which indicated that reaction is predominantly entropically driven. Moreover, spectrofluorometric competition experiment and circular dichroism (CD) results are indicative of non-intercalative DNA binding nature of APM. We suggest that APM interacts with calf thymus DNA via groove binding mode with an intrinsic binding constant of 5×10(+4)M(-1). Copyright © 2013 Elsevier B.V. All rights reserved.
Integration of Lupinus angustifolius L. (narrow-leafed lupin) genome maps and comparative mapping within legumes.

PubMed

Wyrwa, Katarzyna; Książkiewicz, Michał; Szczepaniak, Anna; Susek, Karolina; Podkowiński, Jan; Naganowska, Barbara

2016-09-01

Narrow-leafed lupin (Lupinus angustifolius L.) has recently been considered a reference genome for the Lupinus genus. In the present work, genetic and cytogenetic maps of L. angustifolius were supplemented with 30 new molecular markers representing lupin genome regions, harboring genes involved in nitrogen fixation during the symbiotic interaction of legumes and soil bacteria (Rhizobiaceae). Our studies resulted in the precise localization of bacterial artificial chromosomes (BACs) carrying sequence variants for early nodulin 40, nodulin 26, nodulin 45, aspartate aminotransferase P2, asparagine synthetase, cytosolic glutamine synthetase, and phosphoenolpyruvate carboxylase. Together with previously mapped chromosomes, the integrated L. angustifolius map encompasses 73 chromosome markers, including 5S ribosomal DNA (rDNA) and 45S rDNA, and anchors 20 L. angustifolius linkage groups to corresponding chromosomes. Chromosomal identification using BAC fluorescence in situ hybridization identified two BAC clones as narrow-leafed lupin centromere-specific markers, which served as templates for preliminary studies of centromere composition within the genus. Bioinformatic analysis of these two BACs revealed that centromeric/pericentromeric regions of narrow-leafed lupin chromosomes consisted of simple sequence repeats ordered into tandem repeats containing the trinucleotide and pentanucleotide simple sequence repeats AGG and GATAC, structured into long arrays. Moreover, cross-genus microsynteny analysis revealed syntenic patterns of 31 single-locus BAC clones among several legume species. The gene and chromosome level findings provide evidence of ancient duplication events that must have occurred very early in the divergence of papilionoid lineages. This work provides a strong foundation for future comparative mapping among legumes and may facilitate understanding of mechanisms involved in shaping legume chromosomes.
Highly Efficient CRISPR/Cas9-Mediated Cloning and Functional Characterization of Gastric Cancer-Derived Epstein-Barr Virus Strains.

PubMed

Kanda, Teru; Furuse, Yuki; Oshitani, Hitoshi; Kiyono, Tohru

2016-05-01

The Epstein-Barr virus (EBV) is etiologically linked to approximately 10% of gastric cancers, in which viral genomes are maintained as multicopy episomes. EBV-positive gastric cancer cells are incompetent for progeny virus production, making viral DNA cloning extremely difficult. Here we describe a highly efficient strategy for obtaining bacterial artificial chromosome (BAC) clones of EBV episomes by utilizing a CRISPR/Cas9-mediated strand break of the viral genome and subsequent homology-directed repair. EBV strains maintained in two gastric cancer cell lines (SNU719 and YCCEL1) were cloned, and their complete viral genome sequences were determined. Infectious viruses of gastric cancer cell-derived EBVs were reconstituted, and the viruses established stable latent infections in immortalized keratinocytes. While Ras oncoprotein overexpression caused massive vacuolar degeneration and cell death in control keratinocytes, EBV-infected keratinocytes survived in the presence of Ras expression. These results implicate EBV infection in predisposing epithelial cells to malignant transformation by inducing resistance to oncogene-induced cell death. Recent progress in DNA-sequencing technology has accelerated EBV whole-genome sequencing, and the repertoire of sequenced EBV genomes is increasing progressively. Accordingly, the presence of EBV variant strains that may be relevant to EBV-associated diseases has begun to attract interest. Clearly, the determination of additional disease-associated viral genome sequences will facilitate the identification of any disease-specific EBV variants. We found that CRISPR/Cas9-mediated cleavage of EBV episomal DNA enabled the cloning of disease-associated viral strains with unprecedented efficiency. As a proof of concept, two gastric cancer cell-derived EBV strains were cloned, and the infection of epithelial cells with reconstituted viruses provided important clues about the mechanism of EBV-mediated epithelial carcinogenesis. This experimental system should contribute to establishing the relationship between viral genome variation and EBV-associated diseases. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Ribo HRM--detection of inter- and intra-species polymorphisms within ribosomal DNA by high resolution melting analysis supported by application of artificial allelic standards.

PubMed

Masny, Aleksander; Jagiełło, Agata; Płucienniczak, Grażyna; Golab, Elzbieta

2012-09-01

Ribo HRM, a single-tube PCR and high resolution melting (HRM) assay for detection of polymorphisms in the large subunit ribosomal DNA expansion segment V, was developed on a Trichinella model. Four Trichinella species: T. spiralis (isolates ISS3 and ISS160), T. nativa (isolates ISS10 and ISS70), T. britovi (isolates ISS2 and ISS392) and T. pseudospiralis (isolates ISS13 and ISS1348) were genotyped. Cloned allelic variants of the expansion segment V were used as standards to prepare reference HRM curves characteristic for single sequences and mixtures of several cloned sequences imitating allelic composition detected in Trichinella isolates. Using the primer pair Tsr1 and Trich1bi, it was possible to amplify a fragment of the ESV and detect PCR products obtained from the genomic DNA of pools of larvae belonging to the four investigated species: T. pseudospiralis, T. spiralis, T. britovi and T. nativa, in a single tube Real-Time PCR reaction. Differences in the shape of the HRM curves of Trichinella isolates suggested the presence of differences between examined isolates of T. nativa, T. britovi and T. pseudospiralis species. No differences were observed between T. spiralis isolates. The presence of polymorphisms within the amplified ESV sequence fragment of T. nativa T. britovi and T. pseudospiralis was confirmed by sequencing of the cloned PCR products. Novel sequences were discovered and deposited in GenBank (GenBank IDs: JN971020-JN971027, JN120902.1, JN120903.1, JN120904.1, JN120906.1, JN120905.1). Screening the ESV region of Trichinella for polymorphism is possible using the genotyping assay Ribo HRM at the current state of its development. The Ribo HRM assay could be useful in phylogenetic studies of the Trichinella genus. Copyright © 2012 Elsevier B.V. All rights reserved.

Emerging Biomimetic Applications of DNA Nanotechnology.

PubMed

Shen, Haijing; Wang, Yingqian; Wang, Jie; Li, Zhihao; Yuan, Quan

2018-06-25

Re-engineering cellular components and biological processes has received great interest and promised compelling advantages in applications ranging from basic cell biology to biomedicine. With the advent of DNA nanotechnology, the programmable self-assembly ability makes DNA an appealing candidate for rational design of artificial components with different structures and functions. This Forum Article summarizes recent developments of DNA nanotechnology in mimicking the structures and functions of existing cellular components. We highlight key successes in the achievements of DNA-based biomimetic membrane proteins and discuss the assembly behavior of these artificial proteins. Then, we focus on the construction of higher-order structures by DNA nanotechnology to recreate cell-like structures. Finally, we explore the current challenges and speculate on future directions of DNA nanotechnology in biomimetics.
Novel Rhizosphere Soil Alleles for the Enzyme 1-Aminocyclopropane-1-Carboxylate Deaminase Queried for Function with an In Vivo Competition Assay

PubMed Central

Jin, Zhao; Di Rienzi, Sara C.; Janzon, Anders; Werner, Jeff J.; Angenent, Largus T.; Dangl, Jeffrey L.; Fowler, Douglas M.

2015-01-01

Metagenomes derived from environmental microbiota encode a vast diversity of protein homologs. How this diversity impacts protein function can be explored through selection assays aimed to optimize function. While artificially generated gene sequence pools are typically used in selection assays, their usage may be limited because of technical or ethical reasons. Here, we investigate an alternative strategy, the use of soil microbial DNA as a starting point. We demonstrate this approach by optimizing the function of a widely occurring soil bacterial enzyme, 1-aminocyclopropane-1-carboxylate (ACC) deaminase. We identified a specific ACC deaminase domain region (ACCD-DR) that, when PCR amplified from the soil, produced a variant pool that we could swap into functional plasmids carrying ACC deaminase-encoding genes. Functional clones of ACC deaminase were selected for in a competition assay based on their capacity to provide nitrogen to Escherichia coli in vitro. The most successful ACCD-DR variants were identified after multiple rounds of selection by sequence analysis. We observed that previously identified essential active-site residues were fixed in the original unselected library and that additional residues went to fixation after selection. We identified a divergent essential residue whose presence hints at the possible use of alternative substrates and a cluster of neutral residues that did not influence ACCD performance. Using an artificial ACCD-DR variant library generated by DNA oligomer synthesis, we validated the same fixation patterns. Our study demonstrates that soil metagenomes are useful starting pools of protein-coding-gene diversity that can be utilized for protein optimization and functional characterization when synthetic libraries are not appropriate. PMID:26637602
Performance of the ForenSeqTM DNA Signature Prep kit on highly degraded samples.

PubMed

Fattorini, Paolo; Previderé, Carlo; Carboni, Ilaria; Marrubini, Giorgio; Sorçaburu-Cigliero, Solange; Grignani, Pierangela; Bertoglio, Barbara; Vatta, Paolo; Ricci, Ugo

2017-04-01

Next generation sequencing (NGS) is the emerging technology in forensic genomics laboratories. It offers higher resolution to address most problems of human identification, greater efficiency and potential ability to interrogate very challenging forensic casework samples. In this study, a trial set of DNA samples was artificially degraded by progressive aqueous hydrolysis, and analyzed together with the corresponding unmodified DNA sample and control sample 2800 M, to test the performance and reliability of the ForenSeq TM DNA Signature Prep kit using the MiSeq Sequencer (Illumina). The results of replicate tests performed on the unmodified sample (1.0 ng) and on scalar dilutions (1.0, 0.5 and 0.1 ng) of the reference sample 2800 M showed the robustness and the reliability of the NGS approach even from sub-optimal amounts of high quality DNA. The degraded samples showed a very limited number of reads/sample, from 2.9-10.2 folds lower than the ones reported for the less concentrated 2800 M DNA dilution (0.1 ng). In addition, it was impossible to assign up to 78.2% of the genotypes in the degraded samples as the software identified the corresponding loci as "low coverage" (< 50x). Amplification artifacts such as allelic imbalances, allele drop outs and a single allele drop in were also scored in the degraded samples. However, the ForenSeq TM DNA Sequencing kit, on the Illumina MiSeq, was able to generate data which led to the correct typing of 5.1-44.8% and 10.9-58.7% of 58 of the STRs and 92 SNPs, respectively. In all trial samples, the SNP markers showed higher chances to be typed correctly compared to the STRs. This NGS approach showed very promising results in terms of ability to recover genetic information from heavily degraded DNA samples for which the conventional PCR/CE approach gave no results. The frequency of genetic mistyping was very low, reaching the value of 1.4% for only one of the degraded samples. However, these results suggest that further validation studies and a definition of interpretation criteria for NGS data are needed before implementation of this technique in forensic genetics. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Evaluation of Novel Broad-Range Real-Time PCR Assay for Rapid Detection of Human Pathogenic Fungi in Various Clinical Specimens▿

PubMed Central

Vollmer, Tanja; Störmer, Melanie; Kleesiek, Knut; Dreier, Jens

2008-01-01

In the present study, a novel broad-range real-time PCR was developed for the rapid detection of human pathogenic fungi. The assay targets a part of the 28S large-subunit ribosomal RNA (rDNA) gene. We investigated its application for the most important human pathogenic fungal genera, including Aspergillus, Candida, Cryptococcus, Mucor, Penicillium, Pichia, Microsporum, Trichophyton, and Scopulariopsis. Species were identified in PCR-positive reactions by direct DNA sequencing. A noncompetitive internal control was applied to prevent false-negative results due to PCR inhibition. The minimum detection limit for the PCR was determined to be one 28S rDNA copy per PCR, and the 95% detection limit was calculated to 15 copies per PCR. To assess the clinical applicability of the PCR method, intensive-care patients with artificial respiration and patients with infective endocarditis were investigated. For this purpose, 76 tracheal secretion samples and 70 heart valve tissues were analyzed in parallel by real-time PCR and cultivation. No discrepancies in results were observed between PCR analysis and cultivation methods. Furthermore, the application of the PCR method was investigated for other clinical specimens, including cervical swabs, nail and horny skin scrapings, and serum, blood, and urine samples. The combination of a broad-range real-time PCR and direct sequencing facilitates rapid screening for fungal infection in various clinical specimens. PMID:18385440
Evaluation of novel broad-range real-time PCR assay for rapid detection of human pathogenic fungi in various clinical specimens.

PubMed

Vollmer, Tanja; Störmer, Melanie; Kleesiek, Knut; Dreier, Jens

2008-06-01

In the present study, a novel broad-range real-time PCR was developed for the rapid detection of human pathogenic fungi. The assay targets a part of the 28S large-subunit ribosomal RNA (rDNA) gene. We investigated its application for the most important human pathogenic fungal genera, including Aspergillus, Candida, Cryptococcus, Mucor, Penicillium, Pichia, Microsporum, Trichophyton, and Scopulariopsis. Species were identified in PCR-positive reactions by direct DNA sequencing. A noncompetitive internal control was applied to prevent false-negative results due to PCR inhibition. The minimum detection limit for the PCR was determined to be one 28S rDNA copy per PCR, and the 95% detection limit was calculated to 15 copies per PCR. To assess the clinical applicability of the PCR method, intensive-care patients with artificial respiration and patients with infective endocarditis were investigated. For this purpose, 76 tracheal secretion samples and 70 heart valve tissues were analyzed in parallel by real-time PCR and cultivation. No discrepancies in results were observed between PCR analysis and cultivation methods. Furthermore, the application of the PCR method was investigated for other clinical specimens, including cervical swabs, nail and horny skin scrapings, and serum, blood, and urine samples. The combination of a broad-range real-time PCR and direct sequencing facilitates rapid screening for fungal infection in various clinical specimens.
Identification and characterization of TF1(phox), a DNA-binding protein that increases expression of gp91(phox) in PLB985 myeloid leukemia cells.

PubMed

Eklund, E A; Kakar, R

1997-04-04

The CYBB gene encodes gp91(phox), the heavy chain of the phagocyte-specific NADPH oxidase. CYBB is transcriptionally inactive until the promyelocyte stage of myelopoiesis, and in mature phagocytes, expression of gp91(phox) is further increased by interferon-gamma (IFN-gamma) and other inflammatory mediators. The CYBB promoter region contains several lineage-specific cis-elements involved in the IFN-gamma response. We screened a leukocyte cDNA expression library for proteins able to bind to one of these cis-elements (-214 to -262 base pairs) and identified TF1(phox), a protein with sequence-specific binding to the CYBB promoter. Electrophoretic mobility shift assay with nuclear proteins from a variety of cell lines demonstrated binding of a protein to the CYBB promoter that was cross-immunoreactive with TF1(phox). DNA binding of this protein was increased by IFN-gamma treatment in the myeloid cell line PLB985, but not in the non-myeloid cell line HeLa. Overexpression of recombinant TF1(phox) in PLB985 cells increased endogenous gp91(phox) message abundance, but did not lead to cellular differentiation. Overexpression of TF1(phox) in myeloid leukemia cell lines increased reporter gene expression from artificial promoter constructs containing CYBB promoter sequence. These data suggested that TF1(phox) increased expression of gp91(phox).
Scar-less multi-part DNA assembly design automation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hillson, Nathan J.

The present invention provides a method of a method of designing an implementation of a DNA assembly. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which to assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding flanking homology sequences to each of the DNA oligos. In an exemplary embodiment, the method includes (1) receiving a list of DNA sequence fragments to be assembled together and an order in which tomore » assemble the DNA sequence fragments, (2) designing DNA oligonucleotides (oligos) for each of the DNA sequence fragments, and (3) creating a plan for adding optimized overhang sequences to each of the DNA oligos.« less
Disambiguate: An open-source application for disambiguating two species in next generation sequencing data from grafted samples.

PubMed

Ahdesmäki, Miika J; Gray, Simon R; Johnson, Justin H; Lai, Zhongwu

2016-01-01

Grafting of cell lines and primary tumours is a crucial step in the drug development process between cell line studies and clinical trials. Disambiguate is a program for computationally separating the sequencing reads of two species derived from grafted samples. Disambiguate operates on DNA or RNA-seq alignments to the two species and separates the components at very high sensitivity and specificity as illustrated in artificially mixed human-mouse samples. This allows for maximum recovery of data from target tumours for more accurate variant calling and gene expression quantification. Given that no general use open source algorithm accessible to the bioinformatics community exists for the purposes of separating the two species data, the proposed Disambiguate tool presents a novel approach and improvement to performing sequence analysis of grafted samples. Both Python and C++ implementations are available and they are integrated into several open and closed source pipelines. Disambiguate is open source and is freely available at https://github.com/AstraZeneca-NGS/disambiguate.
Detection of Splice Sites Using Support Vector Machine

NASA Astrophysics Data System (ADS)

Varadwaj, Pritish; Purohit, Neetesh; Arora, Bhumika

Automatic identification and annotation of exon and intron region of gene, from DNA sequences has been an important research area in field of computational biology. Several approaches viz. Hidden Markov Model (HMM), Artificial Intelligence (AI) based machine learning and Digital Signal Processing (DSP) techniques have extensively and independently been used by various researchers to cater this challenging task. In this work, we propose a Support Vector Machine based kernel learning approach for detection of splice sites (the exon-intron boundary) in a gene. Electron-Ion Interaction Potential (EIIP) values of nucleotides have been used for mapping character sequences to corresponding numeric sequences. Radial Basis Function (RBF) SVM kernel is trained using EIIP numeric sequences. Furthermore this was tested on test gene dataset for detection of splice site by window (of 12 residues) shifting. Optimum values of window size, various important parameters of SVM kernel have been optimized for a better accuracy. Receiver Operating Characteristic (ROC) curves have been utilized for displaying the sensitivity rate of the classifier and results showed 94.82% accuracy for splice site detection on test dataset.
A unique chromatin complex occupies young α-satellite arrays of human centromeres

PubMed Central

Henikoff, Jorja G.; Thakur, Jitendra; Kasinathan, Sivakanthan; Henikoff, Steven

2015-01-01

The intractability of homogeneous α-satellite arrays has impeded understanding of human centromeres. Artificial centromeres are produced from higher-order repeats (HORs) present at centromere edges, although the exact sequences and chromatin conformations of centromere cores remain unknown. We use high-resolution chromatin immunoprecipitation (ChIP) of centromere components followed by clustering of sequence data as an unbiased approach to identify functional centromere sequences. We find that specific dimeric α-satellite units shared by multiple individuals dominate functional human centromeres. We identify two recently homogenized α-satellite dimers that are occupied by precisely positioned CENP-A (cenH3) nucleosomes with two ~100–base pair (bp) DNA wraps in tandem separated by a CENP-B/CENP-C–containing linker, whereas pericentromeric HORs show diffuse positioning. Precise positioning is largely maintained, whereas abundance decreases exponentially with divergence, which suggests that young α-satellite dimers with paired ~100-bp particles mediate evolution of functional human centromeres. Our unbiased strategy for identifying functional centromeric sequences should be generally applicable to tandem repeat arrays that dominate the centromeres of most eukaryotes. PMID:25927077
Run-length encoding graphic rules, biochemically editable designs and steganographical numeric data embedment for DNA-based cryptographical coding system.

PubMed

Kawano, Tomonori

2013-03-01

There have been a wide variety of approaches for handling the pieces of DNA as the "unplugged" tools for digital information storage and processing, including a series of studies applied to the security-related area, such as DNA-based digital barcodes, water marks and cryptography. In the present article, novel designs of artificial genes as the media for storing the digitally compressed data for images are proposed for bio-computing purpose while natural genes principally encode for proteins. Furthermore, the proposed system allows cryptographical application of DNA through biochemically editable designs with capacity for steganographical numeric data embedment. As a model case of image-coding DNA technique application, numerically and biochemically combined protocols are employed for ciphering the given "passwords" and/or secret numbers using DNA sequences. The "passwords" of interest were decomposed into single letters and translated into the font image coded on the separate DNA chains with both the coding regions in which the images are encoded based on the novel run-length encoding rule, and the non-coding regions designed for biochemical editing and the remodeling processes revealing the hidden orientation of letters composing the original "passwords." The latter processes require the molecular biological tools for digestion and ligation of the fragmented DNA molecules targeting at the polymerase chain reaction-engineered termini of the chains. Lastly, additional protocols for steganographical overwriting of the numeric data of interests over the image-coding DNA are also discussed.
Systematic Characteristic Exploration of the Chimeras Generated in Multiple Displacement Amplification through Next Generation Sequencing Data Reanalysis

PubMed Central

Gao, Shen; Yao, Bei; Lu, Zuhong

2015-01-01

Background The chimeric sequences produced by phi29 DNA polymerase, which are named as chimeras, influence the performance of the multiple displacement amplification (MDA) and also increase the difficulty of sequence data process. Despite several articles have reported the existence of chimeric sequence, there was only one research focusing on the structure and generation mechanism of chimeras, and it was merely based on hundreds of chimeras found in the sequence data of E. coli genome. Method We finished data mining towards a series of Next Generation Sequencing (NGS) reads which were used for whole genome haplotype assembling in a primary study. We established a bioinformatics pipeline based on subsection alignment strategy to discover all the chimeras inside and achieve their structural visualization. Then, we artificially defined two statistical indexes (the chimeric distance and the overlap length), and their regular abundance distribution helped illustrate of the structural characteristics of the chimeras. Finally we analyzed the relationship between the chimera type and the average insertion size, so that illustrate a method to decrease the proportion of wasted data in the procedure of DNA library construction. Results/Conclusion 131.4 Gb pair-end (PE) sequence data was reanalyzed for the chimeras. Totally, 40,259,438 read pairs (6.19%) with chimerism were discovered among 650,430,811 read pairs. The chimeric sequences are consisted of two or more parts which locate inconsecutively but adjacently on the chromosome. The chimeric distance between the locations of adjacent parts on the chromosome followed an approximate bimodal distribution ranging from 0 to over 5,000 nt, whose peak was at about 250 to 300 nt. The overlap length of adjacent parts followed an approximate Poisson distribution and revealed a peak at 6 nt. Moreover, unmapped chimeras, which were classified as the wasted data, could be reduced by properly increasing the length of the insertion segment size through a linear correlation analysis. Significance This study exhibited the profile of the phi29MDA chimeras by tens of millions of chimeric sequences, and helped understand the amplification mechanism of the phi29 DNA polymerase. Our work also illustrated the importance of NGS data reanalysis, not only for the improvement of data utilization efficiency, but also for more potential genomic information. PMID:26440104
Identification of natural and artificial DNA substrates for the light-activated LOV-HTH transcription factor EL222

PubMed Central

Rivera-Cancel, Giomar; Motta-Mena, Laura B.; Gardner, Kevin H.

2012-01-01

Light-oxygen-voltage (LOV) domains serve as the photosensory modules for a wide range of plant and bacterial proteins, conferring blue light dependent regulation to effector activities as diverse as enzymes and DNA binding. LOV domains can also be engineered into a variety of exogenous targets, enabling similar regulation for new protein-based reagents. Common to these proteins is the ability for LOV domains to reversibly form a photochemical adduct between an internal flavin chromophore and the surrounding protein, using this to trigger conformational changes that affect output activity. Using the Erythrobacter litoralis protein EL222 model system which links LOV regulation to a helix-turn-helix (HTH) DNA binding domain, we demonstrated that the LOV domain binds and inhibits the HTH domain in the dark, releasing these interactions upon illumination [Nash et al. (2011) Proc. Natl. Acad. Sci. USA 108, 9449–9454]. Here we combine genomic and in vitro selection approaches to identify optimal DNA binding sites for EL222. Within the bacterial host, we observe binding several genomic sites using a 12 bp sequence consensus that is also found by in vitro selection methods. Sequence-specific alterations in the DNA consensus reduce EL222-binding affinity in a manner consistent with the expected binding mode: a protein dimer binding to two repeats. Finally, we demonstrate the light-dependent activation of transcription of two genes adjacent to an EL222 binding site. Taken together, these results shed light on the native function of EL222 and provide useful reagents for further basic and applications research of this versatile protein. PMID:23205774
Learning of pitch and time structures in an artificial grammar setting.

PubMed

Prince, Jon B; Stevens, Catherine J; Jones, Mari Riess; Tillmann, Barbara

2018-04-12

Despite the empirical evidence for the power of the cognitive capacity of implicit learning of structures and regularities in several modalities and materials, it remains controversial whether implicit learning extends to the learning of temporal structures and regularities. We investigated whether (a) an artificial grammar can be learned equally well when expressed in duration sequences as when expressed in pitch sequences, (b) learning of the artificial grammar in either duration or pitch (as the primary dimension) sequences can be influenced by the properties of the secondary dimension (invariant vs. randomized), and (c) learning can be boosted when the artificial grammar is expressed in both pitch and duration. After an exposure phase with grammatical sequences, learning in a subsequent test phase was assessed in a grammaticality judgment task. Participants in both the pitch and duration conditions showed incidental (not fully implicit) learning of the artificial grammar when the secondary dimension was invariant, but randomizing the pitch sequence prevented learning of the artificial grammar in duration sequences. Expressing the artificial grammar in both pitch and duration resulted in disproportionately better performance, suggesting an interaction between the learning of pitch and temporal structure. The findings are relevant to research investigating the learning of temporal structures and the learning of structures presented simultaneously in 2 dimensions (e.g., space and time, space and objects). By investigating learning, the findings provide further insight into the potential specificity of pitch and time processing, and their integrated versus independent processing, as previously debated in music cognition research. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Droplet microfluidics for synthetic biology

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gach, Philip Charles; Iwai, Kosuke; Kim, Peter Wonhee

Here, synthetic biology is an interdisciplinary field that aims to engineer biological systems for useful purposes. Organism engineering often requires the optimization of individual genes and/or entire biological pathways (consisting of multiple genes). Advances in DNA sequencing and synthesis have recently begun to enable the possibility of evaluating thousands of gene variants and hundreds of thousands of gene combinations. However, such large-scale optimization experiments remain cost-prohibitive to researchers following traditional molecular biology practices, which are frequently labor-intensive and suffer from poor reproducibility. Liquid handling robotics may reduce labor and improve reproducibility, but are themselves expensive and thus inaccessible to mostmore » researchers. Microfluidic platforms offer a lower entry price point alternative to robotics, and maintain high throughput and reproducibility while further reducing operating costs through diminished reagent volume requirements. Droplet microfluidics have shown exceptional promise for synthetic biology experiments, including DNA assembly, transformation/transfection, culturing, cell sorting, phenotypic assays, artificial cells and genetic circuits.« less
Metal binding proteins, recombinant host cells and methods

DOEpatents

Summers, Anne O.; Caguiat, Jonathan J.

2004-06-15

The present disclosure provides artificial heavy metal binding proteins termed chelons by the inventors. These chelons bind cadmium and/or mercuric ions with relatively high affinity. Also disclosed are coding sequences, recombinant DNA molecules and recombinant host cells comprising those recombinant DNA molecules for expression of the chelon proteins. In the recombinant host cells or transgenic plants, the chelons can be used to bind heavy metals taken up from contaminated soil, groundwater or irrigation water and to concentrate and sequester those ions. Recombinant enteric bacteria can be used within the gastrointestinal tracts of animals or humans exposed to toxic metal ions such as mercury and/or cadmium, where the chelon recombinantly expressed in chosen in accordance with the ion to be rededicated. Alternatively, the chelons can be immobilized to solid supports to bind and concentrate heavy metals from a contaminated aqueous medium including biological fluids.
Droplet microfluidics for synthetic biology

DOE PAGES

Gach, Philip Charles; Iwai, Kosuke; Kim, Peter Wonhee; ...

2017-08-10

Here, synthetic biology is an interdisciplinary field that aims to engineer biological systems for useful purposes. Organism engineering often requires the optimization of individual genes and/or entire biological pathways (consisting of multiple genes). Advances in DNA sequencing and synthesis have recently begun to enable the possibility of evaluating thousands of gene variants and hundreds of thousands of gene combinations. However, such large-scale optimization experiments remain cost-prohibitive to researchers following traditional molecular biology practices, which are frequently labor-intensive and suffer from poor reproducibility. Liquid handling robotics may reduce labor and improve reproducibility, but are themselves expensive and thus inaccessible to mostmore » researchers. Microfluidic platforms offer a lower entry price point alternative to robotics, and maintain high throughput and reproducibility while further reducing operating costs through diminished reagent volume requirements. Droplet microfluidics have shown exceptional promise for synthetic biology experiments, including DNA assembly, transformation/transfection, culturing, cell sorting, phenotypic assays, artificial cells and genetic circuits.« less
Developmental history and application of CRISPR in human disease.

PubMed

Liang, Puping; Zhang, Xiya; Chen, Yuxi; Huang, Junjiu

2017-06-01

Genome-editing tools are programmable artificial nucleases, mainly including zinc-finger nucleases, transcription activator-like effector nucleases and clustered regularly interspaced short palindromic repeat (CRISPR). By recognizing and cleaving specific DNA sequences, genome-editing tools make it possible to generate site-specific DNA double-strand breaks (DSBs) in the genome. DSBs will then be repaired by either error-prone nonhomologous end joining or high-fidelity homologous recombination mechanisms. Through these two different mechanisms, endogenous genes can be knocked out or precisely repaired/modified. Rapid developments in genome-editing tools, especially CRISPR, have revolutionized human disease models generation, for example, various zebrafish, mouse, rat, pig, monkey and human cell lines have been constructed. Here, we review the developmental history of CRISPR and its application in studies of human diseases. In addition, we also briefly discussed the therapeutic application of CRISPR in the near future. Copyright © 2017 John Wiley & Sons, Ltd.
Logical Framework of Forensic Identification: Ability to Resist Fabricated DNA.

PubMed

Wang, Zheng; Zhou, Di; Zhang, Suhua; Bian, Yingnan; Hu, Zhen; Zhu, Ruxin; Lu, Daru; Li, Chengtao

2015-12-01

Over the past 30 years, DNA analysis has revolutionized forensic science and has become the most useful single tool in the multifaceted fight against crime. Today, DNA profiling with sets of highly polymorphic autosomal short tandem repeat markers is widely employed and accepted in the courts due to its high discriminating power and reliability. However, an artificial bloodstain purposefully created using molecular biology techniques succeeded in tricking a leading forensic DNA laboratory. The disturbing possibility that a forensic DNA profile can be faked shocked the general public and the mass media, and generated serious discussion about the credibility of DNA evidence. Herein, we present two exemplary assays based on tissue-specific methylation patterns and cell-specific mRNA expression, respectively. These two assays can be integrated into the DNA analysis pipelines without consumption of additional samples. We show that the two assays can not only distinguish between artificial and genuine samples, but also provide information on tissue origin. The two assays were tested on natural and artificial bloodstains (generated by polymerase chain reaction and whole genome amplification technique) and the results illustrated that the logical framework of forensic identification is still useful for forensic identification with the high credibility.
Experimental evolution reveals genome-wide spectrum and dynamics of mutations in the rice blast fungus, Magnaporthe oryzae.

PubMed

Jeon, Junhyun; Choi, Jaeyoung; Lee, Gir-Won; Dean, Ralph A; Lee, Yong-Hwan

2013-01-01

Knowledge on mutation processes is central to interpreting genetic analysis data as well as understanding the underlying nature of almost all evolutionary phenomena. However, studies on genome-wide mutational spectrum and dynamics in fungal pathogens are scarce, hindering our understanding of their evolution and biology. Here, we explored changes in the phenotypes and genome sequences of the rice blast fungus Magnaporthe oryzae during the forced in vitro evolution by weekly transfer of cultures on artificial media. Through combination of experimental evolution with high throughput sequencing technology, we found that mutations accumulate rapidly prior to visible phenotypic changes and that both genetic drift and selection seem to contribute to shaping mutational landscape, suggesting the buffering capacity of fungal genome against mutations. Inference of mutational effects on phenotypes through the use of T-DNA insertion mutants suggested that at least some of the DNA sequence mutations are likely associated with the observed phenotypic changes. Furthermore, our data suggest oxidative damages and UV as major sources of mutation during subcultures. Taken together, our work revealed important properties of original source of variation in the genome of the rice blast fungus. We believe that these results provide not only insights into stability of pathogenicity and genome evolution in plant pathogenic fungi but also a model in which evolution of fungal pathogens in natura can be comparatively investigated.

Construction and Analysis of Siberian Tiger Bacterial Artificial Chromosome Library with Approximately 6.5-Fold Genome Equivalent Coverage

PubMed Central

Liu, Changqing; Bai, Chunyu; Guo, Yu; Liu, Dan; Lu, Taofeng; Li, Xiangchen; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

2014-01-01

Bacterial artificial chromosome (BAC) libraries are extremely valuable for the genome-wide genetic dissection of complex organisms. The Siberian tiger, one of the most well-known wild primitive carnivores in China, is an endangered animal. In order to promote research on its genome, a high-redundancy BAC library of the Siberian tiger was constructed and characterized. The library is divided into two sub-libraries prepared from blood cells and two sub-libraries prepared from fibroblasts. This BAC library contains 153,600 individually archived clones; for PCR-based screening of the library, BACs were placed into 40 superpools of 10 × 384-deep well microplates. The average insert size of BAC clones was estimated to be 116.5 kb, representing approximately 6.46 genome equivalents of the haploid genome and affording a 98.86% statistical probability of obtaining at least one clone containing a unique DNA sequence. Screening the library with 19 microsatellite markers and a SRY sequence revealed that each of these markers were present in the library; the average number of positive clones per marker was 6.74 (range 2 to 12), consistent with 6.46 coverage of the tiger genome. Additionally, we identified 72 microsatellite markers that could potentially be used as genetic markers. This BAC library will serve as a valuable resource for physical mapping, comparative genomic study and large-scale genome sequencing in the tiger. PMID:24608928
Hymenobacter aquatilis sp. nov., isolated from a mesotrophic artificial lake.

PubMed

Kang, Heeyoung; Cha, Inseong; Kim, Haneul; Joh, Kiseong

2018-06-01

A novel strain, designated HMF3095 T , isolated from freshwater of a mesotrophic artificial lake in the Republic of Korea, was characterized by polyphasic taxonomy. The cells were Gram-stain-negative, aerobic, non-motile, straight rods and formed reddish colonies. Phylogenetic analysis based on its 16S rRNA gene sequence revealed that strain HMF3095 T fell within the cluster of the genus Hymenobacterand was most closely related to Hymenobacter seoulensis 16F7G T and Hymenobacter tenuis POB6 T (96.7 % sequence similarity). Sequence similarities to all other type strains were 96.3 % or less. The major fatty acids were iso-C15 : 0, C16 : 1ω5c, summed feature 4 (iso-C17 : 1 I and/or anteiso-C17 : 1 B), summed feature 3 (C16 : 1ω7c and/or C16 : 1ω6c) and anteiso-C15 : 0. The major isoprenoid quinone was menaquinone 7. The major polar lipids were phosphatidylethanolamine, three unidentified aminophospholipids and one unidentified phospholipid. The DNA G+C content was 58.9 mol%. On the basis of the evidence presented in this study, strain HMF3095 T represents a novel species of the genus Hymenobacter, for which the name Hymenobacter aquatilis sp. nov. is proposed. The type strain is HMF3095 T (=KCTC 52398 T =NBRC 112669 T ).
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

2013-06-25

A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

2011-01-18

A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Data series embedding and scale invariant statistics.

PubMed

Michieli, I; Medved, B; Ristov, S

2010-06-01

Data sequences acquired from bio-systems such as human gait data, heart rate interbeat data, or DNA sequences exhibit complex dynamics that is frequently described by a long-memory or power-law decay of autocorrelation function. One way of characterizing that dynamics is through scale invariant statistics or "fractal-like" behavior. For quantifying scale invariant parameters of physiological signals several methods have been proposed. Among them the most common are detrended fluctuation analysis, sample mean variance analyses, power spectral density analysis, R/S analysis, and recently in the realm of the multifractal approach, wavelet analysis. In this paper it is demonstrated that embedding the time series data in the high-dimensional pseudo-phase space reveals scale invariant statistics in the simple fashion. The procedure is applied on different stride interval data sets from human gait measurements time series (Physio-Bank data library). Results show that introduced mapping adequately separates long-memory from random behavior. Smaller gait data sets were analyzed and scale-free trends for limited scale intervals were successfully detected. The method was verified on artificially produced time series with known scaling behavior and with the varying content of noise. The possibility for the method to falsely detect long-range dependence in the artificially generated short range dependence series was investigated. (c) 2009 Elsevier B.V. All rights reserved.
Multivariate Calibration Approach for Quantitative Determination of Cell-Line Cross Contamination by Intact Cell Mass Spectrometry and Artificial Neural Networks.

PubMed

Valletta, Elisa; Kučera, Lukáš; Prokeš, Lubomír; Amato, Filippo; Pivetta, Tiziana; Hampl, Aleš; Havel, Josef; Vaňhara, Petr

2016-01-01

Cross-contamination of eukaryotic cell lines used in biomedical research represents a highly relevant problem. Analysis of repetitive DNA sequences, such as Short Tandem Repeats (STR), or Simple Sequence Repeats (SSR), is a widely accepted, simple, and commercially available technique to authenticate cell lines. However, it provides only qualitative information that depends on the extent of reference databases for interpretation. In this work, we developed and validated a rapid and routinely applicable method for evaluation of cell culture cross-contamination levels based on mass spectrometric fingerprints of intact mammalian cells coupled with artificial neural networks (ANNs). We used human embryonic stem cells (hESCs) contaminated by either mouse embryonic stem cells (mESCs) or mouse embryonic fibroblasts (MEFs) as a model. We determined the contamination level using a mass spectra database of known calibration mixtures that served as training input for an ANN. The ANN was then capable of correct quantification of the level of contamination of hESCs by mESCs or MEFs. We demonstrate that MS analysis, when linked to proper mathematical instruments, is a tangible tool for unraveling and quantifying heterogeneity in cell cultures. The analysis is applicable in routine scenarios for cell authentication and/or cell phenotyping in general.
Multivariate Calibration Approach for Quantitative Determination of Cell-Line Cross Contamination by Intact Cell Mass Spectrometry and Artificial Neural Networks

PubMed Central

Prokeš, Lubomír; Amato, Filippo; Pivetta, Tiziana; Hampl, Aleš; Havel, Josef; Vaňhara, Petr

2016-01-01

Cross-contamination of eukaryotic cell lines used in biomedical research represents a highly relevant problem. Analysis of repetitive DNA sequences, such as Short Tandem Repeats (STR), or Simple Sequence Repeats (SSR), is a widely accepted, simple, and commercially available technique to authenticate cell lines. However, it provides only qualitative information that depends on the extent of reference databases for interpretation. In this work, we developed and validated a rapid and routinely applicable method for evaluation of cell culture cross-contamination levels based on mass spectrometric fingerprints of intact mammalian cells coupled with artificial neural networks (ANNs). We used human embryonic stem cells (hESCs) contaminated by either mouse embryonic stem cells (mESCs) or mouse embryonic fibroblasts (MEFs) as a model. We determined the contamination level using a mass spectra database of known calibration mixtures that served as training input for an ANN. The ANN was then capable of correct quantification of the level of contamination of hESCs by mESCs or MEFs. We demonstrate that MS analysis, when linked to proper mathematical instruments, is a tangible tool for unraveling and quantifying heterogeneity in cell cultures. The analysis is applicable in routine scenarios for cell authentication and/or cell phenotyping in general. PMID:26821236
PEGylation enhances tumor targeting of plasmid DNA by an artificial cationized protein with repeated RGD sequences, Pronectin.

PubMed

Hosseinkhani, Hossein; Tabata, Yasuhiko

2004-05-31

The objective of this study is to investigate feasibility of a non-viral gene carrier with repeated RGD sequences (Pronectin F+) in tumor targeting for gene expression. The Pronectin F+ was cationized by introducing spermine (Sm) to the hydroxyl groups to allow to polyionically complex with plasmid DNA. The cationized Pronectin F+ prepared was additionally modified with poly(ethylene glycol) (PEG) molecules which have active ester and methoxy groups at the terminal, to form various PEG-introduced cationized Pronectin F+. The cationized Pronectin F+ with or without PEGylation at different extents was mixed with a plasmid DNA of LacZ to form respective cationized Pronectin F+-plasmid DNA complexes. The plasmid DNA was electrophoretically complexed with cationized Pronectin F+ and PEG-introduced cationized Pronectin F+, irrespective of the PEGylation extent, although the higher N/P ratio of complexes was needed for complexation with the latter Pronectin F+. The molecular size and zeta potential measurements revealed that the plasmid DNA was reduced in size to about 250 nm and the charge was changed to be positive by the complexation with cationized Pronectin F+. For the complexation with PEG-introduced cationized Pronectin F+, the charge of complex became neutral being almost 0 mV with the increasing PEGylation extents, while the molecular size was similar to that of cationized Pronectin F+. When cationized Pronectin F+-plasmid DNA complexes with or without PEGylation were intravenously injected to mice carrying a subcutaneous Meth-AR-1 fibrosarcoma mass, the PEG-introduced cationized Pronectin F+-plasmid DNA complex specifically enhanced the level of gene expression in the tumor, to a significantly high extent compared with the cationized Pronectin F+-plasmid DNA complexes and free plasmid DNA. The enhanced level of gene expression depended on the percentage of PEG introduced, the N/P ratio, and the plasmid DNA dose. A fluorescent microscopic study revealed that the localization of plasmid DNA in the tumor tissue was observed only for the PEG-introduced cationized Pronectin F+-plasmid DNA complex injected. We conclude that the PEGylation of cationized Pronectin F+ is a promising way to enable the plasmid DNA to target to the tumor for gene expression. Coyright 2004 Elsevier B.V.
Rapid CRISPR/Cas9-Mediated Cloning of Full-Length Epstein-Barr Virus Genomes from Latently Infected Cells.

PubMed

Yajima, Misako; Ikuta, Kazufumi; Kanda, Teru

2018-04-03

Herpesviruses have relatively large DNA genomes of more than 150 kb that are difficult to clone and sequence. Bacterial artificial chromosome (BAC) cloning of herpesvirus genomes is a powerful technique that greatly facilitates whole viral genome sequencing as well as functional characterization of reconstituted viruses. We describe recently invented technologies for rapid BAC cloning of herpesvirus genomes using CRISPR/Cas9-mediated homology-directed repair. We focus on recent BAC cloning techniques of Epstein-Barr virus (EBV) genomes and discuss the possible advantages of a CRISPR/Cas9-mediated strategy comparatively with precedent EBV-BAC cloning strategies. We also describe the design decisions of this technology as well as possible pitfalls and points to be improved in the future. The obtained EBV-BAC clones are subjected to long-read sequencing analysis to determine complete EBV genome sequence including repetitive regions. Rapid cloning and sequence determination of various EBV strains will greatly contribute to the understanding of their global geographical distribution. This technology can also be used to clone disease-associated EBV strains and test the hypothesis that they have special features that distinguish them from strains that infect asymptomatically.
Rapid CRISPR/Cas9-Mediated Cloning of Full-Length Epstein-Barr Virus Genomes from Latently Infected Cells

PubMed Central

Ikuta, Kazufumi; Kanda, Teru

2018-01-01

Herpesviruses have relatively large DNA genomes of more than 150 kb that are difficult to clone and sequence. Bacterial artificial chromosome (BAC) cloning of herpesvirus genomes is a powerful technique that greatly facilitates whole viral genome sequencing as well as functional characterization of reconstituted viruses. We describe recently invented technologies for rapid BAC cloning of herpesvirus genomes using CRISPR/Cas9-mediated homology-directed repair. We focus on recent BAC cloning techniques of Epstein-Barr virus (EBV) genomes and discuss the possible advantages of a CRISPR/Cas9-mediated strategy comparatively with precedent EBV-BAC cloning strategies. We also describe the design decisions of this technology as well as possible pitfalls and points to be improved in the future. The obtained EBV-BAC clones are subjected to long-read sequencing analysis to determine complete EBV genome sequence including repetitive regions. Rapid cloning and sequence determination of various EBV strains will greatly contribute to the understanding of their global geographical distribution. This technology can also be used to clone disease-associated EBV strains and test the hypothesis that they have special features that distinguish them from strains that infect asymptomatically. PMID:29614006
Cloning of a very virulent plus, 686 strain of Marek’s disease virus as a bacterial artificial chromosome

USDA-ARS?s Scientific Manuscript database

Bacterial artificial chromosome (BAC) vectors were first developed to facilitate propagation and manipulation of large DNA fragments. This technology was later used to clone full-length genomes of large DNA viruses to study viral gene function. Marek’s disease virus (MDV) is a highly oncogenic herpe...
Mitochondrial genome rearrangements in glomus species triggered by homologous recombination between distinct mtDNA haplotypes.

PubMed

Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed

2013-01-01

Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms.AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence,were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigated podiversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity.We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants.
Mitochondrial Genome Rearrangements in Glomus Species Triggered by Homologous Recombination between Distinct mtDNA Haplotypes

PubMed Central

Beaudet, Denis; Terrat, Yves; Halary, Sébastien; de la Providencia, Ivan Enrique; Hijri, Mohamed

2013-01-01

Comparative mitochondrial genomics of arbuscular mycorrhizal fungi (AMF) provide new avenues to overcome long-lasting obstacles that have hampered studies aimed at understanding the community structure, diversity, and evolution of these multinucleated and genetically polymorphic organisms. AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. However, these elements may also lead to genome rearrangements through homologous recombination, although this has never previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e., Glomus cerebriforme and Glomus sp.) with substantial mtDNA synteny divergence, were sequenced and compared with available glomeromycotan mitochondrial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigate dpo diversity in AMF as well as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF single spore cultivations artificially underestimate mtDNA genetic diversity. We assessed potential dpo dispersal mechanisms in AMF and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants. PMID:23925788
Toward a Molecular Cytogenetic Map for Cultivated Sunflower (Helianthus annuus L.) by Landed BAC/BIBAC Clones

PubMed Central

Feng, Jiuhuan; Liu, Zhao; Cai, Xiwen; Jan, Chao-Chien

2013-01-01

Conventional karyotypes and various genetic linkage maps have been established in sunflower (Helianthus annuus L., 2n = 34). However, the relationship between linkage groups and individual chromosomes of sunflower remains unknown and has considerable relevance for the sunflower research community. Recently, a set of linkage group-specific bacterial /binary bacterial artificial chromosome (BAC/BIBAC) clones was identified from two complementary BAC and BIBAC libraries constructed for cultivated sunflower cv. HA89. In the present study, we used these linkage group-specific clones (∼100 kb in size) as probes to in situ hybridize to HA89 mitotic chromosomes at metaphase using the BAC- fluorescence in situ hybridization (FISH) technique. Because a characteristic of the sunflower genome is the abundance of repetitive DNA sequences, a high ratio of blocking DNA to probe DNA was applied to hybridization reactions to minimize the background noise. As a result, all sunflower chromosomes were anchored by one or two BAC/BIBAC clones with specific FISH signals. FISH analysis based on tandem repetitive sequences, such as rRNA genes, has been previously reported; however, the BAC-FISH technique developed here using restriction fragment length polymorphism (RFLP)−derived BAC/BIBAC clones as probes to apply genome-wide analysis is new for sunflower. As chromosome-specific cytogenetic markers, the selected BAC/BIBAC clones that encompass the 17 linkage groups provide a valuable tool for identifying sunflower cytogenetic stocks (such as trisomics) and tracking alien chromosomes in interspecific crosses. This work also demonstrates the potential of using a large-insert DNA library for the development of molecular cytogenetic resources. PMID:23316437
Contig Maps and Genomic Sequencing Identify Candidate Genes in the Usher 1C Locus

PubMed Central

Higgins, Michael J.; Day, Colleen D.; Smilinich, Nancy J.; Ni, L.; Cooper, Paul R.; Nowak, Norma J.; Davies, Chris; de Jong, Pieter J.; Hejtmancik, Fielding; Evans, Glen A.; Smith, Richard J.H.; Shows, Thomas B.

1998-01-01

Usher syndrome 1C (USH1C) is a congenital condition manifesting profound hearing loss, the absence of vestibular function, and eventual retinal degeneration. The USH1C locus has been mapped genetically to a 2- to 3-cM interval in 11p14–15.1 between D11S899 and D11S861. In an effort to identify the USH1C disease gene we have isolated the region between these markers in yeast artificial chromosomes (YACs) using a combination of STS content mapping and Alu–PCR hybridization. The YAC contig is ∼3.5 Mb and has located several other loci within this interval, resulting in the order CEN-LDHA-SAA1-TPH-D11S1310-(D11S1888/KCNC1)-MYOD1-D11S902D11S921-D11S1890-TEL. Subsequent haplotyping and homozygosity analysis refined the location of the disease gene to a 400-kb interval between D11S902 and D11S1890 with all affected individuals being homozygous for the internal marker D11S921. To facilitate gene identification, the critical region has been converted into P1 artificial chromosome (PAC) clones using sequence-tagged sites (STSs) mapped to the YAC contig, Alu–PCR products generated from the YACs, and PAC end probes. A contig of >50 PAC clones has been assembled between D11S1310 and D11S1890, confirming the order of markers used in haplotyping. Three PAC clones representing nearly two-thirds of the USH1C critical region have been sequenced. PowerBLAST analysis identified six clusters of expressed sequence tags (ESTs), two known genes (BIR,SUR1) mapped previously to this region, and a previously characterized but unmapped gene NEFA (DNA binding/EF hand/acidic amino-acid-rich). GRAIL analysis identified 11 CpG islands and 73 exons of excellent quality. These data allowed the construction of a transcription map for the USH1C critical region, consisting of three known genes and six or more novel transcripts. Based on their map location, these loci represent candidate disease loci for USH1C. The NEFA gene was assessed as the USH1C locus by the sequencing of an amplified NEFA cDNA from an USH1C patient; however, no mutations were detected. [The sequence data described in this paper have been submitted to GenBank under accession numbers AC000406–AC000407.] PMID:9445488
Large inserts for big data: artificial chromosomes in the genomic era.

PubMed

Tocchetti, Arianna; Donadio, Stefano; Sosio, Margherita

2018-05-01

The exponential increase in available microbial genome sequences coupled with predictive bioinformatic tools is underscoring the genetic capacity of bacteria to produce an unexpected large number of specialized bioactive compounds. Since most of the biosynthetic gene clusters (BGCs) present in microbial genomes are cryptic, i.e. not expressed under laboratory conditions, a variety of cloning systems and vectors have been devised to harbor DNA fragments large enough to carry entire BGCs and to allow their transfer in suitable heterologous hosts. This minireview provides an overview of the vectors and approaches that have been developed for cloning large BGCs, and successful examples of heterologous expression.
Improved detection of endoparasite DNA in soil sample PCR by the use of anti-inhibitory substances.

PubMed

Krämer, F; Vollrath, T; Schnieder, T; Epe, C

2002-09-26

Although there have been numerous microbial examinations of soil for the presence of human pathogenic developmental parasite stages of Ancylostoma caninum and Toxocara canis, molecular techniques (e.g. DNA extraction, purification and subsequent PCR) have scarcely been applied. Here, DNA preparations of soil samples artificially contaminated with genomic DNA or parasite eggs were examined by PCR. A. caninum and T. canis-specific primers based on the ITS-2 sequence were used for amplification. After the sheer DNA preparation a high content of PCR-interfering substances was still detectable. Subsequently, two different inhibitors of PCR-interfering agents (GeneReleaser, Bioventures Inc. and Maximator, Connex GmbH) were compared in PCR. Both substances increased PCR sensitivity greatly. However, comparison of the increase in sensitivity achieved with the two compounds demonstrated the superiority of Maximator, which enhanced sensitivity to the point of permitting positive detection of a single A. caninum egg and three T. canis eggs in a soil sample. This degree of sensitivity could not be achieved with GeneReleaser for either parasite Furthermore, Maximator not only increased sensitivity; it also cost less, required less time and had a lower risk of contamination. Future applications of molecular methods in epidemiological examinations of soil samples are discussed/elaborated.
Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

NASA Astrophysics Data System (ADS)

Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

2017-07-01

DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
High-throughput multiplex cpDNA resequencing clarifies the genetic diversity and genetic relationships among Brassica napus, Brassica rapa and Brassica oleracea.

PubMed

Qiao, Jiangwei; Cai, Mengxian; Yan, Guixin; Wang, Nian; Li, Feng; Chen, Binyun; Gao, Guizhen; Xu, Kun; Li, Jun; Wu, Xiaoming

2016-01-01

Brassica napus (rapeseed) is a recent allotetraploid plant and the second most important oilseed crop worldwide. The origin of B. napus and the genetic relationships with its diploid ancestor species remain largely unresolved. Here, chloroplast DNA (cpDNA) from 488 B. napus accessions of global origin, 139 B. rapa accessions and 49 B. oleracea accessions were populationally resequenced using Illumina Solexa sequencing technologies. The intraspecific cpDNA variants and their allelic frequencies were called genomewide and further validated via EcoTILLING analyses of the rpo region. The cpDNA of the current global B. napus population comprises more than 400 variants (SNPs and short InDels) and maintains one predominant haplotype (Bncp1). Whole-genome resequencing of the cpDNA of Bncp1 haplotype eliminated its direct inheritance from any accession of the B. rapa or B. oleracea species. The distribution of the polymorphism information content (PIC) values for each variant demonstrated that B. napus has much lower cpDNA diversity than B. rapa; however, a vast majority of the wild and cultivated B. oleracea specimens appeared to share one same distinct cpDNA haplotype, in contrast to its wild C-genome relatives. This finding suggests that the cpDNA of the three Brassica species is well differentiated. The predominant B. napus cpDNA haplotype may have originated from uninvestigated relatives or from interactions between cpDNA mutations and natural/artificial selection during speciation and evolution. These exhaustive data on variation in cpDNA would provide fundamental data for research on cpDNA and chloroplasts. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Large-Scale Concatenation cDNA Sequencing

PubMed Central

Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

1997-01-01

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174

Synthesis of DNA

DOEpatents

Mariella, Jr., Raymond P.

2008-11-18

A method of synthesizing a desired double-stranded DNA of a predetermined length and of a predetermined sequence. Preselected sequence segments that will complete the desired double-stranded DNA are determined. Preselected segment sequences of DNA that will be used to complete the desired double-stranded DNA are provided. The preselected segment sequences of DNA are assembled to produce the desired double-stranded DNA.
Acidovorax valerianellae sp. nov., a novel pathogen of lamb's lettuce [Valerianella locusta (L.) Laterr].

PubMed

Gardan, Louis; Stead, David E; Dauga, Catherine; Gillis, Moniek

2003-05-01

Bacterial spot disease of lamb's lettuce [Valerianella locusta (L.) Laterr.] was first observed in fields in 1991. This new bacterial disease is localized in western France in high-technology field production of lamb's lettuce for the preparation of ready-to-use salad. Nineteen strains isolated in 1992 and 1993 from typical black leaf spots of naturally infected lamb's lettuce were characterized and compared with reference strains of Acidovorax and Delftia. The pathogenicity of the 19 strains was confirmed by artificial inoculation. Biochemical and physiological tests, fatty acid profiles, DNA-DNA hybridization and other nucleic acid-based tests were performed. A numerical taxonomic analysis of the 19 lamb's lettuce strains showed a single homogeneous phenon closely related to previously described phytopathogenic taxa of the genus Acidovorax. DNA-DNA hybridization studies showed that the lamb's lettuce strains were 91-100% related to a representative strain, strain CFBP 4730(T), and constituted a discrete DNA hybridization group, indicating that they belong to the same novel species. Results from DNA-rRNA hybridization, 16S rRNA sequence analysis and fatty acid analysis studies confirmed that this novel species belongs to the beta-subclass of the Proteobacteria and, more specifically, to the family Comamonadaceae and the genus Acidovorax. The name Acidovorax valerianellae sp. nov. is proposed for this novel taxon of phytopathogenic bacteria. The type strain is strain CFBP 4730(T) (= NCPPB 4283(T)).
Development of SSR markers from Citrus clementina (Rutaceae) BAC end sequences and interspecific transferability in Citrus.

PubMed

Ollitrault, Frédérique; Terol, Javier; Pina, Jose Antonio; Navarro, Luis; Talon, Manuel; Ollitrault, Patrick

2010-11-01

Microsatellite primers were developed from bacterial artificial chromosome (BAC) end sequences of Citrus clementina and their transferability and polymorphism tested in the genus Citrus for future anchorage of physical and genetic maps and comparative interspecific genetic mapping. • Using PAGE and DNA silver staining, 79 primer pairs were selected for their transferability and polymorphism among 526 microsatellites mined in BES. A preliminary diversity study in Citrus was conducted with 18 of them, in C. reticulata, C. maxima, C. medica, C. sinensis, C. aurantium, C. paradisi, C. lemon, C. aurantifolia, and some papedas (wild citrus), using a capillary electrophoresis fragment analyzer. Intra- and interspecific polymorphism was observed, and heterozygous markers were identified for the different genotypes to be used for genetic mapping. • These results indicate the utility of the developed primers for comparative mapping studies and the integration of physical and genetic maps.
Nanopore Technology: A Simple, Inexpensive, Futuristic Technology for DNA Sequencing.

PubMed

Gupta, P D

2016-10-01

In health care, importance of DNA sequencing has been fully established. Sanger's Capillary Electrophoresis DNA sequencing methodology is time consuming, cumbersome, hence become more expensive. Lately, because of its versatility DNA sequencing became house hold name, and therefore, there is an urgent need of simple, fast, inexpensive, DNA sequencing technology. In the beginning of this century efforts were made, and Nanopore DNA sequencing technology was developed; still it is infancy, nevertheless, it is the futuristic technology.
The genome-wide DNA sequence specificity of the anti-tumour drug bleomycin in human cells.

PubMed

Murray, Vincent; Chen, Jon K; Tanaka, Mark M

2016-07-01

The cancer chemotherapeutic agent, bleomycin, cleaves DNA at specific sites. For the first time, the genome-wide DNA sequence specificity of bleomycin breakage was determined in human cells. Utilising Illumina next-generation DNA sequencing techniques, over 200 million bleomycin cleavage sites were examined to elucidate the bleomycin genome-wide DNA selectivity. The genome-wide bleomycin cleavage data were analysed by four different methods to determine the cellular DNA sequence specificity of bleomycin strand breakage. For the most highly cleaved DNA sequences, the preferred site of bleomycin breakage was at 5'-GT* dinucleotide sequences (where the asterisk indicates the bleomycin cleavage site), with lesser cleavage at 5'-GC* dinucleotides. This investigation also determined longer bleomycin cleavage sequences, with preferred cleavage at 5'-GT*A and 5'- TGT* trinucleotide sequences, and 5'-TGT*A tetranucleotides. For cellular DNA, the hexanucleotide DNA sequence 5'-RTGT*AY (where R is a purine and Y is a pyrimidine) was the most highly cleaved DNA sequence. It was striking that alternating purine-pyrimidine sequences were highly cleaved by bleomycin. The highest intensity cleavage sites in cellular and purified DNA were very similar although there were some minor differences. Statistical nucleotide frequency analysis indicated a G nucleotide was present at the -3 position (relative to the cleavage site) in cellular DNA but was absent in purified DNA.
Identification of single nucleotide polymorphism in protein phosphatase 1 regulatory subunit 11 gene in Murrah bulls

PubMed Central

Jain, Varsha; Patel, Brijesh; Umar, Farhat Paul; Ajithakumar, H. M.; Gurjar, Suraj K.; Gupta, I. D.; Verma, Archana

2017-01-01

Aim: This study was conducted with the objective to identify single nucleotide polymorphism (SNP) in protein phosphatase 1 regulatory subunit 11 (PPP1R11) gene in Murrah bulls. Materials and Methods: Genomic DNA was isolated by phenol–chloroform extraction method from the frozen semen samples of 65 Murrah bulls maintained at Artificial Breeding Research Centre, ICAR-National Dairy Research Institute, Karnal. The quality and concentration of DNA was checked by spectrophotometer reading and agarose gel electrophoresis. The target region of PPP1R11 gene was amplified using four sets of primer designed based on Bos taurus reference sequence. The amplified products were sequenced and aligned using Clustal Omega for identification of SNPs. Animals were genotyped by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) using EcoNI restriction enzyme. Results: The sequences in the NCBI accession number NW_005785016.1 for Bubalus bubalis were compared and aligned with the edited sequences of Murrah bulls with Clustal Omega software. A total of 10 SNPs were found, out of which 1 at 5’UTR, 3 at intron 1, and 6 at intron 2 region. PCR-RFLP using restriction enzyme EcoNI revealed only AA genotype indicating monomorphism in PPP1R11 gene of all Murrah animals included in the study. Conclusion: A total of 10 SNPs were found. PCR-RFLP revealed only AA genotype indicating monomorphism in PPP1R11 gene of all Murrah animals included in the study, due to which association analysis with conception rate was not feasible. PMID:28344410
Molecular characterization of variant alpha-subunit of electron transfer flavoprotein in three patients with glutaric acidemia type II--and identification of glycine substitution for valine-157 in the sequence of the precursor, producing an unstable mature protein in a patient.

PubMed Central

Indo, Y; Glassberg, R; Yokota, I; Tanaka, K

1991-01-01

In our previous study of eight glutaric acidemia type II (GAII) fibroblast lines by using [35S]methionine labeling and immunoprecipitation, three of them had a defect in the synthesis of the alpha-subunit of electron transfer flavoprotein (alpha-ETF) (Ikeda et al. 1986). In one of them (YH1313) the labeling of the mature alpha-ETF was barely detectable, while that of the precursor (p) was stronger. In another (YH605) no synthesis of immunoreactive p alpha-ETF was detectable. In the third cell line (YH1391) the rate of variant p alpha-ETF synthesis was comparable to normal, but its electrophoretic mobility was slightly faster than normal. In the present study, the northern blot analysis revealed that all three mutant cell lines contained p alpha-ETF mRNA and that their size and amount were comparable to normal. In immunoblot analysis, both alpha- and beta-ETF bands were barely detectable in YH1313 and YH605 but were detectable in YH1391 in amounts comparable to normal. Sequencing of YH1313 p alpha-ETF cDNA via PCR identified a transversion of T-470 to G. We then devised a simple PCR method for the 119-bp section (T-443/G-561) for detecting this mutation. In the upstream primer, A-466 was artificially replaced with C, to introduce a BstNI site into the amplified copies in the presence of G-470 from the variant sequence. The genomic DNA analysis using this method demonstrated that YH1313 was homozygous for T----G-470 transversion. It was not detected either in two other alpha-ETF-deficient GAII or in seven control cell lines. The alpha-ETF cDNA sequence in YH605 was identical to normal. Images Figure 1 Figure 2 Figure 3 Figure 5 PMID:1882842
Sequence and Structure Dependent DNA-DNA Interactions

NASA Astrophysics Data System (ADS)

Kopchick, Benjamin; Qiu, Xiangyun

Molecular forces between dsDNA strands are largely dominated by electrostatics and have been extensively studied. Quantitative knowledge has been accumulated on how DNA-DNA interactions are modulated by varied biological constituents such as ions, cationic ligands, and proteins. Despite its central role in biology, the sequence of DNA has not received substantial attention and ``random'' DNA sequences are typically used in biophysical studies. However, ~50% of human genome is composed of non-random-sequence DNAs, particularly repetitive sequences. Furthermore, covalent modifications of DNA such as methylation play key roles in gene functions. Such DNAs with specific sequences or modifications often take on structures other than the canonical B-form. Here we present series of quantitative measurements of the DNA-DNA forces with the osmotic stress method on different DNA sequences, from short repeats to the most frequent sequences in genome, and to modifications such as bromination and methylation. We observe peculiar behaviors that appear to be strongly correlated with the incurred structural changes. We speculate the causalities in terms of the differences in hydration shell and DNA surface structures.
Run-length encoding graphic rules, biochemically editable designs and steganographical numeric data embedment for DNA-based cryptographical coding system

PubMed Central

Kawano, Tomonori

2013-01-01

There have been a wide variety of approaches for handling the pieces of DNA as the “unplugged” tools for digital information storage and processing, including a series of studies applied to the security-related area, such as DNA-based digital barcodes, water marks and cryptography. In the present article, novel designs of artificial genes as the media for storing the digitally compressed data for images are proposed for bio-computing purpose while natural genes principally encode for proteins. Furthermore, the proposed system allows cryptographical application of DNA through biochemically editable designs with capacity for steganographical numeric data embedment. As a model case of image-coding DNA technique application, numerically and biochemically combined protocols are employed for ciphering the given “passwords” and/or secret numbers using DNA sequences. The “passwords” of interest were decomposed into single letters and translated into the font image coded on the separate DNA chains with both the coding regions in which the images are encoded based on the novel run-length encoding rule, and the non-coding regions designed for biochemical editing and the remodeling processes revealing the hidden orientation of letters composing the original “passwords.” The latter processes require the molecular biological tools for digestion and ligation of the fragmented DNA molecules targeting at the polymerase chain reaction-engineered termini of the chains. Lastly, additional protocols for steganographical overwriting of the numeric data of interests over the image-coding DNA are also discussed. PMID:23750303
A High-Throughput Process for the Solid-Phase Purification of Synthetic DNA Sequences

PubMed Central

Grajkowski, Andrzej; Cieślak, Jacek; Beaucage, Serge L.

2017-01-01

An efficient process for the purification of synthetic phosphorothioate and native DNA sequences is presented. The process is based on the use of an aminopropylated silica gel support functionalized with aminooxyalkyl functions to enable capture of DNA sequences through an oximation reaction with the keto function of a linker conjugated to the 5′-terminus of DNA sequences. Deoxyribonucleoside phosphoramidites carrying this linker, as a 5′-hydroxyl protecting group, have been synthesized for incorporation into DNA sequences during the last coupling step of a standard solid-phase synthesis protocol executed on a controlled pore glass (CPG) support. Solid-phase capture of the nucleobase- and phosphate-deprotected DNA sequences released from the CPG support is demonstrated to proceed near quantitatively. Shorter than full-length DNA sequences are first washed away from the capture support; the solid-phase purified DNA sequences are then released from this support upon reaction with tetra-n-butylammonium fluoride in dry dimethylsulfoxide (DMSO) and precipitated in tetrahydrofuran (THF). The purity of solid-phase-purified DNA sequences exceeds 98%. The simulated high-throughput and scalability features of the solid-phase purification process are demonstrated without sacrificing purity of the DNA sequences. PMID:28628204
Genomic resources for gene discovery, functional genome annotation, and evolutionary studies of maize and its close relatives.

PubMed

Wang, Chao; Shi, Xue; Liu, Lin; Li, Haiyan; Ammiraju, Jetty S S; Kudrna, David A; Xiong, Wentao; Wang, Hao; Dai, Zhaozhao; Zheng, Yonglian; Lai, Jinsheng; Jin, Weiwei; Messing, Joachim; Bennetzen, Jeffrey L; Wing, Rod A; Luo, Meizhong

2013-11-01

Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics.
Automatic polymerase chain reaction product detection system for food safety monitoring using zinc finger protein fused to luciferase.

PubMed

Yoshida, Wataru; Kezuka, Aki; Murakami, Yoshiyuki; Lee, Jinhee; Abe, Koichi; Motoki, Hiroaki; Matsuo, Takafumi; Shimura, Nobuaki; Noda, Mamoru; Igimi, Shizunobu; Ikebukuro, Kazunori

2013-11-01

An automatic polymerase chain reaction (PCR) product detection system for food safety monitoring using zinc finger (ZF) protein fused to luciferase was developed. ZF protein fused to luciferase specifically binds to target double stranded DNA sequence and has luciferase enzymatic activity. Therefore, PCR products that comprise ZF protein recognition sequence can be detected by measuring the luciferase activity of the fusion protein. We previously reported that PCR products from Legionella pneumophila and Escherichia coli (E. coli) O157 genomic DNA were detected by Zif268, a natural ZF protein, fused to luciferase. In this study, Zif268-luciferase was applied to detect the presence of Salmonella and coliforms. Moreover, an artificial zinc finger protein (B2) fused to luciferase was constructed for a Norovirus detection system. In the luciferase activity detection assay, several bound/free separation process is required. Therefore, an analyzer that automatically performed the bound/free separation process was developed to detect PCR products using the ZF-luciferase fusion protein. By means of the automatic analyzer with ZF-luciferase fusion protein, target pathogenic genomes were specifically detected in the presence of other pathogenic genomes. Moreover, we succeeded in the detection of 10 copies of E. coli BL21 without extraction of genomic DNA by the automatic analyzer and E. coli was detected with a logarithmic dependency in the range of 1.0×10 to 1.0×10(6) copies. Copyright © 2013 Elsevier B.V. All rights reserved.
An improved model for whole genome phylogenetic analysis by Fourier transform.

PubMed

Yin, Changchuan; Yau, Stephen S-T

2015-10-07

DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Technique of laser chromosome welding for chromosome repair and artificial chromosome creation.

PubMed

Huang, Yao-Xiong; Li, Lin; Yang, Liu; Zhang, Yi

2018-04-01

Here we report a technique of laser chromosome welding that uses a violet pulse laser micro-beam for welding. The technique can integrate any size of a desired chromosome fragment into recipient chromosomes by combining with other techniques of laser chromosome manipulation such as chromosome cutting, moving, and stretching. We demonstrated that our method could perform chromosomal modifications with high precision, speed and ease of use in the absence of restriction enzymes, DNA ligases and DNA polymerases. Unlike the conventional methods such as de novo artificial chromosome synthesis, our method has no limitation on the size of the inserted chromosome fragment. The inserted DNA size can be precisely defined and the processed chromosome can retain its intrinsic structure and integrity. Therefore, our technique provides a high quality alternative approach to directed genetic recombination, and can be used for chromosomal repair, removal of defects and artificial chromosome creation. The technique may also have applicability on the manipulation and extension of large pieces of synthetic DNA.
Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.

PubMed

Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene

2017-02-01

Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Quantitative analysis of DNA methylation in the promoter region of the methylguanine-O(6) -DNA-methyltransferase gene by COBRA and subsequent native capillary gel electrophoresis.

PubMed

Goedecke, Simon; Mühlisch, Jörg; Hempel, Georg; Frühwald, Michael C; Wünsch, Bernhard

2015-12-01

Along with histone modifications, RNA interference and delayed replication timing, DNA methylation belongs to the key processes in epigenetic regulation of gene expression. Therefore, reliable information about the methylation level of particular DNA fragments is of major interest. Herein the methylation level at two positions of the promoter region of the gene methylguanine-O(6) -DNA-Methyltransferase (MGMT) was investigated. Previously, it was demonstrated that the epigenetic status of this DNA region correlates with response to alkylating anticancer agents. An automated CGE method with LIF detection was established to separate the six DNA fragments resulting from combined bisulfite restriction analysis of the methylated and non-methylated MGMT promoter. In COBRA, the DNA was treated with bisulfite converting cytosine into uracil. During PCR uracil pairs with adenine, which changes the original recognition site of the restriction enzyme Taql. Artificial probes generated by mixing appropriate amounts of DNA after bisulfite treatment and PCR amplification were used for validation of the method. The methylation levels of these samples could be determined with high accuracy and precision. DNA samples prepared by mixing the corresponding clones first and then performing PCR amplification led to non-linear correlation between the corrected peak areas and the methylation levels. This effect is explained by slightly different PCR amplification of DNA with different sequences present in the mixture. The superiority of CGE over PAGE was clearly demonstrated. Finally, the established method was used to analyze the methylation levels of human brain tumor tissue samples. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

PubMed

Yin, Changchuan

2015-04-01

To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
Single-cell genomic sequencing using Multiple Displacement Amplification.

PubMed

Lasken, Roger S

2007-10-01

Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
[Research progress in human artificial chromosomes(HACs) and the potentials in application].

PubMed

Zuo, Guo-Wei; Lü, Feng-Lin

2005-11-01

Since the first report of the establishment of human artificial chromosome(HAC) was published in 1997, several types of HAC have been created by different strategies. Compared to other artificial chromosomes, such as yeast artificial chromosome (YAC) and bacterial artificial chromosome(BAC), HAC exists in a cell independently, in other words, HAC does not integrated into the cellular genome, and can undergo normal mitosis and meiosis from generation to generation in vitro and in vivo. Recent results proved that HAC, as a DNA carrier, is able to host a large fragment of DNA or mini-chromosome, thus it could be a very important tool in the study of human gene expression and regulation, human chromosome function and minimum functional elements and animal models for human diseases. In the near future, HAC can also be used in gene therapy for human genetic diseases.
Virtual Genome Walking across the 32 Gb Ambystoma mexicanum genome; assembling gene models and intronic sequence.

PubMed

Evans, Teri; Johnson, Andrew D; Loose, Matthew

2018-01-12

Large repeat rich genomes present challenges for assembly using short read technologies. The 32 Gb axolotl genome is estimated to contain ~19 Gb of repetitive DNA making an assembly from short reads alone effectively impossible. Indeed, this model species has been sequenced to 20× coverage but the reads could not be conventionally assembled. Using an alternative strategy, we have assembled subsets of these reads into scaffolds describing over 19,000 gene models. We call this method Virtual Genome Walking as it locally assembles whole genome reads based on a reference transcriptome, identifying exons and iteratively extending them into surrounding genomic sequence. These assemblies are then linked and refined to generate gene models including upstream and downstream genomic, and intronic, sequence. Our assemblies are validated by comparison with previously published axolotl bacterial artificial chromosome (BAC) sequences. Our analyses of axolotl intron length, intron-exon structure, repeat content and synteny provide novel insights into the genic structure of this model species. This resource will enable new experimental approaches in axolotl, such as ChIP-Seq and CRISPR and aid in future whole genome sequencing efforts. The assembled sequences and annotations presented here are freely available for download from https://tinyurl.com/y8gydc6n . The software pipeline is available from https://github.com/LooseLab/iterassemble .

The contribution of the DNA microarray technology to gene expression profiling in Leishmania spp.: a retrospective.

PubMed

Alonso, Ana; Larraga, Vicente; Alcolea, Pedro J

2018-05-07

The first genome project of any living organism excluding viruses, the gammaproteobacteria Haemophilus influenzae, was completed in 1995. Until the last decade, genome sequencing was very tedious because genome survey sequences (GSS) and/or expressed sequence tags (ESTs) belonging to plasmid, cosmid and artificial chromosome genome libraries had to be sequenced and assembled in silico. Nowadays, no genome is completely assembled actually, because gaps and unassembled contigs are always remaining. However, most represent the whole genome of the organism of origin from a practical point of view. The first genome sequencing projects of trypanosomatid parasites were completed in 2005 following those strategies, and belong to Leishmania major, Trypanosoma cruzi and T. brucei. The functional genomics era rapidly developed on the basis of the microarray technology and has been evolving. In the case of the genus Leishmania, substantial biological information about differentiation in the digenetic life cycle of the parasite has been obtained. Later on, next generation sequencing has revolutionized genome sequencing and functional genomics, leading to more sensitive, accurate results by using much less resources. This new technology is more advantageous, but does not invalidate microarray results. In fact, promising vaccine candidates and drug targets have been found on the basis of microarray-based screening and preliminary proof-of-concept tests. Copyright © 2018. Published by Elsevier B.V.
Acquisition of New DNA Sequences After Infection of Chicken Cells with Avian Myeloblastosis Virus

PubMed Central

Shoyab, M.; Baluda, M. A.; Evans, R.

1974-01-01

DNA-RNA hybridization studies between 70S RNA from avian myeloblastosis virus (AMV) and an excess of DNA from (i) AMV-induced leukemic chicken myeloblasts or (ii) a mixture of normal and of congenitally infected K-137 chicken embryos producing avian leukosis viruses revealed the presence of fast- and slow-hybridizing virus-specific DNA sequences. However, the leukemic cells contained twice the level of AMV-specific DNA sequences observed in normal chicken embryonic cells. The fast-reacting sequences were two to three times more numerous in leukemic DNA than in DNA from the mixed embryos. The slow-reacting sequences had a reiteration frequency of approximately 9 and 6, in the two respective systems. Both the fast- and the slow-reacting DNA sequences in leukemic cells exhibited a higher Tm (2 C) than the respective DNA sequences in normal cells. In normal and leukemic cells the slow hybrid sequences appeared to have a Tm which was 2 C higher than that of the fast hybrid sequences. Individual non-virus-producing chicken embryos, either group-specific antigen positive or negative, contained 40 to 100 copies of the fast sequences and 2 to 6 copies of the slowly hybridizing sequences per cell genome. Normal rat cells did not contain DNA that hybridized with AMV RNA, whereas non-virus-producing rat cells transformed by B-77 avian sarcoma virus contained only the slowly reacting sequences. The results demonstrate that leukemic cells transformed by AMV contain new AMV-specific DNA sequences which were not present before infection. PMID:16789139
Chromosomal location and gene paucity of the male specific region on papaya Y chromosome.

PubMed

Yu, Qingyi; Hou, Shaobin; Hobza, Roman; Feltus, F Alex; Wang, Xiue; Jin, Weiwei; Skelton, Rachel L; Blas, Andrea; Lemke, Cornelia; Saw, Jimmy H; Moore, Paul H; Alam, Maqsudul; Jiang, Jiming; Paterson, Andrew H; Vyskot, Boris; Ming, Ray

2007-08-01

Sex chromosomes in flowering plants evolved recently and many of them remain homomorphic, including those in papaya. We investigated the chromosomal location of papaya's small male specific region of the hermaphrodite Y (Yh) chromosome (MSY) and its genomic features. We conducted chromosome fluorescence in situ hybridization mapping of Yh-specific bacterial artificial chromosomes (BACs) and placed the MSY near the centromere of the papaya Y chromosome. Then we sequenced five MSY BACs to examine the genomic features of this specialized region, which resulted in the largest collection of contiguous genomic DNA sequences of a Y chromosome in flowering plants. Extreme gene paucity was observed in the papaya MSY with no functional gene identified in 715 kb MSY sequences. A high density of retroelements and local sequence duplications were detected in the MSY that is suppressed for recombination. Location of the papaya MSY near the centromere might have provided recombination suppression and fostered paucity of genes in the male specific region of the Y chromosome. Our findings provide critical information for deciphering the sex chromosomes in papaya and reference information for comparative studies of other sex chromosomes in animals and plants.
Egg Case Silk Gene Sequences from Argiope Spiders: Evidence for Multiple Loci and a Loss of Function Between Paralogs

PubMed Central

Chaw, R. Crystal; Collin, Matthew; Wimmer, Marjorie; Helmrick, Kara-Leigh; Hayashi, Cheryl Y.

2017-01-01

Spiders swath their eggs with silk to protect developing embryos and hatchlings. Egg case silks, like other fibrous spider silks, are primarily composed of proteins called spidroins (spidroin = spider-fibroin). Silks, and thus spidroins, are important throughout the lives of spiders, yet the evolution of spidroin genes has been relatively understudied. Spidroin genes are notoriously difficult to sequence because they are typically very long (≥ 10 kb of coding sequence) and highly repetitive. Here, we investigate the evolution of spider silk genes through long-read sequencing of Bacterial Artificial Chromosome (BAC) clones. We demonstrate that the silver garden spider Argiope argentata has multiple egg case spidroin loci with a loss of function at one locus. We also use degenerate PCR primers to search the genomic DNA of congeneric species and find evidence for multiple egg case spidroin loci in other Argiope spiders. Comparative analyses show that these multiple loci are more similar at the nucleotide level within a species than between species. This pattern is consistent with concerted evolution homogenizing gene copies within a genome. More complicated explanations include convergent evolution or recent independent gene duplications within each species. PMID:29127108
Homogeneity of the 16S rDNA sequence among geographically disparate isolates of Taylorella equigenitalis

PubMed Central

Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC

2006-01-01

Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935
Construction of a YAC contig and STS map spanning 2.5 Mbp in Xq25, the critical region for the X-linked lymphoproliferative (XLP) gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lanyi, A.; Li, B.F.; Li, S.

1994-09-01

X-linked lymphoproliferative disease (XLP) is characterized by a marked vulnerability in Epstein-Barr virus (EBV) infection. Infection of XLP patients with EBV invariably results in fatal mononucleosis, agammaglobulinemia or B-cell lymphoma. The XLP gene lies within a 10 cM region in Xq25 between DXS42 and DXS10. Initial chromosome studies revealed an interstitial, cytogenetically visible deletion in Xq25 in one XLP family (43-004). We estimated the size of the Xq25 deletion by dual laser flow karyotyping to involve 2% of the X chromosome, or approximately 3 Mbp of DNA sequences. To further delineate the deletion we performed a series of pulsed fieldmore » gel electrophoresis (PFGE) analyses which showed that DXS6 and DXS100, two Xq25-specific markers, are missing from 45-004 DNA. Five yeast artificial chromosomes (YACs) from a chromosome X specific YAC library containing sequences deleted in patient`s 43-004 DNA were isolated. These five YACs did not overlap, and their end fragments were used to screen the CEPH MegaYAC library. Seven YACs were isolated from the CEPH MegaYAC library. They could be arranged into a contig which spans between DXS6 and DXS100. The contig contains a minimum of 2.5 Mbp of human DNA. A total of 12 YAC end clone, lambda subclones and STS probes have been used to order clones within the contig. These reagents were also used in Southern blot and patients showed interstitial deletions in Xq25. The size of these deletions range between 0.5 and 2.5 Mbp. The shortest deletion probably represents the critical region for the XLP gene.« less
Genetic analysis among environmental strains of Balamuthia mandrillaris recovered from an artificial lagoon and from soil in Sonora, Mexico.

PubMed

Lares-Jiménez, Luis Fernando; Booton, Gregory C; Lares-Villa, Fernando; Velázquez-Contreras, Carlos Arturo; Fuerst, Paul A

2014-11-01

Since the first report of Balamuthia mandrillaris as a causative agent of granulomatous amoebic encephalitis in humans, the environmental niche of this amoeba was assumed to be restricted to soil and dust. A single isolation from water was recently made independently by us from Northern Mexico. Now we report the isolation of 8 new strains of B. mandrillaris from Mexico. This continues the pattern of an excess of isolates from North America, compared to other parts of the world. All of the new isolates are environmental isolates, 7 from water samples and one from soil. The identity of each isolate was confirmed by PCR and by examining the sequences of the mitochondrial 16S-like rRNA gene. Success in amplification was determined using comparisons of amplifications of DNA from the strain CDC: V039 and the water strain (ITSON-BM1) as positive controls. The DNA sequences of the new isolates were compared to older strains from clinical cases using phylogenetic analysis, showing very high sequence similarity. The similarity among the new isolates and with previous clinical and environmental isolates of B. mandrillaris was also examined using biochemical and immunological studies. High homogeneity of total protein products, and similarity in antigenic moiety among the eight new isolates and two controls was found. Taken together, the molecular and biochemical studies indicate very low levels of genetic variation within B. mandrillaris. Copyright © 2014 Elsevier Inc. All rights reserved.
An analysis of the positional distribution of DNA motifs in promoter regions and its biological relevance.

PubMed

Casimiro, Ana C; Vinga, Susana; Freitas, Ana T; Oliveira, Arlindo L

2008-02-07

Motif finding algorithms have developed in their ability to use computationally efficient methods to detect patterns in biological sequences. However the posterior classification of the output still suffers from some limitations, which makes it difficult to assess the biological significance of the motifs found. Previous work has highlighted the existence of positional bias of motifs in the DNA sequences, which might indicate not only that the pattern is important, but also provide hints of the positions where these patterns occur preferentially. We propose to integrate position uniformity tests and over-representation tests to improve the accuracy of the classification of motifs. Using artificial data, we have compared three different statistical tests (Chi-Square, Kolmogorov-Smirnov and a Chi-Square bootstrap) to assess whether a given motif occurs uniformly in the promoter region of a gene. Using the test that performed better in this dataset, we proceeded to study the positional distribution of several well known cis-regulatory elements, in the promoter sequences of different organisms (S. cerevisiae, H. sapiens, D. melanogaster, E. coli and several Dicotyledons plants). The results show that position conservation is relevant for the transcriptional machinery. We conclude that many biologically relevant motifs appear heterogeneously distributed in the promoter region of genes, and therefore, that non-uniformity is a good indicator of biological relevance and can be used to complement over-representation tests commonly used. In this article we present the results obtained for the S. cerevisiae data sets.
Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

DOEpatents

McCutchen-Maloney, Sandra L.

2002-01-01

DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
A Physical Map, Including a BAC/PAC Clone Contig, of the Williams-Beuren Syndrome–Deletion Region at 7q11.23

PubMed Central

Peoples, Risa; Franke, Yvonne; Wang, Yu-Ker; Pérez-Jurado, Luis; Paperna, Tamar; Cisco, Michael; Francke, Uta

2000-01-01

Summary Williams-Beuren syndrome (WBS) is a developmental disorder caused by haploinsufficiency for genes in a 2-cM region of chromosome band 7q11.23. With the exception of vascular stenoses due to deletion of the elastin gene, the various features of WBS have not yet been attributed to specific genes. Although ⩾16 genes have been identified within the WBS deletion, completion of a physical map of the region has been difficult because of the large duplicated regions flanking the deletion. We present a physical map of the WBS deletion and flanking regions, based on assembly of a bacterial artificial chromosome/P1-derived artificial chromosome contig, analysis of high-throughput genome-sequence data, and long-range restriction mapping of genomic and cloned DNA by pulsed-field gel electrophoresis. Our map encompasses 3 Mb, including 1.6 Mb within the deletion. Two large duplicons, flanking the deletion, of ⩾320 kb contain unique sequence elements from the internal border regions of the deletion, such as sequences from GTF2I (telomeric) and FKBP6 (centromeric). A third copy of this duplicon exists in inverted orientation distal to the telomeric flanking one. These duplicons show stronger sequence conservation with regard to each other than to the presumptive ancestral loci within the common deletion region. Sequence elements originating from beyond 7q11.23 are also present in these duplicons. Although the duplicons are not present in mice, the order of the single-copy genes in the conserved syntenic region of mouse chromosome 5 is inverted relative to the human map. A model is presented for a mechanism of WBS-deletion formation, based on the orientation of duplicons' components relative to each other and to the ancestral elements within the deletion region. PMID:10631136
Identifying a breeding habitat of a critically endangered fish, Acheilognathus typus, in a natural river in Japan

NASA Astrophysics Data System (ADS)

Sakata, Masayuki K.; Maki, Nobutaka; Sugiyama, Hideki; Minamoto, Toshifumi

2017-12-01

Freshwater biodiversity has been severely threatened in recent years, and to conserve endangered species, their distribution and breeding habitats need to be clarified. However, identifying breeding sites in a large area is generally difficult. Here, by combining the emerging environmental DNA (eDNA) analysis with subsequent traditional collection surveys, we successfully identified a breeding habitat for the critically endangered freshwater fish Acheilognathus typus in the mainstream of Omono River in Akita Prefecture, Japan, which is one of the original habitats of this species. Based on DNA cytochrome B sequences of A. typus and closely related species, we developed species-specific primers and a probe that were used in real-time PCR for detecting A. typus eDNA. After verifying the specificity and applicability of the primers and probe on water samples from known artificial habitats, eDNA analysis was applied to water samples collected at 99 sites along Omono River. Two of the samples were positive for A. typus eDNA, and thus, small fixed nets and bottle traps were set out to capture adult fish and verify egg deposition in bivalves (the preferred breeding substrate for A. typus) in the corresponding regions. Mature female and male individuals and bivalves containing laid eggs were collected at one of the eDNA-positive sites. This was the first record of adult A. typus in Omono River in 11 years. This study highlights the value of eDNA analysis to guide conventional monitoring surveys and shows that combining both methods can provide important information on breeding sites that is essential for species' conservation.
Identifying a breeding habitat of a critically endangered fish, Acheilognathus typus, in a natural river in Japan.

PubMed

Sakata, Masayuki K; Maki, Nobutaka; Sugiyama, Hideki; Minamoto, Toshifumi

2017-11-14

Freshwater biodiversity has been severely threatened in recent years, and to conserve endangered species, their distribution and breeding habitats need to be clarified. However, identifying breeding sites in a large area is generally difficult. Here, by combining the emerging environmental DNA (eDNA) analysis with subsequent traditional collection surveys, we successfully identified a breeding habitat for the critically endangered freshwater fish Acheilognathus typus in the mainstream of Omono River in Akita Prefecture, Japan, which is one of the original habitats of this species. Based on DNA cytochrome B sequences of A. typus and closely related species, we developed species-specific primers and a probe that were used in real-time PCR for detecting A. typus eDNA. After verifying the specificity and applicability of the primers and probe on water samples from known artificial habitats, eDNA analysis was applied to water samples collected at 99 sites along Omono River. Two of the samples were positive for A. typus eDNA, and thus, small fixed nets and bottle traps were set out to capture adult fish and verify egg deposition in bivalves (the preferred breeding substrate for A. typus) in the corresponding regions. Mature female and male individuals and bivalves containing laid eggs were collected at one of the eDNA-positive sites. This was the first record of adult A. typus in Omono River in 11 years. This study highlights the value of eDNA analysis to guide conventional monitoring surveys and shows that combining both methods can provide important information on breeding sites that is essential for species' conservation.
Supramolecular Hydrogels Based on DNA Self-Assembly.

PubMed

Shao, Yu; Jia, Haoyang; Cao, Tianyang; Liu, Dongsheng

2017-04-18

Extracellular matrix (ECM) provides essential supports three dimensionally to the cells in living organs, including mechanical support and signal, nutrition, oxygen, and waste transportation. Thus, using hydrogels to mimic its function has attracted much attention in recent years, especially in tissue engineering, cell biology, and drug screening. However, a hydrogel system that can merit all parameters of the natural ECM is still a challenge. In the past decade, deoxyribonucleic acid (DNA) has arisen as an outstanding building material for the hydrogels, as it has unique properties compared to most synthetic or natural polymers, such as sequence designability, precise recognition, structural rigidity, and minimal toxicity. By simple attachment to polymers as a side chain, DNA has been widely used as cross-links in hydrogel preparation. The formed secondary structures could confer on the hydrogel designable responsiveness, such as response to temperature, pH, metal ions, proteins, DNA, RNA, and small signal molecules like ATP. Moreover, single or multiple DNA restriction enzyme sites could be incorporated into the hydrogels by sequence design and greatly expand the latitude of their responses. Compared with most supramolecular hydrogels, these DNA cross-linked hydrogels could be relatively strong and easily adjustable via sequence variation, but it is noteworthy that these hydrogels still have excellent thixotropic properties and could be easily injected through a needle. In addition, the quick formation of duplex has also enabled the multilayer three-dimensional injection printing of living cells with the hydrogel as matrix. When the matrix is built purely by DNA assembly structures, the hydrogel inherits all the previously described characteristics; however, the long persistence length of DNA structures excluded the small size meshes of the network and made the hydrogel permeable to nutrition for cell proliferation. This unique property greatly expands the cell viability in the three-dimensional matrix to several weeks and also provides an easy way to prepare interpenetrating double network materials. In this Account, we outline the stream of hydrogels based on DNA self-assembly and discuss the mechanism that brings outstanding properties to the materials. Unlike most reported hydrogel systems, the all-in-one character of the DNA hydrogel avoids the "cask effect" in the properties. We believe the hydrogel will greatly benefit cell behavior studies especially in the following aspects: (1) stem cell differentiation can be studied with solely tunable mechanical strength of the matrix; (2) the dynamic nature of the network can allow cell migration through the hydrogel, which will help to build a more realistic model to observe the migration of cancer cells in vivo; (3) combination with rapidly developing three-dimension printing technology, the hydrogel will boost the construction of three-dimensional tissues and artificial organs.
Pan-cancer analysis reveals technical artifacts in TCGA germline variant calls.

PubMed

Buckley, Alexandra R; Standish, Kristopher A; Bhutani, Kunal; Ideker, Trey; Lasken, Roger S; Carter, Hannah; Harismendy, Olivier; Schork, Nicholas J

2017-06-12

Cancer research to date has largely focused on somatically acquired genetic aberrations. In contrast, the degree to which germline, or inherited, variation contributes to tumorigenesis remains unclear, possibly due to a lack of accessible germline variant data. Here we called germline variants on 9618 cases from The Cancer Genome Atlas (TCGA) database representing 31 cancer types. We identified batch effects affecting loss of function (LOF) variant calls that can be traced back to differences in the way the sequence data were generated both within and across cancer types. Overall, LOF indel calls were more sensitive to technical artifacts than LOF Single Nucleotide Variant (SNV) calls. In particular, whole genome amplification of DNA prior to sequencing led to an artificially increased burden of LOF indel calls, which confounded association analyses relating germline variants to tumor type despite stringent indel filtering strategies. The samples affected by these technical artifacts include all acute myeloid leukemia and practically all ovarian cancer samples. We demonstrate how technical artifacts induced by whole genome amplification of DNA can lead to false positive germline-tumor type associations and suggest TCGA whole genome amplified samples be used with caution. This study draws attention to the need to be sensitive to problems associated with a lack of uniformity in data generation in TCGA data.
Enhancement of single guide RNA transcription for efficient CRISPR/Cas-based genomic engineering.

PubMed

Ui-Tei, Kumiko; Maruyama, Shohei; Nakano, Yuko

2017-06-01

Genomic engineering using clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) protein is a promising approach for targeting the genomic DNA of virtually any organism in a sequence-specific manner. Recent remarkable advances in CRISPR/Cas technology have made it a feasible system for use in therapeutic applications and biotechnology. In the CRISPR/Cas system, a guide RNA (gRNA), interacting with the Cas protein, recognizes a genomic region with sequence complementarity, and the double-stranded DNA at the target site is cleaved by the Cas protein. A widely used gRNA is an RNA polymerase III (pol III)-driven single gRNA (sgRNA), which is produced by artificial fusion of CRISPR RNA (crRNA) and trans-activation crRNA (tracrRNA). However, we identified a TTTT stretch, known as a termination signal of RNA pol III, in the scaffold region of the sgRNA. Here, we revealed that sgRNA carrying a TTTT stretch reduces the efficiency of sgRNA transcription due to premature transcriptional termination, and decreases the efficiency of genome editing. Unexpectedly, it was also shown that the premature terminated sgRNA may have an adverse effect of inducing RNA interference. Such disadvantageous effects were avoided by substituting one base in the TTTT stretch.
Characterization of the canine desmin (DES) gene and evaluation as a candidate gene for dilated cardiomyopathy in the Dobermann.

PubMed

Stabej, Polona; Imholz, Sandra; Versteeg, Serge A; Zijlstra, Carla; Stokhof, Arnold A; Domanjko-Petric, Aleksandra; Leegwater, Peter A J; van Oost, Bernard A

2004-10-13

Canine-dilated cardiomyopathy (DCM) in dogs is a disease of the myocardium associated with dilatation and impaired contraction of the ventricles and is suspected to have a genetic cause. A missense mutation in the desmin gene (DES) causes DCM in a human family. Human DCM closely resembles the canine disease. In the present study, we evaluated whether DES gene mutations are responsible for DCM in Dobermann dogs. We have isolated bacterial artificial chromosome clones (BACs) containing the canine DES gene and determined the chromosomal location by fluorescence in situ hybridization (FISH). Using data deposited in the NCBI trace archive and GenBank, the canine DES gene DNA sequence was assembled and seven single nucleotide polymorphisms (SNPs) were identified. From the canine DES gene BAC clones, a polymorphic microsatellite marker was isolated. The microsatellite marker and four informative desmin SNPs were typed in a Dobermann family with frequent DCM occurrence, but the disease phenotype did not associate with a desmin haplotype. We concluded that mutations in the DES gene do not play a role in Dobermann DCM. Availability of the microsatellite marker, SNPs and DNA sequence reported in this study enable fast evaluation of the DES gene as a DCM candidate gene in other dog breeds with DCM occurrence.
Methylation patterns of repetitive DNA sequences in germ cells of Mus musculus.

PubMed

Sanford, J; Forrester, L; Chapman, V; Chandley, A; Hastie, N

1984-03-26

The major and the minor satellite sequences of Mus musculus were undermethylated in both sperm and oocyte DNAs relative to the amount of undermethylation observed in adult somatic tissue DNA. This hypomethylation was specific for satellite sequences in sperm DNA. Dispersed repetitive and low copy sequences show a high degree of methylation in sperm DNA; however, a dispersed repetitive sequence was undermethylated in oocyte DNA. This finding suggests a difference in the amount of total genomic DNA methylation between sperm and oocyte DNA. The methylation levels of the minor satellite sequences did not change during spermiogenesis, and were not associated with the onset of meiosis or a specific stage in sperm development.
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
Benchmark analysis of native and artificial NAD+-dependent enzymes generated by a sequence based design method with or without phylogenetic data.

PubMed

Nakano, Shogo; Motoyama, Tomoharu; Miyashita, Yurina; Ishizuka, Yuki; Matsuo, Naoya; Tokiwa, Hiroaki; Shinoda, Suguru; Asano, Yasuhisa; Ito, Sohei

2018-05-22

The expansion of protein sequence databases has enabled us to design artificial proteins by sequence-based design methods, such as full consensus design (FCD) and ancestral sequence reconstruction (ASR). Artificial proteins with enhanced activity levels compared with native ones can potentially be generated by such methods, but successful design is rare because preparing a sequence library by curating the database and selecting a method is difficult. Utilizing a curated library prepared by reducing conservation energies, we successfully designed two artificial L-threonine 3-dehydrogenase (SDR-TDH) with higher activity levels than native SDR-TDH, FcTDH-N1 and AncTDH, using FCD and ASR, respectively. The artificial SDR-TDHs had excellent thermal stability and NAD+ recognition compared to native SDR-TDH from Cupriavidus necator (CnTDH): the melting temperatures of FcTDH-N1 and AncTDH were about 10 and 5°C higher than CnTDH, respectively, and the dissociation constants toward NAD+ of FcTDH-N1 and AncTDH were two- and seven-fold lower than that of CnTDH, respectively. Enzymatic efficiency of the artificial SDR-TDHs were comparable to that of CnTDH. Crystal structures of FcTDH-N1 and AncTDH were determined at 2.8 and 2.1 Å resolution, respectively. Structural and MD simulation analysis of the SDR-TDHs indicated that only the flexibility at specific regions was changed, suggesting that multiple mutations introduced in the artificial SDR-TDHs altered their flexibility and thereby affected their enzymatic properties. Benchmark analysis of the SDR-TDHs indicated that both FCD and ASR can generate highly functional proteins if a curated library is prepared appropriately.
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

PubMed

Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

2014-11-01

As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing

PubMed Central

Tourlousse, Dieter M.; Yoshiike, Satowa; Ohashi, Akiko; Matsukura, Satoko; Noda, Naohiro

2017-01-01

Abstract High-throughput sequencing of 16S rRNA gene amplicons (16S-seq) has become a widely deployed method for profiling complex microbial communities but technical pitfalls related to data reliability and quantification remain to be fully addressed. In this work, we have developed and implemented a set of synthetic 16S rRNA genes to serve as universal spike-in standards for 16S-seq experiments. The spike-ins represent full-length 16S rRNA genes containing artificial variable regions with negligible identity to known nucleotide sequences, permitting unambiguous identification of spike-in sequences in 16S-seq read data from any microbiome sample. Using defined mock communities and environmental microbiota, we characterized the performance of the spike-in standards and demonstrated their utility for evaluating data quality on a per-sample basis. Further, we showed that staggered spike-in mixtures added at the point of DNA extraction enable concurrent estimation of absolute microbial abundances suitable for comparative analysis. Results also underscored that template-specific Illumina sequencing artifacts may lead to biases in the perceived abundance of certain taxa. Taken together, the spike-in standards represent a novel bioanalytical tool that can substantially improve 16S-seq-based microbiome studies by enabling comprehensive quality control along with absolute quantification. PMID:27980100
Enlightenment of Yeast Mitochondrial Homoplasmy: Diversified Roles of Gene Conversion

PubMed Central

Ling, Feng; Mikawa, Tsutomu; Shibata, Takehiko

2011-01-01

Mitochondria have their own genomic DNA. Unlike the nuclear genome, each cell contains hundreds to thousands of copies of mitochondrial DNA (mtDNA). The copies of mtDNA tend to have heterogeneous sequences, due to the high frequency of mutagenesis, but are quickly homogenized within a cell (“homoplasmy”) during vegetative cell growth or through a few sexual generations. Heteroplasmy is strongly associated with mitochondrial diseases, diabetes and aging. Recent studies revealed that the yeast cell has the machinery to homogenize mtDNA, using a common DNA processing pathway with gene conversion; i.e., both genetic events are initiated by a double-stranded break, which is processed into 3′ single-stranded tails. One of the tails is base-paired with the complementary sequence of the recipient double-stranded DNA to form a D-loop (homologous pairing), in which repair DNA synthesis is initiated to restore the sequence lost by the breakage. Gene conversion generates sequence diversity, depending on the divergence between the donor and recipient sequences, especially when it occurs among a number of copies of a DNA sequence family with some sequence variations, such as in immunoglobulin diversification in chicken. MtDNA can be regarded as a sequence family, in which the members tend to be diversified by a high frequency of spontaneous mutagenesis. Thus, it would be interesting to determine why and how double-stranded breakage and D-loop formation induce sequence homogenization in mitochondria and sequence diversification in nuclear DNA. We will review the mechanisms and roles of mtDNA homoplasmy, in contrast to nuclear gene conversion, which diversifies gene and genome sequences, to provide clues toward understanding how the common DNA processing pathway results in such divergent outcomes. PMID:24710143
The artificial zinc finger coding gene 'Jazz' binds the utrophin promoter and activates transcription.

PubMed

Corbi, N; Libri, V; Fanciulli, M; Tinsley, J M; Davies, K E; Passananti, C

2000-06-01

Up-regulation of utrophin gene expression is recognized as a plausible therapeutic approach in the treatment of Duchenne muscular dystrophy (DMD). We have designed and engineered new zinc finger-based transcription factors capable of binding and activating transcription from the promoter of the dystrophin-related gene, utrophin. Using the recognition 'code' that proposes specific rules between zinc finger primary structure and potential DNA binding sites, we engineered a new gene named 'Jazz' that encodes for a three-zinc finger peptide. Jazz belongs to the Cys2-His2 zinc finger type and was engineered to target the nine base pair DNA sequence: 5'-GCT-GCT-GCG-3', present in the promoter region of both the human and mouse utrophin gene. The entire zinc finger alpha-helix region, containing the amino acid positions that are crucial for DNA binding, was specifically chosen on the basis of the contacts more frequently represented in the available list of the 'code'. Here we demonstrate that Jazz protein binds specifically to the double-stranded DNA target, with a dissociation constant of about 32 nM. Band shift and super-shift experiments confirmed the high affinity and specificity of Jazz protein for its DNA target. Moreover, we show that chimeric proteins, named Gal4-Jazz and Sp1-Jazz, are able to drive the transcription of a test gene from the human utrophin promoter.
Structure and hydrodynamics of a DNA G-quadruplex with a cytosine bulge.

PubMed

Meier, Markus; Moya-Torres, Aniel; Krahn, Natalie J; McDougall, Matthew D; Orriss, George L; McRae, Ewan K S; Booy, Evan P; McEleney, Kevin; Patel, Trushar R; McKenna, Sean A; Stetefeld, Jörg

2018-06-01

The identification of four-stranded G-quadruplexes (G4s) has highlighted the fact that DNA has additional spatial organisations at its disposal other than double-stranded helices. Recently, it became clear that the formation of G4s is not limited to the traditional G3+NL1G3+NL2G3+NL3G3+ sequence motif. Instead, the G3 triplets can be interrupted by deoxythymidylate (DNA) or uridylate (RNA) where the base forms a bulge that loops out from the G-quadruplex core. Here, we report the first high-resolution X-ray structure of a unique unimolecular DNA G4 with a cytosine bulge. The G4 forms a dimer that is stacked via its 5'-tetrads. Analytical ultracentrifugation, static light scattering and small angle X-ray scattering confirmed that the G4 adapts a predominantly dimeric structure in solution. We provide a comprehensive comparison of previously published G4 structures containing bulges and report a special γ torsion angle range preferentially populated by the G4 core guanylates adjacent to bulges. Since the penalty for introducing bulges appears to be negligible, it should be possible to functionalize G4s by introducing artificial or modified nucleotides at such positions. The presence of the bulge alters the surface of the DNA, providing an opportunity to develop drugs that can specifically target individual G4s.
"First generation" automated DNA sequencing technology.

PubMed

Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M

2011-10-01

Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
Novel Phenanthrene-Degrading Bacteria Identified by DNA-Stable Isotope Probing

PubMed Central

Luo, Chunling; Zhang, Dayi; Zhang, Gan

2015-01-01

Microorganisms responsible for the degradation of phenanthrene in a clean forest soil sample were identified by DNA-based stable isotope probing (SIP). The soil was artificially amended with either 12C- or 13C-labeled phenanthrene, and soil DNA was extracted on days 3, 6 and 9. Terminal restriction fragment length polymorphism (TRFLP) results revealed that the fragments of 219- and 241-bp in HaeIII digests were distributed throughout the gradient profile at three different sampling time points, and both fragments were more dominant in the heavy fractions of the samples exposed to the 13C-labeled contaminant. 16S rRNA sequencing of the 13C-enriched fraction suggested that Acidobacterium spp. within the class Acidobacteria, and Collimonas spp. within the class Betaproteobacteria, were directly involved in the uptake and degradation of phenanthrene at different times. To our knowledge, this is the first report that the genus Collimonas has the ability to degrade PAHs. Two PAH-RHDα genes were identified in 13C-labeled DNA. However, isolation of pure cultures indicated that strains of Staphylococcus sp. PHE-3, Pseudomonas sp. PHE-1, and Pseudomonas sp. PHE-2 in the soil had high phenanthrene-degrading ability. This emphasizes the role of a culture-independent method in the functional understanding of microbial communities in situ. PMID:26098417
Influence of DNA sequence on the structure of minicircles under torsional stress

PubMed Central

Wang, Qian; Irobalieva, Rossitza N.; Chiu, Wah; Schmid, Michael F.; Fogg, Jonathan M.; Zechiedrich, Lynn

2017-01-01

Abstract The sequence dependence of the conformational distribution of DNA under various levels of torsional stress is an important unsolved problem. Combining theory and coarse-grained simulations shows that the DNA sequence and a structural correlation due to topology constraints of a circle are the main factors that dictate the 3D structure of a 336 bp DNA minicircle under torsional stress. We found that DNA minicircle topoisomers can have multiple bend locations under high torsional stress and that the positions of these sharp bends are determined by the sequence, and by a positive mechanical correlation along the sequence. We showed that simulations and theory are able to provide sequence-specific information about individual DNA minicircles observed by cryo-electron tomography (cryo-ET). We provided a sequence-specific cryo-ET tomogram fitting of DNA minicircles, registering the sequence within the geometric features. Our results indicate that the conformational distribution of minicircles under torsional stress can be designed, which has important implications for using minicircle DNA for gene therapy. PMID:28609782
Association of N2-fixing cyanobacteria and plants: towards novel symbioses of agricultural importance

DOE Office of Scientific and Technical Information (OSTI.GOV)

Elhai, Jeff

2001-06-25

Some nitrogen-fixing cyanobacteria are able to form symbioses with a wide variety of plants. Nostoc 2S9B is unusual in its ability to infect the roots of wheat, raising the prospect of a productive association with an important crop plant. The goal of the project was to lay the groundwork for the use of novel associations between Nostoc and crops of agronomic importance, thereby reducing our reliance on nitrogenous fertilizer. Nostoc 2S9B was found to enter roots through mechanical damage of roots and reside primarily in intercellular spaces. The strain could also be incorporated into wheat calli grown in tissue culture.more » In both cases, the rate of nitrogen fixation by the cyanobacterium was higher than that of the same strain grown with no plant present. Artificial nodules induced by the action of hormone 2,4D were readily infected by Nostoc 2S9B, and the cyanobacteria within such nodules fixed nitrogen under fully aerobic conditions. The nitrogen fixed was shown to be incorporated into the growing wheat seedlings. Nostoc thus differs from other bacteria in its ability to fix nitrogen in para-nodules without need for artificially microaerobic conditions. It would be useful to introduce foreign DNA into Nostoc 2S9B in order to make defined mutations to understand the genetic basis of its ability to infect wheat and to create strains that might facilitate the study of the infection process. Transfer of DNA into the cyanobacterium appears to be limited by the presence of four restriction enzymes, with recognition sequences the same as BamHI, BglI, BsaHI, and Tth111I. Genes encoding methyltransferases that protect DNA against these four enzymes have been cloned into helper plasmids to allow transfer of DNA from E. coli to Nostoc 2S9B.« less
Analysis of DNA Sequences by an Optical Time-Integrating Correlator: Proof-of-Concept Experiments.

DTIC Science & Technology

1992-05-01

DNA ANALYSIS STRATEGY 4 2.1 Representation of DNA Bases 4 2.2 DNA Analysis Strategy 6 3.0 CUSTOM GENERATORS FOR DNA SEQUENCES 10 3.1 Hardware Design 10...of the DNA bases where each base is represented by a 7-bits long pseudorandom sequence. 5 Figure 4: Coarse analysis of a DNA sequence. 7 Figure 5: Fine...a 20-bases long database. 32 xiii LIST OF TABLES PAGE Table 1: Short representations of the DNA bases where each base is represented by 7-bits long
Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting

NASA Astrophysics Data System (ADS)

Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.

1997-05-01

Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.
Micronuclear DNA of Oxytricha nova contains sequences with autonomously replicating activity in Saccharomyces cerevisiae.

PubMed Central

Colombo, M M; Swanton, M T; Donini, P; Prescott, D M

1984-01-01

Oxytricha nova is a hypotrichous ciliate with micronuclei and macronuclei. Micronuclei, which contain large, chromosomal-sized DNA, are genetically inert but undergo meiosis and exchange during cell mating. Macronuclei, which contain only small, gene-sized DNA molecules, provide all of the nuclear RNA needed to run the cell. After cell mating the macronucleus is derived from a micronucleus, a derivation that includes excision of the genes from chromosomes and elimination of the remaining DNA. The eliminated DNA includes all of the repetitious sequences and approximately 95% of the unique sequences. We cloned large restriction fragments from the micronucleus that confer replication ability on a replication-deficient plasmid in Saccharomyces cerevisiae. Sequences that confer replication ability are called autonomously replicating sequences. The frequency and effectiveness of autonomously replicating sequences in micronuclear DNA are similar to those reported for DNAs of other organisms introduced into yeast cells. Of the 12 micronuclear fragments with autonomously replicating sequence activity, 9 also showed homology to macronuclear DNA, indicating that they contain a macronuclear gene sequence. We conclude from this that autonomously replicating sequence activity is nonrandomly distributed throughout micronuclear DNA and is preferentially associated with those regions of micronuclear DNA that contain genes. Images PMID:6092934
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

PubMed Central

Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

2014-01-01

As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

PubMed

El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

2013-07-01

Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Affordable hands-on DNA sequencing and genotyping: an exercise for teaching DNA analysis to undergraduates.

PubMed

Shah, Kushani; Thomas, Shelby; Stein, Arnold

2013-01-01

In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.
Phylogenetic characterization of a biogas plant microbial community integrating clone library 16S-rDNA sequences and metagenome sequence data obtained by 454-pyrosequencing.

PubMed

Kröber, Magdalena; Bekel, Thomas; Diaz, Naryttza N; Goesmann, Alexander; Jaenicke, Sebastian; Krause, Lutz; Miller, Dimitri; Runte, Kai J; Viehöver, Prisca; Pühler, Alfred; Schlüter, Andreas

2009-06-01

The phylogenetic structure of the microbial community residing in a fermentation sample from a production-scale biogas plant fed with maize silage, green rye and liquid manure was analysed by an integrated approach using clone library sequences and metagenome sequence data obtained by 454-pyrosequencing. Sequencing of 109 clones from a bacterial and an archaeal 16S-rDNA amplicon library revealed that the obtained nucleotide sequences are similar but not identical to 16S-rDNA database sequences derived from different anaerobic environments including digestors and bioreactors. Most of the bacterial 16S-rDNA sequences could be assigned to the phylum Firmicutes with the most abundant class Clostridia and to the class Bacteroidetes, whereas most archaeal 16S-rDNA sequences cluster close to the methanogen Methanoculleus bourgensis. Further sequences of the archaeal library most probably represent so far non-characterised species within the genus Methanoculleus. A similar result derived from phylogenetic analysis of mcrA clone sequences. The mcrA gene product encodes the alpha-subunit of methyl-coenzyme-M reductase involved in the final step of methanogenesis. BLASTn analysis applying stringent settings resulted in assignment of 16S-rDNA metagenome sequence reads to 62 16S-rDNA amplicon sequences thus enabling frequency of abundance estimations for 16S-rDNA clone library sequences. Ribosomal Database Project (RDP) Classifier processing of metagenome 16S-rDNA reads revealed abundance of the phyla Firmicutes, Bacteroidetes and Euryarchaeota and the orders Clostridiales, Bacteroidales and Methanomicrobiales. Moreover, a large fraction of 16S-rDNA metagenome reads could not be assigned to lower taxonomic ranks, demonstrating that numerous microorganisms in the analysed fermentation sample of the biogas plant are still unclassified or unknown.
Droplet microfluidics for synthetic biology

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gach, PC; Iwai, K; Kim, PW

2017-01-01

© 2017 The Royal Society of Chemistry. Synthetic biology is an interdisciplinary field that aims to engineer biological systems for useful purposes. Organism engineering often requires the optimization of individual genes and/or entire biological pathways (consisting of multiple genes). Advances in DNA sequencing and synthesis have recently begun to enable the possibility of evaluating thousands of gene variants and hundreds of thousands of gene combinations. However, such large-scale optimization experiments remain cost-prohibitive to researchers following traditional molecular biology practices, which are frequently labor-intensive and suffer from poor reproducibility. Liquid handling robotics may reduce labor and improve reproducibility, but are themselvesmore » expensive and thus inaccessible to most researchers. Microfluidic platforms offer a lower entry price point alternative to robotics, and maintain high throughput and reproducibility while further reducing operating costs through diminished reagent volume requirements. Droplet microfluidics have shown exceptional promise for synthetic biology experiments, including DNA assembly, transformation/transfection, culturing, cell sorting, phenotypic assays, artificial cells and genetic circuits.« less
LINE1 family member is negative regulator of HLA-G expression.

PubMed

Ikeno, Masashi; Suzuki, Nobutaka; Kamiya, Megumi; Takahashi, Yuji; Kudoh, Jun; Okazaki, Tsuneko

2012-11-01

Class Ia molecules of human leucocyte antigen (HLA-A, -B and -C) are widely expressed and play a central role in the immune system by presenting peptides derived from the lumen of the endoplasmic reticulum. In contrast, class Ib molecules such as HLA-G serve novel functions. The distribution of HLA-G is mostly limited to foetal trophoblastic tissues and some tumour tissues. The mechanism required for the tissue-specific regulation of the HLA-G gene has not been well understood. Here, we investigated the genomic regulation of HLA-G by manipulating one copy of a genomic DNA fragment on a human artificial chromosome. We identified a potential negative regulator of gene expression in a sequence upstream of HLA-G that overlapped with the long interspersed element (LINE1); silencing of HLA-G involved a DNA secondary structure generated in LINE1. The presence of a LINE1 gene silencer may explain the limited expression of HLA-G compared with other class I genes.
Relaxation of selective constraint on dog mitochondrial DNA following domestication.

PubMed

Björnerfeldt, Susanne; Webster, Matthew T; Vilà, Carles

2006-08-01

The domestication of dogs caused a dramatic change in their way of life compared with that of their ancestor, the gray wolf. We hypothesize that this new life style changed the selective forces that acted upon the species, which in turn had an effect on the dog's genome. We sequenced the complete mitochondrial DNA genome in 14 dogs, six wolves, and three coyotes. Here we show that dogs have accumulated nonsynonymous changes in mitochondrial genes at a faster rate than wolves, leading to elevated levels of variation in their proteins. This suggests that a major consequence of domestication in dogs was a general relaxation of selective constraint on their mitochondrial genome. If this change also affected other parts of the dog genome, it could have facilitated the generation of novel functional genetic diversity. This diversity could thus have contributed raw material upon which artificial selection has shaped modern breeds and may therefore be an important source of the extreme phenotypic variation present in modern-day dogs.
Influence of major-groove chemical modifications of DNA on transcription by bacterial RNA polymerases

PubMed Central

Raindlová, Veronika; Janoušková, Martina; Slavíčková, Michaela; Perlíková, Pavla; Boháčová, Soňa; Milisavljevič, Nemanja; Šanderová, Hana; Benda, Martin; Barvík, Ivan; Krásný, Libor; Hocek, Michal

2016-01-01

DNA templates containing a set of base modifications in the major groove (5-substituted pyrimidines or 7-substituted 7-deazapurines bearing H, methyl, vinyl, ethynyl or phenyl groups) were prepared by PCR using the corresponding base-modified 2′-deoxyribonucleoside triphosphates (dNTPs). The modified templates were used in an in vitro transcription assay using RNA polymerase from Bacillus subtilis and Escherichia coli. Some modified nucleobases bearing smaller modifications (H, Me in 7-deazapurines) were perfectly tolerated by both enzymes, whereas bulky modifications (Ph at any nucleobase) and, surprisingly, uracil blocked transcription. Some middle-sized modifications (vinyl or ethynyl) were partly tolerated mostly by the E. coli enzyme. In all cases where the transcription proceeded, full length RNA product with correct sequence was obtained indicating that the modifications of the template are not mutagenic and the inhibition is probably at the stage of initiation. The results are promising for the development of bioorthogonal reactions for artificial chemical switching of the transcription. PMID:27001521
Validation of qPCR Methods for the Detection of Mycobacterium in New World Animal Reservoirs.

PubMed

Housman, Genevieve; Malukiewicz, Joanna; Boere, Vanner; Grativol, Adriana D; Pereira, Luiz Cezar M; Silva, Ita de Oliveira; Ruiz-Miranda, Carlos R; Truman, Richard; Stone, Anne C

2015-11-01

Zoonotic pathogens that cause leprosy (Mycobacterium leprae) and tuberculosis (Mycobacterium tuberculosis complex, MTBC) continue to impact modern human populations. Therefore, methods able to survey mycobacterial infection in potential animal hosts are necessary for proper evaluation of human exposure threats. Here we tested for mycobacterial-specific single- and multi-copy loci using qPCR. In a trial study in which armadillos were artificially infected with M. leprae, these techniques were specific and sensitive to pathogen detection, while more traditional ELISAs were only specific. These assays were then employed in a case study to detect M. leprae as well as MTBC in wild marmosets. All marmosets were negative for M. leprae DNA, but 14 were positive for the mycobacterial rpoB gene assay. Targeted capture and sequencing of rpoB and other MTBC genes validated the presence of mycobacterial DNA in these samples and revealed that qPCR is useful for identifying mycobacterial-infected animal hosts.

DNA capture and next-generation sequencing can recover whole mitochondrial genomes from highly degraded samples for human identification

PubMed Central

2013-01-01

Background Mitochondrial DNA (mtDNA) typing can be a useful aid for identifying people from compromised samples when nuclear DNA is too damaged, degraded or below detection thresholds for routine short tandem repeat (STR)-based analysis. Standard mtDNA typing, focused on PCR amplicon sequencing of the control region (HVS I and HVS II), is limited by the resolving power of this short sequence, which misses up to 70% of the variation present in the mtDNA genome. Methods We used in-solution hybridisation-based DNA capture (using DNA capture probes prepared from modern human mtDNA) to recover mtDNA from post-mortem human remains in which the majority of DNA is both highly fragmented (<100 base pairs in length) and chemically damaged. The method ‘immortalises’ the finite quantities of DNA in valuable extracts as DNA libraries, which is followed by the targeted enrichment of endogenous mtDNA sequences and characterisation by next-generation sequencing (NGS). Results We sequenced whole mitochondrial genomes for human identification from samples where standard nuclear STR typing produced only partial profiles or demonstrably failed and/or where standard mtDNA hypervariable region sequences lacked resolving power. Multiple rounds of enrichment can substantially improve coverage and sequencing depth of mtDNA genomes from highly degraded samples. The application of this method has led to the reliable mitochondrial sequencing of human skeletal remains from unidentified World War Two (WWII) casualties approximately 70 years old and from archaeological remains (up to 2,500 years old). Conclusions This approach has potential applications in forensic science, historical human identification cases, archived medical samples, kinship analysis and population studies. In particular the methodology can be applied to any case, involving human or non-human species, where whole mitochondrial genome sequences are required to provide the highest level of maternal lineage discrimination. Multiple rounds of in-solution hybridisation-based DNA capture can retrieve whole mitochondrial genome sequences from even the most challenging samples. PMID:24289217
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.

PubMed

Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

2012-01-01

RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.
Direct Detection and Sequencing of Damaged DNA Bases

PubMed Central

2011-01-01

Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications. PMID:22185597
Direct detection and sequencing of damaged DNA bases.

PubMed

Clark, Tyson A; Spittle, Kristi E; Turner, Stephen W; Korlach, Jonas

2011-12-20

Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications.
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1987-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3575113
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1990-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2333227
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1988-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:3368330
A comprehensive list of cloned human DNA sequences

PubMed Central

Schmidtke, Jörg; Cooper, David N.

1989-01-01

A list of DNA sequences cloned from the human genome is presented. Intended as a guide to clone availability, this list includes published reports of cDNA, genomic and synthetic clones comprising gene and pseudogene sequences, uncharacterised DNA segments and repetitive DNA elements. PMID:2654889
Kilo-sequencing: an ordered strategy for rapid DNA sequence data acquisition.

PubMed Central

Barnes, W M; Bevan, M

1983-01-01

A strategy for rapid DNA sequence acquisition in an ordered, nonrandom manner, while retaining all of the conveniences of the dideoxy method with M13 transducing phage DNA template, is described. Target DNA 3 to 14 kb in size can be stably carried by our M13 vectors. Suitable targets are stretches of DNA which lack an enzyme recognition site which is unique on our cloning vectors and adjacent to the sequencing primer; current sites that are so useful when lacking are Pst, Xba, HindIII, BglII, EcoRI. By an in vitro procedure, we cut RF DNA once randomly and once specifically, to create thousands of deletions which start at the unique restriction site adjacent to the dideoxy sequencing primer and extend various distances across the target DNA. Phage carrying a desired size of deletions, whose DNA as template will give rise to DNA sequence data in a desired location along the target DNA, may be purified by electrophoresis alive on agarose gels. Phage running in the same location on the agarose gel thus conveniently give rise to nucleotide sequence data from the same kilobase of target DNA. Images PMID:6298723
Silicene nanoribbon as a new DNA sequencing device

NASA Astrophysics Data System (ADS)

Alesheikh, Sara; Shahtahmassebi, Nasser; Roknabadi, Mahmood Rezaee; Pilevar Shahri, Raheleh

2018-02-01

The importance of applying DNA sequencing in different fields, results in looking for fast and cheap methods. Nanotechnology helps this development by introducing nanostructures used for DNA sequencing. In this work we study the interaction between zigzag silicene nanoribbon and DNA nucleobases using DFT and non equilibrium Green's function approach, to investigate the possibility of using zigzag silicene nanoribbons as a biosensor for DNA sequencing.
Isolation and characterization of target sequences of the chicken CdxA homeobox gene.

PubMed Central

Margalit, Y; Yarus, S; Shapira, E; Gruenbaum, Y; Fainsod, A

1993-01-01

The DNA binding specificity of the chicken homeodomain protein CDXA was studied. Using a CDXA-glutathione-S-transferase fusion protein, DNA fragments containing the binding site for this protein were isolated. The sources of DNA were oligonucleotides with random sequence and chicken genomic DNA. The DNA fragments isolated were sequenced and tested in DNA binding assays. Sequencing revealed that most DNA fragments are AT rich which is a common feature of homeodomain binding sites. By electrophoretic mobility shift assays it was shown that the different target sequences isolated bind to the CDXA protein with different affinities. The specific sequences bound by the CDXA protein in the genomic fragments isolated, were determined by DNase I footprinting. From the footprinted sequences, the CDXA consensus binding site was determined. The CDXA protein binds the consensus sequence A, A/T, T, A/T, A, T, A/G. The CAUDAL binding site in the ftz promoter is also included in this consensus sequence. When tested, some of the genomic target sequences were capable of enhancing the transcriptional activity of reporter plasmids when introduced into CDXA expressing cells. This study determined the DNA sequence specificity of the CDXA protein and it also shows that this protein can further activate transcription in cells in culture. Images PMID:7909943
Sequence periodicity in nucleosomal DNA and intrinsic curvature.

PubMed

Nair, T Murlidharan

2010-05-17

Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA.
Detection and persistence of environmental DNA from an invasive, terrestrial mammal.

PubMed

Williams, Kelly E; Huyvaert, Kathryn P; Vercauteren, Kurt C; Davis, Amy J; Piaggio, Antoinette J

2018-01-01

Invasive Sus scrofa , a species commonly referred to as wild pig or feral swine, is a destructive invasive species with a rapidly expanding distribution across the United States. We used artificial wallows and small waterers to determine the minimum amount of time needed for pig eDNA to accumulate in the water source to a detectable level. We removed water from the artificial wallows and tested eDNA detection over the course of 2 weeks to understand eDNA persistence. We show that our method is sensitive enough to detect very low quantities of eDNA shed by a terrestrial mammal that has limited interaction with water. Our experiments suggest that the number of individuals shedding into a water system can affect persistence of eDNA. Use of an eDNA detection technique can benefit management efforts by providing a sensitive method for finding even small numbers of individuals that may be elusive using other methods.
Assessing the Fidelity of Ancient DNA Sequences Amplified From Nuclear Genes

PubMed Central

Binladen, Jonas; Wiuf, Carsten; Gilbert, M. Thomas P.; Bunce, Michael; Barnett, Ross; Larson, Greger; Greenwood, Alex D.; Haile, James; Ho, Simon Y. W.; Hansen, Anders J.; Willerslev, Eske

2006-01-01

To date, the field of ancient DNA has relied almost exclusively on mitochondrial DNA (mtDNA) sequences. However, a number of recent studies have reported the successful recovery of ancient nuclear DNA (nuDNA) sequences, thereby allowing the characterization of genetic loci directly involved in phenotypic traits of extinct taxa. It is well documented that postmortem damage in ancient mtDNA can lead to the generation of artifactual sequences. However, as yet no one has thoroughly investigated the damage spectrum in ancient nuDNA. By comparing clone sequences from 23 fossil specimens, recovered from environments ranging from permafrost to desert, we demonstrate the presence of miscoding lesion damage in both the mtDNA and nuDNA, resulting in insertion of erroneous bases during amplification. Interestingly, no significant differences in the frequency of miscoding lesion damage are recorded between mtDNA and nuDNA despite great differences in cellular copy numbers. For both mtDNA and nuDNA, we find significant positive correlations between total sequence heterogeneity and the rates of type 1 transitions (adenine → guanine and thymine → cytosine) and type 2 transitions (cytosine → thymine and guanine → adenine), respectively. Type 2 transitions are by far the most dominant and increase relative to those of type 1 with damage load. The results suggest that the deamination of cytosine (and 5-methyl cytosine) to uracil (and thymine) is the main cause of miscoding lesions in both ancient mtDNA and nuDNA sequences. We argue that the problems presented by postmortem damage, as well as problems with contamination from exogenous sources of conserved nuclear genes, allelic variation, and the reliance on single nucleotide polymorphisms, call for great caution in studies relying on ancient nuDNA sequences. PMID:16299392
Small RNAs: artificial piRNAs for transcriptional silencing.

PubMed

Hirano, Takamasa; Siomi, Haruhiko

2015-03-30

Technologies have been developed in animal germ cells that produce artificial piRNAs from transgenes in piRNA clusters to silence target genes by cleaving their transcripts. A new study provides a simple way to generate artificial piRNAs to direct de novo DNA methylation in mice. Copyright © 2015 Elsevier Ltd. All rights reserved.
[Current applications of high-throughput DNA sequencing technology in antibody drug research].

PubMed

Yu, Xin; Liu, Qi-Gang; Wang, Ming-Rong

2012-03-01

Since the publication of a high-throughput DNA sequencing technology based on PCR reaction was carried out in oil emulsions in 2005, high-throughput DNA sequencing platforms have been evolved to a robust technology in sequencing genomes and diverse DNA libraries. Antibody libraries with vast numbers of members currently serve as a foundation of discovering novel antibody drugs, and high-throughput DNA sequencing technology makes it possible to rapidly identify functional antibody variants with desired properties. Herein we present a review of current applications of high-throughput DNA sequencing technology in the analysis of antibody library diversity, sequencing of CDR3 regions, identification of potent antibodies based on sequence frequency, discovery of functional genes, and combination with various display technologies, so as to provide an alternative approach of discovery and development of antibody drugs.
Artificial Informational Polymers and Nanomaterials from Ring-Opening Metathesis Polymerization

NASA Astrophysics Data System (ADS)

James, Carrie Rae

Inspired by naturally occurring polymers (DNA, polypeptides, polysaccharides, etc.) that can self-assemble on the nanoscale into complex, information-rich architectures, we have synthesized nucleic acid based polymers using ROMP. These polymers were synthesized using a graft-through strategy, whereby nucleic acids bearing a strained cyclic olefin were directly polymerized. This is the first example of the graft-through polymerization of nucleic acids. Our approach takes advantage of non-charged peptide nucleic acids (PNAs) as elements to incorporate into ROMP polymer backbones. PNA is a synthetic nucleic acid analogue known for its increased affinity and specificity for complementary DNA or RNA. To accomplish the graft-through polymerization of PNA, we conjugated PNA to strained cyclic olefins using solid phase peptide conjugation chemistry. These PNA monomers were then directly polymerized into homo and block copolymers forming brushes, or comb-like arrangements, of information. Block copolymer amphiphiles of these materials, where the PNA brush served as the hydrophilic portion, were capable of self-assembly into spherical nanoparticles (PNA NPs). These PNA NPs were then studied with respect to their ability to hybridize complementary DNA sequences, as well as their ability to undergo cellular internalization. PNA NPs consisting of densely packed brushes of nucleic acids possessed increased thermal stability when mixed with their complementary DNA sequence, indicating a greater DNA binding affinity over their unpolymerized PNA counterparts. In addition, by arranging the PNA into dense brushes at the surface of the nanoparticle, Cy5.5 labeled PNA NPs were able to undergo cellular internalization into HeLa cells without the need for an additional cellular delivery device. Importantly, cellular internalization of PNA has remained a significant challenge in the literature due to the neutrally charged amino-ethyl glycine backbone of PNA. Therefore, this represents a novel way of facilitating cellular uptake of PNA. This materials strategy represents the first direct polymerization of nucleic acids, and presents a novel method for arranging biological information on the nanoscale at high density in order to confer novel attributes.
DNA fingerprinting, DNA barcoding, and next generation sequencing technology in plants.

PubMed

Sucher, Nikolaus J; Hennell, James R; Carles, Maria C

2012-01-01

DNA fingerprinting of plants has become an invaluable tool in forensic, scientific, and industrial laboratories all over the world. PCR has become part of virtually every variation of the plethora of approaches used for DNA fingerprinting today. DNA sequencing is increasingly used either in combination with or as a replacement for traditional DNA fingerprinting techniques. A prime example is the use of short, standardized regions of the genome as taxon barcodes for biological identification of plants. Rapid advances in "next generation sequencing" (NGS) technology are driving down the cost of sequencing and bringing large-scale sequencing projects into the reach of individual investigators. We present an overview of recent publications that demonstrate the use of "NGS" technology for DNA fingerprinting and DNA barcoding applications.
Mammalian DNA enriched for replication origins is enriched for snap-back sequences.

PubMed

Zannis-Hadjopoulos, M; Kaufmann, G; Martin, R G

1984-11-15

Using the instability of replication loops as a method for the isolation of double-stranded nascent DNA, extruded DNA enriched for replication origins was obtained and denatured. Snap-back DNA, single-stranded DNA with inverted repeats (palindromic sequences), reassociates rapidly into stem-loop structures with zero-order kinetics when conditions are changed from denaturing to renaturing, and can be assayed by chromatography on hydroxyapatite. Origin-enriched nascent DNA strands from mouse, rat and monkey cells growing either synchronously or asynchronously were purified and assayed for the presence of snap-back sequences. The results show that origin-enriched DNA is also enriched for snap-back sequences, implying that some origins for mammalian DNA replication contain or lie near palindromic sequences.
Artificial and Solar UV Radiation Induces Strand Breaks and Cyclobutane Pyrimidine Dimers in Bacillus subtilis Spore DNA

PubMed Central

Slieman, Tony A.; Nicholson, Wayne L.

2000-01-01

The loss of stratospheric ozone and the accompanying increase in solar UV flux have led to concerns regarding decreases in global microbial productivity. Central to understanding this process is determining the types and amounts of DNA damage in microbes caused by solar UV irradiation. While UV irradiation of dormant Bacillus subtilis endospores results mainly in formation of the “spore photoproduct” 5-thyminyl-5,6-dihydrothymine, genetic evidence indicates that an additional DNA photoproduct(s) may be formed in spores exposed to solar UV-B and UV-A radiation (Y. Xue and W. L. Nicholson, Appl. Environ. Microbiol. 62:2221–2227, 1996). We examined the occurrence of double-strand breaks, single-strand breaks, cyclobutane pyrimidine dimers, and apurinic-apyrimidinic sites in spore DNA under several UV irradiation conditions by using enzymatic probes and neutral or alkaline agarose gel electrophoresis. DNA from spores irradiated with artificial 254-nm UV-C radiation accumulated single-strand breaks, double-strand breaks, and cyclobutane pyrimidine dimers, while DNA from spores exposed to artificial UV-B radiation (wavelengths, 290 to 310 nm) accumulated only cyclobutane pyrimidine dimers. DNA from spores exposed to full-spectrum sunlight (UV-B and UV-A radiation) accumulated single-strand breaks, double-strand breaks, and cyclobutane pyrimidine dimers, whereas DNA from spores exposed to sunlight from which the UV-B component had been removed with a filter (“UV-A sunlight”) accumulated only single-strand breaks and double-strand breaks. Apurinic-apyrimidinic sites were not detected in spore DNA under any of the irradiation conditions used. Our data indicate that there is a complex spectrum of UV photoproducts in DNA of bacterial spores exposed to solar UV irradiation in the environment. PMID:10618224

DNA-Mediated Self-Organization of Polymeric Nanocompartments Leads to Interconnected Artificial Organelles.

PubMed

Liu, Juan; Postupalenko, Viktoriia; Lörcher, Samuel; Wu, Dalin; Chami, Mohamed; Meier, Wolfgang; Palivan, Cornelia G

2016-11-09

Self-organization of nanocomponents was mainly focused on solid nanoparticles, quantum dots, or liposomes to generate complex architectures with specific properties, but intrinsically limited or not developed enough, to mimic sophisticated structures with biological functions in cells. Here, we present a biomimetic strategy to self-organize synthetic nanocompartments (polymersomes) into clusters with controlled properties and topology by exploiting DNA hybridization to interconnect polymersomes. Molecular and external factors affecting the self-organization served to design clusters mimicking the connection of natural organelles: fine-tune of the distance between tethered polymersomes, different topologies, no fusion of clustered polymersomes, and no aggregation. Unexpected, extended DNA bridges that result from migration of the DNA strands inside the thick polymer membrane (about 12 nm) represent a key stability and control factor, not yet exploited for other synthetic nano-object networks. The replacement of the empty polymersomes with artificial organelles, already reported for single polymersome architecture, will provide an excellent platform for the development of artificial systems mimicking natural organelles or cells and represents a fundamental step in the engineering of molecular factories.
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
DNA sequence determinants controlling affinity, stability and shape of DNA complexes bound by the nucleoid protein Fis

DOE PAGES

Hancock, Stephen P.; Stella, Stefano; Cascio, Duilio; ...

2016-03-09

The abundant Fis nucleoid protein selectively binds poorly related DNA sequences with high affinities to regulate diverse DNA reactions. Fis binds DNA primarily through DNA backbone contacts and selects target sites by reading conformational properties of DNA sequences, most prominently intrinsic minor groove widths. High-affinity binding requires Fis-stabilized DNA conformational changes that vary depending on DNA sequence. In order to better understand the molecular basis for high affinity site recognition, we analyzed the effects of DNA sequence within and flanking the core Fis binding site on binding affinity and DNA structure. X-ray crystal structures of Fis-DNA complexes containing variable sequencesmore » in the noncontacted center of the binding site or variations within the major groove interfaces show that the DNA can adapt to the Fis dimer surface asymmetrically. We show that the presence and position of pyrimidine-purine base steps within the major groove interfaces affect both local DNA bending and minor groove compression to modulate affinities and lifetimes of Fis-DNA complexes. Sequences flanking the core binding site also modulate complex affinities, lifetimes, and the degree of local and global Fis-induced DNA bending. In particular, a G immediately upstream of the 15 bp core sequence inhibits binding and bending, and A-tracts within the flanking base pairs increase both complex lifetimes and global DNA curvatures. Taken together, our observations support a revised DNA motif specifying high-affinity Fis binding and highlight the range of conformations that Fis-bound DNA can adopt. Lastly, the affinities and DNA conformations of individual Fis-DNA complexes are likely to be tailored to their context-specific biological functions.« less
Specific minor groove solvation is a crucial determinant of DNA binding site recognition

PubMed Central

Harris, Lydia-Ann; Williams, Loren Dean; Koudelka, Gerald B.

2014-01-01

The DNA sequence preferences of nearly all sequence specific DNA binding proteins are influenced by the identities of bases that are not directly contacted by protein. Discrimination between non-contacted base sequences is commonly based on the differential abilities of DNA sequences to allow narrowing of the DNA minor groove. However, the factors that govern the propensity of minor groove narrowing are not completely understood. Here we show that the differential abilities of various DNA sequences to support formation of a highly ordered and stable minor groove solvation network are a key determinant of non-contacted base recognition by a sequence-specific binding protein. In addition, disrupting the solvent network in the non-contacted region of the binding site alters the protein's ability to recognize contacted base sequences at positions 5–6 bases away. This observation suggests that DNA solvent interactions link contacted and non-contacted base recognition by the protein. PMID:25429976
A Method for Preparing DNA Sequencing Templates Using a DNA-Binding Microplate

PubMed Central

Yang, Yu; Hebron, Haroun R.; Hang, Jun

2009-01-01

A DNA-binding matrix was immobilized on the surface of a 96-well microplate and used for plasmid DNA preparation for DNA sequencing. The same DNA-binding plate was used for bacterial growth, cell lysis, DNA purification, and storage. In a single step using one buffer, bacterial cells were lysed by enzymes, and released DNA was captured on the plate simultaneously. After two wash steps, DNA was eluted and stored in the same plate. Inclusion of phosphates in the culture medium was found to enhance the yield of plasmid significantly. Purified DNA samples were used successfully in DNA sequencing with high consistency and reproducibility. Eleven vectors and nine libraries were tested using this method. In 10 μl sequencing reactions using 3 μl sample and 0.25 μl BigDye Terminator v3.1, the results from a 3730xl sequencer gave a success rate of 90–95% and read-lengths of 700 bases or more. The method is fully automatable and convenient for manual operation as well. It enables reproducible, high-throughput, rapid production of DNA with purity and yields sufficient for high-quality DNA sequencing at a substantially reduced cost. PMID:19568455
Dendritic Cell-Based Immunotherapy of Breast Cancer: Modulation by CpG DNA

DTIC Science & Technology

2005-09-01

tumor-associated antigens and bacterial DNA oligodeoxynucleotides containing unmethylated CpG sequences (CpG DNA) further augment the immune priming...associated antigens by cytotoxic T lymphocytes, and bacterial DNA oligodeoxy- nucleotides containing unmethylated CpG sequences (CpG DNA) can further...further amplify their immunostimulatory capacity and bacterial DNA oligodeoxynucleotides (ODN) containing unmethylated CpG sequences (CpG DNA) provide such
A rapid and cost-effective method for sequencing pooled cDNA clones by using a combination of transposon insertion and Gateway technology.

PubMed

Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide

2011-09-01

Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.
Biological sequence compression algorithms.

PubMed

Matsumoto, T; Sadakane, K; Imai, H

2000-01-01

Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
Detection of DNA Methylation by Whole-Genome Bisulfite Sequencing.

PubMed

Li, Qing; Hermanson, Peter J; Springer, Nathan M

2018-01-01

DNA methylation plays an important role in the regulation of the expression of transposons and genes. Various methods have been developed to assay DNA methylation levels. Bisulfite sequencing is considered to be the "gold standard" for single-base resolution measurement of DNA methylation levels. Coupled with next-generation sequencing, whole-genome bisulfite sequencing (WGBS) allows DNA methylation to be evaluated at a genome-wide scale. Here, we described a protocol for WGBS in plant species with large genomes. This protocol has been successfully applied to assay genome-wide DNA methylation levels in maize and barley. This protocol has also been successfully coupled with sequence capture technology to assay DNA methylation levels in a targeted set of genomic regions.
Single-Molecule Electrical Random Resequencing of DNA and RNA

NASA Astrophysics Data System (ADS)

Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji

2012-07-01

Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
Infectivity of porcine circovirus type 2 DNA in semen from experimentally-infected boars

PubMed Central

Madson, Darin M.; Ramamoorthy, Sheela; Kuster, Chris; Pal, Narinder; Meng, Xiang-Jin; Halbur, Patrick G.; Opriessnig, Tanja

2009-01-01

Porcine circovirus type 2 (PCV2) is an economically important pathogen. It has been demonstrated that PCV2 DNA can be detected in boar semen by PCR; however, the biological relevance of this is unknown. The objectives of this study were to determine if semen positive for PCV2 DNA is infectious (1) in a swine bioassay, or (2) when used for artificial insemination. For the first objective, 4-week-old pigs were inoculated intraperitoneally with PCV2 DNA-negative (bioassay-control; n = 3), PCV2a DNA-positive (bioassay-PCV2a; n = 3), or PCV2b DNA-positive (bioassay-PCV2b; n = 3) raw semen, or PCV2 live virus (bioassay-positive; n = 3), respectively. Pigs inoculated with PCV2 DNA-positive semen and PCV2 live virus became viremic and developed anti-PCV2 antibodies indicating that the PCV2 DNA present in semen was infectious. For the second objective, three Landrace gilts were inseminated with PCV2 DNA-negative semen (gilts-controls) from experimentally-infected boars, and six gilts were artificially inseminated with semen positive for PCV2a DNA (gilts-PCV2a; n = 3) or PCV2b DNA (gilts-PCV2b; n = 3). Serum samples collected from the gilts in all groups remained negative for anti-PCV2 antibodies for the duration of the experiment. In addition, fetal serum samples from all 105-day-gestation fetuses were negative for anti-PCV2 antibodies or PCV2 DNA. Under the conditions of this study, PCV2 DNA-positive semen was not infectious when used to artificially inseminate gilts; however, it was demonstrated to be infectious in a swine bioassay model and therefore is a potential means of PCV2 transmission amongst swine herds. PMID:18973743
DNA/RNA hybrid substrates modulate the catalytic activity of purified AID.

PubMed

Abdouni, Hala S; King, Justin J; Ghorbani, Atefeh; Fifield, Heather; Berghuis, Lesley; Larijani, Mani

2018-01-01

Activation-induced cytidine deaminase (AID) converts cytidine to uridine at Immunoglobulin (Ig) loci, initiating somatic hypermutation and class switching of antibodies. In vitro, AID acts on single stranded DNA (ssDNA), but neither double-stranded DNA (dsDNA) oligonucleotides nor RNA, and it is believed that transcription is the in vivo generator of ssDNA targeted by AID. It is also known that the Ig loci, particularly the switch (S) regions targeted by AID are rich in transcription-generated DNA/RNA hybrids. Here, we examined the binding and catalytic behavior of purified AID on DNA/RNA hybrid substrates bearing either random sequences or GC-rich sequences simulating Ig S regions. If substrates were made up of a random sequence, AID preferred substrates composed entirely of DNA over DNA/RNA hybrids. In contrast, if substrates were composed of S region sequences, AID preferred to mutate DNA/RNA hybrids over substrates composed entirely of DNA. Accordingly, AID exhibited a significantly higher affinity for binding DNA/RNA hybrid substrates composed specifically of S region sequences, than any other substrates composed of DNA. Thus, in the absence of any other cellular processes or factors, AID itself favors binding and mutating DNA/RNA hybrids composed of S region sequences. AID:DNA/RNA complex formation and supporting mutational analyses suggest that recognition of DNA/RNA hybrids is an inherent structural property of AID. Copyright © 2017 Elsevier Ltd. All rights reserved.
Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.

PubMed

Schnitzler, P; Darai, G

1989-09-01

The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].

PubMed

Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y

2017-08-01

To analyze and detect the whole genome sequence of human mitochondrial DNA （mtDNA） by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine
CRISPR Display: A modular method for locus-specific targeting of long noncoding RNAs and synthetic RNA devices in vivo

PubMed Central

Shechner, David M.; Hacisüleyman, Ezgi; Younger, Scott T.; Rinn, John L.

2016-01-01

Noncoding RNAs (ncRNAs) comprise an important class of regulatory molecules that mediate a vast array of biological processes. This broad functional capacity has also facilitated the design of artificial ncRNAs with novel functions. To further investigate and harness these capabilities, we developed CRISPR-Display (“CRISP-Disp”), a targeted localization method that uses Sp. Cas9 to deploy large RNA cargos to DNA loci. We demonstrate that exogenous RNA domains can be functionally appended onto the CRISPR scaffold at multiple insertion points, allowing the construction of Cas9 complexes with protein-binding cassettes, artificial aptamers, pools of random sequences, and RNAs up to 4.8 kilobases in length, including natural lncRNAs. Unlike most existing CRISPR methods, CRISP-Disp allows simultaneous multiplexing of distinct functions at multiple targets, limited only by the number of available functional RNA motifs. We anticipate that this technology will provide a powerful method with which to ectopically localize functional RNAs and ribonucleoprotein (RNP) complexes at specified genomic loci. PMID:26030444
Overexpression of host plant urease in transgenic silkworms.

PubMed

Jiang, Liang; Huang, Chunlin; Sun, Qiang; Guo, Huizhen; Peng, Zhengwen; Dang, Yinghui; Liu, Weiqiang; Xing, Dongxu; Xu, Guowen; Zhao, Ping; Xia, Qingyou

2015-06-01

Bombyx mori and mulberry constitute a model of insect-host plant interactions. Urease hydrolyzes urea to ammonia and is important for the nitrogen metabolism of silkworms because ammonia is assimilated into silk protein. Silkworms do not synthesize urease and acquire it from mulberry leaves. We synthesized the artificial DNA sequence ureas using the codon bias of B. mori to encode the signal peptide and mulberry urease protein. A transgenic vector that overexpresses ure-as under control of the silkworm midgut-specific P2 promoter was constructed. Transgenic silkworms were created via embryo microinjection. RT-PCR results showed that urease was expressed during the larval stage and qPCR revealed the expression only in the midgut of transgenic lines. Urea concentration in the midgut and hemolymph of transgenic silkworms was significantly lower than in a nontransgenic line when silkworms were fed an artificial diet. Analysis of the daily body weight and food conversion efficiency of the fourth and fifth instar larvae and economic characteristics indicated no differences between transgenic silkworms and the nontransgenic line. These results suggested that overexpression of host plant urease promoted nitrogen metabolism in silkworms.
Environmental metabarcodes for insects: in silico PCR reveals potential for taxonomic bias.

PubMed

Clarke, Laurence J; Soubrier, Julien; Weyrich, Laura S; Cooper, Alan

2014-11-01

Studies of insect assemblages are suited to the simultaneous DNA-based identification of multiple taxa known as metabarcoding. To obtain accurate estimates of diversity, metabarcoding markers ideally possess appropriate taxonomic coverage to avoid PCR-amplification bias, as well as sufficient sequence divergence to resolve species. We used in silico PCR to compare the taxonomic coverage and resolution of newly designed insect metabarcodes (targeting 16S) with that of existing markers [16S and cytochrome oxidase c subunit I (COI)] and then compared their efficiency in vitro. Existing metabarcoding primers amplified in silico <75% of insect species with complete mitochondrial genomes available, whereas new primers targeting 16S provided >90% coverage. Furthermore, metabarcodes targeting COI appeared to introduce taxonomic PCR-amplification bias, typically amplifying a greater percentage of Lepidoptera and Diptera species, while failing to amplify certain orders in silico. To test whether bias predicted in silico was observed in vitro, we created an artificial DNA blend containing equal amounts of DNA from 14 species, representing 11 insect orders and one arachnid. We PCR-amplified the blend using five primer sets, targeting either COI or 16S, with high-throughput amplicon sequencing yielding more than 6 million reads. In vitro results typically corresponded to in silico PCR predictions, with newly designed 16S primers detecting 11 insect taxa present, thus providing equivalent or better taxonomic coverage than COI metabarcodes. Our results demonstrate that in silico PCR is a useful tool for predicting taxonomic bias in mixed template PCR and that researchers should be wary of potential bias when selecting metabarcoding markers. © 2014 John Wiley & Sons Ltd.
Transovarial persistence of Babesia ovata DNA in a hard tick, Haemaphysalis longicornis, in a semi-artificial mouse skin membrane feeding system.

PubMed

Umemiya-Shirafuji, Rika; Hatta, Takeshi; Okubo, Kazuhiro; Sato, Moeko; Maeda, Hiroki; Kume, Aiko; Yokoyama, Naoaki; Igarashi, Ikuo; Tsuji, Naotoshi; Fujisaki, Kozo; Inoue, Noboru; Suzuki, Hiroshi

2017-12-20

Bovine piroplasmosis, a tick-borne protozoan disease, is a major concern for the cattle industry worldwide due to its negative effects on livestock productivity. Toward the development of novel therapeutic and vaccine approaches, tick-parasite experimental models have been established to clarify the development of parasites in the ticks and the transmission of the parasites by ticks. A novel tick-Babesia experimental infection model recently revealed the time course of Babesia ovata migration in its vector Haemaphysalis longicornis, which is a dominant tick species in Japan. However, there has been no research on the transovarial persistence of B. ovata DNA using this experimental infection model. Here we assessed the presence of B. ovata DNA in eggs derived from parthenogenetic H. longicornis female ticks that had engorged after semi-artificial mouse skin membrane feeding of B. ovata-infected bovine red blood cells. The oviposition period of the engorged female ticks was 21-24 days in the semi-artificial feeding. Total egg weight measured daily reached a peak by day 3 in all female ticks. Nested PCR revealed that 3 of 10 female ticks laid B. ovata DNA-positive eggs after the semi-artificial feeding. In addition, B. ovata DNA was detected at the peak of egg weight during oviposition, indicating that B. ovata exist in the eggs laid a few days after the onset of oviposition in the tick. These findings will contribute to the establishment of B. ovata-infected H. longicornis colonies under laboratory conditions.
Energy Landscapes: From Protein Folding to Molecular Assembly

Science.gov Websites

been used, for example, in DNA origami, in which artificial structures and machines are built in a mechanical processes and eventually to reproduce these in artificial machines. This conference will provide
Sequence periodicity in nucleosomal DNA and intrinsic curvature

PubMed Central

2010-01-01

Background Most eukaryotic DNA contained in the nucleus is packaged by wrapping DNA around histone octamers. Histones are ubiquitous and bind most regions of chromosomal DNA. In order to achieve smooth wrapping of the DNA around the histone octamer, the DNA duplex should be able to deform and should possess intrinsic curvature. The deformability of DNA is a result of the non-parallelness of base pair stacks. The stacking interaction between base pairs is sequence dependent. The higher the stacking energy the more rigid the DNA helix, thus it is natural to expect that sequences that are involved in wrapping around the histone octamer should be unstacked and possess intrinsic curvature. Intrinsic curvature has been shown to be dictated by the periodic recurrence of certain dinucleotides. Several genome-wide studies directed towards mapping of nucleosome positions have revealed periodicity associated with certain stretches of sequences. In the current study, these sequences have been analyzed with a view to understand their sequence-dependent structures. Results Higher order DNA structures and the distribution of molecular bend loci associated with 146 base nucleosome core DNA sequence from C. elegans and chicken have been analyzed using the theoretical model for DNA curvature. The curvature dispersion calculated by cyclically permuting the sequences revealed that the molecular bend loci were delocalized throughout the nucleosome core region and had varying degrees of intrinsic curvature. Conclusions The higher order structures associated with nucleosomes of C.elegans and chicken calculated from the sequences revealed heterogeneity with respect to the deviation of the DNA axis. The results points to the possibility of context dependent curvature of varying degrees to be associated with nucleosomal DNA. PMID:20487515

A survey of the sequence-specific interaction of damaging agents with DNA: emphasis on antitumor agents.

PubMed

Murray, V

1999-01-01

This article reviews the literature concerning the sequence specificity of DNA-damaging agents. DNA-damaging agents are widely used in cancer chemotherapy. It is important to understand fully the determinants of DNA sequence specificity so that more effective DNA-damaging agents can be developed as antitumor drugs. There are five main methods of DNA sequence specificity analysis: cleavage of end-labeled fragments, linear amplification with Taq DNA polymerase, ligation-mediated polymerase chain reaction (PCR), single-strand ligation PCR, and footprinting. The DNA sequence specificity in purified DNA and in intact mammalian cells is reviewed for several classes of DNA-damaging agent. These include agents that form covalent adducts with DNA, free radical generators, topoisomerase inhibitors, intercalators and minor groove binders, enzymes, and electromagnetic radiation. The main sites of adduct formation are at the N-7 of guanine in the major groove of DNA and the N-3 of adenine in the minor groove, whereas free radical generators abstract hydrogen from the deoxyribose sugar and topoisomerase inhibitors cause enzyme-DNA cross-links to form. Several issues involved in the determination of the DNA sequence specificity are discussed. The future directions of the field, with respect to cancer chemotherapy, are also examined.
Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing

PubMed Central

Chandran, Anandhakumar; Syed, Junetha; Taylor, Rhys D.; Kashiwazaki, Gengo; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

2016-01-01

Chemically engineered small molecules targeting specific genomic sequences play an important role in drug development research. Pyrrole-imidazole polyamides (PIPs) are a group of molecules that can bind to the DNA minor-groove and can be engineered to target specific sequences. Their biological effects rely primarily on their selective DNA binding. However, the binding mechanism of PIPs at the chromatinized genome level is poorly understood. Herein, we report a method using high-throughput sequencing to identify the DNA-alkylating sites of PIP-indole-seco-CBI conjugates. High-throughput sequencing analysis of conjugate 2 showed highly similar DNA-alkylating sites on synthetic oligos (histone-free DNA) and on human genomes (chromatinized DNA context). To our knowledge, this is the first report identifying alkylation sites across genomic DNA by alkylating PIP conjugates using high-throughput sequencing. PMID:27098039
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

PubMed

Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.
TAL effector-DNA specificity.

PubMed

Scholze, Heidi; Boch, Jens

2010-01-01

TAL effectors are important virulence factors of bacterial plant pathogenic Xanthomonas, which infect a wide variety of plants including valuable crops like pepper, rice, and citrus. TAL proteins are translocated via the bacterial type III secretion system into host cells and induce transcription of plant genes by binding to target gene promoters. Members of the TAL effector family differ mainly in their central domain of tandemly arranged repeats of typically 34 amino acids each with hypervariable di-amino acids at positions 12 and 13. We recently showed that target DNA-recognition specificity of TAL effectors is encoded in a modular and clearly predictable mode. The repeats of TAL effectors feature a surprising one repeat-to-one-bp correlation with different repeat types exhibiting a different DNA base pair specificity. Accordingly, we predicted DNA specificities of TAL effectors and generated artificial TAL proteins with novel DNA recognition specificities. We describe here novel artificial TALs and discuss implications for the DNA recognition specificity. The unique TAL-DNA binding domain allows design of proteins with potentially any given DNA recognition specificity enabling many uses for biotechnology.
Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors.

PubMed

Adalsteinsson, Viktor A; Ha, Gavin; Freeman, Samuel S; Choudhury, Atish D; Stover, Daniel G; Parsons, Heather A; Gydush, Gregory; Reed, Sarah C; Rotem, Denisse; Rhoades, Justin; Loginov, Denis; Livitz, Dimitri; Rosebrock, Daniel; Leshchiner, Ignaty; Kim, Jaegil; Stewart, Chip; Rosenberg, Mara; Francis, Joshua M; Zhang, Cheng-Zhong; Cohen, Ofir; Oh, Coyin; Ding, Huiming; Polak, Paz; Lloyd, Max; Mahmud, Sairah; Helvie, Karla; Merrill, Margaret S; Santiago, Rebecca A; O'Connor, Edward P; Jeong, Seong H; Leeson, Rachel; Barry, Rachel M; Kramkowski, Joseph F; Zhang, Zhenwei; Polacek, Laura; Lohr, Jens G; Schleicher, Molly; Lipscomb, Emily; Saltzman, Andrea; Oliver, Nelly M; Marini, Lori; Waks, Adrienne G; Harshman, Lauren C; Tolaney, Sara M; Van Allen, Eliezer M; Winer, Eric P; Lin, Nancy U; Nakabayashi, Mari; Taplin, Mary-Ellen; Johannessen, Cory M; Garraway, Levi A; Golub, Todd R; Boehm, Jesse S; Wagle, Nikhil; Getz, Gad; Love, J Christopher; Meyerson, Matthew

2017-11-06

Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.
An evolution based biosensor receptor DNA sequence generation algorithm.

PubMed

Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng

2010-01-01

A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis

PubMed Central

Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

2012-01-01

RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. Availability http://www.cemb.edu.pk/sw.html Abbreviations RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language. PMID:23055611
Structural and Thermodynamic Signatures of DNA Recognition by Mycobacterium tuberculosis DnaA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tsodikov, Oleg V.; Biswas, Tapan

An essential protein, DnaA, binds to 9-bp DNA sites within the origin of replication oriC. These binding events are prerequisite to forming an enigmatic nucleoprotein scaffold that initiates replication. The number, sequences, positions, and orientations of these short DNA sites, or DnaA boxes, within the oriCs of different bacteria vary considerably. To investigate features of DnaA boxes that are important for binding Mycobacterium tuberculosis DnaA (MtDnaA), we have determined the crystal structures of the DNA binding domain (DBD) of MtDnaA bound to a cognate MtDnaA-box (at 2.0 {angstrom} resolution) and to a consensus Escherichia coli DnaA-box (at 2.3 {angstrom}). Thesemore » structures, complemented by calorimetric equilibrium binding studies of MtDnaA DBD in a series of DnaA-box variants, reveal the main determinants of DNA recognition and establish the [T/C][T/A][G/A]TCCACA sequence as a high-affinity MtDnaA-box. Bioinformatic and calorimetric analyses indicate that DnaA-box sequences in mycobacterial oriCs generally differ from the optimal binding sequence. This sequence variation occurs commonly at the first 2 bp, making an in vivo mycobacterial DnaA-box effectively a 7-mer and not a 9-mer. We demonstrate that the decrease in the affinity of these MtDnaA-box variants for MtDnaA DBD relative to that of the highest-affinity box TTGTCCACA is less than 10-fold. The understanding of DnaA-box recognition by MtDnaA and E. coli DnaA enables one to map DnaA-box sequences in the genomes of M. tuberculosis and other eubacteria.« less
An origin-deficient yeast artificial chromosome triggers a cell cycle checkpoint.

PubMed

van Brabant, A J; Buchanan, C D; Charboneau, E; Fangman, W L; Brewer, B J

2001-04-01

Checkpoint controls coordinate entry into mitosis with the completion of DNA replication. Depletion of nucleotide precursors by treatment with the drug hydroxyurea triggers such a checkpoint response. However, it is not clear whether the signal for this hydroxyurea-induced checkpoint pathway is the presence of unreplicated DNA, or rather the persistence of single-stranded or damaged DNA. In a yeast artificial chromosome (YAC) we have engineered an approximately 170 kb region lacking efficient replication origins that allows us to explore the specific effects of unreplicated DNA on cell cycle progression. Replication of this YAC extends the length of S phase and causes cells to engage an S/M checkpoint. In the absence of Rad9 the YAC becomes unstable, undergoing deletions within the origin-free region.
DNA barcode goes two-dimensions: DNA QR code web server.

PubMed

Liu, Chang; Shi, Linchun; Xu, Xiaolan; Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin

2012-01-01

The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.
TaxI: a software tool for DNA barcoding using distance methods

PubMed Central

Steinke, Dirk; Vences, Miguel; Salzburger, Walter; Meyer, Axel

2005-01-01

DNA barcoding is a promising approach to the diagnosis of biological diversity in which DNA sequences serve as the primary key for information retrieval. Most existing software for evolutionary analysis of DNA sequences was designed for phylogenetic analyses and, hence, those algorithms do not offer appropriate solutions for the rapid, but precise analyses needed for DNA barcoding, and are also unable to process the often large comparative datasets. We developed a flexible software tool for DNA taxonomy, named TaxI. This program calculates sequence divergences between a query sequence (taxon to be barcoded) and each sequence of a dataset of reference sequences defined by the user. Because the analysis is based on separate pairwise alignments this software is also able to work with sequences characterized by multiple insertions and deletions that are difficult to align in large sequence sets (i.e. thousands of sequences) by multiple alignment algorithms because of computational restrictions. Here, we demonstrate the utility of this approach with two datasets of fish larvae and juveniles from Lake Constance and juvenile land snails under different models of sequence evolution. Sets of ribosomal 16S rRNA sequences, characterized by multiple indels, performed as good as or better than cox1 sequence sets in assigning sequences to species, demonstrating the suitability of rRNA genes for DNA barcoding. PMID:16214755
Dna Sequencing

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1995-04-25

A method for sequencing a strand of DNA, including the steps off: providing the strand of DNA; annealing the strand with a primer able to hybridize to the strand to give an annealed mixture; incubating the mixture with four deoxyribonucleoside triphosphates, a DNA polymerase, and at least three deoxyribonucleoside triphosphates in different amounts, under conditions in favoring primer extension to form nucleic acid fragments complementory to the DNA to be sequenced; labelling the nucleic and fragments; separating them and determining the position of the deoxyribonucleoside triphosphates by differences in the intensity of the labels, thereby to determine the DNA sequence.
High-fidelity target sequencing of individual molecules identified using barcode sequences: de novo detection and absolute quantitation of mutations in plasma cell-free DNA from cancer patients.

PubMed

Kukita, Yoji; Matoba, Ryo; Uchida, Junji; Hamakawa, Takuya; Doki, Yuichiro; Imamura, Fumio; Kato, Kikuya

2015-08-01

Circulating tumour DNA (ctDNA) is an emerging field of cancer research. However, current ctDNA analysis is usually restricted to one or a few mutation sites due to technical limitations. In the case of massively parallel DNA sequencers, the number of false positives caused by a high read error rate is a major problem. In addition, the final sequence reads do not represent the original DNA population due to the global amplification step during the template preparation. We established a high-fidelity target sequencing system of individual molecules identified in plasma cell-free DNA using barcode sequences; this system consists of the following two steps. (i) A novel target sequencing method that adds barcode sequences by adaptor ligation. This method uses linear amplification to eliminate the errors introduced during the early cycles of polymerase chain reaction. (ii) The monitoring and removal of erroneous barcode tags. This process involves the identification of individual molecules that have been sequenced and for which the number of mutations have been absolute quantitated. Using plasma cell-free DNA from patients with gastric or lung cancer, we demonstrated that the system achieved near complete elimination of false positives and enabled de novo detection and absolute quantitation of mutations in plasma cell-free DNA. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Sequence-Dependent Diastereospecific and Diastereodivergent Crosslinking of DNA by Decarbamoylmitomycin C.

PubMed

Aguilar, William; Paz, Manuel M; Vargas, Anayatzinc; Clement, Cristina C; Cheng, Shu-Yuan; Champeil, Elise

2018-04-20

Mitomycin C (MC), a potent antitumor drug, and decarbamoylmitomycin C (DMC), a derivative lacking the carbamoyl group, form highly cytotoxic DNA interstrand crosslinks. The major interstrand crosslink formed by DMC is the C1'' epimer of the major crosslink formed by MC. The molecular basis for the stereochemical configuration exhibited by DMC was investigated using biomimetic synthesis. The formation of DNA-DNA crosslinks by DMC is diastereospecific and diastereodivergent: Only the 1''S-diastereomer of the initially formed monoadduct can form crosslinks at GpC sequences, and only the 1''R-diastereomer of the monoadduct can form crosslinks at CpG sequences. We also show that CpG and GpC sequences react with divergent diastereoselectivity in the first alkylation step: 1"S stereochemistry is favored at GpC sequences and 1''R stereochemistry is favored at CpG sequences. Therefore, the first alkylation step results, at each sequence, in the selective formation of the diastereomer able to generate an interstrand DNA-DNA crosslink after the "second arm" alkylation. Examination of the known DNA adduct pattern obtained after treatment of cancer cell cultures with DMC indicates that the GpC sequence is the major target for the formation of DNA-DNA crosslinks in vivo by this drug. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sequencing historical specimens: successful preparation of small specimens with low amounts of degraded DNA.

PubMed

Sproul, John S; Maddison, David R

2017-11-01

Despite advances that allow DNA sequencing of old museum specimens, sequencing small-bodied, historical specimens can be challenging and unreliable as many contain only small amounts of fragmented DNA. Dependable methods to sequence such specimens are especially critical if the specimens are unique. We attempt to sequence small-bodied (3-6 mm) historical specimens (including nomenclatural types) of beetles that have been housed, dried, in museums for 58-159 years, and for which few or no suitable replacement specimens exist. To better understand ideal approaches of sample preparation and produce preparation guidelines, we compared different library preparation protocols using low amounts of input DNA (1-10 ng). We also explored low-cost optimizations designed to improve library preparation efficiency and sequencing success of historical specimens with minimal DNA, such as enzymatic repair of DNA. We report successful sample preparation and sequencing for all historical specimens despite our low-input DNA approach. We provide a list of guidelines related to DNA repair, bead handling, reducing adapter dimers and library amplification. We present these guidelines to facilitate more economical use of valuable DNA and enable more consistent results in projects that aim to sequence challenging, irreplaceable historical specimens. © 2017 John Wiley & Sons Ltd.
i-rDNA: alignment-free algorithm for rapid in silico detection of ribosomal gene fragments from metagenomic sequence data sets.

PubMed

Mohammed, Monzoorul Haque; Ghosh, Tarini Shankar; Chadaram, Sudha; Mande, Sharmila S

2011-11-30

Obtaining accurate estimates of microbial diversity using rDNA profiling is the first step in most metagenomics projects. Consequently, most metagenomic projects spend considerable amounts of time, money and manpower for experimentally cloning, amplifying and sequencing the rDNA content in a metagenomic sample. In the second step, the entire genomic content of the metagenome is extracted, sequenced and analyzed. Since DNA sequences obtained in this second step also contain rDNA fragments, rapid in silico identification of these rDNA fragments would drastically reduce the cost, time and effort of current metagenomic projects by entirely bypassing the experimental steps of primer based rDNA amplification, cloning and sequencing. In this study, we present an algorithm called i-rDNA that can facilitate the rapid detection of 16S rDNA fragments from amongst millions of sequences in metagenomic data sets with high detection sensitivity. Performance evaluation with data sets/database variants simulating typical metagenomic scenarios indicates the significantly high detection sensitivity of i-rDNA. Moreover, i-rDNA can process a million sequences in less than an hour on a simple desktop with modest hardware specifications. In addition to the speed of execution, high sensitivity and low false positive rate, the utility of the algorithmic approach discussed in this paper is immense given that it would help in bypassing the entire experimental step of primer-based rDNA amplification, cloning and sequencing. Application of this algorithmic approach would thus drastically reduce the cost, time and human efforts invested in all metagenomic projects. A web-server for the i-rDNA algorithm is available at http://metagenomics.atc.tcs.com/i-rDNA/
Biosensors for DNA sequence detection

NASA Technical Reports Server (NTRS)

Vercoutere, Wenonah; Akeson, Mark

2002-01-01

DNA biosensors are being developed as alternatives to conventional DNA microarrays. These devices couple signal transduction directly to sequence recognition. Some of the most sensitive and functional technologies use fibre optics or electrochemical sensors in combination with DNA hybridization. In a shift from sequence recognition by hybridization, two emerging single-molecule techniques read sequence composition using zero-mode waveguides or electrical impedance in nanoscale pores.
DNA Sequences from Formalin-Fixed Nematodes: Integrating Molecular and Morphological Approaches to Taxonomy

PubMed Central

Thomas, W. Kelley; Vida, J. T.; Frisse, Linda M.; Mundo, Manuel; Baldwin, James G.

1997-01-01

To effectively integrate DNA sequence analysis and classical nematode taxonomy, we must be able to obtain DNA sequences from formalin-fixed specimens. Microdissected sections of nematodes were removed from specimens fixed in formalin, using standard protocols and without destroying morphological features. The fixed sections provided sufficient template for multiple polymerase chain reaction-based DNA sequence analyses. PMID:19274156
High-Definition Medicine.

PubMed

Torkamani, Ali; Andersen, Kristian G; Steinhubl, Steven R; Topol, Eric J

2017-08-24

The foundation for a new era of data-driven medicine has been set by recent technological advances that enable the assessment and management of human health at an unprecedented level of resolution-what we refer to as high-definition medicine. Our ability to assess human health in high definition is enabled, in part, by advances in DNA sequencing, physiological and environmental monitoring, advanced imaging, and behavioral tracking. Our ability to understand and act upon these observations at equally high precision is driven by advances in genome editing, cellular reprogramming, tissue engineering, and information technologies, especially artificial intelligence. In this review, we will examine the core disciplines that enable high-definition medicine and project how these technologies will alter the future of medicine. Copyright © 2017 Elsevier Inc. All rights reserved.
Back to BAC: The Use of Infectious Clone Technologies for Viral Mutagenesis

PubMed Central

Hall, Robyn N.; Meers, Joanne; Fowler, Elizabeth; Mahony, Timothy

2012-01-01

Bacterial artificial chromosome (BAC) vectors were first developed to facilitate the propagation and manipulation of large DNA fragments in molecular biology studies for uses such as genome sequencing projects and genetic disease models. To facilitate these studies, methodologies have been developed to introduce specific mutations that can be directly applied to the mutagenesis of infectious clones (icBAC) using BAC technologies. This has resulted in rapid identification of gene function and expression at unprecedented rates. Here we review the major developments in BAC mutagenesis in vitro. This review summarises the technologies used to construct and introduce mutations into herpesvirus icBAC. It also explores developing technologies likely to provide the next leap in understanding these important viruses. PMID:22470833

A new family of satellite DNA sequences as a major component of centromeric heterochromatin in owls (Strigiformes).

PubMed

Yamada, Kazuhiko; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2004-03-01

We isolated a new family of satellite DNA sequences from HaeIII- and EcoRI-digested genomic DNA of the Blakiston's fish owl ( Ketupa blakistoni). The repetitive sequences were organized in tandem arrays of the 174 bp element, and localized to the centromeric regions of all macrochromosomes, including the Z and W chromosomes, and microchromosomes. This hybridization pattern was consistent with the distribution of C-band-positive centromeric heterochromatin, and the satellite DNA sequences occupied 10% of the total genome as a major component of centromeric heterochromatin. The sequences were homogenized between macro- and microchromosomes in this species, and therefore intraspecific divergence of the nucleotide sequences was low. The 174 bp element cross-hybridized to the genomic DNA of six other Strigidae species, but not to that of the Tytonidae, suggesting that the satellite DNA sequences are conserved in the same family but fairly divergent between the different families in the Strigiformes. Secondly, the centromeric satellite DNAs were cloned from eight Strigidae species, and the nucleotide sequences of 41 monomer fragments were compared within and between species. Molecular phylogenetic relationships of the nucleotide sequences were highly correlated with both the taxonomy based on morphological traits and the phylogenetic tree constructed by DNA-DNA hybridization. These results suggest that the satellite DNA sequence has evolved by concerted evolution in the Strigidae and that it is a good taxonomic and phylogenetic marker to examine genetic diversity between Strigiformes species.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Sobottka, Marcelo, E-mail: sobottka@mtm.ufsc.br; Hart, Andrew G., E-mail: ahart@dim.uchile.cl

Highlights: {yields} We propose a simple stochastic model to construct primitive DNA sequences. {yields} The model provide an explanation for Chargaff's second parity rule in primitive DNA sequences. {yields} The model is also used to predict a novel type of strand symmetry in primitive DNA sequences. {yields} We extend the results for bacterial DNA sequences and compare distributional properties intrinsic to the model to statistical estimates from 1049 bacterial genomes. {yields} We find out statistical evidences that the novel type of strand symmetry holds for bacterial DNA sequences. -- Abstract: Chargaff's second parity rule for short oligonucleotides states that themore » frequency of any short nucleotide sequence on a strand is approximately equal to the frequency of its reverse complement on the same strand. Recent studies have shown that, with the exception of organellar DNA, this parity rule generally holds for double-stranded DNA genomes and fails to hold for single-stranded genomes. While Chargaff's first parity rule is fully explained by the Watson-Crick pairing in the DNA double helix, a definitive explanation for the second parity rule has not yet been determined. In this work, we propose a model based on a hidden Markov process for approximating the distributional structure of primitive DNA sequences. Then, we use the model to provide another possible theoretical explanation for Chargaff's second parity rule, and to predict novel distributional aspects of bacterial DNA sequences.« less
A Simulation of DNA Sequencing Utilizing 3M Post-It[R] Notes

ERIC Educational Resources Information Center

Christensen, Doug

2009-01-01

An inexpensive and equipment free approach to teaching the technical aspects of DNA sequencing. The activity described requires an instructor with a familiarity of DNA sequencing technology but provides a straight forward method of teaching the technical aspects of sequencing in the absence of expensive sequencing equipment. The final sequence…
DNA and RNA sequencing by nanoscale reading through programmable electrophoresis and nanoelectrode-gated tunneling and dielectric detection

DOEpatents

Lee, James W.; Thundat, Thomas G.

2005-06-14

An apparatus and method for performing nucleic acid (DNA and/or RNA) sequencing on a single molecule. The genetic sequence information is obtained by probing through a DNA or RNA molecule base by base at nanometer scale as though looking through a strip of movie film. This DNA sequencing nanotechnology has the theoretical capability of performing DNA sequencing at a maximal rate of about 1,000,000 bases per second. This enhanced performance is made possible by a series of innovations including: novel applications of a fine-tuned nanometer gap for passage of a single DNA or RNA molecule; thin layer microfluidics for sample loading and delivery; and programmable electric fields for precise control of DNA or RNA movement. Detection methods include nanoelectrode-gated tunneling current measurements, dielectric molecular characterization, and atomic force microscopy/electrostatic force microscopy (AFM/EFM) probing for nanoscale reading of the nucleic acid sequences.
The sequence specificity of UV-induced DNA damage in a systematically altered DNA sequence.

PubMed

Khoe, Clairine V; Chung, Long H; Murray, Vincent

2018-06-01

The sequence specificity of UV-induced DNA damage was investigated in a specifically designed DNA plasmid using two procedures: end-labelling and linear amplification. Absorption of UV photons by DNA leads to dimerisation of pyrimidine bases and produces two major photoproducts, cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). A previous study had determined that two hexanucleotide sequences, 5'-GCTC*AC and 5'-TATT*AA, were high intensity UV-induced DNA damage sites. The UV clone plasmid was constructed by systematically altering each nucleotide of these two hexanucleotide sequences. One of the main goals of this study was to determine the influence of single nucleotide alterations on the intensity of UV-induced DNA damage. The sequence 5'-GCTC*AC was designed to examine the sequence specificity of 6-4PPs and the highest intensity 6-4PP damage sites were found at 5'-GTTC*CC nucleotides. The sequence 5'-TATT*AA was devised to investigate the sequence specificity of CPDs and the highest intensity CPD damage sites were found at 5'-TTTT*CG nucleotides. It was proposed that the tetranucleotide DNA sequence, 5'-YTC*Y (where Y is T or C), was the consensus sequence for the highest intensity UV-induced 6-4PP adduct sites; while it was 5'-YTT*C for the highest intensity UV-induced CPD damage sites. These consensus tetranucleotides are composed entirely of consecutive pyrimidines and must have a DNA conformation that is highly productive for the absorption of UV photons. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.
Genetic relationships among Hylocereus and Selenicereus vine cacti (Cactaceae): evidence from hybridization and cytological studies.

PubMed

Tel-Zur, Noemi; Abbo, Shahal; Bar-Zvi, Dudy; Mizrahi, Yosef

2004-10-01

Hylocereus and Selenicereus are native to tropical and sub-tropical America. Based on its taxonomic status and crossability relations it was postulated that H. megalanthus (syn. S. megalanthus) is an allotetraploid (2n = 4x = 44) derived from natural hybridization between two closely related diploid taxa. The present work aimed at elucidating the genetic relationships between species of the two genera. Crosses were performed and the putative hybrids were analysed by chromosome counts and morphological traits. The ploidy level of hybrids was confirmed by fluorescent in situ hybridization (FISH) of rDNA sites. Genomic in situ hybridization (GISH) was used in an attempt to identify the putative diploid genome donors of H. megalanthus and an artificial interploid hybrid. Reciprocal crosses among four diploid Hylocereus species (H. costaricensis, H. monacanthus (syn. H. polyrhizus), H. undatus and Hylocereus sp.) yielded viable diploid hybrids, with regular chromosome pairing. Reciprocal crosses between these Hylocereus spp. and H. megalanthus yielded viable triploid, pentaploid, hexaploid and aneuploid hybrids. Morphological and phenological traits confirm the hybrid origin. In situ detection of rDNA sites was in accord with the ploidy status of the species and hybrid studied. GISH results indicated that overall sequence composition of H. megalanthus is similar to that of H. ocamponis and S. grandiflorus. High sequence similarity was also found between the parental genomes of H. monacanthus and H. megalanthus in one triploid hybrid. The ease of obtaining partially fertile F1 hybrids and the relative sequence similarity (in GISH study) suggest close genetic relationships among the taxa analysed.
Randomized DNA libraries construction tool: a new 3-bp 'frequent cutter' TthHB27I/sinefungin endonuclease with chemically-induced specificity.

PubMed

Krefft, Daria; Papkov, Aliaksei; Prusinowski, Maciej; Zylicz-Stachula, Agnieszka; Skowron, Piotr M

2018-05-11

Acoustic or hydrodynamic shearing, sonication and enzymatic digestion are used to fragment DNA. However, these methods have several disadvantages, such as DNA damage, difficulties in fragmentation control, irreproducibility and under-representation of some DNA segments. The DNA fragmentation tool would be a gentle enzymatic method, offering cleavage frequency high enough to eliminate DNA fragments distribution bias and allow for easy control of partial digests. Only three such frequently cleaving natural restriction endonucleases (REases) were discovered: CviJI, SetI and FaiI. Therefore, we have previously developed two artificial enzymatic specificities, cleaving DNA approximately every ~ 3-bp: TspGWI/sinefungin (SIN) and TaqII/SIN. In this paper we present the third developed specificity: TthHB27I/SIN(SAM) - a new genomic tool, based on Type IIS/IIC/IIG Thermus-family REases-methyltransferases (MTases). In the presence of dimethyl sulfoxide (DMSO) and S-adenosyl-L-methionine (SAM) or its analogue SIN, the 6-bp cognate TthHB27I recognition sequence 5'-CAARCA-3' is converted into a combined 3.2-3.0-bp 'site' or its statistical equivalent, while a cleavage distance of 11/9 nt is retained. Protocols for various modes of limited DNA digestions were developed. In the presence of DMSO and SAM or SIN, TthHB27I is transformed from rare 6-bp cutter to a very frequent one, approximately 3-bp. Thus, TthHB27I/SIN(SAM) comprises a new tool in the very low-represented segment of such prototype REases specificities. Moreover, this modified TthHB27I enzyme is uniquely suited for controlled DNA fragmentation, due to partial DNA cleavage, which is an inherent feature of the Thermus-family enzymes. Such tool can be used for quasi-random libraries generation as well as for other DNA manipulations, requiring high frequency cleavage and uniform distribution of cuts along DNA.
Torque measurements reveal sequence-specific cooperative transitions in supercoiled DNA

PubMed Central

Oberstrass, Florian C.; Fernandes, Louis E.; Bryant, Zev

2012-01-01

B-DNA becomes unstable under superhelical stress and is able to adopt a wide range of alternative conformations including strand-separated DNA and Z-DNA. Localized sequence-dependent structural transitions are important for the regulation of biological processes such as DNA replication and transcription. To directly probe the effect of sequence on structural transitions driven by torque, we have measured the torsional response of a panel of DNA sequences using single molecule assays that employ nanosphere rotational probes to achieve high torque resolution. The responses of Z-forming d(pGpC)n sequences match our predictions based on a theoretical treatment of cooperative transitions in helical polymers. “Bubble” templates containing 50–100 bp mismatch regions show cooperative structural transitions similar to B-DNA, although less torque is required to disrupt strand–strand interactions. Our mechanical measurements, including direct characterization of the torsional rigidity of strand-separated DNA, establish a framework for quantitative predictions of the complex torsional response of arbitrary sequences in their biological context. PMID:22474350
Ancient dna from pleistocene fossils: Preservation, recovery, and utility of ancient genetic information for quaternary research

NASA Astrophysics Data System (ADS)

Yang, Hong

Until recently, recovery and analysis of genetic information encoded in ancient DNA sequences from Pleistocene fossils were impossible. Recent advances in molecular biology offered technical tools to obtain ancient DNA sequences from well-preserved Quaternary fossils and opened the possibilities to directly study genetic changes in fossil species to address various biological and paleontological questions. Ancient DNA studies involving Pleistocene fossil material and ancient DNA degradation and preservation in Quaternary deposits are reviewed. The molecular technology applied to isolate, amplify, and sequence ancient DNA is also presented. Authentication of ancient DNA sequences and technical problems associated with modern and ancient DNA contamination are discussed. As illustrated in recent studies on ancient DNA from proboscideans, it is apparent that fossil DNA sequence data can shed light on many aspects of Quaternary research such as systematics and phylogeny. conservation biology, evolutionary theory, molecular taphonomy, and forensic sciences. Improvement of molecular techniques and a better understanding of DNA degradation during fossilization are likely to build on current strengths and to overcome existing problems, making fossil DNA data a unique source of information for Quaternary scientists.
Enantiospecific recognition of DNA sequences by a proflavine Tröger base.

PubMed

Bailly, C; Laine, W; Demeunynck, M; Lhomme, J

2000-07-05

The DNA interaction of a chiral Tröger base derived from proflavine was investigated by DNA melting temperature measurements and complementary biochemical assays. DNase I footprinting experiments demonstrate that the binding of the proflavine-based Tröger base is both enantio- and sequence-specific. The (+)-isomer poorly interacts with DNA in a non-sequence-selective fashion. In sharp contrast, the corresponding (-)-isomer recognizes preferentially certain DNA sequences containing both A. T and G. C base pairs, such as the motifs 5'-GTT. AAC and 5'-ATGA. TCAT. This is the first experimental demonstration that acridine-type Tröger bases can be used for enantiospecific recognition of DNA sequences. Copyright 2000 Academic Press.
Sensitive detection of mercury and copper ions by fluorescent DNA/Ag nanoclusters in guanine-rich DNA hybridization

NASA Astrophysics Data System (ADS)

Peng, Jun; Ling, Jian; Zhang, Xiu-Qing; Bai, Hui-Ping; Zheng, Liyan; Cao, Qiu-E.; Ding, Zhong-Tao

2015-02-01

In this work, we designed a new fluorescent oligonucleotides-stabilized silver nanoclusters (DNA/AgNCs) probe for sensitive detection of mercury and copper ions. This probe contains two tailored DNA sequence. One is a signal probe contains a cytosine-rich sequence template for AgNCs synthesis and link sequence at both ends. The other is a guanine-rich sequence for signal enhancement and link sequence complementary to the link sequence of the signal probe. After hybridization, the fluorescence of hybridized double-strand DNA/AgNCs is 200-fold enhanced based on the fluorescence enhancement effect of DNA/AgNCs in proximity of guanine-rich DNA sequence. The double-strand DNA/AgNCs probe is brighter and stable than that of single-strand DNA/AgNCs, and more importantly, can be used as novel fluorescent probes for detecting mercury and copper ions. Mercury and copper ions in the range of 6.0-160.0 and 6-240 nM, can be linearly detected with the detection limits of 2.1 and 3.4 nM, respectively. Our results indicated that the analytical parameters of the method for mercury and copper ions detection are much better than which using a single-strand DNA/AgNCs.
Ab initio DNA synthesis by Bst polymerase in the presence of nicking endonucleases Nt.AlwI, Nb.BbvCI, and Nb.BsmI.

PubMed

Antipova, Valeriya N; Zheleznaya, Lyudmila A; Zyrina, Nadezhda V

2014-08-01

In the absence of added DNA, thermophilic DNA polymerases synthesize double-stranded DNA from free dNTPs, which consist of numerous repetitive units (ab initio DNA synthesis). The addition of thermophilic restriction endonuclease (REase), or nicking endonuclease (NEase), effectively stimulates ab initio DNA synthesis and determines the nucleotide sequence of reaction products. We have found that NEases Nt.AlwI, Nb.BbvCI, and Nb.BsmI with non-palindromic recognition sites stimulate the synthesis of sequences organized mainly as palindromes. Moreover, the nucleotide sequence of the palindromes appeared to be dependent on NEase recognition/cleavage modes. Thus, the heterodimeric Nb.BbvCI stimulated the synthesis of palindromes composed of two recognition sites of this NEase, which were separated by AT-reach sequences or (A)n (T)m spacers. Palindromic DNA sequences obtained in the ab initio DNA synthesis with the monomeric NEases Nb.BsmI and Nt.AlwI contained, along with the sites of these NEases, randomly synthesized sequences consisted of blocks of short repeats. These findings could help investigation of the potential abilities of highly productive ab initio DNA synthesis for the creation of DNA molecules with desirable sequence. © 2014 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase.

PubMed

Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V

2006-10-15

The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.
Recent patents of nanopore DNA sequencing technology: progress and challenges.

PubMed

Zhou, Jianfeng; Xu, Bingqian

2010-11-01

DNA sequencing techniques witnessed fast development in the last decades, primarily driven by the Human Genome Project. Among the proposed new techniques, Nanopore was considered as a suitable candidate for the single DNA sequencing with ultrahigh speed and very low cost. Several fabrication and modification techniques have been developed to produce robust and well-defined nanopore devices. Many efforts have also been done to apply nanopore to analyze the properties of DNA molecules. By comparing with traditional sequencing techniques, nanopore has demonstrated its distinctive superiorities in main practical issues, such as sample preparation, sequencing speed, cost-effective and read-length. Although challenges still remain, recent researches in improving the capabilities of nanopore have shed a light to achieve its ultimate goal: Sequence individual DNA strand at single nucleotide level. This patent review briefly highlights recent developments and technological achievements for DNA analysis and sequencing at single molecule level, focusing on nanopore based methods.
Small tandemly repeated DNA sequences of higher plants likely originate from a tRNA gene ancestor.

PubMed Central

Benslimane, A A; Dron, M; Hartmann, C; Rode, A

1986-01-01

Several monomers (177 bp) of a tandemly arranged repetitive nuclear DNA sequence of Brassica oleracea have been cloned and sequenced. They share up to 95% homology between one another and up to 80% with other satellite DNA sequences of Cruciferae, suggesting a common ancestor. Both strands of these monomers show more than 50% homology with many tRNA genes; the best homologies have been obtained with Lys and His yeast mitochondrial tRNA genes (respectively 64% and 60%). These results suggest that small tandemly repeated DNA sequences of plants may have evolved from a tRNA gene ancestor. These tandem repeats have probably arisen via a process involving reverse transcription of polymerase III RNA intermediates, as is the case for interspersed DNA sequences of mammalians. A model is proposed to explain the formation of such small tandemly repeated DNA sequences. Images PMID:3774553
Integration of Myeloblastosis Associated Virus proviral sequences occurs in the vicinity of genes encoding signaling proteins and regulators of cell proliferation

PubMed Central

Li, Chang Long; Coullin, Philippe; Bernheim, Alain; Joliot, Véronique; Auffray, Charles; Zoroob, Rima; Perbal, Bernard

2006-01-01

Aims Myeloblastosis Associated Virus type 1 (N) [MAV 1(N)] induces specifically nephroblastomas in 8–10 weeks when injected to newborn chicken. The MAV-induced nephroblastomas constitute a unique animal model of the pediatric Wilms' tumor. We have made use of three independent nephroblastomas that represent increasing tumor grades, to identify the host DNA regions in which MAV proviral sequences were integrated. METHODS Cellular sequences localized next to MAV-integration sites in the tumor DNAs were used to screen a Bacterial Artificial Chromosomes (BACs) library and isolate BACs containing about 150 kilobases of normal DNA corresponding to MAV integration regions (MIRs). These BACs were mapped on the chicken chromosomes by Fluorescent In Situ Hybridization (FISH) and used for molecular studies. Results The different MAV integration sites that were conserved after tumor cell selection identify genes involved in the control of cell signaling and proliferation. Syntenic fragments in human DNA contain genes whose products have been involved in normal and pathological kidney development, and several oncogenes responsible for tumorigenesis in human. Conclusion The identification of putative target genes for MAV provides important clues for the understanding of the MAV pathogenic potential. These studies identified ADAMTS1 as a gene upregulated in MAV-induced nephroblastoma and established that ccn3/nov is not a preferential site of integration for MAV as previously thought. The present results support our hypothesis that the highly efficient and specific MAV-induced tumorigenesis results from the alteration of multiple target genes in differentiating blastemal cells, some of which are required for the progression to highly aggressive stages. This study reinforces our previous conclusions that the MAV-induced nephroblastoma constitutes an excellent model in which to characterize new potential oncogenes and tumor suppressors involved in the establishment and maintenance of tumors. PMID:16403231
Next-Generation Sequencing Platforms

NASA Astrophysics Data System (ADS)

Mardis, Elaine R.

2013-06-01

Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Regulatory link between DNA methylation and active demethylation in Arabidopsis

PubMed Central

Lei, Mingguang; Zhang, Huiming; Julian, Russell; Tang, Kai; Xie, Shaojun; Zhu, Jian-Kang

2015-01-01

De novo DNA methylation through the RNA-directed DNA methylation (RdDM) pathway and active DNA demethylation play important roles in controlling genome-wide DNA methylation patterns in plants. Little is known about how cells manage the balance between DNA methylation and active demethylation activities. Here, we report the identification of a unique RdDM target sequence, where DNA methylation is required for maintaining proper active DNA demethylation of the Arabidopsis genome. In a genetic screen for cellular antisilencing factors, we isolated several REPRESSOR OF SILENCING 1 (ros1) mutant alleles, as well as many RdDM mutants, which showed drastically reduced ROS1 gene expression and, consequently, transcriptional silencing of two reporter genes. A helitron transposon element (TE) in the ROS1 gene promoter negatively controls ROS1 expression, whereas DNA methylation of an RdDM target sequence between ROS1 5′ UTR and the promoter TE region antagonizes this helitron TE in regulating ROS1 expression. This RdDM target sequence is also targeted by ROS1, and defective DNA demethylation in loss-of-function ros1 mutant alleles causes DNA hypermethylation of this sequence and concomitantly causes increased ROS1 expression. Our results suggest that this sequence in the ROS1 promoter region serves as a DNA methylation monitoring sequence (MEMS) that senses DNA methylation and active DNA demethylation activities. Therefore, the ROS1 promoter functions like a thermostat (i.e., methylstat) to sense DNA methylation levels and regulates DNA methylation by controlling ROS1 expression. PMID:25733903
Efficient engineering of chromosomal ribosome binding site libraries in mismatch repair proficient Escherichia coli.

PubMed

Oesterle, Sabine; Gerngross, Daniel; Schmitt, Steven; Roberts, Tania Michelle; Panke, Sven

2017-09-26

Multiplexed gene expression optimization via modulation of gene translation efficiency through ribosome binding site (RBS) engineering is a valuable approach for optimizing artificial properties in bacteria, ranging from genetic circuits to production pathways. Established algorithms design smart RBS-libraries based on a single partially-degenerate sequence that efficiently samples the entire space of translation initiation rates. However, the sequence space that is accessible when integrating the library by CRISPR/Cas9-based genome editing is severely restricted by DNA mismatch repair (MMR) systems. MMR efficiency depends on the type and length of the mismatch and thus effectively removes potential library members from the pool. Rather than working in MMR-deficient strains, which accumulate off-target mutations, or depending on temporary MMR inactivation, which requires additional steps, we eliminate this limitation by developing a pre-selection rule of genome-library-optimized-sequences (GLOS) that enables introducing large functional diversity into MMR-proficient strains with sequences that are no longer subject to MMR-processing. We implement several GLOS-libraries in Escherichia coli and show that GLOS-libraries indeed retain diversity during genome editing and that such libraries can be used in complex genome editing operations such as concomitant deletions. We argue that this approach allows for stable and efficient fine tuning of chromosomal functions with minimal effort.
Phylogeny and active ingredients of artificial Ophiocordyceps lanpingensis ascomata

NASA Astrophysics Data System (ADS)

Chen, Zihong; Xu, Ling; Yu, Hong; Zeng, Wenbo; Dai, Yongdong; Wang, Yuanbing

2018-04-01

To evaluate the morphological character, phylogenesis and functional components of artificial Ophiocordyceps lanpingensis, a related species of O. sinensis. The ascomata of O. lanpingensis was induced with its asexual strain, HLANY0707 and its microscopic feature was described. Phylogenesis was analyzed with ITS-5.8S sequences of HLANY0707, its cultured stroma, and 39 relative sequences of Hirsutella and Ophiocordyceps based on the maximum likelihood tree. Six nucleosides of artificial O. lanpingensis, natural O. lanpingensis and natural O. sinensis were compared with HPLC analysis. Artificial ascomata of O. lanpingensis could be massively produced with HLANY0707 and had similar microscopic features as the nature specimens. Phylogenetic analysis showed that both the artificial and natural O. lanpingensis had closer relationship with O. sinensis, O. xuefengensis, H. uncinata and O. robertsii, the species whose massively cultured ascomata being not reported. Nucleosides of artificial O. lanpingensis were very similar to natural O. sinensis, implying a promising application prospect of artificial O. lanpingensis as an alternative to O. sinensis. It showed a promising way to develop artificial O. lanpingensis and conserve the rare and endangered species, O. sinensis.

Attomole-level Genomics with Single-molecule Direct DNA, cDNA and RNA Sequencing Technologies.

PubMed

Ozsolak, Fatih

2016-01-01

With the introduction of next-generation sequencing (NGS) technologies in 2005, the domination of microarrays in genomics quickly came to an end due to NGS's superior technical performance and cost advantages. By enabling genetic analysis capabilities that were not possible previously, NGS technologies have started to play an integral role in all areas of biomedical research. This chapter outlines the low-quantity DNA and cDNA sequencing capabilities and applications developed with the Helicos single molecule DNA sequencing technology.
A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

PubMed Central

Walker, M D; Park, C W; Rosen, A; Aronheim, A

1990-01-01

Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Highly multiplexed targeted DNA sequencing from single nuclei.

PubMed

Leung, Marco L; Wang, Yong; Kim, Charissa; Gao, Ruli; Jiang, Jerry; Sei, Emi; Navin, Nicholas E

2016-02-01

Single-cell DNA sequencing methods are challenged by poor physical coverage, high technical error rates and low throughput. To address these issues, we developed a single-cell DNA sequencing protocol that combines flow-sorting of single nuclei, time-limited multiple-displacement amplification (MDA), low-input library preparation, DNA barcoding, targeted capture and next-generation sequencing (NGS). This approach represents a major improvement over our previous single nucleus sequencing (SNS) Nature Protocols paper in terms of generating higher-coverage data (>90%), thereby enabling the detection of genome-wide variants in single mammalian cells at base-pair resolution. Furthermore, by pooling 48-96 single-cell libraries together for targeted capture, this approach can be used to sequence many single-cell libraries in parallel in a single reaction. This protocol greatly reduces the cost of single-cell DNA sequencing, and it can be completed in 5-6 d by advanced users. This single-cell DNA sequencing protocol has broad applications for studying rare cells and complex populations in diverse fields of biological research and medicine.
Synthesis and Evolution of a Threose Nucleic Acid Aptamer Bearing 7-Deaza-7-Substituted Guanosine Residues.

PubMed

Mei, Hui; Liao, Jen-Yu; Jimenez, Randi M; Wang, Yajun; Bala, Saikat; McCloskey, Cailen; Switzer, Christopher; Chaput, John C

2018-05-02

In vitro selection experiments carried out on artificial genetic polymers require robust and faithful methods for copying genetic information back and forth between DNA and xeno-nucleic acids (XNA). Previously, we have shown that Kod-RI, an engineered polymerase developed to transcribe DNA templates into threose nucleic acid (TNA), can function with high fidelity in the absence of manganese ions. However, the transcriptional efficiency of this enzyme diminishes greatly when individual templates are replaced with libraries of DNA sequences, indicating that manganese ions are still required for in vitro selection. Unfortunately, the presence of manganese ions in the transcription mixture leads to the misincorporation of tGTP nucleotides opposite dG residues in the templating strand, which are detected as G-to-C transversions when the TNA is reverse transcribed back into DNA. Here we report the synthesis and fidelity of TNA replication using 7-deaza-7-modified guanosine base analogues in the DNA template and incoming TNA nucleoside triphosphate. Our findings reveal that tGTP misincorporation occurs via a Hoogsteen base pair in which the incoming tGTP residue adopts a syn conformation with respect to the sugar. Substitution of tGTP for 7-deaza-7-phenyl tGTP enabled the synthesis of TNA polymers with >99% overall fidelity. A TNA library containing the 7-deaza-7-phenyl guanine analogue was used to evolve a biologically stable TNA aptamer that binds to HIV reverse transcriptase with low nanomolar affinity.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE PAGES

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...

2017-07-18

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

PubMed Central

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Mapping the binding site of aflatoxin B/sub 1/ in DNA: systematic analysis of the reactivity of aflatoxin B/sub 1/ with guanines in different DNA sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Benasutti, M.; Ejadi, S.; Whitlow, M.D.

The mutagenic and carcinogenic chemical aflatoxin B/sub 1/ (AFB/sub 1/) reacts almost exclusively at the N(7)-position of guanine following activation to its reactive form, the 8,9-epoxide (AFB/sub 1/ oxide). In general N(7)-guanine adducts yield DNA strand breaks when heated in base, a property that serves as the basis for the Maxam-Gilbert DNA sequencing reaction specific for guanine. Using DNA sequencing methods, other workers have shown that AFB/sub 1/ oxide gives strand breaks at positions of guanines; however, the guanine bands varied in intensity. This phenomenon has been used to infer that AFB/sub 1/ oxide prefers to react with guanines inmore » some sequence contexts more than in others and has been referred to as sequence specificity of binding. Herein, data on the reaction of AFB/sub 1/ oxide with several synthetic DNA polymers with different sequences are presented, and (following hydrolysis) adduct levels are determine by high-pressure liquid chromatography. These results reveal that for AFB/sub 1/ oxide (1) the N(7)-guanine adduct is the major adduct found in all of the DNA polymers, (2) adduct levels vary in different sequences, and, thus, sequence specificity is also observed by this more direct method, and (3) the intensity of bands in DNA sequencing gels is likely to reflect adduct levels formed at the N(7)-position of guanine. Knowing this, a reinvestigation of the reactivity of guanines in different DNA sequences using DNA sequencing methods was undertaken. Methods are developed to determine the X (5'-side) base and the Y (3'-side) base are most influential in determining guanine reactivity. These rules in conjunction with molecular modeling studies were used to assess the binding sites that might be utilized by AFB/sub 1/ oxide in its reaction with DNA.« less
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
Affordable Hands-On DNA Sequencing and Genotyping: An Exercise for Teaching DNA Analysis to Undergraduates

ERIC Educational Resources Information Center

Shah, Kushani; Thomas, Shelby; Stein, Arnold

2013-01-01

In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C…
Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing.

PubMed

Tourlousse, Dieter M; Yoshiike, Satowa; Ohashi, Akiko; Matsukura, Satoko; Noda, Naohiro; Sekiguchi, Yuji

2017-02-28

High-throughput sequencing of 16S rRNA gene amplicons (16S-seq) has become a widely deployed method for profiling complex microbial communities but technical pitfalls related to data reliability and quantification remain to be fully addressed. In this work, we have developed and implemented a set of synthetic 16S rRNA genes to serve as universal spike-in standards for 16S-seq experiments. The spike-ins represent full-length 16S rRNA genes containing artificial variable regions with negligible identity to known nucleotide sequences, permitting unambiguous identification of spike-in sequences in 16S-seq read data from any microbiome sample. Using defined mock communities and environmental microbiota, we characterized the performance of the spike-in standards and demonstrated their utility for evaluating data quality on a per-sample basis. Further, we showed that staggered spike-in mixtures added at the point of DNA extraction enable concurrent estimation of absolute microbial abundances suitable for comparative analysis. Results also underscored that template-specific Illumina sequencing artifacts may lead to biases in the perceived abundance of certain taxa. Taken together, the spike-in standards represent a novel bioanalytical tool that can substantially improve 16S-seq-based microbiome studies by enabling comprehensive quality control along with absolute quantification. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
An artificial intelligence approach fit for tRNA gene studies in the era of big sequence data.

PubMed

Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi

2017-09-12

Unsupervised data mining capable of extracting a wide range of knowledge from big data without prior knowledge or particular models is a timely application in the era of big sequence data accumulation in genome research. By handling oligonucleotide compositions as high-dimensional data, we have previously modified the conventional self-organizing map (SOM) for genome informatics and established BLSOM, which can analyze more than ten million sequences simultaneously. Here, we develop BLSOM specialized for tRNA genes (tDNAs) that can cluster (self-organize) more than one million microbial tDNAs according to their cognate amino acid solely depending on tetra- and pentanucleotide compositions. This unsupervised clustering can reveal combinatorial oligonucleotide motifs that are responsible for the amino acid-dependent clustering, as well as other functionally and structurally important consensus motifs, which have been evolutionarily conserved. BLSOM is also useful for identifying tDNAs as phylogenetic markers for special phylotypes. When we constructed BLSOM with 'species-unknown' tDNAs from metagenomic sequences plus 'species-known' microbial tDNAs, a large portion of metagenomic tDNAs self-organized with species-known tDNAs, yielding information on microbial communities in environmental samples. BLSOM can also enhance accuracy in the tDNA database obtained from big sequence data. This unsupervised data mining should become important for studying numerous functionally unclear RNAs obtained from a wide range of organisms.
DNA Barcode Goes Two-Dimensions: DNA QR Code Web Server

PubMed Central

Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin

2012-01-01

The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, “DNA barcode” actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications. PMID:22574113
High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA.

PubMed

Chandrananda, Dineika; Thorne, Natalie P; Bahlo, Melanie

2015-06-17

High-throughput sequencing of cell-free DNA fragments found in human plasma has been used to non-invasively detect fetal aneuploidy, monitor organ transplants and investigate tumor DNA. However, many biological properties of this extracellular genetic material remain unknown. Research that further characterizes circulating DNA could substantially increase its diagnostic value by allowing the application of more sophisticated bioinformatics tools that lead to an improved signal to noise ratio in the sequencing data. In this study, we investigate various features of cell-free DNA in plasma using deep-sequencing data from two pregnant women (>70X, >50X) and compare them with matched cellular DNA. We utilize a descriptive approach to examine how the biological cleavage of cell-free DNA affects different sequence signatures such as fragment lengths, sequence motifs at fragment ends and the distribution of cleavage sites along the genome. We show that the size distributions of these cell-free DNA molecules are dependent on their autosomal and mitochondrial origin as well as the genomic location within chromosomes. DNA mapping to particular microsatellites and alpha repeat elements display unique size signatures. We show how cell-free fragments occur in clusters along the genome, localizing to nucleosomal arrays and are preferentially cleaved at linker regions by correlating the mapping locations of these fragments with ENCODE annotation of chromatin organization. Our work further demonstrates that cell-free autosomal DNA cleavage is sequence dependent. The region spanning up to 10 positions on either side of the DNA cleavage site show a consistent pattern of preference for specific nucleotides. This sequence motif is present in cleavage sites localized to nucleosomal cores and linker regions but is absent in nucleosome-free mitochondrial DNA. These background signals in cell-free DNA sequencing data stem from the non-random biological cleavage of these fragments. This sequence structure can be harnessed to improve bioinformatics algorithms, in particular for CNV and structural variant detection. Descriptive measures for cell-free DNA features developed here could also be used in biomarker analysis to monitor the changes that occur during different pathological conditions.
Candidate gene database and transcript map for peach, a model species for fruit trees.

PubMed

Horn, Renate; Lecouls, Anne-Claire; Callahan, Ann; Dandekar, Abhaya; Garay, Lilibeth; McCord, Per; Howad, Werner; Chan, Helen; Verde, Ignazio; Main, Doreen; Jung, Sook; Georgi, Laura; Forrest, Sam; Mook, Jennifer; Zhebentyayeva, Tatyana; Yu, Yeisoo; Kim, Hye Ran; Jesudurai, Christopher; Sosinski, Bryon; Arús, Pere; Baird, Vance; Parfitt, Dan; Reighard, Gregory; Scorza, Ralph; Tomkins, Jeffrey; Wing, Rod; Abbott, Albert Glenn

2005-05-01

Peach (Prunus persica) is a model species for the Rosaceae, which includes a number of economically important fruit tree species. To develop an extensive Prunus expressed sequence tag (EST) database for identifying and cloning the genes important to fruit and tree development, we generated 9,984 high-quality ESTs from a peach cDNA library of developing fruit mesocarp. After assembly and annotation, a putative peach unigene set consisting of 3,842 ESTs was defined. Gene ontology (GO) classification was assigned based on the annotation of the single "best hit" match against the Swiss-Prot database. No significant homology could be found in the GenBank nr databases for 24.3% of the sequences. Using core markers from the general Prunus genetic map, we anchored bacterial artificial chromosome (BAC) clones on the genetic map, thereby providing a framework for the construction of a physical and transcript map. A transcript map was developed by hybridizing 1,236 ESTs from the putative peach unigene set and an additional 68 peach cDNA clones against the peach BAC library. Hybridizing ESTs to genetically anchored BACs immediately localized 11.2% of the ESTs on the genetic map. ESTs showed a clustering of expressed genes in defined regions of the linkage groups. [The data were built into a regularly updated Genome Database for Rosaceae (GDR), available at (http://www.genome.clemson.edu/gdr/).].
Analysis of DNA Sequences by An Optical Time-Integrating Correlator: Proof-Of-Concept Experiments.

DTIC Science & Technology

1992-05-01

TABLES xv LIST OF ABBREVIATIONS xvii 1.0 INTRODUCTION 1 2.0 DNA ANALYSIS STRATEGY 4 2.1 Representation of DNA Bases 4 2.2 DNA Analysis Strategy 6 3.0...Zehnder architecture. 3 Figure 3: Short representations of the DNA bases where each base is represented by a 7-bits long pseudorandom sequence. 5... DNA bases where each base is represented by 7-bits long pseudorandom sequences. 4 Table 2: Long representations of the DNA bases with 255-bits maximum
SNP discovery through de novo deep sequencing using the next generation of DNA sequencers

USDA-ARS?s Scientific Manuscript database

The production of high volumes of DNA sequence data using new technologies has permitted more efficient identification of single nucleotide polymorphisms in vertebrate genomes. This chapter presented practical methodology for production and analysis of DNA sequence data for SNP discovery....
A simple procedure for parallel sequence analysis of both strands of 5'-labeled DNA.

PubMed

Razvi, F; Gargiulo, G; Worcel, A

1983-08-01

Ligation of a 5'-labeled DNA restriction fragment results in a circular DNA molecule carrying the two 32Ps at the reformed restriction site. Double digestions of the circular DNA with the original enzyme and a second restriction enzyme cleavage near the labeled site allows direct chemical sequencing of one 5'-labeled DNA strand. Similar double digestions, using an isoschizomer that cleaves differently at the 32P-labeled site, allows direct sequencing of the now 3'-labeled complementary DNA strand. It is possible to directly sequence both strands of cloned DNA inserts by using the above protocol and a multiple cloning site vector that provides the necessary restriction sites. The simultaneous and parallel visualization of both DNA strands eliminates sequence ambiguities. In addition, the labeled circular molecules are particularly useful for single-hit DNA cleavage studies and DNA footprint analysis. As an example, we show here an analysis of the micrococcal nuclease-induced breaks on the two strands of the somatic 5S RNA gene of Xenopus borealis, which suggests that the enzyme may recognize and cleave small AT-containing palindromes along the DNA helix.
A Glimpse into the Satellite DNA Library in Characidae Fish (Teleostei, Characiformes)

PubMed Central

Utsunomia, Ricardo; Ruiz-Ruano, Francisco J.; Silva, Duílio M. Z. A.; Serrano, Érica A.; Rosa, Ivana F.; Scudeler, Patrícia E. S.; Hashimoto, Diogo T.; Oliveira, Claudio; Camacho, Juan Pedro M.; Foresti, Fausto

2017-01-01

Satellite DNA (satDNA) is an abundant fraction of repetitive DNA in eukaryotic genomes and plays an important role in genome organization and evolution. In general, satDNA sequences follow a concerted evolutionary pattern through the intragenomic homogenization of different repeat units. In addition, the satDNA library hypothesis predicts that related species share a series of satDNA variants descended from a common ancestor species, with differential amplification of different satDNA variants. The finding of a same satDNA family in species belonging to different genera within Characidae fish provided the opportunity to test both concerted evolution and library hypotheses. For this purpose, we analyzed here sequence variation and abundance of this satDNA family in ten species, by a combination of next generation sequencing (NGS), PCR and Sanger sequencing, and fluorescence in situ hybridization (FISH). We found extensive between-species variation for the number and size of pericentromeric FISH signals. At genomic level, the analysis of 1000s of DNA sequences obtained by Illumina sequencing and PCR amplification allowed defining 150 haplotypes which were linked in a common minimum spanning tree, where different patterns of concerted evolution were apparent. This also provided a glimpse into the satDNA library of this group of species. In consistency with the library hypothesis, different variants for this satDNA showed high differences in abundance between species, from highly abundant to simply relictual variants. PMID:28855916
Short, interspersed, and repetitive DNA sequences in Spiroplasma species.

PubMed

Nur, I; LeBlanc, D J; Tully, J G

1987-03-01

Small fragments of DNA from an 8-kbp plasmid, pRA1, from a plant pathogenic strain of Spiroplasma citri were shown previously to be present in the chromosomal DNA of at least two species of Spiroplasma. We describe here the shot-gun cloning of chromosomal DNA from S. citri Maroc and the identification of two distinct sequences exhibiting homology to pRA1. Further subcloning experiments provided specific molecular probes for the identification of these two sequences in chromosomal DNA from three distinct plant pathogenic species of Spiroplasma. The results of Southern blot hybridization indicated that each of the pRA1-associated sequences is present as multiple copies in short, dispersed, and repetitive sequences in the chromosomes of these three strains. None of the sequences was detectable in chromosomal DNA from an additional nine Spiroplasma strains examined.

Laser Desorption Mass Spectrometry for DNA Sequencing and Analysis

NASA Astrophysics Data System (ADS)

Chen, C. H. Winston; Taranenko, N. I.; Golovlev, V. V.; Isola, N. R.; Allman, S. L.

1998-03-01

Rapid DNA sequencing and/or analysis is critically important for biomedical research. In the past, gel electrophoresis has been the primary tool to achieve DNA analysis and sequencing. However, gel electrophoresis is a time-consuming and labor-extensive process. Recently, we have developed and used laser desorption mass spectrometry (LDMS) to achieve sequencing of ss-DNA longer than 100 nucleotides. With LDMS, we succeeded in sequencing DNA in seconds instead of hours or days required by gel electrophoresis. In addition to sequencing, we also applied LDMS for the detection of DNA probes for hybridization LDMS was also used to detect short tandem repeats for forensic applications. Clinical applications for disease diagnosis such as cystic fibrosis caused by base deletion and point mutation have also been demonstrated. Experimental details will be presented in the meeting. abstract.
Constructing DNA Barcode Sets Based on Particle Swarm Optimization.

PubMed

Wang, Bin; Zheng, Xuedong; Zhou, Shihua; Zhou, Changjun; Wei, Xiaopeng; Zhang, Qiang; Wei, Ziqi

2018-01-01

Following the completion of the human genome project, a large amount of high-throughput bio-data was generated. To analyze these data, massively parallel sequencing, namely next-generation sequencing, was rapidly developed. DNA barcodes are used to identify the ownership between sequences and samples when they are attached at the beginning or end of sequencing reads. Constructing DNA barcode sets provides the candidate DNA barcodes for this application. To increase the accuracy of DNA barcode sets, a particle swarm optimization (PSO) algorithm has been modified and used to construct the DNA barcode sets in this paper. Compared with the extant results, some lower bounds of DNA barcode sets are improved. The results show that the proposed algorithm is effective in constructing DNA barcode sets.
Measurements of nonlinear Hall-driven reconnection in the reversed field pinch

NASA Astrophysics Data System (ADS)

Tharp, Timothy D.

Complex organisms are able to develop because of the complex regulatory systems that control their gene expression. The first step in this regulation, transcription initiation, is controlled by transcription factors. Transcription factors are modular proteins composed of two distinct domains, the DNA binding domain and the regulatory domain. These molecules are involved in a plethora of important biological processes including embryogenesis, development, cell health, and cancer. Tissue enriched transcription factors Nkx-2.5 and Gata4 are involved in cardiac development and cardiac health. In this thesis the DNA binding specificity of Nkx-2.5 will be analyzed using a high throughput double stranded DNA platform called Cognate Site Identifier (CSI) arrays (Chapter 2). The full DNA binding specificity of Nkx-2.5 and Nkx-2.5 mutants will be visualized using Sequence Specificity Landscapes (SSLs). In Chapter 3, the definition of binding specificity will be investigated by evaluating a number of different DNA binding folds by CSI and SSLs. CSI and SSLs will also be used to evaluate different pyrrole/imidazole hairpin polyamides in order to better characterize these small molecule DNA binding domains. CSI and SSL data will be applied to the genome in order to explain the biological function an artificial transcription factor. Chapter 4 will discuss the mechanism of nonspecific DNA binding. The historical means of predicting DNA binding will be challenged by utilizing high throughput experiments. The effect of salt concentration on both specific and nonspecific binding will also be investigated. Finally, in Chapter 5, a generation of Protein DNA Dimerizer will be discussed. A PDD that regulates transcription on genomic DNA by binding cooperatively with the heart IF Gata4 will be characterized. These studies provide understanding of, and a means to control, how transcription factors sample the endless sea of DNA in the genome in order to regulate gene expression with such wonderful specificity.
Winnowing DNA for rare sequences: highly specific sequence and methylation based enrichment.

PubMed

Thompson, Jason D; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

2012-01-01

Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue.
Pulling out the 1%: Whole-Genome Capture for the Targeted Enrichment of Ancient DNA Sequencing Libraries

PubMed Central

Carpenter, Meredith L.; Buenrostro, Jason D.; Valdiosera, Cristina; Schroeder, Hannes; Allentoft, Morten E.; Sikora, Martin; Rasmussen, Morten; Gravel, Simon; Guillén, Sonia; Nekhrizov, Georgi; Leshtakov, Krasimir; Dimitrova, Diana; Theodossiev, Nikola; Pettener, Davide; Luiselli, Donata; Sandoval, Karla; Moreno-Estrada, Andrés; Li, Yingrui; Wang, Jun; Gilbert, M. Thomas P.; Willerslev, Eske; Greenleaf, William J.; Bustamante, Carlos D.

2013-01-01

Most ancient specimens contain very low levels of endogenous DNA, precluding the shotgun sequencing of many interesting samples because of cost. Ancient DNA (aDNA) libraries often contain <1% endogenous DNA, with the majority of sequencing capacity taken up by environmental DNA. Here we present a capture-based method for enriching the endogenous component of aDNA sequencing libraries. By using biotinylated RNA baits transcribed from genomic DNA libraries, we are able to capture DNA fragments from across the human genome. We demonstrate this method on libraries created from four Iron Age and Bronze Age human teeth from Bulgaria, as well as bone samples from seven Peruvian mummies and a Bronze Age hair sample from Denmark. Prior to capture, shotgun sequencing of these libraries yielded an average of 1.2% of reads mapping to the human genome (including duplicates). After capture, this fraction increased substantially, with up to 59% of reads mapped to human and enrichment ranging from 6- to 159-fold. Furthermore, we maintained coverage of the majority of regions sequenced in the precapture library. Intersection with the 1000 Genomes Project reference panel yielded an average of 50,723 SNPs (range 3,062–147,243) for the postcapture libraries sequenced with 1 million reads, compared with 13,280 SNPs (range 217–73,266) for the precapture libraries, increasing resolution in population genetic analyses. Our whole-genome capture approach makes it less costly to sequence aDNA from specimens containing very low levels of endogenous DNA, enabling the analysis of larger numbers of samples. PMID:24568772
Biological nanopore MspA for DNA sequencing

NASA Astrophysics Data System (ADS)

Manrao, Elizabeth A.

Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.
Effects of sequence on DNA wrapping around histones

NASA Astrophysics Data System (ADS)

Ortiz, Vanessa

2011-03-01

A central question in biophysics is whether the sequence of a DNA strand affects its mechanical properties. In epigenetics, these are thought to influence nucleosome positioning and gene expression. Theoretical and experimental attempts to answer this question have been hindered by an inability to directly resolve DNA structure and dynamics at the base-pair level. In our previous studies we used a detailed model of DNA to measure the effects of sequence on the stability of naked DNA under bending. Sequence was shown to influence DNA's ability to form kinks, which arise when certain motifs slide past others to form non-native contacts. Here, we have now included histone-DNA interactions to see if the results obtained for naked DNA are transferable to the problem of nucleosome positioning. Different DNA sequences interacting with the histone protein complex are studied, and their equilibrium and mechanical properties are compared among themselves and with the naked case. NLM training grant to the Computation and Informatics in Biology and Medicine Training Program (NLM T15LM007359).
A high-throughput and quantitative method to assess the mutagenic potential of translesion DNA synthesis

PubMed Central

Taggart, David J.; Camerlengo, Terry L.; Harrison, Jason K.; Sherrer, Shanen M.; Kshetry, Ajay K.; Taylor, John-Stephen; Huang, Kun; Suo, Zucai

2013-01-01

Cellular genomes are constantly damaged by endogenous and exogenous agents that covalently and structurally modify DNA to produce DNA lesions. Although most lesions are mended by various DNA repair pathways in vivo, a significant number of damage sites persist during genomic replication. Our understanding of the mutagenic outcomes derived from these unrepaired DNA lesions has been hindered by the low throughput of existing sequencing methods. Therefore, we have developed a cost-effective high-throughput short oligonucleotide sequencing assay that uses next-generation DNA sequencing technology for the assessment of the mutagenic profiles of translesion DNA synthesis catalyzed by any error-prone DNA polymerase. The vast amount of sequencing data produced were aligned and quantified by using our novel software. As an example, the high-throughput short oligonucleotide sequencing assay was used to analyze the types and frequencies of mutations upstream, downstream and at a site-specifically placed cis–syn thymidine–thymidine dimer generated individually by three lesion-bypass human Y-family DNA polymerases. PMID:23470999
A comparative study of retrotransposons in the centromeric regions of A and B chromosomes of maize.

PubMed

Theuri, J; Phelps-Durr, T; Mathews, S; Birchler, J

2005-01-01

Bacterial Artificial Chromosomes (BACs) derived from the B chromosome, based on homology with the B specific sequence, were subcloned and sequenced. Analysis of DNA sequence data indicated the presence of 23 common retroelements, as well as novel sequences of B chromosome origin. Generally, where the same retrotransposon type was observed in both A and B chromosomes, there were more copies per unit of sequence in the B centromeric region (the major site of B repeat) than in the A centromere, except for Huck-1. Based on previous estimates of the age of the major burst of transposition into the maize genome, the oldest retrotransposons (Ji-6 and Tekay, approximately 5.0 and 5.2 million years ago, respectively) were found in the B centromere region only, while the next two oldest (Huck-1 and Opie-1) were found in both the A and B sequences. Phylogenetic analysis of Opie retroelements from both A and B centromeres indicated that some of the B Opie centromeric sequences share a more recent common ancestor with A Opie retroelements than they do with other B Opie centromeric sequences. These results imply that the supernumerary maize B chromosome has coexisted with the A chromosomes during that period of transposition. They also support the hypothesis that the B chromosome had its origins from A chromosome elements, or that alternative origins, such as being donated to the maize genome in a wide species cross, preceded six million years ago, because the spectrum of retrotransposons in the two chromosomes is quite similar.
Sequences Of Amino Acids For Human Serum Albumin

NASA Technical Reports Server (NTRS)

Carter, Daniel C.

1992-01-01

Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.
An extended sequence specificity for UV-induced DNA damage.

PubMed

Chung, Long H; Murray, Vincent

2018-01-01

The sequence specificity of UV-induced DNA damage was determined with a higher precision and accuracy than previously reported. UV light induces two major damage adducts: cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). Employing capillary electrophoresis with laser-induced fluorescence and taking advantages of the distinct properties of the CPDs and 6-4PPs, we studied the sequence specificity of UV-induced DNA damage in a purified DNA sequence using two approaches: end-labelling and a polymerase stop/linear amplification assay. A mitochondrial DNA sequence that contained a random nucleotide composition was employed as the target DNA sequence. With previous methodology, the UV sequence specificity was determined at a dinucleotide or trinucleotide level; however, in this paper, we have extended the UV sequence specificity to a hexanucleotide level. With the end-labelling technique (for 6-4PPs), the consensus sequence was found to be 5'-GCTC*AC (where C* is the breakage site); while with the linear amplification procedure, it was 5'-TCTT*AC. With end-labelling, the dinucleotide frequency of occurrence was highest for 5'-TC*, 5'-TT* and 5'-CC*; whereas it was 5'-TT* for linear amplification. The influence of neighbouring nucleotides on the degree of UV-induced DNA damage was also examined. The core sequences consisted of pyrimidine nucleotides 5'-CTC* and 5'-CTT* while an A at position "1" and C at position "2" enhanced UV-induced DNA damage. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.
Multiplex analysis of DNA

DOEpatents

Church, George M.; Kieffer-Higgins, Stephen

1992-01-01

This invention features vectors and a method for sequencing DNA. The method includes the steps of: a) ligating the DNA into a vector comprising a tag sequence, the tag sequence includes at least 15 bases, wherein the tag sequence will not hybridize to the DNA under stringent hybridization conditions and is unique in the vector, to form a hybrid vector, b) treating the hybrid vector in a plurality of vessels to produce fragments comprising the tag sequence, wherein the fragments differ in length and terminate at a fixed known base or bases, wherein the fixed known base or bases differs in each vessel, c) separating the fragments from each vessel according to their size, d) hybridizing the fragments with an oligonucleotide able to hybridize specifically with the tag sequence, and e) detecting the pattern of hybridization of the tag sequence, wherein the pattern reflects the nucleotide sequence of the DNA.
BiQ Analyzer HT: locus-specific analysis of DNA methylation by high-throughput bisulfite sequencing

PubMed Central

Lutsik, Pavlo; Feuerbach, Lars; Arand, Julia; Lengauer, Thomas; Walter, Jörn; Bock, Christoph

2011-01-01

Bisulfite sequencing is a widely used method for measuring DNA methylation in eukaryotic genomes. The assay provides single-base pair resolution and, given sufficient sequencing depth, its quantitative accuracy is excellent. High-throughput sequencing of bisulfite-converted DNA can be applied either genome wide or targeted to a defined set of genomic loci (e.g. using locus-specific PCR primers or DNA capture probes). Here, we describe BiQ Analyzer HT (http://biq-analyzer-ht.bioinf.mpi-inf.mpg.de/), a user-friendly software tool that supports locus-specific analysis and visualization of high-throughput bisulfite sequencing data. The software facilitates the shift from time-consuming clonal bisulfite sequencing to the more quantitative and cost-efficient use of high-throughput sequencing for studying locus-specific DNA methylation patterns. In addition, it is useful for locus-specific visualization of genome-wide bisulfite sequencing data. PMID:21565797
A DNA sequence analysis package for the IBM personal computer.

PubMed Central

Lagrimini, L M; Brentano, S T; Donelson, J E

1984-01-01

We present here a collection of DNA sequence analysis programs, called "PC Sequence" (PCS), which are designed to run on the IBM Personal Computer (PC). These programs are written in IBM PC compiled BASIC and take full advantage of the IBM PC's speed, error handling, and graphics capabilities. For a modest initial expense in hardware any laboratory can use these programs to quickly perform computer analysis on DNA sequences. They are written with the novice user in mind and require very little training or previous experience with computers. Also provided are a text editing program for creating and modifying DNA sequence files and a communications program which enables the PC to communicate with and collect information from mainframe computers and DNA sequence databases. PMID:6546433
Genomic sequencing of Pleistocene cave bears

DOE Office of Scientific and Technical Information (OSTI.GOV)

Noonan, James P.; Hofreiter, Michael; Smith, Doug

2005-04-01

Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome,more » the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.« less
Characterization of North American Armillaria species: Genetic relationships determined by ribosomal DNA sequences and AFLP markers

Treesearch

M. -S. Kim; N. B. Klopfenstein; J. W. Hanna; G. I. McDonald

2006-01-01

Phylogenetic and genetic relationships among 10 North American Armillaria species were analysed using sequence data from ribosomal DNA (rDNA), including intergenic spacer (IGS-1), internal transcribed spacers with associated 5.8S (ITS + 5.8S), and nuclear large subunit rDNA (nLSU), and amplified fragment length polymorphism (AFLP) markers. Based on rDNA sequence data,...
Fractal landscape analysis of DNA walks

NASA Technical Reports Server (NTRS)

Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H. E.

1992-01-01

By mapping nucleotide sequences onto a "DNA walk", we uncovered remarkably long-range power law correlations [Nature 356 (1992) 168] that imply a new scale invariant property of DNA. We found such long-range correlations in intron-containing genes and in non-transcribed regulatory DNA sequences, but not in cDNA sequences or intron-less genes. In this paper, we present more explicit evidences to support our findings.
[Genome-scale sequence data processing and epigenetic analysis of DNA methylation].

PubMed

Wang, Ting-Zhang; Shan, Gao; Xu, Jian-Hong; Xue, Qing-Zhong

2013-06-01

A new approach recently developed for detecting cytosine DNA methylation (mC) and analyzing the genome-scale DNA methylation profiling, is called BS-Seq which is based on bisulfite conversion of genomic DNA combined with next-generation sequencing. The method can not only provide an insight into the difference of genome-scale DNA methylation among different organisms, but also reveal the conservation of DNA methylation in all contexts and nucleotide preference for different genomic regions, including genes, exons, and repetitive DNA sequences. It will be helpful to under-stand the epigenetic impacts of cytosine DNA methylation on the regulation of gene expression and maintaining silence of repetitive sequences, such as transposable elements. In this paper, we introduce the preprocessing steps of DNA methylation data, by which cytosine (C) and guanine (G) in the reference sequence are transferred to thymine (T) and adenine (A), and cytosine in reads is transferred to thymine, respectively. We also comprehensively review the main content of the DNA methylation analysis on the genomic scale: (1) the cytosine methylation under the context of different sequences; (2) the distribution of genomic methylcytosine; (3) DNA methylation context and the preference for the nucleotides; (4) DNA- protein interaction sites of DNA methylation; (5) degree of methylation of cytosine in the different structural elements of genes. DNA methylation analysis technique provides a powerful tool for the epigenome study in human and other species, and genes and environment interaction, and founds the theoretical basis for further development of disease diagnostics and therapeutics in human.
Extracting DNA words based on the sequence features: non-uniform distribution and integrity.

PubMed

Li, Zhi; Cao, Hongyan; Cui, Yuehua; Zhang, Yanbo

2016-01-25

DNA sequence can be viewed as an unknown language with words as its functional units. Given that most sequence alignment algorithms such as the motif discovery algorithms depend on the quality of background information about sequences, it is necessary to develop an ab initio algorithm for extracting the "words" based only on the DNA sequences. We considered that non-uniform distribution and integrity were two important features of a word, based on which we developed an ab initio algorithm to extract "DNA words" that have potential functional meaning. A Kolmogorov-Smirnov test was used for consistency test of uniform distribution of DNA sequences, and the integrity was judged by the sequence and position alignment. Two random base sequences were adopted as negative control, and an English book was used as positive control to verify our algorithm. We applied our algorithm to the genomes of Saccharomyces cerevisiae and 10 strains of Escherichia coli to show the utility of the methods. The results provide strong evidences that the algorithm is a promising tool for ab initio building a DNA dictionary. Our method provides a fast way for large scale screening of important DNA elements and offers potential insights into the understanding of a genome.
CpG PatternFinder: a Windows-based utility program for easy and rapid identification of the CpG methylation status of DNA.

PubMed

Xu, Yi-Hua; Manoharan, Herbert T; Pitot, Henry C

2007-09-01

The bisulfite genomic sequencing technique is one of the most widely used techniques to study sequence-specific DNA methylation because of its unambiguous ability to reveal DNA methylation status to the order of a single nucleotide. One characteristic feature of the bisulfite genomic sequencing technique is that a number of sample sequence files will be produced from a single DNA sample. The PCR products of bisulfite-treated DNA samples cannot be sequenced directly because they are heterogeneous in nature; therefore they should be cloned into suitable plasmids and then sequenced. This procedure generates an enormous number of sample DNA sequence files as well as adding extra bases belonging to the plasmids to the sequence, which will cause problems in the final sequence comparison. Finding the methylation status for each CpG in each sample sequence is not an easy job. As a result CpG PatternFinder was developed for this purpose. The main functions of the CpG PatternFinder are: (i) to analyze the reference sequence to obtain CpG and non-CpG-C residue position information. (ii) To tailor sample sequence files (delete insertions and mark deletions from the sample sequence files) based on a configuration of ClustalW multiple alignment. (iii) To align sample sequence files with a reference file to obtain bisulfite conversion efficiency and CpG methylation status. And, (iv) to produce graphics, highlighted aligned sequence text and a summary report which can be easily exported to Microsoft Office suite. CpG PatternFinder is designed to operate cooperatively with BioEdit, a freeware on the internet. It can handle up to 100 files of sample DNA sequences simultaneously, and the total CpG pattern analysis process can be finished in minutes. CpG PatternFinder is an ideal software tool for DNA methylation studies to determine the differential methylation pattern in a large number of individuals in a population. Previously we developed the CpG Analyzer program; CpG PatternFinder is our further effort to create software tools for DNA methylation studies.

DNA-based watermarks using the DNA-Crypt algorithm.

PubMed

Heider, Dominik; Barnekow, Angelika

2007-05-29

The aim of this paper is to demonstrate the application of watermarks based on DNA sequences to identify the unauthorized use of genetically modified organisms (GMOs) protected by patents. Predicted mutations in the genome can be corrected by the DNA-Crypt program leaving the encrypted information intact. Existing DNA cryptographic and steganographic algorithms use synthetic DNA sequences to store binary information however, although these sequences can be used for authentication, they may change the target DNA sequence when introduced into living organisms. The DNA-Crypt algorithm and image steganography are based on the same watermark-hiding principle, namely using the least significant base in case of DNA-Crypt and the least significant bit in case of the image steganography. It can be combined with binary encryption algorithms like AES, RSA or Blowfish. DNA-Crypt is able to correct mutations in the target DNA with several mutation correction codes such as the Hamming-code or the WDH-code. Mutations which can occur infrequently may destroy the encrypted information, however an integrated fuzzy controller decides on a set of heuristics based on three input dimensions, and recommends whether or not to use a correction code. These three input dimensions are the length of the sequence, the individual mutation rate and the stability over time, which is represented by the number of generations. In silico experiments using the Ypt7 in Saccharomyces cerevisiae shows that the DNA watermarks produced by DNA-Crypt do not alter the translation of mRNA into protein. The program is able to store watermarks in living organisms and can maintain the original information by correcting mutations itself. Pairwise or multiple sequence alignments show that DNA-Crypt produces few mismatches between the sequences similar to all steganographic algorithms.
DNA-based watermarks using the DNA-Crypt algorithm

PubMed Central

Heider, Dominik; Barnekow, Angelika

2007-01-01

Background The aim of this paper is to demonstrate the application of watermarks based on DNA sequences to identify the unauthorized use of genetically modified organisms (GMOs) protected by patents. Predicted mutations in the genome can be corrected by the DNA-Crypt program leaving the encrypted information intact. Existing DNA cryptographic and steganographic algorithms use synthetic DNA sequences to store binary information however, although these sequences can be used for authentication, they may change the target DNA sequence when introduced into living organisms. Results The DNA-Crypt algorithm and image steganography are based on the same watermark-hiding principle, namely using the least significant base in case of DNA-Crypt and the least significant bit in case of the image steganography. It can be combined with binary encryption algorithms like AES, RSA or Blowfish. DNA-Crypt is able to correct mutations in the target DNA with several mutation correction codes such as the Hamming-code or the WDH-code. Mutations which can occur infrequently may destroy the encrypted information, however an integrated fuzzy controller decides on a set of heuristics based on three input dimensions, and recommends whether or not to use a correction code. These three input dimensions are the length of the sequence, the individual mutation rate and the stability over time, which is represented by the number of generations. In silico experiments using the Ypt7 in Saccharomyces cerevisiae shows that the DNA watermarks produced by DNA-Crypt do not alter the translation of mRNA into protein. Conclusion The program is able to store watermarks in living organisms and can maintain the original information by correcting mutations itself. Pairwise or multiple sequence alignments show that DNA-Crypt produces few mismatches between the sequences similar to all steganographic algorithms. PMID:17535434
Conserved Sequences at the Origin of Adenovirus DNA Replication

PubMed Central

Stillman, Bruce W.; Topp, William C.; Engler, Jeffrey A.

1982-01-01

The origin of adenovirus DNA replication lies within an inverted sequence repetition at either end of the linear, double-stranded viral DNA. Initiation of DNA replication is primed by a deoxynucleoside that is covalently linked to a protein, which remains bound to the newly synthesized DNA. We demonstrate that virion-derived DNA-protein complexes from five human adenovirus serological subgroups (A to E) can act as a template for both the initiation and the elongation of DNA replication in vitro, using nuclear extracts from adenovirus type 2 (Ad2)-infected HeLa cells. The heterologous template DNA-protein complexes were not as active as the homologous Ad2 DNA, most probably due to inefficient initiation by Ad2 replication factors. In an attempt to identify common features which may permit this replication, we have also sequenced the inverted terminal repeated DNA from human adenovirus serotypes Ad4 (group E), Ad9 and Ad10 (group D), and Ad31 (group A), and we have compared these to previously determined sequences from Ad2 and Ad5 (group C), Ad7 (group B), and Ad12 and Ad18 (group A) DNA. In all cases, the sequence around the origin of DNA replication can be divided into two structural domains: a proximal A · T-rich region which is partially conserved among these serotypes, and a distal G · C-rich region which is less well conserved. The G · C-rich region contains sequences similar to sequences present in papovavirus replication origins. The two domains may reflect a dual mechanism for initiation of DNA replication: adenovirus-specific protein priming of replication, and subsequent utilization of this primer by host replication factors for completion of DNA synthesis. Images PMID:7143575
Hardware Acceleration Of Multi-Deme Genetic Algorithm for DNA Codeword Searching

DTIC Science & Technology

2008-01-01

C and G are complementary to each other. A Watson - Crick complement of a DNA sequence is another DNA sequence which replaces all the A with T or vise...versa and replaces all the T with A or vise versa, and also switches the 5’ and 3’ ends. A DNA sequence binds most stably with its Watson - Crick ...bind with 5 Watson - Crick pairs. The length of the longest complementary sequence between two flexible DNA strands, A and B, is the same as the
Combined subtraction hybridization and polymerase chain reaction amplification procedure for isolation of strain-specific Rhizobium DNA sequences.

PubMed Central

Bjourson, A J; Stone, C E; Cooper, J E

1992-01-01

A novel subtraction hybridization procedure, incorporating a combination of four separation strategies, was developed to isolate unique DNA sequences from a strain of Rhizobium leguminosarum bv. trifolii. Sau3A-digested DNA from this strain, i.e., the probe strain, was ligated to a linker and hybridized in solution with an excess of pooled subtracter DNA from seven other strains of the same biovar which had been restricted, ligated to a different, biotinylated, subtracter-specific linker, and amplified by polymerase chain reaction to incorporate dUTP. Subtracter DNA and subtracter-probe hybrids were removed by phenol-chloroform extraction of a streptavidin-biotin-DNA complex. NENSORB chromatography of the sequences remaining in the aqueous layer captured biotinylated subtracter DNA which may have escaped removal by phenol-chloroform treatment. Any traces of contaminating subtracter DNA were removed by digestion with uracil DNA glycosylase. Finally, remaining sequences were amplified by polymerase chain reaction with a probe strain-specific primer, labelled with 32P, and tested for specificity in dot blot hybridizations against total genomic target DNA from each strain in the subtracter pool. Two rounds of subtraction-amplification were sufficient to remove cross-hybridizing sequences and to give a probe which hybridized only with homologous target DNA. The method is applicable to the isolation of DNA and RNA sequences from both procaryotic and eucaryotic cells. Images PMID:1637166
Development of genomic resources for the narrow-leafed lupin (Lupinus angustifolius): construction of a bacterial artificial chromosome (BAC) library and BAC-end sequencing

PubMed Central

2011-01-01

Background Lupinus angustifolius L, also known as narrow-leafed lupin (NLL), is becoming an important grain legume crop that is valuable for sustainable farming and is becoming recognised as a potential human health food. Recent interest is being directed at NLL to improve grain production, disease and pest management and health benefits of the grain. However, studies have been hindered by a lack of extensive genomic resources for the species. Results A NLL BAC library was constructed consisting of 111,360 clones with an average insert size of 99.7 Kbp from cv Tanjil. The library has approximately 12 × genome coverage. Both ends of 9600 randomly selected BAC clones were sequenced to generate 13985 BAC end-sequences (BESs), covering approximately 1% of the NLL genome. These BESs permitted a preliminary characterisation of the NLL genome such as organisation and composition, with the BESs having approximately 39% G:C content, 16.6% repetitive DNA and 5.4% putative gene-encoding regions. From the BESs 9966 simple sequence repeat (SSR) motifs were identified and some of these are shown to be potential markers. Conclusions The NLL BAC library and BAC-end sequences are powerful resources for genetic and genomic research on lupin. These resources will provide a robust platform for future high-resolution mapping, map-based cloning, comparative genomics and assembly of whole-genome sequencing data for the species. PMID:22014081
Sequence Dependent Interactions Between DNA and Single-Walled Carbon Nanotubes

NASA Astrophysics Data System (ADS)

Roxbury, Daniel

It is known that single-stranded DNA adopts a helical wrap around a single-walled carbon nanotube (SWCNT), forming a water-dispersible hybrid molecule. The ability to sort mixtures of SWCNTs based on chirality (electronic species) has recently been demonstrated using special short DNA sequences that recognize certain matching SWCNTs of specific chirality. This thesis investigates the intricacies of DNA-SWCNT sequence-specific interactions through both experimental and molecular simulation studies. The DNA-SWCNT binding strengths were experimentally quantified by studying the kinetics of DNA replacement by a surfactant on the surface of particular SWCNTs. Recognition ability was found to correlate strongly with measured binding strength, e.g. DNA sequence (TAT)4 was found to bind 20 times stronger to the (6,5)-SWCNT than sequence (TAT)4T. Next, using replica exchange molecular dynamics (REMD) simulations, equilibrium structures formed by (a) single-strands and (b) multiple-strands of 12-mer oligonucleotides adsorbed on various SWCNTs were explored. A number of structural motifs were discovered in which the DNA strand wraps around the SWCNT and 'stitches' to itself via hydrogen bonding. Great variability among equilibrium structures was observed and shown to be directly influenced by DNA sequence and SWCNT type. For example, the (6,5)-SWCNT DNA recognition sequence, (TAT)4, was found to wrap in a tight single-stranded right-handed helical conformation. In contrast, DNA sequence T12 forms a beta-barrel left-handed structure on the same SWCNT. These are the first theoretical indications that DNA-based SWCNT selectivity can arise on a molecular level. In a biomedical collaboration with the Mayo Clinic, pathways for DNA-SWCNT internalization into healthy human endothelial cells were explored. Through absorbance spectroscopy, TEM imaging, and confocal fluorescence microscopy, we showed that intracellular concentrations of SWCNTs far exceeded those of the incubation solution, which suggested an energy-dependent pathway. Additionally, by means of pharmacological inhibition and vector-induced gene knockout studies, the DNA-SWCNTs were shown to enter the cells via Rac1-mediated macropinocytosis.
Development of a Novel Technology for Label Free DNA Sequencing

DTIC Science & Technology

2012-05-21

of the C-H bond stretch vibrations in the planes of the corresponding DNA bases , and in the higher-frequency side, sequence-identifier region is...composed of the N-H bond stretch vibrations in the planes of the corresponding DNA bases . In addition, the sequence-identifier dividing region almost...regions are localized at the corresponding DNA bases and exhibit a definable dependence on the sequence form of the codons under study. Final
Flow cytometry for enrichment and titration in massively parallel DNA sequencing

PubMed Central

Sandberg, Julia; Ståhl, Patrik L.; Ahmadian, Afshin; Bjursell, Magnus K.; Lundeberg, Joakim

2009-01-01

Massively parallel DNA sequencing is revolutionizing genomics research throughout the life sciences. However, the reagent costs and labor requirements in current sequencing protocols are still substantial, although improvements are continuously being made. Here, we demonstrate an effective alternative to existing sample titration protocols for the Roche/454 system using Fluorescence Activated Cell Sorting (FACS) technology to determine the optimal DNA-to-bead ratio prior to large-scale sequencing. Our method, which eliminates the need for the costly pilot sequencing of samples during titration is capable of rapidly providing accurate DNA-to-bead ratios that are not biased by the quantification and sedimentation steps included in current protocols. Moreover, we demonstrate that FACS sorting can be readily used to highly enrich fractions of beads carrying template DNA, with near total elimination of empty beads and no downstream sacrifice of DNA sequencing quality. Automated enrichment by FACS is a simple approach to obtain pure samples for bead-based sequencing systems, and offers an efficient, low-cost alternative to current enrichment protocols. PMID:19304748
A DNA sequence obtained by replacement of the dopamine RNA aptamer bases is not an aptamer.

PubMed

Álvarez-Martos, Isabel; Ferapontova, Elena E

2017-08-05

A unique specificity of the aptamer-ligand biorecognition and binding facilitates bioanalysis and biosensor development, contributing to discrimination of structurally related molecules, such as dopamine and other catecholamine neurotransmitters. The aptamer sequence capable of specific binding of dopamine is a 57 nucleotides long RNA sequence reported in 1997 (Biochemistry, 1997, 36, 9726). Later, it was suggested that the DNA homologue of the RNA aptamer retains the specificity of dopamine binding (Biochem. Biophys. Res. Commun., 2009, 388, 732). Here, we show that the DNA sequence obtained by the replacement of the RNA aptamer bases for their DNA analogues is not able of specific biorecognition of dopamine, in contrast to the original RNA aptamer sequence. This DNA sequence binds dopamine and structurally related catecholamine neurotransmitters non-specifically, as any DNA sequence, and, thus, is not an aptamer and cannot be used neither for in vivo nor in situ analysis of dopamine in the presence of structurally related neurotransmitters. Copyright © 2017 Elsevier Inc. All rights reserved.
Method for sequencing DNA base pairs

DOEpatents

Sessler, Andrew M.; Dawson, John

1993-01-01

The base pairs of a DNA structure are sequenced with the use of a scanning tunneling microscope (STM). The DNA structure is scanned by the STM probe tip, and, as it is being scanned, the DNA structure is separately subjected to a sequence of infrared radiation from four different sources, each source being selected to preferentially excite one of the four different bases in the DNA structure. Each particular base being scanned is subjected to such sequence of infrared radiation from the four different sources as that particular base is being scanned. The DNA structure as a whole is separately imaged for each subjection thereof to radiation from one only of each source.
Nuclear counterparts of the cytoplasmic mitochondrial 12S rRNA gene: a problem of ancient DNA and molecular phylogenies.

PubMed

van der Kuyl, A C; Kuiken, C L; Dekker, J T; Perizonius, W R; Goudsmit, J

1995-06-01

Monkey mummy bones and teeth originating from the North Saqqara Baboon Galleries (Egypt), soft tissue from a mummified baboon in a museum collection, and nineteenth/twentieth-century skin fragments from mangabeys were used for DNA extraction and PCR amplification of part of the mitochondrial 12S rRNA gene. Sequences aligning with the 12S rRNA gene were recovered but were only distantly related to contemporary monkey mitochondrial 12S rRNA sequences. However, many of these sequences were identical or closely related to human nuclear DNA sequences resembling mitochondrial 12S rRNA (isolated from a cell line depleted in mitochondria) and therefore have to be considered contamination. Subsequently in a separate study we were able to recover genuine mitochondrial 12S rRNA sequences from many extant species of nonhuman Old World primates and sequences closely resembling the human nuclear integrations. Analysis of all sequences by the neighbor-joining (NJ) method indicated that mitochondrial DNA sequences and their nuclear counterparts can be divided into two distinct clusters. One cluster contained all temporary cytoplasmic mitochondrial DNA sequences and approximately half of the monkey nuclear mitochondriallike sequences. A second cluster contained most human nuclear sequences and the other half of monkey nuclear sequences with a separate branch leading to human and gorilla mitochondrial and nuclear sequences. Sequences recovered from ancient materials were equally divided between the two clusters. These results constitute a warning for when working with ancient DNA or performing phylogenetic analysis using mitochondrial DNA as a target sequence: Nuclear counterparts of mitochondrial genes may lead to faulty interpretation of results.
A tailored biocatalyst achieved by the rational anchoring of imidazole groups on a natural polymer: furnishing a potential artificial nuclease by sustainable materials engineering.

PubMed

Ferreira, José G L; Grein-Iankovski, Aline; Oliveira, Marco A S; Simas-Tosin, Fernanda F; Riegel-Vidotti, Izabel C; Orth, Elisa S

2015-04-11

Foreseeing the development of artificial enzymes by sustainable materials engineering, we rationally anchored reactive imidazole groups on gum arabic, a natural biocompatible polymer. The tailored biocatalyst GAIMZ demonstrated catalytic activity (>10(5)-fold) in dephosphorylation reactions with recyclable features and was effective in cleaving plasmid DNA, comprising a potential artificial nuclease.
Sequence independent amplification of DNA

DOEpatents

Bohlander, S.K.

1998-03-24

The present invention is a rapid sequence-independent amplification procedure (SIA). Even minute amounts of DNA from various sources can be amplified independent of any sequence requirements of the DNA or any a priori knowledge of any sequence characteristics of the DNA to be amplified. This method allows, for example, the sequence independent amplification of microdissected chromosomal material and the reliable construction of high quality fluorescent in situ hybridization (FISH) probes from YACs or from other sources. These probes can be used to localize YACs on metaphase chromosomes but also--with high efficiency--in interphase nuclei. 25 figs.
Sequence independent amplification of DNA

DOEpatents

Bohlander, Stefan K.

1998-01-01

The present invention is a rapid sequence-independent amplification procedure (SIA). Even minute amounts of DNA from various sources can be amplified independent of any sequence requirements of the DNA or any a priori knowledge of any sequence characteristics of the DNA to be amplified. This method allows, for example the sequence independent amplification of microdissected chromosomal material and the reliable construction of high quality fluorescent in situ hybridization (FISH) probes from YACs or from other sources. These probes can be used to localize YACs on metaphase chromosomes but also--with high efficiency--in interphase nuclei.
UV-Visible Spectroscopy-Based Quantification of Unlabeled DNA Bound to Gold Nanoparticles.

PubMed

Baldock, Brandi L; Hutchison, James E

2016-12-20

DNA-functionalized gold nanoparticles have been increasingly applied as sensitive and selective analytical probes and biosensors. The DNA ligands bound to a nanoparticle dictate its reactivity, making it essential to know the type and number of DNA strands bound to the nanoparticle surface. Existing methods used to determine the number of DNA strands per gold nanoparticle (AuNP) require that the sequences be fluorophore-labeled, which may affect the DNA surface coverage and reactivity of the nanoparticle and/or require specialized equipment and other fluorophore-containing reagents. We report a UV-visible-based method to conveniently and inexpensively determine the number of DNA strands attached to AuNPs of different core sizes. When this method is used in tandem with a fluorescence dye assay, it is possible to determine the ratio of two unlabeled sequences of different lengths bound to AuNPs. Two sizes of citrate-stabilized AuNPs (5 and 12 nm) were functionalized with mixtures of short (5 base) and long (32 base) disulfide-terminated DNA sequences, and the ratios of sequences bound to the AuNPs were determined using the new method. The long DNA sequence was present as a lower proportion of the ligand shell than in the ligand exchange mixture, suggesting it had a lower propensity to bind the AuNPs than the short DNA sequence. The ratio of DNA sequences bound to the AuNPs was not the same for the large and small AuNPs, which suggests that the radius of curvature had a significant influence on the assembly of DNA strands onto the AuNPs.
Ancient DNA sequence revealed by error-correcting codes.

PubMed

Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo

2015-07-10

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.
Ancient DNA sequence revealed by error-correcting codes

PubMed Central

Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo

2015-01-01

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228
Random sampling causes the low reproducibility of rare eukaryotic OTUs in Illumina COI metabarcoding.

PubMed

Leray, Matthieu; Knowlton, Nancy

2017-01-01

DNA metabarcoding, the PCR-based profiling of natural communities, is becoming the method of choice for biodiversity monitoring because it circumvents some of the limitations inherent to traditional ecological surveys. However, potential sources of bias that can affect the reproducibility of this method remain to be quantified. The interpretation of differences in patterns of sequence abundance and the ecological relevance of rare sequences remain particularly uncertain. Here we used one artificial mock community to explore the significance of abundance patterns and disentangle the effects of two potential biases on data reproducibility: indexed PCR primers and random sampling during Illumina MiSeq sequencing. We amplified a short fragment of the mitochondrial Cytochrome c Oxidase Subunit I (COI) for a single mock sample containing equimolar amounts of total genomic DNA from 34 marine invertebrates belonging to six phyla. We used seven indexed broad-range primers and sequenced the resulting library on two consecutive Illumina MiSeq runs. The total number of Operational Taxonomic Units (OTUs) was ∼4 times higher than expected based on the composition of the mock sample. Moreover, the total number of reads for the 34 components of the mock sample differed by up to three orders of magnitude. However, 79 out of 86 of the unexpected OTUs were represented by <10 sequences that did not appear consistently across replicates. Our data suggest that random sampling of rare OTUs (e.g., small associated fauna such as parasites) accounted for most of variation in OTU presence-absence, whereas biases associated with indexed PCRs accounted for a larger amount of variation in relative abundance patterns. These results suggest that random sampling during sequencing leads to the low reproducibility of rare OTUs. We suggest that the strategy for handling rare OTUs should depend on the objectives of the study. Systematic removal of rare OTUs may avoid inflating diversity based on common β descriptors but will exclude positive records of taxa that are functionally important. Our results further reinforce the need for technical replicates (parallel PCR and sequencing from the same sample) in metabarcoding experimental designs. Data reproducibility should be determined empirically as it will depend upon the sequencing depth, the type of sample, the sequence analysis pipeline, and the number of replicates. Moreover, estimating relative biomasses or abundances based on read counts remains elusive at the OTU level.
Multiplex picoliter-droplet digital PCR for quantitative assessment of DNA integrity in clinical samples.

PubMed

Didelot, Audrey; Kotsopoulos, Steve K; Lupo, Audrey; Pekin, Deniz; Li, Xinyu; Atochin, Ivan; Srinivasan, Preethi; Zhong, Qun; Olson, Jeff; Link, Darren R; Laurent-Puig, Pierre; Blons, Hélène; Hutchison, J Brian; Taly, Valerie

2013-05-01

Assessment of DNA integrity and quantity remains a bottleneck for high-throughput molecular genotyping technologies, including next-generation sequencing. In particular, DNA extracted from paraffin-embedded tissues, a major potential source of tumor DNA, varies widely in quality, leading to unpredictable sequencing data. We describe a picoliter droplet-based digital PCR method that enables simultaneous detection of DNA integrity and the quantity of amplifiable DNA. Using a multiplex assay, we detected 4 different target lengths (78, 159, 197, and 550 bp). Assays were validated with human genomic DNA fragmented to sizes of 170 bp to 3000 bp. The technique was validated with DNA quantities as low as 1 ng. We evaluated 12 DNA samples extracted from paraffin-embedded lung adenocarcinoma tissues. One sample contained no amplifiable DNA. The fractions of amplifiable DNA for the 11 other samples were between 0.05% and 10.1% for 78-bp fragments and ≤1% for longer fragments. Four samples were chosen for enrichment and next-generation sequencing. The quality of the sequencing data was in agreement with the results of the DNA-integrity test. Specifically, DNA with low integrity yielded sequencing results with lower levels of coverage and uniformity and had higher levels of false-positive variants. The development of DNA-quality assays will enable researchers to downselect samples or process more DNA to achieve reliable genome sequencing with the highest possible efficiency of cost and effort, as well as minimize the waste of precious samples. © 2013 American Association for Clinical Chemistry.

Detecting and Estimating Contamination of Human DNA Samples in Sequencing and Array-Based Genotype Data

PubMed Central

Jun, Goo; Flickinger, Matthew; Hetrick, Kurt N.; Romm, Jane M.; Doheny, Kimberly F.; Abecasis, Gonçalo R.; Boehnke, Michael; Kang, Hyun Min

2012-01-01

DNA sample contamination is a serious problem in DNA sequencing studies and may result in systematic genotype misclassification and false positive associations. Although methods exist to detect and filter out cross-species contamination, few methods to detect within-species sample contamination are available. In this paper, we describe methods to identify within-species DNA sample contamination based on (1) a combination of sequencing reads and array-based genotype data, (2) sequence reads alone, and (3) array-based genotype data alone. Analysis of sequencing reads allows contamination detection after sequence data is generated but prior to variant calling; analysis of array-based genotype data allows contamination detection prior to generation of costly sequence data. Through a combination of analysis of in silico and experimentally contaminated samples, we show that our methods can reliably detect and estimate levels of contamination as low as 1%. We evaluate the impact of DNA contamination on genotype accuracy and propose effective strategies to screen for and prevent DNA contamination in sequencing studies. PMID:23103226
Integrated sequencing of exome and mRNA of large-sized single cells.

PubMed

Wang, Lily Yan; Guo, Jiajie; Cao, Wei; Zhang, Meng; He, Jiankui; Li, Zhoufang

2018-01-10

Current approaches of single cell DNA-RNA integrated sequencing are difficult to call SNPs, because a large amount of DNA and RNA is lost during DNA-RNA separation. Here, we performed simultaneous single-cell exome and transcriptome sequencing on individual mouse oocytes. Using microinjection, we kept the nuclei intact to avoid DNA loss, while retaining the cytoplasm inside the cell membrane, to maximize the amount of DNA and RNA captured from the single cell. We then conducted exome-sequencing on the isolated nuclei and mRNA-sequencing on the enucleated cytoplasm. For single oocytes, exome-seq can cover up to 92% of exome region with an average sequencing depth of 10+, while mRNA-sequencing reveals more than 10,000 expressed genes in enucleated cytoplasm, with similar performance for intact oocytes. This approach provides unprecedented opportunities to study DNA-RNA regulation, such as RNA editing at single nucleotide level in oocytes. In future, this method can also be applied to other large cells, including neurons, large dendritic cells and large tumour cells for integrated exome and transcriptome sequencing.
Genomics dataset of unidentified disclosed isolates.

PubMed

Rekadwad, Bhagwan N

2016-09-01

Analysis of DNA sequences is necessary for higher hierarchical classification of the organisms. It gives clues about the characteristics of organisms and their taxonomic position. This dataset is chosen to find complexities in the unidentified DNA in the disclosed patents. A total of 17 unidentified DNA sequences were thoroughly analyzed. The quick response codes were generated. AT/GC content of the DNA sequences analysis was carried out. The QR is helpful for quick identification of isolates. AT/GC content is helpful for studying their stability at different temperatures. Additionally, a dataset on cleavage code and enzyme code studied under the restriction digestion study, which helpful for performing studies using short DNA sequences was reported. The dataset disclosed here is the new revelatory data for exploration of unique DNA sequences for evaluation, identification, comparison and analysis.
Winnowing DNA for Rare Sequences: Highly Specific Sequence and Methylation Based Enrichment

PubMed Central

Thompson, Jason D.; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

2012-01-01

Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue. PMID:22355378
Low-Energy Electron-Induced Strand Breaks in Telomere-Derived DNA Sequences-Influence of DNA Sequence and Topology.

PubMed

Rackwitz, Jenny; Bald, Ilko

2018-03-26

During cancer radiation therapy high-energy radiation is used to reduce tumour tissue. The irradiation produces a shower of secondary low-energy (<20 eV) electrons, which are able to damage DNA very efficiently by dissociative electron attachment. Recently, it was suggested that low-energy electron-induced DNA strand breaks strongly depend on the specific DNA sequence with a high sensitivity of G-rich sequences. Here, we use DNA origami platforms to expose G-rich telomere sequences to low-energy (8.8 eV) electrons to determine absolute cross sections for strand breakage and to study the influence of sequence modifications and topology of telomeric DNA on the strand breakage. We find that the telomeric DNA 5'-(TTA GGG) 2 is more sensitive to low-energy electrons than an intermixed sequence 5'-(TGT GTG A) 2 confirming the unique electronic properties resulting from G-stacking. With increasing length of the oligonucleotide (i.e., going from 5'-(GGG ATT) 2 to 5'-(GGG ATT) 4 ), both the variety of topology and the electron-induced strand break cross sections increase. Addition of K + ions decreases the strand break cross section for all sequences that are able to fold G-quadruplexes or G-intermediates, whereas the strand break cross section for the intermixed sequence remains unchanged. These results indicate that telomeric DNA is rather sensitive towards low-energy electron-induced strand breakage suggesting significant telomere shortening that can also occur during cancer radiation therapy. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Improved multiple displacement amplification (iMDA) and ultraclean reagents.

PubMed

Motley, S Timothy; Picuri, John M; Crowder, Chris D; Minich, Jeremiah J; Hofstadler, Steven A; Eshoo, Mark W

2014-06-06

Next-generation sequencing sample preparation requires nanogram to microgram quantities of DNA; however, many relevant samples are comprised of only a few cells. Genomic analysis of these samples requires a whole genome amplification method that is unbiased and free of exogenous DNA contamination. To address these challenges we have developed protocols for the production of DNA-free consumables including reagents and have improved upon multiple displacement amplification (iMDA). A specialized ethylene oxide treatment was developed that renders free DNA and DNA present within Gram positive bacterial cells undetectable by qPCR. To reduce DNA contamination in amplification reagents, a combination of ion exchange chromatography, filtration, and lot testing protocols were developed. Our multiple displacement amplification protocol employs a second strand-displacing DNA polymerase, improved buffers, improved reaction conditions and DNA free reagents. The iMDA protocol, when used in combination with DNA-free laboratory consumables and reagents, significantly improved efficiency and accuracy of amplification and sequencing of specimens with moderate to low levels of DNA. The sensitivity and specificity of sequencing of amplified DNA prepared using iMDA was compared to that of DNA obtained with two commercial whole genome amplification kits using 10 fg (~1-2 bacterial cells worth) of bacterial genomic DNA as a template. Analysis showed >99% of the iMDA reads mapped to the template organism whereas only 0.02% of the reads from the commercial kits mapped to the template. To assess the ability of iMDA to achieve balanced genomic coverage, a non-stochastic amount of bacterial genomic DNA (1 pg) was amplified and sequenced, and data obtained were compared to sequencing data obtained directly from genomic DNA. The iMDA DNA and genomic DNA sequencing had comparable coverage 99.98% of the reference genome at ≥1X coverage and 99.9% at ≥5X coverage while maintaining both balance and representation of the genome. The iMDA protocol in combination with DNA-free laboratory consumables, significantly improved the ability to sequence specimens with low levels of DNA. iMDA has broad utility in metagenomics, diagnostics, ancient DNA analysis, pre-implantation embryo screening, single-cell genomics, whole genome sequencing of unculturable organisms, and forensic applications for both human and microbial targets.
Strong minor groove base conservation in sequence logos implies DNA distortion or base flipping during replication and transcription initiation.

PubMed

Schneider, T D

2001-12-01

The sequence logo for DNA binding sites of the bacteriophage P1 replication protein RepA shows unusually high sequence conservation ( approximately 2 bits) at a minor groove that faces RepA. However, B-form DNA can support only 1 bit of sequence conservation via contacts into the minor groove. The high conservation in RepA sites therefore implies a distorted DNA helix with direct or indirect contacts to the protein. Here I show that a high minor groove conservation signature also appears in sequence logos of sites for other replication origin binding proteins (Rts1, DnaA, P4 alpha, EBNA1, ORC) and promoter binding proteins (sigma(70), sigma(D) factors). This finding implies that DNA binding proteins generally use non-B-form DNA distortion such as base flipping to initiate replication and transcription.
Molecular design of sequence specific DNA alkylating agents.

PubMed

Minoshima, Masafumi; Bando, Toshikazu; Shinohara, Ken-ichi; Sugiyama, Hiroshi

2009-01-01

Sequence-specific DNA alkylating agents have great interest for novel approach to cancer chemotherapy. We designed the conjugates between pyrrole (Py)-imidazole (Im) polyamides and DNA alkylating chlorambucil moiety possessing at different positions. The sequence-specific DNA alkylation by conjugates was investigated by using high-resolution denaturing polyacrylamide gel electrophoresis (PAGE). The results showed that polyamide chlorambucil conjugates alkylate DNA at flanking adenines in recognition sequences of Py-Im polyamides, however, the reactivities and alkylation sites were influenced by the positions of conjugation. In addition, we synthesized conjugate between Py-Im polyamide and another alkylating agent, 1-(chloromethyl)-5-hydroxy-1,2-dihydro-3H-benz[e]indole (seco-CBI). DNA alkylation reactivies by both alkylating polyamides were almost comparable. In contrast, cytotoxicities against cell lines differed greatly. These comparative studies would promote development of appropriate sequence-specific DNA alkylating polyamides against specific cancer cells.
The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

PubMed

Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P; Panitz, Frank; Bendixen, Christian; Nielsen, Rasmus; Willerslev, Eske

2007-02-14

The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources. We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences). Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis. We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%). Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial analyses, population genetics, and phylogenetics.
Anhidrotic ectodermal dysplasia gene region cloned in yeast artificial chromosomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kere, J.; Grzeschik, K.H.; Limon, J.

1993-05-01

Anhidrotic ectodermal dysplasia (EDA), an X-chromosomal recessive disorder, is expressed in a few females with chromosomal translocations involving bands Xq12-q13. Using available DNA markers from the region and somatic cell hybrids the authors mapped the X-chromosomal breakpoints in two such translocations. The breakpoints were further mapped within a yeast artificial chromosome contig constructed by chromosome walking techniques. Genomic DNA markers that map between the two translocation breakpoints were recovered representing putative portions of the EDA gene. 32 refs., 3 figs., 1 tab.
Utility of 16S rDNA Sequencing for Identification of Rare Pathogenic Bacteria.

PubMed

Loong, Shih Keng; Khor, Chee Sieng; Jafar, Faizatul Lela; AbuBakar, Sazaly

2016-11-01

Phenotypic identification systems are established methods for laboratory identification of bacteria causing human infections. Here, the utility of phenotypic identification systems was compared against 16S rDNA identification method on clinical isolates obtained during a 5-year study period, with special emphasis on isolates that gave unsatisfactory identification. One hundred and eighty-seven clinical bacteria isolates were tested with commercial phenotypic identification systems and 16S rDNA sequencing. Isolate identities determined using phenotypic identification systems and 16S rDNA sequencing were compared for similarity at genus and species level, with 16S rDNA sequencing as the reference method. Phenotypic identification systems identified ~46% (86/187) of the isolates with identity similar to that identified using 16S rDNA sequencing. Approximately 39% (73/187) and ~15% (28/187) of the isolates showed different genus identity and could not be identified using the phenotypic identification systems, respectively. Both methods succeeded in determining the species identities of 55 isolates; however, only ~69% (38/55) of the isolates matched at species level. 16S rDNA sequencing could not determine the species of ~20% (37/187) of the isolates. The 16S rDNA sequencing is a useful method over the phenotypic identification systems for the identification of rare and difficult to identify bacteria species. The 16S rDNA sequencing method, however, does have limitation for species-level identification of some bacteria highlighting the need for better bacterial pathogen identification tools. © 2016 Wiley Periodicals, Inc.
Mapping the Space of Genomic Signatures

PubMed Central

Kari, Lila; Hill, Kathleen A.; Sayem, Abu S.; Karamichalis, Rallis; Bryans, Nathaniel; Davis, Katelyn; Dattani, Nikesh S.

2015-01-01

We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity between the corresponding sequences. The graphical representation of DNA sequences utilized, Chaos Game Representation (CGR), is genome- and species-specific and can thus act as a genomic signature. Consequently, Molecular Distance Maps could inform species identification, taxonomic classifications and, to a certain extent, evolutionary history. The image distance employed, Structural Dissimilarity Index (DSSIM), implicitly compares the occurrences of oligomers of length up to k (herein k = 9) in DNA sequences. We computed DSSIM distances for more than 5 million pairs of complete mitochondrial genomes, and used Multi-Dimensional Scaling (MDS) to obtain Molecular Distance Maps that visually display the sequence relatedness in various subsets, at different taxonomic levels. This general-purpose method does not require DNA sequence alignment and can thus be used to compare similar or vastly different DNA sequences, genomic or computer-generated, of the same or different lengths. We illustrate potential uses of this approach by applying it to several taxonomic subsets: phylum Vertebrata, (super)kingdom Protista, classes Amphibia-Insecta-Mammalia, class Amphibia, and order Primates. This analysis of an extensive dataset confirms that the oligomer composition of full mtDNA sequences can be a source of taxonomic information. This method also correctly finds the mtDNA sequences most closely related to that of the anatomically modern human (the Neanderthal, the Denisovan, and the chimp), and that the sequence most different from it in this dataset belongs to a cucumber. PMID:26000734
The number of reduced alignments between two DNA sequences

PubMed Central

2014-01-01

Background In this study we consider DNA sequences as mathematical strings. Total and reduced alignments between two DNA sequences have been considered in the literature to measure their similarity. Results for explicit representations of some alignments have been already obtained. Results We present exact, explicit and computable formulas for the number of different possible alignments between two DNA sequences and a new formula for a class of reduced alignments. Conclusions A unified approach for a wide class of alignments between two DNA sequences has been provided. The formula is computable and, if complemented by software development, will provide a deeper insight into the theory of sequence alignment and give rise to new comparison methods. AMS Subject Classification Primary 92B05, 33C20, secondary 39A14, 65Q30 PMID:24684679
Novel numerical and graphical representation of DNA sequences and proteins.

PubMed

Randić, M; Novic, M; Vikić-Topić, D; Plavsić, D

2006-12-01

We have introduced novel numerical and graphical representations of DNA, which offer a simple and unique characterization of DNA sequences. The numerical representation of a DNA sequence is given as a sequence of real numbers derived from a unique graphical representation of the standard genetic code. There is no loss of information on the primary structure of a DNA sequence associated with this numerical representation. The novel representations are illustrated with the coding sequences of the first exon of beta-globin gene of half a dozen species in addition to human. The method can be extended to proteins as is exemplified by humanin, a 24-aa peptide that has recently been identified as a specific inhibitor of neuronal cell death induced by familial Alzheimer's disease mutant genes.
Capillary electrophoresis of Big-Dye terminator sequencing reactions for human mtDNA Control Region haplotyping in the identification of human remains.

PubMed

Montesino, Marta; Prieto, Lourdes

2012-01-01

Cycle sequencing reaction with Big-Dye terminators provides the methodology to analyze mtDNA Control Region amplicons by means of capillary electrophoresis. DNA sequencing with ddNTPs or terminators was developed by (1). The progressive automation of the method by combining the use of fluorescent-dye terminators with cycle sequencing has made it possible to increase the sensibility and efficiency of the method and hence has allowed its introduction into the forensic field. PCR-generated mitochondrial DNA products are the templates for sequencing reactions. Different set of primers can be used to generate amplicons with different sizes according to the quality and quantity of the DNA extract providing sequence data for different ranges inside the Control Region.
Gene Identification Algorithms Using Exploratory Statistical Analysis of Periodicity

NASA Astrophysics Data System (ADS)

Mukherjee, Shashi Bajaj; Sen, Pradip Kumar

2010-10-01

Studying periodic pattern is expected as a standard line of attack for recognizing DNA sequence in identification of gene and similar problems. But peculiarly very little significant work is done in this direction. This paper studies statistical properties of DNA sequences of complete genome using a new technique. A DNA sequence is converted to a numeric sequence using various types of mappings and standard Fourier technique is applied to study the periodicity. Distinct statistical behaviour of periodicity parameters is found in coding and non-coding sequences, which can be used to distinguish between these parts. Here DNA sequences of Drosophila melanogaster were analyzed with significant accuracy.
Sequencing of adenine in DNA by scanning tunneling microscopy

NASA Astrophysics Data System (ADS)

Tanaka, Hiroyuki; Taniguchi, Masateru

2017-08-01

The development of DNA sequencing technology utilizing the detection of a tunnel current is important for next-generation sequencer technologies based on single-molecule analysis technology. Using a scanning tunneling microscope, we previously reported that dI/dV measurements and dI/dV mapping revealed that the guanine base (purine base) of DNA adsorbed onto the Cu(111) surface has a characteristic peak at V s = -1.6 V. If, in addition to guanine, the other purine base of DNA, namely, adenine, can be distinguished, then by reading all the purine bases of each single strand of a DNA double helix, the entire base sequence of the original double helix can be determined due to the complementarity of the DNA base pair. Therefore, the ability to read adenine is important from the viewpoint of sequencing. Here, we report on the identification of adenine by STM topographic and spectroscopic measurements using a synthetic DNA oligomer and viral DNA.
Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments.

PubMed

Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

2013-09-24

Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp.
Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments

PubMed Central

Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

2013-01-01

Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp. PMID:24019490
Crystal structure of MboIIA methyltransferase.

PubMed

Osipiuk, Jerzy; Walsh, Martin A; Joachimiak, Andrzej

2003-09-15

DNA methyltransferases (MTases) are sequence-specific enzymes which transfer a methyl group from S-adenosyl-L-methionine (AdoMet) to the amino group of either cytosine or adenine within a recognized DNA sequence. Methylation of a base in a specific DNA sequence protects DNA from nucleolytic cleavage by restriction enzymes recognizing the same DNA sequence. We have determined at 1.74 A resolution the crystal structure of a beta-class DNA MTase MboIIA (M.MboIIA) from the bacterium Moraxella bovis, the smallest DNA MTase determined to date. M.MboIIA methylates the 3' adenine of the pentanucleotide sequence 5'-GAAGA-3'. The protein crystallizes with two molecules in the asymmetric unit which we propose to resemble the dimer when M.MboIIA is not bound to DNA. The overall structure of the enzyme closely resembles that of M.RsrI. However, the cofactor-binding pocket in M.MboIIA forms a closed structure which is in contrast to the open-form structures of other known MTases.

Application of artificial neural networks to identify equilibration in computer simulations

NASA Astrophysics Data System (ADS)

Leibowitz, Mitchell H.; Miller, Evan D.; Henry, Michael M.; Jankowski, Eric

2017-11-01

Determining which microstates generated by a thermodynamic simulation are representative of the ensemble for which sampling is desired is a ubiquitous, underspecified problem. Artificial neural networks are one type of machine learning algorithm that can provide a reproducible way to apply pattern recognition heuristics to underspecified problems. Here we use the open-source TensorFlow machine learning library and apply it to the problem of identifying which hypothetical observation sequences from a computer simulation are “equilibrated” and which are not. We generate training populations and test populations of observation sequences with embedded linear and exponential correlations. We train a two-neuron artificial network to distinguish the correlated and uncorrelated sequences. We find that this simple network is good enough for > 98% accuracy in identifying exponentially-decaying energy trajectories from molecular simulations.
Genomics dataset on unclassified published organism (patent US 7547531).

PubMed

Khan Shawan, Mohammad Mahfuz Ali; Hasan, Md Ashraful; Hossain, Md Mozammel; Hasan, Md Mahmudul; Parvin, Afroza; Akter, Salina; Uddin, Kazi Rasel; Banik, Subrata; Morshed, Mahbubul; Rahman, Md Nazibur; Rahman, S M Badier

2016-12-01

Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531) is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from patent US 7547531 were selected and their complete sequences were retrieved from NCBI BioSample database. Quick response (QR) code of those DNA sequences was constructed by DNA BarID tool. QR code is useful for the identification and comparison of isolates with other organisms. AT/GC content of the DNA sequences was determined using ENDMEMO GC Content Calculator, which indicates their stability at different temperature. The highest GC content was observed in GP445188 (62.5%) which was followed by GP445198 (61.8%) and GP445189 (59.44%), while lowest was in GP445178 (24.39%). In addition, New England BioLabs (NEB) database was used to identify cleavage code indicating the 5, 3 and blunt end and enzyme code indicating the methylation site of the DNA sequences was also shown. These data will be helpful for the construction of the organisms' hierarchical classification, determination of their phylogenetic and taxonomic position and revelation of their molecular characteristics.
Fluorescent DNA-templated silver nanoclusters

NASA Astrophysics Data System (ADS)

Lin, Ruoqian

Because of the ultra-small size and biocompatibility of silver nanoclusters, they have attracted much research interest for their applications in biolabeling. Among the many ways of synthesizing silver nanoclusters, DNA templated method is particularly attractive---the high tunability of DNA sequences provides another degree of freedom for controlling the chemical and photophysical properties. However, systematic studies about how DNA sequences and concentrations are controlling the photophysical properties are still lacking. The aim of this thesis is to investigate the binding mechanisms of silver clusters binding and single stranded DNAs. Here in this thesis, we report synthesis and characterization of DNA-templated silver nanoclusters and provide a systematic interrogation of the effects of DNA concentrations and sequences, including lengths and secondary structures. We performed a series of syntheses utilizing five different sequences to explore the optimal synthesis condition. By characterizing samples with UV-vis and fluorescence spectroscopy, we achieved the most proper reactants ratio and synthesis conditions. Two of them were chosen for further concentration dependence studies and sequence dependence studies. We found that cytosine-rich sequences are more likely to produce silver nanoclusters with stronger fluorescence signals; however, sequences with hairpin secondary structures are more capable in stabilizing silver nanoclusters. In addition, the fluorescence peak emission intensities and wavelengths of the DNA templated silver clusters have sequence dependent fingerprints. This potentially can be applied to sequence sensing in the future. However all the current conclusions are not warranted; there is still difficulty in formulating general rules in DNA strand design and silver nanocluster production. Further investigation of more sequences could solve these questions in the future.
Electrical responses of artificial DNA nanostructures on solution-processed In-Ga-Zn-O thin-film transistors with multistacked active layers.

PubMed

Jung, Joohye; Kim, Si Joon; Yoon, Doo Hyun; Kim, Byeonghoon; Park, Sung Ha; Kim, Hyun Jae

2013-01-01

We propose solution-processed In-Ga-Zn-O (IGZO) thin-film transistors (TFTs) with multistacked active layers for detecting artificial deoxyribonucleic acid (DNA). Enhanced sensing ability and stable electrical performance of TFTs were achieved through use of multistacked active layers. Our IGZO TFT had a turn-on voltage (V(on)) of -0.8 V and a subthreshold swing (SS) value of 0.48 V/decade. A dry-wet method was adopted to immobilize double-crossover DNA on the IGZO surface, after which an anomalous hump effect accompanying a significant decrease in V(on) (-13.6 V) and degradation of SS (1.29 V/decade) was observed. This sensing behavior was attributed to the middle interfaces of the multistacked active layers and the negatively charged phosphate groups on the DNA backbone, which generated a parasitic path in the TFT device. These results compared favorably with those reported for conventional field-effect transistor-based DNA sensors with remarkable sensitivity and stability.
Phylogeography of the endangered Cathaya argyrophylla (Pinaceae) inferred from sequence variation of mitochondrial and nuclear DNA.

PubMed

Wang, Hong-Wei; Ge, Song

2006-11-01

Cathaya argyrophylla is an endangered conifer restricted to subtropical mountains of China. To study phylogeographical pattern and demographic history of C. argyrophylla, species-wide genetic variation was investigated using sequences of maternally inherited mtDNA and biparentally inherited nuclear DNA. Of 15 populations sampled from all four distinct regions, only three mitotypes were detected at two loci, without single region having a mixed composition (G(ST) = 1). Average nucleotide diversity (theta(ws) = 0.0024; pi(s) = 0.0029) across eight nuclear loci is significantly lower than those found for other conifers (theta(ws) = 0.003 approximately 0.015; pi(s) = 0.002 approximately 0.012) based on estimates of multiple loci. Because of its highest diversity among the eight nuclear loci and evolving neutrally, one locus (2009) was further used for phylogeographical studies and eight haplotypes resulting from 12 polymorphic sites were obtained from 98 individuals. All the four distinct regions had at least four haplotypes, with the Dalou region (DL) having the highest diversity and the Bamian region (BM) the lowest, paralleling the result of the eight nuclear loci. An AMOVA revealed significant proportion of diversity attributable to differences among regions (13.4%) and among populations within regions (8.9%). F(ST) analysis also indicated significantly high differentiation among populations (F(ST) = 0.22) and between regions (F(ST) = 0.12-0.38). Non-overlapping distribution of mitotypes and high genetic differentiation among the distinct geographical groups suggest the existence of at least four separate glacial refugia. Based on network and mismatch distribution analyses, we do not find evidence of long distance dispersal and population expansion in C. argyrophylla. Ex situ conservation and artificial crossing are recommended for the management of this endangered species.
cDNA cloning, genomic organization and expression analysis during somatic embryogenesis of the translationally controlled tumor protein (TCTP) gene from Japanese larch (Larix leptolepis).

PubMed

Zhang, Li-Feng; Li, Wan-Feng; Han, Su-Ying; Yang, Wen-Hua; Qi, Li-Wang

2013-10-15

A full-length cDNA and genomic sequences of a translationally controlled tumor protein (TCTP) gene were isolated from Japanese larch (Larix leptolepis) and designated LaTCTP. The length of the cDNA was 1, 043 bp and contained a 504 bp open reading frame that encodes a predicted protein of 167 amino acids, characterized by two signature sequences of the TCTP protein family. Analysis of the LaTCTP gene structure indicated four introns and five exons, and it is the largest of all currently known TCTP genes in plants. The 5'-flanking promoter region of LaTCTP was cloned using an improved TAIL-PCR technique. In this region we identified many important potential cis-acting elements, such as a Box-W1 (fungal elicitor responsive element), a CAT-box (cis-acting regulatory element related to meristem expression), a CGTCA-motif (cis-acting regulatory element involved in MeJA-responsiveness), a GT1-motif (light responsive element), a Skn-1-motif (cis-acting regulatory element required for endosperm expression) and a TGA-element (auxin-responsive element), suggesting that expression of LaTCTP is highly regulated. Expression analysis demonstrated ubiquitous localization of LaTCTP mRNA in the roots, stems and needles, high mRNA levels in the embryonal-suspensor mass (ESM), browning embryogenic cultures and mature somatic embryos, and low levels of mRNA at day five during somatic embryogenesis. We suggest that LaTCTP might participate in the regulation of somatic embryo development. These results provide a theoretical basis for understanding the molecular regulatory mechanism of LaTCTP and lay the foundation for artificial regulation of somatic embryogenesis. © 2013.
A novel mode for transcription inhibition mediated by PNA-induced R-loops with a model in vitro system.

PubMed

D'Souza, Alicia D; Belotserkovskii, Boris P; Hanawalt, Philip C

2018-02-01

The selective inhibition of transcription of a chosen gene by an artificial agent has numerous applications. Usually, these agents are designed to bind a specific nucleotide sequence in the promoter or within the transcribed region of the chosen gene. However, since optimal binding sites might not exist within the gene, it is of interest to explore the possibility of transcription inhibition when the agent is designed to bind at other locations. One of these possibilities arises when an additional transcription initiation site (e.g. secondary promoter) is present upstream from the primary promoter of the target gene. In this case, transcription inhibition might be achieved by inducing the formation of an RNA-DNA hybrid (R-loop) upon transcription from the secondary promoter. The R-loop could extend into the region of the primary promoter, to interfere with promoter recognition by RNA polymerase and thereby inhibit transcription. As a sequence-specific R-loop-inducing agent, a peptide nucleic acid (PNA) could be designed to facilitate R-loop formation by sequestering the non-template DNA strand. To investigate this mode for transcription inhibition, we have employed a model system in which a PNA binding site is localized between the T3 and T7 phage RNA polymerase promoters, which respectively assume the roles of primary and secondary promoters. In accord with our model, we have demonstrated that with PNA-bound DNA substrates, transcription from the T7 promoter reduces transcription from the T3 promoter by 30-fold, while in the absence of PNA binding there is no significant effect of T7 transcription upon T3 transcription. Copyright © 2018 Elsevier B.V. All rights reserved.
Cloning and sequence analysis of complementary DNA encoding an aberrantly rearranged human T-cell gamma chain.

PubMed Central

Dialynas, D P; Murre, C; Quertermous, T; Boss, J M; Leiden, J M; Seidman, J G; Strominger, J L

1986-01-01

Complementary DNA (cDNA) encoding a human T-cell gamma chain has been cloned and sequenced. At the junction of the variable and joining regions, there is an apparent deletion of two nucleotides in the human cDNA sequence relative to the murine gamma-chain cDNA sequence, resulting simultaneously in the generation of an in-frame stop codon and in a translational frameshift. For this reason, the sequence presented here encodes an aberrantly rearranged human T-cell gamma chain. There are several surprising differences between the deduced human and murine gamma-chain amino acid sequences. These include poor homology in the variable region, poor homology in a discrete segment of the constant region precisely bounded by the expected junctions of exon CII, and the presence in the human sequence of five potential sites for N-linked glycosylation. Images PMID:3458221
'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers.

PubMed Central

Marck, C

1988-01-01

DNA Strider is a new integrated DNA and Protein sequence analysis program written with the C language for the Macintosh Plus, SE and II computers. It has been designed as an easy to learn and use program as well as a fast and efficient tool for the day-to-day sequence analysis work. The program consists of a multi-window sequence editor and of various DNA and Protein analysis functions. The editor may use 4 different types of sequences (DNA, degenerate DNA, RNA and one-letter coded protein) and can handle simultaneously 6 sequences of any type up to 32.5 kB each. Negative numbering of the bases is allowed for DNA sequences. All classical restriction and translation analysis functions are present and can be performed in any order on any open sequence or part of a sequence. The main feature of the program is that the same analysis function can be repeated several times on different sequences, thus generating multiple windows on the screen. Many graphic capabilities have been incorporated such as graphic restriction map, hydrophobicity profile and the CAI plot- codon adaptation index according to Sharp and Li. The restriction sites search uses a newly designed fast hexamer look-ahead algorithm. Typical runtime for the search of all sites with a library of 130 restriction endonucleases is 1 second per 10,000 bases. The circular graphic restriction map of the pBR322 plasmid can be therefore computed from its sequence and displayed on the Macintosh Plus screen within 2 seconds and its multiline restriction map obtained in a scrolling window within 5 seconds. PMID:2832831
Sequence-dependent DNA deformability studied using molecular dynamics simulations.

PubMed

Fujii, Satoshi; Kono, Hidetoshi; Takenaka, Shigeori; Go, Nobuhiro; Sarai, Akinori

2007-01-01

Proteins recognize specific DNA sequences not only through direct contact between amino acids and bases, but also indirectly based on the sequence-dependent conformation and deformability of the DNA (indirect readout). We used molecular dynamics simulations to analyze the sequence-dependent DNA conformations of all 136 possible tetrameric sequences sandwiched between CGCG sequences. The deformability of dimeric steps obtained by the simulations is consistent with that by the crystal structures. The simulation results further showed that the conformation and deformability of the tetramers can highly depend on the flanking base pairs. The conformations of xATx tetramers show the most rigidity and are not affected by the flanking base pairs and the xYRx show by contrast the greatest flexibility and change their conformations depending on the base pairs at both ends, suggesting tetramers with the same central dimer can show different deformabilities. These results suggest that analysis of dimeric steps alone may overlook some conformational features of DNA and provide insight into the mechanism of indirect readout during protein-DNA recognition. Moreover, the sequence dependence of DNA conformation and deformability may be used to estimate the contribution of indirect readout to the specificity of protein-DNA recognition as well as nucleosome positioning and large-scale behavior of nucleic acids.
CROSS-DISCIPLINARY PHYSICS AND RELATED AREAS OF SCIENCE AND TECHNOLOGY: Characteristics of alternating current hopping conductivity in DNA sequences

NASA Astrophysics Data System (ADS)

Ma, Song-Shan; Xu, Hui; Wang, Huan-You; Guo, Rui

2009-08-01

This paper presents a model to describe alternating current (AC) conductivity of DNA sequences, in which DNA is considered as a one-dimensional (1D) disordered system, and electrons transport via hopping between localized states. It finds that AC conductivity in DNA sequences increases as the frequency of the external electric field rises, and it takes the form of øac(ω) ~ ω2 ln2(1/ω). Also AC conductivity of DNA sequences increases with the increase of temperature, this phenomenon presents characteristics of weak temperature-dependence. Meanwhile, the AC conductivity in an off-diagonally correlated case is much larger than that in the uncorrelated case of the Anderson limit in low temperatures, which indicates that the off-diagonal correlations in DNA sequences have a great effect on the AC conductivity, while at high temperature the off-diagonal correlations no longer play a vital role in electric transport. In addition, the proportion of nucleotide pairs p also plays an important role in AC electron transport of DNA sequences. For p < 0.5, the conductivity of DNA sequence decreases with the increase of p, while for p >= 0.5, the conductivity increases with the increase of p.
Phylogenetic relationships of the Gomphales based on nuc-25S-rDNA, mit-12S-rDNA, and mit-atp6-DNA combined sequences

Treesearch

Admir J. Giachini; Kentaro Hosaka; Eduardo Nouhra; Joseph Spatafora; James M. Trappe

2010-01-01

Phylogenetic relationships among Geastrales, Gomphales, Hysterangiales, and Phallales were estimated via combined sequences: nuclear large subunit ribosomal DNA (nuc-25S-rDNA), mitochondrial small subunit ribosomal DNA (mit-12S-rDNA), and mitochondrial atp6 DNA (mit-atp6-DNA). Eighty-one taxa comprising 19 genera and 58 species...
Controlled assembly of artificial protein-protein complexes via DNA duplex formation.

PubMed

Płoskoń, Eliza; Wagner, Sara C; Ellington, Andrew D; Jewett, Michael C; O'Reilly, Rachel; Booth, Paula J

2015-03-18

DNA-protein conjugates have found a wide range of applications. This study demonstrates the formation of defined, non-native protein-protein complexes via the site specific labeling of two proteins of interest with complementary strands of single-stranded DNA in vitro. This study demonstrates that the affinity of two DNA-protein conjugates for one another may be tuned by the use of variable lengths of DNA allowing reversible control of complex formation.
Phylogeny of the Celastraceae inferred from 26S nuclear ribosomal DNA, phytochrome B, rbcL, atpB, and morphology.

PubMed

Simmons, M P; Savolainen, V; Clevinger, C C; Archer, R H; Davis, J I

2001-06-01

Phylogenetic relationships within Celastraceae (spindle-tree family) were inferred from nucleotide sequence characters from the 5' end of 26S nuclear ribosomal DNA (including expansion segments D1-D3; 84 species sampled), phytochrome B (58 species), rbcL (31 species), atpB (23 species), and morphology (94 species). Among taxa of questionable affinity, Forsellesia is a member of Crossosomataceae, and Goupia is excluded from Celastraceae. However, Brexia, Canotia, Lepuropetalon, Parnassia, Siphonodon, and Stackhousiaceae are supported as members of Celastraceae. Gymnosporia and Tricerma are distinct from Maytenus, Cassine is supported as distinct from Elaeodendron, and Dicarpellum is distinct from Salacia. Catha, Maytenus, and Pristimera are not resolved as natural genera. Hippocrateaceae (including Plagiopteron and Lophopetalum) are a clade nested within a paraphyletic Celastraceae. These data also suggest that the Loesener's classification of Celastraceae sensu stricto and Hallé's classification of Hippocrateaceae are artificial. The diversification of the fruit and aril within Celastraceae appears to be complex, with multiple origins of most fruit and aril forms. Copyright 2001 Academic Press.
Engineering M13 for phage display.

PubMed

Sidhu, S S

2001-09-01

Phage display is achieved by fusing polypeptide libraries to phage coat proteins. The resulting phage particles display the polypeptides on their surfaces and they also contain the encoding DNA. Library members with particular functions can be isolated with simple selections and polypeptide sequences can be decoded from the encapsulated DNA. The technology's success depends on the efficiency with which polypeptides can be displayed on the phage surface, and significant progress has been made in engineering M13 bacteriophage coat proteins as improved phage display platforms. Functional display has been achieved with all five M13 coat proteins, with both N- and C-terminal fusions. Also, coat protein mutants have been designed and selected to improve the efficiency of heterologous protein display, and in the extreme case, completely artificial coat proteins have been evolved specifically as display platforms. These studies demonstrate that the M13 phage coat is extremely malleable, and this property can be used to engineer the phage particle specifically for phage display. These improvements expand the utility of phage display as a powerful tool in modern biotechnology.
Influence of major-groove chemical modifications of DNA on transcription by bacterial RNA polymerases.

PubMed

Raindlová, Veronika; Janoušková, Martina; Slavíčková, Michaela; Perlíková, Pavla; Boháčová, Soňa; Milisavljevič, Nemanja; Šanderová, Hana; Benda, Martin; Barvík, Ivan; Krásný, Libor; Hocek, Michal

2016-04-20

DNA templates containing a set of base modifications in the major groove (5-substituted pyrimidines or 7-substituted 7-deazapurines bearing H, methyl, vinyl, ethynyl or phenyl groups) were prepared by PCR using the corresponding base-modified 2'-deoxyribonucleoside triphosphates (dNTPs). The modified templates were used in an in vitro transcription assay using RNA polymerase from Bacillus subtilis and Escherichia coli Some modified nucleobases bearing smaller modifications (H, Me in 7-deazapurines) were perfectly tolerated by both enzymes, whereas bulky modifications (Ph at any nucleobase) and, surprisingly, uracil blocked transcription. Some middle-sized modifications (vinyl or ethynyl) were partly tolerated mostly by the E. colienzyme. In all cases where the transcription proceeded, full length RNA product with correct sequence was obtained indicating that the modifications of the template are not mutagenic and the inhibition is probably at the stage of initiation. The results are promising for the development of bioorthogonal reactions for artificial chemical switching of the transcription. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Method for performing site-specific affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

1999-01-01

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.
Miniaturized reaction vessel system, method for performing site-specific biochemical reactions and affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

2000-01-01

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.
DNABIT Compress - Genome compression algorithm.

PubMed

Rajarajeswari, Pothuraju; Apparao, Allam

2011-01-22

Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.
Method for performing site-specific affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, A.D.; Lysov, Y.P.; Dubley, S.A.

1999-05-18

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between the cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting the extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to the extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from the array. 14 figs.

Partial DNA sequencing of Douglas-fir cDNAs used in RFLP mapping

Treesearch

K.D. Jermstad; D.L. Bassoni; C.S. Kinlaw; D.B. Neale

1998-01-01

DNA sequences from 87 Douglas-fir (Pseudotsuga menziesii [Mirb.] Franco) cDNA RFLP probes were determined. Sequences were submitted to the GenBank dbEST database and searched for similarity against nucleotide and protein databases using the BLASTn and BLASTx programs. Twenty-one sequences (24%) were assigned putative functions; 18 of which...
Entire plastid phylogeny of the carrot genus (Daucus, Apiaceae):Concordance with nuclear data and mitochondrial and nuclear DNA insertions to the plastid

USDA-ARS?s Scientific Manuscript database

We explored the phylogenetic utility of entire plastid DNA sequences in Daucus and compared the results to prior phylogenetic results using plastid, nuclear, and mitochondrial DNA sequences. We obtained, using Illumina sequencing, full plastid sequences of 37 accessions of 20 Daucus taxa and outgrou...
Rhipicephalus microplus dataset of nonredundant raw sequence reads from 454 GS FLX sequencing of Cot-selected (Cot = 660) genomic DNA

USDA-ARS?s Scientific Manuscript database

A reassociation kinetics-based approach was used to reduce the complexity of genomic DNA from the Deutsch laboratory strain of the cattle tick, Rhipicephalus microplus, to facilitate genome sequencing. Selected genomic DNA (Cot value = 660) was sequenced using 454 GS FLX technology, resulting in 356...
Discovery and information-theoretic characterization of transcription factor binding sites that act cooperatively.

PubMed

Clifford, Jacob; Adami, Christoph

2015-09-02

Transcription factor binding to the surface of DNA regulatory regions is one of the primary causes of regulating gene expression levels. A probabilistic approach to model protein-DNA interactions at the sequence level is through position weight matrices (PWMs) that estimate the joint probability of a DNA binding site sequence by assuming positional independence within the DNA sequence. Here we construct conditional PWMs that depend on the motif signatures in the flanking DNA sequence, by conditioning known binding site loci on the presence or absence of additional binding sites in the flanking sequence of each site's locus. Pooling known sites with similar flanking sequence patterns allows for the estimation of the conditional distribution function over the binding site sequences. We apply our model to the Dorsal transcription factor binding sites active in patterning the Dorsal-Ventral axis of Drosophila development. We find that those binding sites that cooperate with nearby Twist sites on average contain about 0.5 bits of information about the presence of Twist transcription factor binding sites in the flanking sequence. We also find that Dorsal binding site detectors conditioned on flanking sequence information make better predictions about what is a Dorsal site relative to background DNA than detection without information about flanking sequence features.
Real-Time DNA Sequencing in the Antarctic Dry Valleys Using the Oxford Nanopore Sequencer

PubMed Central

Johnson, Sarah S.; Zaikova, Elena; Goerlitz, David S.; Bai, Yu; Tighe, Scott W.

2017-01-01

The ability to sequence DNA outside of the laboratory setting has enabled novel research questions to be addressed in the field in diverse areas, ranging from environmental microbiology to viral epidemics. Here, we demonstrate the application of offline DNA sequencing of environmental samples using a hand-held nanopore sequencer in a remote field location: the McMurdo Dry Valleys, Antarctica. Sequencing was performed using a MK1B MinION sequencer from Oxford Nanopore Technologies (ONT; Oxford, United Kingdom) that was equipped with software to operate without internet connectivity. One-direction (1D) genomic libraries were prepared using portable field techniques on DNA isolated from desiccated microbial mats. By adequately insulating the sequencer and laptop, it was possible to run the sequencing protocol for up to 2½ h under arduous conditions. PMID:28337073
Species-Specific Detection of Mycosphaerella polygoni-cuspidati as a Biological Control Agent for Fallopia japonica by PCR Assay.

PubMed

Kurose, Daisuke; Furuya, Naruto; Saeki, Tetsuya; Tsuchiya, Kenichi; Tsushima, Seiya; Seier, Marion K

2016-10-01

The ascomycete fungus Mycosphaerella polygoni-cuspidati has been undergoing evaluation as a potential classical biological control agent for the invasive weed Fallopia japonica (Japanese knotweed), which has become troublesome in Europe and North America. In advance of the potential release of a biocontrol agent into a new environment, it is crucial to develop an effective monitoring system to enable the evaluation of agent establishment and dispersal within the target host population, as well as any potential attacks on non-target species. Therefore, a primer pair was designed for direct, rapid, and specific detection of the Japanese knotweed pathogen M. polygoni-cuspidati based on the sequences of the internal transcribed spacer regions including the 5.8S rDNA. A PCR product of approximately 298 bp was obtained only when the DNA extracted from mycelial fragments of M. polygoni-cuspidati was used. The lower limit of detection of the PCR method was 100 fg of genomic DNA. Using the specific primer pair, M. polygoni-cuspidati could be detected from both naturally and artificially infected Japanese knotweed plants. No amplification was observed for other Mycosphaerella spp. or fungal endophytes isolated from F. japonica. The designed primer pair is thus effective for the specific detection of M. polygoni-cuspidati in planta.
Multiplexed Sequence Encoding: A Framework for DNA Communication.

PubMed

Zakeri, Bijan; Carr, Peter A; Lu, Timothy K

2016-01-01

Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication-data encoding, data transfer & data extraction-and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system-Multiplexed Sequence Encoding (MuSE)-that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA.
An improved divergent synthesis of comb-type branched oligodeoxyribonucleotides (bDNA) containing multiple secondary sequences.

PubMed

Horn, T; Chang, C A; Urdea, M S

1997-12-01

The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays.
An improved divergent synthesis of comb-type branched oligodeoxyribonucleotides (bDNA) containing multiple secondary sequences.

PubMed Central

Horn, T; Chang, C A; Urdea, M S

1997-01-01

The divergent synthesis of branched DNA (bDNA) comb structures is described. This new type of bDNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branch network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb structures were assembled on a solid support and several synthesis parameters were investigated and optimized. The bDNA comb molecules were characterized by polyacrylamide gel electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The developed chemistry allows synthesis of bDNA comb molecules containing multiple secondary sequences. In the accompanying article we describe the synthesis and characterization of large bDNA combs containing all four deoxynucleotides for use as signal amplifiers in nucleic acid quantification assays. PMID:9365265
A 28,000 Years Old Cro-Magnon mtDNA Sequence Differs from All Potentially Contaminating Modern Sequences

PubMed Central

Caramelli, David; Milani, Lucio; Vai, Stefania; Modi, Alessandra; Pecchioli, Elena; Girardi, Matteo; Pilli, Elena; Lari, Martina; Lippi, Barbara; Ronchitelli, Annamaria; Mallegni, Francesco; Casoli, Antonella; Bertorelle, Giorgio; Barbujani, Guido

2008-01-01

Background DNA sequences from ancient speciments may in fact result from undetected contamination of the ancient specimens by modern DNA, and the problem is particularly challenging in studies of human fossils. Doubts on the authenticity of the available sequences have so far hampered genetic comparisons between anatomically archaic (Neandertal) and early modern (Cro-Magnoid) Europeans. Methodology/Principal Findings We typed the mitochondrial DNA (mtDNA) hypervariable region I in a 28,000 years old Cro-Magnoid individual from the Paglicci cave, in Italy (Paglicci 23) and in all the people who had contact with the sample since its discovery in 2003. The Paglicci 23 sequence, determined through the analysis of 152 clones, is the Cambridge reference sequence, and cannot possibly reflect contamination because it differs from all potentially contaminating modern sequences. Conclusions/Significance: The Paglicci 23 individual carried a mtDNA sequence that is still common in Europe, and which radically differs from those of the almost contemporary Neandertals, demonstrating a genealogical continuity across 28,000 years, from Cro-Magnoid to modern Europeans. Because all potential sources of modern DNA contamination are known, the Paglicci 23 sample will offer a unique opportunity to get insight for the first time into the nuclear genes of early modern Europeans. PMID:18628960
High-Throughput Block Optical DNA Sequence Identification.

PubMed

Sagar, Dodderi Manjunatha; Korshoj, Lee Erik; Hanson, Katrina Bethany; Chowdhury, Partha Pratim; Otoupal, Peter Britton; Chatterjee, Anushree; Nagpal, Prashant

2018-01-01

Optical techniques for molecular diagnostics or DNA sequencing generally rely on small molecule fluorescent labels, which utilize light with a wavelength of several hundred nanometers for detection. Developing a label-free optical DNA sequencing technique will require nanoscale focusing of light, a high-throughput and multiplexed identification method, and a data compression technique to rapidly identify sequences and analyze genomic heterogeneity for big datasets. Such a method should identify characteristic molecular vibrations using optical spectroscopy, especially in the "fingerprinting region" from ≈400-1400 cm -1 . Here, surface-enhanced Raman spectroscopy is used to demonstrate label-free identification of DNA nucleobases with multiplexed 3D plasmonic nanofocusing. While nanometer-scale mode volumes prevent identification of single nucleobases within a DNA sequence, the block optical technique can identify A, T, G, and C content in DNA k-mers. The content of each nucleotide in a DNA block can be a unique and high-throughput method for identifying sequences, genes, and other biomarkers as an alternative to single-letter sequencing. Additionally, coupling two complementary vibrational spectroscopy techniques (infrared and Raman) can improve block characterization. These results pave the way for developing a novel, high-throughput block optical sequencing method with lossy genomic data compression using k-mer identification from multiplexed optical data acquisition. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Basic quantitative polymerase chain reaction using real-time fluorescence measurements.

PubMed

Ares, Manuel

2014-10-01

This protocol uses quantitative polymerase chain reaction (qPCR) to measure the number of DNA molecules containing a specific contiguous sequence in a sample of interest (e.g., genomic DNA or cDNA generated by reverse transcription). The sample is subjected to fluorescence-based PCR amplification and, theoretically, during each cycle, two new duplex DNA molecules are produced for each duplex DNA molecule present in the sample. The progress of the reaction during PCR is evaluated by measuring the fluorescence of dsDNA-dye complexes in real time. In the early cycles, DNA duplication is not detected because inadequate amounts of DNA are made. At a certain threshold cycle, DNA-dye complexes double each cycle for 8-10 cycles, until the DNA concentration becomes so high and the primer concentration so low that the reassociation of the product strands blocks efficient synthesis of new DNA and the reaction plateaus. There are two types of measurements: (1) the relative change of the target sequence compared to a reference sequence and (2) the determination of molecule number in the starting sample. The first requires a reference sequence, and the second requires a sample of the target sequence with known numbers of the molecules of sequence to generate a standard curve. By identifying the threshold cycle at which a sample first begins to accumulate DNA-dye complexes exponentially, an estimation of the numbers of starting molecules in the sample can be extrapolated. © 2014 Cold Spring Harbor Laboratory Press.
Spiroplasma species share common DNA sequences among their viruses, plasmids and genomes.

PubMed

Ranhand, J M; Nur, I; Rose, D L; Tully, J G

1987-01-01

Alkaline-Southern-blot analyses showed that a spiroplasma plasmid, pRA1, obtained from Spiroplasma citri (Maroc-R8A2), contained DNA sequences that were homologous to spiroplasma type 3 viruses (SV3) obtained from S. citri (Maroc-R8A2), S. citri (608) and S. mirum (SMCA). In addition, pRA1 and SV3(608) DNA shared common, but not necessarily related, sequences with extrachromosomal DNA derived from 11 Spiroplasma species or strains. Furthermore, SV3(608) had DNA homology with the chromosome from 6 distinct spiroplasmas but not with chromosomal DNA from eight other Spiroplasma species or strains. The biological function of these common sequences is unknown.
Flow cytometric detection method for DNA samples

DOEpatents

Nasarabadi, Shanavaz [Livermore, CA; Langlois, Richard G [Livermore, CA; Venkateswaran, Kodumudi S [Round Rock, TX

2011-07-05

Disclosed herein are two methods for rapid multiplex analysis to determine the presence and identity of target DNA sequences within a DNA sample. Both methods use reporting DNA sequences, e.g., modified conventional Taqman.RTM. probes, to combine multiplex PCR amplification with microsphere-based hybridization using flow cytometry means of detection. Real-time PCR detection can also be incorporated. The first method uses a cyanine dye, such as, Cy3.TM., as the reporter linked to the 5' end of a reporting DNA sequence. The second method positions a reporter dye, e.g., FAM.TM. on the 3' end of the reporting DNA sequence and a quencher dye, e.g., TAMRA.TM., on the 5' end.
Flow cytometric detection method for DNA samples

DOEpatents

Nasarabadi, Shanavaz [Livermore, CA; Langlois, Richard G [Livermore, CA; Venkateswaran, Kodumudi S [Livermore, CA

2006-08-01

Disclosed herein are two methods for rapid multiplex analysis to determine the presence and identity of target DNA sequences within a DNA sample. Both methods use reporting DNA sequences, e.g., modified conventional Taqman.RTM. probes, to combine multiplex PCR amplification with microsphere-based hybridization using flow cytometry means of detection. Real-time PCR detection can also be incorporated. The first method uses a cyanine dye, such as, Cy3.TM., as the reporter linked to the 5' end of a reporting DNA sequence. The second method positions a reporter dye, e.g., FAM, on the 3' end of the reporting DNA sequence and a quencher dye, e.g., TAMRA, on the 5' end.
Method for sequencing DNA base pairs

DOEpatents

Sessler, A.M.; Dawson, J.

1993-12-14

The base pairs of a DNA structure are sequenced with the use of a scanning tunneling microscope (STM). The DNA structure is scanned by the STM probe tip, and, as it is being scanned, the DNA structure is separately subjected to a sequence of infrared radiation from four different sources, each source being selected to preferentially excite one of the four different bases in the DNA structure. Each particular base being scanned is subjected to such sequence of infrared radiation from the four different sources as that particular base is being scanned. The DNA structure as a whole is separately imaged for each subjection thereof to radiation from one only of each source. 6 figures.
Artificial-epitope mapping for CK-MB assay.

PubMed

Tai, Dar-Fu; Ho, Yi-Fang; Wu, Cheng-Hsin; Lin, Tzu-Chieh; Lu, Kuo-Hao; Lin, Kun-Shian

2011-06-07

A quantitative method using artificial antibody to detect creatine kinases was developed. Linear epitope sequences were selected based on an artificial-epitope mapping strategy. Nine different MIPs corresponding to the selected peptides were then fabricated on QCM chips. The subtle conformational changes were also recognized by these chips.
G-quadruplex and G-rich sequence stimulate Pif1p-catalyzed downstream duplex DNA unwinding through reducing waiting time at ss/dsDNA junction

PubMed Central

Zhang, Bo; Wu, Wen-Qiang; Liu, Na-Nv; Duan, Xiao-Lei; Li, Ming; Dou, Shuo-Xing; Hou, Xi-Miao; Xi, Xu-Guang

2016-01-01

Alternative DNA structures that deviate from B-form double-stranded DNA such as G-quadruplex (G4) DNA can be formed by G-rich sequences that are widely distributed throughout the human genome. We have previously shown that Pif1p not only unfolds G4, but also unwinds the downstream duplex DNA in a G4-stimulated manner. In the present study, we further characterized the G4-stimulated duplex DNA unwinding phenomenon by means of single-molecule fluorescence resonance energy transfer. It was found that Pif1p did not unwind the partial duplex DNA immediately after unfolding the upstream G4 structure, but rather, it would dwell at the ss/dsDNA junction with a ‘waiting time’. Further studies revealed that the waiting time was in fact related to a protein dimerization process that was sensitive to ssDNA sequence and would become rapid if the sequence is G-rich. Furthermore, we identified that the G-rich sequence, as the G4 structure, equally stimulates duplex DNA unwinding. The present work sheds new light on the molecular mechanism by which G4-unwinding helicase Pif1p resolves physiological G4/duplex DNA structures in cells. PMID:27471032
Continuous Influx of Genetic Material from Host to Virus Populations

PubMed Central

Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane

2016-01-01

Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors. PMID:26829124
Continuous Influx of Genetic Material from Host to Virus Populations.

PubMed

Gilbert, Clément; Peccoud, Jean; Chateigner, Aurélien; Moumen, Bouziane; Cordaux, Richard; Herniou, Elisabeth A

2016-02-01

Many genes of large double-stranded DNA viruses have a cellular origin, suggesting that host-to-virus horizontal transfer (HT) of DNA is recurrent. Yet, the frequency of these transfers has never been assessed in viral populations. Here we used ultra-deep DNA sequencing of 21 baculovirus populations extracted from two moth species to show that a large diversity of moth DNA sequences (n = 86) can integrate into viral genomes during the course of a viral infection. The majority of the 86 different moth DNA sequences are transposable elements (TEs, n = 69) belonging to 10 superfamilies of DNA transposons and three superfamilies of retrotransposons. The remaining 17 sequences are moth sequences of unknown nature. In addition to bona fide DNA transposition, we uncover microhomology-mediated recombination as a mechanism explaining integration of moth sequences into viral genomes. Many sequences integrated multiple times at multiple positions along the viral genome. We detected a total of 27,504 insertions of moth sequences in the 21 viral populations and we calculate that on average, 4.8% of viruses harbor at least one moth sequence in these populations. Despite this substantial proportion, no insertion of moth DNA was maintained in any viral population after 10 successive infection cycles. Hence, there is a constant turnover of host DNA inserted into viral genomes each time the virus infects a moth. Finally, we found that at least 21 of the moth TEs integrated into viral genomes underwent repeated horizontal transfers between various insect species, including some lepidopterans susceptible to baculoviruses. Our results identify host DNA influx as a potent source of genetic diversity in viral populations. They also support a role for baculoviruses as vectors of DNA HT between insects, and call for an evaluation of possible gene or TE spread when using viruses as biopesticides or gene delivery vectors.

A computer aided thermodynamic approach for predicting the formation of Z-DNA in naturally occurring sequences

NASA Technical Reports Server (NTRS)

Ho, P. S.; Ellison, M. J.; Quigley, G. J.; Rich, A.

1986-01-01

The ease with which a particular DNA segment adopts the left-handed Z-conformation depends largely on the sequence and on the degree of negative supercoiling to which it is subjected. We describe a computer program (Z-hunt) that is designed to search long sequences of naturally occurring DNA and retrieve those nucleotide combinations of up to 24 bp in length which show a strong propensity for Z-DNA formation. Incorporated into Z-hunt is a statistical mechanical model based on empirically determined energetic parameters for the B to Z transition accumulated to date. The Z-forming potential of a sequence is assessed by ranking its behavior as a function of negative superhelicity relative to the behavior of similar sized randomly generated nucleotide sequences assembled from over 80,000 combinations. The program makes it possible to compare directly the Z-forming potential of sequences with different base compositions and different sequence lengths. Using Z-hunt, we have analyzed the DNA sequences of the bacteriophage phi X174, plasmid pBR322, the animal virus SV40 and the replicative form of the eukaryotic adenovirus-2. The results are compared with those previously obtained by others from experiments designed to locate Z-DNA forming regions in these sequences using probes which show specificity for the left-handed DNA conformation.
Comparative Genomic and Transcriptomic Characterization of the Toxigenic Marine Dinoflagellate Alexandrium ostenfeldii

PubMed Central

Jaeckisch, Nina; Yang, Ines; Wohlrab, Sylke; Glöckner, Gernot; Kroymann, Juergen; Vogel, Heiko; Cembella, Allan; John, Uwe

2011-01-01

Many dinoflagellate species are notorious for the toxins they produce and ecological and human health consequences associated with harmful algal blooms (HABs). Dinoflagellates are particularly refractory to genomic analysis due to the enormous genome size, lack of knowledge about their DNA composition and structure, and peculiarities of gene regulation, such as spliced leader (SL) trans-splicing and mRNA transposition mechanisms. Alexandrium ostenfeldii is known to produce macrocyclic imine toxins, described as spirolides. We characterized the genome of A. ostenfeldii using a combination of transcriptomic data and random genomic clones for comparison with other dinoflagellates, particularly Alexandrium species. Examination of SL sequences revealed similar features as in other dinoflagellates, including Alexandrium species. SL sequences in decay indicate frequent retro-transposition of mRNA species. This probably contributes to overall genome complexity by generating additional gene copies. Sequencing of several thousand fosmid and bacterial artificial chromosome (BAC) ends yielded a wealth of simple repeats and tandemly repeated longer sequence stretches which we estimated to comprise more than half of the whole genome. Surprisingly, the repeats comprise a very limited set of 79–97 bp sequences; in part the genome is thus a relatively uniform sequence space interrupted by coding sequences. Our genomic sequence survey (GSS) represents the largest genomic data set of a dinoflagellate to date. Alexandrium ostenfeldii is a typical dinoflagellate with respect to its transcriptome and mRNA transposition but demonstrates Alexandrium-like stop codon usage. The large portion of repetitive sequences and the organization within the genome is in agreement with several other studies on dinoflagellates using different approaches. It remains to be determined whether this unusual composition is directly correlated to the exceptionally genome organization of dinoflagellates with a low amount of histones and histone-like proteins. PMID:22164224
Recognition of platinum-DNA adducts by HMGB1a.

PubMed

Ramachandran, Srinivas; Temple, Brenda; Alexandrova, Anastassia N; Chaney, Stephen G; Dokholyan, Nikolay V

2012-09-25

Cisplatin (CP) and oxaliplatin (OX), platinum-based drugs used widely in chemotherapy, form adducts on intrastrand guanines (5'GG) in genomic DNA. DNA damage recognition proteins, transcription factors, mismatch repair proteins, and DNA polymerases discriminate between CP- and OX-GG DNA adducts, which could partly account for differences in the efficacy, toxicity, and mutagenicity of CP and OX. In addition, differential recognition of CP- and OX-GG adducts is highly dependent on the sequence context of the Pt-GG adduct. In particular, DNA binding protein domain HMGB1a binds to CP-GG DNA adducts with up to 53-fold greater affinity than to OX-GG adducts in the TGGA sequence context but shows much smaller differences in binding in the AGGC or TGGT sequence contexts. Here, simulations of the HMGB1a-Pt-DNA complex in the three sequence contexts revealed a higher number of interface contacts for the CP-DNA complex in the TGGA sequence context than in the OX-DNA complex. However, the number of interface contacts was similar in the TGGT and AGGC sequence contexts. The higher number of interface contacts in the CP-TGGA sequence context corresponded to a larger roll of the Pt-GG base pair step. Furthermore, geometric analysis of stacking of phenylalanine 37 in HMGB1a (Phe37) with the platinated guanines revealed more favorable stacking modes correlated with a larger roll of the Pt-GG base pair step in the TGGA sequence context. These data are consistent with our previous molecular dynamics simulations showing that the CP-TGGA complex was able to sample larger roll angles than the OX-TGGA complex or either CP- or OX-DNA complexes in the AGGC or TGGT sequences. We infer that the high binding affinity of HMGB1a for CP-TGGA is due to the greater flexibility of CP-TGGA compared to OX-TGGA and other Pt-DNA adducts. This increased flexibility is reflected in the ability of CP-TGGA to sample larger roll angles, which allows for a higher number of interface contacts between the Pt-DNA adduct and HMGB1a.
GeneKnockout by Targeted Mutagenesis in a Hemimetabolous Insect, the Two-Spotted Cricket Gryllus bimaculatus, using TALENs.

PubMed

Watanabe, Takahito; Noji, Sumihare; Mito, Taro

2016-01-01

Hemimetabolous, or incompletely metamorphosing, insects are phylogenetically basal. These insects include many deleterious species. The cricket, Gryllus bimaculatus, is an emerging model for hemimetabolous insects, based on the success of RNA interference (RNAi)-based gene-functional analyses and transgenic technology. Taking advantage of genome-editing technologies in this species would greatly promote functional genomics studies. Genome editing using transcription activator-like effector nucleases (TALENs) has proven to be an effective method for site-specific genome manipulation in various species. TALENs are artificial nucleases that are capable of inducing DNA double-strand breaks into specified target sequences. Here, we describe a protocol for TALEN-based gene knockout in G. bimaculatus, including a mutant selection scheme via mutation detection assays, for generating homozygous knockout organisms.
Characterization of proviruses cloned from mink cell focus-forming virus-infected cellular DNA.

PubMed Central

Khan, A S; Repaske, R; Garon, C F; Chan, H W; Rowe, W P; Martin, M A

1982-01-01

Two proviruses were cloned from EcoRI-digested DNA extracted from mink cells chronically infected with AKR mink cell focus-forming (MCF) 247 murine leukemia virus (MuLV), using a lambda phage host vector system. One cloned MuLV DNA fragment (designated MCF 1) contained sequences extending 6.8 kilobases from an EcoRI restriction site in the 5' long terminal repeat (LTR) to an EcoRI site located in the envelope (env) region and was indistinguishable by restriction endonuclease mapping for 5.1 kilobases (except for the EcoRI site in the LTR) from the 5' end of AKR ecotropic proviral DNA. The DNA segment extending from 5.1 to 6.8 kilobases contained several restriction sites that were not present in the AKR ecotropic provirus. A 0.5-kilobase DNA segment located at the 3' end of MCF 1 DNA contained sequences which hybridized to a xenotropic env-specific DNA probe but not to labeled ecotropic env-specific DNA. This dual character of MCF 1 proviral DNA was also confirmed by analyzing heteroduplex molecules by electron microscopy. The second cloned proviral DNA (designated MCF 2) was a 6.9-kilobase EcoRI DNA fragment which contained LTR sequences at each end and a 2.0-kilobase deletion encompassing most of the env region. The MCF 2 proviral DNA proved to be a useful reagent for detecting LTRs electron microscopically due to the presence of nonoverlapping, terminally located LTR sequences which effected its circularization with DNAs containing homologous LTR sequences. Nucleotide sequence analysis demonstrated the presence of a 104-base-pair direct repeat in the LTR of MCF 2 DNA. In contrast, only a single copy of the reiterated component of the direct repeat was present in MCF 1 DNA. Images PMID:6281459
Organization and evolution of highly repeated satellite DNA sequences in plant chromosomes.

PubMed

Sharma, S; Raina, S N

2005-01-01

A major component of the plant nuclear genome is constituted by different classes of repetitive DNA sequences. The structural, functional and evolutionary aspects of the satellite repetitive DNA families, and their organization in the chromosomes is reviewed. The tandem satellite DNA sequences exhibit characteristic chromosomal locations, usually at subtelomeric and centromeric regions. The repetitive DNA family(ies) may be widely distributed in a taxonomic family or a genus, or may be specific for a species, genome or even a chromosome. They may acquire large-scale variations in their sequence and copy number over an evolutionary time-scale. These features have formed the basis of extensive utilization of repetitive sequences for taxonomic and phylogenetic studies. Hybrid polyploids have especially proven to be excellent models for studying the evolution of repetitive DNA sequences. Recent studies explicitly show that some repetitive DNA families localized at the telomeres and centromeres have acquired important structural and functional significance. The repetitive elements are under different evolutionary constraints as compared to the genes. Satellite DNA families are thought to arise de novo as a consequence of molecular mechanisms such as unequal crossing over, rolling circle amplification, replication slippage and mutation that constitute "molecular drive". Copyright 2005 S. Karger AG, Basel.
First Molecular Identification and Phylogeny of Moroccan Anopheles sergentii (Diptera: Culicidae) Based on Second Internal Transcribed Spencer (ITS2) and Cytochrome c Oxidase I (COI) Sequences.

PubMed

Benabdelkrim Filali, Oumama; Kabine, Mostafa; El Hamouchi, Adil; Lemrani, Meryem; Debboun, Mustapha; Sarih, M'hammed

2018-06-05

Anopheles sergentii known as the "oasis vector" or the "desert malaria vector" is considered the main vector of malaria in the southern parts of Morocco. Its presence in Morocco is confirmed for the first time through sequencing of mitochondrial DNA (mDNA) cytochrome c oxidase subunit I (COI) barcodes and nuclear ribosomal DNA (rDNA) second internal transcribed spacer (ITS2) sequences and direct comparison with specimens of A. sergentii of other countries. The DNA barcodes (n = 39) obtained from A. sergentii collected in 2015 and 2016 showed more diversity with 10 haplotypes, compared with 3 haplotypes obtained from ITS2 sequences (n = 59). Moreover, the comparison using the ITS2 sequences showed closer evolutionary relationship between the Moroccan and Egyptian strains than the Iranian strain. Nevertheless, genetic differences due to geographical segregation were also observed. This study provides the first report on the sequence of rDNA-ITS2 and mtDNA COI, which could be used to better understand the biodiversity of A. sergentii.
Does TATA matter? A structural exploration of the selectivity determinants in its complexes with TATA box-binding protein.

PubMed Central

Pastor, N; Pardo, L; Weinstein, H

1997-01-01

The binding of the TATA box-binding protein (TBP) to a TATA sequence in DNA is essential for eukaryotic basal transcription. TBP binds in the minor groove of DNA, causing a large distortion of the DNA helix. Given the apparent stereochemical equivalence of AT and TA basepairs in the minor groove, DNA deformability must play a significant role in binding site selection, because not all AT-rich sequences are bound effectively by TBP. To gain insight into the precise role that the properties of the TATA sequence have in determining the specificity of the DNA substrates of TBP, the solution structure and dynamics of seven DNA dodecamers have been studied by using molecular dynamics simulations. The analysis of the structural properties of basepair steps in these TATA sequences suggests a reason for the preference for alternating pyrimidine-purine (YR) sequences, but indicates that these properties cannot be the sole determinant of the sequence specificity of TBP. Rather, recognition depends on the interplay between the inherent deformability of the DNA and steric complementarity at the molecular interface. Images FIGURE 2 PMID:9251783
Competition between B-Z and B-L transitions in a single DNA molecule: Computational studies

NASA Astrophysics Data System (ADS)

Kwon, Ah-Young; Nam, Gi-Moon; Johner, Albert; Kim, Seyong; Hong, Seok-Cheol; Lee, Nam-Kyung

2016-02-01

Under negative torsion, DNA adopts left-handed helical forms, such as Z-DNA and L-DNA. Using the random copolymer model developed for a wormlike chain, we represent a single DNA molecule with structural heterogeneity as a helical chain consisting of monomers which can be characterized by different helical senses and pitches. By Monte Carlo simulation, where we take into account bending and twist fluctuations explicitly, we study sequence dependence of B-Z transitions under torsional stress and tension focusing on the interaction with B-L transitions. We consider core sequences, (GC) n repeats or (TG) n repeats, which can interconvert between the right-handed B form and the left-handed Z form, imbedded in a random sequence, which can convert to left-handed L form with different (tension dependent) helical pitch. We show that Z-DNA formation from the (GC) n sequence is always supported by unwinding torsional stress but Z-DNA formation from the (TG) n sequence, which are more costly to convert but numerous, can be strongly influenced by the quenched disorder in the surrounding random sequence.
Extending the spectrum of DNA sequences retrieved from ancient bones and teeth

PubMed Central

Glocke, Isabelle; Meyer, Matthias

2017-01-01

The number of DNA fragments surviving in ancient bones and teeth is known to decrease with fragment length. Recent genetic analyses of Middle Pleistocene remains have shown that the recovery of extremely short fragments can prove critical for successful retrieval of sequence information from particularly degraded ancient biological material. Current sample preparation techniques, however, are not optimized to recover DNA sequences from fragments shorter than ∼35 base pairs (bp). Here, we show that much shorter DNA fragments are present in ancient skeletal remains but lost during DNA extraction. We present a refined silica-based DNA extraction method that not only enables efficient recovery of molecules as short as 25 bp but also doubles the yield of sequences from longer fragments due to improved recovery of molecules with single-strand breaks. Furthermore, we present strategies for monitoring inefficiencies in library preparation that may result from co-extraction of inhibitory substances during DNA extraction. The combination of DNA extraction and library preparation techniques described here substantially increases the yield of DNA sequences from ancient remains and provides access to a yet unexploited source of highly degraded DNA fragments. Our work may thus open the door for genetic analyses on even older material. PMID:28408382
Complementary DNA cloning and molecular evolution of opine dehydrogenases in some marine invertebrates.

PubMed

Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K

2004-01-01

The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.
Artificial mismatch hybridization

DOEpatents

Guo, Zhen; Smith, Lloyd M.

1998-01-01

An improved nucleic acid hybridization process is provided which employs a modified oligonucleotide and improves the ability to discriminate a control nucleic acid target from a variant nucleic acid target containing a sequence variation. The modified probe contains at least one artificial mismatch relative to the control nucleic acid target in addition to any mismatch(es) arising from the sequence variation. The invention has direct and advantageous application to numerous existing hybridization methods, including, applications that employ, for example, the Polymerase Chain Reaction, allele-specific nucleic acid sequencing methods, and diagnostic hybridization methods.
Fusion of GFP to the M.EcoKI DNA methyltransferase produces a new probe of Type I DNA restriction and modification enzymes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Kai; Roberts, Gareth A.; Stephanou, Augoustinos S.

2010-07-23

Research highlights: {yields} Successful fusion of GFP to M.EcoKI DNA methyltransferase. {yields} GFP located at C-terminal of sequence specificity subunit does not later enzyme activity. {yields} FRET confirms structural model of M.EcoKI bound to DNA. -- Abstract: We describe the fusion of enhanced green fluorescent protein to the C-terminus of the HsdS DNA sequence-specificity subunit of the Type I DNA modification methyltransferase M.EcoKI. The fusion expresses well in vivo and assembles with the two HsdM modification subunits. The fusion protein functions as a sequence-specific DNA methyltransferase protecting DNA against digestion by the EcoKI restriction endonuclease. The purified enzyme shows Foerstermore » resonance energy transfer to fluorescently-labelled DNA duplexes containing the target sequence and to fluorescently-labelled ocr protein, a DNA mimic that binds to the M.EcoKI enzyme. Distances determined from the energy transfer experiments corroborate the structural model of M.EcoKI.« less
In silico evidence for sequence-dependent nucleosome sliding

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lequieu, Joshua; Schwartz, David C.; de Pablo, Juan J.

Nucleosomes represent the basic building block of chromatin and provide an important mechanism by which cellular processes are controlled. The locations of nucleosomes across the genome are not random but instead depend on both the underlying DNA sequence and the dynamic action of other proteins within the nucleus. These processes are central to cellular function, and the molecular details of the interplay between DNA sequence and nudeosome dynamics remain poorly understood. In this work, we investigate this interplay in detail by relying on a molecular model, which permits development of a comprehensive picture of the underlying free energy surfaces andmore » the corresponding dynamics of nudeosome repositioning. The mechanism of nudeosome repositioning is shown to be strongly linked to DNA sequence and directly related to the binding energy of a given DNA sequence to the histone core. It is also demonstrated that chromatin remodelers can override DNA-sequence preferences by exerting torque, and the histone H4 tail is then identified as a key component by which DNA-sequence, histone modifications, and chromatin remodelers could in fact be coupled.« less
Dual signal amplification for highly sensitive electrochemical detection of uropathogens via enzyme-based catalytic target recycling.

PubMed

Su, Jiao; Zhang, Haijie; Jiang, Bingying; Zheng, Huzhi; Chai, Yaqin; Yuan, Ruo; Xiang, Yun

2011-11-15

We report an ultrasensitive electrochemical approach for the detection of uropathogen sequence-specific DNA target. The sensing strategy involves a dual signal amplification process, which combines the signal enhancement by the enzymatic target recycling technique with the sensitivity improvement by the quantum dot (QD) layer-by-layer (LBL) assembled labels. The enzyme-based catalytic target DNA recycling process results in the use of each target DNA sequence for multiple times and leads to direct amplification of the analytical signal. Moreover, the LBL assembled QD labels can further enhance the sensitivity of the sensing system. The coupling of these two effective signal amplification strategies thus leads to low femtomolar (5fM) detection of the target DNA sequences. The proposed strategy also shows excellent discrimination between the target DNA and the single-base mismatch sequences. The advantageous intrinsic sequence-independent property of exonuclease III over other sequence-dependent enzymes makes our new dual signal amplification system a general sensing platform for monitoring ultralow level of various types of target DNA sequences. Copyright © 2011 Elsevier B.V. All rights reserved.
Relations between Shannon entropy and genome order index in segmenting DNA sequences.

PubMed

Zhang, Yi

2009-04-01

Shannon entropy H and genome order index S are used in segmenting DNA sequences. Zhang [Phys. Rev. E 72, 041917 (2005)] found that the two schemes are equivalent when a DNA sequence is converted to a binary sequence of S (strong H bond) and W (weak H bond). They left the mathematical proof to mathematicians who are interested in this issue. In this paper, a possible mathematical explanation is given. Moreover, we find that Chargaff parity rule 2 is the necessary condition of the equivalence, and the equivalence disappears when a DNA sequence is regarded as a four-symbol sequence. At last, we propose that S-2(-H) may be related to species evolution.
Evaluating the role of coherent delocalized phonon-like modes in DNA cyclization

DOE PAGES

Alexandrov, Ludmil B.; Rasmussen, Kim Ã.; Bishop, Alan R.; ...

2017-08-29

The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer’s J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. We develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded ”flexible hinges” to assist in loop formation. We also combine the Czapla-Swigon-Olson structural model of DNA with ourmore » extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Furthermore, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.« less
Evaluating the role of coherent delocalized phonon-like modes in DNA cyclization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Alexandrov, Ludmil B.; Rasmussen, Kim Ã.; Bishop, Alan R.

The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer’s J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. We develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded ”flexible hinges” to assist in loop formation. We also combine the Czapla-Swigon-Olson structural model of DNA with ourmore » extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Furthermore, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.« less
mtDNAmanager: a Web-based tool for the management and quality analysis of mitochondrial DNA control-region sequences

PubMed Central

Lee, Hwan Young; Song, Injee; Ha, Eunho; Cho, Sung-Bae; Yang, Woo Ick; Shin, Kyoung-Jin

2008-01-01

Background For the past few years, scientific controversy has surrounded the large number of errors in forensic and literature mitochondrial DNA (mtDNA) data. However, recent research has shown that using mtDNA phylogeny and referring to known mtDNA haplotypes can be useful for checking the quality of sequence data. Results We developed a Web-based bioinformatics resource "mtDNAmanager" that offers a convenient interface supporting the management and quality analysis of mtDNA sequence data. The mtDNAmanager performs computations on mtDNA control-region sequences to estimate the most-probable mtDNA haplogroups and retrieves similar sequences from a selected database. By the phased designation of the most-probable haplogroups (both expected and estimated haplogroups), mtDNAmanager enables users to systematically detect errors whilst allowing for confirmation of the presence of clear key diagnostic mutations and accompanying mutations. The query tools of mtDNAmanager also facilitate database screening with two options of "match" and "include the queried nucleotide polymorphism". In addition, mtDNAmanager provides Web interfaces for users to manage and analyse their own data in batch mode. Conclusion The mtDNAmanager will provide systematic routines for mtDNA sequence data management and analysis via easily accessible Web interfaces, and thus should be very useful for population, medical and forensic studies that employ mtDNA analysis. mtDNAmanager can be accessed at . PMID:19014619
repDNA: a Python package to generate various modes of feature vectors for DNA sequences by incorporating user-defined physicochemical properties and sequence-order effects.

PubMed

Liu, Bin; Liu, Fule; Fang, Longyun; Wang, Xiaolong; Chou, Kuo-Chen

2015-04-15

In order to develop powerful computational predictors for identifying the biological features or attributes of DNAs, one of the most challenging problems is to find a suitable approach to effectively represent the DNA sequences. To facilitate the studies of DNAs and nucleotides, we developed a Python package called representations of DNAs (repDNA) for generating the widely used features reflecting the physicochemical properties and sequence-order effects of DNAs and nucleotides. There are three feature groups composed of 15 features. The first group calculates three nucleic acid composition features describing the local sequence information by means of kmers; the second group calculates six autocorrelation features describing the level of correlation between two oligonucleotides along a DNA sequence in terms of their specific physicochemical properties; the third group calculates six pseudo nucleotide composition features, which can be used to represent a DNA sequence with a discrete model or vector yet still keep considerable sequence-order information via the physicochemical properties of its constituent oligonucleotides. In addition, these features can be easily calculated based on both the built-in and user-defined properties via using repDNA. The repDNA Python package is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repDNA/. bliu@insun.hit.edu.cn or kcchou@gordonlifescience.org Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.